ftime-trace: break up the template instance span into sub-phases by suyashkumar102 · Pull Request #22825 · dlang/dmd

suyashkumar102 · 2026-03-27T06:49:20Z

-ftime-trace output in perfetto and the sema1 template instance block was just one big blob with nothing inside it. makes it really hard to tell if the time is going into arg evaluation, overload resolution, expanding members, sema2 or sema3.

added spans for:

semanticTiargs (arg semantic)
findBestMatch (overload resolution)
expandMembers (sema1 on members)
semantic2 on the instance
trySemantic3 on the instance

fixes #22711

dlang-bot · 2026-03-27T06:49:24Z

Thanks for your pull request and interest in making D better, @suyashkumar102! We are looking forward to reviewing it, and you should be hearing from a maintainer soon.
Please verify that your PR follows this checklist:

My PR is fully covered with tests (you can see the coverage diff by visiting the details link of the codecov check)
My PR is as minimal as possible (smaller, focused PRs are easier to review than big ones)
I have provided a detailed rationale explaining my changes
New or modified functions have Ddoc comments (with Params: and Returns:)

Please see CONTRIBUTING.md for more information.

If you have addressed all reviews or aren't sure how to proceed, don't hesitate to ping us with a simple comment.

Bugzilla references

Your PR doesn't reference any Bugzilla issue.

If your PR contains non-trivial changes, please reference a Bugzilla issue or create a manual changelog.

⚠️⚠️⚠️ Warnings ⚠️⚠️⚠️

In preparation for migrating from Bugzilla to GitHub Issues, the issue reference syntax has changed. Please add the word "Bugzilla" to issue references. For example, Fix Bugzilla Issue 12345 or Fix Bugzilla 12345.(Reminder: the edit needs to be done in the Git commit message, not the GitHub pull request.)

Testing this PR locally

If you don't have a local development environment setup, you can use Digger to test this PR:

dub run digger -- build "master + dmd#22825"

suyashkumar102 · 2026-03-27T09:18:00Z

Tested locally on Windows — here's what the trace looks like in perfetto after this PR. The Sema1: Template Instance blocks now have visible sub-spans nested inside them. Before this, the entire semantic analysis block was flat with no sub-breakdown, making it impossible to attribute compile-time regressions to specific phases. The selected slice at the bottom confirms Sema1: Template Arg Semantic: Repeat!(float, 4) is showing correctly.

dkorpel · 2026-03-27T11:06:18Z

How were these spans chosen? It's important to weigh the added granularity vs. the added overhead. If a span is always a constant time, or always a fixed percentage of the parent span, then it would only make the flame chart larger without adding new information. If you could show how the spans vary in different scenarios, that would be great!

suyashkumar102 · 2026-03-27T13:41:29Z

@dkorpel
The spans were chosen to line up with the main phases involved in template instantiation in the compiler — argument semantic, overload resolution, member expansion, and the semantic2/3 passes. The idea was to break the template instance into a small number of meaningful steps rather than adding very fine-grained instrumentation.

To check whether they actually add useful information, I tried two contrasting cases:

Scenario A — Overload-heavy (select!bool with multiple constrained candidates):

Template instance ≈ 643 µs

Overload resolution ≈ 609 µs (~95%)

Members negligible

Scenario B — Recursive template (Fib!10):

Template instance ≈ 3.3 ms

Members ≈ 3.25 ms (~98%)

Overload resolution negligible

So the dominant cost shifts depending on the pattern — overload resolution in one case, member expansion in the other. Without the sub-spans both would just appear as “template instantiation”, even though the underlying work is very different.

For granularity vs overhead, I tried to keep it at a level where each span corresponds to a distinct kind of work, rather than splitting things further. That keeps the trace readable while still making it possible to see where time is going. Also, this is only active with -ftime-trace, so it doesn’t affect normal compilation.

was looking at a -ftime-trace output in perfetto and the sema1 template instance block was just one big blob with nothing inside it. makes it really hard to tell if the time is going into arg evaluation, overload resolution, expanding members, sema2 or sema3. added spans for: - semanticTiargs (arg semantic) - findBestMatch (overload resolution) - expandMembers (sema1 on members) - semantic2 on the instance - trySemantic3 on the instance updated compilable/ftimetrace.d to expect the new span names. fixes Bugzilla 22711

thewilsonator · 2026-03-27T14:04:33Z

compiler/src/dmd/templatesem.d

+    if (false)
    {
    Lerror:


please make Lerror nested function and then return Lerror();

compiler/src/dmd/templatesem.d

The previous commit converted goto Lerror to return Lerror() across the file, but Lerror() as a nested function only exists inside templateInstanceSemantic. The four call sites inside functionResolve (two in applyTemplate, two in the outer body) still have their own Lerror: labels in scope, so they should stay as goto Lerror.

suyashkumar102 · 2026-03-28T18:29:37Z

@dkorpel just following up on PR #22825 — answered your question about the span selection with some benchmark data. Let me know if you'd like any changes.

dkorpel · 2026-03-28T18:35:44Z

changelog/dmd.ftime-trace-sema-subspans.dd

+$(CHANGELOG_NAV_INJECT)
+
+$(VERSION $(CHANGELOG_VERSION_NIGHTLY), =================================================,
+
+$(BUGSTITLE_TEXT_HEADER Compiler changes,
+
+$(LI $(RELATIVE_LINK2 ftime-trace-sema-subspans,`-ftime-trace`: add sub-spans for template argument semantic and overload resolution))
+
+)


Please check the documentation on how to structure changelog entries, and proof-read the text

Fixed, used the correct format from the README and matched the span names to the actual trace output

suyashkumar102 force-pushed the fix/ftime-trace-sema-subspans branch 2 times, most recently from ae08b16 to 3802242 Compare March 27, 2026 08:07

suyashkumar102 force-pushed the fix/ftime-trace-sema-subspans branch from 3802242 to 511f335 Compare March 27, 2026 13:59

thewilsonator reviewed Mar 27, 2026

View reviewed changes

suyashkumar102 added 2 commits March 27, 2026 20:07

refactor: replace goto Lerror with nested function

b1e6333

refactor: replace goto Lerror with nested helper function

83feeb3

thewilsonator reviewed Mar 27, 2026

View reviewed changes

compiler/src/dmd/templatesem.d Outdated Show resolved Hide resolved

compiler/src/dmd/templatesem.d Outdated Show resolved Hide resolved

dkorpel requested changes Mar 28, 2026

View reviewed changes

suyashkumar102 added 2 commits March 29, 2026 09:22

fix changelog format for ftime-trace sema subspans

6ffe798

fix span name in changelog: Expand Members -> Template Members

8d32379

dkorpel approved these changes Mar 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ftime-trace: break up the template instance span into sub-phases#22825

ftime-trace: break up the template instance span into sub-phases#22825
suyashkumar102 wants to merge 6 commits intodlang:masterfrom
suyashkumar102:fix/ftime-trace-sema-subspans

suyashkumar102 commented Mar 27, 2026

Uh oh!

dlang-bot commented Mar 27, 2026

Uh oh!

suyashkumar102 commented Mar 27, 2026

Uh oh!

dkorpel commented Mar 27, 2026

Uh oh!

suyashkumar102 commented Mar 27, 2026 •

edited

Loading

Uh oh!

thewilsonator Mar 27, 2026

Uh oh!

Uh oh!

Uh oh!

suyashkumar102 commented Mar 28, 2026

Uh oh!

dkorpel Mar 28, 2026

Uh oh!

suyashkumar102 Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

suyashkumar102 commented Mar 27, 2026

Uh oh!

dlang-bot commented Mar 27, 2026

Bugzilla references

⚠️⚠️⚠️ Warnings ⚠️⚠️⚠️

Testing this PR locally

Uh oh!

suyashkumar102 commented Mar 27, 2026

Uh oh!

dkorpel commented Mar 27, 2026

Uh oh!

suyashkumar102 commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thewilsonator Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

suyashkumar102 commented Mar 28, 2026

Uh oh!

dkorpel Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

suyashkumar102 Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

suyashkumar102 commented Mar 27, 2026 •

edited

Loading