Show expansion of C++ macros
Categories
(Webtools :: Searchfox, enhancement)
Tracking
(firefox130 fixed)
Tracking | Status | |
---|---|---|
firefox130 | --- | fixed |
People
(Reporter: asuth, Assigned: nicolas.guichard)
References
(Regressed 1 open bug)
Details
Attachments
(2 files)
Trying to figure out what ISUPPORTS and related cycle-collecting macros do could be made easier by showing what the macros expand to (in places where compilation expands the macro).
Reporter | ||
Comment 1•5 years ago
|
||
Shelving for now. Various in-progress attempts all muddled together are at https://github.com/asutherland/mozsearch/tree/expand-macros
The main complication seemed to be that:
- Macro expansion hooks fired prior to the actual work of the expansion.
- The preprocessing record that was potentially supposed to exist no longer existed at the end of the file analysis pass.
- I didn't quite intuitively understand what the sourcemanager was up to.
The branch has some hacks that know how to produce a post-macro expansion pretty print of impacted code, but that doesn't really suffice for our use-cases.
I think a way that might work would basically be to:
- Scan the sourcemanager's data structure for the file and for any line that experienced a macro expansion, emit a new record type that provides the post-expansion version of the line.
- When emitting any token that was also present in macro expansions, provide the post-expansion location(s?) of the token as secondary metadata on the normal records. Right now we normalize to pre-expansion locations (so the argument), but I think we'd want to be able to provide at least clickability even if we can't syntax highlight. (Although we probably could syntax highlight if we view macro expansions as an alternate parse stream. Also, it's probably fine if macro expansion syntax highlighting is super buggy.)
- When rendering to HTML, put the macro expanded lines in collapsible divs with an alternate presentation.
Reporter | ||
Comment 2•4 years ago
|
||
One thing I don't think I looked at sufficiently is if there's existing machinery relating to the clang -E
output that just stops after the preprocessor stage that can be reused. Even if that output is fundamentally lossy, it seems likely a correspondence could be hackily re-established between that output and the SourceManager's canonical locations.
Reporter | ||
Comment 3•2 years ago
|
||
It turns out clang has an HTMLRewriter class which has a method clang::html::HighlightMacros that knows how to produce an HTML formatted version of the macro expansions. A simple starting point might be to just try and have it generate that HTML for us and include that as a monolithic "diagnostic" record or some other kind of meta-data. With significant extra work we could create a modified version of the logic that would emit a token and analysis record stream, I guess nested inside a single JSON record again for sanity purposes which output-file would know how to handle (semi-recursively).
Assignee | ||
Comment 4•9 months ago
|
||
Reporter | ||
Comment 5•7 months ago
|
||
Unfortunately we ran out of funding for Nicolas to continue his fantastic work here so I am going to land the work from 1583635 (on a hobby basis) to try and get this landed as an alpha-gated feature[1] given how useful this capability already is.
1: Blockers to advancing to beta would be:
- Accessibility verification with NVDA. The general plan has been that the screen reader UX should resemble blame.
- Handle macro expansions that have been expanded multiple times but with different underlying definitions as part of an ".inc" file that is itself multiply included.
- Improved matching of tokens in the expanded text. As noted in https://github.com/mozsearch/mozsearch/pull/679#issuecomment-1866207558 a really amazing aspect of the existing implementation is that it attempts to match up records from within the expansion with tokens in the expansion so that the expansion is more than just the expanded, syntax-highlighted text (which would itself still be incredibly useful).
The feature gate rationale is documented in https://searchfox.org/mozilla-central/pages/settings.html although my rationale here is more like:
- Accessibility needs to be tested with NVDA before reaching beta (and honestly before reaching alpha).
- If there's stuff people would file correctness bugs for, it definitely needs to be alpha. This is partially covered by the existing "brittle" phrasing.
Reporter | ||
Updated•6 months ago
|
Assignee | ||
Comment 6•2 months ago
|
||
This updates the mozsearch clang plugin from the GitHub repository to
add support for macros.
This also includes changes that happened since the last sync and were
not propagated to mozilla-central yet.
Updated•2 months ago
|
Pushed by bugmail@asutherland.org: https://hg.mozilla.org/integration/autoland/rev/b07fcbaaaa48 Update MozsearchIndexer to export macro expansions. r=asuth
Comment 8•2 months ago
|
||
bugherder |
Reporter | ||
Comment 9•2 months ago
•
|
||
It's worth noting that, as observed in https://bugzilla.mozilla.org/show_bug.cgi?id=1910304#c1, it appears that we end up seeing a ~one-third increase in clang build times with the extra macro expansions. This is definitely a worthwhile trade-off, and it's also worth keeping in mind that there are really two cost factors potentially going on:
- The time required to generate the macro data.
- The time required for the analysis merge step; we acquire a lock on the ~source file, read in the existing analysis data, then merge and de-duplicate the contents of what we want to write to it and what was already there, write it out, then release the lock. (This is potentially subject to lock contention, which can limit the benefits of increased parallelism.) An increase in data size can then have cumulative cost impacts rather than one-off cost impacts.
If we add a small server component to the indexing process, it potentially becomes possible for us to improve our performance across both aspects. In particular:
- For 2, one could imagine a redis-ish server where the clang indexer simply streams all the records it wants to a server which is responsible for the merging/deduplication and flushing to disk as appropriate.
- The kythe indexer has a concept of "claiming" symbols which potentially covers both 1 and 2, as if you can ask the server if someone has already done the processing for something, you can potentially skip some work in addition to not needing to do the I/O related to it.
Description
•