195262 - static atom tables

Assignee

Description

•

22 years ago

After some conversations with brendan, darin, and doug, I started to wonder how hard it would be to use gperf's generated code to make static atom tables. Turns out not to be too hard. here's how it works. XPCOM Support ------------- - in XPCOM, we define a new concrete class, nsStaticAtom: class nsStaticAtom : nsIAtom { ... const char* mValue; ... } In this class, AddRef/Release are no-ops (they return 2 and 1 respectively) - we define a new function NS_RegisterAtomLookupFunction(...) which takes a function that looks up an atom value. The function looks like: nsStaticAtom* lookup(const char* str, unsigned int len); (this is the signature of the function generated by gperf) - when an atom is looked up, we look in the global hashtable for the atom. If it is not found, then we call each lookup function that has been registered. If a match is found in a lookup function, we return that. If it is not found, we create the atom and add it to the global hashtable. Clients ------- - the client has a wordlist, like html-atoms.txt that looks like: class nsStaticAtom {}; %% atom1 atom2 atom3 .. etc .. - gperf is used to generate the C++: % gperf --language=C++ --class-name=nsHTMLAtoms --duplicates --key-positions=* --struct-type --slot-name=GetValue() --omit-struct-type --global < html-atom-list.txt > nsHTMLAtomsImpl.cpp - the generated code is included in some larger file, which handles the registration: #include "nsAtomTable.h" #include "nsHTMLAtomsImpl.cpp" nsresult NS_RegisterHTMLAtoms() { return NS_RegisterStaticAtoms(nsHTMLAtoms::in_word_set); } And that's about it. A few details: - I initially thought that for performance, we'd call the lookup functions first. Actually, the global hashtable should always be consulted first, because it is possible for someone to look up an atom before all the lookup functions are registered. - So then I thought, what's the gain here? Is there really a performance win if we have to consult the dynamic structure first? The real win is that we aren't creating hundreds of atoms on the heap - instead they're permenant, readonly structures mapped directly into memory. Currently all dynamic atoms make copies of the strings that created them, which is bad. I see 2019 atoms created in the tinderbox logs just for startup stuff, so even if the average string size of an atom is 8 characters * 2 bytes/char + the 4 byte overhead of an atom thats a potential savings of about 40k of of heap (that lives for the lifetime of the product) - the one problem I've run into is nsIAtom::GetUnicode() which wants shared access to the internal unicode buffer.. which there wouldn't be for these atoms - they hold raw char* strings. My thought was that since nsIAtom isn't frozen, that we just dump GetUnicode() or even provide an attribute AUTF8String value; Most atoms are just ASCII inflated into PRUnichars so the conversion would be cheap. (and for nsStaticAtoms, we could decide that they are ASCII by definition, making the GetValue() implementation easy)

update the nsIAtom API and callers 22 years ago Alec Flett 75.48 KB, patch		Details \| Diff \| Splinter Review
update atom API and callers, and add nsStaticAtom support 22 years ago Alec Flett 89.85 KB, patch	darin.moz : superreview-	Details \| Diff \| Splinter Review
update atom API and callers, and add nsStaticAtomSupport v1.1 22 years ago Alec Flett 90.48 KB, patch		Details \| Diff \| Splinter Review
update atom API and callers, and add nsStaticAtomSupport v1.11 22 years ago Alec Flett 89.19 KB, patch		Details \| Diff \| Splinter Review
update atom API and callers, nsStaticAtomSupport v1.2 22 years ago Alec Flett 90.70 KB, patch	darin.moz : superreview-	Details \| Diff \| Splinter Review
review comments 22 years ago Darin Fisher 4.42 KB, text/plain		Details
update atom API and callers, nsStaticAtom support v1.3 22 years ago Alec Flett 90.87 KB, patch	dbaron : review- darin.moz : superreview+	Details \| Diff \| Splinter Review
updated patch 22 years ago Alec Flett 91.96 KB, patch	dbaron : review- alecf : superreview+	Details \| Diff \| Splinter Review
updated patch, v1.5 22 years ago Alec Flett 93.74 KB, patch		Details \| Diff \| Splinter Review
fix performance (Fragment) 22 years ago Alec Flett 10.26 KB, patch		Details \| Diff \| Splinter Review
8 day Tp graph from btek 22 years ago Brendan Eich [:brendan] 5.46 KB, image/png		Details