798172 - add mfbt/Endian.h

Reporter

Description

•

13 years ago

Attached file strawman Endian.h file (obsolete) — Details

There's quite a bit of gnarly byte-swapping code lurking in the code base, most of it reliant on some form of IS_BIG_ENDIAN or IS_LITTLE_ENDIAN. As those macros come from #include'ing prtypes.h (prtypes.h includes MDCPUCFG, which provides said macros), and we want prtypes.h to go away, we need to consolidate all of these definitions somehow, into something that doesn't rely on prtypes.h. There's also offhand comments in mfbt/SHA1.h about how we should more carefully define MOZ_LITTLE_ENDIAN and/or MOZ_BIG_ENDIAN in an Endian.h header. As such macros are still useful for defining structure layouts, some low-level gfx bits, and so forth, we should provide them. I've attached a strawman Endian.h file; there may be bugs. It needs a little more documentation and possibly some optimization of the actual read/write of different endiannesses (clang will optimize the Read and Write functions fine; gcc has trouble; I'm not sure what MSVC does with them). The MOZ_{LITTLE,BIG}_ENDIAN bits I plan to borrow liberally from jscpucfg.h. I'm not entirely happy with the API yet. The idealistic part of me (http://commandcenter.blogspot.com/2012/04/byte-order-fallacy.html) would like people to just use the Read/Write functions: m16Bit = LittleEndian::ReadUInt16(buffer); m32Bit = BigEndian::ReadUInt32(buffer + sizeof(m16Bit)); and so forth. But for the sake of obviousness and not tediously rewriting a bunch of code, the Swap* functions may actually be more useful: m16Bit = NativeEndian::SwapFromLittleEndian(m16Bit); m32Bit = NativeEndian::SwapFromBigEndian(m32Bit); or somesuch. Suggestions welcome.

Joshua Cranmer [:jcranmer]

Comment 1

•

13 years ago

Your Write function has a nasty overflow bug lying in wait: (UINT8_MAX << (8 * i)); C99 appears to guarantee that UINT8_MAX would be only an int, which means the left-shift will produce undefined behavior for 32-bit (0xFF << 24 is actually undefined behavior!). Not to mention you forgot to right-shift the values back into 8 bits :-P. Thinking harder, your Read function also has a subtle undefined behavior bug as well too (hint: left-shifting a 1 into the sign bit of a signed number is undefined behavior). As for your mention of the blog post, I'll point out that some of his points are just plain wrong: 1. Hide the code in a standard function/macro (you should do this with any code which is mind-numbingly simple and also very easy to mess up), and there's no code-difference either way. 2. 95% of the use cases are already in situations where things are properly aligned (at least for 32-bit values). 3. uint32_t. 4. Modern optimizers can probably spot the byte-swap pattern anyways, and reduce it to explicit byte swap instructions on architectures that have them. Granted, they should probably also be able to optimize the native-endian-read pattern as well. 5. I do agree here, actually. 6. I don't do enough network/binary file programming to comment here. I will point out, expounding on my point #4, it's probably better not to rely on the compiler to first unroll the loop and then do pattern-recognition for byte swapping (this may be why gcc doesn't pick up on optimization).

strawman Endian.h file 13 years ago Nathan Froyd [:froydnj] 6.01 KB, text/plain		Details
strawman Endian.h file 13 years ago Nathan Froyd [:froydnj] 6.12 KB, text/plain		Details
strawman Endian.h file 13 years ago Nathan Froyd [:froydnj] 7.03 KB, text/plain		Details
strawman Endian.h file, v2 13 years ago Nathan Froyd [:froydnj] 14.64 KB, text/plain	Waldo : feedback+	Details
part 1 - add mfbt/Endian.h 12 years ago Nathan Froyd [:froydnj] 16.03 KB, patch		Details \| Diff \| Splinter Review
part 2 - add tests for mfbt/Endian.h 12 years ago Nathan Froyd [:froydnj] 16.80 KB, patch		Details \| Diff \| Splinter Review
part 3 - convert SHA1.cpp to use Endian.h 12 years ago Nathan Froyd [:froydnj] 1.55 KB, patch		Details \| Diff \| Splinter Review
part 4 - convert the jsclone bits to use Endian.h 12 years ago Nathan Froyd [:froydnj] 7.04 KB, patch		Details \| Diff \| Splinter Review
part 5 - convert xdr bits to use Endian.h 12 years ago Nathan Froyd [:froydnj] 5.42 KB, patch		Details \| Diff \| Splinter Review
part 1 - add mfbt/Endian.h 12 years ago Nathan Froyd [:froydnj] 17.05 KB, patch	Waldo : review-	Details \| Diff \| Splinter Review
part 2 - add tests for mfbt/Endian.h 12 years ago Nathan Froyd [:froydnj] 16.39 KB, patch		Details \| Diff \| Splinter Review
part 1 - add mfbt/Endian.h 12 years ago Nathan Froyd [:froydnj] 20.77 KB, patch		Details \| Diff \| Splinter Review
part 2 - add tests for mfbt/Endian.h 12 years ago Nathan Froyd [:froydnj] 16.53 KB, patch	Waldo : review+	Details \| Diff \| Splinter Review
Part 1, revised - add mfbt/Endian.h 12 years ago Jeff Walden [:Waldo] 20.41 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
part 3 - convert SHA1.cpp to use Endian.h 12 years ago Nathan Froyd [:froydnj] 3.46 KB, patch	Waldo : review+	Details \| Diff \| Splinter Review
part 4 - convert the jsclone bits to use Endian.h 12 years ago Nathan Froyd [:froydnj] 7.38 KB, patch	Waldo : review+	Details \| Diff \| Splinter Review
part 5 - convert xdr bits to use Endian.h 12 years ago Nathan Froyd [:froydnj] 5.49 KB, patch	Waldo : review+	Details \| Diff \| Splinter Review
workaround-clang-__builtin_bswap16.patch 12 years ago Chris Peterson [:cpeterson] 1.69 KB, patch	Waldo : review+	Details \| Diff \| Splinter Review