[whatwg] Endianness of typed arrays

Fri Apr 27 05:40:52 PDT 2012

----- Original Message -----
> Sorry for joining the conversation late.
> 
> On Mar 28, 2012, at 1:39 PM, Kenneth Russell wrote:
> 
> > On Wed, Mar 28, 2012 at 12:34 PM, Benoit Jacob <bjacob at mozilla.com>
> > wrote:
> >> 
> >> 1. In webgl.bufferData implementation, don't call glBufferData,
> >> instead just cache the buffer data.
> >> 
> >> 2. In webgl.vertexAttribPointer, record the attributes structure
> >> (their types, how they use buffer data). Do not convert/upload
> >> buffers yet.
> >> 
> >> 3. In the first WebGL draw call (like webgl.drawArrays) since the
> >> last bufferData/vertexAttribPointer call, do the conversion of
> >> buffers and the glBufferData calls. Use some heuristics to drop
> >> the buffer data cache, as most WebGL apps will not have a use for
> >> it anymore.
> > 
> > It would never be possible to drop the CPU side buffer data cache.
> > A
> > subsequent draw call may set up the vertex attribute pointers
> > differently for the same buffer object, which would necessitate
> > going
> > back through the buffer's data and generating new, appropriately
> > byte-swapped data for the GPU.

I wanted to reply to the above that while indeed in theory it's never possible to drop these caches, in practice some heuristics might work well enough that it doesn't matter for real content; and in the worst case where we effectively can't ever drop caches, we have a +10% or +20% memory usage increase for typical WebGL applications (as buffers typically aren't the majority of a WebGL application's memory usage), which is bad but still better than not running WebGL at all.

> 
> That's true. But there are other plausible approaches. There's
> GL_PACK_SWAP_BYTES:
> 
>     http://www.opengl.org/sdk/docs/man/xhtml/glPixelStore.xml

This seems specific to desktop OpenGL and doesn't seem to exist in the core OpenGL ES 2.0 specification. Maybe as an extension?

> 
> Or code generation: translate the shaders to do the byte-swapping
> explicitly in GLSL. For floats you should be able to cast back and
> forth to ints via intBitsToFloat/floatBitsToInt.

Interesting; one would have to measure the performance impact of this. What makes me hopeful is that this should only slow down vertex shaders, not fragment shaders (which are the performance critical shaders in most applications). So this could well be the most practical solution so far. WebGL implementations must have a shader compiler anyways, for multiple reasons (validation, shading language differences, working around driver bugs).

This only applies to ARRAY_BUFFER, not ELEMENT_ARRAY_BUFFER buffers, but for the latter, we already have to keep a permanent CPU-side copy anyway for validation purposes, so the approach of swapping bytes in the implementation of drawElements should work well and not have any major downside (since having to keep a CPU-side copy was the main downside).

Cheers,
Benoit

> 
> But these days more and more big-endian systems have support for
> little-endian mode, which is probably the simplest approach. And
> honestly, there just don't seem to be WebGL-enabled user agents on
> big-endian systems. We've left a specification hole in a place
> that's easy to trip over, only out of concern for hypothetical
> systems -- in an era when little-endian has clearly won.
> 
> If the web isn't already de facto little-endian -- and I believe my
> colleagues have seen evidence that sites are beginning to depend on
> it -- then typed arrays force developers to test on big-endian
> systems to make sure their code is portable, when it's quite likely
> they don't have any big-endian systems to test on. That's a tax on
> developers they may not be willing or able to pay. I should know, I
> am one! :)
> 
>     https://github.com/dherman/float.js/blob/master/float.js
> 
> In a hilariously ironic twist of fate, I recently noticed that the
> endianness-testing logic originally had a stupid bug that made
> LITTLE_ENDIAN always true. It's now fixed, but I didn't detect the
> bug because I didn't have a big-endian JS engine to test on.
> 
> > Our emails certainly crossed, but please refer to my other email.
> > WebGL applications that assemble vertex data for the GPU using
> > typed
> > arrays will already work correctly on big-endian architectures.
> > This
> > was a key consideration when these APIs were being designed. The
> > problems occur when binary data is loaded via XHR and uploaded to
> > WebGL directly. DataView is supposed to be used in such cases to
> > load
> > the binary data, because the endianness of the file format must
> > necessarily be known.
> 
> I'm afraid this is wishful thinking. API's have more than a fixed set
> of use cases. The beautiful thing about platforms is that people
> invent new uses the designers didn't think of. Typed arrays are
> simple, powerful, and general-purpose, and people will use them for
> all kinds of purposes. Take my "float explorer":
> 
>     http://dherman.github.com/float.js/
> 
> There's no XHR and no WebGL involved in that code. (And I didn't
> invent that to make a point here -- I wrote it months ago when I
> wanted to visualize the bit patterns of floating-point numbers.)
> 
> Or imagine writing a crypto algorithm that optimizes bit
> manipulations by working on int-sized chunks via casts:
> 
>     http://mxr.mozilla.org/mozilla-central/source/security/nss/lib/freebl/arcfour.c#347
> 
> If you don't carefully endian-detect, the hashing logic will produce
> gibberish.
> 
> > Once DataView is available everywhere then the top priority should
> > be
> > to write educational materials regarding binary I/O. It should be
> > possible to educate the web development community about correct
> > practices with only a few high profile articles.
> 
> DataView is great and we're working on it in SpiderMonkey, but as I
> say that won't solve this issue. There are all kinds of
> possibilities for using typed arrays with casts purely in-memory.
> Where there's incentive to write unportable code, people will write
> unportable code. A few articles or tutorials won't change that.
> That's not a knock against webdevs; it's entirely rational behavior.
> 
> We shouldn't tax web developers by forcing them to implement
> portability logic they can't realistically test, nor should we
> expect that they will. Particularly when it's only for potential
> performance problems on hypothetical big-endian user agents. If
> worse comes to worst, and big-endian browsers crop up with serious
> performance problems, we can extend the API's to allow an optional
> boolean bigEndian flag (that defaults to false), and a way to
> feature-detect the system's native endianness (navigator.isBigEndian
> or something like that). That way, developers who don't opt in to
> writing portable logic get *performance degradation* instead of
> *incorrect behavior*.
> 
> Dave
> 
>