<html><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div>[Speaking with my Apple hat on.]</div><div><br></div><div>We agree that upgrading the hardware is not an acceptable answer to this question. Many web-enabled devices have much less CPU power than the average laptop or desktop (think mobile devices), but many have quite capable GPUs. On desktop machines too, hardware acceleration of video playback is essential for optimum performance, and for acceptable CPU usage with large videos in non-trivial content (e.g. content with CSS effects on the video, or lots of expensive-to-render content under or over the video).</div><div><br></div><div>Ignoring rotations, overlays etc. in order to get reasonable performance is an option, but should be considered a bug in the implementation.</div><div><br></div><div>You could also imagine an attribute on the video element akin to the "wmode" attribute for plug-ins, which allow the page author to opt-in to faster rendering, with the expectation that the visual rendering will suffer. However, we feel strongly that the video spec should not be encumbered by an attribute of this kind.</div><div><br></div><div><div>Taking the video full-screen is an approach that makes a lot of sense for mobile devices. It's unfortunate that the spec shies away from the full-screen issue.</div><div><br></div><div><div>In an ideal world, hardware acceleration of video playback would "just work", and the spec would have to say no more about it. We believe it's possible to do hardware acceleration of video (and other animating web content) and preserve rendering in many cases. WebKit actually has some experimental code for this.</div><div><br></div><div><div>I'm not sure how much an HTML spec will be able to say about triggers that may cause the video rendering to fall off the hardware-accelerated path, since many of those triggers will be CSS-related, and implementation details will differ between browsers. From our experience in WebKit, those triggers may include:</div><div>* clipping (via overflow on the video or an ancestor)</div><div>* overlapping elements</div><div>* blending (opacity)</div><div>* being affected by transforms (via SVG, or CSS transforms on video or an ancestor)</div><div>* CSS masks or reflections</div><div>* CSS box decorations (border, background)</div><div><br></div></div></div><div>If the spec does say something about performance of <video>, I think it should be no more than a note that performance may differ across browsers, and can be affected in various ways that may be non-obvious to the page author, related to the layout and styling of the video and other elements on the page.</div><div><br></div><div>Simon</div><div><br></div></div><div><div>On Apr 28, 2009, at 6:07 PM, Ian Fette wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite">Upgrade the hardware is not an acceptable answer. Video acceleration is meant to offload work from CPU (especially on constrained devices, e.g. mobile). You want to be able to do compositing on video card, so that you don't have to read the video out of the video card's memory, transfer it over the bus, to the CPU, do some transforms/overlays/..., and then send it back to the video card for display. Doing that absolutely kills framerate.<div> <br></div><div>As we (browsers) implement <video> I think a lot of us are starting with software rendering. Certainly we want to be able to do hardware acceleration at some point. Perhaps some things we will still be able to do in hardware, e.g. overlays of HTML or certain transforms (if the video device supports saying "take this, translate it, and composite" and the rendering engine only needs geometry data.) Other things we might not be able to do in hardware (e.g. if you have "transparent" flash video on top, and Flash wants to know what pixels are underneath it, then we would have to read that data off of the video card, send it to CPU, ...)</div> <div><br></div><div>I think what would be helpful is for browsers who are implementing <video> with hardware acceleration to publish information on what would make them fall back to software rendering. If it turns out that list is roughly similar across implementations, perhaps it could be added as a note in the spec that doing the following certain things may cause performance implications. We're probably not ready to do that yet given that we don't have enough implementation experience, but that would be my suggestion for how to move forward.</div> <div><br></div><div>-Ian<br><br><div class="gmail_quote">On Tue, Apr 28, 2009 at 5:59 PM, Ian Hickson <span dir="ltr"><<a href="mailto:ian@hixie.ch">ian@hixie.ch</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;"> <div><div></div><div class="h5">On Sat, 28 Mar 2009, Benjamin M. Schwartz wrote:<br> ><br> > The <video> tag has great potential to be useful on low-powered<br> > computers and computing devices, where current internet video streaming<br> > solutions (such as Adobe's Flash) are too computationally expensive.<br> > My personal experience is with OLPC XO-1, on which Flash (and Gnash) are<br> > terribly slow for any purpose, but Theora+Vorbis playback is quite<br> > smooth at reasonable resolutions and bitrates.<br> ><br> > The <video> standard allows arbitrary manipulations of the video stream<br> > within the HTML renderer. To permit this, the initial implementations<br> > (such as the one in Firefox 3.5) will perform all video decoding<br> > operations on the CPU, including the tremendously expensive YUV->RGB<br> > conversion and scaling. This is viable only for moderate resolutions<br> > and extremely fast processors.<br> ><br> > Recognizing this, the Firefox developers expect that the decoding<br> > process will eventually be accelerated. However, an accelerated<br> > implementation of the <video> spec inevitably requires a 3D GPU, in<br> > order to permit transparent video, blended overlays, and arbitrary<br> > rotations.<br> ><br> > Pure software playback of video looks like a slideshow on the XO, or any<br> > device with similar CPU power, achieving 1 or 2 fps. However, these<br> > devices typically have a 2D graphics chip that provides "video overlay"<br> > acceleration: 1-bit alpha, YUV->RGB, and simple scaling, all in<br> > special-purpose hardware. Using the overlay (via XVideo on Linux) allows<br> > smooth, full-speed playback.<br> ><br> > THE QUESTION:<br> > What is the recommended way to handle the <video> tag on such hardware?<br> <br> </div></div>Upgrade the hardware.<br> <div class="im"><br> <br> > There are two obvious solutions:<br> > 0. Implement the spec, and just let it be really slow.<br> > 1. Attempt to approximate the correct behavior, given the limitations of<br> > the hardware. Make the video appear where it's supposed to appear, and<br> > use the 1-bit alpha (dithered?) to blend static items over it. Ignore<br> > transparency of the video. Ignore rotations, etc.<br> > 2. Ignore the HTML context. Show the video "in manners more suitable to<br> > the user (e.g. full-screen or in an independent resizable window)".<br> ><br> > Which is preferable? Is it worth specifying a preferred behavior?<br> <br> </div>>From HTML's point of view, all are acceptable. From the user's point of<br> view, 1 and 2 are preferable, probably at the user's option.<br> <br> I don't know what else to tell you. :-)<br> <font color="#888888"><br> --<br> Ian Hickson U+1047E )\._.,--....,'``. fL<br> <a href="http://ln.hixie.ch/" target="_blank">http://ln.hixie.ch/</a> U+263A /, _.. \ _\ ;`._ ,.<br> Things that are impossible just take longer. `._.-(,_..'--(,_..'`-.;.'<br> </font></blockquote></div><br></div></blockquote></div><br></body></html>