YouTube, HTML5, and Comparing H.264 With Theora
David Gerard writes "Google Chrome includes Ogg support for the <video> element. It also includes support for the hideously encumbered H.264 format. Nice as an extra, but ... they're also testing HTML5 YouTube only for H.264 — meaning the largest video provider on the Net will make H.264 the primary codec and relegate the equally good open format Ogg/Theora firmly to the sidelines. Mike Shaver from Mozilla has fairly unambiguously asked Chris DiBona from Google what the heck Google thinks it's doing."
DiBona responded with concerns that switching to Theora while maintaining quality would take up an incredible amount of bandwidth for a site like YouTube, though he made clear his support for the continued improvement of the project. Greg Maxwell jumped into the debate by comparing the quality of Ogg/Theora+Vorbis with the current YouTube implementations using H.263+MP3 and H.264+AAC. At the lower bitrate, Theora seems to have the clear edge, while the higher bitrate may slightly favor H.264. He concludes that YouTube's adoption of "an open unencumbered format in addition to or instead of their current offerings would not cause problems on the basis of quality or bitrate."
Understanding TFA linked from your "equally good" link to a slashdot story? YOU FAIL IT!!! From TFA:
So just to recap, you have suggested that Ogg Theora video provides quality comparable to H.264 based on a study using a specific development-version Ogg Theora video codec and a specific H.264 encoder (x264) which is NOT the best encoder around, when it in fact has inferior SnR (the only thing the study was meant to test) as compared to x264, which has inferior SnR as compared to other H.264 encoders?
I don't know who failed bigger, you, Soulskill, or the peoples of slashdot who actually use the firehose... but you have all failed miserably.
With all that said; is there any reason they can't add Theora support later?
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Superior in objective PSNR Quality. OK.
How about CPU utilization? Are there any ultra-low-power decoding chips that play Theora?
H.264 already has a large install base of devices that play it. Is there enough of an advantage to Theora to warrant dumping all of those for new ones?
I remember when ogg first came out. I read slashdot regularly, saw all the information about how great it was, how since it was free it would be easily adopted by hardware makers who didn't need to pay for the privilege. I bought into the hype. I ripped my cd's to ogg files, paid extra money for a neuros player because it was one of the few players that handled ogg files.
Now, 5 years later I have a large collection of ogg files that are essentially useless. No one in the mainstream uses ogg, despite the superiority and price. Whenever I get a new player, I have to carefully read the specs to see if it will play my oggs. Few do. Luckily I have the cds and I can simply re-rip them to mp3s as I find the time/care too.
My guess is that the same thing will happen with theora. It may be superior. It may be cheaper. But I just don't think it will catch on. It's another example of the slashdot echo chamber.
Everyone has made a mythology about VHS somehow losing to Sony Beta despite being inferior. If you lived in that day, and walked into a store, there was really no significant difference between picture quality between VHS and Beta on the average TV of the day. There just wasn't. And, everyone forgets that the superiority of Beta was achieved by making the tapes only an hour long. VHS vs Beta was a silly argument. Beta claimed superior picture quality on TV's nobody had, but, VHS could store entire movies. To most people, Beta's claims sounded a lot like BS, while VHS was clearly better.
This is my sig.
html was never really designed to do much more than have a single "document" that can link to other "documents" on the internet. over time dynamic ideas were tacked on such as javascript but it still has never been designed in such a way that 'app-y' ideas can be created without hacking up the 'document' model.
Thus html 5 attempts to correct this by modifying the original 'document' model so that it now supports 'documents' and 'app-y' ideas. its not evil, its progress.
Are those sarcasm tags part of the HTML5 standard?
Stop misquoting the motto! It's "don't be evil", not "do no evil". Google is just saying that they don't intend to screw over their own customers, not that they intend to become the moral custodian of justice for the entire world.
An open-source browser cannot legally read h264 video, that is the real issue that people seem to have trouble to understand. That is why the HTML standard only mandates a format that is not impaired by any legal restrictions: Theora.
Not being able to legally play DVDs, Blurays, connect your ipod, etc. on linux are already big problems, we don't need another one.
Pirates have the advantage that they don't have to pay for patent licenses, so H.264 and Theora are both "free". But for law-abiding companies like Mozilla and Google, Theora is free and H.264 isn't.
Why would they install another browser when they could just click the "Click here to install silverlight and watch this video" button?
Reread Shaver's methadology:
A keyframe interval of 250 frames was used for the Theora encoding.
10 seconds is absurdly short for any kind of codec test. That's almost as long as the buffer would be, and current Thusndela builds don't include full buffer management. Plus he picked a pretty low motion section of the clip. He should the full clip. Current Theora builds are plenty fast; it'd be faster than realtime on a laptop.
In a real codec compare, CBR is often the best way to see differences between codecs and implementations, since that's where rate distortion really shows its stuff. How well a codec can preserve quality with high motion in a fixed buffer is a key differentiatior.
That said, I believe that the Theora+Vorbis results are substantially better than the YouTube 327kbit/sec. Several other people have expressed the same view to me, and I expect you'll also reach the same conclusion. This is unsurprising since we've been telling people that Theora is better than H.263
His primary quality comparison is between Theora and H.263, not H.264. H.263 is even older than VP3 which Theora is based on. As to H.264 he says:
In the case of the 499kbit/sec H.264 I believe that under careful comparison many people would prefer the H.264 video.
Yep. And it would be a huge differential if he'd picked a more challenging section of the source.
And while it doesn't have any impact on the comparison, no compressionist would use those frame sizes. We always try to round to the nearest mod16 value, so that we have macroblock alignment.
Thus 480x272 and 400x224 would be more efficient choices in both cases. 400x226 is particulary egregious, as it means the codec is really encoding at 400x240 internally with 14 lines of padding.
My video compression blog
This document describes the terms of the H.264 license. The license seems to cover both encoding and decoding.
WTFV. The keyframe interval is 10 seconds but the clip is nearly 5 minutes long.