Browser Vendors Force W3C To Scrap HTML 5 Codecs
snydeq writes "Major browser vendors have been unable to agree on an encoding format they will support in their products, forcing the W3C to drop audio and video codecs from HTML 5, the forthcoming W3C spec that has been viewed as a threat to Flash, Silverlight, and similar technologies. 'After an inordinate amount of discussions on the situation, I have reluctantly come to the conclusion that there is no suitable codec that all vendors are willing to implement and ship,' HTML 5 editor Ian Hickson wrote to the whatwg mailing list. Apple, for its part, won't support Ogg Theora in QuickTime, expressing concerns over patents despite the fact that the codec can be used royalty-free. Opera and Mozilla oppose using H.264 due to licensing and distribution issues. Google has similar reservations, despite already using H.264 and Ogg Theora in Chrome. Microsoft has made no commitment to support <video>."
See, this is something that open source accomplishes that stupid fucking arrogant businesses will never get. When something is obsolete or no longer needed, it gets ditched or replaced by something better. Don't keep it around because someone thinks that they have the right to continue being in business even though their shit is a decade out of date. Its a hard and cold life for the developer whose project gets ditched (And sometimes I feel bad for them), but in the end, the user wins big and things evolve.
But of course, the rest of the world lives in reality, so the user loses.
Fuck you Microsoft. Die already!
Fuck you Adobe. Die already!
Fuck you Java. Die already!
Fuck you too Realnetworks. Just because.
Perhaps it is a stupid question but why do the vendors have a say what goes into the spec and what doesn't? Isn't it up to them to choose to implement the spec fully or not? FFS just make it Ogg Vorbis/Theora and if Apple doesn't want to support it then Safari can just not support that part of the spec. It isn't like any of the browser are 100% complient anyway.
Apple, for its part, won't support Ogg Theora in QuickTime, expressing concerns over patents despite the fact that the codec can be used royalty-free.
Or perhaps their concern is precisely because of this fact?
What's with Apple? They had no problem paying Sorenson Media in the past. What, specifically, is wrong with Theora?
sig: sauer
So not counting Microsoft (which has had nothing to say on the matter, and therefore cannot be counted one way or another), the only party blocking this is Apple, and they're blocking it based solely on a trumped-up and prima facie invalid argument, and furthermore, an argument that has never once impeded any of Apple's past actions. In other words, "BAWWWWW they din pik my pet codec BAWWWWW i wants every1 usin only my codec BAWWWWW BAWWWWW BAWWWWW!"
Seriously, folks; QuickTime uses a plug-in architecture for a reason. If Apple were truly concerned about Theora and patents, all they'd need to do is implement it as a plug-in -something they should have absolutely no trouble doing, as it's their own architecture- which could then be trivially removed if the need ever arose. But no; this is a step back towards the bad old days of Not-Invented-Here syndrome at Apple.
It's not just Apple, though. MS will probably not implement Theora either. Google will not be using it for anything substantial because of substandard quality per bit. The fact is that nothing is gained by making it a spec requirement. Either vendors will implement Theora or they won't, having it in the spec won't change anything. So why even have it, if that's the case?
They don't know who to pay.
First...
Have you even *used* Safari, Webkit, or any Webkit derived browsers?
Why would they care what Apple/Webkit supports? Um, besides the fact that 65% of mobile browsing is currently with a Webkit based browser, golly, I can't think of any.
Someone please mod this idiot Troll.
Second...
But, I agree with others ... that they shouldn't care what *any* browsers currently support. Make it part of the spec and the users will decide. FireFox users will use ogg, Webkit based browsers will use h.264... I really don't see the issue here.
Seems to be more of a 'if you won't play my game, we just won't play ... I'm taking my ball and going home' behavior that really isn't helping the situation to me...
How about making the browser use system (DirectShow on Windows, whatever-it's-called on Linux) codecs, so everybody could be using whatever codec they want. Look, a lot of media players on Windows (like WMP and MPC) use DirectShow, so thew users can install additional codecs.
Why they want to include the codecs in the browsers. This way is worse. If system codecs were used, then the sites could choose whether to use h.264, ogg or some other codec, like XviD.
Also, this way all of the patent/license/whatever issues for the browser vendors would go away. And if the users are watching video files on their computers they most likely have codecs already installed.
Fuck Apple too. They are as bad as it comes. No less than microsoft.
If no browser will support the codecs then webmasters wont use html 5 and stick with html4. When IE owned a significant marketshare a couple of years ago the web evolution slowed down to a halt. Firefox can't adopt H.264 because its patented and Firefox can be shutdown if a lawsuit over infringement takes place.
And Firefox does not have a significant enough marketshare for developers to care about Ogg Vorbis/Theora. Besides all the professional tools do not support it so it wont ever be used. It wont ever be used because professional tools do not support. Its a catch-22 just like Microsoft Windows and Office. You can't ever leave the platform.
If silverlight and flash work on 95% of the market why switch?
http://saveie6.com/
My understanding is that Apple doesn't want to work on QuickTime because it is buggy and no one wants to fix it.
This is essentially what is happening. FF3.5 shipped with support for Theora.
Nerd rage is the funniest rage.
Vendors never actually mean what they say. Here are the real reasons:
Apple won't support a codec that's incompatible with its huge installed base of ipods and iphones. They don't care about royalty fees because most Safari users pay for an OS X licence, and they want the free browsers to look sub-par compared with theirs.
Microsoft won't support a codec that makes the web more reliable for non-Windows users - especially Linux users. They don't care about royalty fees because all IE users pay for a WIndows licence, and they want the free browsers to look sub-par compared with theirs.
Google, Opera and Mozilla won't support anything that puts them at risk of needing to pay royalties on the huge number of free downloads they give away.
Nobody actually cares about end users or developers. If you think they do, you're kidding yourself.
You can still make use of the tag in a cross platform way. Video For Everybody Is a simple set of code that uses the video tag with only two input files - an ogg and an mp4 - and lets the tag work for, well, everyone. IE6? Check. Safari? Check. iPhone? Yep.
It falls back to whatever method works for playback - including using Flash to play the h.264 if it needs to.
It's pretty funny to see so many people bitching about Apple not supporting ogg when Microsoft ignores the tag altogether. Everyone, start supporting the video tag today as widespread use is the only way to get big companies to fully adopt it - perhaps that will motivate Apple to someday support ogg.
"There is more worth loving than we have strength to love." - Brian Jay Stanley
Fuck you Microsoft. Die already!
Fuck you Adobe. Die already!
Fuck you Java. Die already!
Fuck you too Realnetworks. Just because.
Not "Just because". Fuck Real for producing crappy software that doesn't fit in anywhere at makes it annoyingly non-trivial to download things I want to watch.
Fuck Adobe for Flash. Seriously, I don't need vector graphics in my web browser. I'd love to have embedded .wmv/.avi/.mpeg files, whatever, because I can play those with mplayer which DOES NOT SUCK. As opposed to flash.
Fuck Microsoft for being the great browser market retardant. And in general for writing shitty software which doesn't do what I want it to (heck, I can't even get XP to install; epic fail).
And fuck Apple for being such control freaks. Well, first, fuck 'em for not helping fix this browser shit. Secondly, fuck them for being a worse control freak than Microsoft could ever be. I recently played with an iPhone (display/sales demo); among the top 25 apps in the store is one that displays scantily clad women, which are "as naked as Apple will let us get away with". FFS, Apple. Don't decide whether I'm going to watch porn on my phone. And you include a web browser---is that porn-filtered too? Assholes.
But don't fuck with Java. It's free software. It works for what it does: sorting algorithm animations and interactive Rubik's cube algorithm display. Java is OK, when used in moderation.
Flame on ;-)
what they did, is just one brain fart out of this quote from:
http://lists.whatwg.org/htdig.cgi/whatwg-whatwg.org/2009-June/020620.html
"I considered requiring Ogg Theora support in the spec, since we do have
three implementations that are willing to implement it, but it wouldn't
help get us true interoperabiliy, since the people who are willing to
implement it are willing to do so regardless of the spec, and the people
who aren't are not going to be swayed by what the spec says."
There's no word about "cutting theora" just considerations that some companies won't comply with the spec.
But I guess this is somehow normal with new specs...
You realise that Snow Leopard, shipping in September, comes with a new version of QuickTime, right? QuickTime 7 is not 64-bit clean, which is a large part of the reason for the rewrite.
I am TheRaven on Soylent News
In a way, that's what's happening. Vendors are flat-out declaring they will not cooperate on open-source unpatented web code. What's getting dropped here is not just HTML5 but the W3C's reason for existence, and thus the document neutrality of the Web. This is a bad day.
What we really need in HTML standarization:
Ignoring the tremendous improvements in the Thusnelda branch, if YouTube suddenly switched from severe H.26whatever overcompression to stock Theora with optimal settings (and everyone had libtheora and HTML 5 browsers), no one would notice the difference.
Untrue. Xiph has made heroic progress with Theora, but it's still a decade-old codec design and bitstream, and it's hard to imagine it catching up with xvid, let alone a good H.264 implementation.
YouTube certainly has quality issues, but things can be bad in more than one way at a time. There's nothing that less efficient codec would help them with. Note their top bitrate is 1280x720p30 at 2 Mbps.
Some samples compared Xiph's latest demo clips, with the same source encoded with VC-1 and x264 are here:
http://cid-bee3c9ac9541c85b.skydrive.live.com/browse.aspx/.Public/BBB%7C_Compare
x264 can do 640x352 with higher per pixel-quality than Theora can do at 400x224 at the same bitrate.
My video compression blog
Audio video codecs are outside the scope of HTML. Whatever it says in the HTML 5 spec about video codecs, that will not magically change the last 20 years of digital audio video away from MPEG to something else.
The current audio video standard is ISO MPEG-4 (2001) and the codecs are H.264/AAC. Supporting this standard is not an academic issue because the world is full of content as well as hardware and software players and authoring tools that conform to this standard. It's also the video in Flash and in YouTube, which is considered the de facto standard in "Web video". When people talk about Web video they usually mean YouTube or something very like it. They are talking about MPEG-4.
The MPEG-4 content that you find in the world and on the Web today includes:
- every song ever offered for sale in or purchased from iTunes Store
- every song ripped from a CD by iTunes since 2002
- every video ever made on a cell phone (3GPP is part of MPEG-4) including the iPhone's recent shoot, edit, upload to YouTube feature which is H.264/AAC
- every video on YouTube is stored as MPEG-4 (no matter what format you originally uploaded)
- almost all of the video that runs in Adobe Flash, excluding 320x240 movies which may be the old codec
- all of the consumer video shot on solid state storage, and most of it from a few years before that
- all Podcast video is H.264 and most Podcast audio is AAC
- Blu-Ray
Nobody has explained how all of this content would be transcoded to Ogg or other non-standard format in order to be published on the Web. Where would the computing time come from? How would it be practically done? What are you going to tell someone who wants to upload a video from their camera or phone directly to the Web? That they should transcode it into a non-standard audio video codec first?
The players are very important also, because they have H.264/AAC decoding HARDWARE, which enables them to work efficiently enough to run on batteries. You can't drop a new software codec into these, you have to drop in a replacement audio video decoder chip. These include:
- every iPod and all of their competitors, except for the ones that only play MP3 which is part of MPEG-2
- every PC with a recent NVIDIA GPU can decode H.264/AAC without breaking a sweat or busting its batteries because it happens in the GPU
- Internet set-top boxes such as AppleTV and Netflix
- PlayStation3 and other game boxes
- even the Zune has MPEG-4 hardware in it, although somewhat underutilized from what I hear
Even software players cannot so easily be modified to support a non-standard codec, because of the scope of the MPEG-4 support. We're talking about every Mac and every PC in the world, because they all have one or both of these:
- every QuickTime/iTunes since 2002 is MPEG-4
- every Adobe FlashPlayer version 9 or 10 is MPEG-4
The reason those 2 match both each other and all the hardware players is because of the benefits of standardization, which took place almost a decade ago for MPEG-4 and goes back further to previous MPEG versions. If you, or Mozilla, or anyone, wants to make an audio video player, they only need to conform to the MPEG-4 standard to enable their player to play all of the content from QuickTime/iTunes and Flash. You can come along in 2009 and decide to get your feet wet in audio video players and simply by following a published ISO specification you can have instant equality with QuickTime and Flash and others. Again, the benefit of standardization.
A very important consideration that is often completely ignored by Web-centric people as they talk about audio video is the authoring tools! People who make audio and video all day long also want to publish their work on the Web. MPEG-4 is standardized QuickTime, so there is not just 8 years of MPEG-4 authoring tools right now, there is almost 18 years of digital audio video practice realized in MPEG-4. A key feature here is that these tools must not make content that has a "content tax" on it, like
From the HTML 4.01 Spec:
src = uri [CT]
This attribute specifies the location of the image resource. Examples of widely recognized image formats include GIF, JPEG, and PNG.
Now true, that doesn't say that any formats are recommended, well at least not until you head to the W3C PNG specification:
http://www.w3.org/Graphics/PNG/
They also have a nice section on SVG:
http://www.w3.org/Graphics/SVG/
Microsoft, Apple, Google, Amazon what's the difference? All steal money from devs and control with walled gardens.
What's really funny is, Youtube has pretty poor H.264 quality.
By tweaking x264 settings(B-frames and motion detection in particular), I've encoded videos to the same quality as Youtube at 1mbit.
(Mostly FRAPS vids of me playing games)
Well, it's not like YouTube sells more ads with better looking video, and I doubt 90% of the uploads get watched more than a dozen times. They probably have some pretty deep metrics about the watts/cents per minute of video encoding and tune for that.
YouTube is also really only a good example of YouTube, since they're a massively money-losing operation funded by a very rich company. No one else does it like YouTube, and ever other video site is going to average a lot higher views/clip, so they can afford more CPU time to improve quality.
Or maybe they're just not very good at video compression :).
Beyond B-frames, they're not using 8x8 blocks or CABAC entropy coding, both of which can offer substantial efficiency improvements.
My video compression blog
From the HTML5 spec:
"This specification does not specify which image types are to be supported."
Does Ogg even have hardware acceleration at this point?
Nope. I don't know if anyone's even scoped a hardware implementation of VP3. There have been some VP6/7 DSP implementations, but no ASIC ones (ASIC have better power consumption).
Now, Theora is a pretty simple codec, so doing it in hardware would be a lot simpler than H.264 and probably simpler than VC-1. But it can take quite a few engineering years to refine a decoder for performant playback.
Of couree, performance isn't just the video decoder. It's the video and the audio decoder, and the whole pipeline to make sure you get smooth in-sync playback. Media pipelines are really hard, particularly if you're trying to implement them as part of a browser rendering model that never had to worry about timestamps, decoder buffers, etcetera.
My video compression blog
Well, YouTube has three sets of settings:
Low bitrate H.263 + MP3
HQ bitrate H.264 + AAC-LC
HD bitrate H.264 + AAC-LC
The low bitrate, for whatever reason is keeping to the specs they've been using since launch, which are using the xvid implementation of old Sorenson Spark H.263 v1/MPEG-4 Part 2 Short Header. Maybe for device compatibility? Anyway, That's a codec about as old as the Theora bitstream, so we wouldn't expect it to be much better.
But I don't know that YouTube thinks it's "good enough" - they're offering higher quality modes, and that's what you get by default on the iPhone and other platforms. For whatever reason they're keeping around a legacy version, likely backwards comaptibility with some clients that don't do H.264 for whatever reason.
For the their high quality streams, Theora isn't competitive in quality. And for the highly compatible streams, Theora isn't competitive in compatibility.
So YouTube saying that Theora doesn't make sense for them makes sense to me. Therora doesn't an advantage in quality or compatibility for the streams they're doing.
Also, Big Buck Bunny isn't the best clip to extrapolate from, as it's really high quality lossless animation. To really see what YouTube needs to handle, try some lousy webcam, DV, and VOB rips. That's where H.264's in-loop deblocking filter give it a big advantage over other codecs, because it just gets smoother intead of blocky as the content gets more challenging.
Not to dismiss the excellent development work Xiph has done on Theora. The posts have been a fascinating read. But it's not plausible to me that anyone can make a business case for Theora over H.264, VC-1, or ASP licensing is available; the reduced bandwidth costs would be bigger than the actual real-world licensing fees for the real world examples I've thought of.
Theora's sweet spot would be in cases where MPEG-LA codec licenses simply aren't available for whatever reason. I imagine a fully refined Theora decoder would need fewer MIPS/pixel than H.264 High Profile, and perhaps even Baseline. But even in those cases, VC-1 Main Profile will probably offer similar performance with significantly better efficiency.
My video compression blog
What are your mobile devices, and what's your media player? All the current ones that are meant for media playback include H.264 ASICs. And those are getting crazy good; the Zune HD's going to support 720p HD playback using the NVidia Tegra.
Lacking an ASIC, any Theora on devices would need to be done in sofware, and even a simple codec can be extremely taxing on a 400-600 MHz ARM. Even if it's playable it's going to eat battery like no tomorrow. I can imagine a really good implementation being able to maybe do 320x240 30p 500 Kbps Theora in software on a 600 MHz ARM, but that'd require a whole lot of tuning.
The VP3 bitstream predates device media, so I doubt it has any particular design tuning for them. It's very much a codec designed for x86, with a PPC port.
My video compression blog
Interesting, considering that I don't remember ever hearing about ASP or AVC hardware decoders until after those formats became popular. It would seem that the popularity of the codec defines whether a hardware decoder exists, not the other way around.
Since I said CPU, I thought I was clear that I meant decoding in software.
Using mplayer, h264 will eat almost exactly twice the CPU time that theora uses with similarly encoded files.
Also, about the battery life -- I can get about 5-6 hours of playback time decoding purely on the CPU with an Atom N280. That's certainly not "eating up battery like no tomorrow."
The browsers need to start supporting free codecs now. Streaming h.264 is free for now, but that party is going to end at the end of 2010. If YouTube has to start paying royalties for every h.264 stream they serve up you better bet the whole game is going to change.
Theora/Dirac/Whatever start looking real good when consider that it keeps the web "free". Imagine if you had to pay everytime you served up a jpeg on your website? If you want to serve video from your site in a couple years, you may have to. I say we pick an open format now, to avoid all that headache now.
You sure do put a lot of energy into slagging Ogg, and you consistently neglect mention the advantage Ogg has over H.264: it is unencumbered by patents and therefore free for anybody to encode and/or play, on any hardware they wish.
Oh, I don't have anything against Theora per se, nor Ogg in general. It's just people keep having highly unrealistic hopes for what it can do in terms of compression efficiency and ecossytem.
Codecs are hard, and it does no one any benefit to assume they're capable of things they simply aren't.
The Xiph blog posts on their optimization process for Theora have been excellent reading, and they've done really good work. But the bitstream itself simply isn't capable of what modern codecs are capable of already. I'm sure Theroa will continue to improve, but I don't see any reaosn why H.264 won't see improvements at least as quickly.
And H.265 is already in development, targeting new bistream features that will add further substantial efficiency improvements.
I for one, am perfectly happy to burn a little extra bandwidth for that, and anyway I not buy your assertion that Ogg cannot close the bandwidth gap over time. .
After all, you are a Microsoftie with a vested interest in keeping video proprietary
Eh, I work on Silverlight, where we have the Raw AV pipeline for managed code decoders. It's be trivial for any customer add Theora support to Silverlight if they want it. If anything, Theora would be a competitive advantage for Silverlight.
Also, I don't think anyone is talking about propritary codecs here, except for perhaps VP6. VC-1 and H.264 are both international standards, with licensing handled by MPEG-LA. They are patent encumbered, but are not propritary any more than MP3 or ASP are.
My video compression blog
When 60+ percent* and increasing of all mobile web journeys come from iPhones, the other platforms fade away. You're mistaking the United States as a proxy for the entire world.
Da Blog
Patented means proprietary, please to not try to make words mean things that they do not.
No, patent means patented. There's a qualitative difference between a format for which there are public and publshed interoperable standard, and one where the implementation details are private or only avialable under a specific license. You may not care for either model, but it's certainly a meaningful distinction, and a longstanding one in the digital media world. When people in the vieo industry speak of an "open standard" they mean publically available specifications and patent licenses available under RAND terms. A propritary codec would be used to describe, say, RealVideo 10 or Apple's ProRes, for which there isn't bitstream documentation or RAND licensing availble.
With an open standard, everyone's on equal ground in building interoperable implementations without any reverse engineering, and has equal abilty and pricing for licensing the patents.
The issue of whether those patents have a fee or not is obviously important, but somewhat orthogonal to openness. One could certainly have a free-to-implement technology that isn't documented, and hence wouldn't be considered "open."
And Theora is certainly patented as well; On2 has released their patents under an extremely flexible license, but they're still valid.
I repeat my assertion that Ogg Theora is already good enough for me, and likely is good enough for many besides myself, who do not care much about 3 DB more or less of streaming bandwidth, and who do care about freedom from proprietary restrictions and patent fees for video codecs.
I've never heard streaming bandwidth described in dB. Interesting metric; so 3 dB would be ~2x bandwidth difference at the same quality? Kind of elegent; I normally talk about that in terms of percentage, but since improvements are measured in dB, it could apply either way.
FWIW, codec engineers sweat blood for a 0.1 dB improvement. The cable industry has spent multiple billions of dollars to upgrade to H.264 set top boxes and infrastructure to get that ~3-4 dB improvement of H.264 over MPEG-2, expecting a much bigger payoff due to additioanl channels/services they can sell with those savings.
Anyway, if Theora does what you want it do, use it with my blessing. Good enough is by definition good enough. Like I said earlier, I work on Silverlight, and we've already got the infrastructure in Silverlight for 3rd parties to add new codec and format support in managed code.
My concern is mainly that a lot of people seem to be thinking that Theora is capable of things it isn't and won't be capable of. To whit:
Theora isn't ever going to be competitive with H.264 High Profile in compression efficiency. While it's certainly capable of futher improvement, H.264 implementatiosn are improving rapidly as well, so I doubt it'd ever need less than 2x the bandwidth for a particular quality level compared to best H.264 implementations at the time.
For the business models I've run some quikc numbers on, the extra bandwidth cost of Theora would cost more than any H.264 license fees saved.
Thus mainstream media sites, like YouTube, don't have any business reasons to adopt Thera; it'd be a net negative on their profitability.
If you think I'm mistaken on any of the above, I'd be very interested in disucussing your perspective. If you're asserting that there are markets where the above factors don't matter much, then I agree with you.
But if it's really important for this community to have a competitive codec without patent licensing requirements, then Theora (at least a Theora 1.0 bistream compatible version) may be a distraction.
I don't have a lot of hope for Dirac either; I've not seen any indication of a new approach to the challenges of marrying wavelts with motion estimation; once your intra and inter block sizes are radically different, things get quite challenging. Theora is likely to remain a superior choice than Dirac.
My video compression blog