AV1 is Well On Its Way To Becoming a Viable Alternative To Patented Video Codecs, Mozilla Says (mozilla.org)
Here's a surprising fact: It costs money to watch video online, even on free sites like YouTube. That's because about 4 in 5 videos on the web today rely on a patented technology called the H.264 video codec. From a report: It took years for companies to put this complex, global set of legal and business agreements in place, so H.264 web video works everywhere. Now, as the industry shifts to using more efficient video codecs, those businesses are picking and choosing which next-generation technologies they will support. The fragmentation in the market is raising concerns about whether our favorite web past-time, watching videos, will continue to be accessible and affordable to all.
Over the last decade, several companies started building viable alternatives to patented video codecs. Mozilla worked on the Daala Project, Google released VP9, and Cisco created Thor for low-complexity videoconferencing. All these efforts had the same goal: to create a next-generation video compression technology that would make sharing high-quality video over the internet faster, more reliable, and less expensive. In 2015, Mozilla, Google, Cisco, and others joined with Amazon and Netflix and hardware vendors AMD, ARM, Intel, and NVIDIA to form AOMedia. As AOMedia grew, efforts to create an open video format coalesced around a new codec: AV1. AV1 is based largely on Google's VP9 code and incorporates tools and technologies from Daala, Thor, and VP10.
Mozilla loves AV1 for two reasons: AV1 is royalty-free, so anyone can use it free of charge. Software companies can use it to build video streaming into their applications. Web developers can build their own video players for their sites. The second reason we love AV1 is that it delivers better compression technology than even high-efficiency codecs -- about 30% better, according to a Moscow State University study.
Over the last decade, several companies started building viable alternatives to patented video codecs. Mozilla worked on the Daala Project, Google released VP9, and Cisco created Thor for low-complexity videoconferencing. All these efforts had the same goal: to create a next-generation video compression technology that would make sharing high-quality video over the internet faster, more reliable, and less expensive. In 2015, Mozilla, Google, Cisco, and others joined with Amazon and Netflix and hardware vendors AMD, ARM, Intel, and NVIDIA to form AOMedia. As AOMedia grew, efforts to create an open video format coalesced around a new codec: AV1. AV1 is based largely on Google's VP9 code and incorporates tools and technologies from Daala, Thor, and VP10.
Mozilla loves AV1 for two reasons: AV1 is royalty-free, so anyone can use it free of charge. Software companies can use it to build video streaming into their applications. Web developers can build their own video players for their sites. The second reason we love AV1 is that it delivers better compression technology than even high-efficiency codecs -- about 30% better, according to a Moscow State University study.
At the outset, I just want to say how happy I am that AV1 has taken off, and how seriously it is viewed by so many technology companies as a way around H.264 and (even worse) HEVC. Particularly with respect to HEVC, there are three separate patent pools with different participants. HEVC is, in many ways, already set up to fail due to a large number of participants that participate in either none or one of the pools (see https://streaminglearningcente... for how chaotic it is). There are some other proprietary technologies such as Perseus that are out there that claim better performance than HEVC from a PSNR/SSIM perspective, but they will likely remain fringe.
What is of more concern to me is how carefully AV1 has been constructed in terms of its coding tools to avoid patent trolling and patent submarining (e.g. Rambus at JEDEC with DDR). This is a very serious and very technically complex issue, as any company could easily assert patents on AV1 if they feel there is infringement on their claims as pertains to any of the coding tools. There are increasingly limited ways of dealing with spatiotemporal entropy in non-infringing ways that do not involve exponential increases in gates or CPU cycles.
A recent and simple example of this is the MPEG-LA claiming they license patents related to the MPEG-DASH streaming framework. MPEG-DASH is, essentially, an XML schema for a streaming manifest combined with either MPEG-4 Part 12 (the MP4 container originally specified by Apple as the MOV format), or MPEG-2 Transport Streams encapsulating H.264 video. Nobody on the DASH Industry Forum really thought that MPEG-DASH would be subject to this type of activity, yet magically MPEG-LA began waiving it agreement around about two years ago.
As a result, many in the industry have held onto the virtually universally-supported HTTP Live Streaming, which is an M3U playlist with tag extensions and MPEG-2 Transport Stream container for the codecs. Even that standard developed by Apple has never become a fully ratified within the IETF, and nobody knows if the same thing will happen there either.
Incidentally, any time Google has presented VP8 or VP9 at previous conferences and is asked about patents, they avoid answering questions and the audience usually laughs. I've seen it personally, and I think it's the industry's cynicism for the various patent holders and some of their past actions. Where it becomes critical is for silicon suppliers, whose front-loaded costs are now in the neighborhood of nine figures to launch some SoCs, and for content distributors, who invest a tremendous amount of time and money encoding all of the required profiles for streaming to new codecs. Commitment to efficient hardware acceleration by them for the codec is risky, as they could easily be legally enjoined from selling their products if they didn't get their patent licenses in order, and this would also leave content holders scrambling to fall back to already-established codecs.
I will admit I'm cynical here too. While I'd love to see a patent-free open standard, I'm not optimistic that someone will not come out of the woodwork claiming infringement on a key coding tool. I wish Google and the rest of the AV1 participants luck. They'll need it.