Slashdot Mirror


Next-Next Generation Video: Introducing Daala

An anonymous reader sends this excerpt from a post by Xiph.org's Monty Montgomery: "Xiph.Org has been working on Daala, a new video codec for some time now, though Opus work had overshadowed it until just recently. With Opus finalized and much of the mop-up work well in hand, Daala development has taken center stage. I've started work on 'demo' pages for Daala, just like I've done demos of other Xiph development projects. Daala aims to be rather different from other video codecs (and it's the first from-scratch design attempt in a while), so the first few demo pages are going to be mostly concerned with what's new and different in Daala. I've finished the first 'demo' page (about Daala's lapped transforms), so if you're interested in video coding technology, go have a look!"

19 of 86 comments (clear)

  1. There really aren't any marketing people in OSS by Anonymous Coward · · Score: 2, Insightful

    Daala? That will play well in test/focus groups - I'm certain of it.

    1. Re:There really aren't any marketing people in OSS by Anonymous Coward · · Score: 2, Insightful

      Compared to... x.264?

    2. Re:There really aren't any marketing people in OSS by serviscope_minor · · Score: 5, Informative

      Daala? That will play well in test/focus groups - I'm certain of it.

      Hmm, yeah, because the input of "focus" groups are well known for their role in chosing video codecs. So, let's look at the real commercial codecs out there:

      ISO/IEC 11172 aka MPEG-1
      Dirac Pro aka SMPTE 2042-1-2009 VC-2
      H.222 aka H.262 aka MPEG-2 Part 2 aka ISO/IEC 13818-2
      MPEG-4 er, which was that, Part 2 aka H.263 aka ISO/IEC 14496-2 or Part 10 AVC aka h.264 aka ISO/IEC 14496-10
      MS MPEG-4v3 aka (unofficially) Divx ;-)

      etc.

      So yeah, Daala is terrible by those standards

      So, basically you're randomly grousing about OSS is nonsensical and in complete contrast to reality and apparently this has caused your post to be upmodded.

      I expect I'll be downmodded by whoever modded you up, but I have karma to burn so what the hell.

      --
      SJW n. One who posts facts.
    3. Re:There really aren't any marketing people in OSS by LourensV · · Score: 2

      Also, in the Star Wars universe, Daala is the protégé of Grand Moff Tarkin, who gave his name to Xiph.org's earlier experimental wavelet-based video codec effort, so the name makes perfect sense from a historical perspective as well.

    4. Re:There really aren't any marketing people in OSS by bzipitidoo · · Score: 2

      Star Wars is hardly original. Lucas evidently liked to double up vowels to give clones a slightly different name, ie. Luke Skywalker's clone is Luuke Skywalker. Daala might come from Dala, which according to a quick search is a board game from Sudan.

      --
      Intellectual Property is a monopolistic, selfish, and defective concept. It is "tyranny over the mind of man"
    5. Re:There really aren't any marketing people in OSS by tlhIngan · · Score: 2

      MPEG-4 er, which was that, Part 2 aka H.263 aka ISO/IEC 14496-2 or Part 10 AVC aka h.264 aka ISO/IEC 14496-10
      MS MPEG-4v3 aka (unofficially) Divx ;-)

      Actually, MS MPEG4 is a form of h.263. In fact, MPEG4 Part 2 is also known as DivX/Xvid/etc, and to be more correct still, they fall under MPEG4 Part 2 Advanced Simple Profile (ASP).

      MPEG4 Part 10 (h.264) is Advanced Video Codec (AVC).

      Of course, one needs to realize that MPEG4 is a comprehensive standard and nothing "implements" MPEG4 - it's just a related group of standards, including the use of a subset of the QuickTime MOV format (Part 14) for the containers, various video codecs (including h.263-based ones like DivX and the like, h.264), audio codecs (preferred AAC) and many other standards, including stuff like DRM, bitstream formats, subtitles, integration testing, reference software, hardware implementations, etc.

      Effectively, MPEG4 is a "meta standard" of standards - it really is just an accumulation of standards others have written.

  2. Re:Patent Tolls heading your way by bill_mcgonigle · · Score: 5, Interesting

    It's the submarine patents that are the bigger worry. Since this codec is based on work that's come from some academic research papers, one can imagine a sufficiently wealthy litigation-mad megalocorp paying developers to stay on top of research and file patent applications citing obvious implementation details and then keeping them under the surface until it's sufficiently advantageous to allow them to surface. This process is 100% the opposite of the intention of the Constitutional provision for IP.

    BTW, the posted whitepaper is really nicely done - good job Xiph team. In a free world everybody would rejoice and be happy for your efforts.

    --
    My God, it's Full of Source!
    OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
  3. Essential Features.. alpha support, lossless. by thesupraman · · Score: 5, Insightful

    If you want to have a good solid niche for this (always helps) then.
    1 - support a reasonably efficient lossless mode (several others have this, but it is alsways useful)
    2 - support ALPHA channels (32bit, RGBA, YUVA) - this is not trivial but very worthwhile, basically ZERO of the modern codec support this.

    alpha channels are a requirement for many composition/editing/video production workflows, and yet the supporting codecs are old, clunky,
    and painful to use.

    good alpha channel support is not trivial, but is usually not major also.

    For extra points support lossless on alpha, and lossy on the other channels, that is a VERY good option for many workflows.

    Gives alpha and lossless, you are pretty much guaranteed professional users.

    1. Re:Essential Features.. alpha support, lossless. by qbwiz · · Score: 4, Informative

      Wikipedia claims that VP9 does/will support alpha channel

      --
      Ewige Blumenkraft.
    2. Re:Essential Features.. alpha support, lossless. by thesupraman · · Score: 5, Informative

      Yes, as far as I can tell thats a pretty solid 'will', closer to a 'may', but alpha there as far as I can tell is just another
      channel with no special consideration, unfortunately.

      But of course, the more the merrier, its just a very under supported requirement, and one that is heavily used in content creation circles.

      Most end up having to use png or target image sequences, or quicktime/animation codec for support, which are somewhat poor workflows.

    3. Re:Essential Features.. alpha support, lossless. by mikechant · · Score: 2

      Apple Prores 4444 is YUVA 0 it's not lossless but its close enough for most applications.

      The whole point of lossless formats is that you can move back and forth between them *as many times as you like* without ever losing *any* information. No non-lossless codec is 'close enough' to lossless for this purpose. It's like the difference between 'not pregnant' and 'only a bit pregnant'.

  4. Why still use blocks? by pipatron · · Score: 3, Interesting

    I'm a bit worried about the big focus on "blocks" for such a video codec. Originally, the blocks used in the JPEG image coder was put there to make sure that you could stream-encode images using reasonably cheap silicon back in the eighties. No one really wants the blocks, they were a necessary limitation. Using the same algorithm as JPEG but removing the blocks gives a serious quality boost.

    This codec will never run on hardware that can't handle more than 16 x 16 pixels at once. The lowest specs that will encode these frames will be hand-held cameras, which will have more than enough ram to buffer at least two full frames, and use a small FPGA for encoding/decoding. Everything else will be decoded by a GPU directly to the framebuffer, and likely encoded by the same GPU. Even server farms have these for processing media.

    There's also no issue with streaming as far as I know. Both DCT and Wavelet based coders can packetize the important bits in a frame first, and the less important bit later, so that a slow connection can still decode a degraded image even if not all bits are received. This without splitting the image into blocks.

    --
    c++; /* this makes c bigger but returns the old value */
    1. Re:Why still use blocks? by Overzeetop · · Score: 2

      I thought the blocks were used to provide a reasonable limit on the number of terms necessary to effectively describe the function in the "time" space using a DCT. The large the block, the more terms are necessary. I don't know enough about DCT to know how the number of terms scales with individual accuracy required and block size, but I would think a move away from that kind of a transform would be necessary to encode entire frames in one go with any hope at a highly efficient compression ratio.

      --
      Is it just my observation, or are there way too many stupid people in the world?
    2. Re:Why still use blocks? by cbcbcb · · Score: 4, Informative

      You still need to subdivide the image for motion estimation, as some parts of a moving image move at different rates to others. Subdividing into blocks gives you O(N) complexity in the size of the input image, whereas doing a DCT transformation has O(N log N) complexity.

    3. Re:Why still use blocks? by TeknoHog · · Score: 2

      It would be great if the blocks were independent, so you could parallelize the work by a huge factor. I guess block boundary effects are the problem, but they don't exactly constitute all the work. A great deal of scientific computing (e.g. fluid dynamics) is done by splitting the problem space into blocks which can be run on different nodes of a cluster, and while a lot of communication is needed between nodes, there are certainly benefits from such parallelism.

      Another interesting idea (which I've probably mentioned before, and isn't exactly new anyway) is to treat the video as a 3D voxel blob, with time as the third dimension. Split this into cubical blocks (if only for streaming) and use a 3D Fourier (wavelet, cosine etc.) transform and keep the most significant elements. This way you'll get a natural tradeoff between spatial and temporal accuracy, depending on the block content.

      --
      Escher was the first MC and Giger invented the HR department.
  5. Re:I wonder who their focus group is by Junta · · Score: 2

    I'm holding out hope that this codec supports subtitles as filters, so we can bake them in without actually baking them in, if you catch my drift. One of the biggest challenges I've faced when working with video encoding is that container formats are notoriously unsupported across the entire spectrum of video players.

    The issue being that whatever inconsistencies you are hitting won't be addressed by shuffling the subtitle track awkwardly into the video stream somehow. A player not being able to interpret an SSA/ASS stream won't suddenly be able to render them because it comes in the video part of the container. This just moves the problem form one part to another, and does so in a way without technical benefit.

    If the meat of your gripe is that a lot of things do not understand matroska and you are really thinking that such a trick will make .mp4 servicable, that would be incorrect too. Already you can put SSA or vobsub into mp4, but because it isn't part of the mp4 spec, many .mp4 players ignore the track. The same exact thing would happen if Daala somehowe hypothetically had subtitles in it, Daala in .mp4 would be something that those players would just fail to play.

    --
    XML is like violence. If it doesn't solve the problem, use more.
  6. Re:Another day, another codec. by LourensV · · Score: 5, Interesting

    Those existing codecs are all very similar technically, and riddled with patents. If Monty can make something new (and he can, see CELT) and work around those patents (and he can, see Vorbis, Theora), then it's definitely a welcome addition. And a codec doesn't have to dominate to be useful; Vorbis is widely used (Wikipedia, all sorts of software that plays sound and music including a lot of if not most video games) and supported on a lot of platforms (including hardware players and set-top boxes) even if it never did completely replace MP3 and AAC. If nothing else, having a free and unencumbered option will keep the licensors of the proprietary codecs at least somewhat honest.

    Incidentally, isn't it about time for Monty to get an EFF Pioneer award? He's been very successfully working on freely usable audio and video codecs for well over a decade now, starting at a time when many people didn't believe that a non-encumbered audio or video codec was even possible. Someone with his skills could probably make a very good living in proprietary codec development, but he chose to start Xiph.org and fight the good fight (and now works for Red Hat). He belongs in that list IMHO.

  7. Re:Another day, another codec. by evilviper · · Score: 2

    He's been very successfully working on freely usable audio and video codecs for well over a decade now, starting at a time when many people didn't believe that a non-encumbered audio or video codec was even possible.

    Monty and Xiph have done pretty well with audio codecs, but they need to stay the HELL away from video codecs.

    Theora/VP3 was a nightmarish debacle that we're still suffering from. Thank god Google took the reins by releasing VP8, and developing it themselves, rather than handing it over to Xiph to die on the vine, and make a depressing reappearance a decade later...

    --
    Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  8. Re:Another day, another codec. by xiphmont8352 · · Score: 5, Informative

    We abandoned Tarkin, but not because of VP3. It was an interesting crazy idea that simply wasn't workable.

    Tarkin was an attempt to replace motion compensation with a directional 3D wavelets that could predict across time. It had... disturbing... encoding artifacts.