Slashdot Mirror


YouTube Has 1 Billion Videos With Closed-Captioning (But Not All of Them Are Accurate) (variety.com)

Over a billion videos on YouTube are accessible to viewers with difficulties in hearing, thanks to the video giant's automated captions, it said Thursday. From a report on Variety: That certainly sounds impressive -- except when you realize that many of the site's automatically generated captions aren't completely right. The Google-owned video giant first launched captions back in 2006, and three years later introduced automatic speech recognition to add closed-captioning to YouTube content. Today, YouTube users watch video with auto-generated captions more than 15 million times per day. But the system is prone to errors. For example, the trailer for Amazon Studio's Oscar-nominated "Manchester by the Sea" (at this link) includes numerous inaccuracies in the auto-transcribed captions, sometimes to hilarious -- not to mention frustrating -- effect.

33 of 52 comments (clear)

  1. So we're talking Auto Generated Bad Lip Reading? by DickBreath · · Score: 2

    > thanks to the video giant's automated captions, > That certainly sounds impressive -- except when you realize that many of the site's automatically generated captions aren't completely right.

    I know robots are taking over jobs. But put those two statements together and this sounds like auto-generated bad lip reading.

    Now if someone could only implement all possible bad lip readings, and then auto-rate them for hilarity, we would be onto something.

    --

    I'll see your senator, and I'll raise you two judges.
  2. Re:So we're talking Auto Generated Bad Lip Reading by s1d3track3D · · Score: 2

    Thank you for submitting your video RickDeath, we will get right on close captioning it.

  3. I've noticed a lot of errors in 'Downfall' by Hussman32 · · Score: 1, Insightful

    Something is clearly wrong with the translations of the Downfall videos. Sometimes it's about SAP, sometimes it's about the World Cup, but my limited German tells me it's about the fall of the Third Reich.

    --
    "Who are you?" "No one of consequence." "I must know." "Get used to disappointment."
    1. Re:I've noticed a lot of errors in 'Downfall' by XxtraLarGe · · Score: 1

      Something is clearly wrong with the translations of the Downfall videos. Sometimes it's about SAP, sometimes it's about the World Cup, but my limited German tells me it's about the fall of the Third Reich.

      We're living in the postmodern era. The interpretation is left up to the viewer!

      --
      Taking guns away from the 99% gives the 1% 100% of the power.
  4. Goes for comments too... by s1d3track3D · · Score: 1

    site's automatically generated captions aren't completely right

    Maybe they are generating the comments as well

    1. Re:Goes for comments too... by DickBreath · · Score: 2

      Maybe Slashdot is what was used for beta testing auto generated comments.

      --

      I'll see your senator, and I'll raise you two judges.
  5. Re:So we're talking Auto Generated Bad Lip Reading by cayenne8 · · Score: 3, Informative
    Yeah, the auto generated CC is pretty bad.

    But I did read, that it *IS* very much worth your while to put accurate CC on your videos, as that it supposedly highly figures into your Google rankings.

    I found that after I transcribed my videos, my rankings did shoot up higher on plain old Google searches and I think also on YouTube suggestions, etc.....so, looks to be worthwhile to do if you want max hits.

    --
    Light travels faster than sound. This is why some people appear bright until you hear them speak.........
  6. Re:Lip Reading? by hackwrench · · Score: 1

    Why bad lip reading? Why not your basic garden variety bad speech recognition?

  7. Good enough for me. by laserhead · · Score: 1

    And I don't need to pay extra money for that.

  8. about as good as google search. by fish_in_the_c · · Score: 2

    I recently watched a video with closed captioning on.
    'stan fortuna school of the eucharist'
    lets just say google search doesn't think eucharist is a common term and has an especially hard time with it when it is a quickly spoken rap song with a Hispanic accent.

    It was pretty funny what they translated it too.

    It did leave me wondering if there should be a mechanism to tell them the words are wrong and really wrong.

    --
    âoeTolerance applies only to persons, but never to truth. Intolerance applies only to truth, but never to persons.
  9. Well you would choose Manchester By the Sea by hey! · · Score: 1

    I grew up in Boston, and when I go back to the old neighborhood it makes me wonder how people understand me at all. Speech recognition programs never work for me.

    --
    Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
    1. Re:Well you would choose Manchester By the Sea by farble1670 · · Score: 1

      Don't worry. You are a special, unique snowflake. If people can't understand you, they are racist.

    2. Re:Well you would choose Manchester By the Sea by hey! · · Score: 1

      Don't worry. You are a special, unique snowflake. If people can't understand you, they are racist.

      Actually people who don't understand me are usually idiots. Some of them are blockheads. Funnily enough racists understand me fine, they just don't like what I have to say.

      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
  10. Re:But not all of them are accurate by Tablizer · · Score: 1

    Aw, Yorb aces Arbee long "2S".

  11. Re:Lip Reading? by swillden · · Score: 2

    Why bad lip reading? Why not your basic garden variety bad speech recognition?

    https://www.youtube.com/user/BadLipReading

    --
    Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
  12. couple of things by Quirkz · · Score: 1

    I've lost the link, but someone recently mentioned an intentionally humorous duo who:
    1) write a skit, perform it, and upload to Google
    2) let captioning take its best stab
    3) use the captioning as a new script, and re-record the scene
    4) upload and re-caption
    5) record a third time, with even weirder dialogue

    Then they splice it all together, and you get to watch the degeneration of language as iterative captioning makes everything nonsensical.

    My wife and I also tend to watch a lot of TV when the other wants quiet, so closed captioning is almost always on for all shows. The quality and consistency can vary wildly, and sometimes the mistakes are hilariously bad. (One particularly bad one I recall is "Atlas Shrugged" coming out as "At Last Shrub" and some other cases where a British show has about half of the dialogue listed as [indecipherable] even if it seemed clear enough to us). Occasionally, though, we'll get captioning that either relays something we thought was indecipherable, or even calls out something ("distant cry for help" or "creepy creak") that we couldn't hear/notice on our own.

    1. Re:couple of things by aevan · · Score: 1

      Sounds like a variation on those 'bad translation' humor videos: use online translators to translate through several languages, before ending back in English and using that. Like Bohemian Rapsody.

  13. I wonder... by XSportSeeker · · Score: 1

    From my personal experience, I can't help but wonder if there is even ONE out of all the auto-generated captions that is accurate at all. :P
    Have you guys ever seen one? I mean, a few mistakes are ok... but so far I haven't seen any video that had auto-generated captions that was even understandable at all. More like a mish mash of guesses.
    Which is great for comedic effect I guess, but not so much for viewers with difficulties in hearing.

    1. Re:I wonder... by Harlequin80 · · Score: 1

      I dunno I use the auto CC all the time. Some of the guesses are terrible but normally seems to have the biggest issues around random nouns. So assuming that I know what the topic of the video is I can use the CC, I just have to substitute the proper noun at the right time.

      I tend to use it when watching a technical video with a single talker and I don't actually want the sound on for what ever reason.

  14. Accuracy? by twmcneil · · Score: 1

    Accuracy is not that important to me. We all are pretty used to inaccuracies while texting now. What is important to me is the synchronization. If the captions follow the actual speech by more than just a bit, it makes it hard for me to follow as I lip read in addition to reading the captions. Lip reading is often ok by itself, but with movies and TV, the speakers face is not always pointed to the camera or there might be something covering the speakers lips. That's when cc comes in handy.

    --
    "The ferrets, they're every where I tell you!"
    1. Re:Accuracy? by fish_in_the_c · · Score: 1

      I really don't think this thing is using lip reading ( I could be wrong does anyone have a source). I'd guess it is using the same servers as google voice search and the type of errors it has seems consistent with that.

      --
      âoeTolerance applies only to persons, but never to truth. Intolerance applies only to truth, but never to persons.
  15. Not accurate by TWX · · Score: 1

    Yeah, the Swedish Chef video was quite wrong...

    --
    Do not look into laser with remaining eye.
  16. Make perfect the enemy of good. by 140Mandak262Jamuna · · Score: 2
    It is true in software engineering, and it is true everywhere.

    Perfection is the goal. But doing better than current version is the shipping criterion.

    Auto captioning is better than no captioning for hearing impaired.

    And human captioning is not perfect. I remember watching Lion King with closed captioning turned on and they had missed a crucial "o" in some dialog that had the word "count".

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:Make perfect the enemy of good. by 140Mandak262Jamuna · · Score: 1

      Yes, that was the line. Zazu's lines in the song "I'm going to be king". So you noticed that too? You think it was an intentional mistake?

      --
      sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
  17. Re:about 25% uses them by aevan · · Score: 2

    I use CC even in languages I understand. Sometimes it's the background noise or multiple people talking (in the movie, or kids running around irl), sometimes the speaker mumbles, sometimes I have the volume low/muted, and sometimes they just speak so damn slow and I'm feeling impatient. I also like when the CC is in another language so as to give to alternate impressions of the dialogue (though if the 'sub' is bad it drives me nuts 'that isn't what they said!' :P ).

    So by all means, continue doing CC. Some need it, and some just enjoy it.

  18. Not all are accurate by argStyopa · · Score: 1

    Not always a bad thing.

    Bad Lip Reading are far more entertaining than the actual text of the presidential debates, for example.

    --
    -Styopa
  19. Re:Need a Hitler parody video... by PPH · · Score: 1
    --
    Have gnu, will travel.
  20. Re:Not All of Them Are Accurate by gnick · · Score: 1

    Bad closed captions allowed me to grasp the idea of most scenes of a picture I wanted to watch.

    When I'm watching something in a language I don't understand, inaccurate CCs are nearly as good as accurate ones, as I have no idea that they don't match up. I prefer the original language with subtitles because the dubbed audio lacks the actors' inflections plus I don't like it when lips and words don't match. I often watch movies or TV programs with subtitles on, but avoid automated CC - When my eyes and ears don't match the conflict distracts me.

    ...the movie keeps playing and you get to understand who is the hero and who's the villain, at least.

    I can typically do that with the film muted, can't you?

    --
    He's getting rather old, but he's a good mouse.
  21. yeah by Feneric · · Score: 1

    Anyone who's actually watched YouTube with captions will know almost immediately what I'm getting at in the subject. Saying the system "is prone to errors" as above is being very kind. Amongst its many errors are frequent phantom occurrences of the word "yeah". While the phantom "yeah" instances are more funny than anything else, many of the other errors are much worse. Amongst other problems they have been known to convert a family-friendly video into something that no longer fits that description.

  22. But Not All of Them Are Accurate by nitehawk214 · · Score: 1

    Are any of them accurate? Can you manually enter captions?

    Undertake this discourse, actually types.

    --
    I'm a good cook. I'm a fantastic eater. - Steven Brust
  23. Youtube has a potty mouth by ukoda · · Score: 3, Informative

    For some reason Youtube thinks that people speaking with a New Zealand accent swear a lot. I was testing the Youtube product tutorial on an Android product which, unlike PC browsers, has the closed captioning on by default. A lot of the technical terms, spoken with a Kiwi accent, were being captioned with obscene words. When I recovered from laughing at just how rude it was being I warned our marketing team that made the video. They were mortified and suddenly had a large task of checking and removing the computer generated captions. It turns out all of our SFW videos had NSFW captions.

  24. Can't transcribe a Boston accent by RogueWarrior65 · · Score: 1

    Clearly, these algorithms don't know what a bubblah or a blinkah or a clickah is.

  25. Crowd sourced option by Immerial · · Score: 1

    There is a mechanism to tell them the words are wrong. YouTube has a crowd sourced option for subtitles called "Community contributions" that you can turn on as a content owner. You can make it available for anyone to sub your videos http://www.youtube.com/timedtext_cs_panel.