Slashdot Mirror


YouTube Has 1 Billion Videos With Closed-Captioning (But Not All of Them Are Accurate) (variety.com)

Over a billion videos on YouTube are accessible to viewers with difficulties in hearing, thanks to the video giant's automated captions, it said Thursday. From a report on Variety: That certainly sounds impressive -- except when you realize that many of the site's automatically generated captions aren't completely right. The Google-owned video giant first launched captions back in 2006, and three years later introduced automatic speech recognition to add closed-captioning to YouTube content. Today, YouTube users watch video with auto-generated captions more than 15 million times per day. But the system is prone to errors. For example, the trailer for Amazon Studio's Oscar-nominated "Manchester by the Sea" (at this link) includes numerous inaccuracies in the auto-transcribed captions, sometimes to hilarious -- not to mention frustrating -- effect.

52 comments

  1. So we're talking Auto Generated Bad Lip Reading? by DickBreath · · Score: 2

    > thanks to the video giant's automated captions, > That certainly sounds impressive -- except when you realize that many of the site's automatically generated captions aren't completely right.

    I know robots are taking over jobs. But put those two statements together and this sounds like auto-generated bad lip reading.

    Now if someone could only implement all possible bad lip readings, and then auto-rate them for hilarity, we would be onto something.

    --

    I'll see your senator, and I'll raise you two judges.
  2. The next step is facial recognition by Anonymous Coward · · Score: 0

    ...if they're not doing it behind the scenes already. You should try putting your own face through the test tool AWS has, it's outright scary.

    I wonder if google could marry this with their tensorflow framework, and maybe figure out why Casey Affleck always looks like he's having trouble holding in a fart.

  3. Re:So we're talking Auto Generated Bad Lip Reading by s1d3track3D · · Score: 2

    Thank you for submitting your video RickDeath, we will get right on close captioning it.

  4. I've noticed a lot of errors in 'Downfall' by Hussman32 · · Score: 1, Insightful

    Something is clearly wrong with the translations of the Downfall videos. Sometimes it's about SAP, sometimes it's about the World Cup, but my limited German tells me it's about the fall of the Third Reich.

    --
    "Who are you?" "No one of consequence." "I must know." "Get used to disappointment."
    1. Re:I've noticed a lot of errors in 'Downfall' by XxtraLarGe · · Score: 1

      Something is clearly wrong with the translations of the Downfall videos. Sometimes it's about SAP, sometimes it's about the World Cup, but my limited German tells me it's about the fall of the Third Reich.

      We're living in the postmodern era. The interpretation is left up to the viewer!

      --
      Taking guns away from the 99% gives the 1% 100% of the power.
    2. Re:I've noticed a lot of errors in 'Downfall' by Anonymous Coward · · Score: 0

      +1 Funny, and also -1, Fail to everyone who missed the joke.

  5. Not All of Them Are Accurate by Anonymous Coward · · Score: 0

    How about "Not Any of Them Are Accurate" ?
    Seriously, has anyone seen one that actually is correct?

    1. Re:Not All of Them Are Accurate by Anonymous Coward · · Score: 0

      > How about "Not Any of Them Are Accurate" ? Seriously, has anyone seen one that actually is correct?

      Do you speak Tagalog? No? Well, neither me.

      Bad closed captions allowed me to grasp the idea of most scenes of a picture I wanted to watch. Sometimes it's correct, sometimes it's wrong but you can infer from context, sometimes you can infer from the scene itself, sometimes you can't -- but the movie keeps playing and you get to understand who is the hero and who's the villain, at least.

      I'd say it's nice to have it. And it's improving all the time.

      Maybe not for me, or you, but for future generations, perhaps.

    2. Re:Not All of Them Are Accurate by gnick · · Score: 1

      Bad closed captions allowed me to grasp the idea of most scenes of a picture I wanted to watch.

      When I'm watching something in a language I don't understand, inaccurate CCs are nearly as good as accurate ones, as I have no idea that they don't match up. I prefer the original language with subtitles because the dubbed audio lacks the actors' inflections plus I don't like it when lips and words don't match. I often watch movies or TV programs with subtitles on, but avoid automated CC - When my eyes and ears don't match the conflict distracts me.

      ...the movie keeps playing and you get to understand who is the hero and who's the villain, at least.

      I can typically do that with the film muted, can't you?

      --
      He's getting rather old, but he's a good mouse.
    3. Re:Not All of Them Are Accurate by Anonymous Coward · · Score: 0

      > When my eyes and ears don't match the conflict distracts me.

      Interesting. I don't work that way -- but that may be because we are highly trained on that, since most films are spoken in English (originally, that is) and everyone here watches them dubbed in our language (a Latin derivative). I guess we learned to disregard lip movements.

      >> ...the movie keeps playing and you get to understand who is the hero and who's the villain, at least.

      > I can typically do that with the film muted, can't you?

      Not always, and it's specially harder in movies with great plots.

  6. Need a Hitler parody video... by Anonymous Coward · · Score: 0

    ...in which Hitler finds out Youtube has 1 billion videos with subtitles, but not all of them are accurate.

    1. Re:Need a Hitler parody video... by PPH · · Score: 1
      --
      Have gnu, will travel.
  7. Goes for comments too... by s1d3track3D · · Score: 1

    site's automatically generated captions aren't completely right

    Maybe they are generating the comments as well

    1. Re:Goes for comments too... by DickBreath · · Score: 2

      Maybe Slashdot is what was used for beta testing auto generated comments.

      --

      I'll see your senator, and I'll raise you two judges.
  8. Re:So we're talking Auto Generated Bad Lip Reading by cayenne8 · · Score: 3, Informative
    Yeah, the auto generated CC is pretty bad.

    But I did read, that it *IS* very much worth your while to put accurate CC on your videos, as that it supposedly highly figures into your Google rankings.

    I found that after I transcribed my videos, my rankings did shoot up higher on plain old Google searches and I think also on YouTube suggestions, etc.....so, looks to be worthwhile to do if you want max hits.

    --
    Light travels faster than sound. This is why some people appear bright until you hear them speak.........
  9. about 25% uses them by Anonymous Coward · · Score: 0

    I write closed-captioning on my videos ( https://www.youtube.com/ruddk/ ) in two languages(those with speech) and about 25% of them are viewed with CC on.
    Now that might be because of my mumbling and my less than ideal English skills. :) But I was surprised to see that many people using closed-captioning.
    But is just a hobby for me because I like tinkering with it.

    1. Re:about 25% uses them by aevan · · Score: 2

      I use CC even in languages I understand. Sometimes it's the background noise or multiple people talking (in the movie, or kids running around irl), sometimes the speaker mumbles, sometimes I have the volume low/muted, and sometimes they just speak so damn slow and I'm feeling impatient. I also like when the CC is in another language so as to give to alternate impressions of the dialogue (though if the 'sub' is bad it drives me nuts 'that isn't what they said!' :P ).

      So by all means, continue doing CC. Some need it, and some just enjoy it.

  10. Re:Lip Reading? by hackwrench · · Score: 1

    Why bad lip reading? Why not your basic garden variety bad speech recognition?

  11. Good enough for me. by laserhead · · Score: 1

    And I don't need to pay extra money for that.

  12. But not all of them are accurate by DontBeAMoran · · Score: 0

    Task one undergarment.

    --
    #DeleteFacebook
    1. Re:But not all of them are accurate by Tablizer · · Score: 1

      Aw, Yorb aces Arbee long "2S".

  13. about as good as google search. by fish_in_the_c · · Score: 2

    I recently watched a video with closed captioning on.
    'stan fortuna school of the eucharist'
    lets just say google search doesn't think eucharist is a common term and has an especially hard time with it when it is a quickly spoken rap song with a Hispanic accent.

    It was pretty funny what they translated it too.

    It did leave me wondering if there should be a mechanism to tell them the words are wrong and really wrong.

    --
    âoeTolerance applies only to persons, but never to truth. Intolerance applies only to truth, but never to persons.
    1. Re:about as good as google search. by Anonymous Coward · · Score: 0

      What is a "Hispanic accent"? I know Spanish accents, Mexican accents, etc. Not so much Hispanic.

  14. Well you would choose Manchester By the Sea by hey! · · Score: 1

    I grew up in Boston, and when I go back to the old neighborhood it makes me wonder how people understand me at all. Speech recognition programs never work for me.

    --
    Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
    1. Re:Well you would choose Manchester By the Sea by farble1670 · · Score: 1

      Don't worry. You are a special, unique snowflake. If people can't understand you, they are racist.

    2. Re:Well you would choose Manchester By the Sea by hey! · · Score: 1

      Don't worry. You are a special, unique snowflake. If people can't understand you, they are racist.

      Actually people who don't understand me are usually idiots. Some of them are blockheads. Funnily enough racists understand me fine, they just don't like what I have to say.

      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
  15. News for Nerds? by Anonymous Coward · · Score: 0

    Or Stuff That Matters?

    I still don't see it.

  16. Re:Lip Reading? by swillden · · Score: 2

    Why bad lip reading? Why not your basic garden variety bad speech recognition?

    https://www.youtube.com/user/BadLipReading

    --
    Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
  17. couple of things by Quirkz · · Score: 1

    I've lost the link, but someone recently mentioned an intentionally humorous duo who:
    1) write a skit, perform it, and upload to Google
    2) let captioning take its best stab
    3) use the captioning as a new script, and re-record the scene
    4) upload and re-caption
    5) record a third time, with even weirder dialogue

    Then they splice it all together, and you get to watch the degeneration of language as iterative captioning makes everything nonsensical.

    My wife and I also tend to watch a lot of TV when the other wants quiet, so closed captioning is almost always on for all shows. The quality and consistency can vary wildly, and sometimes the mistakes are hilariously bad. (One particularly bad one I recall is "Atlas Shrugged" coming out as "At Last Shrub" and some other cases where a British show has about half of the dialogue listed as [indecipherable] even if it seemed clear enough to us). Occasionally, though, we'll get captioning that either relays something we thought was indecipherable, or even calls out something ("distant cry for help" or "creepy creak") that we couldn't hear/notice on our own.

    1. Re:couple of things by aevan · · Score: 1

      Sounds like a variation on those 'bad translation' humor videos: use online translators to translate through several languages, before ending back in English and using that. Like Bohemian Rapsody.

    2. Re: couple of things by Anonymous Coward · · Score: 0

      Sounds like Rhett & Link:
      https://m.youtube.com/playlist?list=PLA220BA20D4D3DE46

  18. I wonder... by XSportSeeker · · Score: 1

    From my personal experience, I can't help but wonder if there is even ONE out of all the auto-generated captions that is accurate at all. :P
    Have you guys ever seen one? I mean, a few mistakes are ok... but so far I haven't seen any video that had auto-generated captions that was even understandable at all. More like a mish mash of guesses.
    Which is great for comedic effect I guess, but not so much for viewers with difficulties in hearing.

    1. Re:I wonder... by Harlequin80 · · Score: 1

      I dunno I use the auto CC all the time. Some of the guesses are terrible but normally seems to have the biggest issues around random nouns. So assuming that I know what the topic of the video is I can use the CC, I just have to substitute the proper noun at the right time.

      I tend to use it when watching a technical video with a single talker and I don't actually want the sound on for what ever reason.

  19. Accuracy? by twmcneil · · Score: 1

    Accuracy is not that important to me. We all are pretty used to inaccuracies while texting now. What is important to me is the synchronization. If the captions follow the actual speech by more than just a bit, it makes it hard for me to follow as I lip read in addition to reading the captions. Lip reading is often ok by itself, but with movies and TV, the speakers face is not always pointed to the camera or there might be something covering the speakers lips. That's when cc comes in handy.

    --
    "The ferrets, they're every where I tell you!"
    1. Re:Accuracy? by fish_in_the_c · · Score: 1

      I really don't think this thing is using lip reading ( I could be wrong does anyone have a source). I'd guess it is using the same servers as google voice search and the type of errors it has seems consistent with that.

      --
      âoeTolerance applies only to persons, but never to truth. Intolerance applies only to truth, but never to persons.
  20. Not accurate by TWX · · Score: 1

    Yeah, the Swedish Chef video was quite wrong...

    --
    Do not look into laser with remaining eye.
  21. Make perfect the enemy of good. by 140Mandak262Jamuna · · Score: 2
    It is true in software engineering, and it is true everywhere.

    Perfection is the goal. But doing better than current version is the shipping criterion.

    Auto captioning is better than no captioning for hearing impaired.

    And human captioning is not perfect. I remember watching Lion King with closed captioning turned on and they had missed a crucial "o" in some dialog that had the word "count".

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:Make perfect the enemy of good. by Anonymous Coward · · Score: 0

      If this is where closed captioning is headed, cunt me out!

    2. Re:Make perfect the enemy of good. by 140Mandak262Jamuna · · Score: 1

      Yes, that was the line. Zazu's lines in the song "I'm going to be king". So you noticed that too? You think it was an intentional mistake?

      --
      sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
  22. Need to ID language before speech recognition by Anonymous Coward · · Score: 0

    There is a video of a promotional event for the new Ghost in the Shell movie including an interview with Takeshi Kitano. The interviewer is asking the questions in English and Kitano is answering in Japanese but the google captions are performing the speech recognition as if it was English. I found it pretty funny.

    The interview starts at around 15 min.
    Tokyo Event

  23. on the plus side, Google can say whatever ...` by Anonymous Coward · · Score: 0

    on the plus side, Google can say whatever they want in videos and blame it on robots...

  24. More fair evaluation.... by Anonymous Coward · · Score: 0

    Google's automatic captioning sucks balls. It is horrid. I am embarrassed to have it on any video I've made. They need to shut the system down, fire all the staff working on it, and take the equivalent money and pay for reliable transcriptionists to closed-caption the videos the old fashioned way.
    Now, do you want to know what I really think? :|>

  25. Not all are accurate by argStyopa · · Score: 1

    Not always a bad thing.

    Bad Lip Reading are far more entertaining than the actual text of the presidential debates, for example.

    --
    -Styopa
  26. My hovercraft is full of eels by Anonymous Coward · · Score: 0

    My hovercraft is full of eels.
    Look it up.

  27. John Oliver fail by Anonymous Coward · · Score: 0

    I watched last Sunday's Last Week Tonight. The captions were mostly okay but the goofs were funny in their own right.

  28. yeah by Feneric · · Score: 1

    Anyone who's actually watched YouTube with captions will know almost immediately what I'm getting at in the subject. Saying the system "is prone to errors" as above is being very kind. Amongst its many errors are frequent phantom occurrences of the word "yeah". While the phantom "yeah" instances are more funny than anything else, many of the other errors are much worse. Amongst other problems they have been known to convert a family-friendly video into something that no longer fits that description.

  29. But Not All of Them Are Accurate by nitehawk214 · · Score: 1

    Are any of them accurate? Can you manually enter captions?

    Undertake this discourse, actually types.

    --
    I'm a good cook. I'm a fantastic eater. - Steven Brust
  30. average at best by Anonymous Coward · · Score: 0

    The transcripts for the videos are average at best - but thats alphabet ( Alpha Beta) all over - all there code is either alpha or beta but never polished.

  31. Youtube has a potty mouth by ukoda · · Score: 3, Informative

    For some reason Youtube thinks that people speaking with a New Zealand accent swear a lot. I was testing the Youtube product tutorial on an Android product which, unlike PC browsers, has the closed captioning on by default. A lot of the technical terms, spoken with a Kiwi accent, were being captioned with obscene words. When I recovered from laughing at just how rude it was being I warned our marketing team that made the video. They were mortified and suddenly had a large task of checking and removing the computer generated captions. It turns out all of our SFW videos had NSFW captions.

  32. Can't transcribe a Boston accent by RogueWarrior65 · · Score: 1

    Clearly, these algorithms don't know what a bubblah or a blinkah or a clickah is.

  33. Tried it many times .... by Anonymous Coward · · Score: 0

    .... and every single times it was WRONG at least for 90% of the audio.

    The only time CC has worked, is when the uploader included the subtitles with the video.

  34. Crowd sourced option by Immerial · · Score: 1

    There is a mechanism to tell them the words are wrong. YouTube has a crowd sourced option for subtitles called "Community contributions" that you can turn on as a content owner. You can make it available for anyone to sub your videos http://www.youtube.com/timedtext_cs_panel.