Slashdot Mirror


YouTube Video-Fingerprinting Due in September

Tech.Luver writes "The Register is reporting on Google's statement to a presiding judge that video-fingerprinting of YouTube material will be ready in September. The development is required to head off a three-headed suit against the company, currently being debated in a New York City courthouse. The system will, according to Google, 'be as sophisticated as fingerprinting technology used by the Federal Bureau of Investigation.' From the article: 'As Google told El Reg in an earlier conversation, the company already has two systems in place for policing infringing content - but neither are ideal. One system allows copyright holders to notify Google when they spot their videos on the company's sites. When notified, the company removes the offending videos, in compliance with the American Digital Millennium Copyright Act. A second system uses "hash" technology to automatically block repeated uploads of infringing material.'"

25 of 115 comments (clear)

  1. Hard AI ftw by UbuntuDupe · · Score: 4, Interesting

    Earlier I had joked about Google's claim to be nearing a system that lets them check for copyrighted works. I said that they're basically claiming to have solved a hard AI problem.

    Others pointed out that, no, it's not a hard AI problem to just compare some kind of checksum of the video against a set of banned checksums. That's true. But what about once people know they're using this system? They can just trivially re-encode. Perhaps add a scene break here or there, and totally mess up the fingerprint. To prevent that, it seems, you would need to solve a hard-AI problem: that is, be able to determine if an arbitrarily-encoded video appears to a human to match some copyrighted work. It would have to be robust against minor scene shortenings and lengthenings, scene breakups, color gradients laid over the video, etc.

    Anyone know how difficult this program is to circumvent? (Just hypothetically -- not advocating criminal activity here.)

    1. Re:Hard AI ftw by Udderdude · · Score: 2, Interesting

      They could potentially scan the entire movie for a few keyframes that they know will be in there reguardless of silly scene breaks, etc.

      Nobody would know what the keyframes are, so it would be hard/impossible to black out that specific frame.

    2. Re:Hard AI ftw by MyLongNickName · · Score: 4, Insightful

      I think you miss the point.
      1) Is there a way around the system? Yes.
      2) Does that matter? No.
      3) Why is that? This solution shows that Google is making reasonable efforts to comply with the legal issues.

      The majority of folks aren't going to take the effort to circumvent these controls. Rates will drop significantly. Google can honestly say they are making every effort to comply with copyright protection. Lawsuits will go away.

      --
      See my journal for slashdot ID's by year. Mine created in 2005. http://slashdot.org/journal/289875/slashdot-ids-by-year
    3. Re:Hard AI ftw by Nasarius · · Score: 3, Insightful

      There are already several decent systems for fingerprinting audio; it's not particularly surprising that Google researchers would be able to do something similar for video.

      --
      LOAD "SIG",8,1
    4. Re:Hard AI ftw by Hatta · · Score: 3, Insightful

      Mostly it's an assumption. Since only humans so far have been able to recognize video, then it must take something like a human to do it. Viewing clip A and clip B and abstracting the parts that make them the same is not the kind of problem computers are good at, and if you've done it you've probably solved the really hard problem in AI, how abstraction works. Of course this rests on the assumption that strong AI is fundamentally different from weak AI, and the difference is not just one of degrees. Personally I think that's a specious distinction much like that between microevolution and macroevolution.

      --
      Give me Classic Slashdot or give me death!
    5. Re:Hard AI ftw by sholden · · Score: 4, Interesting

      So I guess it doesn't matter if it then blocks a video which has taken a couple of seconds of video from a TV show in a "Review of Episode X" video post, and just happens to grab one of those keyframes?

      Of course a little bit of coding and you have a program that takes that 10 minute video, splits it into 10 1 minute videos and uploads them. The ones that got rejected it splits into 10 6 seconds videos and uploads them. Rinse and repeat until you have however small an set of rejections you asked it for. Then it cuts out just the necessary fragments of videos (replacing them with the last good frame or something?).

      Of course that can be worked at google's end by adding a delay to the report rejection step, and by banning those who get lots of rejects.

    6. Re:Hard AI ftw by DDLKermit007 · · Score: 4, Interesting

      Sorry, but fingerprinting the video is damn near impossible. Closest any image recogniton app can go is similar scene right now. Some can do logos, but were talking low resolution videos. Pretty much they'll do a file checksum, and that'll kill most reuploads. For the determined. All that'll be needed is a simple re-encode. That'll completely change any checksums involved. They won't seriously spend that much machine time on checking if they are allowed to make money off a video. They'll do just enough to say they did something, but not so much you can't find a way around it.

    7. Re:Hard AI ftw by utopianfiat · · Score: 5, Insightful

      It's not going to get shut down because of copyright infringement. :/ That's like saying we should bomb Hong Kong because they sell copyrighted works there- just because something has an illegitimate use doesn't make it illegitimate on face ffs.
      Note, this can also be applied to "kitchen knives can kill so we should ban kitchen knives." and "people can die in cars so we should ban motor vehicles"
      and uh... "People who have killed a lot of people have played video games, so we should ban video games." The states needs to get over the damn prohibitionist culture that's removing any sense of personal responsibility from our great nation.

      --
      +5, Truth
    8. Re:Hard AI ftw by Hatta · · Score: 2, Informative

      Sure it is. So is the heliocentric model of the solar system, Einstein's theory of special relativity, and quantum electrodynamics. It's just one of the best tested, best supported, and most theoretically fruitful theories we have.

      --
      Give me Classic Slashdot or give me death!
  2. separation of the web by 4solarisinfo · · Score: 4, Insightful

    As soon as Google stops indexing/posting material people want (legal or not) people will stop using Google. I believe they know what a fine line they're walking between 'do no evil' and survival here, I wonder which will pervail?

    1. Re:separation of the web by drrck · · Score: 2, Informative

      I've posted numerous video clips to Google Video and YouTube. I have recently received 3 e-mails from Google telling me that I have been flagged by the copyright holder. Subsequently I have already stopped using Google Video.

    2. Re:separation of the web by 4solarisinfo · · Score: 3, Insightful

      For the unoffocial record - are you the copyright holder of these clips, or could they legitimate requests?

    3. Re:separation of the web by 4solarisinfo · · Score: 2, Insightful

      Soo... the system works, and "do no evil" gets a point.

    4. Re:separation of the web by radish · · Score: 3, Interesting

      You think? Here's a history lesson. The generation currently in power were all pot-smoking hippies back in the 60's and 70's, but anti-drug laws just keep on getting stronger. Teenagers and 20-somethings have always had this wacky belief that they are heralding a new way of thinking, and eventually when all the old gas-bags die we'll have a utopia. Somehow it never seems to happen.

      --

      ---- Den ene knappen er powerknapp, den andre er Bender voice knapp "Bite My Shiny Metal Ass"

  3. Amazon Mechanical Turk by InvisblePinkUnicorn · · Score: 5, Funny

    That would be the ultimate mechanical turk - sitting around watching youtube videos all day and getting paid... in addition to what you are already being paid as you put off work to watch youtube videos all day.

  4. Obfuscation? by HitekHobo · · Score: 3, Funny

    Can I just filter the video to change the general shape and size of the content and scribble all over it until humans can't recognize it? Seems to work for websites that require a signup... I had one the other day that took 4 people and 5 attempts to actually sign up.

  5. Two-part Protection by RancidPickle · · Score: 3, Interesting

    One part is the same -- someone spots "their" video, they take it down immediately to avoid getting sued under the DCMA. Expect the takedown notices to continue, which will still kill parody videos, fair-use compliant videos, and videos that are legal, but someone sends bogus takedown notices, such as the ones that Viacom "accidently" included in their original request.

    The second part sounds more promising, but someone may be able to get around hashing the videos, such as inserting random one-frame images, as in the Fight Club movie, or adding in overlay text, or possibly adding in effects. If they try to hash a few selected time slices, someone will figure it out eventually. As with all digital protection, this just pushes off the inevitable. At least it will make Google look good in court, since they're attempting to comply with Viacom and the other copyright holder's requests for not posting their material.

    In the end, it won't count for much. It would make more sense to add in additional protections for false or malicious takedown notices, such as adding in a $50K fine for false claims. This would at least make the big companies scrutinize the videos that they're issuing a takedown notice for.

    --
    "First things first, but not necessarily in that order."
    - Doctor Who
  6. As "sophisticated" as FBI fingerprinting? by dpbsmith · · Score: 2, Insightful

    We don't want "sophistication," we want reliability.

    And since they are making the comparison... just how reliable
    are fingerprints, really?

    True, a character in Mark Twain's 1893 novel Pudd'n'head Wilson tells a court

    "Every human being carries with him from his cradle to his grave certain physical marks which do not change their character, and by which he can always be identified -- and that without shade of doubt or question. These marks are his signature, his physiological autograph, so to speak, and this autograph canImage available not be counterfeited, nor can he disguise it or hide it away, nor can it become illegible by the wear and mutations of time. This signature is not his face -- age can change that beyond recognition; it is not his hair, for that can fall out; it is not his height, for duplicates of that exist; it is not his form, for duplicates of that exist also, whereas this signature is each man's very own -- there is no duplicate of it among the swarming populations of the globe! This autograph consists of the delicate lines or corrugations with which Nature marks the insides of the hands and the soles of the feet."

    and ever since Mark Twain said so everyone has believed it, but that doesn't necessarily make it true.

    1. Re:As "sophisticated" as FBI fingerprinting? by westlake · · Score: 3, Informative
      And since they are making the comparison... just how reliable are fingerprints, really?

      The Newman link is from 2001.

      The judge who decided the original Llera-Plaza motion, which is discussed and critiqued in the following article, reversed himself on March 13, 2002, holding that expert evidence of a "match" was admissible. Judge Pollak had granted the Government's motion for a reconsideration that is mentioned above, and he also reopened the record to hear additional testimony for the prosecution as well as for the defense. In reversing himself in a 60-page opinion, Judge Pollak stated, in part, "In short, I have changed my mind.' The Reliability of Fingerprint Evidence: A Case Report

      You'll find links here to many articles on Identification Evidence. For example: Phenotype vs Genotype: Why Identical Twins Have Different Fingerprints

  7. Dumb. Really dumb. by Anonymous Coward · · Score: 4, Insightful

    The supposedly clever media moguls are missing a wealth-building opportunity. Lots of these "infringing videos" are short clips from longer presentations. If they had any smarts at all, they'd ask Google to set up a link on those pages where people could buy the programs/music on disk, or direct download them for a fee. Instead, the moguls want to get rid of what amounts to "free advertising" because they fear the new paradigm.

  8. Just the start by sjonke · · Score: 4, Funny

    YouTube Video Strip-Searching is due in January '08

    --
    --- What?
  9. Re:Why isn't Google fighting this out in court? by eclectro · · Score: 3, Insightful

    Well, it's the cost of fighting a copyright battle, and also the dark possibility that the judge would side with the copyright holders which they almost invariably always do these days.

    Take that and the fact that Google is actually a big fat cash cow with a bulls-eye on the side of it and it becomes obvious that the best strategy is one of accomadation. Rather than a long drown out battle that would also hurt googles stock price because of the uncertainity it creates.

    So anyway you cut it, this looks like the best route for them to take. Maybe google could throw some lobbyists on congress to address the copyright abuse that copyright holders are getting away with.

    --
    Take the cheese to sickbay, the doctor should see it as soon as possible - B'Elanna Torres, "Learning Curve"
  10. 'infringement' by SCHecklerX · · Score: 2, Interesting

    Same as with music. If people are going to buy it, they will. Just charge a fair price. Use youtube as advertisement for commercial interests (daily show, colbert report, robot chicken, anyone?)

    But youtube is a little different in that many of the things people go there for are unique or one-time things that the only way you'll ever get a chance to see them again is if you recorded it yourself, or somebody else does and you are lucky enough to find it online.

    The biggest issue I have is stuff that you'll NEVER BE ABLE TO ACTUALLY BUY OR SEE AGAIN being taken down. My favorite example is prince performing at half time for the superbowl. Now, not only are the videos gone from youtube, but also all of the comments (which IMHO are equally as valuable to the community) about the videos.

    Taking things like this down erodes our culture and destroys valuable records of what has gone on in our lives.

  11. Wide open for abuse by AmiMoJo · · Score: 2, Insightful

    How is this going to work? Will Google process all copyrighted videos themselves and produce the necessary data to block them? If so, what is the backlog going to be when big media submits 90 years of video?

    If Google are not going to check it, what is to stop me downloading a Quicktime trailer of a movie, generating the data and submitting it to Google for blocking? It will quickly become impossible for even sanctioned videos to appear. Cultists/Scientologists will be screwed too.

    As usual, media companies are being idiots. They paniced about the VCR, they paniced about P2P, they are panicing about DVRs and YouTube. In the end, new technology tends to do them good in the long run and besides which, you can't fight it.

    --
    const int one = 65536; (Silvermoon, Texture.cs)
    SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
  12. fingerprinting video is trivial by heinzkunz · · Score: 2, Informative

    Philips has a video fingerprinting system. From the site:

    The system is robust against severe degradations like low bit rate video compression, scaling, rotation, cropping, noise addition, median filter and noise removal. [...]
    A 5 second video fingerprint on any segment of video content is sufficient to uniquely identify that segment.


    You obviously need more than a simple re-encode to get around that and I'm sure Googles system won't be fooled by simple tricks either.