YouTube Video-Fingerprinting Due in September
Tech.Luver writes "The Register is reporting on Google's statement to a presiding judge that video-fingerprinting of YouTube material will be ready in September. The development is required to head off a three-headed suit against the company, currently being debated in a New York City courthouse. The system will, according to Google, 'be as sophisticated as fingerprinting technology used by the Federal Bureau of Investigation.' From the article: 'As Google told El Reg in an earlier conversation, the company already has two systems in place for policing infringing content - but neither are ideal. One system allows copyright holders to notify Google when they spot their videos on the company's sites. When notified, the company removes the offending videos, in compliance with the American Digital Millennium Copyright Act. A second system uses "hash" technology to automatically block repeated uploads of infringing material.'"
Earlier I had joked about Google's claim to be nearing a system that lets them check for copyrighted works. I said that they're basically claiming to have solved a hard AI problem.
Others pointed out that, no, it's not a hard AI problem to just compare some kind of checksum of the video against a set of banned checksums. That's true. But what about once people know they're using this system? They can just trivially re-encode. Perhaps add a scene break here or there, and totally mess up the fingerprint. To prevent that, it seems, you would need to solve a hard-AI problem: that is, be able to determine if an arbitrarily-encoded video appears to a human to match some copyrighted work. It would have to be robust against minor scene shortenings and lengthenings, scene breakups, color gradients laid over the video, etc.
Anyone know how difficult this program is to circumvent? (Just hypothetically -- not advocating criminal activity here.)
Apology to Ubuntu forum.
As soon as Google stops indexing/posting material people want (legal or not) people will stop using Google. I believe they know what a fine line they're walking between 'do no evil' and survival here, I wonder which will pervail?
Gootube fingerprints you!!
Oh wait.....
Shit.
That would be the ultimate mechanical turk - sitting around watching youtube videos all day and getting paid... in addition to what you are already being paid as you put off work to watch youtube videos all day.
Can I just filter the video to change the general shape and size of the content and scribble all over it until humans can't recognize it? Seems to work for websites that require a signup... I had one the other day that took 4 people and 5 attempts to actually sign up.
A couple of 30-somethings embark on the ultimate roadtrip
One part is the same -- someone spots "their" video, they take it down immediately to avoid getting sued under the DCMA. Expect the takedown notices to continue, which will still kill parody videos, fair-use compliant videos, and videos that are legal, but someone sends bogus takedown notices, such as the ones that Viacom "accidently" included in their original request.
The second part sounds more promising, but someone may be able to get around hashing the videos, such as inserting random one-frame images, as in the Fight Club movie, or adding in overlay text, or possibly adding in effects. If they try to hash a few selected time slices, someone will figure it out eventually. As with all digital protection, this just pushes off the inevitable. At least it will make Google look good in court, since they're attempting to comply with Viacom and the other copyright holder's requests for not posting their material.
In the end, it won't count for much. It would make more sense to add in additional protections for false or malicious takedown notices, such as adding in a $50K fine for false claims. This would at least make the big companies scrutinize the videos that they're issuing a takedown notice for.
"First things first, but not necessarily in that order."
- Doctor Who
We don't want "sophistication," we want reliability.
And since they are making the comparison... just how reliable are fingerprints, really?
True, a character in Mark Twain's 1893 novel Pudd'n'head Wilson tells a court
"Every human being carries with him from his cradle to his grave certain physical marks which do not change their character, and by which he can always be identified -- and that without shade of doubt or question. These marks are his signature, his physiological autograph, so to speak, and this autograph canImage available not be counterfeited, nor can he disguise it or hide it away, nor can it become illegible by the wear and mutations of time. This signature is not his face -- age can change that beyond recognition; it is not his hair, for that can fall out; it is not his height, for duplicates of that exist; it is not his form, for duplicates of that exist also, whereas this signature is each man's very own -- there is no duplicate of it among the swarming populations of the globe! This autograph consists of the delicate lines or corrugations with which Nature marks the insides of the hands and the soles of the feet."
and ever since Mark Twain said so everyone has believed it, but that doesn't necessarily make it true.
"How to Do Nothing," kids activities, back in print!
The supposedly clever media moguls are missing a wealth-building opportunity. Lots of these "infringing videos" are short clips from longer presentations. If they had any smarts at all, they'd ask Google to set up a link on those pages where people could buy the programs/music on disk, or direct download them for a fee. Instead, the moguls want to get rid of what amounts to "free advertising" because they fear the new paradigm.
...
The trouble with the first system is that neither Google nor the copyright holders can possibly keep up with the vast number of copyrighted videos uploaded each day. What exactly is the compelling legal argument that spawned three lawsuits?
That GooTube isn't complying with the DMCA?
That complaince with the DMCA isn't enough?
Depending on your POV, the 'right' thing to do is either to create new filters (business), or to try and win the lawsuits (users).
[Fuck Beta]
o0t!
YouTube Video Strip-Searching is due in January '08
--- What?
Alright kiddies,
Step1. Get out your laptop & your random MAC address generator toolkit
Step2. Drive down some random street until you find...
Step3. A neighbour with unprotected WIFI (or just crack their non-WPA2 secure connection)
Step4. Carry on & upload your Simpson's episodes to Youtube.
Step5. Cause profits (loss) for Simpson's authors
NEXT!
No trees were killed in the making of this post; however, many trillions of electrons were horribly inconvenienced.
Appearantly you never had a boss who would shoot down every idea that isn't his own. Usually, businesses with a boss like that go under in pretty short time.
In other words, I have my hopes up that we might get rid of them pretty soon.
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
Same as with music. If people are going to buy it, they will. Just charge a fair price. Use youtube as advertisement for commercial interests (daily show, colbert report, robot chicken, anyone?)
But youtube is a little different in that many of the things people go there for are unique or one-time things that the only way you'll ever get a chance to see them again is if you recorded it yourself, or somebody else does and you are lucky enough to find it online.
The biggest issue I have is stuff that you'll NEVER BE ABLE TO ACTUALLY BUY OR SEE AGAIN being taken down. My favorite example is prince performing at half time for the superbowl. Now, not only are the videos gone from youtube, but also all of the comments (which IMHO are equally as valuable to the community) about the videos.
Taking things like this down erodes our culture and destroys valuable records of what has gone on in our lives.
Bingo! I already posted, or I would mod you up.
Google is already complying with the letter of the DMCA. What right does any other organization have to compel Google to go beyond that and spend a fortune creating a video fingerprinting system? Google ought to be fighting this, not bending over and doing whatever large media companies demand.
Doing this is manifestly against the interest of the people who made Google what it is today. What happened to doing no evil?
Funny- I can't post a home video with copyrighted background music on MSN Soapbox - they refused to publish it. But it works great in GooTube. A previous writer hit the nail on the head - when Google pulls down all of the illegal content, GooTube will turn into a GooGhostTown...
Are they gonna sue Google because its search engine aids in the acquisition of copyrighted content (like when you search for torrents)?
Phone companies and ISP's claim that since they are "Common Carriers" just carrying content from other people without any knowledge/filtering of the content, that they can't be sued if naughty/illegal stuff is carried on their wires. If Google has pro-active measures in place to filter submissions, then won't they lose that protection and can then be sued for anything that slips through?
Hulk SMASH Celiac Disease
People will find another outlet. If the content that people are looking for is not on youtube then they will abandon the site for somewhere else.
The content industry should take a lesson learned from the past. Right now they have a large concentration of people looking at grainy, low resolution video in one place. Remove that and the sites will go underground, and maybe with even better quality video which would be a real threat to their model. They should take the opportunity to promote their product - here is the low resolution preview, click here for a clean high res version for a small fee. Make a tie-in to portable device downloads. There are lots of possible marketing opportunities. However, once they shut down one of the primary reasons for visiting youtube and the content becomes decentralized then that becomes a lot harder to do.
eg. in a gimp plug-in.
But yes, it will reduce it. By how much?
I don't know. Maybe a lot.
It's not as hard a problem as you think. There are algorithms that could recognize a short segment of video even if re-encoded.. it's like iris recognition. They can recognize your iris even though the new image may be rotated and scaled. It's all about transforming the image so that these kinds of trivial transformations are made to disappear, and then comparing the result to a bank of pre-stored images.
The
Another popular media site imeem has been using audio fingerprinting on user content for a while now, they have deals with the usual clutch of indie labels and more interestingly they have a deal with Warner Brothers (who up until a month ago were suing imeem for copyright violations). The main draw of the site is 'youtube for music', and I guess imeem has some process to detect what is licensed and what isn't because some uploads are turned into 30 second previews if they're not covered by one of imeem's deals.
If you're the kind of person who finds themselves wasting all their time surfing youtube you had better stay away from imeem because it will suck you in forever.
Thing is, the content owners will have to provide the "fingerprint," or at least the video from which the "fingerprint" will be made. Now, while I can see the bigger corporations taking the time, effort and money to do this for current and recent popular TV shows and movies, a lot of the more obscure stuff is still going to be around. Part of the fun and appeal of sites like YouTube is getting a chance to see or re-see more esoteric (i.e., not very commercially viable) clips -- old TV shows with a cult following, but no chance of being released on DVD; old commercials for no longer existent products; continuity clips (promos, station IDs, etc.); old local newscasts, etc. The rights to all of these things are still held by someone, somewhere, but much of it will still be up on YT -- just the stuff that no one (except for the small base of fanatics that appreciate it) cares about. I mean, is someone going to "fingerprint" every episode of "Hello, Larry" or some equally stinkeroo old show just in case someone uploads a few episodes that they recorded on their Betamax way back when?
"Every great cause begins as a movement, becomes a business, and eventually degenerates into a racket." -- Eric Hoffer
Of course, such a fingerprinting scheme ignores fair use. If a match is found, the video will automatically be flagged as infringing without concern for context and prevented from being uploaded without any oversight.
It's like being found guilty for murder because your fingerprints were found at the scene... on some groceries you bagged for the victim at the supermarket a week earlier, but that is of no concern to the justice droid.
Oh, say does that Star-Spangled Banner entwine / The myrtle of Venus with Bacchus's vine?
Someone will just create another video site and host it from an oil derrick somewhere in international waters.
It's time for NEW-Tube!! (tm)
you could easily beat any hash of a video with a watermark hue shift of say, 1 value in the red. The video would be unintelligibly different to the naked eye, but each and every frame would be significantly different than an original based on the exact video hash. Is this not how it would work?
stuff |
How is this going to work? Will Google process all copyrighted videos themselves and produce the necessary data to block them? If so, what is the backlog going to be when big media submits 90 years of video?
If Google are not going to check it, what is to stop me downloading a Quicktime trailer of a movie, generating the data and submitting it to Google for blocking? It will quickly become impossible for even sanctioned videos to appear. Cultists/Scientologists will be screwed too.
As usual, media companies are being idiots. They paniced about the VCR, they paniced about P2P, they are panicing about DVRs and YouTube. In the end, new technology tends to do them good in the long run and besides which, you can't fight it.
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
Wonder if it's got anything to do with Kadokawa Group's deal with Google. I found it amusing that they were helping develop this kind of system for Google, since they're probably one of the few groups that have really gotten some sales/publicity boost from user-generated content on Youtube and NiconicoVideo.
As will its userbase.
---- Booth was a patriot ----
It WILL bite them in the ass for the very reason laid out in the DMCA itself:
You'll need to read the rest of the cited sections to get a complete understanding of the process, but here's how it goes:
So when YouTube gets a takedown notice and the person counter-claims, 'Tube has a choice: either restore the video or be liable for a shiny new lawsuit.
Yeah, right.
That could kill fair use. Any keyframes that would be in every reasonable edit of a film or TV show would be critical to many short non-infringing excerpts from that film or TV show.
Pity the MPAA doesn't believe in fair use....
There is a fine line between recklessness and courage... -- Paul McCartney
yes, you're right. Also, it's surely compressed, and random noise will reduce the compressed size.
However, I think only a little noise would be necessary. Eg. +1 or -1 brightness, on one pixel per screen.
Hmmm... or maybe just different compression parameters? This changes the artifacts introduced into the video, and so the uncompressed video would look slightly different.
Philips has a video fingerprinting system. From the site:
The system is robust against severe degradations like low bit rate video compression, scaling, rotation, cropping, noise addition, median filter and noise removal. [...]
A 5 second video fingerprint on any segment of video content is sufficient to uniquely identify that segment.
You obviously need more than a simple re-encode to get around that and I'm sure Googles system won't be fooled by simple tricks either.
That tweaks your video a bit before uploading it. Probably the simplest way around a fingerprinting, even if it actually works as claimed (which is extremely doubtful) is to add a 5-second lead segment (titles, etc.) which makes its 5-second sampling invalid.
Take 1 frame every 2 minute from a source material (of a scene you want banned), choosing ones where there is no motion for (say) 3 frames. The two minute skip that means that you get a 3000 to one compression on the number of frames you're looking for, and the three frames of stillness means
(1) that the frame will compress well (making it a better match), and
(2) that you only have to look at 1 out of three frames in uploaded material - and even that you can ignore high motion scenes entirely..
Combined these methods and you can probably increase your search speed by 15,000 (!) times.
Once you have the original video, it is a simple case of pattern matching to find how close the two videos are. Google could say that any video that matches a copyrighted video close to 80% or more is not allowed on youtube.
I have since abandoned Youtube when I found this:
http://stage6.divx.com/
The image quality is amazingly good, totally opposite to Youtube and flash video in general.
Also, and very important to me (because I multitask), it consumes about the same CPU in both cases. Probably flash video consumes a little more.
We are Turing O-Machines. The Oracle is out there.