Hulu Munging HTML With JS To Protect Content
N!NJA writes "Hulu has started encoding the html that they send to people's browsers, and then decoding it using javascript before rendering it. [...] They then run the character stream through a series of javascript functions to convert it back into plain text before pushing it into your browser using DHTML. That's quite a lot of effort just for fun, so I assume that is to stop screen scrapers from parsing content." I really can't understand all this effort. Boxee displayed the Hulu advertising perfectly. I suspect Alec Baldwin is to blame.
they're aliens. that's how they roll.
It sounds like there's something ROT-13 in the state of Hawaii.
...ended at midday yesterday. Though I have to admit that this is far funnier than the "stories" that Slashdot ran at the time.
The XBMC guys already made a plugin after the last hulu change. It'll take a few hours and a new one will be made.
Especially if you SEND the user all the info they need, how hard is it to decode functions? There are crackers out there that take decoded assembly to figure out how to bypass DRM, what makes Hulu think their implementation will be any more difficult?
Couldn't an enterprising screen-scraper also just run it through the same Javascript code? Hulu is forgetting what I like to call the Fundamental Law of DRM: if you make data possible for users to see /hear, it will be possible for a reasonably enterprising user to copy it.
I am officially gone from
TunerFreeMCE couldn't scrape the data. Mission accomplished. Oh, wait... Tada:
"Update- version 2.6.7 is now available to download to work round this new tactic."
And now, I supposed, there will be a DMCA attack as phase two.
They *want* you to go back to watching regular TV, where the ad revenue is greatest.
If you can't see the value in jet powered ants you should turn in your nerd card. - Dunbal (464142)
As long as Hulu continues to work with a Linux-based browser, I'm happy. This is unlike ABC, whose system doesn't support Linux at all.
Their loss (or perhaps I should say "They're Lost").
"My country, right or wrong; if right, to be kept right; and if wrong, to be set right." --Senator Carl Schurz (1872)
> if they wanted aggregates to link to their content I would think hulu would have provided an API to allow it.
They did. It's called the hypertext transfer protocol.
25% Funny, 25% Insightful, 25% Informative, 25% Troll
Make the viewer fill it in every ~2 minutes to keep watching.
Entomologically speaking, the spider is not a bug, it's a feature.
They *want* you to go back to watching regular TV, where the ad revenue is greatest.
As you probably know, that cat's not going back into the bag. I wonder whether the inability to admit this and work with it is a special trait of media companies or if it's just true of large organizations in general?
It is a miracle that curiosity survives formal education. - Einstein
Hulu is a joint venture of NBC Universal and Fox Entertainment Group. The Hulu management might not precisely be content providers, but the folks holding the purse are.
Nerd rage is the funniest rage.
These guys do understand that nothing prevents me from plugging my laptop into a TV and running a browser on it? And nothing prevents me from plugging a tuner card into my computer and showing TV on the monitor? So regardless of what they do, they can't make something show on a computer but not on a TV?
Wait a minute, my assistant is handing me an envelope he says will explain everything.
(envelope opening noises)
The note inside says "They're total idiots".
Yep, that does explain everything.
The particular situation here deals with compressed/encoded HTML in an effort to prevent screen-scraping. This leaves two options for screen scrapers:
Option 1
1) Figure out how the decoder works
2) Replicate the decoder functionality in the screen scraper
3) Parse the decoded HTML
4) Make changes as the encoding scheme changes
5) ???
6) Profit!
Option 2
1) Link a Javascript engine like SpiderMonkey, Rhino, V8, or SquirrelFish into the screen scraper
2) Run the Javascript to decode the HTML
3) Parse the decoded HTML
4) ???
5) Profit!
Javascript + Nintendo DSi = DSiCade
Hulu is a BRAND. It wants to live in its own world and be exclusive.
So their attitude is "Frak Boxie", as boxie is trying to DESTROY the brand of all the video sites to be replaced by the Boxee brand.
Why should Hulu play nice?
Test your net with Netalyzr
Absolutely.
It's been cute these last 10 years watching companies try to put things on the Internet and monopolize the information they put up. If you don't require user authentication, it's public.
If you want to piggy back in a web browser, with a public protocol like HTTP, expect people to interact with your server in unintended ways.
The only way to prevent this is to invent your own propietary protocol, and your own client. And even this doesn't prevent reverse-engineering of the protocol to gain access.
If you do decrypt it without authorization, they can claim you're in violation. It's not about the technical merits of their solution, it's about the legal aspect.
And to anyone complaining about having to dance through proxies to watch Hulu internationally, it's for the same reasons. What benefit does Charmin see from advertising toilet paper to people in the Netherlands?
This is where the MBA and Marketing guys are falling down on the job. They should be selling regional ads for international viewers... instead of Charmin, they could sell Nokia ads for Dutch viewers, Weetabix in the UK, and Nutella in Italy, etc...
This is not actually the worst web DRM. I once found a site where the top of the code had a comment that said "Source code not available" followed by a bunch of blank lines. In order to get the source, one just had to scroll down some.
Which, of course, would make the scroll bar an anti-circumvention device.
This is probably to stop Lynx browsers from properly displaying content. I'm betting this move was backed by bribe money. Clearly this is aimed at reducing compatibility with Lynx. MS is just trying to steal away market share.