Hulu Munging HTML With JS To Protect Content
N!NJA writes "Hulu has started encoding the html that they send to people's browsers, and then decoding it using javascript before rendering it. [...] They then run the character stream through a series of javascript functions to convert it back into plain text before pushing it into your browser using DHTML. That's quite a lot of effort just for fun, so I assume that is to stop screen scrapers from parsing content." I really can't understand all this effort. Boxee displayed the Hulu advertising perfectly. I suspect Alec Baldwin is to blame.
The XBMC guys already made a plugin after the last hulu change. It'll take a few hours and a new one will be made.
Especially if you SEND the user all the info they need, how hard is it to decode functions? There are crackers out there that take decoded assembly to figure out how to bypass DRM, what makes Hulu think their implementation will be any more difficult?
TunerFreeMCE couldn't scrape the data. Mission accomplished. Oh, wait... Tada:
"Update- version 2.6.7 is now available to download to work round this new tactic."
And now, I supposed, there will be a DMCA attack as phase two.
Yes, in fact, HtmlUnit is my preferred browser simulation library in Java for this very reason: it allows you to write very easy to understand Java code, and it uses Rhino as a JavaScript interpreter. Completely brilliant, and yet few people know about it.