Websites Complaining About Screen-Scraping
wilko11 writes "There have been two cases recently where websites have requested the removal of modules from CPAN. These modules could be used to access the websites (EuroTV and Streetmap) from a PERL program. The question being asked on the mailinglists (threads about EuroTV and about Streetmap) is 'can companies dictate what software you can use to access web content from their server?'"
Everyone's assuming the appropriate rules here are from copyright law, which allow you to protect the expression of an idea but not the idea itself. That's probably right. It's not the way some big organizations want to play.
In the United States, most major sports leagues (NFL, NBA, NHL, MLB, etc.) believe that they own the rights to real time scores, and can permit or restrict any desired use. I ran into this at a previous job: we could "broadcast" football, basketball, and hockey scores at the end of every "period," and baseball scores at the end of every half inning, but we couldn't send updated broadcasts for every new score. That information needed (so said the leagues) to be licensed, and most of it had been exclusively licensed for the medium (Internet) we were interested in.
Do they have a legal leg to stand on? No. (IANAL.) Are they leaning on a great, big, huge stick with nails driven through it? Apparently.
Stupid job ads, weird spam, occasional insight at
Here is some more info
There's no law stating that we have to look at ads.
What about 17 USC 106, which states that barring fair use, etc., the copyright owner has the right to prevent others from creating derivative works of a web page?
Will I retire or break 10K?
The problem with the examples you gave is that the first 2 fail the human comprehension test as well.
The first one could be (orca, oyca, oycd, orcd).
The second asked: "What are these pictures of?" and gave 6 pictures. I assume that they want an answer that applies to all 6 pictures, but damned if I could come up with a common theme for all 6.
Now, if I went to a site that used this technology for something, I would get frustrated and leave. Kinda defeats the purpose of using it in the first place don't you think?
Go Here for discussion last summer over at Perlmonks.
UNIX/Linux Consulting
Comment removed based on user account deletion
If its purely internal then they should use a VPN and/or intranet and keep their stuff OFF the web.
The web is about as private as yelling at the top of your lungs at a karaoke competition. Anybody who thinks they can tell you to listen with one ear or the other is dumb.
MSBPodcast.com The opinions expressed here are my own. If you don't like 'em... Think up your own stuff.
If they dont want people to use the information the way they do, why the hell are they publishing it on servers connected to a network not controlled by them...
I mean seriously, are they now telling us what packets and requests we are allowed to send over the internet?
By hosing an internet server they are accepting people can connect to it and send the data they like. If they dont like it, they should try and outsmart people with clever protecting software, or host it on their own private lans.