Canadian Court Finds Website Scraping Infringes Copyright

← Back to Stories (view on slashdot.org)

Canadian Court Finds Website Scraping Infringes Copyright

Posted by timothy on Sunday October 2, 2011 @03:19AM from the didja-see-the-eula-punk dept.

First time accepted submitter wrecked writes "A trial judgment from British Columbia, Canada, found that Zoocasa, a real estate search site operated by Rogers Communications, breached copyright by scraping real estate listings and photos from Century 21 Canada. The decision thoroughly reviews the issues of website scraping, Terms of Use, 'Shrink Wrap' and 'Click Wrap' Agreements, robots.txt files, and copyright implications of hyperlinking. For American readers used to multi-million dollar damages, the court here awarded $1,000 (one thousand dollars) for breach of the Century 21 website's Terms of Use, and statutory copyright damages totalling $32,000 ($250 per infringing real estate photo). More analysis at Michael Geist's blog, and the Globe & Mail."

147 comments

Min score:

Reason:

Sort:

Curious question, by pecosdave · 2011-10-02 03:26 · Score: 4, Interesting

those crappy robot sites that like to take my comments on other web sites and message boards and repost them willie-nilly all over the place in hopes of attracting ad revenue- are those affected by this ruling?
Not like I'm going to file in a Canadian court, but I do find it annoying to have comments showing up all over Google on garbage sites that only exist for a short time and that I've never heard of.

--
The preceding post was not a Slashvertisement.
1. Re:Curious question, by Kjella · 2011-10-02 04:48 · Score: 1
  
  Well that depends on the Terms of Service of the board where you posted them - both what rights you signed away and what rights others get - but I would imagine for most of these fly-by-night scraping sites yes. It'll probably be just as hard bringing those to court as spammers though, the chance you'll ever recover any time or money you spend pursuing them is very slim and the award likely the statutory minimum.
  
  --
  Live today, because you never know what tomorrow brings
2. Re:Curious question, by Anonymous Coward · 2011-10-02 05:26 · Score: 0
  
  Boo hoo.
  How do you think Google started? They downloaded ("scraped", like we need ANOTHER word for it) the WHOLE internet... recursively.
  You don't HAVE to look at things you don't like if they don't affect you materially, or rather, you only find them astheticly offensive. (It is a mistake to think someone's reuse of something you made necessarily affects you. Copyright has nothing to do with it.)
3. Re:Curious question, by pecosdave · 2011-10-02 05:46 · Score: 2
  
  You're an idiot and you missed the point.
  Google, and other search engines for that matter, index my content (yes by downloading or scraping), keep it internally and point people towards what I've written. Spamdexers, which is what I'm actually complaining about, take my words, put them in a completely unrelated context - sort of like most news commentary shows - and replaster it somewhere else, usually out of context and incomplete in a manner that prevents the users from replying to the original post after a search if thats what they wish to do. Even worse it forces them to sign up to reply to me on the scrapers web site in a manner I'll probably never see and certainly won't respond to if they wish to ask me a question of flame me much as you have and they'll hand over personal information in the process to do it. Practices I would rather not be associated with.
  I really don't see what your point in the last paragraph. Where they to attribute the original post in their repost there would be no questioning their reuse of my statements, however that is very rarely done, usually an effort is made to hide the origin, except for the use of my handle. I generally could care less about copyright on things I post on the web, I care more about how I'm viewed based on what/where I post, and those places that fail to attribute the original to make it appear as though I personally contributed to their site, THAT is an offense of a different kind. It's akin to using someone elses endorsement or logo on your product without permission or compensation.
  
  --
  The preceding post was not a Slashvertisement.
4. Re:Curious question, by Orion+Blastar · 2011-10-02 07:04 · Score: 1
  
  Sure only if you can afford those same high priced lawyers that Century 21 Canada can afford.
  
  --
  Remember, Slashdot does not have a -1 disagree moderation, and no, troll, flamebait, and overrated are not substitutes.
5. Re:Curious question, by pecosdave · 2011-10-02 11:37 · Score: 1
  
  So, you are pushing for socialistic sharing, like Mussolini, but calling me a fascist because I I don't want a spam bot using my name to promote itself.
  You make less sense than Highlander the Source.
  
  --
  The preceding post was not a Slashvertisement.
6. Re:Curious question, by rastoboy29 · 2011-10-02 12:16 · Score: 1
  
  Here you go: http://xkcd.com/810/
  
  --
  expandfairuse.org
7. Re:Curious question, by pecosdave · 2011-10-03 00:24 · Score: 1
  
  Do you spam for a living with a slight guilt thread running through you you're trying to smack down with over-compensation?
  
  --
  The preceding post was not a Slashvertisement.
8. Re:Curious question, by pecosdave · 2011-10-03 00:37 · Score: 1
  
  I'm an open source advocate, I love FOSS and utilize the hell out of it.
  I believe in and defend reasonable copyright protection. Really, I think life +20 seems to be reasonable, it takes care of the author and the authors kids assuming the author croaks with a pregnant wife. I'm open to a bit longer even though I don't really like it. The Mickey Mouse Protection Act was obscene.
  I don't care if people quote me all over the place, the only attribution I want was "I found this quote on Slashdot", or a link to the original, or even just quotation marks, that's good. To embed my words into a web page that makes it look like I personally typed the information into the web page is forgery, and I don't like that. Your stupid ass citations above are well beyond anything I ever wanted or think is reasonable.
  You obviously don't care because you're an AC.
  I chose pecosdave as my handle in 1996 because I couldn't think of why anyone else would want it, and up until a few years ago I was the ONLY pecosdave on the web. I'm annoyed at that other one or two pussies who aren't even from Pecos who've since adopted it the first one about four or five years ago. Rule of thumb, pecosdave on a tech site is me, pecosdave on a porn, personals, or gambling site is not. I still see it as my signature.
  
  --
  The preceding post was not a Slashvertisement.
9. Re:Curious question, by Anonymous Coward · 2011-10-06 11:07 · Score: 0
  
  I never thought I could make a living at it! Sweet.
And Google rejoices by Anonymous Coward · 2011-10-02 03:27 · Score: 0

And hoping that results like this come to America. Web scrapers are ruining the quality of their search engine results.
1. Re:And Google rejoices by hsmyers · 2011-10-02 03:35 · Score: 1
  
  Seems to me that would depend entirely on what they do with the results. If the answer is 'nothing that the web sees again' then no harm, no foul. On the other-hand things quickly gray up if the info is re-posted in an altered form. The guy who scraped Cragslist and made it easier to use comes to mind. Not sure if this qualifies as 'ruining the quality' or not, but the Craigslist lawyers were quick to react. If the guy doesn't (or hasn't caved) it will be interesting to see what happens.
2. Re:And Google rejoices by Calos · 2011-10-02 05:37 · Score: 1
  
  Then again, with Google being accused of scraping Yelp! reviews etc., they might have a lot to lose.
  
  --
  I vote based on politicians' actions, unless contrary to my preconceptions. Often wrong, never uncertain. #iamthe99%
Huh? by Anonymous Coward · 2011-10-02 03:29 · Score: 0

This is relevant to mint.com and others in the financial space that depend on screen scraping as a key part of their solution.
1. Re:Huh? by hedwards · 2011-10-02 03:59 · Score: 1
  
  No, it isn't. You can't copyright data, especially data that's derived from actual events. Theoretically they could copyright the presentation, but Mint and services like that are there to display the data in a different way than just displaying all the other sites.
2. Re:Huh? by gomiam · 2011-10-02 04:06 · Score: 1
  
  You can't copyright data, but the presentation, as you call it, includes the way the text is written. In other words, as long as you tell it your own way without copy-pasting the source you used you are in the clear. I doubt these scraping sites do any meaningful rewriting of their sources, though. And how do you rewrite a picture, by the way, without the data it comes from?
3. Re:Huh? by erroneus · 2011-10-02 04:43 · Score: 1
  
  Zaktly. I was going to point at that same obvious fact. Don't twist law to suit your need for a solution to a problem. That spells abuse of law for you and everyone. Create new law if needed as new law can be more easily ruled against or striken down as needed.
4. Re:Huh? by mysidia · 2011-10-02 04:49 · Score: 1
  
  You can definitely copyright a human-written description of something accompanied by a certain picture. In fact, the picture itself and a human-written description itself can be copyrighted.
  As long as there is some creative element in the description. There are always some creative elements in regards to a photographer's choice of how to take a certain picture of a building/property.
5. Re:Huh? by mysidia · 2011-10-02 04:51 · Score: 1
  
  Lawyers and Judges live to "twist"/ apply existing laws to new types of situations as they arise.
  To suggest there will be a new law governing every possible new technology/practice is unrealistic.
first comment! by Almost-Retired · 2011-10-02 03:32 · Score: 2

I wonder what they were smoking, thinking that was legal to do in the first place?

I know if I was a R.E. agent & somebody scraped all the pictures and other work I had gone to in an attempt to sell a property that I was asked and contracted to sell, I would be yelling copyright violations in court. And if the property was sold, I would normally have a contract that said I got my commission if it was sold within the duration of my contact.

They walk among us, and breed too!

--
Cheers, Gene.
1. Re:first comment! by No,+I+am+Spratacus! · 2011-10-02 03:41 · Score: 1
  
  "First" comment, in excess of five minutes late? I wonder what browser you're using..
2. Re:first comment! by Yvan256 · 2011-10-02 03:52 · Score: 5, Funny
  
  He's probably using Firefox and it had to update to a new major version between each keystroke.
3. Re:first comment! by Anonymous Coward · 2011-10-02 03:57 · Score: 2, Funny
  
  He's probably using Firefox and it had to update to a new major version between each keystroke.
  I'm torn between moderating this as Funny or as Insightful.
4. Re:first comment! by Anonymous Coward · 2011-10-02 04:02 · Score: 0
  
  "R.E. agent & somebody scraped all the pictures and other work"
  Web pages are publicly viewable by anyone. If you do not want people to download the "pictures and other work", there is a simple solution: put them behind a password.
  If you do not do this, then by the very nature of the internet, anyone can come along and download them.
5. Re:first comment! by siride · 2011-10-02 04:11 · Score: 1
  
  Ugh. Just because they are publicly viewable does not mean you can take their content and put it up on your site for your own purposes. If you leave your house door unlocked, it is NOT okay for someone to enter your house and take what they want.
6. Re:first comment! by Almost-Retired · 2011-10-02 04:22 · Score: 1
  
  No, but when I started that comment, it was first, but I often go back and change the wording a bit, so someone else could well have beat me to the actual first post. No biggie IMO, I was just surprised that I might be first.
  
  --
  Cheers, Gene
7. Re:first comment! by Anonymous Coward · 2011-10-02 04:24 · Score: 1
  
  /.'s pregnant pause waiting for server to respond/page to refresh?
  
  You would think this being a techy site it would be a technically 'advanced' experience. Now we know better. I'll put this here while I go grab a power nap...
8. Re:first comment! by Almost-Retired · 2011-10-02 04:26 · Score: 3, Informative
  
  Yeah, well, I waste a lot more time being forced to do a preview before I can submit the post. It takes at least 5 seconds, and often 10+ seconds to get the damned preview back from /. For me, that is a PITA. When I am ready to post my drivel, even if complete with miss-spellings, I am ready.
  
  Oh, and yes, firefox-7.01
9. Re:first comment! by Almost-Retired · 2011-10-02 04:29 · Score: 1
  
  Precisely my point. One could even get lead poisoned doing that.
  
  --
  Cheers, Gene.
10. Re:first comment! by Anonymous Coward · 2011-10-02 04:34 · Score: 0
  
  Sure it does. Claiming that this situation is analogous to leaving a private home unlocked is RETARDED.
  Websites are published works. They are works published freely. They are far more like stuff placed in the rubbish bins by the curb. No "breaking and entering" is required to access them. They are given away freely.
11. Re:first comment! by mini+me · 2011-10-02 04:50 · Score: 1
  
  If you leave your house door unlocked, it is NOT okay for someone to enter your house and take what they want.
  This is more like leaving your possessions on your front lawn with a "Take Me" sign attached.
  The web was designed to be used by any user agent. This is something you accept by developing using web technologies. There are many other protocols in existence that are designed for proprietary systems. If you want control over your content, use the right technology for presenting your content, not the one that comes with a "Take Me" sign attached.
12. Re:first comment! by siride · 2011-10-02 05:13 · Score: 1
  
  Where is this "Take Me" sign? I don't see anywhere in the technology of the web the virtue that you should take other people's stuff and use it as your own. Yeah, linking to it is fine, and that's easy to do and built right in.
  This is to be contrasted with your reasonable right to take any content from the internet that you can access and then use privately as you please. In much the same way, you are free to buy a book, tear pages out, scan and make copies of it and plaster your walls with those copies, write notes in the pages, etc., so long as you don't try to pretend the content in the book is yours and sell it or distribute it as such. If you want to scrape a site and keep it on your harddrive so you can view it when you don't have wifi, I see no problem with that.
13. Re:first comment! by Anonymous Coward · 2011-10-02 05:33 · Score: 0
  
  Websites are published works..
  Quite true. And published works have copyright terms.
  
  They are works published freely.
  Here you are just plain wrong. They are published under whatever terms the publisher says they are published under. They may be free to re-use. They may be free to view, but not to re-use. There are many possible licenses.
  
  They are far more like stuff placed in the rubbish bins by the curb. No "breaking and entering" is required to access them. They are given away freely.
  Now who is getting retarded. No, the scenario is much more like the free as in beer advertisement newspapers you see in some stores - things like "Autotrader" or real estate sales books. If someone copies the pictures and descriptions from those and places them in a competing book (or web site) you can be sure that those companies would sue for copyright infringement.
14. Re:first comment! by realityimpaired · 2011-10-02 05:33 · Score: 2
  
  And in today's lesson, we learn that publishing something and putting it on display for everybody does not give away your copyright or distribution rights. It doesn't matter if I'm handing out pamphlets for free... if you take my pamphlet, scrub my contact information, and then try to resell it as your own work, then you're violating my copyright. If I'm trying to convert you to veganism, that's not really a problem (though it would be annoying), but if that work is in a business context, and is part of my advertising for my services, then it poses a very serious problem. In a real-estate context, where I'm paid commission from both the buyer and the seller, that directly impacts my bottom line, and is a major no-no.
15. Re:first comment! by maxwell+demon · 2011-10-02 07:05 · Score: 1
  
  This is more like leaving your possessions on your front lawn with a "Take Me" sign attached.
  No. Making it publicly readable, even by automated agents, doesn't imply that you can use what you've just read in any way you like, especially not redisplay it in different context.
  If I'm on a public place, I have to accept that I might appear in photographs taken at that place. That doesn't mean that I can't object if such a photograph is then publicly put on display.
  
  --
  The Tao of math: The numbers you can count are not the real numbers.
16. Re:first comment! by Nikker · 2011-10-02 07:18 · Score: 1
  
  Would you be also mad if I scraped your site and it resulted in the sale of your property? /devils advocate
  
  --
  A loop, by its nature, continues. If that didn't make sense, start reading this sentence again.
17. Re:first comment! by Almost-Retired · 2011-10-02 08:09 · Score: 1
  
  The point is, I do not have a sales contract with you, so I am in no way obligated to pay you any commission, only the person/agency I have a contract with (and you don't do anything in R.E. w/o a contract) then the only commission to be paid is to that agency that I do have a contract with.
  
  Further point, why would you even want to expose yourself to a copyright violation when there is not a single farthing in it for you?
  
  Boggles my mind.
  
  So answer me this? What was in it for Rogers Comm. that made it appetizing for them to do this? Some detail in CA law that gives them a payday?
  
  --
  Cheers, Gene
18. Re:first comment! by mini+me · 2011-10-02 08:23 · Score: 1
  
  Making it publicly readable, even by automated agents, doesn't imply that you can use what you've just read in any way you like
  I have visited sites whose Terms of Service required you use Internet Explorer or Netscape. Under the precedence set here, I could just as easily be sued for using Chrome, Safari, Firefox, or even Lynx. None of those user agents preset the website as the owner intended either.
  If you don't allow any user agent to use your content, the whole purpose of the web is lost. Use proprietary non-standard protocols if you want protection from use beyond your control; simple as that. Standard protocols come with the assumption that anyone can implement their own implementation of the standard, just like Rogers did here.
19. Re:first comment! by mini+me · 2011-10-02 08:34 · Score: 1
  
  Where is this "Take Me" sign?
  HTTP, HTML, etc. are written as open standards so anyone can write their own implementation. Users might visit using Chrome, Firefox, Lynx, or a custom web browser that makes browsing real estate listings easier. You understand and accept that by using an open standard to deliver your content.
  Nobody wants a web that is legally limited to just Internet Explorer, we want people to use open standards as open standards. Developers choose HTML and HTTP because it means anyone can use their content using their software of choice, otherwise developers would just develop native software where they have much more control of use restrictions.
  When you put the content out there using defined open standards, you are saying "anyone who supports this protocol may access and use this content." It, again, is the reason why someone chooses an open standard over a proprietary solution. If you want to protect your content, put it somewhere that is not designed to be open for use by anyone.
20. Re:first comment! by maxwell+demon · 2011-10-02 08:59 · Score: 1
  
  If you don't see the difference between using another browser to view the content and using a program to republish the content, I can't help you.
  
  --
  The Tao of math: The numbers you can count are not the real numbers.
21. Re:first comment! by siride · 2011-10-02 09:09 · Score: 1
  
  You are conflating the ease of the protocol for transmitting information to rules about how the content should be used. Because the protocol makes it really easy, therefore, it's okay to just take other people's content. I think my house analogy holds. An unlocked door makes it really easy to get into the house. In fact, it's even designed that way (for easy access to the house). I might even want other people to come to my house, including people I don't know, under the right circumstances. I don't want them to come in at any time and take all my stuff for their own use. By your logic, I would have to make it really hard for them to do that, and if I didn't, then I am basically granting them access to my house.
22. Re:first comment! by rtb61 · 2011-10-02 09:41 · Score: 1
  
  They have access to the buyer. In some real estate sales market, if another real estate agent can come up with a buyer, the real estate agent with the contract will share the sales commission with the referring real estate agent. So the scraping was likely re-directing the potential purchases to a real estate agency other than contract real estate agency, resulting in many shared sales commissions.
  However in principle there is no real cheating going on, Century 21 was just being idiotically greedy. They simply could have refused to accept the purchaser referral from the agency and the commission would not have been shared. Of course they are then taking a chance the property is not sold before the contract expires, allowing the other agent to approach the seller directly and gaining the whole sales commission.
  From the seller point of view, Century 21 is now pushing the bounds, basically disadvantaging the seller by significantly reducing advertising potential, driven by nothing but excessive greed. Based upon normal supply and demand rules will on average result in reduced prices for Century 21 sellers, simply by the significant reduction in number of potential. Lesson here, absolutely do not use Century 21 in Canada, their stupid myopic greed for a few hundred dollars in shared sales commission could cost you ten of thousands of dollars in reduced sales price.
  
  --
  Chaos - everything, everywhere, everywhen
23. Re:first comment! by Altrag · 2011-10-02 10:10 · Score: 1
  
  We've already got an amalgamated real estate listing service up here in Canada, which is sanctioned and used by basically all of the real estate agents.
  The only people Zoocasa would be benefiting are buyers who use that specific search site and don't bother going through an agent (who would be making recommendations from MLS).
  Whether this battle was worth C21's time and lawyer fees is up for debate, but they aren't standing to lose much from the lack of an unsanctioned rip of their data. Who knows, maybe they were foreseeing some future problem with the situation getting out of control and wanted to stem it off?
  Remember, Rogers is a big greedy corporation too. You don't take on someone like Rogers without a damned good reason.
24. Re:first comment! by Anonymous Coward · 2011-10-02 10:40 · Score: 0
  
  Is this what copyright law was originally intended to achieve?
25. Re:first comment! by mrchaotica · 2011-10-02 13:06 · Score: 1
  
  If you were a real estate agent with a contract to sell a property, then having somebody scrape and repost your ad is just more free advertising for you.
  It's the photographer who should be mad, in the (unlikely) event he had plans to sell copies of those photos for their artistic value or something.
  
  --
  "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
26. Re:first comment! by mini+me · 2011-10-02 13:23 · Score: 1
  
  How then does one interpret the intent of a website owner with respect to user agents? Last time I checked there is no user agent verification service.
  If the owner made their site for Internet Explorer, am I opening myself up for a lawsuit if I visit using any other browser? My browser of choice will not display the website in the same way the owner intended me to see it. Logic would say that I am quite vulnerable as my intent is to circumvent the original software requirements held by the author.
  If you want to use the door analogy, am I breaking the law by visiting Slashdot? I didn't get explicit permission from this website's owners to access their site. I walked in on my own accord and I suspect you did as well.
27. Re:first comment! by mini+me · 2011-10-02 13:26 · Score: 1
  
  A browser is a browser is a browser. The browser in this case was built on the platform of the web. If it were a native software package would you hold a different view?
28. Re:first comment! by maxwell+demon · 2011-10-02 20:02 · Score: 1
  
  A browser is a browser is a browser. The browser in this case was built on the platform of the web. If it were a native software package would you hold a different view?
  No. It doesn't matter what type of software is used. It matters who has access to the result. Put the very same web server on your own computer, with access only to you, and there's no problem at all. (Well, except for the fact that he violated the robots.txt standard; that's something even private people shouldn't do).
  
  --
  The Tao of math: The numbers you can count are not the real numbers.
29. Re:first comment! by mini+me · 2011-10-03 05:04 · Score: 1
  
  How then does Amazon Silk play into this then? It downloads, renders, and caches remote pages in Amazon's infrastructure, sending only the modified and optimized results to the client. just like the service in question. Will web browsing on the Kindle Fire be illegal in Canada?
30. Re:first comment! by rtb61 · 2011-10-03 06:57 · Score: 1
  
  You only list on the amalgamated service when you are ready to give up a piece of your commission and that service also costs money. From the agents point of view, especially the larger ones, they have already scammed the client into accepting a somewhat lower price than the property is worth making it an easier sale and they have a purchaser lined up. Sales agreement signed and the property sold that week, no advertising costs no shared commission.
  Now they have the internet, for them very cheap internal advertising compared to prior costs including shared commission. So they simply got greedy and were expecting their page to sell all of their listed properties directly (web developer likely spun them a grand yarn to get maximum development capital and long running support contract). Rude shock, basically they gained nothing much from the web site, they were basically just keeping up with the competition.
  
  --
  Chaos - everything, everywhere, everywhen
Century 21? by M.+Baranczak · 2011-10-02 03:35 · Score: 4, Insightful

They're a real estate broker. They make money when the property is sold; displaying ads is only a means to selling the property. Why do they object when people are copying and re-publishing their ads?
1. Re:Century 21? by Anonymous Coward · 2011-10-02 03:44 · Score: 0
  
  As a company who has ads out, do you want people to look at your ads only or the ads for you and your competitors? Also, with real estate the commissions on a property sold are split between the real estate agent of the buyer and the seller. Would you want another company to get part of the commission because someone scraped your web sire?
  As the person searching (not just for real estate), I'd rather get results from each company that matters and not hundreds of websites that scraped information, some of which have malware on them. I'd have fewer more unique search results to go through rather than the bloat of sites with identical information that doesn't have what I'm looking for half the time anyway.
2. Re:Century 21? by M.+Baranczak · 2011-10-02 04:04 · Score: 3, Insightful
  
  As a company who has ads out, do you want people to look at your ads only or the ads for you and your competitors?
  And now, people visiting this site will only see ads from C-21's competitors. Well done, guys.
  
  Also, with real estate the commissions on a property sold are split between the real estate agent of the buyer and the seller.
  Completely irrelevant. Just because someone publishes an ad doesn't make them an agent.
3. Re:Century 21? by amorsen · 2011-10-02 07:06 · Score: 1
  
  There is a similar situation in Denmark with the website Boliga. They attempt to list all properties which are for sale along with information about price changes and what the properties were sold for in the past (if that information is public). Real estate brokers and sellers are not particularly interested in buyers getting access to such information and so they are trying various things to block Boliga from providing that service. So far the competition authorities have forced the brokers to not block access to Boliga's scrapers, but this does not extend to photos.
  Of course that means that the independent brokers who are not part of the collusion called Dansk Ejendomsmæglerforening are getting excellent exposure on Boliga, simply because their properties show up with photos...
  
  --
  Finally! A year of moderation! Ready for 2019?
4. Re:Century 21? by jklovanc · 2011-10-02 07:22 · Score: 1
  
  Zoocasa is an aggregating site; they take listing information from a number of realtors and provide listings from all of them. The issue is that Zoocasa took too much information to be considered fair use.
  By linking directly to each listing users do not see the following;
  Century 21 home page and all the information/offers there.
  Other Century 21 listings that may be in a different order than what Zoocasa wants to display them.
  Zoocasa does not display all the information that Century 21 does which may cause a potential buyer to reject a listing he might have looked into further.
  Users will also see listings from other companies next to Century21 and may get the impression that they are from the same company. One of the selling point for real estate agents is brand recognition. By mashing all the companies together that brand recognition is diluted.
  Century21 is not getting important metrics on who is searching for what kind of property. If they can identify a certain kind of property as being popular Century21 can advise their agents to concentrate on getting listings for those properties.With the full listing available to the user on the Zoocasa site they may never touch the Century21 site thereby denying the information that someone was interested in that kind of property. So no search parameter logging and no listing hit count.
  .
  Century21 want people who are looking for or selling real estate to come through their web site from the front page and use their search engine. That will not happen with Zoocasa scraping all the important information.
5. Re:Century 21? by stephanruby · 2011-10-02 12:39 · Score: 1
  
  They make money when the property is sold; displaying ads is only a means to selling the property.
  You're assuming brokers have exclusivity over each property they list. In some jurisdictions, this is a given, but in others, it's not. And I really don't know about Canada? Do you live in Canada?
6. Re:Century 21? by Rysc · 2011-10-03 05:46 · Score: 1
  
  Too bad. I, as a buyer, don't care what Century21 wants. Services like Zoocasa provide value to me. If Century21 doesn't get metrics and I don't follow the process they prefer, tough cookies.
  
  --
  I want my Cowboyneal
7. Re:Century 21? by Anonymous Coward · 2011-10-03 06:05 · Score: 0
  
  That's interesting, and I don't like what Zoocasa is doing, but as a buyer, I don't give a crap what Century21 wants. If I like Zoocasa's listing method better, that's Century21's problem. Most of these companies that want to force users to do something specific, like use their search engine, just don't get it. Like I say, I don't like what Zoocasa's doing, and it may legitimately be copyright infringement, but Century 21 could make a better website and also require a login if they're that concerned about it.
8. Re:Century 21? by jklovanc · 2011-10-03 08:34 · Score: 1
  
  Zoocasa is still in business. They just can not pull the complete listing from Century21. Had they not been such idiots and come to an agreement with Century21 this issue would never have arisen.
  As a buyer you are interested in finding a listing in a certain area with certain criteria. Century21 is interested in creating those listings so you can buy them. If Century21 doesn't know you are looking they the don't know to try to get people to sell. You both lose.
  So you think it is fine that Zoocasa makes millions of dollars of the hard work of real estate companies who spend millions getting the listings, taking photos, writing descriptions and putting them on the net. As the judge stated:
  "[141] I find that Century 21’s Terms of Use constitute a binding contract between the parties, that Zoocasa had actual knowledge of the Terms of Use and in continuing its actions after notice of those Terms of Use, Zoocasa breached those terms."
  Information is not "free" once it is on the net if there are terms of use readily available and/or the company is made aware of those terms. You can not violate terms of use just because you wanted to.
9. Re:Century 21? by jklovanc · 2011-10-03 08:38 · Score: 1
  
  The judge ruled that the presence and subsequent sending of the Terms of service was sufficient to rule that Zoocassa broke a binding contract. Data is not free if it is on the internet; It still belongs to the entity that created it.
  Al Zoocasa had to do was come up with an agreement with Century 21 and this problem would go away. They didn't do that so they no longer can scrape the site. Be an idiot, get kicked out.
10. Re:Century 21? by Rysc · 2011-10-04 01:47 · Score: 1
  
  I simply don't care whether Zoocasa is operating legally or not, making money or not. I'm interested in results.
  Thought experiment: What if Zoocasa asked for permission to relist the listings and was denied. What if Century21 routinely denies all such requests? At that point I want someone to list them anyway, no matter the terms of use, because that's what's good for me. The free-market argument that by doing this eventually Century21 loses business is bogus because houses are not commodities. Exclusive contracts for listings are anti-consumer bullshit.
  More technically, if I write a screen-scraper which sucks up the listings from various places and aggregates them into a format convenient to me... and I use it for myself and never sell or share it, is that copyright infringement? The law would say it's just as illegal as this, but I disagree with that very stongly. Century21 can put the content out there but they have no right to control its presentation. And if I can alter the presentation for myself then another person can do it and provide me that aggeregation service, so therefore damn the legalities and full speed ahead.
  
  --
  I want my Cowboyneal
let me be the first to say... by Cyko_01 · 2011-10-02 03:36 · Score: 1

DUH!
1. Re:let me be the first to say... by Anonymous Coward · 2011-10-02 03:52 · Score: 0
  
  no.
Limiting damages -- Great idea by Anonymous Coward · 2011-10-02 03:45 · Score: 0

The US has four per cent of the world's population and eighty per cent of the world's lawyers. Needless to say, they're all pushing up the size of pay outs. Canada, you keep on with those damage limits. The legal profession in the US has become a guild taxing the general population for no valid reason.
1. Re:Limiting damages -- Great idea by Anonymous Coward · 2011-10-02 03:51 · Score: 0
  
  It's not like it's easy to avoid the legal system in the US.
2. Re:Limiting damages -- Great idea by Anonymous Coward · 2011-10-02 03:52 · Score: 0
  
  No. World population is ~ 7,000,000,000 and the U.S. population is ~ 300,000,000
  that is ~ 23%
3. Re:Limiting damages -- Great idea by schnikies79 · 2011-10-02 03:55 · Score: 1
  
  Umm, no. GP is correct, though the United States is the 3rd most populous county, behind India and China.
  
  --
  Gone!
4. Re:Limiting damages -- Great idea by Lunix+Nutcase · 2011-10-02 03:57 · Score: 2
  
  *facepalm. 300,000,000/7,000,000,000 = .0428 or about 4.3%. 23% of 7 billion is about 2.2 billion, idiot.
5. Re:Limiting damages -- Great idea by canadian_right · 2011-10-02 04:01 · Score: 1
  
  It isn't the lawyers, it is silly juries awarding the outrageous amounts, plus your law does not cap punitive awards. In Canada the punitive part of the award cannot exceed $300,000. The rest of the award is to "make you whole" that is, undo the damage done by the party who lost. So the most you can get is what you lost, future losses, and $300,000 in cases of extreme suffering.
  
  --
  Anarchists never rule
6. Re:Limiting damages -- Great idea by Anonymous Coward · 2011-10-02 04:02 · Score: 0
  
  Let me guess - from your facility with maths, you are an American, yes?
Not copyrightable in the US by kabloom · 2011-10-02 04:12 · Score: 0

IANAL, but in the United States these real estate ads would not be copyrightable because they amount to a collection of facts.
1. Re:Not copyrightable in the US by Lunix+Nutcase · 2011-10-02 04:20 · Score: 3, Informative
  
  Yes, you are definitely not a lawyer since you, as with most Slashtards, misunderstand what you are talking about. Yes, the facts themselves can not be copyrighted but the expression on those facts can be copyrighted. Which is why I can take the info from the phone book and publish them myself but I can't take someone else's phonebook, copy all the pages and then republish it as my own work because that expression of those facts are copyrighted.
2. Re:Not copyrightable in the US by Anonymous Coward · 2011-10-02 04:29 · Score: 0
  
  IANAL, but in the United States these real estate ads would not be copyrightable because they amount to a collection of facts.
  
  The photographs are copyrightable, for one.
3. Re:Not copyrightable in the US by isilrion · 2011-10-02 06:30 · Score: 1
  
  Which is why I can take the info from the phone book and publish them myself but I can't take someone else's phonebook, copy all the pages and then republish it as my own work because that expression of those facts are copyrighted.
  I don't understand the difference between those scenarios. The final result is the same - both contain the same information, in the same order. I just don't see where to draw the line between "copying the facts in the phone book" and "copying the phone book". At the font and page design, perhaps? That seem awfully narrow...
  If anything, in this case, I suspect that the "expression" of those facts is very different - the same set of facts (real state listings) in a different website design, probably without any inherent order and hopefully mixed with "facts" from other sources. I know, IANAL, I'm just trying to figure out why, if what can be copyrighted is the expression and not the facts, getting a final result almost exactly like the original work (phone book case) is more acceptable than aggregating several sources and presenting it in a new design (this case).
4. Re:Not copyrightable in the US by maxwell+demon · 2011-10-02 07:07 · Score: 1
  
  The images of the houses are definitely copyrightable.
  
  --
  The Tao of math: The numbers you can count are not the real numbers.
5. Re:Not copyrightable in the US by jklovanc · 2011-10-02 07:28 · Score: 5, Informative
  
  The decision was quite specific on this. The issue was not about the facts such as address, number of bedrooms, floor space, etc. It stated quite clearly that those were not copyrightable. The issue was the description of the property which are impressions done on prose and the pictures. Both of these a copyrightable.
6. Re:Not copyrightable in the US by Anonymous Coward · 2011-10-02 07:38 · Score: 0
  
  I just don't see where to draw the line between "copying the facts in the phone book" and "copying the phone book".
  If, for some reason, you wanted to use a small subset of names and numbers from the phone book, such as for notable people that live in the same area, that wouldn't be a copyright violation (there'd probably be privacy laws to mind, though).
  If you started just took down every name/number in a phonebook and sold your own copy of that list, that'd be a copyright violation. You're supposed to get the phone numbers yourself, not rely on the other company doing the hard work and profiting of the effort.
  And if you actually made your own phone book my compiling the names/numbers yourself, there WOULD be a difference. Regularly there are fake entries put into the listings, to catch others who copy them. It's an awfully big coincidence if your competitor have the same exact people, who don't exist, in their phone book that your phone book does.
7. Re:Not copyrightable in the US by Anonymous Coward · 2011-10-02 07:57 · Score: 0
  
  The decision was quite specific on this. The issue was not about the facts such as address, number of bedrooms, floor space, etc. It stated quite clearly that those were not copyrightable. The issue was the description of the property which are impressions done on prose and the pictures. Both of these a copyrightable.
  Please someone, mod the parent up.
8. Re:Not copyrightable in the US by ortholattice · 2011-10-02 08:14 · Score: 1
  
  If you started just took down every name/number in a phonebook and sold your own copy of that list, that'd be a copyright violation. You're supposed to get the phone numbers yourself, not rely on the other company doing the hard work and profiting of the effort.
  
  As I understand Feist v. Rural (ianal) you could sell your own copy of the list, although reformatting it (so as not to violate copyright on a creative layout) might be a good idea. You couldn't sell a copy of the cover, prefaces, or end material.
  The "hard work" of collecting facts is irrelevant to copyright law. This is probably one of the most common misconceptions of copyright law, that it exists to protect "hard work". The only thing that is relevant is the extent to which creative expression is involved.
  On the other hand, this:
  
  Regularly there are fake entries put into the listings, to catch others who copy them.
  
  brings up an interesting point, since arguably fake entries are creative. On the other hand, they are also fraudulent if the phone company represents the list as factual. So maybe a suit/countersuit for copyright/fraud would cancel each other? :)
9. Re:Not copyrightable in the US by xelah · 2011-10-02 09:56 · Score: 1
  
  That's not true in many countries other than the US, such as in the EU, where there is a database right. Maybe not relevant for Canadian property adverts, unless they have a database right too, but perhaps for a whole lot else given that this is the Internet and you could be sued anywhere. Scraping a whole database full of property ads might well be illegal here even if you rewrite the text and get a five year old to draw artist's impressions of the houses. You could take a one or two, but not the whole thing.
10. Re:Not copyrightable in the US by Lunix+Nutcase · 2011-10-02 10:11 · Score: 1
  
  That's not true in many countries other than the US
  Yes, and we were talking about the US hence why the title of this thread is "Not copyrightable in the US" (emphasis added).
11. Re:Not copyrightable in the US by Lunix+Nutcase · 2011-10-02 10:13 · Score: 1
  
  I don't understand the difference between those scenarios.
  
  What is that hard to understand? Facts themselves are not copyrightable but the work that presents those facts is copyrightable. Hence why you can copy all the names and numbers from a phone book but you can't copy the pages exactly and pass that off as your own because that is an expression of the facts which can be copyrighted.
12. Re:Not copyrightable in the US by isilrion · 2011-10-02 10:35 · Score: 1
  
  Did you read the rest of my post? In the sentence right after that one I said why I didn't see a difference: because the end result would be the same, a phone book with exactly the same contents as the original. And then I went on to ask why would copying factual information from one website and presenting it in another with a completely new design was worse than copying the data from a phone book and publishing it. The web scraping scenario even has a creative component missing from the phonebook-entry-copying scenario. Btw, I was not being antagonistic.
How is this different than Google News? by Frosty+Piss · 2011-10-02 04:13 · Score: 1

How is this different than Google News? When sites bitch about Google News, I believe Slashdotter's call them greedy and spout "fair use"...

--
If you want news from today, you have to come back tomorrow.
1. Re:How is this different than Google News? by DarwinSurvivor · 2011-10-02 04:50 · Score: 1
  
  Google news add functionality by allowing you to search through articles, it then LINKS to every article it "scraped". Google news is closer to a search engine than a scraper.
2. Re:How is this different than Google News? by Frosty+Piss · 2011-10-02 05:24 · Score: 1
  
  I imagine that those "additions" make Google News more palatable to the "fair use" folks, but to the content owners it's exactly the same thing.
  
  --
  If you want news from today, you have to come back tomorrow.
3. Re:How is this different than Google News? by icebraining · 2011-10-02 06:47 · Score: 1
  
  The real difference is that unlike these shady scrappers, Google News has a very easy way for content owners to block it; its spiders respect the robots.txt directives. And you don't have to block Google Search, there's a specific User Agent just for Google News.
  Now ask yourself why the 'content owners' don't block it. Hint: it's because they, unlike the content owner in this story, don't actually want to block it.
  
  --
  Dilbert RSS feed
Century 21 by EdZ · 2011-10-02 04:14 · Score: 1

Century 21? What is Gerry Anderson doing selling properties in Australia?
1. Re:Century 21 by Anonymous Coward · 2011-10-02 05:42 · Score: 0
  
  Century 21? What is Gerry Anderson doing selling properties in Australia?
  I don't know, but I wouldn't want to piss off Thunderbirds, Stringray *and* the indestructible Captain Scarlet...
Cut off internet access for"Rogers Communications" by RichMan · 2011-10-02 04:25 · Score: 3, Informative

Why not treat them like and end user and shut off internet access for the "user".
I would like to see "Rogers Communications", the runner of the site lose access to the internet.
Some how I don't think the court would do that to one of the top ISP's in Canada.
Yet we are likely to see the courts cut off internet access for small end users. Why are the laws not equal? Corporation got people status, but it seems corporations are way above people status when it comes to some laws. That is not fair. And if it would be unfair to cut off access for a corporation then it should be unfair to cut off access for a person.
Bye bye www.google.ca by Anonymous Coward · 2011-10-02 04:30 · Score: 0

I guess with this precedent, Google has to close its service on Canada.
Not only linking, scraping, page previews, google images... just imagine the damages!
1. Re:Bye bye www.google.ca by bmo · 2011-10-02 04:53 · Score: 2
  
  I think that Google should just blackhole Canada if this ruling stands.
  See, back a while ago, the newspapers in Belgium sued Google for copyright infringement, and Google was told by the court to take down the content or face a big fine, per day.
  So they did took it down.
  Suddenly the Belgian newspapers were screaming bloody murder because they weren't getting hits.
  Go ahead, "content creators", kill indexing and searching. Bring it back to the old days of no search engines. I dare you.
  --
  BMO
2. Re:Bye bye www.google.ca by bmo · 2011-10-02 05:02 · Score: 1
  
  >third sentence.
  I cannot into English.
3. Re:Bye bye www.google.ca by realityimpaired · 2011-10-02 05:39 · Score: 1
  
  Google does not scrape the entirety of a site and put it up on their scraped search... they take a small blurb for you to read, and link to the original site. They do cache sites, but you have to look for the cache link, and most people aren't even aware that they exist. Very different animal.
  I suspect that Google isn't in any danger here, because rather than taking business away from the sites they index, they are actually driving more business *to* those sites.
4. Re:Bye bye www.google.ca by maxwell+demon · 2011-10-02 07:11 · Score: 1
  
  From what I understood from TFA, Zoocasa didn't comply with the robots.txt standard, and that was a point considered in the ruling. Since Google does comply with the robots.txt standard, I don't think this ruling can be directly transferred to Google.
  
  --
  The Tao of math: The numbers you can count are not the real numbers.
Re:Cut off internet access for"Rogers Communicatio by Almost-Retired · 2011-10-02 04:34 · Score: 2

I agree with this sentiment. A weeks internet black hole for Rogers Comm. would send a message their greedy board members will never forget.

--
Cheers, Gene
Re:Cut off internet access for"Rogers Communicatio by JimCanuck · 2011-10-02 04:55 · Score: 1

Not really, only the subscribers to Rogers would suffer, as they'd use it as a excuse to charge a "Government blocking fee" and increase their profits for that month.
The scraper is the one that makes no sense by msobkow · 2011-10-02 05:05 · Score: 1

Century 21 usually has exclusive listing contracts, which means that no one else can sell the house for commission without getting sued out of their commission. I don't see the profit for the scraping company.

--
I do not fail; I succeed at finding out what does not work.
1. Re:The scraper is the one that makes no sense by Jane+Q.+Public · 2011-10-02 12:48 · Score: 1
  
  Century 21's (or anybody's) private contracts cannot obligate 3rd parties without their consent. If I wanted to sell a house that Century 21 has listed, for commission or not, there isn't a damned thing they can do about it... unless the contract is one of exclusivity, with the owner. In which case they could sue the OWNER, but not me. I still haven't done anything wrong or illegal.
Port scan by tepples · 2011-10-02 05:06 · Score: 1

and often 10+ seconds to get the damned preview back from /.
The ten-second delay is Slashdot port-scanning your IP address for common open proxies and waiting for connections to time out. You get it on a given IP only once every 24 hours. I've been told that if you set your firewall to refuse the connections instead of letting them timing out, the scan will run faster.
1. Re:Port scan by Anonymous Coward · 2011-10-02 06:52 · Score: 0
  
  Fascinating!
  "open proxy" : http://en.wikipedia.org/wiki/Open_proxy
  From that Wikipedia article, I learned that some online services attempt to determine if incoming connections are from "open proxies" using software like "proxycheck" (which I see has been around for Unix/Linux since 2004 or earlier, but which the original author vows to port to "winbloze"; stay classy, Michael Tokarev! LOL).
  OK, so let's call a client machine "C" (e.g., my home computer), and a proxy machine "P", and the machine running the service "S" (e.g., Slashdot's web server). If "P" were an *open* proxy, then any client "C" could connect to "P" and specify that service requests should be forwarded to an arbitrary service "S". But, if "P" were truly "open", then a request for the forwarding feature could come from anywhere, including "S". So, software like "proxycheck", running on "S", simply attempts to connect to "P" with the assumption that "P" is indeed an "open proxy", and confirms this assumption by using "P" to forward a message back to "S"! If "P" is not an *open* proxy, or if there is no "P" (because "C" connected *directly* to "S"), then the only indication might be a lack of a response after many seconds (hence the need to wait for connections to time out).
  OK, so I think I understand the concept. But I don't understand the motivation.
  Open proxies can unknowingly be created on any computer using viruses. An open proxy can then be used to obfuscate the true origin of a request (e.g., an HTTP request, or a SMTP request, etc).
  Is it because Slashdot is trying to reduce comment spam by throttling comments by IP addresses? I can see how using a series of open proxies might enable a person to post many more comments than if they were communicating directly with Slashdot from a single IP address.
  But both Tor and convention non-open bot nets would both be able to wage denial of service attacks or comment spam attacks without being stopped by a check for open proxy services from the originating connection IP address.
2. Re:Port scan by Almost-Retired · 2011-10-02 07:26 · Score: 1
  
  I wondered why, by the time I had posted 3 times, the response was much faster. thanks for that inofo.
  
  --
  Cheers, Gene
3. Re:Port scan by Richard_at_work · 2011-10-02 21:45 · Score: 1
  
  Hold on, so we are being punished by Slashdot for Slashdot updating to a shittier interface?
  Fucking ridiculous!
4. Re:Port scan by tepples · 2011-10-03 00:22 · Score: 1
  
  The allegedly "shittier interface" has nothing to do with this. The same delay for a port scan happens the first time you preview or post using the classic posting interface, which you can reach by middle-clicking "Reply to This" on Firefox.
Texas high school graduate by Anonymous Coward · 2011-10-02 05:10 · Score: 0

Wow, I bet you graduated from high school in Texas, with the new curriculum, with flying colors.
Only $250 per picture? by Rehnberg · 2011-10-02 05:13 · Score: 1

Seems WAY more reasonable than US copyright court.
1. Re:Only $250 per picture? by msobkow · 2011-10-02 05:45 · Score: 1
  
  One reason we don't have many (if any) MPAA/RIAA "copyright infringement" lawsuits up here is that our courts award sane damages based on real lost revenue and injuries. Which is to say, ONE lost sale.
  
  --
  I do not fail; I succeed at finding out what does not work.
2. Re:Only $250 per picture? by Sqr(twg) · 2011-10-02 06:16 · Score: 1
  
  The same is true where I live. From a photographer's perspective, it sucks. Some companies steal pictures all the time. When they get away with it, they pay nothing. When dragged before court, they only pay what they should have paid in the first place.
3. Re:Only $250 per picture? by maxwell+demon · 2011-10-02 07:31 · Score: 1
  
  The same is true where I live. From a photographer's perspective, it sucks. Some companies steal pictures all the time. When they get away with it, they pay nothing. When dragged before court, they only pay what they should have paid in the first place.
  Don't they additionally have to pay court costs?
  
  --
  The Tao of math: The numbers you can count are not the real numbers.
4. Re:Only $250 per picture? by Sqr(twg) · 2011-10-02 17:16 · Score: 1
  
  No necessarily, no. Under Swedish law, the looser pays the winner's court costs, but if, for example, the company that stole the picture agreed to pay some ridiculously small amount from the start, then the court may decide that there is no winner or looser, and each part pays his own costs.
  The court may even decide that the photographer asked for too much, so he should pay the costs of the company that stole the picture, after they pay whatever the court thinks is reasonable for the photo.
  These laws are great if the MAFIAA sues you for a billion because you downloaded one movie, but as I said: It sucks if you're the copyright owner.
Good to know. by isilrion · 2011-10-02 06:18 · Score: 2

Last time I was looking for a place to rent, I felt tempted to do something like this. There are several rental websites out there, and you are lucky if their listing overlap... Comparing locations, prices, what was being offered, etc, was a pain. Some sites would at least let you right-click and open in a new tab, but others wouldn't. The maps, sometimes they were google's, sometimes they were bing, and they would never let you overlay the public transit lines on top... And instead of letting you chose a location and radius on the map, some would ask you for the postal code!
So, I toyed with the idea of scraping those sites and build my own database, and build a website from it for others like me (probably sticking some google ads to try to pay for the hosting). I guess it is a good thing I didn't. And it is a shame to know that I can't (and no one else can either). I don't like Rogers and part of me is smiling about this ruling... but if this means that there will be no "google news"-like service for rent hunting, this is another case of copyright preventing useful services from coming to life.
1. Re:Good to know. by jklovanc · 2011-10-02 07:39 · Score: 1
  
  You can do something like that. You just have to comply with the Robot Exclusion Standard so that a site that does not want you to index them can make that call. You also can not copy and save too much of each listing.
2. Re:Good to know. by witherstaff · 2011-10-02 10:25 · Score: 1
  
  How would that be any different than what Google or any other search provider does? They scrape a website and retain all the data and return it in a variety of ways. As long as robots.txt doesn't reject your crawler.
3. Re:Good to know. by isilrion · 2011-10-02 10:57 · Score: 1
  
  Because, for a site like I mentioned to be useful, it would have to display lots of information about the listings, not merely point to the original website with an excerpt of the content. Think a "comparison" website. I don't know about you, but I think that displaying the information side by side is much more useful than going back and forth between different tabs, specially if the formatting of those tabs is different. But that would fall completely in the "scrap and display the factual data with a new formatting" category.
  Re: the robots.txt, I didn't even think about that. Another reason to not go through the trouble of building the site: if a couple of the listings websites "opt out" of being crawled, I'm back to having to search in different websites and do the comparisons by hand. I agree with obeying robots.txt, so that means that I couldn't do it even if it was only for my private use and I didn't publish the website. It makes little sense to go through the trouble of building an incomplete database for comparison and then having to manually search the sites that opted out.
4. Re:Good to know. by Jane+Q.+Public · 2011-10-02 12:58 · Score: 1
  
  Actually, you probably don't have to follow the standard, or anything even close. At least in the US.
  
  First off, in US law it has (finally) become pretty clear that not obeying TOS is not, in itself, a crime. Ever-higher courts have ruled on that issue and Congress has a pending bill spelling that out in plain English.
  
  Second, again in the US, tabular data cannot be copyrighted. A recent case involving phone books pretty much nailed the lid on that one. Real estate listings are nothing if not "tabular data"... except for the pictures. Copying the layout of somebody's real estate page, or their pictures, would probably be a violation of copyright. But the data? Hell no.
  
  The upshot is: as long as you only copy the data, not the layout or pictures, you are perfectly withing the law, in the United States.
  
  I have to further question this Canadian decision because I did read the court decision, and they didn't even get the definition of "scraping" right. They concluded that it was a form of indexing, which is simply false. Scraping is simply the gathering of data, whether it is then used for indexing or any other purpose.
5. Re:Good to know. by Jane+Q.+Public · 2011-10-02 13:03 · Score: 1
  
  "... within the law..." Sheesh. Typos.
  
  But this also needs to be stated: contrary to the title of this thread, scraping per se was not found to be the illegal act here anyway: reproducing someone else's site was.
6. Re:Good to know. by wvmarle · 2011-10-02 19:10 · Score: 1
  
  Some sites would at least let you right-click and open in a new tab, but others wouldn't.
  Why right click when ctrl + left click does the job just fine? (that's in Firefox at least). And FF also allows you to switch off suppression of the right-click menu. Though it's of course irritating behaviour of those sites.
7. Re:Good to know. by wvmarle · 2011-10-02 19:18 · Score: 1
  
  Replying twice but totally different comment :)
  Interesting how markets differ. In Hong Kong I've never used web sites to find homes for rent: we just went to an area, walked around the shopping malls, and asked the agents directly on what's available. They always post numerous offers on the window giving you a good idea on what's available in the area and what prices you have to think of. And usually they can show you some flats right away, just walking around the area.
  My current home we found through a small flyer posted to a lamp post. That's how some agents advertise too, this was an agent that was doing village houses so more spread out, having a much larger action radius than agents handling high-rises. This agent then showed us around by car, and that included the place we live now.
  Larger agents have web sites listing properties, that we only used as rough indication and to look for general areas. They sometimes come with photos, though mostly just a floor plan.
8. Re:Good to know. by jklovanc · 2011-10-02 21:45 · Score: 3, Informative
  
  The copyright issue was not about the facts about the listing; address, floor space, bedrooms, etc. There were two components that were copyrightable; the pictures and the description. Here are some reveant clauses from the decision.
  [185] The property descriptions describe particular real properties. They are created to market the property to potential buyers. It is apparent they are written for each property in a manner to highlight the positive aspects of the properties. There is also the evidence of Bilash and Walton that there is some level of skill involved in writing an effective property description. I am satisfied that the property descriptions are the product of skill and judgment. As a result they meet the threshold for copyright protection.
  [204] The continued copying of the entire property description to the Zoocasa server is a violation of copyright. The truncated versions of the property description in my view do not infringe copyright as they do not meet the criteria for substantial copying sufficient for copyright infringement.
  [205] With respect to the photographs, Zoocasa was not merely copying a thumbnail image as in the case of Kelly v. Arriba Soft Corporation, 336 F. 3d 811 (USCA, 9th Circuit, 2003), but rather the entire photograph. This was a clear violation of copyright.
9. Re:Good to know. by jfine · 2011-10-03 01:11 · Score: 1
  
  You can do this, just contact the site owners, setup business relationships, then pay them. Some might give it to you for free (for exposure), some might want a split of revenue (50/50 is pretty typical on ads), and some might want a crazy high license fee. At the end of the day you might wind up with an actual business rather than just a little side project that helped you find your next place.
10. Re:Good to know. by Anonymous Coward · 2011-10-03 03:53 · Score: 0
  
  You are somewhat incorrect on how this was working.
  Rogers was presenting the results AS THERE OWN, they were stripping any ads from the century21 version of the site (potentially denying century21 of ad revenue, or at least directing searchers to other century 21 listing). If you built a site that MIRRORED information off of century21 including all ancillary information (perhaps by embedding the whole century21 page for that location in a frame), used your enhanced search methodology (provided 'how close to a major transit line, placed all locations on google maps, etc), but when the actual listing was 'clicked on' (for more information) it took you to the century21 site, I do not believe you would be found to have violated the copyright of century21 like rogers did for this setup.
11. Re:Good to know. by Jane+Q.+Public · 2011-10-03 04:56 · Score: 1
  
  That is true; descriptive text probably cannot be characterized as "tabular data". But the rest can: address, basic home style and information about it (square feet, bathrooms, etc.), price, and so on.
12. Re:Good to know. by jklovanc · 2011-10-03 08:43 · Score: 1
  
  First off, in US law it has (finally) become pretty clear that not obeying TOS is not, in itself, a crime. Ever-higher courts have ruled on that issue and Congress has a pending bill spelling that out in plain English..
  Sorry but you missed the point. The bill just means the someone can not be arrested and charged by the state with a crime that will go on their criminal record. They still can be sued in civil court for damages, breach of contract, etc. It becomes a civil matter rather than a criminal matter.
13. Re:Good to know. by jklovanc · 2011-10-03 08:45 · Score: 1
  
  As long as robots.txt doesn't reject your crawler.
  There is the crux of the situation. Zoocasa would not give Century21 the information that could put into their robots.txt file to stop the scraping.
14. Re:Good to know. by Jane+Q.+Public · 2011-10-04 12:39 · Score: 1
  
  No, YOU missed MY point. Just as I stated the first time, and I will repeat: in the U.S., a violation of TOS was already a civil matter, not a crime. And a pretty darned minor civil matter, at that. Generally the "penalties" are no more than the cost of the service over the disputed period.
  
  What was unclear to you about that?
  
  My second point was that if you copy only tabular data, not the verbatim description or pictures -- again, in the U.S. -- it isn't even a civil matter. It is perfectly legal. You can even re-publish the information for profit, with no legal repercussions.
  
  And that is why I brought up the Canadian court's mis-definition of "scraping". Because scraping is NOT "a form of indexing", as the court asserted. It is merely the gathering of data. Indexing or whatever else you do with that data is a separate operation from the gathering. I should know; I've been writing scraping programs professionally for years.
15. Re:Good to know. by jklovanc · 2011-10-04 19:24 · Score: 1
  
  You need to take a look at the Computer Fraud and Abuse Act (CFAA). A violation of this act is a criminal matter. There have been a number of criminal cases brought by the government for TOS violations. Take a look at this article. http://econsultancy.com/us/blog/6189-can-terms-of-service-turn-you-into-a-criminal. How about this article http://www.onthemedia.org/blogs/on-the-media/2011/sep/28/senate-advocates-terms-of-use-reform-computer-fraud/. Note this quote
  "Late last week, the Senate Judiciary Committee approved an amendment to the law that specifically decriminalizes terms of service violations." How do you "decriminalize" something that in not currently criminal? Here is the EFF article about the amendment. https://www.eff.org/deeplinks/2011/09/senate-committee-agrees-violating-terms-service-shouldnt. Here is the amendment. http://judiciary.senate.gov/legislation/upload/JEN11A19-Grassley-Franken.pdf
  Your second point is moot because the ruling clearly states that the tabular data was not copyrightable and focused on the descriptions and pictures which are.
  You are confusing the two aspects of the case; breach of contract and copyright infringement. The scraping was a breach of contract as it contravened the web site TOS which the court ruled is a binding contract. The display of copyrightable descriptions and pictures was ruled copyright infringement. These are separate issues in the case.
16. Re:Good to know. by Jane+Q.+Public · 2011-10-05 19:23 · Score: 1
  
  Obviously you aren't familiar with the many recent court cases that have consistently ruled that violations of TOS are not criminal.
  
  I have confused nothing. YOU have confused the act of taking someone to court, with the violation of criminal statutes.
  
  As I stated earlier, what the new statute does is make it explicit, rather than leave it up to the courts, that violation of TOS is not criminal, which will avoid many malicious prosecutions. But just as I stated previously: while people have been taken to court over the issue, US courts have consistently ruled that violation of TOS is not a criminal act. You only have to look the cases up if you don't believe me.
  
  And as for the two aspects of the case, I did not confuse those, either. Read this thread back up to my original post, and remember what it was about. It is you who have changed the subject, not me.
17. Re:Good to know. by Jane+Q.+Public · 2011-10-05 19:31 · Score: 1
  
  Further, my statements about TOS had absolutely nothing to do with the CFAA. That is a straw-man argument, and a pretty silly one at that.
  
  You have confused prosecutorial abuses of the law with what the courts have already decided the law actually is. As *I* stated before you did, the pending legislation merely makes what the courts have already decided explicit, which should avoid further such prosecutorial abuses..
18. Re:Good to know. by jklovanc · 2011-10-07 12:49 · Score: 1
  
  First off, in US law it has (finally) become pretty clear that not obeying TOS is not, in itself, a crime. Ever-higher courts have ruled on that issue and Congress has a pending bill spelling that out in plain English.
  I never said violation of TOS was a crime but breach of contract is an actionable item in civil court even in the US. If you break the TOS in the US you can be sued for breach of contract in the US. Here is the relevant citation concerning Brows Wrap Agreements.
  
  "[93] In Register.com, Inc. v. Verio, Inc., 126 F. Supp. 2d 238 (Dist. Court S.D.N.Y. 2000), aff’d 356 F.3d 393 (2d Cir. N.Y. 2004), Register.com’s website contained their Terms of Use which stated that if the user accessed the database then the user agreed to the terms. The defendant submitted that simply making a query of the database was insufficient to indicate their consent. Register.com involved commercial parties where the defendant Verio accessed the plaintiff’s computers daily and saw the Terms of Use each time they did so and admitted that they were aware of the terms. Verio conceded that its use of the data for solicitations by mail and telemarketing breached Register.com’s Terms of Use. Registrar.com notified Verio that they were in breach of Register.com’s Terms of Use. Verio argued that it was not bound because the notice was provided after the transaction occurred, not before. The United States Court of Appeals for the Second Circuit found that this argument would only succeed if Verio accessed Register.com’s computers infrequently. The court held that Verio had notice of the terms because of its numerous daily queries and the presence of the Terms of Use after each query."
  Notice that is an American case. Zoocasa was frequently accessing Century21 data and had been advised of the TOS. The court ruled there was a binding contract and Zoocasa breached that contract. A similar decision based on the same precedent would probably happen in the US.
hmm.. by Bengie · 2011-10-02 06:42 · Score: 1

" the court here awarded $1,000 (one thousand dollars) for breach of the Century 21 website's Terms of Use"
We need an "Internet Terms of Use". "Anything on the internet that was meant to be accessible by the public is automatically public domain.
1. Re:hmm.. by amorsen · 2011-10-02 07:08 · Score: 1
  
  We need an "Internet Terms of Use". "Anything on the internet that was meant to be accessible by the public is automatically public domain.
  Are you sure that is what you want? It would mean that you could not apply the GPL to downloadable software.
  I'm an advocate of abolishing copyright so I'm all in favour of course.
  
  --
  Finally! A year of moderation! Ready for 2019?
2. Re:hmm.. by maxwell+demon · 2011-10-02 07:21 · Score: 1
  
  That would break a lot of things. For a start, Linux (being GPLed, not public domain) would probably no longer be available on the internet (at least not legally).
  Note that public domain includes the possibility for others to take that work and put their name on it instead of yours. I think few people would like that to happen to their work.
  And given the craziness of the U.S copyright law, I'd not even be surprised if then that other person could gain copyright on that work and order you to take your work, which now is infringing on their copyright, off the net.
  
  --
  The Tao of math: The numbers you can count are not the real numbers.
3. Re:hmm.. by Bengie · 2011-10-02 07:52 · Score: 1
  
  Nah, I didn't really think it through, but it sounds nice to me :P
  I bet it would be more good than bad.
4. Re:hmm.. by bk2204 · 2011-10-02 09:52 · Score: 1
  
  Anything on the internet that was meant to be accessible by the public is automatically public domain.
  Uh, I don't think that's a good idea. It means that all downloadable software would be in the public domain. It would effectively prevent anyone from putting any sort of creative work online (written, photographic, etc.). The same goes for documentation, news stories, comics, useful web apps, etc. It would probably result in a much more closed web where everybody had to sign up to every site in order to just access it.
My house is up for rent, I still live there by Anonymous Coward · 2011-10-02 06:52 · Score: 0

Dutch real-estate sites often copy their own content, and this goes in a circle.
I've moved into my rental apartment months ago, but on many sites it's still listed as being "IMMEDIATELY AVAILABLE".
I still get the odd person ringing my doorbell and asking if he could have a look at the apartment.
Effort != Creativity != (c) by Mathinker · 2011-10-02 06:59 · Score: 1

Your post shows just how badly misunderstood copyright law is nowadays. The mere fact that someone puts in a lot of effort to make a web page does not in and of itself mean that that web page is protected by copyright. Many things which require hard work do not qualify for protection under copyright law. Two examples which immediately come to mind are facts (like telephone book listings) or creative works which are considered utilitarian (like clothing/fashion designs, or individual cookbook recipes).
I haven't reviewed the facts of this case, so I don't know whether I would believe copyright was infringed (and what I believe makes no difference, what the court believed is what determines that). And I might well believe copyright wasn't infringed but the copying was immoral (rather than illegal). But I do know that the underlying message of your post: "if my work was copied I would be pissed off so it must be illegal", is just adding to the general confusion over an already complex and contorted branch of the law.
1. Re:Effort != Creativity != (c) by Almost-Retired · 2011-10-02 08:00 · Score: 2
  
  I like your sig. ;)
  
  Text, scraped and copied might be gotten away with, but if I took the photo, the copyright is mine and you must negotiate the use of a copy of that photo with me. I have been rather intimately involved with that aspect of photography since the late 40's, and have even won a suit for the theft of my work to the tune of ten grand back in about 1982. That part of copyright law is, at least here in the states, air tight. OTOH, if I post it, I believe that one can link to my picture, but to copy that link and store it on your machine is a clear cut violation.
  
  --
  Cheers, Gene
2. Re:Effort != Creativity != (c) by Anonymous Coward · 2011-10-02 15:11 · Score: 0
  
  >I like you sig. ;)
  What was it? I can't see it as it seems to have expired.
Re:Cut off internet access for"Rogers Communicatio by Almost-Retired · 2011-10-02 07:24 · Score: 1

And would not those subscribers/customers who are doing legit business over the web, and who might suffer many thousands in damages from the disconnect, have standing to sue Rogers for their loss of connectivity when it is not their fault, but Rogers actions that got them disconnected?

It seems to me that that the sauce used for the goose should work equally well for the gander. :)

--
Cheers, Gene.
I run small e-cigarette price comparison site... by Anonymous Coward · 2011-10-02 09:42 · Score: 0

...and this is greatly upsets me. Started it because It was too tiresome to check prices\search for items manually. VapeGrabber ( http://vapegrabber.com ) is based entirely on scraping at this point of time and I'm wondering if something like this could happen to me.
robots.txt paragraph by Anonymous Coward · 2011-10-02 11:55 · Score: 0

relevant robots.txt paragraph from the decision:

[244] Century 21’s Website has a robot.txt file and uses it to indicate to specific search engines that Century 21 does not want them to index the Century 21 Website. However, Century 21, through its website operator, has not been able to use its robot.txt file to block Zoocasa because it does not know the name of Zoocasa’s robot or spider. That information is required before a site can be blocked. Zoocasa has refused to provide the name of its robot. On this application Zoocasa indicated for the first time that it does not currently have a “branded” Internet robot. Zoocasa has not explained why it does not given how widespread the standard is in the industry.
Bullshit .. by roguegramma · 2011-10-03 00:32 · Score: 1

If anyone was copying your website 1:1, that is assuming you are competent enough to have one, and just replaced your name with his, you would be up screaming "he stole my website".
Copying a little from many makes a doctoral thesis or a fiction book, copying from one without attribution, is plagiarism.
Also, the robots.txt is an established means to indicate that you do not want your content to be copied completely.
Weakening the status of the robots.txt is stupid, exactly because it allows publishers who are too stupid to set up a robots.txt, to claim that someone violated their "Terms of use". If you took the time to scan the court decision, you would find that the defendant willingly violated the standards surrounding the robots.txt file, in particular, it failed to provide a robot name.

--
Hey don't blame me, IANAB