Wikipedia's Content Ripped Off More Egregiously Than Usual
Ultraexactzz writes "Wikipedia's content is licensed under the GFDL, which permits such content to be copied with attribution — and Wikipedia is used to its content being copied and mirrored. However, a new website at e-wikipedia.net appears to have taken this a step further by mirroring the entire English Wikipedia — articles, logos, disclaimers, userpages, and all. Compare Wikipedia's About page with e-wikipedia.net's. The site even adds to Wikipedia's normally ad-free interface by including text ads." Just try logging in or actually editing an article, though, and you'll get the message "The requested URL /w/index.php was not found on this server. Additionally, a 404 Not Found error was encountered while trying to use an ErrorDocument to handle the request." If there's credit here, I don't see it — sure looks like it's intentionally misleading readers.
This is perfect! Next time a teacher or other person in authority says I can't use Wikipedia because it is unreliable I just get the content from this site and I can say that it wasn't Wikipedia!
Taxation is legalized theft, no more, no less.
C'mon people - this story is a dupe. I just saw the exact same discussion on e-slashdot.org!
When you have nothing left to burn you must set yourself on fire
You mean... someone is taking information freely available on the internet and claiming it as their own for profit reasons? My word, what a shocking turn of events!
Yeah! Weren't those the days? There were an awful lot of "CmdrTaco wins Nobel Prizes in Medicine, Peace, and Chemistry" posts, though.
The sauce is under GFDL. E-Wikipedia is also under GFDL. I don't see the problem.
(rot13) rpbzbab@tznvy.pbz
Brought to you by the creators of Limbo of the Lost.
Spelling mistakes, grammatical errors, and stupid comments are intentional.
I noticed when I scrolled down to the bottom of the "e-wikipedia"'s clone of the About page, there was some junk words at the bottom which were not on the original.
The site is probably just a reverse proxy with a few filters to insert ads, maybe embed malicious content, insert some junk text, white on white, and the site owners probably hope that when people are looking for info using a search engine, that they will mistake the site for the real Wikipedia.
1. Create a Fake-e-pedia site
2. ????
3. Profit!!!
I wonder what their #2 is...
Just my 2cents.
No sig. Move along - nothing to see here.
1. That site will sooner or later be indexed by Google, misleading unwary googlers to the fake site.
2. More hits, more ad revenue.
3. Profit!!
Hopefully, Wikipedia's GFDL license will make possible to have this website banned.
Aww now you tell me. There goes all my personal banking information that I normally keep safe and sound on the REAL wikipedia site. :(
Interesting.
I was already logged into Wikipedia. I went to e-wiki, and did a search for itself. I decided I'd have some fun and create the article. I clicked to create it, and it brought me over to en.wikipedia.org to create it.
Very interesting. Not even -trying- for original content.
Those who believe the Internet is private,
find their privates are on the Internet.
There are many many many of these sites.
While I notice it hasn't in this case, google is normally pretty quick to remove them from its indexes as well, so if you use google, you'll mostly not be able to find them.
However, the basic meme of copy content, add ads and publish, particularly for content like wikipedia that is self-referential, is very widely used.
--Q
This does nothing to resolve the trademark problem that the 'mirror' creates, but it is instructive to look at the actual text of the license.
"2. Verbatim Copying [] You may copy and distribute the Document in any medium, either commercially or noncommercially, provided that this License, the copyright notices, and the license notice saying this License applies to the Document are reproduced in all copies, and that you add no other conditions whatsoever to those of this License."
The pages do appear to be verbatim copies of the Wikipedia pages, despite the lack of some images (note: verbatim - in precisely the same words used by a writer or speaker). You'll also note that the license does not require attribution (found in other words in Section 4), just a requirement for reproduction. Wikipedia is the one that must resolve its failure to include a copyright notice on the pages, not the mirror.
Archived WHOIS on e-wikipedia.net domain from 2008/04/27 (it's now using a privacy protect WHOIS service):
:-)
Registration Service Provided By: NameCheap.com
Contact: support@NameCheap.com
Visit: http://www.namecheap.com/
Domain name: e-wikipedia.net
Registrant Contact:
-
John Heys (allegro.share2@o2.pl)
+46.0851041152
Fax: +1.5555555555
Virkesvagen 5
Stockholm, n/a 12030
SE
Administrative Contact:
-
John Heys (allegro.share2@o2.pl)
+46.0851041152
Fax: +1.5555555555
Virkesvagen 5
Stockholm, n/a 12030
SE
Technical Contact:
-
John Heys (allegro.share2@o2.pl)
+46.0851041152
Fax: +1.5555555555
Virkesvagen 5
Stockholm, n/a 12030
SE
Status: Locked
Name Servers:
ns1.hostpower.pl
ns2.hostpower.pl
Creation date: 28 Feb 2008 20:23:45
Expiration date: 28 Feb 2009 20:23:45
---
Other domains hosted at that IP:
Strzelecki.info
E-teledyski.org
Giexx.com
Moderowany.net
Songstexts.info
Tibianews.info
Wartibia.com
Wikipedia2009.com
Axeee.com
I'll spare everyone the WHOIS data for all of those domains as well - look it up on your own.
I had a high school student turn in a long report that obviously wasn't her work. I googled it and she had cut and pasted about 10 pages of material right from Wikipedia into her report. I brought her in, told her that some of the writing didn't look like she wrote it:
Me: "Did you write this whole thing yourself?"
Her: "Yes, of course!"
Me: "Are you sure"
Her: "Yes, 100%"
Me: "Well, a huge chunk of your report is straight from Wikipedia."
Her: "Um, yeah, well, um I wrote that Wikipedia page."
If someone references e-wikipedia.net in an article on Wikipedia will the internet collapse in on itself?
Moreover, this is a stupid way to design it, since it's trivial for Wikipedia to detect what you're doing, and serve a custom error page, as they have done. In short, why did these people assume Wikipedia was going to let them continue infringing their trademark and taxing their servers?
When I did a search on it, it returned a "leech" message. Obviously,they didn't even bother to copy it, as far as I can tell, they are just returning wikipedia pages. In fact, the page it returned specifically warned me only to use pages from *.wikipedia.org and that this site was leeching off them. If your going to try something like this, you should at least not be a total idiot, to the point where your copy actually points out that it is fake.
Step 1) Duplicate highly successful web site, rip off all content, images, layouts, etc... /. and Digg about rampant abuse of IP
Step 2) Secure Advertising
Step 3) Submit story on
Step 4) Profit!
-Rick
"Most people in the U.S. wouldn't know they live in a tyrannical state if it walked up and grabbed their junk." - MyFirs
Your hair look like poop, Bob! - Wanker.
The site now redirects to the wiki article on "Leech (computing)" explaining why you can no longer see any other articles. That was quick.
- None can love freedom heartily, but good men; the rest love not freedom, but license. -- John Milton
Another fine product from... MAJESTIC STUDIOS!
Yes, I remember. Ahhh, the halcyon days when Slashdot headlines were 100 percent relevent to nerds, and every post was full of knowledge and wisdom. Yes, I remember those days well.
Steve's Computer Service, Hobbs, NM
I tried that, but my bank account details failed the notability criteria..
Repton.
They say that only an experienced wizard can do the tengu shuffle.
That is not the problem.
The problem is not using the information that Wikipedia provides--after all, that's why it is contributed under copyleft. The problem is that someone is essentially hosting a site that routes all the heavy computational, database, and programming work through Wikimedia's servers, usually with the intention of making a quick buck by spamming or selling ads.
And then the site got started :)
For the unaware: Majestic Studios.
I'm amazed at how many people are missing Wikipedia's built in protection against this.
Every page has a history. It's possible to cite a page at a certain time and guarantee that it will be displayed regardless of what changes are made to the article. This, in addition to a diff system (and discussion), makes it better in some ways than hard print, because it allows the reader to map changes over time and consensus/disagreements over contentious topics.
http://www.xkcd.com/354/
Which is a lot more work than just citing a primary source in the first place. You shouldn't be citing a source you don't understand anyway.
After all, I am strangely colored.
The information is made available under certain licensing rules which they are breaking.
If Google really cared they would fix Android Chrome to reflow text, instead of discriminating
Eh, I think the OED is the de facto dictionary for non-law research.
I think it depends on which edition, one of the paperback editions like the Essential or American editions or the full 20something volume edition. I got my spelling of time as "tyme" from the full edition.
FalconShould there be a Law?
And definitive use in the law, as well. Merriam-Webster is not a particularly accurate, thorough, or disciplined publication. It's fine as a casual reference, but so is Dictionary.com, American Heritage, and Webster's (the REAL Webster's).
The OED is the English language resource, at least in terms of the high water mark for scholarship. It is that disciplined scholarship that leads to its criticism, however. Precise word choice, where it is important, should not be blunted by an overly populist dictionary with demonstrably lower levels of academic scholarship and fidelity.
If the term has become such a point of contention that the precise dictionary definition is required, then OED is the ultimate arbiter. If you're not squabbling over technicalities and just want the basic gist, then any of the other reputable dictionaries, including M-W, are acceptable. Stopping at M-W, on the other hand, is like saying an encyclopedia is a sufficient technical resource. Encyclopedias and dictionaries are by their nature limited. The OED is unquestionably the most detailed English dictionary, and no other resource can make a contrary claim with any real credibility. That's what makes it valuable in academic, technical, and legal research.