Wikipedia Pages Now On Amazon — With Product Links
An anonymous reader writes "Last month, e-commerce marketplace Amazon.com launched a relatively unnoticed new feature that brings content from Wikipedia pages to its own servers in a shadowy new project that appears to be called 'Shopping Enabled Wikipedia Pages.' Hosted on the Amazon.com domain, they replicate Wikipedia's content but have added links to where a book can be purchased on Amazon. Amazon representative Anya Waring told CNET when asked via e-mail, 'As of November, we have rolled out in the books category, however [it] will be expanding to new categories in 2011.' If Average Joe scrapes Wikipedia and adds affiliate links to it, Google will remove and punish the domains with duplicate pages."
I guess there's nothing that doesn't end up being commercialized. Wikipedia has certain problems — when I look up topics in which I'm an expert, I always find the articles full of mistakes — but it was nice to see something that was relatively free of commercial spin. No more, it seems.
Google punishes wikipedia clones with adverts? Are you sure, because one of the things that made me stop using Google was the large number of results that were either mailing list archives with ads (the same list post on the top 10 hits, just different ads), or Wikipedia copies with ads. In fact, the 'Google will remove and punish' link refers to domains that contain the same content on different pages, rather than domains that duplicate the content of other domains, so is completely inapplicable to pages hosting Wikipedia content plus adverts.
I am TheRaven on Soylent News
From the very page linked
I don't think Amazon is doing this to boost their pagerank.
From Google:
"Duplicate content on a site is note grounds for action on that site unless it appears that the intent of the duplicate content is to be deceptive and manipulate search engine results"
ie: the 'Average Joe' can scrape wikipedia all he wants and Google will not punish him unless his intent is to deceive. But thanks for the conspiracy theory attempt just the same.
A.
...bringing you cynical quips since 1998
Google will not punish and remove.
Google will discount the PageRank (Page, as in Larry) to nothing for prior published content. That is the one and only "penalty."
Amazon, whatever the value of this, has enough related value content for this not to matter much-- there's (probably) a PR+ value to presenting the relevant Wikipedia content next to similar information.
Yes, it's darn annoying and another reason to boycott those **** at Amazon. But it's not the things the OP summary says. //karma-whoring
"I said we should host Wikipedia, you idiot!"
How can I believe you when you tell me what I don't want to hear?
I don't think Amazon is doing this to boost their pagerank.
Sure, but they still "manipulate search engine results".
If you want news from today, you have to come back tomorrow.
Sarah, is that you?
Wikileaks was going to publish the fact that Amazon is a pediaphile.
I am TheRaven on Soylent News
I am however against the commercialization aspect of Wikipedia (especially since, like others have said, I doubt Wikipedia makes any money off of this due to its open nature).
Why not just create an add-on that does this across all web pages, similar to how skype lets you call any phone number on a web page, or g-mail identifies emails that are about shipments and posts a link to the tracking (though these are not themselves add-ons). If you could even further specify what types of things you are interested in getting info on (i.e. only books) it would be much less intrusive and subversive and less likely to turn away people.
Though maybe enough people wont care and this will be really profitable for amazon, even though they are shoving it down people's throats
www.RacquetUp.org - Helping Detroit Youth
Let's play this game. Assume Wikipedia was using a more draconian licence that restricted monetary gain. Then it would become a much less valuable as source material. If I was working on a research grant, I couldn't touch wikipedia, not even to check their sources, out of fear of getting sued for copyright violations. Do we really want more of that?
If our elected representatives no longer represent us, do we still live in a Democracy?
As a sometimes-wikipedia editor (aren't we all) I have to say "MEH".
I contribute to wikipeida because I want a useful reference. If Amazon is willing to mirror it (with a couple of ads) what is the problem?
but aside from that I am not sure what I am reading here? is wikipedia turning into some kind of fancy amazon catalog? FUCK ME! I am outta the intertubes, enough of this commercial bullshit!
When I read the summary, I thought it was referring to the delightful(ly stupid) practice some people have got into, of packaging Wikipedia pages and selling them on Amazon while printing the things through services like Lulu. That was a clear example of how badly their internal search can be gamed. This is just unbelievably crass. On the other hand, who on Earth is going to go straight to an Amazon mirror of Wikipedia?
It appears that you have to find a way to click yourself out of shopping-enabled Wikipedia into regular Wikipedia in order to be able to search Wikipedia for anything that's not already on the main page.
Also, the shopping-enabled main page is under the impression that today is October 23. When you live near a Marine Corps base, stuff like
1983 – Lebanese Civil War: Suicide bombers destroyed two barracks in Beirut, Lebanon, killing 241 U.S. servicemen and 58 French paratroopers of the international peacekeeping force.
tends to catch your eye.
I see even classic Slashdot is now pretty much unusable on dial up anymore.
Your example is silly, a non sequitur: nothing in such a license would prohibit READING Wikipedia... which is all you'd be doing if you were "checking sources". If you COPIED the article into your own research-for-profit, though, you'd be begging for a smackdown.
Wikipedia will be the first encyclopedia to have a version which actually directly pushes readers to more authoritative sources (specialized books, etc.) How many other encyclopedias will be able to say that they have such integration?
Are they redirecting people from wikipedia? Are they stomping on search result pages? Nothing is being "shoved" here.
This is an incredibly useful feature. I use wikipedia all the time for research papaers, but most research papers do not allow online sources or allow only a limited number. Citations to actual books are needed, and to draw quotes from those books we need access to at least a bit of the content. Amazon provides this, meaning now I may be able to just click a citation and be directed to the proper page at amazon where I can access a few sample pages from the book - ba-bing, now I have a citation for my paper. What's amazing is not how amazon was crass enough to do this, but that jimmy wales was so shortsighted as to not offer to do this from the beginning. That's potentially a lot of revenue they'll never claim now.
...1, May the wiki-fiddling begin....
Laters Sol "Have you found the secrets of the universe? Asked Zebade "I'm sure I left them here somewhere"
Well, there are basically two problems that I see:
1) If the data is a copy, how do you keep the copy synced with the original.
2) If the data is a hot-link, who pays for the extra bandwidth?
Those are both minor, and only one will apply. But to me it seems that there should probably be an update cycle. The main question is "how fast?". If it's a slow update cycle, then there should be little on-going expense, and it should facilitate Wikipedia doing it's job.
Ideally, Amazon should host Wikipedia in the cloud, and Wikipedia should do periodic hot-updates to it's local database. This would decrease the cost to Wikipedia and facilitate Amazon doing hot-links. But there's the matter of control of the original sources, domain name, etc., and I'm afraid that I wouldn't trust Amazon enough for that to be an acceptable alternative. We don't live in an ideal world.
I think we've pushed this "anyone can grow up to be president" thing too far.
Let's play this game. Assume Wikipedia was using a more draconian licence that restricted monetary gain. Then it would become a much less valuable as source material. If I was working on a research grant, I couldn't touch wikipedia, not even to check their sources, out of fear of getting sued for copyright violations. Do we really want more of that?
If I was working on a research grant, I couldn't touch wikipedia *anyway*. It *might* be an OK source for grade / high-school and *some* undergrad papers / projects, but NOT for research grants.
Wikipedia is a great resource, but not for anything more than a very preliminary starting point for things above a certain level.
If you want news from today, you have to come back tomorrow.
They give away their software and copies of their database so anyone an do the same thing. Amazon should throw some cash their way but they don't have to.
I hope they'll be kicking back some money to the Wikimedia Foundation. Though they don't have to, if they're getting some value out of it, they should make sure Wikimedia can keep its projects running. Bezos can certainly spare some change.
Use Wikipedia's xml export periodically.
Extra bandwith is handled by Amazon, no?
Change is certain; progress is not obligatory.
woohoo! GPL vs BSD flame on!
Do you even lift?
These aren't the 'roids you're looking for.
Assume Wikipedia was using a more draconian license that restricted monetary gain. Then it would become a much less valuable as source material.
That's nonsense, Wikipedia's license applies to redistribution, not use. You would have as much freedom with Wikipedia using non-commercial license as with any regular old book, you could use and quote it all you like, just not do plain verbatim copies of it. Or have you stopped using regular books to while working on your research grants too?
Do we really want more of that?
Depends, once up on a time there was some use for allowing commercial redistribution of freely licensed stuff, as otherwise you wouldn't have all the Linux distributions. But with the Internet being pretty much commonplace everywhere now there is much less need for physical redistribution, so using a non-commercial licenses could make a good bit more sense, as they would stop people from grabbing free stuff, slapping some ads on it and making money from that.
Huh? The equivalent for software is perfectly fine under either BSD or GPL.
I'd guess that this is mostly about enhancing things for Kindle users. Perhaps when reading a Kindle book there'll be an embedded link to the Amazon enhanced Wiki content. Same for Shelfari, Abebooks, etc. They may have no intention of making the Wiki content available to casual surfers, and may opt-out of indexing by search engines entirely.
Wikipedia has always had the ability to look up where to get books that are cited as references. People tend to cite online sources more often because it is easier, and because the admins prefer references that they can check without having to do much work; I've seen arguments where admin threatened to remove something because the reference was an (unclassified) military manual which was only available in large libraries.
If you click on an ISBN you'll get this unweildy page, which links to searches in more libraries, stores, and databases then you ever cared to look at. It would be nice if they used some sort of geolocation to show the most relevant at the top of the page.
You're using an encyclopedia for grant-level research?
Can you be Even More Awesome?!
If not, what is required? Ten more sites copying wikipedia content, so all ten results on the first page point to the same page?
If I was working on a research grant, I couldn't touch wikipedia *anyway*. It *might* be an OK source for grade / high-school and *some* undergrad papers / projects, but NOT for research grants.
Wikipedia shouldn't be cited as a source at any level. But it can help you to understand a topic, and hopefully point you to some better sources if you need to cite something. There's no arbitrary limit at which you can't use it like that. Even when you're an expert in some field, you're still going to want information on related fields quite often.
I might get bashed for this comment but I think that it is actually a good feature. As a researcher, I often use Wikipedia to get links to more more sources of authority that I can ask the laboratory to order on Amazon. As far as I understand, at the moment, Amazon just links ISBN and book titles back to Amazon so you can buy them. What I did before was copy and pasting the ISBN to Amazon or searching for the book title. The way they have implemented the shopping-enabled Wikipedia is close to the behaviour of customers looking for books on a specific subject and just spare some copy-paste. If I use wikipedia to get to know how I should spend my book budget, I think this is a very good approach.
This is all about Search Engine Optimization and result relevance. Amazon (in particular, this team) is trying to make sure that their pages are the top results for search hits on keywords related to authors, musicians, movies, TV series, etc. By providing additional context (such as a full Wikipedia article) they can get not just relevance but related concepts or keywords.
There are no community features, no editing the page, and no discussion. This is solely about redirecting search traffic to Amazon for shopping, and website sessions resulting in purchases that enter through this page is how the feature is going to be evaluated and continued or discontinued.
Amazon is all about making the sale and the pages are relentlessly optimized to maximize the likelihood that you'll make a purchase through them.
If you're like most researchers you simply mass copy the research links sourced from a few studies without reading any of them anyways, don't kid yourself, you use wikipedia as a source to find other sources.
No, it's a wikiphile. But it's not so funny.
Is it just me or have they removed the ability to search articles? So far, it appears to show me the main page by default and I can't freely choose an article. I don't see any adds and links to Amazon content only seem to work with ISBNs. As is, it's pretty useless.
I feel so sig.
And how about those stupid scrape-and-clone "business directory" sites? Why, when I search for a company by name, do 5 or 10 of these useless listings rank above the company's own website? Google (and/or its competitors) need to do something about that or the entire WWW will die. Hmm, that could be a good thing :)
The extra bandwidth that I was talking about was that bandwidth used by Amazon in making the periodic backup copy. Which happens repeatedly. The frequency with which it happens determines the amount of extra bandwidth.
I think we've pushed this "anyone can grow up to be president" thing too far.
After what Amazon did to Wikileaks, I cannot believe they are anything but greedy Corporate bloodsuckers, ready and willing to sacrifice YOUR freedom for a few dollars.
It is going to be a good long while before I will do business with them again.
I don't understand. How is it extra? It's the cost of running a public website when you allow everyone to do that. I'm certain Amazon isn't the worst abuser of Wikipedia's resources.
Change is certain; progress is not obligatory.
Yes, it's a part of the cost of running an open web-site. That doesn't mean it's not a cost. (Possibly not a major one, though.)
This is an identified on-going cost. That others do the same thing doesn't mean this isn't a cost. (And I *did* originally say that this was a minor problem.)
The actual point was that I couldn't find anything BUT minor problems. Perhaps I should have been more explicit.
(OTOH, I don't really trust Amazon. That incident with removing already purchased copies of 1984 still sticks in my craw.)
I think we've pushed this "anyone can grow up to be president" thing too far.
No one knows you're an old dog.
I may be "too young to understand" whatever it is you blathered on about, but at least I'm old enough to know not to make silly assumptions about anonymous voices in the ether.