Wikia Search Engine to be Launched on January 7th
cagnol writes "The Washington Post reports that Jimmy Wales, the founder of online encyclopedia Wikipedia, has announced the launch of a new open-source search engine, Wikia Search, on January 7th, 2008. The project will allow the community to help rank search results, in a model close to Wikipedia. However the company is a for-profit organization. This new search is supposed to challenge Google and Yahoo."
So basically...they're asking for people to abuse the ranking system. To patrol something like this would require a company with resources like Google, and most likely the reason Google doesn't have such functionality. Just my two cents.
The idea is to challenge the established players by offering a search service that is more transparent to end users, meaning they can see how search results are arrived at. Wales has described Yahoo and Google as opaque services that don't explain how results are arrived at.
Personally, I don't care how search engines rank the websites they return as long as what is returned is proper, relevant and useful.
What? Why? It's called "competition" and it's healthy.
The non-profit is still going to have to make money. Crawling the web and returning results to queries is quite hardware and energy intensive.
It would have been nice to see them fix Wikipedia's own search engine, which IMO is absolute garbage. I have a better chance of being linked to what I'm looking for by using a general search engine.
This is simply his response to Google starting what amounts to competition for wikpedia. I'm sure google is having quite the laugh from it - one wonders how much of the donations for wikpedia are being used towards this thing.
If you think wikpedia gets vandalized, wait until there's money involved. Wikpedia for all it's trappings, doesn't directly influence spam. But a search engine... IF, and this is a big IF, this thing becomes mainstream, having the code public will make it very easy for the bot herders to control it. The idea is simply flawed. Google is currently dealing with bot herders attempting to manipulate it's page ranks - while the idea of it being open source sounds great (well, ok it doesn't to me - I don't have the love affair with open source that most slashdotters do - I've never bought into the security myth that there's GOOD coders out there with so much free time on their hands that they are walking OTHER peoples code. I don't like doing that when I'm PAID to do it. Not too mention there just aren't that many good coders out there....but I digress) it's simply going to work right into the hands of the malware crowd - especially now that it's more organized crime than it is vandalism.
EK
Shouldn't they work on getting wikipedia's search to work half way decently before they try to compete with Google?
Don't get me wrong, I like wikipedia, but their search on the site is next to worthless.
The thought that Jimmy Wales, cofounder of Wikipedia could have an open site without abuse is laughable. You operate under the sham of an open community, yet exclude those outside a very narrow political agenda. Your a fraud, using open source principals as a smokescreen that presents your personal world-view set as fact to the world. I don't buy what your selling, and I'm calling your bluff. The sad thing is that this will probably make you a fair amount of money if more people don't start to see through you.
But then the wonderful thing about leading revisionist history is you can substitute your own revisions for reality....
Hey Jimmy: quit goofing around on Slashdot and get to straightening out the Wikipedia "administration" system. Check out your current fundraising campaign - that little green guy is moving REALLY slowly, and things like faked credentials for editors and the "notability purge" aren't helping.
Sincerely,
The Rest of the Internet
"As God is my witness, I thought turkeys could fly." A. Carlson
Wikia has been something of a dud. What Wikia really does is monetize fancruft. Their big wikis are for Star [Trek|Wars|Gate|Craft], Everquest, Marvel comics, Yu-Gi-Oh, and similar subjects. They're the resting place for fan articles thrown out of Wikipedia.
Wikia's search engine, based on the user demographic they have now, is going to have great coverage of furry fan fiction.
There's already a good manually-updated search engine. It's called Open Directory. It's quite useful as a data source for answering the question "what is this web site about"? It tends to run months behind changes to the web, since it's manually updated. While not many people query DMOZ manually, it's used by Yahoo, Google, etc. to get some basic information about a web site.
As an example of how great Wikia search is going to be, Wales suggested searching for "Tampa hotels". The major search engines return too many bottom-feeder reseller and directory sites for searches like that. As I point out occasionally, we've already solved that problem over at SiteTruth, which looks for business legitimacy. Type in "Tampa hotels" there and watch it push the marginal sites to the bottom of the search results. We have that one handled.
Wikipedia works because people are willing to do substantial work for free for a non-profit organization. That doesn't work for a commercial business. You can get people to write about themselves (Myspace, Facebook, etc.) but beyond that, "crowdsourcing" doesn't go very far.
Is content going to ever be totally free? It will be if people understand the inherent rewards of an open society. Information's negligible cost of duplication is the revolutionary model is the thing that is shattering the old models (c.f. http://homes.eff.org/~barlow/EconomyOfIdeas.html). Wikipedia is already doing that. As much as I'm a critic of Jimmy Wales, citizendium, etc. (with their NPOV lunacy), the system he's helped build is saving people's lives and improving quality of life in ways the old world just doesn't understand yet.
Personally, I'm hopeful that as long as we still have the Right to Read (c.f. http://www.gnu.org/philosophy/right-to-read.html), we're on the path to freedom and salvation. A corporation who makes up a new "model" to take advantage of content producers isn't going to take hold anymore. There's just not a point anymore. The price of content is already quite low for common knowledge. Even if the arbiters of knowledge try to keep it from common knowledge, we can paraphrase it. The greatest risk to real productive use of our knowledge still remains Patents. Information may finally be free, but the freedom to tinker is not.
Funny to read this today, after I spent a couple of hours yesterday searching Google for something that doesn't exist -- a Plucker type app for the iPod Classic. What struck me was just how badly Google performed. Any search containing the word "iPod" seems to return pages upon pages of blog entries about the (long since released) iPhone. What one tends to find with a Google search are a lot of loud, content-light blog entries, popping with ads, with short dashed-off articles broken across several pages. "Relevance" in Google seems to have the most to do with activity -- posts per day per site, repeated introductory blurbs on every page, modestly-trafficed forums devoid of meaningful discussion. Google does a pretty decent job with common searches, reasonably well with obscure searches, but very badly with the rest -- the middle of the long tail.
Google rose to prominence by being the best of a pretty weak set of players. It's still only the least bad solution, and there are a lot of things it does poorly. In classic AltaVista, you could type a few words of a song in quotes and find the title and lyrics. Type a long quoted string into Google, and you're likely to come up with nothing.
If Wikia manages to best Google in any type of search I'll applaud it. Search choices beyond Google and Trying to Be Google would be most welcome.
If we're talking about publicly-ranked search results, the results may expose more than we're comfortable with.
/., I find all +5 content to be generally insightful, interesting, funny, etc. At least it seems so to me. Either I'm new here, or we've all seen Life of Brian. Whether that's utopia or not is another question altogether.
Wikipedia content is either right or wrong. It's not meant to be subjective, hence it can be patrolled and corrected. Now they want to apply it to subjective content; I don't see that making sense, albeit at first glance. User A is a technocrat who loves Monty Python. Hardly an isolated case. Use B is a 15yr old who likes whatever he/she likes this week. There's no "patrolling" this, except to address systematic abuse.
The concept is fine for slashdot, or any "closed" system, where the users generally share a common set of expectations. At
Expand this out to the general internet user, and the result will, of course, reflect the general focus of human society. That will be interesting, to say the least, though I'll bet $5 that anything entertainment- and religion-based will always be at the top of the results. Is that what people want? Ipso facto perhaps, but sure as hell not I.
Let's keep in mind that (no offence to anyone specific) ~80% of Americans believe in God, less than 50% subscribe to Darwin, ~30% believe in "UFOs, witches and astrology" (if you can believe this poll that is). Of course, smart people believe weird things too.
Add to this, that 81% of those who have seen two or more "Police Academy" movies believe that O.J. is innocent, and you have a recipe for disaster.
It's easier to find things on Wikipedia with Google than it is with the Wikipedia search... Good luck, Jimmy Wales. You're going to need it.