Wikia Search Engine to be Launched on January 7th
cagnol writes "The Washington Post reports that Jimmy Wales, the founder of online encyclopedia Wikipedia, has announced the launch of a new open-source search engine, Wikia Search, on January 7th, 2008. The project will allow the community to help rank search results, in a model close to Wikipedia. However the company is a for-profit organization. This new search is supposed to challenge Google and Yahoo."
I guess that's their response to Google's Knol (http://en.wikipedia.org/wiki/Knol) Pity to see things heat up between the 'good guys'.
So basically...they're asking for people to abuse the ranking system. To patrol something like this would require a company with resources like Google, and most likely the reason Google doesn't have such functionality. Just my two cents.
The Washington Post reports that Jimmy Wales, the founder of online encyclopedia Wikipedia, has announced the launch of a new open-source search engine, Wikia Search, on January 7th, 2008.
Not only that, Wikipedia is reporting that its marketshare has tripled in the last six months.
The theory of relativity doesn't work right in Arkansas.
The idea is to challenge the established players by offering a search service that is more transparent to end users, meaning they can see how search results are arrived at. Wales has described Yahoo and Google as opaque services that don't explain how results are arrived at.
Personally, I don't care how search engines rank the websites they return as long as what is returned is proper, relevant and useful.
Since this project would seem to depend on the participation and good-will of users in order to work, my guess is that a nonprofit version will follow shortly afterwards, paralleling the open-source model. I also predict that without the benefit of a massive Microsoft-esque head start, the for-profit version will be put of business in short order.
A-Bomb
It would have been nice to see them fix Wikipedia's own search engine, which IMO is absolute garbage. I have a better chance of being linked to what I'm looking for by using a general search engine.
This is simply his response to Google starting what amounts to competition for wikpedia. I'm sure google is having quite the laugh from it - one wonders how much of the donations for wikpedia are being used towards this thing.
If you think wikpedia gets vandalized, wait until there's money involved. Wikpedia for all it's trappings, doesn't directly influence spam. But a search engine... IF, and this is a big IF, this thing becomes mainstream, having the code public will make it very easy for the bot herders to control it. The idea is simply flawed. Google is currently dealing with bot herders attempting to manipulate it's page ranks - while the idea of it being open source sounds great (well, ok it doesn't to me - I don't have the love affair with open source that most slashdotters do - I've never bought into the security myth that there's GOOD coders out there with so much free time on their hands that they are walking OTHER peoples code. I don't like doing that when I'm PAID to do it. Not too mention there just aren't that many good coders out there....but I digress) it's simply going to work right into the hands of the malware crowd - especially now that it's more organized crime than it is vandalism.
EK
by our tags, that we have a few Wikipedian Protestors in our midsts.
Ask me about repetitive DNA
the top hundred results will be to porn sites.
What a crappy search engine. When I search for something, I want the top 100 results to be 100 different porn sites! I can find two porn sites without help.
paintball
Shouldn't they work on getting wikipedia's search to work half way decently before they try to compete with Google?
Don't get me wrong, I like wikipedia, but their search on the site is next to worthless.
The thought that Jimmy Wales, cofounder of Wikipedia could have an open site without abuse is laughable. You operate under the sham of an open community, yet exclude those outside a very narrow political agenda. Your a fraud, using open source principals as a smokescreen that presents your personal world-view set as fact to the world. I don't buy what your selling, and I'm calling your bluff. The sad thing is that this will probably make you a fair amount of money if more people don't start to see through you.
But then the wonderful thing about leading revisionist history is you can substitute your own revisions for reality....
There have only been two fundamental revenue models of content for 25 years now - EndUser and Advertiser. The ISP's went through the throes of the switch from PerHour to FlatRate in the 1990's, and the RIAA is struggling with it now.
I don't know anyone who would "pay to search" casual queries. There are some professional databases which do operate on this principle for high powered content.
From the RIAA threads we learn people don't want to pay as endusers for their content. The post above asks about the advertiser model.
The absolutely tough part about Free Open Source models is that it takes a MUCH longer cycle for the benefits to wind around the social benefit cycle. The monthly rent/mortgage whips around much sooner. The first person to absolutely nail this problem will be the mogul of the 2010 decade.
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
Wikia has been something of a dud. What Wikia really does is monetize fancruft. Their big wikis are for Star [Trek|Wars|Gate|Craft], Everquest, Marvel comics, Yu-Gi-Oh, and similar subjects. They're the resting place for fan articles thrown out of Wikipedia.
Wikia's search engine, based on the user demographic they have now, is going to have great coverage of furry fan fiction.
There's already a good manually-updated search engine. It's called Open Directory. It's quite useful as a data source for answering the question "what is this web site about"? It tends to run months behind changes to the web, since it's manually updated. While not many people query DMOZ manually, it's used by Yahoo, Google, etc. to get some basic information about a web site.
As an example of how great Wikia search is going to be, Wales suggested searching for "Tampa hotels". The major search engines return too many bottom-feeder reseller and directory sites for searches like that. As I point out occasionally, we've already solved that problem over at SiteTruth, which looks for business legitimacy. Type in "Tampa hotels" there and watch it push the marginal sites to the bottom of the search results. We have that one handled.
Wikipedia works because people are willing to do substantial work for free for a non-profit organization. That doesn't work for a commercial business. You can get people to write about themselves (Myspace, Facebook, etc.) but beyond that, "crowdsourcing" doesn't go very far.
As trollish as parent is perhaps, he is unfortunately speaking a trollish truth.
Speaking explicitly as a reader of slashdot, with all the group-think biases a site like this introduces, wikipedia is floundering in a mire of their own arrogance, and the dissatisfaction with this needs to be heard.
Wikipedia receives most of its traffic from its articles appearing in Google's search results, Wikipedia being relevant content, and Google being the top search engine.
How is Wikipedia to draw traffic to their search engine? Obviously not via Google, as search engines are content free on their own. Integrating it with Wikipedia? But again, Wikipedia is the end target, not a start point, so how could this work.
I don't think Wikipedia has the strategy or money for this to reach critical mass and show its potential, but it'll be interesting as an experiment.
And Furries.
Earn a % of cash back from Newegg, Tiger Direct, Walmart.com, and more: http://www.mrrebates.com?refid=458505
I foresee someone hacking this system to return goatse as the #1 result for every search made.
Is content going to ever be totally free? It will be if people understand the inherent rewards of an open society. Information's negligible cost of duplication is the revolutionary model is the thing that is shattering the old models (c.f. http://homes.eff.org/~barlow/EconomyOfIdeas.html). Wikipedia is already doing that. As much as I'm a critic of Jimmy Wales, citizendium, etc. (with their NPOV lunacy), the system he's helped build is saving people's lives and improving quality of life in ways the old world just doesn't understand yet.
Personally, I'm hopeful that as long as we still have the Right to Read (c.f. http://www.gnu.org/philosophy/right-to-read.html), we're on the path to freedom and salvation. A corporation who makes up a new "model" to take advantage of content producers isn't going to take hold anymore. There's just not a point anymore. The price of content is already quite low for common knowledge. Even if the arbiters of knowledge try to keep it from common knowledge, we can paraphrase it. The greatest risk to real productive use of our knowledge still remains Patents. Information may finally be free, but the freedom to tinker is not.
Completely different. :) For one thing, we are doing everything completely freely licensed. Mahalo is proprietary.
For another thing, Mahalo is "human edited" search results for the top queries, which is not a bad idea of course, but it is not intended to be a full search engine. Mahalo have indicated an interest in replacing their google search backup with our open source alternative, if we get to be good enough, which is obviously a far from foregone conclusion.
Wikia
Funny to read this today, after I spent a couple of hours yesterday searching Google for something that doesn't exist -- a Plucker type app for the iPod Classic. What struck me was just how badly Google performed. Any search containing the word "iPod" seems to return pages upon pages of blog entries about the (long since released) iPhone. What one tends to find with a Google search are a lot of loud, content-light blog entries, popping with ads, with short dashed-off articles broken across several pages. "Relevance" in Google seems to have the most to do with activity -- posts per day per site, repeated introductory blurbs on every page, modestly-trafficed forums devoid of meaningful discussion. Google does a pretty decent job with common searches, reasonably well with obscure searches, but very badly with the rest -- the middle of the long tail.
Google rose to prominence by being the best of a pretty weak set of players. It's still only the least bad solution, and there are a lot of things it does poorly. In classic AltaVista, you could type a few words of a song in quotes and find the title and lyrics. Type a long quoted string into Google, and you're likely to come up with nothing.
If Wikia manages to best Google in any type of search I'll applaud it. Search choices beyond Google and Trying to Be Google would be most welcome.
If we're talking about publicly-ranked search results, the results may expose more than we're comfortable with.
/., I find all +5 content to be generally insightful, interesting, funny, etc. At least it seems so to me. Either I'm new here, or we've all seen Life of Brian. Whether that's utopia or not is another question altogether.
Wikipedia content is either right or wrong. It's not meant to be subjective, hence it can be patrolled and corrected. Now they want to apply it to subjective content; I don't see that making sense, albeit at first glance. User A is a technocrat who loves Monty Python. Hardly an isolated case. Use B is a 15yr old who likes whatever he/she likes this week. There's no "patrolling" this, except to address systematic abuse.
The concept is fine for slashdot, or any "closed" system, where the users generally share a common set of expectations. At
Expand this out to the general internet user, and the result will, of course, reflect the general focus of human society. That will be interesting, to say the least, though I'll bet $5 that anything entertainment- and religion-based will always be at the top of the results. Is that what people want? Ipso facto perhaps, but sure as hell not I.
Let's keep in mind that (no offence to anyone specific) ~80% of Americans believe in God, less than 50% subscribe to Darwin, ~30% believe in "UFOs, witches and astrology" (if you can believe this poll that is). Of course, smart people believe weird things too.
Add to this, that 81% of those who have seen two or more "Police Academy" movies believe that O.J. is innocent, and you have a recipe for disaster.
It's easier to find things on Wikipedia with Google than it is with the Wikipedia search... Good luck, Jimmy Wales. You're going to need it.