Company Offers Customizable Web Spidering
TechReviewAl writes "A company called 80legs has come up with an interesting new web business model: customized, on-demand web spidering. The company sells access to its spidering system, charging $2 for every million pages crawled, plus a fee of three cents per hour of processing used. The idea is to offer Web startups a way to build their own web indexes without requiring huge server farms. 'Many startups struggle to find the funding needed to build large data centers, but that's not the approach 80legs took to construct its Web crawling infrastructure. The company instead runs its software on a distributed network of personal computers, much like the ones used for projects such as SETI@home. The distributed computing network is put together by Plura Processing, which rents it to 80legs. Plura gets computer users to supply unused processing power in exchange for access to games, donations to charities, and other rewards.'"
Lets assume that spidering a page costs 10 kB of data.
So thats $2 for 1M pages, or 10 GB of data download.
So thats at least $1 of data transfer that is being shifted onto the suckers, err "volunteers" who's home network is running this app.
Test your net with Netalyzr
Sounds like a legitimate front for identity thieves, spammer, or even worse... Marketers.
I suppose its easier to do than running your own bot net.
"I am the king of the Romans, and am superior to rules of grammar!"
-Sigismund, Holy Roman Emperor (1368-1437)
Seems like an awfully cheap way to spider millions of pages of porn. It would be worthwhile if Google didn't do it already for free.
The levels of indirection present to support this system -- distributed clients, incentives for being a distributed client, power supply vs demand, payment for custom spidering -- make the system many things at the same time and unnecessarily complex, because those things already exist for free and in less complex ways. Many needs are sufficed by the simpler mechanisms and always have been.
Plura gets computer users to supply unused processing power in exchange for access to games, donations to charities, and spyware.
What's a Sig?
Why the hell would the embed plura into an IM client anyway?
Unfortunately, it's all about money.