Startup Webaroo to put the 'Web on a Hard Drive'?
An anonymous reader writes "A new startup called Webaroo is launching Monday with an audacious proposition: You can search the Web without a net connection of any kind. Initial release consists of 'Web packs' on specific topics such as news, city guides or Wikipedia. Later this year they're promising a full-Web version that you can carry on a laptop -- provided you're willing to devote something in the neighborhood of 80 gig."
I'm sold. Does anyone have the .torrent for it?
------
beware he who would deny you access to information, for in his mind he dreams himself your master
A new startup called Webaroo is launching Monday with an audacious proposition: You can search the Web without a net connection of any kind.
If anyone doubted the next dotcom boom is upon us, this should put that doubt to rest.
After reading the article, it sounds like they are just selling their web cache, nice idea but really unless they are selling really cheap I just can't see it picking up, especially considering the difficulties of getting the data to your drive, I mean an 80G download!
Additionally what if I decide to follow site links that leave the cache?
Yeah I can't really see this picking up.
GeekServ Unix Consulting Services (http://www.geekserv.com)
when someone asked if the internet will fit on a floppy?
The war with islam is a war on the beast
The war on terror is a war for peace
Wouldn't there be an issue here of selling another person's content? While everyone can view the content at will, copying that information to media and then reselling it, or even distributing it for free, would be an issue.
For example, where do we get the porn diffs?
Did you know my dad's dog died?
e.g. searching? Having Wikipedia on your hdd is all well and good, but if you can't easily search it, what's the point?
The problem with slashdot is that most of its users were bullied and stuffed into lockers as kids!
"The Internet Archive Wayback Machine contains approximately 1 petabyte of data and is currently growing at a rate of 20 terabytes per month. This eclipses the amount of text contained in the world's largest libraries, including the Library of Congress. If you tried to place the entire contents of the archive onto floppy disks (we don't recommend this!) and laid them end to end, it would stretch from New York, past Los Angeles, and halfway to Hawaii."
Internet Archive Frequently Asked Questions
Only if one of the webpacks is porn. Or better yet, if several are porn, cross referenced by type and participants.
Though, my vaguely disturbing ramblings do raise an interesting point, maybe - what's their stance on the indecent materials that make up a good deal of teh webernet? When they say the "whole internet," do they MEAN goatse too?
Would the downloadable content include porn?
Er, I'm asking this in order to, er, protect my girlfriend's sensibilities. Can't have her unwittingly downloading such naughty stuff you know. =)
Not enough so's you'd notice. What's the difference between one thimbleful of ocean and 100 thimblefuls of ocean? Besides trying to solve the wrong problem to begin with?
Concealed Handgun License Courses in Plano, Texas
How soon till the first lawsuit is filed.
US copyright law, 17 USC 512, excuses operators of automated caches that conform to established cache control protocols (meta elements, /robots.txt, etc.) from copyright infringement liability.
Technically, they make a copy and the ISP doesn't.
Isn't the ephemeral copy in the RAM of a router still a copy? And don't operators of automated caches have a fairly broad exemption under United States copyright law, 17 USC 512(b)?
How are they to justify selling other peoples' websites? What about the sites' lost ad revenues?
But not in the way they think. TFA mentions two points, but doesn't explore them in depth. The first is their algorithms they use; let's face it, Google is starting to fall to the SEOs. If they have a new algorithm that was able to actually follow your web browsing all the way, they'd be able to provide much better results. Google claims to do this, but they can't follow you more than your first link. Second, they seem to pick up that most people find their entire information on the second or think link they visit.
Combine these together, and the program could offer you 80 gigs of data to just sit on your computer and be sifted through at yuor leisure. It would be able to follow you through, and find exactly how you get through your data. When it needs to, it can spider into areas that it might think you'd want to go (Been looking at a lok of Wikipedia? Next time you connect, it goes an picks up some wikibooks).
The best part, is that all the "Big Brother" information is being stored on YOUR computer, not their servers. You want that info, Bush? You'll have to supoena every user.
If they tergeted this more towards a desktop-search type thing with better search algos than Google, this could just work.
Archives are good and this can be a useful service. Providing 80 select gigs on a hard drive to libraries and schools is a useful until US networks get where they should be. Their software can keep those 80 GB up to snuff at night. When you leave the cache, you ... gasp ... get the new content. In the mean time, things are much faster when it matters. Mirrored content will always be a good idea. Look at the debian distribution system, for example.
Good luck to the people at Webaroo. So long as they don't apply for stupid patents that give them an exclusive franchise to distribution systems, they are AOK.
The road warrior thing will flop, though. People are going to stay where there's a network or pay the $10. It's the one piece of live information that requires the hook up. The speed of the rest is gravy for those people.
Friends don't help friends install M$ junk.
Frankly, I could see a market for this *maybe* 10-12 years ago. It just doesn't make any sense now. The internet is not solely about static content. Also, the thimble of data provided in each pack will be underwhelming and perpetually out of date.
I mean, if I know I won't be online for a week, what stops me from just CURLing or WGETing whatever I plan on reading for the next couple of weeks? And that goes only for static content like books and articles. Everything else is cannot be simply cached.
For shorter flights within the UK and Europe, it's safe to say I can cope without internet access for two hours.
Backup not found: (A)bort (R)etry (P)anic