Slashdot Mirror


Startup Webaroo to put the 'Web on a Hard Drive'?

An anonymous reader writes "A new startup called Webaroo is launching Monday with an audacious proposition: You can search the Web without a net connection of any kind. Initial release consists of 'Web packs' on specific topics such as news, city guides or Wikipedia. Later this year they're promising a full-Web version that you can carry on a laptop -- provided you're willing to devote something in the neighborhood of 80 gig."

52 of 340 comments (clear)

  1. sounds great by sentientbeing · · Score: 5, Funny

    I'm sold. Does anyone have the .torrent for it?

    --

    ------
    beware he who would deny you access to information, for in his mind he dreams himself your master
    1. Re:sounds great by Red+Alastor · · Score: 5, Funny
      I'm sold. Does anyone have the .torrent for it?
      Me too ! Do you think we can subscribe to a service to get updates when the content change ?
      --
      Slashdot anagrams to "Sad Sloth"
    2. Re:sounds great by cgenman · · Score: 5, Funny

      I don't have a lot of hard drive space. It would be really convienient for me if they just put the packs online.

  2. Dotcom v3.0 by Saven+Marek · · Score: 5, Insightful

    A new startup called Webaroo is launching Monday with an audacious proposition: You can search the Web without a net connection of any kind.

    If anyone doubted the next dotcom boom is upon us, this should put that doubt to rest.

    1. Re:Dotcom v3.0 by caffeinemessiah · · Score: 5, Insightful
      I was JUST thinking that. This seems like the beginning of a whole slew of semi-ridiculous ideas that get funded because their proponents seem 'ahead of their time'. Did someone at a funding company not think of the following two points:

      1) the web is growing at a phenomenal rate. in a few years, the only thing that you'll be able to fit on even high-density media is very narrow, specific content. is there really such a huge market for that?

      2) wifi is nearly ubiquitous. why pay for a static snapshot of the web that will be obsolete in a few days when you can walk into a starbucks with you laptop and get the fresh stuff almost for free??

      I'm sure the guys who want to put the web on a disk have thought these points through, but me...I just really want to sigh. and buy some short-term stocks.

      --
      An old-timer with old-timey ideas.
    2. Re:Dotcom v3.0 by tepples · · Score: 2, Insightful

      in a few years, the only thing that you'll be able to fit on even high-density media is very narrow, specific content.

      The thing is that Wikipedia, with all its imperfections and gaps, is still a surprisingly good start.

    3. Re:Dotcom v3.0 by Bogtha · · Score: 5, Insightful

      wifi is nearly ubiquitous.

      I think you're way off on this one. On the other hand, I have a suitable substitute:

      2. What the hell are they going to do about the copyright issues?

      --
      Bogtha Bogtha Bogtha
    4. Re:Dotcom v3.0 by Philocke+Fox · · Score: 4, Interesting
      Robert X. Cringley had an article about this last year. http://www.pbs.org/cringely/pulpit/pulpit20050210. html

      Basically what he said was that venture capitalists raised a whole bunch of money that they didn't spend during the last boom. This money is raised from investors and is given to the VCs for a limited time. The VCs make money from the management fees they collect for dealing with this money, usually 1 or 2% of the total amount. But, if they don't invest, then the money AND the fees get sent back to the original investors.

      The time limit on investment is usually about 10 years. So if we say that the boom started around '96, then some of these limits have already expired, and the rest of them will expire within the next 4 years.

      Use it or lose it. And the VCs will definitely use it.

    5. Re:Dotcom v3.0 by Xeriar · · Score: 4, Insightful

      What the hell are they going to do about the copyright issues?

      Quoted for truth. I know I'm not the only one who thought "Hey, this would be cool... but the target websites are going to be pissed about losing their ad revenue."

      For sites like Wikipedia and others whose goal is the distribution of their content, this isn't as much of a big deal (unless, in the case of Wikipedia, they snapshot a vandalized site...), but a lot of content providers won't be happy about getting their ad revenue stolen.

    6. Re:Dotcom v3.0 by JourneyExpertApe · · Score: 2, Insightful

      Not to mention the copyright issues. I don't think many companies/individuals would want their websites being packaged and sold without their consent.

      --
      If you can read this sig, you're too close.
    7. Re:Dotcom v3.0 by flyingsquid · · Score: 3, Funny
      I was JUST thinking that. This seems like the beginning of a whole slew of semi-ridiculous ideas that get funded because their proponents seem 'ahead of their time'

      In related news, NewsCorp bought Myspace.com for 580 million.

    8. Re:Dotcom v3.0 by Mr+Z · · Score: 2, Informative

      Hmm... I'm guessing bandwidth better be cheaper than ads, otherwise they lose money with every page served. So, if the ads bring in more than the bandwidth costs, they still lose money here.

  3. Sounds like a cache to me by liliafan · · Score: 5, Informative

    After reading the article, it sounds like they are just selling their web cache, nice idea but really unless they are selling really cheap I just can't see it picking up, especially considering the difficulties of getting the data to your drive, I mean an 80G download!

    Additionally what if I decide to follow site links that leave the cache?

    Yeah I can't really see this picking up.

    --
    GeekServ Unix Consulting Services (http://www.geekserv.com)
    1. Re:Sounds like a cache to me by dogwelder99 · · Score: 2, Interesting
      From TFA...
      The company and service officially emerge from behind their stealth shield tomorrow armed with a flashy bundling agreement from laptop maker Acer.
      Most likely, the reason behind this awesomely silly "feature" is getting people to pay more for laptops with larger hard drives, with marketing promising "search the web without an internet connection!"

      And, of course, selling a subscription service that lets you download updates of your favorite internet content to your laptop... a technology formerly known as, well, "browsing the web". Using slick marketing to sell people stuff they already have, was a huge success for the bottled water industry... can't blame these guys for trying it on the internets.

  4. ah yes remember the day by minus_273 · · Score: 5, Funny

    when someone asked if the internet will fit on a floppy?

    --
    The war with islam is a war on the beast
    The war on terror is a war for peace
    1. Re:ah yes remember the day by jeroenb · · Score: 3, Funny

      I remember somewhere halfway through the 90s that a co-worker who did research into search technologies got the "idea" to just crawl the web looking for references to stuff you were interested in. It was pretty obvious, but he wanted to back everything up in case he wanted to recrawl it searching for something else.

      One day our internet connection was down and we went up to him asking: "the net connection is down, could we use your internet backup instead?" He was not amused, we were :)

      Come to think of it, I'm not sure what he's up to nowadays...

    2. Re:ah yes remember the day by fracex · · Score: 4, Funny

      I remember when I was convinced that the entire interent came on one of those AOL floppies.

      Mind you I think I was 7 years old at the time.

  5. Copyright infringment. by arthurpaliden · · Score: 2, Insightful

    How soon till the first lawsuit is filed.

    1. Re:Copyright infringment. by tepples · · Score: 5, Informative

      How soon till the first lawsuit is filed.

      US copyright law, 17 USC 512, excuses operators of automated caches that conform to established cache control protocols (meta elements, /robots.txt, etc.) from copyright infringement liability.

  6. Is this really the right time? by php_krisp · · Score: 2, Insightful

    Is this really the right to to try this? when wi-fi connections are popping up all over the place and the internet's bigger than it ever has been before?

    1. Re:Is this really the right time? by know1 · · Score: 2, Funny

      it would always be the right time, if only for the possibility that the bomb drops and we have to live a mad max style existance scavenging and fighting over laptop batteries and petrol in old stores throughout the land. If that happens, i wanna be able to read uncyclopedia at the end of the day.
      If it didn't happen i would be like the guy who loses his glasses in that old story and can't read even though he has eternity "but there was time now..." or whatever.

  7. ownership by xzvf · · Score: 3, Interesting

    Wouldn't there be an issue here of selling another person's content? While everyone can view the content at will, copying that information to media and then reselling it, or even distributing it for free, would be an issue.

  8. What about important updates? by KenDodd · · Score: 5, Funny

    For example, where do we get the porn diffs?

    --
    Did you know my dad's dog died?
  9. Downloading Wikipedia Is Nothing New by RobotRunAmok · · Score: 2, Funny

    Been around since the early 90's. Back then it was called "fan fiction."

  10. How are they going to handle dynamic things.. by Idimmu+Xul · · Score: 3, Insightful

    e.g. searching? Having Wikipedia on your hdd is all well and good, but if you can't easily search it, what's the point?

    --
    The problem with slashdot is that most of its users were bullied and stuffed into lockers as kids!
  11. Screw the web searching business by mtrisk · · Score: 2, Funny

    They should be selling their compression technology!

    --

    Without a proper flamewar, Anonymous was undecided on what shell to run.
  12. 80 gig web? by hlh_nospam · · Score: 2, Insightful

    That would cover about 0.0000000001% of the web, give or take a few dozen orders of magitude.

    1. Re:80 gig web? by hlh_nospam · · Score: 4, Insightful
      No images and compression on the text would probably change that quite a bit.

      Not enough so's you'd notice. What's the difference between one thimbleful of ocean and 100 thimblefuls of ocean? Besides trying to solve the wrong problem to begin with?

    2. Re:80 gig web? by TubeSteak · · Score: 2, Insightful

      You make a good point, but there are ways to squeeze the size down. They aren't archiving the entire ocean, just the top layer, which is what 99% of people look at anyways.

      Compression, for one. Force people to use some proprietary browser (or a FireFox extension) and compress html files > xx KB so that the browser opens the archives on the fly. Zip up executables, pdf's, word documents, etc etc etc. Webservers & browsers use gzip to save bandwidth, why can't this archive use it to save space?

      Convert bitmaps into jpegs, recompress/resize jpegs greater than xx KB or some arbitrary height x width. (and make people pay more for uncompressed/resized images).

      Write up an automated tool to strip the html of links to ads while deleting the ad images/files too.

      That's just off the top of my head, but if anyone had 40GB of web pages to sift through, I'm sure they can come up with some other intelligent ways to save space.

      --
      [Fuck Beta]
      o0t!
  13. All of the Web on a laptop? by omeg · · Score: 5, Informative

    "The Internet Archive Wayback Machine contains approximately 1 petabyte of data and is currently growing at a rate of 20 terabytes per month. This eclipses the amount of text contained in the world's largest libraries, including the Library of Congress. If you tried to place the entire contents of the archive onto floppy disks (we don't recommend this!) and laid them end to end, it would stretch from New York, past Los Angeles, and halfway to Hawaii."

    Internet Archive Frequently Asked Questions

  14. Not just access by David+Hume · · Score: 2, Interesting
    FTFA:
    Which isn't to say that ever more ubiquitous 'Net connections won't pose a challenge to the Webaroo business model.

    "Long-term their opportunity may have more to do with [search] performance" than the offline capability itself, Enderle says.

    Husick tells me that performance benefit was reinforced for the company by a rousing reception their service received from Japanese mobile operators who he says were salivating over Webaroo as a means to siphon search traffic away from their increasingly crowded wireless broadband networks.

    Webaroo will also be touting the potential cost savings and convenience of its service.

    "Every hotel I go to wants to charge me $10 to $15 a night for Internet. Every airport wants to charge me another $10 to get connected," Husick says. "If I've got five minutes before I have to board my flight, do I want to spend that five minutes connecting or do I want to spend five minutes getting my search answer?"
    I still think this is a business scheme destined to fail. It may be a business plan that is designed to survive only long enough to cash out.
     
    I've got news for Husick. I'm a lawyer who have sets of Statutes, Court Rules and Local Rules behind his desk. I still look them up online to make sure I have the most recent version. I can't afford not to.
     
    Search performance? Rarely, if ever a problem.
     
    Siphon traffic away from "increasingly crowded broadband networks?" They make money from that traffic. They can't, if necessary, charge per data download? Tier the service by download bandwidth? Charge more? Build a better network?
     
    The first cell phone or wireless device that expects me pre-download some portion of the net, that portion being determined by somebody else, is the first one I can cross off my list.
     
    Save $5 or %10 at the airport by not connecting? What if I want to send or receive e-mail? Get the latest news, business or stock information? I'm AT AN AIRPORT, which implies I have some money, and in his context that I'm on business. I'm going to foregoe a net connection for $5 or $10? If my employer is that tight, I'm looking for another job anyway -- one that doesn't use Webaroos' services.

    This reminds me of software solutions to cramped hard drive spaces awhile back. On the fly file compression and expansion when data size was outstriping hard drive size for a short period of time. (Remember the file corruption.) Even though there was a market for those products, barely, everyone and his brother knew that market was going to go away Real Soon Now.
  15. Hm.. by Anonymous Coward · · Score: 5, Interesting

    Only if one of the webpacks is porn. Or better yet, if several are porn, cross referenced by type and participants.

    Though, my vaguely disturbing ramblings do raise an interesting point, maybe - what's their stance on the indecent materials that make up a good deal of teh webernet? When they say the "whole internet," do they MEAN goatse too?

    1. Re:Hm.. by AndroidCat · · Score: 5, Funny

      I think goatse would be under "hole internet".

      --
      One line blog. I hear that they're called Twitters now.
    2. Re:Hm.. by edbulldog · · Score: 5, Insightful

      The only way one can store the "whole internet" in a 80GB drive is to drop off the pr0n. I mean... besides the pr0n, everything else should fit in a 80GB drive, right?

    3. Re:Hm.. by Alex+P+Keaton+in+da · · Score: 2, Insightful

      Are you trying to be funny? 80 GB of PDF's out there? Buddy, there are departments of companies, not the whole company, that have more than 80 GB of PDFs available to the public on servers.... (sometimes limited public, i.e. customers, for owners manulas, docs etc....)

      --
      And All I Ask is a Tall Ship And a Star to Steer Her By
  16. Pr0n? by Dante+Shamest · · Score: 4, Funny

    Would the downloadable content include porn?

    Er, I'm asking this in order to, er, protect my girlfriend's sensibilities. Can't have her unwittingly downloading such naughty stuff you know. =)

    1. Re:Pr0n? by RoloDMonkey · · Score: 2, Insightful

      if(posts_to_slashdot && has_girlfriend)
        if(girlfriend.has_sensibilities)
          chance_of_lying = VERY_HIGH;
        else
          chance of lying = HIGH;

      --
      Long live the Speaker Bracelet
      Rolo D. Monkey
  17. Re:Surfing is only part of the web... by caffeination · · Score: 2, Insightful
    So your personal computing habits are the yardstick by which all IT products are to be measured?

    When your argument is based exclusively on your opinions and personal experience, global absolutes like "this idea is bad" come off as arrogance. Phrases like "this is useless to me" are more accurate.

  18. Oh my good god (plus bonus Dr Who joke)... by tyroneking · · Score: 2, Funny

    From the website "Webaroo is a stealth-mode technology startup" which obviously means something very clever ... personally I use WinHTTrack on a small number of sites, now if someone offered pre-downloaded WinHTTrack sites ...maybe to order ...
    Anyway, more importantly - Dr Who is due back on UK TV soon I think (slightly disappointing end to last series - shame to to see Chris E leave) so here's a joke that Webaroo might like to to 'cache' ... "What do Daleks have for a snack? ...
    Dalek bread..." geddit? (thanks to a kids radio show for that one).

  19. Re:I won't be doing that one... by Jetekus · · Score: 2, Funny

    I've heard that a technology for seeing news offline already exists. It's very cheap, disposable (so don't worry if you leave it on the train) and can even keep you dry for a short period of time if it starts to rain. What's more - it's made from trees! How clever is that!!!

  20. html has never cared where data comes from by Glowing+Fish · · Score: 2, Informative

    This actually isn't by any means a new idea.

    If you've ever written or read html, you know that html doesn't care if links start file:// or if they start html://. HTML has always been quite neutral on whether it was linking to a local file system or getting something over the internet. Of course, most people don't use html extensively for local content. So in theory, this isn't a new idea at all.

    In practice, I don't see a lot of points for it. I can imagine that some people might want a map of a new city, with clickable pictures and informations about various services there. Most features of a city map are going to stay the same for at least six months, so this is the type of thing that could be done staticly. But even with this, internet access is so widespread, that it seems like a solution for a minor problem. Also, if you want a handy city guide, it would make more sense to me to write it from scratch rather than use a cludge of cached web pages.

    --
    Hopefully I didn't put any [] around my words.
  21. Cache exemption by tepples · · Score: 4, Informative

    Technically, they make a copy and the ISP doesn't.

    Isn't the ephemeral copy in the RAM of a router still a copy? And don't operators of automated caches have a fairly broad exemption under United States copyright law, 17 USC 512(b)?

    1. Re:Cache exemption by ammoQ · · Score: 2, Insightful

      It doesn't seem likely to me that those Webaroo guys will be able to fullfill the conditions of (2), especially (C) and (E). The cache exemption is obviously targeted towards _online_ caches. This makes sense, IMO.

  22. Is this even legal? by sunwolf · · Score: 3, Insightful

    How are they to justify selling other peoples' websites? What about the sites' lost ad revenues?

  23. Could just work... by ELProphet · · Score: 3, Interesting

    But not in the way they think. TFA mentions two points, but doesn't explore them in depth. The first is their algorithms they use; let's face it, Google is starting to fall to the SEOs. If they have a new algorithm that was able to actually follow your web browsing all the way, they'd be able to provide much better results. Google claims to do this, but they can't follow you more than your first link. Second, they seem to pick up that most people find their entire information on the second or think link they visit.

    Combine these together, and the program could offer you 80 gigs of data to just sit on your computer and be sifted through at yuor leisure. It would be able to follow you through, and find exactly how you get through your data. When it needs to, it can spider into areas that it might think you'd want to go (Been looking at a lok of Wikipedia? Next time you connect, it goes an picks up some wikibooks).

    The best part, is that all the "Big Brother" information is being stored on YOUR computer, not their servers. You want that info, Bush? You'll have to supoena every user.

    If they tergeted this more towards a desktop-search type thing with better search algos than Google, this could just work.

  24. a nice service is possible. by twitter · · Score: 4, Interesting
    The wayback machine's terrabytes of data is what this really takes. Keeping it up to date is another story.

    Archives are good and this can be a useful service. Providing 80 select gigs on a hard drive to libraries and schools is a useful until US networks get where they should be. Their software can keep those 80 GB up to snuff at night. When you leave the cache, you ... gasp ... get the new content. In the mean time, things are much faster when it matters. Mirrored content will always be a good idea. Look at the debian distribution system, for example.

    Good luck to the people at Webaroo. So long as they don't apply for stupid patents that give them an exclusive franchise to distribution systems, they are AOK.

    The road warrior thing will flop, though. People are going to stay where there's a network or pay the $10. It's the one piece of live information that requires the hook up. The speed of the rest is gravy for those people.

    --

    Friends don't help friends install M$ junk.

  25. The Net on a disk is not a net by gihan_ripper · · Score: 2, Insightful

    Even if this is doable and legal, it runs entirely counter to the spirit of the Internet. The Internet on a hard disk is no longer a network, it becomes a passive entity with no possibility of interaction.

    At the moment, we are seeing a return to the interactive origins of the Internet, prime examples being blogging, Wikipedia, and even Slashdot! If this projects takes off it will be harmful to interaction and will turn the Net into a glorified television.

    However, I find it unlikely that Webaroo will gain currency, precisely because we have become dependent on an interactive and living Internet. When I use the Net, I want to be able to read and respond to my emails, to check my bank balance, shop online, and read the latest news. Why on earth would I want to have a static Internet on my laptop?

    --
    Phoenix, Boston, Little Rock, see a pattern?
    1. Re:The Net on a disk is not a net by fishizzle · · Score: 2, Interesting

      Navigate to the page that interests you to have available offline, select "Favorites" -> "Add to Favorites..." -> Check "Make Available Offline" -> Click the "Customize..." button -> Choose how many pages deep you want to download.

      That's for IE, but I'm sure most competing browsers have the ability.

  26. Re:Transoceanic flights? by Theatetus · · Score: 2, Insightful
    So what should the transatlantic and transpacific frequent fliers use?

    wget (while they're waiting in the airport).

    --
    All's true that is mistrusted
  27. Re:skewed world by onebecoming · · Score: 2, Interesting

    Five? You must live in the sticks. 162 Starbucks within five miles of me right now, according to the store locator... not that there aren't other places I'd rather be.

  28. feh by andreyw · · Score: 3, Interesting

    Frankly, I could see a market for this *maybe* 10-12 years ago. It just doesn't make any sense now. The internet is not solely about static content. Also, the thimble of data provided in each pack will be underwhelming and perpetually out of date.

    I mean, if I know I won't be online for a week, what stops me from just CURLing or WGETing whatever I plan on reading for the next couple of weeks? And that goes only for static content like books and articles. Everything else is cannot be simply cached.

  29. Re:Transoceanic flights? by timmyf2371 · · Score: 3, Informative
    I always fly Lufthansa whenever travelling trans-atlantic, providing you're willing to pay the WiFi premium, you get WiFi internet access for the duration of the flight.

    For shorter flights within the UK and Europe, it's safe to say I can cope without internet access for two hours.

    --

    Backup not found: (A)bort (R)etry (P)anic