Slashdot Mirror


Online Search Engines Lift Cover Of Privacy

Rican writes "MSNBC has an interesting article about how 'Googledorks' are using the powerful search engine to do searches across the web for sensitive and/or private information. Some of this information includes 'Medical records, bank account numbers, students' grades, and the docking locations of 804 U.S. Navy ships, submarines and destroyers.'"

43 of 460 comments (clear)

  1. Um. by Anonymous Coward · · Score: 5, Insightful

    While googlestalking is scary and bad and I'm not condoning it, in this *specific* case, if the docking locations of U.S. naval ships is something that they do not want made public perhaps they should simply not make them public?

    1. Re:Um. by Anonymous Coward · · Score: 5, Insightful

      Maybe you should use some kind of security instead of just really -hoping- no one crawls/reads/caches your document.

    2. Re:Um. by ecalkin · · Score: 5, Insightful

      documents that should not be available to the general public should be a) behind firewalls where the general public is on the other side, b) stored on web servers that require authentication to read such pages (where the general public does not have username/password), or c) not be stored on a web server!

      i think that this is somewhat an issues of bad management and somewhat (maybe more) and issue of the weakness of web service security (compared to something like local novell services).

      eric

    3. Re:Um. by pla · · Score: 5, Insightful

      Let's say you put a page on your site
      <snip>
      And it is not linked to ever.

      Then you have still put it in a publically accessible place, and bear full blame for others finding it.

      For a physical-world analogy, let's say that you want to give a note to a friend (which, for some reason, requires a non-conventional mode of delivery). You could leave it at page 416 of "The complete minutes of the Town of Dullsville, 1853 to 1862", which no one had checked out in the past 30 years. Tell your friend where to find it, and 999 times out of 1000, you'd have no problems.

      If you one day used that same method of sending a note, only to discover someone checked out the book and removed the note, would you actually have the gall to blame anyone but yourself?


      Slashdotters, of all people, have heard this over and over and over... Security through obscurity may help in addition to some form of "real" security, but it almost never works by itself. The web counts as a very public place. If you place sensitive information on it with no security beyond a "hidden" URL, don't act surprised when the NYT has it as a headline the next week.

      And for reference, yeah, I too have stuck random files up on my site for a friend to grab. But never when it would have mattered if someone else randomly found those files.

    4. Re:Um. by Elwood+P+Dowd · · Score: 3, Insightful

      It's worth a damn if we're talking about Google or archive.org.

      No, it's not worth a damn if you're talking about actually sensitive data.

      --

      There are no trails. There are no trees out here.
    5. Re:Um. by AstroDrabb · · Score: 2, Insightful
      If the information is not meant to be public, then it should not be on a publicly addressable server.

      Where I work we have a few servers that are addressable from the internet in a DMZ. Everthing else is untouchable, so the Opera trick doesn't work. The next block we have is that we use Netegrity for corporate wide single-sign-on. Every non-public webserver has a Netegrity client installed. To get any document, you need to first authenticate against the Netegrity policy server over SSL.

      There is also the robots.txt file that google will honor, so there is no reason to block google's bots with your firewall.

      Bottom line, is not to put sensitive information on a server that is addressable from the internet. Keep it all on your local network and force users to VPN in if they need to get to that information.

      --
      If Tyranny and Oppression come to this land,
      it will be in the guise of fighting a foreign enemy. -James Madison
    6. Re:Um. by MrNybbles · · Score: 2, Insightful
      Who's to blame for this then? not you.

      Actually I would blame the person who put private information on the Internet. Even with no obvious way for anyone to know it exists, it is still unprotected and out in the open.


      I have found hidden files in directories by looking at the location of images and looking in those directories. Those directories and some of the files were not linked to anywere. They were not private although the person was surprised when I asked about them.


      My philosophy on security: If you security settings are not set to paranoid, they are set to low.

      --
      Losing faith in humanity one person at a time.
    7. Re:Um. by qtp · · Score: 2, Insightful

      which is what .htaccess is for, but I guess you've never configured a webserver.

      There's a lot of this going around lately, whether we're talking webservers or configuring sendmail: a lot of folks with their shiny new CS degrees telling the rest of us that our tools are broken and asking us to trust Mr. Bill to set us straight. I'd be a lot more confident with their advice if they would at least give the impression that they had ever configured the tools they are so ready to throw aside the tools they say are broken.

      --
      Read, L
    8. Re:Um. by Jeremi · · Score: 2, Insightful
      Who's to blame for this then? not you. You've already ensured you hadn't linked to it.


      Absolutely you, because you assumed that not linking to a document would make it private. Bad assumption. Even without Opera's "feature", someone could stumble upon the proper URL by blind luck, or as part of a dictionary attack, or by sniffing HTTP header traffic.


      If you want to keep something private, don't put it on a public web site. Period.

      --


      I don't care if it's 90,000 hectares. That lake was not my doing.
  2. Hardc0re hax0r. by monstroyer · · Score: 2, Insightful

    That googledorsk link... You're telling me if i put the word "googledorks" on my website and wait a few months i will be one because it appears in a google search?

    Is googledorks a real hacker movement or just some random key word any one with a high ranking web page can abuse?

    On another note, the best thing i found that was supposed to be hidden was with the query "quality hentai" This was last year. It has since been secured (by being taken offline).

    What have you found?

  3. Cover of "Privacy" by mobiGeek · · Score: 5, Insightful
    What "privacy"? The information is posted on the WORLD WIDE Web...

    --

    ...Beware the IDEs of Microsoft...

    1. Re:Cover of "Privacy" by IntelliTubbie · · Score: 4, Insightful

      What "privacy"? The information is posted on the WORLD WIDE Web...

      Perhaps a more accurate title would have been "Online Search Engines Remove Delusion Of Privacy."

      Cheers,
      IT

      --

      Power corrupts. PowerPoint corrupts absolutely.

  4. Why Google? by lostchicken · · Score: 4, Insightful

    Why do people always have to drag Google into this sort of thing? Somewhere, someone is pissed off at Google for putting their medical records on the web, and letting people get at them, when they should be angry at the people who posted them to the web in the first place. It's like calling Southwest Bell your partner in crime because you used DSL to steal from an online bank. It just makes SWBell look bad, just as this makes Google look bad.

    --
    -twb
    1. Re:Why Google? by flewp · · Score: 1, Insightful

      Brcause they can? Yes, yes, they shouldn't, but they do.

      Never underestimate the power of blaming someone/thing else instead of your own actions. (Or inaction)

      --
      WWJD.... for a Klondike bar?
    2. Re:Why Google? by agentZ · · Score: 4, Insightful

      Google is a tool, and tools can be used for good or for bad.

    3. Re:Why Google? by lostchicken · · Score: 2, Insightful

      Google is not a "tool" in this sense. A hammer is a tool. I can kill someone with a hammer. The internet is a tool. However, the guy at Sears who tells me where to buy a hammer is not a "tool" (well, he may be, but that's a different kind of tool). He can't be used for good or evil. He can tell me where to find a hammer, which I can then use for good or evil.

      --
      -twb
    4. Re:Why Google? by Simon+Garlick · · Score: 2, Insightful

      You did notice who's publishing the Google-bashing article, didn't you?

  5. Hard to hide by BWJones · · Score: 4, Insightful

    This all brings up one of the central tenets of computer network security: If it is connected to the Internet, it can be accessed, and sometimes the probing computers that are looking leave their little IP footprints all over the place. For instance, I was rather surprised a couple of years ago watching some IP's scroll through while someone/a software bot was accessing my workstation. Whois revealed nothing, but traceroute revealed an IP that allowed me to do a little more poking around to find out the identity as something from a "Special Collections Service" in Maryland. A little more poking around revealed it to be something involving a state department program whereupon I rather quickly decided to stop investigating. I still don't know anything about them or what they do, but it is surprising how hard it can be to be anonymous on the web. Hey, I am sure even all those Slashdot anonymous coward posters are leaving IP's that can and are documented. :-)

    --
    Visit Jonesblog and say hello.
  6. web servers for morons by belmolis · · Score: 5, Insightful

    The real story here is that companies and other organizations and institutions are setting machines up as servers and are too stupid to create an appropriate robots.txt file and/or keep their confidential information elsewhere. Google doesn't just drop in, even on networked machines. I have some sympathy for individuals who don't understand what they are doing when they make their machine a server, but surely any professional sysadmin, even one with limited training and experience, should know better than this. It's the same as leaving your briefcase on the front seat of an unlocked car.

  7. so who owns it, how can we stop it? by HealYourChurchWebSit · · Score: 5, Insightful



    Part of this problem comes out of who owns the daggoned data. For example, let's say a hospital, instead of using clipboards, uses smartcards to hocket about patient records.

    Who own's the data. The hospital, the insurance company paying the bill, or the poor schmuck on the business end of a colonoscopy?

    I ask because without the indiviual having the write to own the data, there seems to me little that can be done to protect oneself other than go through expensive and tedious legal channels.

    And if someone else can own sensitive data about me, then what can we do, as private citizens with limited resources, to make sure larger entities such as insurance companies play by rules like HIPPA?

    --
    --- have you healed your church website?
  8. Interesting Nugget by Slavinski · · Score: 2, Insightful
    What I found interesting was this portion of the
    article:

    Since 2001, the FTC has settled cases with Eli
    Lilly & Co., Microsoft Corp. and clothing maker
    Guess Inc. for not taking "reasonable" measures
    to keep medical or financial information
    secure, said Jessica Rich, assistant
    director of the commission's bureau of consumer
    protection. Letting customer information
    reside on an unsecure server can open
    up a business to such liability.

  9. Geez by Wolfier · · Score: 4, Insightful

    If your information is "sensitive" or "private", do yourself a favor and don't put it on the web.

    Peeps nowadays...

  10. Re:Google threatens privacy and national security by JanneM · · Score: 4, Insightful

    Shouldn't Google take precautions to make sure that sensitive data doesn't fall into the wrong hands?

    No, they should not. They are not in a position to know what _is_ sensitive - and to whom. They can reasonably only assume that anything reachable with an ordinary, polite spider is meant to be accessible to the world at large. If you feel certain information should not be made accessible, bring it up with those actually making it accessible, not with those just indexing it once it is.

    Shooting the messenger is not just pointless, it is counterproductive.

    --
    Trust the Computer. The Computer is your friend.
  11. Re:Google threatens privacy and national security by Concerned+Onlooker · · Score: 4, Insightful
    Sensitive data? Just because it's found through Google online doesn't make it any more sensitive or useful for terrorists. You can walk into any aviation bookstore and buy sectionals for the whole country, and they've got a lot more info than some MapBlast gif file.

    --
    http://www.rootstrikers.org/
  12. docking locations of 804 ships? by usn2fsu03 · · Score: 5, Insightful
    That's more than twice the number of ships currently in service.

    Also, these are not precise locations. Yeah, you can find that the USS Roosevelt (DDG-80) is homeported in Mayport, Florida but you're not going to find the precise pier number.

    As for ships on deployment, one can find their general locations just by looking at the latest issue of the Navy Times and by reading the newspaper of the town that the ship and its battlegroup are from.

    The Navy really tightened up on what get's posted on official ship's websites after 9/11. If there is sensitive information still out there, Google is not at fault, but rather the unit's webmaster, Commanding Officer, and the Operational Security people who are supposed to be looking out for that sort of thing.

  13. This could be earth shattering for google? by saskboy · · Score: 2, Insightful

    Imagine if the US government gets in its head that search engines are a terrorist tool?

    Wouldn't that be interesting?

    --
    Saskboy's blog is good. 9 out of 10 dentists agree.
  14. Fuck that shit by Anonymous Coward · · Score: 4, Insightful

    Maybe they should just use the fricking robots.txt protocol. That's what it's *FOR*. You can put a little file named robots.txt in the directory you want hidden, put text in it that says "i want this hidden, google", and google will ignore your directory forevermore.

    No one has any right to complain if their page is in a search engine unless they followed the robots.txt protocol and the search engine did not.

    1. Re:Fuck that shit by Anonymous Coward · · Score: 3, Insightful

      The problem with this is, anybody can now download your robots.txt and have a list of your unprotected sensitive data.

    2. Re:Fuck that shit by Senior+Frac · · Score: 4, Insightful

      Not if the robots.txt file prevents you from accessing that data, which it does.

      The robots.txt file prevents nothing. It's merely a request that the spider "not go here." It's not a lock on the door. It's a sign that says, "please do not enter my house."

  15. Re:Nothing new by Ivan+the+Terrible · · Score: 4, Insightful

    If Bill Gates is using the same SS # that was leaked in 1995, then he is a total moron. He is not a moron. Therefore he is not using the same SS # that was leaked in 1995. QED

  16. old skool trick by shird · · Score: 4, Insightful

    An old trick I used to do was searching for something along the lines of

    "http://*:*@" member

    and you would get a bunch of sites with direct links into passworded member sites. Microsoft will put a stop to this with their latest update to IE however.

    --
    I.O.U One Sig.
  17. Re:Nothings private by MrNybbles · · Score: 5, Insightful
    Am I just another cynical bastard?
    Yes, you are a cynical bastard, and the world needs more of you.

    And on a totally unrelated thought. . .

    Online search engines lift cover of privacy
    Is Yuki Noguchi on crack? Google does not do anything to privacy. All Google does is make it easier to find publicly available information. Maybe "Online search engines act as a catalyst to find private information" would be more a accurate title. ". . .cover of privacy" makes it sound like it was protected in the first place.
    --
    Losing faith in humanity one person at a time.
  18. Re:I've heard of "cow orkers"... by jridley · · Score: 4, Insightful

    OT:
    How come Homer and Krusty look like clones?

    It's intentional. MG originally intended it to be a joke; Bart didn't respect his dad, but he worshiped a clown who looked exactly like his dad. He mentioned this on an NPR interview last week.

  19. Primary issue is the historical data problem by xant · · Score: 3, Insightful

    Google and the wayback machine, respectively, have memories. Just because you take something off the web doesn't mean it can't be found by those services; it just means it won't respond to your browser's request. Cached results and so forth are dangerous. If there ever was leaked data about the locations of those ships, it can still probably be found somewhere, and if that information hasn't changed since it was taken off the web, it's still a problem.

    This applies to any information that's ever been stored electronically; I call it the "backup tape problem". Someday, that information may (will?) find its way online, a public service will index it, and the genie will be out of the bottle forever.

    --
    It's rare that you're presented with a knob whose only two positions are Make History and Flee Your Glorious Destiny.
  20. Re:wait... by Elwood+P+Dowd · · Score: 2, Insightful

    Right. And then you can complain about that site doing the cross linking, or you can think about putting access controls on that sensitive document that you've put on the world-readable public internet.

    --

    There are no trails. There are no trees out here.
  21. Re:Nothings private by Anonymous Coward · · Score: 2, Insightful

    On the one hand, yes, the information was already out there to be found and sorted. On the other, however, things like Google take such information and make them available to anyone (which they were before) in an incredibly easy-to-use form (which it was most certainly not previously). Say I want to know information about Al G. Trenton, the G is for Greerson. This person went to XYZ school back in high school and held ZYX position. Do a few searches for common phrases, you get some information maybe from a private web side or a town newspaper. Maybe you get parents' names or the name of a college or something; boom, you have more information to refine your search. This once would have taken days, maybe weeks, and preferably a trained professional; now it takes hours, and is easily done by a moderately bright and creative individual.

  22. Re:wait... by djupedal · · Score: 4, Insightful

    Or you realize that putting something on the internet means that it is no longer private..... regardless of how stupid it is to say that google will leave it alone if you just ask..

  23. Good! by ottffssent · · Score: 5, Insightful

    Hopefully this sort of flagrant violation will draw at least a modicum of public attention.

    This isn't some hardened criminal mastermind at work. It's not a seasoned cracker attacking military targets. This isn't even some script kiddie poking at IIS. It's a MACHINE. A machine that respects robots.txt for Eris' sake!

    If medical records and other "real" secrets are this visible, something is terribly wrong and I want to see public floggings. Seriously, this is not a case of weak security, or poor security, or incompetent security. It's a case of there not being so much as a screen door between the public and sensitive information.

    This is actually a case where I think the government (or at least the courts) can do some good. You'll notice banks don't get hacked on a daily basis. That's because they'd lose squintillions of dollars if it happened. But nobody cares about my medical records because it costs money not to have incompetent asses running things. On the other hand, if revealing to without were punishible by a $1000 fine per person, per offense, you'd notice a severe tightening of security in a mighty big hurry.

    It's a shame that suing people is sometimes the only way to get their attention, but with the decline of basic civil responsibility it might be inevitable.

  24. Nothings private unless (unless nobody else knows) by MrNybbles · · Score: 2, Insightful
    I have actually looked up an old girlfriend that way. Her parents had their own business and a website so it was easy. The good news is that she graduated college. Using betterwhois.com I was able to get an address. That took 10 to 15 minutes and I did that a few years ago. I guess it's a good thing that I am not a bad person. ** evil grin **

    I say why blame just one person/group/entity. Let's blame the people who publicly post the personal information AND the people who use that information to hurt people. But let's not blame Google or any other search engine for doing too good of a job.

    --
    Losing faith in humanity one person at a time.
  25. Re:Could happen to you by Anonymous Coward · · Score: 1, Insightful

    Is it bad? Lots of shops etc know my CC#. Thats the way it works.

    Ur protected from losses

  26. Mod down please. by jotaeleemeese · · Score: 1, Insightful

    It is not insightful, it is trool, stupid.

    robots.txt is a polite request not to do something.

    Of course rogue people will not even notice this or will use it to their advantage.

    --
    IANAL but write like a drunk one.
  27. This is *not* Hacking? by DeanFox · · Score: 4, Insightful

    I know this is very late in the discussion.

    But, if I wander into an unprotected system, like a bank or military site, and I start reading confidential documents... Is this not a crime?

    What's the difference if I locate the unprotected documents via a search engine or by using a port scanner with an IP range.

    I think what I'm saying is that port scanning and finding an vunerable system, going into that system and looking around is now a crime.

    But didn't I just describe what's going on with google hacking?

    I don't advocate nor believe any of this is a crime but where and why is a line drawn between them?

    I've often said about hacking that just because I go to the market and forget to lock my front door, that doesn't mean I expect to come home and find someone rumaging through my house.

    If it's an administrator who forgets to lock down a port or one how inadvertantly places confidential materal on the wrong box... Again, Where is the line and how is it drawn, and why, between criminal hacking and "it's on an open system, google found it so it's legal".

    I'm just asking. It's early in the AM and my brain isn't working because it's not seeing the difference. I'm only seeing a very fine line between what one might consider a "public" system versus one that expected to be "private". Is the only difference our "expectation" of privacy that makes one illegal and another a sport?

    1. Re:This is *not* Hacking? by poot_rootbeer · · Score: 2, Insightful

      Consider the analogy of not locking your door and then coming home to find someone rummaging through your house.

      In most of the cases referenced in this article, the sites hosting the sensitive data didn't just leave their doors unlocked, they brought the data outside and dumped it on the curb. If you're walking by and see something worth salvaging in what for all purposes appears to be someone's trash, do you consider it illegal to pick it up and take it with you?