White House Website Limits Iraq-Related Crawling
oscarcar writes "Dan Gillmor is reporting on the White House website's use of its robots.txt file to disable search engines from crawling certain material. Many excluded items in the robots.txt file involve mentions of Iraq, possibly to prevent people from finding changes to past statements and information when archived elsewhere."
whitehouse.com doesn't have that problem.
sulli
RTFJ.
it's good to see the whitehouse embracing technology so much.
!(^((ri)|(mp))aa$)
I have to admit, when I first read the story I thought someone was being paranoid. But you really should RTF robots.txt file before you accuse the poster of being paranoid. The disallowed files are extraordinarily specific. I really can't come up with a plausible explanation beyond simoniker's.
Obviously, they're keeping people from accessing the top-secret teeball Iraq files ! Besides:
check out these other frightening examples of censorship:Truly frightening.
Computer Go: Writing Software to Play the Ancient Game of Go
welcome our White House Robot Overlords. It would be funnier if it weren't true.
- - - If the sun is a star, why can't I see it at night?
Looks like they removed a bunch of files where they were making claims that Saddam was behind 9/11. One could be lead to suspect that now that Bush got his war his doesn't need that lie anymore, and wants to erase all history of it since it undermines his authority.
Peace, or Not?
Downloading the "robot.txt" file and doing a quick ctrl-f on different words, I discovered that there are six instances of "Barney" coming up in the robot.txt:
/holiday/2002/barney/iraq /holiday/2002/barney/text /kids/barney/iraq /kids/barney/text /kids/photoessays/barney/iraq /kids/photoessays/barney/text
Disallow:
Disallow:
Disallow:
Disallow:
Disallow:
Disallow:
Which is the same number as "cheney", "powell" had 4, "saddam" didn't have any and "bush" only comes up with "bushpets".
Clearly, there is something to do with Barney and Iraq that The White House doesn't want you to know about.
myke
Mimetics Inc. Twitter
It really doesn't look like it. It looks like someone screwed up, because none of those directories appear to exist at all. I mean really, what are the chances of /firstlady/photos/2003/01/iraq actually having at some time contained real data?
It looks like someone did a
find . -type d|perl -e 'while(<>){print "${_}/iraq\n"; print "${_}/text\n";}' > robots.txt
I have no idea what the purpose would be, but it seems like a funny thing to do if you were trying to hide something.
By the way, who is going around looking at people's robots.txt files?
Engineering and the Ultimate
How can they be Iraq related if they didn't exsist to begin with?
:-)
A question that GW gets asked all the time.
Karma: Chevy Kavalierma.
http://www.bway.net/~keith/whrobots/disdirs.html And, yes these files *are* relevant.
Melius mori in libertate quam vivere in servitute.