Slashdot Mirror


98% of DNS Queries at the Root Level are Unnecessary

LEPP writes "Scientists at the San Diego Supercomputer Centerfound that 98% of the DNS queries at the root level are unnecessary. This doesn't even take into account the 99.9% of web pages suck or are unnecessary anyways. This means that the remaining 2% of necessary DNS queries are probably not necessary either."

96 of 426 comments (clear)

  1. In other news by Anonymous Coward · · Score: 5, Funny

    99% of slashdot posts are unnecessary.

    1. Re:In other news by apdt · · Score: 3, Funny

      50% of /. posts are duplicates

      --
      I lay awake last night wondering where the sun had gone, then it dawned on me.
    2. Re:In other news by heff · · Score: 2, Funny

      50% of /. posts are duplicates

      --

      --

      |-_-| . o O ( bEef!)

  2. So... they looked at one server.. by Anonymous Coward · · Score: 2, Interesting

    And they assumed the other 12 were exactly the same? Wouldn't looking at 2 at least be merited?

  3. AOL by almeida · · Score: 5, Interesting

    On a similar note, I noticed that AOL causes a lot of DNS lookups. From what I can see from my firewall logs, each TCP connection from an AOL user is handled by a separate proxy. Each proxy then does its own lookup on the host. So, for a normal sized webpage with some images or whatever, you get like 10 TCP connections for the content and 10 UDP connections for the DNS lookup. Seems kind of excessive to me.

    1. Re:AOL by cyb97 · · Score: 5, Interesting
      AOL always screws up webpage statistics (which I guess can be a good thing as the only dufuzes that really really care about statistics are marketers?)...

      I can't count the number of times I've seen a massive spike in number of "unique visitors" just to look at the hosts and find *.proxy.aol.com filling the whole thing....

    2. Re:AOL by micromoog · · Score: 3, Interesting

      Hmmmmmmm, I wonder if this could even constiture fraud. If web publishers believe a larger number of AOL'ers are visiting their site than actually do, wouldn't they be inclined to pay more for adverts on AOL's portal?

    3. Re:AOL by toast0 · · Score: 3, Informative

      i doubt it.

      it is common knowledge that aolusers come through aol's proxies, and the proxy hostnames contain proxy in them, so it should be fairly obvious

      also, anybody who is running web statistics should know the following things:
      1) web statistics are inaccurate
      2) proxies screw up web statistics
      3) not all proxies are visible
      4) refer to 1 and 2

    4. Re:AOL by Goodbyte · · Score: 2, Informative

      Maybe so, but all requests should go to a dns-server at AOL which will cache the results. So if all users make a request for a domain in the same top-domain, there should still only be one request to the root-server.

    5. Re:AOL by shoppa · · Score: 2, Interesting
      First of all, it's mostly a given that AOL's name service is going to suck rocks. But the way you describe is the opposite of the problem they had a few years ago:
      • AOL would cache DNS lookups for much much longer than the expire time
      • Sometimes this cache would live on for weeks past expire time
      Maybe the current situation is an over-reaction to the bad effects caused by the previous screwups.
    6. Re:AOL by HD+Webdev · · Score: 2, Interesting

      Not really. AOL isn't doing it for any Evil Intent.

      Usually, the most important data is the first page hit. WHICH PAGE/SITE REFERRED THE PERSON TO THIS WEB SITE? In most cases, where the person is connecting from is not nearly important as where they found the link.

      An ecommerce example: When showing site statistics, I advise my ecommerce clients to put their money in the referring sites that yield the highest 'bought a product' ratio.*

      Once in a while, the client will be awed by the AOL total hits statistics and want to put money there. I then explain that they will most likely increase their bandwidth use with little return and have to pay AOL for the privilege.

      A site that depends on banner ads example:

      Put money in referrer sites where the referred person viewed the most pages AND clicked the most ads per person. Accurate statistics for that are easy with PHP scripting (or your language of choice). Bonus points for using a script that counts returning visitors and compares that to where they were originally referred from.

      * It has crossed my mind that I could be mean/funny and generate 'how many attempts it takes an AOL user to fill out a form correctly' statistics.

      --
      This is not a dream, not a dream...we are transmitting from the year 1-9-9-9.
    7. Re:AOL by TheTomcat · · Score: 2, Interesting

      AOL is not alone in this. I've seen many (largish) ISPs ignore DNS TTLs. Makes switching IPs a disaster.

      S

    8. Re:AOL by TheTomcat · · Score: 4, Insightful

      This is probably a cyclic argument.

      As an ISP, why not respect a TTL? Because many DNS zones set their TTL values too small (5 mins), when 24 hours, would accomplish the same thing (except in rare circumstances -- if you're planning on moving, set it low a week before, do the move, reset to high ttl).

      As a DNS administrator, it's a pain to keep changing your TTL and the ISPs don't respect it anyway.

      It's useless to have a low TTL because the ISPs generally don't respect it because it's generally set too low because the ISPs don't generally respect it because....

      S

  4. DNS queries are for lamers by Anonymous Coward · · Score: 2, Funny

    Real man know IPs.

    1. Re:DNS queries are for lamers by bbh · · Score: 3, Funny

      I just decided to put the entire Internet into my hosts file.

      bbh

    2. Re:DNS queries are for lamers by jovlinger · · Score: 3, Interesting

      ya know, that's not impossible these days.
      What with the private subnets you can't get to, and coorporations buying up whole class IP blocks, you're not going to need to map every single IP to a set of names.

      Say you need to map 2**30 names. Give each name 256 bytes to list the hosts using that ip. You've just used 256GB. Alot, yes, but I'm willing to bet at least one person reading this has that much storage dedicated to MP3s.

    3. Re:DNS queries are for lamers by buttahead · · Score: 2, Funny

      i do daily backup of archie to a series of paper cups organized in such a manner as to readable by low flying planes.

    4. Re:DNS queries are for lamers by digitalsushi · · Score: 2, Interesting

      Wow crazy. I thought you just made that number up, but if I make a text file with "xxx.xxx.xxx.xxx" in it, find out how big that is, and multiply times the number of hosts in ipv4, I get 268.4 gigabytes. Very interesting. ipv6 is gonna keep that dream a dream.

      --
      slashdot: where everyone yells sarcastic metaphors to themselves to understand the issue
    5. Re:DNS queries are for lamers by jovlinger · · Score: 2, Informative

      I mistyped -- made sense to me at the time, but not what I meant upon rereading it. I wrote:
      > Say you need to map 2**30 names. Give each name 256 bytes to list the hosts using that ip

      I meant to write:
      Say you need to map 2**30 addresses. Give each address 256 bytes to list the host names using that ip

      ok. So 256 is an arbitrary limit. However, assuming that most addresses wont use all 256 bytes, you should be able to borrow some bytes from lines that aren't using 'em. The whole thing was meant as a ballpark estimate. I made up the assumption that only 1 IP in 4 is named, too. Give it a few years, and I won't need to make these assumptions either.

  5. .elvis? by Anonymous Coward · · Score: 2, Funny

    That's a thought! But we'd have to create servers for .vim, .pico, and .emacs as well...

    1. Re:.elvis? by bpfinn · · Score: 2, Funny

      RMS would prefer that you use .nano instead of .pico.

    2. Re:.elvis? by nelsonal · · Score: 2, Funny

      RMS also prefers that you use .GNU/nano over pico.

      --
      Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
    3. Re:.elvis? by JimDabell · · Score: 2, Funny

      Yes, but would sites in .emacs only be available to broadband users?

    4. Re:.elvis? by Dannon · · Score: 2, Funny

      Now there's an idea! Using "Mork" as a pseudonym, I'll register the domain na.GNU/nano!

      --
      Good judgment comes from experience.
      Experience comes from bad judgment.
  6. Good thinking, Sparky. by grub · · Score: 2, Funny


    98% of the DNS queries at the root level are unnecessary. [...] This means that the remaining 2% of necessary DNS queries are probably not necessary either."

    Uhh... right, eliminate 100% of the root queries, they aren't needed..

    sheeeesh..

    --
    Trolling is a art,
  7. Badly written by matthew.thompson · · Score: 2, Insightful
    The following quote seems badly written to me...
    About 12 percent of the queries received by the root server on Oct. 4, were for nonexistent top-level domains, such as ".elvis", ".corp", and ".localhost". Registered top-level domains include country codes such as ".au" for Australia, ".jp" for Japan, or ".us" for the United States, as well as generic domains such as ".com", ".net", and ".edu". In addition, 7 percent of all the queries already contained an IP address instead of a host name, which made the job of mapping it to an IP address irrelevant.
    Reading through it takes a couple of attempts to realise that they're not classing the ccTLDs and gTLDs as in the 12 percent of nonexistent TLDs but they're providing them as examples of what is a real domain - yet they take more of the paragraph to do this than to explain the nonexistent TLDs.

    Just my 2p/2 worth.
    --
    Matt Thompson - Actuality - Insert product here.
  8. Why... by jascat · · Score: 5, Insightful

    is it that hard to configure a firewall to explicitly allow outgoing traffic rather than allow all? It seems that everyone thinks that the only bad traffic is the stuff coming in from the outside...

    1. Re:Why... by PhxBlue · · Score: 4, Informative

      Excellent point, and I hope whomever has modpoints today will mod the parent up. Your PC is a sieve of information even with nothing more than a web browser and E-mail client. When you install IM applications or, gods forbid, file-sharing applications like KaZaa, the sieve becomes a fount.

      I've made a couple other posts regarding this in the past week or so, to point out that most applications don't need access to port 80, for example. E-mail doesn't need it, and IM programs certainly don't need it. ICQ uses a port in the 400 range somewhere, IIRC, for its message traffic; but it uses port 80 to report usage statistics to Mirabilis and to download banners. So does it really need port 80? Nope--you can save yourself bandwidth and gain privacy by blocking it.

      The list goes on, of course; but my biggest gain from firewalling my PC has been the freedom to restrict outgoing traffic.

      --
      !#@%*)anks for hanging up the phone, dear.
  9. No wonder these servers have so many problems by PiGuy · · Score: 5, Funny

    It's no wonder these servers have so many problems - there's thirteen of them! They need a lucky #14 - a Bilbo Baggins for their horde of dwarves. That'll stop those DoS attacks and unnecessary requests right away!

    1. Re:No wonder these servers have so many problems by wstearns · · Score: 2, Informative

      (I do realize that the post was supposed to be funny, but I suspect that people will wonder why there aren't more if the 13 get overloaded). This was tried a few years back; additional nameservers were put in place. Because the query for the root nameservers no longer fit in a udp packet, dns servers had to fall back to dns/tcp requests just to get the list of root nameservers, and we were reminded that a large number of firewalls block dns/tcp. With so many sites no longer able to make any dns lookups, the number was dropped back to 13 within a day.


      For those that would like to try the dnstop package mentioned on the site, I have signed rpms available.

      --
      Mason, Buildkernel and more: http://www.stearns.org/
    2. Re:No wonder these servers have so many problems by Lord+Sauron · · Score: 2, Funny

      >One server to update them all!

      It's wrong. The correct is
      "One server to BIND them all!"

  10. 99.9% by dirvish · · Score: 5, Insightful

    This doesn't even take into account the 99.9% of web pages suck or are unnecessary anyways.

    What standard is this based on? My website wite sucks and is only necessary for my own amusement but it is similar to my favorite kind of sites on the web. I would use the web a lot less if it wasn't for those 99.9% of web sites. Most blogs, for instance, suck and are unnecessary but at the same time the total of all the blogs is having a big impact on news outlets and the media.

    1. Re:99.9% by bucklesl · · Score: 2, Funny

      Fr tht mttr, mst vwls r nt ncssry eithr.

      --
      help fill in hidden movie endings @ End of the Credits
  11. News you can use by El_Smack · · Score: 5, Interesting

    From the article:
    "Researchers believe that many bad requests occur because organizations have misconfigured packet filters and firewalls, security mechanisms intended to restrict certain types of network traffic. When packet filters and firewalls allow outgoing DNS queries, but block the resulting incoming responses..."
    It's nice to see a story with info I can take and use. This is actually "stuff that matters".
    Kudos to the researchers, and now I am off to check my firewall.

    --


    There are 01 kinds of cars in the world. The General Lee, and everything else.
    1. Re:News you can use by LordNimon · · Score: 2, Interesting

      I have a Linksys cable modem firewall/router. How do I fix mine? It appears that pretty much every firewall allows outgoing requests but block incoming ones.

      --
      And the men who hold high places must be the ones who start
      To mold a new reality... closer to the heart
    2. Re:News you can use by El_Smack · · Score: 2, Insightful

      Well, I was more speaking of the guys who cobbled together their own firewall using 2 NIC's and their OS and software of choice. That's what I did, and I easily could have screwed up an iptables command or a default rule and blocked incoming DNS.
      With a purchased firewall, especially if you can't edit it yourself, I would have to assume (uh oh) that the manufacturer got the basics right, at least. I really don't know of a way to check those. You could try an online port scan like sygate.com offers. But your firewall is probably using a "statefull" method, which would allow DNS to come back if you initiated the request, but it would block a NEW request that originated outside. So it will probably say your port 53 is blocked.

      --


      There are 01 kinds of cars in the world. The General Lee, and everything else.
    3. Re:News you can use by lanner · · Score: 4, Informative


      I crazy about my home network firewall configuration, and when it is under my authority, the firewall rules of the business to which I am employed at any time.

      An important but often left out part of a firewall's configuration is logging. Attempts to do things that should never be done should not just be dropped, they should be logged and then brought to your attention.

      Some examples;

      If your local network is 192.168.2.128/29 then any outgoing packet that does not have a source within the range of 192.168.2.129 and 192.168.2.134 should be dropped AND logged. Someone on YOUR network is either stupid or trying to spoof someone!

      The same thing goes for ports and protocols that should not be outgoing on your network.

      Okay, so getting probed on TCP 80 is getting annoying now that you are logging everything that is not allowed. Fine, explicitly drop it without logging.

      Conform to RFC1918 -- don't route IP private space to or from the Internet. Route it to /dev/null or null0 AND filter it. AND if it came from YOUR network, log it. The quantity of ISPs that fail to conform to this is astounding and scary. You don't need this traffic moving around your ISP -- use GRE or MPLS tunneling instead.

      Also, conform to BCP38 ftp://ftp.rfc-editor.org/in-notes/bcp/bcp38.txt

      After tuning your firewall logging filters, you will find that when new attacks occur or something is up, you notice. Otherwise, you are blind and dumb to what your firewall is doing, which means that you are blind and dumb to what your network is doing.

  12. Ignant by edraven · · Score: 5, Interesting
    In addition, 7 percent of all the queries already contained an IP address instead of a host name, which made the job of mapping it to an IP address irrelevant.


    Is it just me, or is this a description of a reverse lookup? How does that qualify as unnecessary? This is a pretty common step in troubleshooting, and some software does a reverse lookup following a forward lookup to verify that the hostname it gets back is the same one it started with.

    Chuckles
    1. Re:Ignant by deepchasm · · Score: 2, Insightful

      No, I assume the researchers are not that stupid.

      They mean that some software, designed to take a fully qualified domain name as input, *always* looks up the input by DNS, even if someone has typed in an IP rather than a hostname - making the lookup unnecessary.

      If it was a reverse lookup it wouldn't just contain an ip (e.g. "1.2.3.4"), it would be "4.3.2.1.in-addr.arpa", that's how reverse lookup works.

    2. Re:Ignant by dachshund · · Score: 5, Informative
      Is it just me, or is this a description of a reverse lookup? How does that qualify as unnecessary?

      I believe that reverse lookups are identified by an "inverse" status flag in the request header. One can only assume that the authors were not counting this sort of valid query, and were only focusing on the "standard" queries that contained IP addresses. Those certainly would, I think, be rather pointless.

    3. Re:Ignant by pde · · Score: 3, Insightful
      Is it just me, or is this a description of a reverse lookup? How does that qualify as unnecessary?

      I believe that reverse lookups are identified by an "inverse" status flag in the request header. One can only assume that the authors were not counting this sort of valid query, and were only focusing on the "standard" queries that contained IP addresses. Those certainly would, I think, be rather pointless.



      Ummm, no. "inverse" does not in any way shape or forme identify a request for the hostname associated with an IP address.

      And the lookups being described are not reverse loops, either. A 'reverse lookup' for 1.2.3.4 is a query for the PTR RR associated with 4.3.2.1.in-addr.arpa. The queries being described are for the A RR associated with the FQDN 1.2.3.4. There is no such TLD as '4.'

    4. Re:Ignant by DavidTC · · Score: 2, Funny
      The thing to do to fix this is to hack the root servers to pretend that a random number is a TLD, and point then at a fake TLD server that gives out the IP address of 127.0.0.1 for any query.

      All those broken programs would immediately stop working.

      --
      If corporations are people, aren't stockholders guilty of slavery?
  13. Serious question by Anonymous Coward · · Score: 5, Insightful
    This doesn't even take into account the 99.9% of web pages suck or are unnecessary anyways. This means that the remaining 2% of necessary DNS queries are probably not necessary either.

    I see this kind of thing all the time on /.--completely unedited, barely literate, rant-style submissions. Why don't the /. editors tone down or eliminate the rhetoric from submissions about otherwise worthy topics, or at least fix the grammar and typos?

    I know, I'm going to get blasted for saying this, but I'm convinced it's one of those "little things" that makes /. look to the rest of the world more like a bunch of know-nothing kids typing at each other than a group of technically literate activists with something of value to contribute.

    I now return you to your regularly scheduled rant...

    1. Re:Serious question by CharlieO · · Score: 2

      Last time I looked the idea was to collate interesting stories and articles from around the web and discuss them.

      I'm not trying to go high brow or anything, I really enjoy the in jokes and the strong opinions, theres nothing wrong with it.

      Its just I don't feel that a story submission should be full of personal opinion - thats up to the slashdotters to add in the comments where its subject to the ebb and flow of moderating - or am I missing something??

  14. Incorrect top-level domains by jb_nizet · · Score: 5, Interesting
    About 12 percent of the queries received by the root server on Oct. 4, were for nonexistent top-level domains, such as ".elvis", ".corp", and ".localhost"

    Why don't DNS servers have a list of correct top-level domains, in order to answer directly, without going to a root server? The list is short, compared to the information the DNS server caches already, and the content of the list doesn't change so often. This list could be downloaded once in a day or so, from the DNS root servers.

    When packet filters and firewalls allow outgoing DNS queries, but block the resulting incoming responses, software on the inside of the firewall can make the same DNS queries over and over, waiting for responses that can't get through

    Why the hell does a firewall accept outgoing queries to black-listed domain names, if they are configured to block the response to these queries? This seems like a serious misconception to me.

    JB.

    1. Re:Incorrect top-level domains by dfn5 · · Score: 5, Informative
      Why don't DNS servers have a list of correct top-level domains, in order to answer directly, without going to a root server?

      This is actually an excellent idea and one that people who use opennic do already. The root zone "." at OpenNIC is setup to be slaved so my DNS server downloads a copy of the root zone which has all the information for all the top level domains. If the root zones get DOSed I don't care because I don't use them anymore. Everyone should use OpenNIC. It is the Internet friendly thing to do. :)

      --
      -- Thou hast strayed far from the path of the Avatar.
  15. In related news by Walterk · · Score: 4, Funny

    74.4% of all statistics are made up on the spot.

    1. Re:In related news by JonnyElvis42 · · Score: 2, Insightful

      >> 99% of slashdot posts are unnecessary.
      > 74.4% of all statistics are made up on the spot.


      That's right! A complete study would probably have shown that first number to be much higher than 99%.
      Heck, we're 3 for 3 so far :P

    2. Re:In related news by PunchMonkey · · Score: 4, Funny

      100% of all queries would be unnecessary if all the lazy 'net users would just maintain their own hosts file and use IP addresses....

      --
      I'll have something intelligent to add one of these days...
    3. Re:In related news by Zordak · · Score: 4, Funny
      74.4% of all statistics are made up on the spot.
      And 100% of users who post that same old joke again should be shot on the spot.
      --

      Today's Sesame Street was brought to you by the number e.
    4. Re:In related news by ArsonSmith · · Score: 4, Funny

      and 6 out of 5 of them have no basis in reality what so ever.

      --
      Paying taxes to buy civilization is like paying a hooker to buy love.
    5. Re:In related news by Ack_OZ · · Score: 2, Funny

      And 100% of users who post that same old joke again should be shot on the spot.

      and then Profit! ;)

    6. Re:In related news by langed · · Score: 2, Interesting
      Hmm. Well, at the University I used to attend, DNS service was quite problematic, on their very young ethernet network. My solution was to play with dig a little bit and enter a couple root servers into my /etc/resolv.conf.

      I guess you could argue that it was unnecessary--but I was the one laughing when everyone else thought "the internet was down" when they couldn't get to Yahoo Games, Hotmail, and Google.

      Of course, it might be noteworthy to mention I find far more "relevant" uses of my Internet connection. :)

  16. Vatican by Anonymous Coward · · Score: 5, Funny

    "Scientists at the Vatican Praying Center found that 98% of the prayer queries at the God level are unnecessary."

  17. Re:Highlight... by Zeinfeld · · Score: 5, Informative
    About 12 percent of the queries received by the root server on Oct. 4, were for nonexistent top-level domains, such as ".elvis"

    If the authors actually thought how the DNS works they would realise the reason for this. A DNS server that gets a request for .com will consult the root the first time and then cache the result. So even though the server might then get a million hits in .com it won't ask the root again.

    If the server tries to query for a non existent domain it will get back a 'non-existent' response. Now it will cache that response for some time but the chances of getting a cache hit is actually pretty low.

    So if you have a properly configured DNS with a bunch of web surfers that view 1 million pages in 20 TLDs and 1,000 bogus ones they will generate 20 hits they would classify as genuine and 1,000 that were 'unnecessary'.

    That is how the system is meant to work.

    The 70% of repeated requests are likely to include outright attacks as well as misconfigured DNS systems.

    The problem dealing with these issues is that a DNS query is pretty cheap to handle, cheaper in fact than most of the proposed defenses. It is probably more expensive for a DNS server to check IPs against a blacklist than to just return the damn data...

    --
    Looking for an Information Security student project suggestion?
    Try http://dotcrimeManifesto.com/
  18. Not really "broken" queries by dachshund · · Score: 5, Interesting
    About 12 percent of the queries received by the root server on Oct. 4, were for nonexistent top-level domains, such as ".elvis", ".corp", and ".localhost".

    And that's a problem? My understanding was dealing with this sort of thing was exactly the purpose of the root DNS servers. If every ISP's DNS server was pre-configured to recognize valid and invalid top-level domains, you could just set them up to go straight to the specific DNS servers handling those domains (.com, .net, .org, etc.) There would be no need for a root-level system.

    The argument for allowing this kind of cracked query through to the root server is that it makes it easy to add new domains (.elvis, .corp, what have you) without forcing everyone to reconfigure their DNS boxes for each new top-level domain.

    1. Re:Not really "broken" queries by aridhol · · Score: 4, Informative

      Actually, according to RFC 2606 (Reserved Top Level DNS Names), .localhost can be blocked by the local DNS, as it is an invalid name (along with .test, .example, .invalid, and .example.(com|org|net)). These are supposed to be used for testing and documentation, so if they aren't in use, they may as well be blocked.

      --
      I can't say that I don't give a fuck. I've just run out of fuck to give.
    2. Re:Not really "broken" queries by alanjstr · · Score: 3, Informative

      No, but the ISPs are supposed to query once a day or so and cache the results, so that the ROOT server isn't the DNS server that Everyone queries.

  19. Re:The real root of the problem... by TheShadow · · Score: 4, Interesting

    Ummm... what does IPv6 have to do with DNS vanishing? With 128-bit IP addresses in an ugly hex-colon notation... DNS will be even more important when people move to IPv6.

    The problem with DNS (and SMTP) is that they are protocols developed during a time where everyone on the internet was operating in a cooperative mode. Now that there is a proliferation of SPAM and DOS attacks, these old protocols break down because they were not developed with security in mind.

    DNS will not go away. But the protocol will probably change at some point.

    --

    --
    "What do you want me to do? Whack a guy? Off a guy? Whack off a guy? Cause I'm married."
  20. How about by HD+Webdev · · Score: 2, Interesting

    1. Bad request received by a root server

    2. Root server notices it's of the 'non-existent top level domain' variety.

    3. Root server sends back information pointing to an ip that shows a web page with a nicer version of 'either you clicked a FrontPage created link, you are a monkey banging a banana on the keyboard, or your ISP administrators don't have a clue'.

    Advantages: It'll embarrass ISP's. It'll cut down on the traffic to the Root Servers.

    Disadvantages: It'll only be noticeable with web queries.

    --
    This is not a dream, not a dream...we are transmitting from the year 1-9-9-9.
  21. Re:The real root of the problem... by NineNine · · Score: 4, Funny

    DNS *does* suck! I mean, who wants to go through all of the trouble of laboriously remembering and typing "slashdot.org" in a browser when they can much more easily remember and type in "234.54.197.233.90.222"?? I pray for that day, also.

  22. DNS Moderation by MarkGriz · · Score: 5, Funny

    How about coming up with a DNS Moderation system.
    The root servers give say 50 karma points to each IP address issuing a query.
    If the query is unnecessary, it gets modded "-1 redundant".
    When karma hits 0, it stops responding to further queries.
    DNS eventually stops working at that site, admin pulls head out of ass and fixes the problem causing the redundant DNS queries.

    --
    Beauty is in the eye of the beerholder.
    1. Re:DNS Moderation by wayne · · Score: 2, Interesting

      While the moderators seem to think this is a "funny" idea, I personally kind of like it. Only, I would recommend creating increasingly long delays in the response or to increase the number of dropped requests. You want that sysadmin who pulls his head out of his ass to at least be able to download fixes and such.

      --
      SPF support for most open source mail servers can be found at libspf2.
  23. Huh? IPv6 a cure for DNS? by Pii · · Score: 4, Funny
    Maybe this was meant as a joke, but on the off chance that it wasn't, allow me to reiterate the subject of my reply...

    Huh?

    Maybe I've been asleep at the wheel when it comes to all of the advantages of IPv6, but how on earth does it alleviate the need for a functioning DNS service?

    Do you imagine that it will somehow be easier for people to remember IP addresses that are 128 bits in length than it is to remember them in their current 32 bit dotted decimal form?

    I guess these will be what we have to look forward to in your DNS-free world of the future:

    • http://[FEDC:BA98:7654:3210:FEDC:BA98:7654:3210] :80/index.html
    • http://[1080:0:0:0:8:800:200C:417A]/index.html
    • http://[3ffe:2a00:100:7031::1]
    • http://[1080::8:800:200C:417A]/foo
    • http://[::192.9.5.5]/ipng
    • http://[::FFFF:129.144.52.38]:80/index.html
    • http://[2010:836B:4179::836B:4179]

    Riiiight.

    --
    For those that would die defending it, Freedom
    has a sweet taste that the protected will never know.
  24. One factor... by ZoneGray · · Score: 4, Informative

    One factor is that I suspect people are increasingly lowering their TTL's, expires, or whatever that parameter is. Most of the manage-it yourself DNS providers now allow an option toreduce that to a few minutes, which makes it much easier to move hosts around. And while a low setting increases DNS traffic, it rarely if ever incurs an extra cost to the domain holder.

    1. Re:One factor... by Slashed+Otter · · Score: 2, Insightful

      That makes no sense...

      We're talking about the root nameserver here, not the server that handles .com. So if I lower the ttl on my domain, that increases the traffic to the server that handles .com and not the server that handles "."

      Basically, me lowering my ttl on my domain doesn't cause DNS servers to forget which machine is authoritative for all .coms

  25. Original story... by Goodbyte · · Score: 5, Informative
    It' seems this originally came from UCSD, so when the page gets /.:ed, here is another one: Original story, and the interesting pie-chart from original story.

    It obviously seems to be a lot of junk traffic, but the only part we can say is bad requests are part 3 and 4 from the chart. Bad spellings must go to the root since there may be such domains!

    It would be nice to analyze the 70% repeated or identical queries, probably lots of traffic can be explained for (or else there are a bunch of administrators out there who need a good manual on bind).

  26. In other news ... by borgdows · · Score: 2, Funny

    Scientists at the San Diego Supercomputer Center found that 98% of the remaining (necessary) DNS queries are related to porn websites!

  27. Lets go a level deeper by jj_johny · · Score: 2, Insightful
    1. Most web users (and unfortunately lots of admins) don't understand DNS at a theory level also don't understand a lot of other stuff like security but...

    2. The amount of time it takes to set up DNS correctly and effeciently with the existing products, especially BIND, is a lot more than it takes to just get them functioning.

    3. The research would have been more interesting if they had gone and looked at say 1000 random requestors who where doing things screwed up and find out why and how they were screwed up.

    4. It would be nice if the local DNS servers had a list of valid top level domains so that it would kill requests to non-existant ones.

    THAT would be stuff that matters!

  28. Another great source for broken DNS. by FreeLinux · · Score: 4, Interesting

    I'm surprised that they did not mention massive numbers of "broken" requests from Windows 2000/XP systems. I see this all the time due to misconfigurations. Administrators often set up the Windows 2000 DNS servers incorrectly and Windows 2000/XP systems(workstations and servers) configured such that they constantly try dynamic DNS updates to the wrong DNS servers, even the root servers.

    Linux too, has some issues here. Obviously misconfigured DNS servers will always be a problem but, distros like Red Hat have IPv6 support compiled into the BIND RPM, this results in an IPv6 formatted query folllowed by an IPv4 query for every request.

  29. Re:Highlight... by Goodbyte · · Score: 2, Insightful

    Finally someone who makes a relevant comment. Though I wonder how the 'search from address bar'-feature has affected the number of non-existent queries.

  30. Unnecessary Queries? by Eskarel · · Score: 3, Interesting
    About 70 percent of all the queries were either identical, or repeat requests for addresses within the same domain. It is as if a telephone user were dialing directory assistance to get the phone numbers of certain businesses, and repeating the directory-assistance calls again and again.

    This is somewhat of an invalid metaphor for both the way dns works, and the way computer caching works. Pretty much every local DNS server(unless my information is wrong), has some sort of caching system of varying degrees of efficiency. The problem is that unlike humans who are more likely to remember things if they are repeated, caching usually just consists of a series of entries which can quite easily be overwritten, older entries will be overwritten if they aren't updated or caching would never work for new frequently accessed sites. It's quite easy to get an access pattern which would remove even the most frequently accessed files from a list especially on a server with a great deal of users. By providing different servers for each chunk of users you can diminish this problem but then you'll get requests from each server. DNS is an ugly system because it does and ugly job.

  31. Re:Highlight... by Zeinfeld · · Score: 4, Informative
    Though I wonder how the 'search from address bar'-feature has affected the number of non-existent queries.

    A way to tell would be to see how many of the queries were looking for mx records.

    I suspect that people using dummy email addresses like 'a@b.c' for subscriptions are another major cause of the misfires.

    The browsers doing search from the address bar probably reduces the number of misfires. A modern browser will only go to DNS if it sees something like foo.bar. If it just sees foo it will typically try foo.com and then go bang a search engine.

    Another reason I suspect spam is a major issue in the misfires is that lots of spam filters do lookup on sender addresses and those frequently point to non existent domains. Also the spam senders rarely do the most basic filtering on their lists - you can tell that since every now and again you get a spam with a full sender list at the top and you can see the broken addresses right there.

    --
    Looking for an Information Security student project suggestion?
    Try http://dotcrimeManifesto.com/
  32. That's not the same problem... by robbo · · Score: 4, Informative

    That's a local problem, between the user and AOL's DNS servers. The article is descibing a different, higher-level problem between, for example, AOL's DNS servers and the root-level servers. If an AOL user's machine makes ten DNS requests for the same host, only one request should propagate past AOL's nameservers, but instead a misconfigured DNS will propagate all ten.

    I can suddenly see lots of slashdot users thinking-- oh, I should fix my firewall, I have all these DNS requests; but that's normal operation for a client workstation. Your firewall would be broken only if all your DNS queries failed, and you'd know it pretty fast if that were the case.

    --
    So long, and thanks for all the Phish
  33. It's a wash by ShaggyZet · · Score: 2, Interesting

    It all comes out in the end anyway. Say AOL has 100 proxies. If 10,000 AOL users visit your site, then it'll look like only 100 unique visitors. Granted this is more than the 1 unique visitor that it would look like for most proxies, but it's still less than the actual number, not more. Presumably there are significantly less proxies at AOL than there are users. It only really matters to small sites like yours and mine, where we're getting excited about each and every visitor, and 10 all at once makes us need a new keyboard.

  34. Re:Ign(or)ant by anticypher · · Score: 5, Informative
    Its not just you, the two completely different DNS databases require different lookups, a common enough mis-understanding. Consider yourself less ignorant now :-)

    To do a reverse lookup, the resolver sends a different request type, asking for a PTR resource record. The form is to put the IP address (or network address) backwards, and append .in-addr.arpa to the request. All (well, ok, most) IPv4 addresses are mapped under the .in-addr.arpa domain. But these misconfigured resolvers are sending A (address) record requests but with a IP address included instead of a domain name.

    If you have your own DNS server and watch your DNS traffic, you can see these two effects happening differently.

    For a forward (A or MX record) lookup:

    Local server queries root server for an A record

    Root server responds with NS record for the registry of the domain

    Local server contacts registry server for A

    Registry server responds with NS records for the domain

    Local server contacts the domain's server, which responds with an A record

    Local server answers the resolver with the A record.

    For a reverse (PTR) lookup, the resolver traverses the netblock providers:

    Local server queries the root servers with a properly constructed PTR request (z.y.x.w.in-addr.arpa.)

    Root server knows only where major net blocks are allocated, and returns the NS record of a Regional Internet Registry (RIPE, APNIC, etc)

    Local server again queries an RIR NS with the PTR

    RIR NS knows which ISPs hold which blocks, so responds with the ISP NS record

    Local server again queries the ISP NS server, which either has the reverse hostname, or once again returns the NS record of the the local DNS server.

    The two different types of queries follow different paths, either Name Registries or Netblock Providers. This article points out that many resolvers are broken because they allow obvious reverse lookups to pass as forward lookups, and then can't deal with the resulting error messages.

    I have often seen broken resolvers repeatedly query DNS servers I manage, possibly because as the article points out, fucked firewalls allow the requests out, but block the requests from getting back to the resolver. It happens so much I just ignore it when I see it, its not worth notifying the admins because they are usually too clueless to know how to fix the problem.

    the AC

    --
    Hemos is like...sci-fi fans;he thinks technology is cool, but he hasn't bothered to understand the science it's based on
  35. Let's ReCap!! by mcoko · · Score: 2, Interesting

    If the First 98% are unnecessary and the last 2% are unnecessary as well...that's 100%...

    That means that you just explained and wished for the Internet to go away...

    or...you some how figured out how an end user can magically come up with the IP for a Host name from thin air. Go You. Your a Millionaire.

    --
    www.fotoforay.com
  36. DNS2 by emil · · Score: 3, Interesting

    Really, we should have some sort of gnutella-like system for distributing zone files. The problem with DNS is that it was designed a LONG time ago before the more recent advances in P2P networks.

    There shouldn't be much argument at this point that we need DNS2 - the current system is vulnerable to attack.

    The problem is that, if you distribute zone files (or pieces of zone files) among a loosely-connected network, then you will need to establish trust. These zone files would have to be signed, and the certificate authority then becomes the bottleneck.

    It hurts my head.

  37. Re:DNS Needs a redesign.... by jefftp · · Score: 4, Insightful

    The fact that DNS, a 20 year old design, still works after being scaled several magnitudes beyond its original environment is proof that DNS doesn't need to be redesigned. The initial design is nothing short of genius. The extensions to the initial design (dynamic updates) build upon already solid technology.

    I run a DNS server, I've looked at DNS packets, and every time I ask the Internet to tell me who the heck slashdot.org is and it comes back with an IP address I'm amazed. My network asks strangers for help and those strangers say: Hey, try here. Bam! Slashdot.org pops up in my browser.

    You cannot "combine" DNS, DHCP, and Routing all into a single protocol. Hell, get three network engineers together sometime and try to get them to agree upon the best Internal Gateway Routing protocol sometime... EIGRP, OSPF, RIP.

    Routing information is extremely different from domain name information. The two have nothing in common other than IP Addresses. You have to include not only information about who your neighbors are, but also what type of links are between you and your neighbors, and how congested those links are. Now, what about your neighbor's neighbors? Oh, we'll track that to, and also keep a set of tables that show us the next two best reconfigurations should any of the links stop working. Unless you're just talking about RIP for routing.

    DHCP on the other hand is about getting clients configured for a network. They can then use DDNS to update their DNS record in a local DNS server. DHCP can do much more than just say: Here's your IP. It can also tell a client: here's where you should get your operating system from, and here's the voice over IP gateway, and here's the server where you should send your management info to, and here's the best local printer to use. Most people don't have clients that can handle that type of information, however.

    It's not just "if it's not broke, don't fix it" this is a case of "it frelling works great, keep your hands off of it or I'll kick you in the jimmy."

  38. But is this really a problem ? by staaktdenarbeid · · Score: 2, Interesting

    The Internet was shown to be a scale-free network by U. Notre Dame physicist Barabasi. It means that the majority of the Web Page Requests is only for a fraction of the total Web Pages (the 'hubs').
    Thus the 98% DNS Queries might be needed for only a minority of connections (I am assuming that Web Traffic is the bulk of Internet Traffic here).

  39. Re:Highlight... by pde · · Score: 5, Informative
    If the authors actually thought how the DNS works they would realise the reason for this. A DNS server that gets a request for .com will consult the root the first time and then cache the result. So even though the server might then get a million hits in .com it won't ask the root again.

    Well, that's the theory. In practice, however, there are millions of servers out there that do not cache NXDOMAIN at all, and just keep querying, over and over and over again, for TLDs that they've already been told don't exist. Microsoft's name server has been known to do this.

    At one point, f.gtld-servers.net was seeing millions of repeated queries per hour from the same two .mil servers asking the same question and refusing to accept the NXDOMAIN. For long periods, these two servers were asking the same question multiple times per click of F's timer. That's.. ummm.. Bad. I suggest that you read the actual CAIDA paper, and the other papers on the subject that Evi Nemeth and others at CAIDA have produced. They *have* thought about how the DNS actually works in practice. You've only thought about how it would work if every implementation worked perfectly, according to your expectations.

  40. Oh, give me a break... by casmithva · · Score: 2, Insightful
    So let me see if I'm getting this right. According to their article, I've somehow misconfigured my nameserver if a query for slashdot.org goes from my local nameserver to the root, then to a VeriSign gTLD server, and then to a VA Software (or whatever y'all are known as this year) nameserver? Funny, I thought that's how DNS was supposed to work! I suppose they want us to go set up ~300 forward zones in our nameservers to prevent these unnecessary queries...? Yeah, okay, sure, I'll get right on that after lunch. *snicker*

    Repetitive queries from the same nameserver in rapid succession, full-blown email addresses, search engine queries -- those are unnecessary, illegitimate queries that indicate not only bad nameserver configuration, but also bad application software. How many assorted DNS query permutation tricks have the various versions of Netscape Navigator tried over the years?

    1. Re:Oh, give me a break... by kindbud · · Score: 2, Insightful

      So let me see if I'm getting this right.

      You aren't. What you described is how it should happen. But once you have the NS for dot-org, which you received in the reply to your first query for slashdot.org, you need not go back to the roots with any dot-org query name until the NS records expire from your cache.

      If your nameserver repeatedly hits the roots for dot-org names, even after it has received the dot-org NS records, that is broken and those queries are unnecessary. Your nameserver should be hitting the dot-org servers, not the roots.

      --
      Edith Keeler Must Die
    2. Re:Oh, give me a break... by kindbud · · Score: 2, Informative

      You haven't looked at the roots in a while, have you? ;) All TLDs (except in-addr.arpa.) were moved off the roots some time ago. In the old days, the roots were also the dot-com servers (and dot-net and dot-org). They returned the NS for a 2nd level gTLD domain because they were ALSO the NS for that TLD.

      This is no longer the case, and I can't paste an example because of Slashdot's LAME lameness filter. But run this command:

      dig @a.root-servers.net slashdot.org mx

      You will getback a delegation to the .org servers, and not the NS for slashdot.org. It's been this way for almost two years now.

      --
      Edith Keeler Must Die
  41. memory by Anonymous Coward · · Score: 2, Funny

    It should be a pre-requisite to remember each and every ip address before accessing the internet.
    Problem solved.!! :)

    1. Re:memory by boneshintai · · Score: 2, Funny

      1.1.1.1... 1.1.1.2... 1.1.1.3...

  42. Re:Huh? IPv6 a cure for DNS? by fliplap · · Score: 2, Funny

    yes, but now everyone's IP can look like:

    3ffe:10d7:::dead:beef:cafe:babe
    and
    2001:234d: ::be:a:beef:ace

    what exciting words and numbers can you come up with 1234567890ABCDEF!

  43. Re:Ignant...you've got it all wrong. by Agent+Green · · Score: 4, Informative

    Reverse lookups go by sending a PTR request containing an IP address to a DNS server, versus a A request with a name as a snippet from this TCPdump shows a request from one my boxen to my DNS server:

    Reverse:

    12:59:31.814847 defender.licensedaemon > gimpy.domain: 20091+ PTR? 1.65.0.199.in-addr.arpa. (41)
    12:59:31.816003 defender.1029 > arrowroot.arin.net.domain: 19500 [b2&3=0x10] [1au] PTR? 1.65.0.199.in-addr.arpa. (52)

    Forward (complete request cycle from defender to gimpy):

    13:11:54.760484 defender.globe > gimpy.domain: 47604+ A? www.gtei.net. (30)
    13:11:54.761597 gimpy.1029 > dnsauth1.sys.gtei.net.domain: 51438 A? www.gtei.net. (30)
    13:11:54.977584 dnsauth1.sys.gtei.net.domain > gimpy.1029: 51438*- 1/3/3 A 128.11.42.31 (167) (DF)
    13:11:54.978626 gimpy.domain > defender.globe: 47604 1/3/0 A 128.11.42.31 (119)

    DNS & BIND is the first book to use for more info, though.

    --
    // Agent Green (Ian / IU7 / KB1JQO)
    // IEEE 802.3: All 10base Are Belong To Us
  44. Wait a minute, you don't understand the artical by qix · · Score: 4, Informative


    A DNS query for an IP address is a *BAD REQUEST* contrary to what some of these other posters have said. Asking a root server to resolve anything in the first place, is bad - they should only be asked for NS records - and in the second place, an IP address is not a valid domain name (unless ICANN has serripitiously added 256 new top level domains, namely, the numbers 0 thru 255).

    Most networks that I've seen, are badly broken this way. The usual problem is that the network in question may use private address space (192.168.1.0/24 for example), but fail to install reverse dns for these addresses, causing delays and other problems when machines try to get the name associated with their ip address or that of a local machine connecting to them. Yes, you heard right - if you use any of the 192.168.x.x, 10.x.x.x, or 172.16-32.x.x addresses, you are broken unless you install dns to resolve for those addresses! This also goes for any ip netblock in general, although most isp's these days are setting up dummy records for their unused ip space that'll cover their customers allocations ok.

  45. Acronym soup by Linuxathome · · Score: 2, Funny

    Let me get this straight. This is a study of DNS conducted by CAIDA at SDSC at UCSD? I need a host list for these acronyms!

  46. Re:Highlight... by Zeinfeld · · Score: 2, Informative
    Well, that's the theory. In practice, however, there are millions of servers out there that do not cache NXDOMAIN at all,

    That is hardly suprising since a lot of servers don't even cache the positive hits.

    The report said 70% of the hits were repeated requests. Again this is not too suprising, the root zone caches really well. There are less than 200 domains after all and only 20 of those have a significant degree of activity. The TLD configurations change so infrequently that the TTL could be set at a month without inconveniencing anyone.

    So the 'necessary' traffic for the root servers is negligible. Even with a million odd DNS servers out there each root need see no more than a few tens of thousands of hits an hour.

    It makes no real difference since the roots have to be scaled to be able to survive a sustained DDoS attack for at least as long as it takes remediation measures to kick in. Get rid off all the bozo queries and you still need the same size box because of the script kiddies.

    There are a bunch of changes that could be put in place that would reduce the DDoS problem. First we could follow the proposals of Mark Kosters and Paul Vixie to start using anycast (this looks like it is going ahead).

    Another thing we could do is to change the DNS logic so that servers keep records in their cache beyond the TTL and use those as backup if the root or TLD is unavailable. Then even a DDoS that succeeded would have only marginal effect.

    --
    Looking for an Information Security student project suggestion?
    Try http://dotcrimeManifesto.com/
  47. It's the SPAM by haapi · · Score: 3, Insightful

    I'll bet a large percent of the queries, especially for bogus top-level domains, are due to lookups by MTAs when receiving SPAM. Think of the numbers!

    This doesn't mean that even these queries shouldn't be handled better -- just that SPAM lookups cause a bunch of 'em.

    --
    Well, apparently, you only have to fool the majority of people for a little while.
  48. Fess up... by FuzzyDaddy · · Score: 4, Funny
    How many people just went and checked to see if there's an .elvis TLD?


    Actually, I've always had a theory that Microsoft coined ".msn" because they wanted to get their own top level domain.

    --
    It's not wasting time, I'm educating myself.
  49. Re:Highlight... by kirknall · · Score: 2, Interesting

    I've read through every comment on this page, hoping someone could explain this to me, but alas, I guess I'll just ask.
    How does a query containing a valid IP address ever get to a root server?
    It just seems to me that anything with an IP would bypass DNS altogether. Thanks!

  50. This is pathetic and typical of /. by Royster · · Score: 3, Informative

    You don't know what you are talking about, so you rant.

    There is nothing malformed about a .elvis DNS request. ICANN might decide to open up .elvis registrations tomorrow and program the root servers to respond to them. If every DNS server had to be reprogrammed every time a new TLD was added, it would be a maintenance problem whenever the TLDs were expanded.

    The elegant part of the design was to define the protocol to look up unknown TLDs and unrecognized TLDs at the roots. It didn't anticipate a few million monkeys typing search terms into browser address lines.

    The fault for the excess lookups lies in the applications programmers.

    --
    I have discovered a truly marvelous sig, unfortunately the sig limit is too small to contain i
  51. 3rd party DNS servers and experience. by BrookHarty · · Score: 2, Informative

    I can think of some important things to consider about the interaction of your network and root servers.

    1. Some 3rd party DNS programs, like NetIQ and Preside. Require you to have root servers configured. Some will even break if you put false root information into it.

    2. All unknown queries are sent to root servers, like your DMZ'ed networks. (Depends on your software has failsafe mode, but Bind can be disabled.)

    3. Other TLD's are queried by the root domains. .GPRS used for 3G phone networks (example) might be also query the wrong root level servers.

    4. With the security being a hot topic, networks are switching from recursive lookups to iterative mode. Which makes for more visible lookups on a network sniff by increased traffic.

    Also, you can offload dns on your home networks by using a local dns server. Really handy, caches lookups, saves bandwidth, easier to setup than bind, and can use your /etc/hosts file to pretend you have DNS on a nat'ed network. dnsmasq on freshmeat is very nice choice.