How To Evade URL Filters With (Not-So) Fancy Math

← Back to Stories (view on slashdot.org)

How To Evade URL Filters With (Not-So) Fancy Math

Posted by timothy on Tuesday March 23, 2010 @11:22AM from the could-I-have-twice-a-half-dozen dept.

Trailrunner7 writes "In their constant quest to find new and interesting ways to abuse the Internet, attackers recently have begun using an old technique to obfuscate URLs and IP addresses to bypass URL filters and direct users to malicious sites. The technique takes advantage of the fact that modern browsers will allow users to specify IP addresses in formats other than base 10. So a typical IP address that looks something like this — 192.10.10.1 — can also be written in base 8, hexadecimal or a handful of other formats, and the browser will recognize it and take the user to the specified site. What is interesting though is that due to the relative obscurity of using such methods to denote an IP or URL, it is quite feasible that existing security products do not correctly identify the URLs as valid or flag them as malicious when they point to existing known bad websites."

22 of 162 comments (clear)

Min score:

Reason:

Sort:

Technical details here by TSHTF · 2010-03-23 11:26 · Score: 4, Informative

The linked article is next to worthless. The real details are in this blog post.
1. Re:Technical details here by AnEducatedNegro · 2010-03-23 11:29 · Score: 5, Funny
  
  don't you mean in this blog post?
2. Re:Technical details here by TheRaven64 · 2010-03-23 11:52 · Score: 4, Informative
  
  OpenDNS is irrelevant. These are IP addresses, they are not domain names, so they don't need to go via DNS to be resolved. None of the links works in Safari on OS X either, but you can ping the IPs in the terminal, so it appears to be a bug (or 'security feature') in libcurl, which is what Safari uses for resolving URLs (earlier versions used CFURL, now WebKit uses libcurl directly). Checking this in the terminal shows the problem is actually deeper; libcurl passes the address to getaddrinfo(), but that fails. Trying the same command on GNU/Linux works correctly, so the glibc implementation of getaddrinfo() does handle this kind of resolution correctly. I presume that on OS X the ping utility handles its own address parsing; telnetting to 0x42.0x66.0x0d.0x63 fails in the host lookup stage.
  
  --
  I am TheRaven on Soylent News
3. Re:Technical details here by moreati · 2010-03-23 12:16 · Score: 4, Interesting
  
  don't you mean in this blog post [3273372964]
  Interestingl. Though Slashcode presented your url as typed by you, hovering over it and right-click-copy in Chromium shows the canonical dotted quad http://195.27.181.36/en/weblog?weblogid=208188044
4. Re:Technical details here by ObitMan · 2010-03-23 12:55 · Score: 3, Informative
  
  never mind. i misread the article, sorry
  
  --
  Who run Barter Town?
5. Re:Technical details here by plover · 2010-03-23 13:27 · Score: 4, Interesting
  
  That blog post even has a variant of obfuscation the author likely didn't intend. He mentioned octal, but used a funny notation in his google.com example:
  http://00000102.00000146.00000015.00000143/
  True octal notation simply requires a single leading zero, like this:
  http://0102.0146.015.0143/
  The cool thing is this opens a new avenue for further defeating the fixed string-based scanners. These are all equivalent:
  http://00000102.00000146.00000015.0143/
  (Slashdot makes me fill the lines with not-repetitive stuff.)
  http://00000102.00000146.00000015.00143/
  (Slashdot makes me fill the lines with not-repetitive stuff.)
  http://00000102.00000146.00000015.000143/
  (Slashdot makes me fill the lines with not-repetitive stuff.)
  http://00000102.00000146.00000015.0000143/
  (Slashdot makes me fill the lines with not-repetitive stuff.)
  http://00000102.00000146.00000015.00000143/
  Sure, a regexp would easily solve the problem, but that seems to be part of the root problem anyway.
  
  --
  John
6. Re:Technical details here by MBCook · 2010-03-23 13:37 · Score: 3, Interesting
  
  I'm on Safari on OS X, and I can tell you that the link doesn't work. I get the standard Safari page saying "Can't find the server 3277....".
  I tried the links in the blog post, the first three don't work, they have the same problem. The fourth link, the one padded with 0s, eventually failed because the server failed to respond (/.ing, I'm guessing).
  This is the first time Safari has failed me in something geeky like this. Safari is the only browser that render's my brother's URL properly. It's one of the unicode symbols, and Safari shows it that way. Safari shows (snowman).net correctly, but FireFox turns it into xn--n3h.net.
  Of course, /. won't let me post a unicode character.
  
  --
  Comment forecast: Bits of genius surrounded by a sea of mediocrity.
7. Re:Technical details here by SEWilco · 2010-03-23 13:49 · Score: 3, Insightful
  
  I learned about this back in 2002 in my Network security class
  Those who do not learn history are doomed to repeat it. And issue patches.
Yeah But... by Greyfox · 2010-03-23 11:30 · Score: 4, Informative

I actually preferred using a url with the 10 digit number that was my base 10 IP address in E-Mails as it got people's attention in an otherwise bland sea of domains. This has been a feature of libc as long as I can remember (in Linux you should be able to ping an IP address in some other number base) but Firefox actually makes an effort to disallow using IP addresses with this notation. So if they're using Firefox, it won't work so well.

--
I'm trying to teach myself to set people on fire with my mind... Is it hot in here?
Re:102 105 114 115 116 112 111 115 116 33 by bytethese · 2010-03-23 11:32 · Score: 4, Funny

That's the same combination I have on my luggage!
Oh come on by Zouden · 2010-03-23 11:33 · Score: 5, Interesting

It doesn't matter which way you enter the address into your browser, it still resolves to the same IP. If that IP is blocked, you won't get through even if you use this method.
FTFA:

it’s possible to imagine URL filtering tools having the same lack of support.
In other words, no testing has been done at all. What is this poorly-thought-out bit of speculation doing on the front page of Slashdot?

--
"A week in the lab saves an hour in the library"
Works in Chrome by crow · 2010-03-23 11:33 · Score: 3, Interesting

All the alternate methods of specifying IP addresses for URLs work in Chrome. When you mouse over the link, you see it with the traditional decimal IP address, so it's not as obfuscated as it could be. Similarly when you reach the site, the URL displayed is in the traditional format.
Addresses like http://0xdeadbeef/ and http://0xdeadd00d/ are assigned to a Chinese telecom company (they have all of 0xdead....).
And the lesson people don't learn is... by Estanislao+Mart�nez · 2010-03-23 11:37 · Score: 4, Insightful

You can't just do things like this based on the syntax of the input, but rather on the semantics. In this case, to properly block the URLs, you need to parse them and transform them into an abstract representation of what they mean, e.g. a struct that encodes the protocol, host, port, document and query strings, and then examine the parse result to check if it matches the rule.
The IT industry just systematically fails this over and over, because of people's bad habit of doing shit with regular expressions instead of parsing and semantic analysis. See, for example, the gazillion ways that people get around cross-site scripting filters; or if you want to see it from the other angle (generation instead of parsing), see SQL injection.

--
Are you adequate?
Big problem by Bogtha · 2010-03-23 11:39 · Score: 4, Informative

The problem with this approach is that the requested URL doesn't provide a hostname, just the IP address. As IP addresses are in short supply, it has been an extremely common practice for years to assign multiple websites to a single IP address, otherwise known as name-based virtual hosting. This is common even for large companies. When you specify the URL with an IP address, the browser doesn't provide an appropriate Host: HTTP header, so any web server set up this way won't know which of the many websites it hosts should be returned. This means that anybody browsing the web with this technique will find that some websites work and some won't, seemingly at random to them.

--
Bogtha Bogtha Bogtha
Why? by Anonymous Coward · 2010-03-23 11:41 · Score: 4, Insightful

Who thought it was a good idea to allow IP addresses to be entered in so many different formats? Who are you to decide that 0x01 is not a domain name? This is a feature which is hardly ever going to be used legitimately, but the code must be written and tested. KISS. Keep it simple, stupid.
Parent is troll link - don't click. by Anonymous Coward · 2010-03-23 11:41 · Score: 3, Informative

Here is some text to get past the filter.
Welcome to the 20th century by Dachannien · 2010-03-23 11:49 · Score: 4, Informative

I'm glad Slashdot is here to tell us about these things, or else I might not have found this important security bulletin.
Re:We learned this on slashdot. by bakdor · 2010-03-23 12:49 · Score: 4, Funny

We must have had 20 different ways to get to goatse.cx.
I didn't need 20 different ways. I just had it bookmarked for quick and easy viewing.
Re:Simple defense: by DavidRawling · 2010-03-23 13:29 · Score: 3, Insightful

Unfortunately you now cannot configure your ADSL modem until you install and configure local DNS and add the modem to the zone. Hardly something most grandmothers can do.
This is totally going over your head. by Estanislao+Mart�nez · 2010-03-23 13:55 · Score: 3, Insightful

No matter how you try to obfuscate the destination - a base-10 "number", octal, binary, who effing cares how - it still goes out on the wire as an IP packet with a destination address field, either sourced from your desktop or your proxy. Packets don't lie.
Not all IP address filtering is done by IP firewalls. These days there are many applications, most notably web browsers, that consult online databases of known or suspected malicious hosts in order to protect users from malicious hosts. I know for a fact that Firefox and Safari do this--if you try to go to a known suspected malware site, the browser pops up a warning page instead of the page you asked for. Google also do it for their search results--suspected malware site results don't link to the site in question, they link to a warning page. Many websites also have anti-XSS submission filters that perform textual matching against known "bad" addresses, to protect their users from attacks.
Apparently, many such programs are not parsing the textual IP addresses into a canonical form, and are therefore vulnerable to this sort of obfuscation. So the typical result here is that a comment submission system will fail to block a comment that has some XSS in it, and the users' browsers, running on a network whose firewally doesn't filter the IP address in question, will then fetch a malicious script from a known malware site.

--
Are you adequate?
Get prepared to have your mind blown by gqx · 2010-03-23 16:11 · Score: 5, Informative

The author apparently does not realize this, but you can also partly concatenate octets and mix various notations:

http://0x4a.8196963/

And yes, congratulations on being cutting edge: this thing is so old and well-known that it's even explicitly covered in RFC 3986, section 7 ("Security Considerations"), subsection 7.4 ("Rare IP Address Formats").
Not quite new by Cyberllama · 2010-03-23 21:41 · Score: 3, Interesting

This is actually just a watered down version of a very, very old trick wherein you'd take a URL like http://3273372964/en/weblog?weblogid=208188044 and insert www.cnn.com@ before the ip address in long form. This of course meant the browser would try to login to the "real" website with the login "www.cnn.com". So you'd end up with a url that looked very much like it was part of CNN's website but was in fact something else entirely. I'd show you a demonstration URL, but Slashdot filters out the obfuscating part of urls formatted in that way so it would look identical.
At any rate, these days, not only do forums like Slashdot actively weed out those sorts of URLs as obvious attempts at obfuscation, but browsers pretty much universally will throw up a warning before you taking you to a website obfuscated in that manner. And as a result, that trick long ago fell out of fashion.
But it seems everything old is new again, if you wait long enough.