Google Outlines AI-Based Number Reading For Street View Photos
mikejuk writes "A recent Google research paper outlines how it might use AI to read digits in natural images — specifically Street View photos. The idea is to automatically extract the number of each house as captured by Street View and then use this to improve the geocoding data returned by Google. When you next ask for directions to a particular address the new data could be used to show you a street view looking directly at the house you specified."
does it crash?
I've poked around the US Census data which has geographical coordinates of pretty much everything you can imagine - streets, natural landmarks, including addresses.
How does most mapping software get its dataset? Why does Google need to find street numbers from the photos? (Probably because a public dataset like this isn't available globally) I imagine that Census data is a free alternative but professional geolocation data requires big fees.
http://www.census.gov/geo/www/tiger/
Perhaps I'm not thinking hard enough, but this seems to me to be an development from one of the $InternetSocialMediaOverlords that doesn't seem creepy.
I mean, this is a nice feature that will save time and be useful; but it doesn't go revealing yet more personal information, or infact anything that you couldn't do yourself by browsing streetview a bit.
So yeah, a new shiny that doesn't yet make things worse or closer to 1984.
1. Did OCR report (with confidence >=95 percent) number N, where (0 <= N < 4000)? Choose N.
2. Did OCR for property's left-hand neighbor report N, and for right-hand neighbor report (N+4)? Choose (N+2).
3. Same as #2 with left-hand and right-hand reversed.
4. Did OCR for property's left-hand neighbor report N, and for right-hand neighbor report (N+2)? Choose (N+1).
5. Same as #4 with left-hand and right-hand reversed
6. Did OCR for property's left-hand neighbor report N, where (N % 100) >= 80, and for right-hand neighbor report Q, where (Q / 100) <= 20? Choose (N / 100) * 100.
7. Same as #6 with left-hand and right-hand reversed.
If it can read a digit in a pattern then capchta just became useless as a way of keeping bots out of a website
captcha anyone?
It's clear now.
I've been wondering when they'd make that work.
In retail areas, street numbers tend not to be too prominent. It may be necessary to read business signs and use that data to disambiguate addresses. This would help to clean up the phony-business problem in Google Places. An alternative is to use real estate records, as the USC Geocoder does for some areas, to get a solid lock on address vs. physical position. But that data is only available for some areas. There are also the Census Bureau's TIGER/LINE files, but they're US only and not complete for the entire US.
Outside the US, this is likely to be more useful. If you have a few street numbers and a few business signs per block, you can infer the rest reasonably accurately.
http://cache.gawkerassets.com/assets/images/4/2011/11/8ca4aa827c1151851324895f5ffb778b.jpg
If its able to read the digits in all those varieties, then it will probably be great at reading captchas.
Thanks...
For an advanced Tolitarian state to perpetuate a surveillance society, an AI must exist which is powerful enough to search the infinite amount of data created, and determine it's enemies as efficiently as the market determines winners and losers.
If the proliferation of CCTV using facial recognition could be tied into Google allowing you to Google for faces and where they were last seen.
So using a 3 Dimensional recognition algorithm and saving that data plus the location would then allow you to drag a picture of your friends face into Google (as you already can) then get results of where that face was last seen.
"If any question why we died, Tell them because our fathers lied."
Could humans, with their supremely evolved optic nerve/brain, outperform any A.I. that Google might have now or in the near future? Might it even be cheaper? (Just how much does difficult A.I. cost in the cloud anyway?). Maybe even if A.I. is much cheaper, it would still be useful for using humans for the difficult cases, or as error correction. Best would be for the humans to train the A.I. (and themselves out of a job!)
I always thought this was the only possible reason how the machine intelligences in "The Matrix" could use humans. While they lay in their pods, dreaming their lives away, use the unused portions of thir brains (no, not 90%) to perform tasks that humans are "better" at. Like image recognition or perhaps language. That whole "battery" idea was just stupid.
Then again they should've had their eyeballs plucked out and a connector wired directly to their optic nerve. Kinda would've made making a movie, in which Neo and all the other humans were stumbling around blind, difficult.
It's interesting to note that one of the co author is also the teacher for stanford free classe on Machine Learning and that the last lesson of the course was on this topic...
what if I live at Sales, 20?
I feel like a high beyonder stuck in the slow zone, damn you Pham and your disaster!
Will it work with those annoying recaptcha boxes?
GENERATION 26: The first time you see this, copy it into your sig on any forum and add 1 to the generation.
It's machine learning.
There's a difference.
Check your premises.
After all, your address is public, until it's tied to your name. Even that info is more or less public in a lot of cases (name on the mailbox?) But it does open up the potential for fun new hacks and pranks... It'll be a good test of AI and OCR, ultimately. I sense new captchas on the horizon.
Zombie Killa, Free nerdcore hip-hop novella http://www.smashwords.com/books/view/119619
they were trying to appeal to english speaking olympics visitors in 2008, and the translation server crashed, but they thought the error message was the translation, so we get:
http://boingboing.net/2008/07/15/chinese-restaurant-c.html
so would it be like divide by zero if machines try to map the real world and encounter a bit of the real world mimicking machine world fail?
intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
What the f*** would the point of that be? in the UK they blur out door numbers and street signs which is bloody inconvenient when you're researching visiting a place.
Waterfox - a Firefox fork with legacy extension support, security updates and better privacy by default.
You open a shop in the same road as a very sucessful one and put their number sign (and the shop name, in case they use OCR on it) just when the google car passes by......
n/t
This might path the way for some nice augmented reality gear.
Look at a building though your augmented gear/apps and see that there is a Japanese restaurant inside the building and it is rated 3 stars.
can you opt out?
Or are they now going to show everything they can collect on an adres when you punch it in?
Here is also the facebook, linkedin, email adres, etc.... of the persons of interest on this adress
is 867-5309
Harald
I remember when I first heard that the spy satellites could read a cars license plates.
I thought that's good trick, not only do they have to view small characters they
had to do it sideways.
In Saskatchewan, we have a Crown Corporation that is responsible for maintaining, updating, and distributing that map data for the province. It's accurate data -- collected using actual transits and GPS systems. Google can not improve on the data quality by reading street signs with AI at all.
I suspect it has a lot more to do with licensing issues for Google wanting to republish the data to the general public, something that I'm sure our crown corp would frown upon as it would cut out their geodata sales market. Google wants a royalty-free data source, not better data quality.
I do not fail; I succeed at finding out what does not work.
... and get an army of volunteers to help them.
Most StreetView images are soooooo low res that you can't make out street numbers on most houses.
Maybe businesses with large street number signs, but not houses.
With the first link, the chain is forged.