Slashdot Mirror


Google Outlines AI-Based Number Reading For Street View Photos

mikejuk writes "A recent Google research paper outlines how it might use AI to read digits in natural images — specifically Street View photos. The idea is to automatically extract the number of each house as captured by Street View and then use this to improve the geocoding data returned by Google. When you next ask for directions to a particular address the new data could be used to show you a street view looking directly at the house you specified."

50 of 68 comments (clear)

  1. what happens when it reads 0 / 0? by Joe_Dragon · · Score: 5, Funny

    does it crash?

    1. Re:what happens when it reads 0 / 0? by Ksevio · · Score: 4, Funny

      Might run into problems when it drives by my house at number:
      42'); DROP TABLE street_numbers; --

  2. US Census TIGER/Line shape files? by Anonymous Coward · · Score: 3, Interesting

    I've poked around the US Census data which has geographical coordinates of pretty much everything you can imagine - streets, natural landmarks, including addresses.

    How does most mapping software get its dataset? Why does Google need to find street numbers from the photos? (Probably because a public dataset like this isn't available globally) I imagine that Census data is a free alternative but professional geolocation data requires big fees.

    http://www.census.gov/geo/www/tiger/

    1. Re:US Census TIGER/Line shape files? by djl4570 · · Score: 2

      An underlying reason for this is to better correlate the map with the objects placed on the map. Accurately place the Pita Place on the map between the Jamba Juice and the Charbucks instead of putting it on the other side of the parking lot where there isn't a building.

    2. Re:US Census TIGER/Line shape files? by skids · · Score: 1

      At least some of the current store of street number data came from publishing HITS on Amazon Turk, so it's wetware-derived.

    3. Re:US Census TIGER/Line shape files? by barry99705 · · Score: 1

      A lot of this comes from the R.L. Polk database. Those guys hire out at minimum wage. Would you trust that data? I know a few guys that sat at a Denny's for a week just reusing info from a phone book for their daily logs. This will by a lot more accurate, since I don't know too many people that put the wrong street number on their houses.

    4. Re:US Census TIGER/Line shape files? by chrismcb · · Score: 1

      Why does Google need to find street numbers from the photos?

      Google doesn't NEED to find the street numbers from photos. But evidence suggests that the data Google currently uses isn't quite accurate. Google thinks it can improve on that accuracy, for almost no additional cost.

  3. For once a development that isn't creepy by PSVMOrnot · · Score: 1

    Perhaps I'm not thinking hard enough, but this seems to me to be an development from one of the $InternetSocialMediaOverlords that doesn't seem creepy.

    I mean, this is a nice feature that will save time and be useful; but it doesn't go revealing yet more personal information, or infact anything that you couldn't do yourself by browsing streetview a bit.

    So yeah, a new shiny that doesn't yet make things worse or closer to 1984.

    1. Re:For once a development that isn't creepy by Surt · · Score: 1

      This improves the ability of the gestapo to avoid getting lost on their way to pick you up.

      --
      "Who is the Journal of Quantum Physics going to believe?" --Stephen Hawking
    2. Re:For once a development that isn't creepy by Time_Ngler · · Score: 1

      This isn't a development of the $InternetSocialMediaOverlords. It's just a bunch of nerds who know some algorithms and enjoy solving practical problems who thought this would be fun to attempt, and happen to work for the $InternetSocialMediaOverlords. Trying to look for a conspiracy in this would be akin to looking for a conspiracy in the way the leaves pile up in your driveway in the fall.

    3. Re:For once a development that isn't creepy by Anonymus · · Score: 1

      That's right, this isn't creepy, because StreetView itself is the creepy part. This is just refining it.

  4. Captcha just failed by Anonymous Coward · · Score: 2, Interesting

    If it can read a digit in a pattern then capchta just became useless as a way of keeping bots out of a website

    1. Re:Captcha just failed by djl4570 · · Score: 5, Funny

      Then we'll have to migrate to "Cutest kitten"

    2. Re:Captcha just failed by jdpars · · Score: 2

      Except that house numbers are generally made to be readable at a distance, whereas captchas are made to be readable up close, with scrutiny.

    3. Re:Captcha just failed by d4fseeker · · Score: 1

      Captchas are made to be unradable. The average human success rate at some Captchas is far lower than that of a OCR-based cracker.

    4. Re:Captcha just failed by Baloroth · · Score: 2

      You should look up how the captcha system works (or reCAPTCHA, anyways). It is digitally scanned old print (such as old editions of the New York Times). They feed it through several OCR systems, and the ones that it doesn't work on get thrown into the CAPTCHA system for humans to identify. This allows them to a) digitize massive amounts of old print material (using the humans interpretation of words that can't be read by computer) and b) ensure large sources of CAPTCHAs which are unreadable by present OCR systems. Material that can be read by the new OCR system simply won't end up as captchas.

      --
      "None can love freedom heartily, but good men; the rest love not freedom, but license." --John Milton
    5. Re:Captcha just failed by satuon · · Score: 1

      They can't use the words from old print because they don't know what it reads, so they can't know if you've answered correctly. The captcha that is used for verification is computer generated, that's why reCAPTCHA has 2 words - one is generated (hence they know what it is), and one is from old print. If you answer the computer generated captcha it's assumed that you've answered the other one as well. They also send the same old print word to 2-3 people to see if their answers match. That way Google gets a crowd-sourced human-brain-driven OCR machine for free.

    6. Re:Captcha just failed by justforgetme · · Score: 2

      you are unfamiliar with the tech underlying recatpcha

      why do you think it requests two words?
      one word it has classified one it hasn't.
      If you input the classified word correctly you pass. The input of the unclassified word is buffered.
      Unclassified words that get consistent results are added to the classified word pool and used as validators.

      It is a basic ml application whose computing is outsourced to the humans on the Internet who want to post
      on canHazMcRib.org and a very elegant solution as well. if you want more info on Luis von Ahn's ventures
      start with his ted talk.

      --
      -- no sig today
    7. Re:Captcha just failed by Ihmhi · · Score: 1

      Being pedantic here, but one of the words was probably read by OCR. One word in reCAPTCHA is unknown and the other is verified (but blurred, struck out, color inverted, etc.). It's likely that the machine-readable word was originally caught properly by OCR.

  5. Google is Skynet by itsybitsy · · Score: 2

    It's clear now.

    1. Re:Google is Skynet by Avarist · · Score: 1

      Skynet actually already exists. It's a Belgian company.

      --
      In Capitalist US, the commerce controls the Government.
  6. Street number reading by Animats · · Score: 2

    I've been wondering when they'd make that work.

    In retail areas, street numbers tend not to be too prominent. It may be necessary to read business signs and use that data to disambiguate addresses. This would help to clean up the phony-business problem in Google Places. An alternative is to use real estate records, as the USC Geocoder does for some areas, to get a solid lock on address vs. physical position. But that data is only available for some areas. There are also the Census Bureau's TIGER/LINE files, but they're US only and not complete for the entire US.

    Outside the US, this is likely to be more useful. If you have a few street numbers and a few business signs per block, you can infer the rest reasonably accurately.

    1. Re:Street number reading by ortholattice · · Score: 1

      In retail areas, street numbers tend not to be too prominent.

      I have found that to be true and don't understand it. While businesses loudly proclaim their names and logos, their street numbers are often barely visible if present at all. When dealing with heavy traffic, you often can't just leisurely slow down and rubberneck to search for some street number clue in an unfamiliar area. (Well, pre-GPS anyway.) More than once I've just given up trying to find a business while driving and gone to a competitor I was already familiar with. Is there some marketing reason that makes a street number detrimental to business?

    2. Re:Street number reading by Animats · · Score: 1

      While businesses loudly proclaim their names and logos, their street numbers are often barely visible if present at all.

      That's true, and annoying. Some cities require street address signs but most don't.

  7. I have an idea.... by salva84 · · Score: 4, Funny
    1. Re:I have an idea.... by mapkinase · · Score: 1

      I wonder how passing like this through unsanitized forms works. The final SQL statement will include unclosed quotes and would produce an error.

      --
      I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
    2. Re:I have an idea.... by mapkinase · · Score: 1

      Nevermind, saw the comment in the end.

      --
      I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
    3. Re:I have an idea.... by jheath314 · · Score: 1

      Did you really get the city to change your house number to 404'); DROP TABLE Streetnames;-- ?

      --
      Procrastination Man strikes again!
  8. Is this the paper Norvig mentioned? by Obispus · · Score: 1
    Is this the paper Peter Norvig (quite tangentially) referred to near the end of this article in the MIT Tech Review? Can anybody confirm?

    Thanks...

  9. I wonder by koan · · Score: 1

    If the proliferation of CCTV using facial recognition could be tied into Google allowing you to Google for faces and where they were last seen.

    So using a 3 Dimensional recognition algorithm and saving that data plus the location would then allow you to drag a picture of your friends face into Google (as you already can) then get results of where that face was last seen.

    --
    "If any question why we died, Tell them because our fathers lied."
  10. Isn't this a good job for the mechanical Turk? by wisebabo · · Score: 1

    Could humans, with their supremely evolved optic nerve/brain, outperform any A.I. that Google might have now or in the near future? Might it even be cheaper? (Just how much does difficult A.I. cost in the cloud anyway?). Maybe even if A.I. is much cheaper, it would still be useful for using humans for the difficult cases, or as error correction. Best would be for the humans to train the A.I. (and themselves out of a job!)

    I always thought this was the only possible reason how the machine intelligences in "The Matrix" could use humans. While they lay in their pods, dreaming their lives away, use the unused portions of thir brains (no, not 90%) to perform tasks that humans are "better" at. Like image recognition or perhaps language. That whole "battery" idea was just stupid.

    Then again they should've had their eyeballs plucked out and a connector wired directly to their optic nerve. Kinda would've made making a movie, in which Neo and all the other humans were stumbling around blind, difficult.

    1. Re:Isn't this a good job for the mechanical Turk? by RockDoctor · · Score: 1

      I'm sure that it could be done by the Mechanical Turk. But considering the amount of data that they've got ... possibly it would be too expensive. The Turk may be flexible and relatively cheap, but it's still got a finite cost per computation, and meat-puppets are not particularly cheap. (It might be useful for cross-checking hit rates etc though. Quicker and cheaper than coding and testing a second/ third/ fourth algorithm in detail.)

      --
      Birds are not dinosaur descendants;birds are dinosaurs, for all useful meanings of "birds", "are" and "dinosaurs"
  11. Last lesson of ml_class.org by doudou42 · · Score: 4, Interesting

    It's interesting to note that one of the co author is also the teacher for stanford free classe on Machine Learning and that the last lesson of the course was on this topic...

    1. Re:Last lesson of ml_class.org by Mr.+Shotgun · · Score: 1

      You noticed that to? It's nice of him to pass along some of the stuff he is working on as a lesson for the class.

      --
      Of all tyrannies, a tyranny sincerely exercised for the (supposed) good of its victims may be the most oppressive
    2. Re:Last lesson of ml_class.org by w_dragon · · Score: 1

      Don't most profs do that? Isn't that the whole point of having PhDs doing both research and teaching? I know most of my 4th year profs talked about what they were working on in class...

    3. Re:Last lesson of ml_class.org by RockDoctor · · Score: 1

      nice of him to pass along some of the stuff he is working on as a lesson for the class.

      Almost as immoral as using the Mechanical Turk! (As someone else suggested upthread.)

      --
      Birds are not dinosaur descendants;birds are dinosaurs, for all useful meanings of "birds", "are" and "dinosaurs"
  12. Captcha by numbscholar · · Score: 1

    Will it work with those annoying recaptcha boxes?

    --
    GENERATION 26: The first time you see this, copy it into your sig on any forum and add 1 to the generation.
  13. It's not AI. by forkfail · · Score: 1

    It's machine learning.

    There's a difference.

    --
    Check your premises.
    1. Re:It's not AI. by sanchom · · Score: 1

      It's machine learning.

      There's a difference.

      Please explain the difference, then. At Stanford, UBC, MIT (I assume other places as well), their artificial intelligence labs include their machine learning groups.

      For example, at Stanford, the Artificial Intelligence Laboratory includes the research groups of Daphne Koller, Andrew Ng, and Sebastien Thrun, who all describe their research as Machine Learning.

      Andrew Ng, a machine learning research and instructor for the online machine learning course (http://www.ml-class.org/course/auth/welcome) is director of the artificial intelligence laboratory at Stanford.

      If you say the techniques in the article are machine learning, then they are also artificial intelligence. The original post title is correct.

    2. Re:It's not AI. by sanchom · · Score: 1

      I did google it. That's how I found out the info in my post, and I also the wikipedia article you linked to. The wikipedia article says, as I did in my post, that machine learning is a branch of artificial intelligence. So if you say something is machine learning, it is also artificial intelligence.

      Calling this work "AI-based" is correct.

      The amount of times you'd have to tell an AI something in order for it to learn is irrelevant to this discussion.

  14. there's a famous street sign in china by circletimessquare · · Score: 1

    they were trying to appeal to english speaking olympics visitors in 2008, and the translation server crashed, but they thought the error message was the translation, so we get:

    http://boingboing.net/2008/07/15/chinese-restaurant-c.html

    so would it be like divide by zero if machines try to map the real world and encounter a bit of the real world mimicking machine world fail?

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
  15. Eh? by MrL0G1C · · Score: 1

    What the f*** would the point of that be? in the UK they blur out door numbers and street signs which is bloody inconvenient when you're researching visiting a place.

    --
    Waterfox - a Firefox fork with legacy extension support, security updates and better privacy by default.
  16. Hacking the system: by drolli · · Score: 1

    You open a shop in the same road as a very sucessful one and put their number sign (and the shop name, in case they use OCR on it) just when the google car passes by......

  17. Sponsored by the US drone program by Mister+Liberty · · Score: 3, Funny

    n/t

  18. The number most frequently seen by Harald+Paulsen · · Score: 1

    is 867-5309

    --
    Harald
  19. I'm sure the military has already worked this out by Trax3001BBS · · Score: 1

    I remember when I first heard that the spy satellites could read a cars license plates.

    I thought that's good trick, not only do they have to view small characters they
    had to do it sideways.

  20. Licensing issues, more likely by msobkow · · Score: 1

    In Saskatchewan, we have a Crown Corporation that is responsible for maintaining, updating, and distributing that map data for the province. It's accurate data -- collected using actual transits and GPS systems. Google can not improve on the data quality by reading street signs with AI at all.

    I suspect it has a lot more to do with licensing issues for Google wanting to republish the data to the general public, something that I'm sure our crown corp would frown upon as it would cut out their geodata sales market. Google wants a royalty-free data source, not better data quality.

    --
    I do not fail; I succeed at finding out what does not work.
  21. Re:AI required. by RockDoctor · · Score: 1

    For an advanced Tolitarian state

    Is that an oxmoron? Possibly. But I'll let it pass for the moment.

    to perpetuate a surveillance society, an AI must exist

    MUST ? I really hate people who use excessive didacticism ; I think that they should be taken out and shot in front of Jeremy Clarkson's family.

    which is powerful enough to search the infinite amount of data created,

    Consult your dictionary for the difference between "large" and infinite. If you feel brave, or are already under medical care, you may wish to read Georg Cantor's versions of the Necronomicon.

    and determine it's enemies as efficiently as the market determines winners and losers.

    OIC, it's all a joke. Sick fuck.

    --
    Birds are not dinosaur descendants;birds are dinosaurs, for all useful meanings of "birds", "are" and "dinosaurs"
  22. Or they could team up with OpenStreetMap by olau · · Score: 1

    ... and get an army of volunteers to help them.

  23. StreetView is very low-res by LanMan04 · · Score: 1

    Most StreetView images are soooooo low res that you can't make out street numbers on most houses.

    Maybe businesses with large street number signs, but not houses.

    --
    With the first link, the chain is forged.