Cloud-Powered Facial Recognition Is Terrifying
oker sends this quote from The Atlantic:
"With Carnegie Mellon's cloud-centric new mobile app, the process of matching a casual snapshot with a person's online identity takes less than a minute. Tools like PittPatt and other cloud-based facial recognition services rely on finding publicly available pictures of you online, whether it's a profile image for social networks like Facebook and Google Plus or from something more official from a company website or a college athletic portrait. In their most recent round of facial recognition studies, researchers at Carnegie Mellon were able to not only match unidentified profile photos from a dating website (where the vast majority of users operate pseudonymously) with positively identified Facebook photos, but also match pedestrians on a North American college campus with their online identities. ... '[C]onceptually, the goal of Experiment 3 was to show that it is possible to start from an anonymous face in the street, and end up with very sensitive information about that person, in a process of data "accretion." In the context of our experiment, it is this blending of online and offline data — made possible by the convergence of face recognition, social networks, data mining, and cloud computing — that we refer to as augmented reality.'
The first real-world, publicly available use of this will be an app that lets you:
1. Take a picture of someone with your smart phone
2. Find naked pictures of this person online
BRB, heading to the local college campus...
This is why Google shelved their version of this tech. The implications were too big.
Having studied this in college and witnessed many failed implementations of it I casually ask: Where are the recall rates (see also sensitivity and specificity) of these experiments?
Because when I read the articles, I found this instead of hard numbers:
Q. Are these results scalable?
The capabilities of automated face recognition *today* are still limited - but keep improving. Although our studies were completed in the "wild" (that is, with real social networks profiles data, and webcam shots taken in public, and so forth), they are nevertheless the output of a controlled (set of) experiment(s). The results of a controlled experiment do not necessarily translate to reality with the same level of accuracy. However, considering the technological trends in cloud computing, face recognition accuracy, and online self-disclosures, it is hard not to conclude that what today we presented as a proof-of-concept in our study, tomorrow may become as common as everyday's text-based search engine queries.
How you want to decide Google passed on continuing down this road is up to you. Frankly, I would surmise that the type I and type II errors become woefully problematic when applied to an entire population. Facial recognition is not there yet, not until I see some hard numbers that convince me the error rate is low enough. Right now I bet if you were to snap pictures of 10,000 people, you would incorrectly classify at least 100 of them leading to wasted time, violated rights and wasted opportunity (depending on the misclassification).
My work here is dung.