Why Your Online Impersonation of a 16-year Old Girl Won't Last Long
An anonymous reader writes "Can computers pick up your age and gender from your tweets? If you want to give it a try, here's your chance: 'To develop your software for age and gender identification, we provide you with a training data set that consists of blog posts, Twitter tweets, social media texts, as well as hotel reviews.' Well, at least my paid Amazon reviews are safe for the time being..."
I am sure you can pick up on general mood of a person, I am sure you can pick up various clues, but if somebody is set on hiding themselves by providing false information, I don't think you'll be able to identify them without what NSA calls 'meta data', as in pattern of your behaviour. I don't think it will be possible to do a better job than guessing at about 50/50 chance ratio about a person's age and gender if the person in question is actively trying to portray something he or she is not in a single conversation. From a pattern of behaviour? Yes. From a single conversation? ... this assumes too much about people. It assumes that 16 y.o. girls are also not pretending to be something they are not as well...
You can't handle the truth.
here's your chance to write one.
I wish I could shell out 300 euro to have my next million-euro commercial product crowdsourced
Why do we have to help them train the replacement twitter scanners after the EU is trying to cut off their direct pipeline?
-- Tigger warning: This post may contain tiggers! --
How are they going to catch pedophiles now?
For those puzzled by the description here, this is a software contest, with a 300 Euro prize :
>Why Your Online Impersonation of a 16-year Old Girl Won't Last Long
This is one of those posts where I have to read the summary, isn't it.
So you like to chat with 16 year old girls on line?
I'am sick of descriptions and subjects written in gibberish. Reminds me of spam emails.
Do you have a big cock, how big can your cock get?
Win £xxx pounds if your cock can be bigger than their cock.
Is your cock hidden? can people see your cock? CLICK HERE NOW!
How the fuck did this get authed? Not only is it the worst subject and description i've seen in a while, its an insult to the readers here. Seriously we are here for a reason, we dont have an attention span of a fish.
Oh yeah thats right, its a Slashadvert supplied by Corex.
No offence, but if this shit carries on, i may as well start reading my spam emails. Target your audience, not Facebook muppets ffs.
Researcher A: This... This c-can't be right!
Researcher B: What's not right?
Researcher A: Apparently, 4chan really is populated by little girls.
Researcher C: *posting anonymously on /a/* See guys, I told you -- wearing a skirt makes it work. Also, Check'Em.
According to the research, 83% of slashdot posters are 16yr old girls. Go figure.
The "dialectizer" http://www.rinkworks.com/diale... "translates" English to Redneck, Jive, Cockney, Elmer Fudd, Swedish Chef, Moron, Pig Latin, or Hacker. And there's an English to Ebonics translator at http://joel.net/EBONICS/Transl... so it won't be that difficult to get a translator that outputs 16-year-old-girl talk.
I'm not repeating myself
I'm an X window user; I'm an ex-Windows user
OMG!!! Ponnies!
Table-ized A.I.
screw the text content
timestamp, frequency and timing in general will give away much more information
Seems like it will be based too much on stereotypes. When I was 16, I typed in proper case (I wanted most of all to be a writer, and I used every opportunity to improve my grammar and typing), used big words in context, and did not easily use emoticons.
I suppose over time they could develop something keener, but if you're going to base it on the mean of all people in a certain age bracket, there will be enough exceptions to render the most useful applications of the software irrelevant.
OMG! Spelling Nutzis!
I could see a bunch of ways to make tons of money from this, starting with selling it to FaceBook for $19B.
Why would I publish it for 300 Euro again? I know they *claim* it's not published, but if they didn't sign an NDA, you're not going to get a patent out of it outside the U.S., and you're not going to have any protection against them just using your algorithms.
This is a really silly contest.
Didn't Gender Genie already attempt to do this? https://www.google.com/search?...
(I say "attempt" because I found that even in cases where I wasn't trying to fool it, it would often come up with the wrong gender.)
A friend of mine was working on sentiment analysis. They studied the content of yahoo answer and it was quite interesting all the correlations that you can make. The study is of course not enough to provide a direct identification, but it shows how many parameters you need to keep in mind when building a "virtual identity".
http://www.cse.ohio-state.edu/...
Now both people and computers will call me a girl.
You remind me of my wife, and this kind of "logic" is why women are arguably slightly worse writers.
If you think that is nonsense, than the whole original sexist line of argument is as well.
So what's your excuse?
Three hundred euro? The contest sparks my interest, but 300 is about what it would take to get me to fill out the entry form. To develop an effective NEW algorithm, code it, and test it in HOPES of winning the prize? Maybe for 300,000, maybe. 3,000,000 would be more like it.
I've developed exactly two truly innovative products. One I sold over $1 million worth, the other still provides $3,000 / month in net income . Why would I, or anyone skilled and innovative, touch this for 300 euro?
I managed to get my impersonation of a 16-year-old girl into the training set.
I second that!
Honestly no one uses usenet anymore for anything but porn and piracy.
Your comment run through the "Jive" filter at the link you provided:
Then that out put filtered with the "Elmer Fudd":
Then, since a moose once bit my sister, that output run through the "Swedish Chef" filter:
Your Text, Dialectized (bork)
Yuoov Text, Deeevectized (jeefe-a)
De-a "deeevectizoo" http://vvv/ [vvv] su coot me-a sume-a sveck, Jeck.veenkvu'ks. Um gesh dee bork, bork! Ooh, det scvooy vebbeet! cum/deeeve-a... [veenkvu'ks. Um gesh dee bork, bork! Ooh, det scvooy vebbeet! cum] "tvunsvetes" Ingveesh t'Vedneck, JIBE, Cuckney, Ivme' Foodd, Svedeesh Cheff, Mu'un, Peeg Veteen, oo' Heckoo. Eh be-a beeed. Bork bork bork!.. Und dooe's un Ingveesh t'Ebuneecs tvunsvetu' et http://messa/ [messa] v.net/IBONICS/Tvunsv... [Messe' v.net] su's it vun't be-a det deefffficoovt t'get sume-a tvunsvetu' det ooootpoots 16-yeev-oovd-guet vep.
--
I'm nut vepeeteen' meh'sooff
I'm un X veendoo usoo; I'm un 'es-Veendoos usoo
Aannd, then run that output back through the "Jive" filter:
All of that proves that.....I'm easily entertained. :-)
Too bad there was not a Japanese anime filter as well.
Down With Slashdot BETA!!! I've been around the corner and seen the oliphant; you can only abuse me from your perspecti
1. You can assume that someone is female if they act like a man without reason or accountability. 2. Anyone that shows interest in you sexually is an undercover police officer.
Really, these people need their dick sizes predicted by comments on SlashDot. Maybe then they'd understand that any form of profiling is equally bad, regardless of what you profile on.
I was promised a flying car. Where is my flying car?
What kind of person is willing to spend 300 euros for detecting 16 year olds online, mmmm?
Computers are very bad at judging these things. It isn't their strength.
I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
your approach is "genetic programming", some sort of unsupervised learning / reeinforcement learning.
grammar nazi, its called a grammar nazi
You are going to have to pay me more than 300 euro for that. ~400 should do it.
Just read the competition, and thought I should point this out:
participants will be asked to classify the author of a set of tweets as journalist, politician, activist, professional, client, company, authority or citizen, since the fact of belonging to a certain category could determine the importance of the user's opinions.
If you submit a solution, know that it will be used to classify journalist and activist communications.
What's my motovation to hide as a 16 year old girl?
Look at the original proposal by Turing. I imagine Turing was very conscious about gender issues since after winning wwii for the old boy network they decided he was a felon. It seems to me the five eyes gives us Turing test success and...wait for it ... means that is not really what we meant by human intelligence.
I have come to the conclusion that it is because the world is filled with f*cking morons and that most people in charge of large sums of cash are the stupidest of the whole lot. I mean Whatsapp is a crippled xmpp chat app making no profit and it gets sold for tens of billions of dollars and I watch tv and see economists say whatsapp should have held longer because they could have gotten more money... what the hell. half to three quarters the people on slashdot could write a better chat app and none of us would make enough money off of it to pay for the webhosting for it. How much longer can it be until this mobile/cloud version of the .com bubble bursts and the market gains some sanity?
---Saying gnome 3 is better than windows 8 not so much a compliment as it is damning with light praise.
Togged to the bricks, and all they offer me is a trip for biscuits. Just wanted a ring-a-ding ding and some Bruno slips me a Micky Finn in a clip joint. Next thing I know, I'm waking up in a flop with some Lunger wearing iron who suddenly runs to the window and starts drilling beans at a tin can full of Joes. So I make tracks before the coppers turn up with a meat wagon and someone ends leaving in a Chicago overcoat.
Sure enough, the cow costume was hanging up next to the superhero outfit and sailors uniform. (S,Spud)