George Mason University Speech Accent Archive
JT Olds writes "Apparently George Mason University is running a project to document differences in speech and accents from different backgrounds and the like. They have a paragraph that 306 sample readers have read and recorded, and all of these sound files are categorized by background, gender, age, and other things. They say that this is primarily for teaching and learning, and is especially useful for any linguists out there, but I just thought it was cool. The sound bytes are released under the Creative Commons license. Of course, the Google cache of the main frame is here.
As a side note, I did get the link to this from Penny Arcade's Jerry Holkins."
No cockney support? Insensitive clods...
It's nice they named a University after him, after all, he did save Jack Bauer's life by swapping seats on the plane with the nuke...
The spoke's on JOO!
We were just talking about how the British English language was the true "natural" English language, all other derived languages that were English with an accent. For example, If I (a person who lives in America and speaks US English; no born American (thank goodness)) were to go to England and converse with an Englishman; who would have the accent, me or him? The obvious answer, as a lot of Americans fail to realize, is me.
I think I just found a great tool for my role playing game ...
Robert Anton Wilson
Getting speakers of English as a foreign language to repeat a standard English phrase. It's highly unlikely that this produces accents in the sense of two speakers of the same language would recognise. I.e. would a Flemish Dutch speaker recognise the accent of a Dutch speaker from Amsterdam when mangled through an English phrase? Somehow, I don't think so.
It might be useful for tracing people's origins when they are in an Anglosaxon country. But you might as well just ask them.
What would be more useful, perhaps, is a study of the relative differences in accents between native speakers of the "same" language, and how these differences come about.
Ceci n'est pas une signature
This is a really wonderful idea. However, I worry that it has a copule significant problems for researchers. First, for computer analysis work, a paragraph is likely too short to be useful. It can take a *lot* of audio data to make up for one-time variations. Second, cleanliness of the recording. Since anyone can submit a recording, not only will the recording environments and devices differ, but it is unlikely that any recordings will be made in the kind of studio-quality or lab-quality environment that would make these most useful for analysis work.
I'm not a speech synth/recognition researcher, but I do know that generally, for speech research, much stricter constraints are placed on audio being acquired. The extreme variety of the site is nice, but I'm not sure that it outweighs the drawbacks.
May we never see th
When I found this site a few days ago (linked on Penny Arcade), one of the first things that came to mind was how useful it could be to an actor who has to learn how to do a certain accent. In some of the more common accents they even have a list of rules on how most speakers of that other language speak (e.g. many Japanese speakers reverse their R's and L's).
I belong to the ______ generation.
Actors/voice actors have "dialect tapes" which they study to learn accents. I have a few and generally they start by giving vowel substitutions, common phrases and syntax, and then move on to insanely boring phrases you must repeat while trying to copy their accents and inflections.
Religion is a gateway psychosis. -- Dave Foley
Have you even been to the UK? There are a LOT of different accents there :)
you think the Klingon thing has been played out already?
...who finds these inexplicably hilarious?
Should I feel guilty for doing so?
I believe the correct answer is both. Everyone has an accent. An accent is the part of speech which is neither specific to an individual or to the language. It varies by region, background, or time period. If you were to go back to the Old English days (there is no "single" English language as it has evolved over time) it is unlikely that anyone would understand you. The same for the Brits.
Maybe the question you meant was which is closer to "correct". If you consider correct to be closer to the root of the evolutionary language tree then the Brittish are probably closer since the Americans' language changed more quickly since the split.
Great they are all encoded in a proprietary format :(.
Anyone know how to get quicktime working in Firefox on Linux (Gentoo)?
----
Japanese people don't REVERSE L and R, they just can't pronounce L at all. A lot of people (stupid people) imitating japanese accents reverse the l an r because they think it sounds japanese. It doesn't. It's justs stupid. BTW, they call it "Engrish" because they just can't say "English." Its just like how I cannot roll my R's no matter how hard I try. Thus, when I speak Spanish, I sound funny when saying words containing rr. If you want proof of this, just look on any Japanese Katakana or Hirigana chart. These contain all the phonetic sounds in the Japanese language. notice there is no L.
Sound samples? Stepford Earth? *hides*
( o ) one could say I'm rather baked
They say that the first accent was a grave mistake...
And while there I saw a chalkboard outside a cornerstore with a joke on it:
;))
A foreigner was at a sheep farm watching them shear the wool off the sheep. Knowing a better way he said "Here, let me show you how to shear your sheep"
The Kiwi replied "I'm not shearing with anybody!"
Never let it be said that Kiwi's don't know how to laugh at themselves! (and for this instance we'll forgive them their rediculous accents
it doesn't work on my machine, where do i downloa
This stuff is cool, IMHO. In case anyone's interested, here's the Swedish version of the concept: http://swedia.ling.umu.se/
In SweDia you can listen to 100 Swedish dialects recorded 1998-2000. Hurty flurty schnipp schnipp!
Nintendo sues "George Mason University" for their "Speech Accent Archive", saying that the university is guilty of trademark infringment on nintendo's patented "Hellooo it's meeee Marrrrrioooo " and that they're trying to take advantage of the copyrighted italian accent in their work...
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
Here are some nice examples of this and more. It also goes the other way.
At web page looking at, catalog creating they are different speeches and styles on planet Earth I see. English language evolved on other planets what about analyzing? Mmmmmmm...
The Collection won't be complete w/ out Father Guido Sarducci Or the Jive Guys from Airplane
Sanity is the trademark of a weak mind. -- Mark Harrold
This sounds like a perfect collection to train a neural network on! Anyone tried that yet?
Sure - there is no 'high/pure language' outthere. Everything else is linugistics aristocracy. The archive is an example of the great variety among L2 English speakers (those who didn't acquire the language 'naturally' before the age of about 13). However, the meta-data about the speakers collected seems somewhat insufficient, and if you would actually want to use the corpus scientifically, you would want a greater consistency among the samples: only people that haven't lived outside of their original region and learned English 'academically', for example. The phonetic transcriptions, however, seem very useful!
I like role playing, too. I'm German (and native German speaker), I've spent some time in the U.S. when I was around 16-17, now, at the age of 26, I've been living in Ireland. Most people will know that the Irish accents are pretty distinctive (non-linguistics might say that every sentence sounds a bit like a complaint). I have found that most Americans that I talk to guess that I am Irish, or at least they say that I have an Irish accent. Most Irish people say that my accent is American. A couple of months ago all around the world in Oz, some cab driver guessed after five minutes: are you from Ireland? In most cases, when people talk to me for more than a few minutes, they wonder where I'm from, as they can't tell. It's strange, how adaptive we are. I assume that the fact that I'm not a native speaker helps me in adapting. For example, in my native German, I didn't assume a Berlin accent after four years there.
No. Use a real OS.
They missed quite a few accents.
Bill Shatner
Christopher Walken
Dana Carvey's Ross Perot
James Stewart
I you havent done anything wrong, then you have nothing to fear.
..as in, stay a while and yours will change, probably end up somewhere mid-atlantic. You too could end up sounding like Gross Lloydman.
I'm English, with pretty much a 'received-pronounciation' accent. Following 3 years spent working in Asia, when I returned home 2/3rds of the people I met for the first 6 months asked if I was Australian. Afetr a while I was sorely tempted to just reply 'Ah shit yeah.'
emerge xine-lib xine-ui
I'm not a prophet or a stone-age man,
I'm just a mortal with potential of a super man.
L and R can lead to problems in everyday life. I worked in Asia for three years where during introductions my surname, Clark, usually caused facial expressions akin to a bulldog chewing a wasp.
Of course its reciprocal. I paused the first time I met a Mr. Ng.
(scottish accent)
Ack! Me retirement grease!
(/scottish accent)
due to moving around the states... So, am I unique, or just a mutt?
I call computer-illiteracy job security
For doctors practicing in Boston:
Doc "Ok, open your mouth and say 'R'"
patient "Ahhhhhhhhhhhhhhhhhhhhhhhhhh"
Doc "Good!"
http://github.com/gbook/nidb
The IDEA archive has a far more complete collection of accents and voice samples. Excellent source material for geeks who work in film, TV or theater.
... at least in the view of Robert Pirsig. In Lila he called it "plain" or "plains" speaking, characterized by an economy of words and minumum of affect. That is, the accent and style typical of Midwesterners and Westerners.
Interestingly, most U.S. newscasters either come from the Midwest or cultivate that neutral way of speaking, so there must be a preference for it.
America isn't the lingistic island it once was though. This is mostly good but some asinine things seem to creep in. For example, it eats my ass that U.S. news media force the British style of a/an. While clearly this is recognition of the global market for our news 'product', we Americans aren't in the habit of dropping our "H"s so saying "an historic" just sounds affected - and wrong.
by the way, hear me as english #49, dude...
There is an error in the transcription of the Afrikaans speaker from Pretoria. Near the beginning, the first vowel in the pronounciation of "ask" is wrong. It's transcribed as a backwards c. That's appropriate for the vowel in "call" prior, but in "ask", it's pronounced as an unrounded vowel, and it's lower.
I thought creative commons idea was a smorgasbord of license subparts that you could pick and choose amongst to create "a" CC license.
Our strategory is to maintain the new world order with our nucular capableness while simultanically hunting down criminables like Osama Bin Laden to stop their terroristic practicings.
The government doesn't agree with that. Reach for the sky, unperson!
Just out of curiosity:
When Dick van Dyke adopted a "cockney" accent in Mary Poppins, he was beloved by Americans but panned by the English. Yet most people didn't realize that Monty Python's Terry Gilliam wasn't English and that his accent wasn't natural, or if they did, they didn't hold it against him. For years, I thought Peter Jennings, who was based in London for ABC news for many years, was British because he spoke with an accent at that time.
If you adopted an English accent,
a) Would the British people recognize it as being "fake"?
b) Would they treat you more favorably? Would they view it as offensive (such as a person trying to fake their way into a higher social status)?
there is no one CC license, there are at least 5 or 6: http://creativecommons.org/license/
tasty electronic music vittles
Surely this would make good reference material for voice recognition systems?
They need to sample the New Orleans (Nawlins) taxi radio chatter. It's like a peat bog crammed with living fossils of American history.
--
make install -not war
In English there isn't an official accent (BBC "Received English" notwithstanding). Other languages have different conventions.
For example, German. There is an official "High German" (Hochdeutch) that is learned in school and is considered "correct." Other dialects, of which there are many of course, are considered "nonstandard." This is more than just a Texan being proud of speaking Texan, they are really considered different. Someone who speaks Hochdeutch natively (there are a small number) are considered by others to have "no accent."
Remember: this is a language that standardises its spelling, hyphenation, capitalisation and comma usage by international treaty. Making one accent official is comparatively speaking, trivial.
As a native english speaker myself, I find this all all a bit berserk. But other people, other ways.
A friend of mine speaks German fluently and worked for a while in Germany. Sometimes just for fun he would speak German with an exaggerated Southern (American) accent. It was always puzzling to the Germans because they could easily understand him, but could not place the accent. "Are you from Austria?" was the most common response.
Q: What does the "B." in Benoit B. Mandelbrot stand for? A: Benoit B. Mandelbrot
uhoh.....an AC who's all Freudian.
(MoFoQ reads original post) Hmmm...interesting, but I wonder what the IP-mafia (RIAA, MPAA, etc.) here in the states think about it.
It's too bad the entire contents of the Library of Congress isn't digitized and shared (something like out of ST:TNG).
or maybe it would help if we all stopped being broke...o wait...the economy (especially the tech sector) is down Fat Bastard's (the "corn in crap" guy from the Austin Powers movies) toilet....so it's a bit hard to "recover".
(long period of dumbfounded silence)
aawww...gawd....now I have to scrub my mind with bleach....thx alot for the imagery, my sic mind....
either way, I don't see the correlation between the archive and being too "cheap to get a mac".....or maybe there was a speaker at GMU who talked about macs and all.
I'm not that far into the DVD series yet. Thanks, asshole.
I concur on your second point. I've never tried the third one, as I don't hang out with any of the American expats here, and even if I did they would want to speak English, not Norwegian.
People say I'm crazy, I got diamonds on the soles of my shoes...
Still if you listen hard and know the accents already you can almost imagine what the real versions sound like. I hope they continue to gather a more realistic sampling.
Moderating a post as "-1 Troll" isn't always a sign of ignorance, but clearly it was in this case. Someone in apparent ignorance of Creative Commons decided to punish you -- and the /. reader community in general -- by using their moderation power to flaunt their ignorance. Another red letter day for Slashdot's moderation system.
As you accurately and Informatively pointed out, CC is a set of licenses, not one license. The person who submitted the story is apparently just as ignorant about that as the person behind the mod down of your message.
Unfortunately, when you go to the site, there's no indication of any CC license (at least none that I can see, if one applies they're certainly not making it obvious.)
No Laughing Allowed!
I'll give them a sober one, which is a very strange blend of a deep South Texas accent, Nawlins ( New Orleans to all yall ), and a bit of Australian picked up off of one of my best friends and some co-workers who happen to be from there. Accent, syntax, etc. is a total muddle of the three ( shit, trying saying Zed for Z in the US, noone knows what the fsck you're talking about ).
Of course, get me drunk and I'm a prime candidate for a remake of Hee-Haw. If you're not a Southerner, and I'm drunk, good luck understanding a fucking word I'm saying.
PC moderators can suck my White pierced, tattooed dick. If you think pride == hate, s/dick/Aryan meat mallet/g.
I've read that Canada is considered the best country to learn English in because it doesn't vary much across the country*, it sounds enough like American to avoid prejudice, and it is the easiest to understand by other English speakers. (Does anyone know if it's the case that a Texan and a Scot will understand a Ontarian better than each other?) I wonder if it also has something to do with not covering up other accents, for example Europeans who learned English from Brits have a hybrid accent (eg: actor Frederik de Groot, ING Direct Spokesperson)?
* My Canada does not include Quebec or Newfoundland.
Does the study show if software pirates say "arr" more often than other people ?
Forget magic. Any technology distinguishable from divine power is insufficiently advanced.
That these soundbytes are under creative commons license is cool; might it be possible to extract the phonetic sounds from each of these readers and create a shit-hot speech synthesizer?
...
nick
The BBC has an interesting bit on this very fact. Check out the Real clip, "How Did People Speak Then?". Towards the end, they play a reconstructed accent circa Bill the Bard's time. Those of you familiar with accents from the American South will hear something creepily familiar in the way the woman speaks.
I didn't think the house band in Hell would play this badly.
So, I submitted Fred, from Cupertino, CA spoken via Apple's OS X Core Audio, recorded by Ambrosia Software's WireTap. Fred, as you may, know is also the voice of Stephan Hawking and has done numerous cameo appearances on TV and film. I'm merely questioning what makes a voice "human."
PegQuin--I've got a sneakin' suspicion
Romajii is an *approximation* not a direct translation. Do you really think kung fu is how the words are said in Chinese? Or even Volkswagen in German?
Of course not. Hell, I said as much in my post. Did you, perhaps, mean to attach this comment to the parent, and not to my post?
If you don't like Americans, what are you doing here, you insolent swine?
-ccm
(PS. Before you reply, you should know that I was born in England, and thank my lucky stars that I am now an American citizen.)
Too much Law; not enough Order.
hmmm, mebbe u r on2 something there...
My father grew up about twenty miles from where Strom Thurmond was born, and I about twenty miles from there. But after I've been in deep conversation with non-Southerners, you would never know I was from the South unless you caught a particular turn of phrase. My father has a gentle lowland drawl (more Charleston than highcountry), and I once heard the century-old Thurmond speak, and to me he was damned near incomprehensible.
This is not my sandwich.
For some reason I have a very strong urge to go watch "A Streetcare Named Desire" right now, or failing that, "Oh Streetcar"