The Science of Word Recognition

AAAAAARRGGHHH, I'm going blind! by rock_climbing_guy · 2004-09-01 21:07 · Score: 5, Funny

Would one of those stupid comments about the colour scheme on /. be on-topic now?

--
Wh47 d1d j00 541, 31337 15n't t3h r0xor5 ne m0r3???

Honest!!! by TheWingThing · 2004-09-01 21:11 · Score: 5, Funny

I was reading what was written on her T-shirt!

Re:Honest!!! by rock_climbing_guy · 2004-09-01 21:16 · Score: 5, Funny

I liked the t-shirt that said,
(in big letters) If you can read this,
(in slightly smaller letters)you obviously must have
(in still smaller letters)very good eyesight.
(in smaller letters)While you're down here, why don't you give me a blow job?

--
Wh47 d1d j00 541, 31337 15n't t3h r0xor5 ne m0r3???

Eye movements? by ImaLamer · 2004-09-01 21:19 · Score: 4, Interesting

With the assistance of fancy eye-tracking cameras researchers have been able to devise several clever experiments to give us new insight into how reading works."

Oh they must have been using EyeQ....

I can read at 44692 words per minute! Thanks for posting that long article for me to read, I needed the exercise.

And thank you EyeQ! Your the greatest!

Really though, they say that the more letters/words mean faster reading times. It's true. Think about a book or article you've read. When the words are together on the page it's easier to read because your eyes can jump around letting your brain fill in the blanks.

Ever read something that made sense but you couldn't quote it word for word? It's likely because you read in this same way.

--
Get your Unix fortune now!

Quotation by Anonymous Coward · 2004-09-01 21:20 · Score: 5, Funny

"Evidence from the last 20 years of work in cognitive psychology indicates that we use the letters within a word to recognize a word."

Man, I'm so glad they finally figured this out...

I love how by FS1 · 2004-09-01 21:20 · Score: 5, Insightful

Does anyone else think that merely analyzing how english is read is very closed minded? I'm pretty sure only a very small percentage of the world speaks and reads english.

I would love to see a study comparing how english is read to how chinese is read by native speakers. Very interesting i would gather.

--
A Fatal OE Exception has occurred, Sig will now reboot.

Re:I love how by ImaLamer · 2004-09-01 21:44 · Score: 4, Interesting

You're right. It would seem that for better analysis comparing Hebrew/Chinese to English would be better.

Maybe we can learn even more about our way of reading, like: Is it the most efficient?

Is right to left, or left to right the best way to go.

Interesting side note (don't know why I'm bringing this up...) President #20, James A. Garfield could write in both Latin and Greek at the same time?

--
Get your Unix fortune now!
Re:I love how by dave420 · 2004-09-01 22:22 · Score: 4, Insightful

There are roughly 400 million people with English as their first language, true, but there are even more with English as a second language. If you're looking to select a language to base a study on, and you want it to be accessible, then you choose English. It really is that simple.
Also, Chinese is character-based, not letter-based, so the research would be completely different. Kind of like asking someone who's studying jet aircraft to study cars as more people have them.
Re:I love how by dave420 · 2004-09-01 23:27 · Score: 4, Interesting

No, there's lots of study on the matter, and it's shown that Chinese people interpret their written language in a completely different part of the brain than english-reading people. That fact alone means a completely different method is at work... :)

Reading about how we read by DrFrasierCrane · 2004-09-01 21:22 · Score: 5, Interesting

While reading the article, I suddenly become hyper-aware about how I was reading the article. :-)

Don't let the Microsoft name scare you off - the article makes for a fascinating look (pun intended) into how we read. I wonder, though, if these findings are duplicated with written Oriental languages.

--
You call this a signature?

What about other writing systems? by mocm · 2004-09-01 21:22 · Score: 4, Interesting

Since most people in the world don't use the latin alphabet, it would be interesting to find out how word recognition works for them. And how they read words in our alphabet.

--
***Quis custodiet ipsos custodes***

Re:What about other writing systems? by ajs318 · 2004-09-01 23:23 · Score: 5, Interesting

They probably have already written papers on it ..... in their own languages.

Want my theory? I think the brain uses multiple techniques in parallel, then releases resources from the ones found to be going nowhere. So at any one time you may be trying to read a word letter-by-letter, recognising the word from the Bouma shape, and picking likely words from context. The different techniques will have different successes depending on various factors (clean type vs. messy handwriting, familiar vs unfamiliar words, &c). So my theory is that the brain is trying various methods at the same time, each narrowing down the possibilities, and just goes with whatever produces a result first. As soon as that happens, any half-finished tests in progress are scrapped and their resources deallocated. The eye movements may well have something to do with this ..... different reading techniques require different resolutions, the eye is great at recognising outlines but needs to zero-in on details, once a clue is established from the word envelope. There is evidence that fonts such as Times are more readable than Helvetica, so maybe serifs add recognisability in their own way? And if this is what is happening, then it would explain some of the test results in the article too, since they were looking for a single technique in use at any one time.

If all this sounds inefficient, you have to remember that human beings are optimised for non-optimum conditions ..... for instance, we have kidneys that pack up if you drink nothing but de-mineralised water, and an immune system that goes berserk and tries to poison you with histamine if it doesn't get enough germs to fight off.

--
Je fume. Tu fumes. Nous fûmes!

So ... by Pegasus · 2004-09-01 21:40 · Score: 4, Insightful

when are they going to repeat these experiments in let say China or Japan? I'm *very* interested in what would the conclusions be there.
For what i know abaout japanese, they don't use spaces between 'words'. A single kanji represents the whole word and their outline is always more or less square. So the whole bouma theory fails here, as he finds out.
I'm sure they could leard more interesting things in other writing sysmtems ...

Though comes before language by alanxyzzy · 2004-09-01 21:42 · Score: 4, Informative

I would love to see a study comparing how english is read to how chinese is read by native speakers.

There is an interesting article at the Harvard Gazette about research which seems to show that thought comes before language. The Korean language distinguishes between two meanings of "in" - fitting loosely or tightly.

Research shows that

Infants of English-speaking parents easily grasp the Korean distinction between a cylinder fitting loosely or tightly into a container. In other words, children come into the world with the ability to describe what's on their young minds in English, Korean, or any other language. But differences in niceties of thought not reflected in a language go unspoken when they get older.

Focuses on 1 script, 1 language by kahei · 2004-09-01 21:46 · Score: 4, Insightful

While some of the results here are interesting (but old), the fact that the entire study focuses on exactly 1 script and 1 language basically renders the conclusions worthless (as conclusions about cognition in general... I suppose they still have value as conclusions about English and the Latin script).

What has happened here is:

1 -- Observe people reading a given language/script

2 -- See how they make use of features of that particular language/script, such as tall letters, case, and the occurrence of 'skippable' words such as articles

3 -- Describe the way they use these local features, and call that a theory of reading in general.

I don't really understand how to apply a theory of reading based on word and letter shapes when there are so many people reading text in which:

--There are no letter boundaries, and/or
--There are no word boundaries, and/or
--Letters all have the same form factor

The experiments described would probably generalize very well to arabic and greek scripts, pretty well to cyrillic (no tall/short letters to speak of), badly to devanagari-type scripts, very badly to Chinese and Japanese, and not at all to hieroglyphics (though I agree that there may never have been a reader of hieroglyphics who was fluent by modern standards).

To pretend that these experiments apply to humanity in general rather than the author's own language/script choice is silly. It's an interesting article and I'm glad the research was done but unfortunately a certain failure to 'get' the multilingual nature of humanity, which I don't really expect to find in MS work, is in evidence here.

--
Whence? Hence. Whither? Thither.

Re:Focuses on 1 script, 1 language by hazem · 2004-09-01 22:05 · Score: 5, Insightful

Everybody seems to be giving this guy a hard time because he did his research for reading only English. My guess is that the guy reads/speaks English and has ready access to people who do the same. This research is a good start and seems to have valuable results.

Now someone else can work on a PhD Thesis by taking his work and seeing if it applies in other languages.

Isn't this how science works? You do research, try to make some conclusions, and publish the results. If you wait to publish until you've found the Grand Unified Theory of Everything, then nobody publishes anything and science doesn't advance at all.

I'm not sure that he missed anything. He has started with what he knows and has resources to study.
Re:Focuses on 1 script, 1 language by olau · 2004-09-01 23:43 · Score: 5, Insightful

To pretend that these experiments apply to humanity in general rather than the author's own language/script choice is silly.

You know what is also silly? To pretend that this was the conclusion, although clearly the paper nowhere stated that it had found the grand unified theory of how people read. Here's a hint: when the paper talks about reading, it is obviously talking about reading English.

Yes, the paper would be even more interesting if it included studies of other scripts, and the failure to acknowledge the existence of other scripts should be criticised. But the rest of your criticism is unfounded.

Article in short... by uss_valiant · 2004-09-01 21:55 · Score: 4, Informative

Further examination of the evidence used to support the word shape model has demonstrated that the case for the word shape model was not as strong as it seemed. The word superiority effect is caused by familiar letter sequences and not word shapes. Uppercase is faster than lowercase because of practice. Letter shape similarities rather than word shape similarities drive mistakes in the proofreading task. And pseudowords also suffer from decreased reading speed with alternating case text. All of these findings make more sense with the parallel letter recognition model of reading than the word shape model.

Of course he describes all the models before he concludes that from the three models, Word Shape Recognition (oldest), Serial Letter Recognition and Parallel Letter Recognition (newest), the latter is the one that is today the most accepted model.

Re:How we read... by Johan+Veenstra · 2004-09-01 22:02 · Score: 5, Informative

The example:

Aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it deosn't mttaer in waht oredr the ltteers in a wrod are, the olny iprmoetnt tihng is taht the frist and lsat ltteer be at the rghit pclae. The rset can be a toatl mses and you can sitll raed it wouthit porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe.

But soon enough there was a counter example:

Anidroccg to crad cniyrrag lcitsiugnis planoissefors at an uemannd, utisreviny in Bsitirh Cibmuloa, and crartnoy to the duoibus cmials of the ueticnd rcraeseh, a slpmie, macinahcel ioisrevnn of ianretnl cretcarahs araepps sneiciffut to csufnoe the eadyrevy oekoolnr.

In the counter example, the letters are not randomly scrabled, the letters are in reverse order, except the first and last letters.

Re:aaah!! eyes hurt! by The+Grassy+Knoll · 2004-09-01 22:06 · Score: 5, Funny

>renerding on firefox

re-nerding! ha ha. Best... typo... ever...

--
They will never know the simple pleasure of a monkey knife fight

Re:Reduced Redudancy by Placido · 2004-09-01 22:07 · Score: 4, Interesting

>> How else would be understand a sentence like "The boy ate a ham___er" (with a few letters obscured)?

What a way to prove your point. I kept thinking "hamster", "hammer" and then eventually realised that I didn't spot your miss-spelling of 'we' and that I read right over it and filled in the blank.

--

Pinky: "What are we going to do tomorrow night Brain?"
Brain: "I would tell you Pinky but this 120 char limi

or maybe it's both? by Illserve · 2004-09-01 22:14 · Score: 4, Interesting

If there's one real take-home lesson of brain-design from cognitive science, it's that the brain tends to do everything several different ways in parallel, and then use the results from all of them.

Obviously it can't all be shape, there are plenty of words with identical shapes and yet these are distinguishable.

But it could certainly be true that we use shape and parallel letter recognition at the same time. Shape narrows the field of possibilities from millions to a small handful, and then parallel recognition chooses one of the options.

Whatever happens, you can be sure it's terribly complicated, extremely robust and very efficient.

Don't shout! by meckardt · 2004-09-01 22:34 · Score: 4, Interesting

From the article: ...lowercase text is read faster than uppercase text. This could also explain why nobody likes to read email where the other person uses all caps.

Re:Don't shout! by Seahawk · 2004-09-01 23:31 · Score: 4, Informative

And if you had read the rest of the article, you would know that this is just because 99% of all we read is lowercase.

People can easily be trained to read text in caps as fast as lowercase text - or mirrored text.

What I fail to understand is how randomizing the middle letters of a word doesnt affect reading much. I had hoped he would use that as an example.

Tihs is a emxpale of the efecft.

Re:aaah!! eyes hurt! by DrSkwid · 2004-09-01 22:40 · Score: 4, Informative

dunno, firefox / moz has one of my favourite features

tools ... options ... general .... fonts & colours .... minimum font size : 14

great for annoying "web site designers" who can't design for shit

--
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter

Microsoft Research Web Site by Numen · 2004-09-01 22:56 · Score: 5, Informative

If there's those that have shied away from Microsoft, well because they're Microsoft, you might not be aware of http://research.microsoft.com which regardless of which side of various fences you might sit has some very interesting material and is generally worth tracking over time.

Aplogise for the tangent, on the back of this article seemed an apt place to point to the MS research site for those that might not of been aware of it.

Re:I'm not sure I buy it. by ideonode · 2004-09-01 23:52 · Score: 5, Interesting

I cdnuolt blveiee taht I cluod aulaclty uesdnatnrd waht I was rdanieg - the phaonmneal pweor of the hmuan mnid. Aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it deosn't mttaer inwaht oredr the ltteers in a wrod are, the olny iprmoatnt tihng is taht the frist and lsat ltteer be in the rghit pclae. The rset can be a taotl mses and you can sitll raed it wouthit a porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe.

Re:I'm not sure I buy it. by maxwell+demon · 2004-09-02 00:36 · Score: 5, Insightful

Actuallythesplittingintowordsisnotnecessarytounder standwhatiswritteniftheorderoflettersiscorrect.Thi s"proves"thatyouarereadingbytheletter,notbytheword .(relyingonslashcodetoinsertameaninglessspaceevery nowandthen:-))

--
The Tao of math: The numbers you can count are not the real numbers.

amusing test... by zozzi · 2004-09-02 00:43 · Score: 5, Interesting

I enjoy giving people this test: Write a long sentence and make sure that the last word of the sentence is a filler word. Then write that filler word again at the start of the next sentence and write some more. Eg:

Yesterday I went to the beach and saw the the boat I always dreamt about.

~ 7 out of 10 people fail to spot it, even if told beforehand there's an obvious error. Somehow music people are more prone to spot the error straight away.

--
---

Re:REKANYZE! by TarlCabbot · 2004-09-02 01:12 · Score: 5, Insightful

I am sure that we've seen this e-mail floating around. Doesn't it seem like we read in shapes?

I cdnuolt blveiee taht I cluod aulaclty uesdnatnrd waht I was rdgnieg The phaonmneal pweor of the hmuan mnid Aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it deosn't mttaer inwaht oredr the ltteers in a wrod are, the olny iprmoatnt tihng is taht the frist and lsat ltteer be in the rghit pclae. The rset can be a taotl mses and you can sitll raed it wouthit a porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe. Amzanig huh? yaeh and I awlyas thought slpeling was ipmorantt!

Interesting observation by lazyl · 2004-09-02 01:13 · Score: 4, Interesting

It makes a big difference if your messed up words use common letter patterns (what, in the article he called 'Psuedowords'), or not.

Example:

'uesdnatnrd' wasn't to hard to recognize beacuase 'uesd' and 'tnrd' aren't letter patterns that exist in real words. So the mind works quicker to rearrange the letters to find a real word.

'aulaclty' was much harder because it's almost pronouncable. 'lac' and 'lty' are common patterns from real words, and 'aul' might not be common but it's pronouncable.

Just an observation.

--
Aw crap, ninjas!

Re:REKANYZE! by iamacat · 2004-09-02 02:41 · Score: 5, Interesting

Don't give any ideas to spammers on how to sneak their "pneis elnraegemnt ceram" past the filters. I do suspect that the effect is local to the small group of letters and long words that are totally randomized will be difficult to read.

Re:I'm not sure I buy it. by Orne · 2004-09-02 03:05 · Score: 4, Insightful

I'm no linguist (elec eng w/ neural net studies), but I would argue that the ability to perceive concatenated sentences like that is a function of the ability of the brain/eye to focus on a particular range and filter out "distractions" (letters to the left and right). Padding our words with spaces helps the brain to quicker define the focus boundaries, after which we can process the text range for meaning...

I imagine the brain's focus as little perception boxes, scanning up and down the concatenated sentence until enough symbols are aligned to fire a recognition signal... As I read your post above, I find my eyes darting about a little more, actually darting to the center of the "word" once recognition is made.

runonsentencewithlowercase -- here's your letter by letter scan "mode"

runonsentencewithcoloring -- slightly easier to define word boundaries by color

runonSENTENCEwithuppercase -- it's easier to locate the word SENTENCE because we perceive a boundary beween small letters and upper letters.

runo nsente ncewit hbads pacing -- pain in the ass, but we still comprehend

run on sentence with lowercase -- whitespace speeds compehension.

Re:Thought comes before language by alanxyzzy · 2004-09-02 04:41 · Score: 4, Interesting

why has English since then been stealing words from other languages like a slum rat during a riot in a shopping mall?

"The problem with defending the purity of the English language is that English is about as pure as a cribhouse whore. We don't just borrow words; on occasion, English has pursued other languages down alleyways to beat them unconscious and rifle their pockets for new vocabulary."
- James D. Nicoll

Slashdot Mirror

The Science of Word Recognition

34 of 430 comments (clear)