The Science of Word Recognition

AAAAAARRGGHHH, I'm going blind! by rock_climbing_guy · 2004-09-01 21:07 · Score: 5, Funny

Would one of those stupid comments about the colour scheme on /. be on-topic now?

--
Wh47 d1d j00 541, 31337 15n't t3h r0xor5 ne m0r3???

Re:AAAAAARRGGHHH, I'm going blind! by tahii · 2004-09-01 22:03 · Score: 2, Funny

Yes, yes it would.

For all those wanting this post in an eye-shattering colour, Click here

Honest!!! by TheWingThing · 2004-09-01 21:11 · Score: 5, Funny

I was reading what was written on her T-shirt!

Re:Honest!!! by rock_climbing_guy · 2004-09-01 21:16 · Score: 5, Funny

I liked the t-shirt that said,
(in big letters) If you can read this,
(in slightly smaller letters)you obviously must have
(in still smaller letters)very good eyesight.
(in smaller letters)While you're down here, why don't you give me a blow job?

--
Wh47 d1d j00 541, 31337 15n't t3h r0xor5 ne m0r3???

Oh no! by barcodez · 2004-09-01 21:12 · Score: 3, Funny

So are Microsoft going to patent the way we read and then sue?

"If you are reading this then you owe Microsoft royalies"

--

----

I'm not sure I buy it. by Anonymous Coward · 2004-09-01 21:15 · Score: 2, Interesting

His "word shape" matrix with "than" "tban" "tnan", etc; could be more easily explained by saying that people pay more attention to tall letters than short ones. That would explain why 'tban' gets caught more than 'tnan' just as well as word-shape arguments.

To make it more obvious, stick a tall letter in a word that only has short letters and you'll come away thinking word shape does matter.

(or did he explain it... there were way to many words and way too few glossy pictures in that article for me to comprehend it)

Re:I'm not sure I buy it. by SammyTheSnake · 2004-09-01 23:22 · Score: 2, Interesting

I once saw a little article in a free-on-the-train paper demonstrating that we mostly read the top half of a line of text. Try it some time, cover up the bottom half of a line of text and read it, then cover up the top half of the next line of text and read that. Which is easier?

Cheers & God bless
Sam "SammyTheSnake" Penny
Re:I'm not sure I buy it. by ideonode · 2004-09-01 23:52 · Score: 5, Interesting

I cdnuolt blveiee taht I cluod aulaclty uesdnatnrd waht I was rdanieg - the phaonmneal pweor of the hmuan mnid. Aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it deosn't mttaer inwaht oredr the ltteers in a wrod are, the olny iprmoatnt tihng is taht the frist and lsat ltteer be in the rghit pclae. The rset can be a taotl mses and you can sitll raed it wouthit a porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe.
Re:I'm not sure I buy it. by maxwell+demon · 2004-09-02 00:36 · Score: 5, Insightful

Actuallythesplittingintowordsisnotnecessarytounder standwhatiswritteniftheorderoflettersiscorrect.Thi s"proves"thatyouarereadingbytheletter,notbytheword .(relyingonslashcodetoinsertameaninglessspaceevery nowandthen:-))

--
The Tao of math: The numbers you can count are not the real numbers.
Re:I'm not sure I buy it. by Orne · 2004-09-02 03:05 · Score: 4, Insightful

I'm no linguist (elec eng w/ neural net studies), but I would argue that the ability to perceive concatenated sentences like that is a function of the ability of the brain/eye to focus on a particular range and filter out "distractions" (letters to the left and right). Padding our words with spaces helps the brain to quicker define the focus boundaries, after which we can process the text range for meaning...

I imagine the brain's focus as little perception boxes, scanning up and down the concatenated sentence until enough symbols are aligned to fire a recognition signal... As I read your post above, I find my eyes darting about a little more, actually darting to the center of the "word" once recognition is made.

runonsentencewithlowercase -- here's your letter by letter scan "mode"

runonsentencewithcoloring -- slightly easier to define word boundaries by color

runonSENTENCEwithuppercase -- it's easier to locate the word SENTENCE because we perceive a boundary beween small letters and upper letters.

runo nsente ncewit hbads pacing -- pain in the ass, but we still comprehend

run on sentence with lowercase -- whitespace speeds compehension.

In related news... by Zorilla · 2004-09-01 21:19 · Score: 3, Funny

New technology will soon be revealed that will instruct Slashdot users on the proper spelling of "lose".

The USSGN (Union of Slashdot Spelling and Grammar Nazis) is expected to stage protests against the new product in the interest of keeping their jobs.

--

It would be cool if it didn't suck.

Eye movements? by ImaLamer · 2004-09-01 21:19 · Score: 4, Interesting

With the assistance of fancy eye-tracking cameras researchers have been able to devise several clever experiments to give us new insight into how reading works."

Oh they must have been using EyeQ....

I can read at 44692 words per minute! Thanks for posting that long article for me to read, I needed the exercise.

And thank you EyeQ! Your the greatest!

Really though, they say that the more letters/words mean faster reading times. It's true. Think about a book or article you've read. When the words are together on the page it's easier to read because your eyes can jump around letting your brain fill in the blanks.

Ever read something that made sense but you couldn't quote it word for word? It's likely because you read in this same way.

--
Get your Unix fortune now!

Quotation by Anonymous Coward · 2004-09-01 21:20 · Score: 5, Funny

"Evidence from the last 20 years of work in cognitive psychology indicates that we use the letters within a word to recognize a word."

Man, I'm so glad they finally figured this out...

Re:Quotation by Richard_at_work · 2004-09-01 23:10 · Score: 2, Interesting

There was a slashdot story a while back which basically stated exactly that quote. Basically you could easily read an entire book where the words were made up of the correct starting character, the correct ending character, but the middle of the word it didnt matter what order the characters came in.

For example: "sadhoslt nwes for nrdes. Sfutf taht mrttaes". (I think ive got that correct, someone will obviously correct me if not :))

Your brain didnt need the middle of the word to understand the word when placed in a sentance. So long as the word was the correct length, your brain could extrapolate the meaning and create a meaningful sentance out of it.
Re:Quotation by Lars+Clausen · 2004-09-02 00:14 · Score: 3, Interesting

It also turned out to be mostly urban legend. There was some related research, but none that stated that claim. Bdeeiss, if taht was true, we cloud imoprve ceioomprssn aghilmorts by sinortg the mddile leertts aaabcehilllpty, scine tehir piinoosts are iaaeimmrtl.

-Lars
Re:Quotation by danila · 2004-09-02 00:55 · Score: 2, Informative

But teh gsit of teh sotry was ture. Terhe is a lot of rdeandncuy in the lagnuage nad if th rerhesecars are rghit, yuor bairn reelis mroe on crroect ltteres awanyy.

--
Future Wiki -- If you don't think about the future, you cannot have one.
Re:Quotation by Epistax · 2004-09-02 01:45 · Score: 2, Insightful

"Evidence from the last 20 years of work in cognitive psychology indicates that we use the letters within a word to recognize a word."

Very strange because if y_u r____r we d__'t n__d a_l those l_____s.

I love how by FS1 · 2004-09-01 21:20 · Score: 5, Insightful

Does anyone else think that merely analyzing how english is read is very closed minded? I'm pretty sure only a very small percentage of the world speaks and reads english.

I would love to see a study comparing how english is read to how chinese is read by native speakers. Very interesting i would gather.

--
A Fatal OE Exception has occurred, Sig will now reboot.

Re:I love how by defMan · 2004-09-01 21:34 · Score: 3, Interesting

I would personally be very interested in seeing english compared to dutch or german. In those languages (i'm a native dutch speaker) the word order is much more flexible and the determining verb often comes very late in the sentence. In german this is more prominent than in dutch.
I just searched around on google and these documents come up
Word Order in German
Kathol's analysis of German Word Order
Re:I love how by ImaLamer · 2004-09-01 21:44 · Score: 4, Interesting

You're right. It would seem that for better analysis comparing Hebrew/Chinese to English would be better.

Maybe we can learn even more about our way of reading, like: Is it the most efficient?

Is right to left, or left to right the best way to go.

Interesting side note (don't know why I'm bringing this up...) President #20, James A. Garfield could write in both Latin and Greek at the same time?

--
Get your Unix fortune now!
Re:I love how by dave420 · 2004-09-01 22:22 · Score: 4, Insightful

There are roughly 400 million people with English as their first language, true, but there are even more with English as a second language. If you're looking to select a language to base a study on, and you want it to be accessible, then you choose English. It really is that simple.
Also, Chinese is character-based, not letter-based, so the research would be completely different. Kind of like asking someone who's studying jet aircraft to study cars as more people have them.
Re:I love how by dave420 · 2004-09-01 23:27 · Score: 4, Interesting

No, there's lots of study on the matter, and it's shown that Chinese people interpret their written language in a completely different part of the brain than english-reading people. That fact alone means a completely different method is at work... :)
Re:I love how by julesh · 2004-09-02 00:39 · Score: 3, Interesting

Is right to left, or left to right the best way to go.

I remember reading about an interesting study into this. Apparently, there are a small number of people who have a particular form of brain damage which effectively reverses their perception. These people, if they were originally educated to read/write left to right, would afterwards naturally read/write right to left, or vice versa.

Apparently, once they get used to using their right hand with a style similar to that a left-hander would use (or vice-versa) they can read & write in the opposite direction at roughly the same rate a normal person can in the usual direction. The conclusion: the difference is not noticeable; neither left to right nor right to left is substantially more efficient (or any difference is also negated by the brain damage these people have suffered).

No, I can't cite references. I just came across it about 10 years ago, I don't even remember what I was studying at the time.
Re:I love how by antifoidulus · 2004-09-02 00:59 · Score: 2, Interesting

I wish they would have expounded on what they meant. Because the "sound" of a word in Chinese is pretty much the same as it's meaning. Yes, the characters do have meanings, but in Chinese(for the most part, there are some exceptions) each character only has 1 sound. The sound is exactly how you would pronounce the word if you were speaking, so I'm not sure what they mean by saying that children process the sound and the meaning seperately. Or maybe it's the difference between how a person understands a language when it is being taught as a 2nd language vs. a native one.
It gets a bit different for Japanese though. In Chinese(once again, for the most part) you can process character by character and read the sentence. However, because Japan had to force the Chinese character system to it's language(and also borrowed readings, then newer readings without changing the old ones) each character has usually 2 readings, some only 1, some much more. How you read the character depends on the characters around it. To a certain extent it's like English, you really cannot read it character for character phonetically, you have to process blocks. I wonder if the Japanese reader uses the same parts of the brain as the English reader, or Chinese reader, or both.

--
Monstar L
Re:I love how by QuickFox · 2004-09-02 01:38 · Score: 2, Funny

Puzzling.... anyway, it's good to know that at least some presidents have some skills

Wow, you're even more pessimistic than me about these things! You find it puzzling that a president has skills?

--
Terrorists can't threaten a country's freedom and democracy. Only lawmakers and voters can do that.
Re:I love how by Sunnan · 2004-09-02 03:28 · Score: 2, Informative

Is right to left, or left to right the best way to go.

If you're right-handed, you'll smudge the text with your hand if you write right-to-left.
Re:I love how by oneself · 2004-09-02 03:46 · Score: 2, Informative

> Is right to left, or left to right the best way to go.

The difference between right-to-left and left-to-right is not in the
reading but in the writing. Right-to-left languages, including
semitic languages that stemmed from Aramaic, were created in a time
before paper. These languages were usually chiseled into hard
materials like marble. Since most people are right handed, they tend
to hold the chisel with that hand, to give themselves more control.
That makes writing from right-to-left easier to read _as you write_.
Try it out.

Left-to-right languages were invented much later, when technology
improved, and the use of paper was common. In those cases writing
from left-to-right made more sense. Since, again, you can read as you
right. I'm not sure what the rational for top-down languages is, but
I'll bet it's something similar to this logic. In most cases in
history, the people who dictate (pun indented) the rules are usually
the creators, and not the users.
Re:I love how by Colazar · 2004-09-02 05:42 · Score: 2, Interesting

In German, in certain situations, the verb moves to the end. Moreover, certain verbs get even split in certain situations.
This actually happens in English, too, but we've been trained not to think of it that way.
* I'd like to hang up that picture.
* I don't know where to hang that picture up.
* His friends are going to move out at the end of the month.
* Is he going to help move his friends out?
Most of the time when you are "ending a sentence with a preposition," you are actually doing no such thing--you are using a seperable verb. But because we write them as two separate words even when they are next to each other, we don't really think of them as being that closely related anymore. Also the fact that those "particles" (the technical term for them) look and sound exactly like prepositions helped lead to the confusion.
I remember in my introductory syntax class, we spent about a week proving that the sentence structures for English and German were virtually identical, but mirror images of each other. (things that German tended to move to the back, English tended to move to the front, and vice versa) but we had to diagram an awful lot of sentences to get to that point.

--
He decided to just watch the government, and kind of scale it down to size, and run his life that way. --Laurie Anderson

Reading about how we read by DrFrasierCrane · 2004-09-01 21:22 · Score: 5, Interesting

While reading the article, I suddenly become hyper-aware about how I was reading the article. :-)

Don't let the Microsoft name scare you off - the article makes for a fascinating look (pun intended) into how we read. I wonder, though, if these findings are duplicated with written Oriental languages.

--
You call this a signature?

What about other writing systems? by mocm · 2004-09-01 21:22 · Score: 4, Interesting

Since most people in the world don't use the latin alphabet, it would be interesting to find out how word recognition works for them. And how they read words in our alphabet.

--
***Quis custodiet ipsos custodes***

Re:What about other writing systems? by ajs318 · 2004-09-01 23:23 · Score: 5, Interesting

They probably have already written papers on it ..... in their own languages.

Want my theory? I think the brain uses multiple techniques in parallel, then releases resources from the ones found to be going nowhere. So at any one time you may be trying to read a word letter-by-letter, recognising the word from the Bouma shape, and picking likely words from context. The different techniques will have different successes depending on various factors (clean type vs. messy handwriting, familiar vs unfamiliar words, &c). So my theory is that the brain is trying various methods at the same time, each narrowing down the possibilities, and just goes with whatever produces a result first. As soon as that happens, any half-finished tests in progress are scrapped and their resources deallocated. The eye movements may well have something to do with this ..... different reading techniques require different resolutions, the eye is great at recognising outlines but needs to zero-in on details, once a clue is established from the word envelope. There is evidence that fonts such as Times are more readable than Helvetica, so maybe serifs add recognisability in their own way? And if this is what is happening, then it would explain some of the test results in the article too, since they were looking for a single technique in use at any one time.

If all this sounds inefficient, you have to remember that human beings are optimised for non-optimum conditions ..... for instance, we have kidneys that pack up if you drink nothing but de-mineralised water, and an immune system that goes berserk and tries to poison you with histamine if it doesn't get enough germs to fight off.

--
Je fume. Tu fumes. Nous fûmes!
Re:What about other writing systems? by mocm · 2004-09-01 23:36 · Score: 2, Informative

If you look at the languages sorted by native speakers, I guess you can say that it will be a pretty close race between those that use the latin alphabet and extensions thereof and those that don't. Here you have a list of all the different writing systems.

--
***Quis custodiet ipsos custodes***

Reduced Redudancy by plasticmillion · 2004-09-01 21:26 · Score: 3, Informative

This got slashdotted!? The idea of recognizing words by "word shape" seems so silly to me that I almost feel as if the author is attacking a straw man rather than a widely accepted linguistic theory.

The final conclusions are similar to what I learned in my college linguistics classes 15 years ago. Language contains a lot of redundancy. The reason is that we often encounter situations of so-called "reduced redundancy". For example, someone might have sloppy handwriting so you can't make out all of the letters. Or you might be talking to someone while they brush their teeth. If language were highly optimized, we wouldn't understand a thing in these situations, but because of redundancy we can usually communicate very effectively.

The same applies to reading. The conclusions of the paper seem trivial to me. Of course, reading exploits "visual" and "contextual" information. How else would be understand a sentence like "The boy ate a ham___er" (with a few letters obscured)?

The fact that the brain's neural net adds up the weighted lexicographic, syntactic, semantic (and even pragmatic) information available to it in order to interpret language should be familiar to anyone who's read Goedel, Escher, Bach. And that was published in 1979...

--
Peer Pressure

Re:Reduced Redudancy by Placido · 2004-09-01 22:07 · Score: 4, Interesting

>> How else would be understand a sentence like "The boy ate a ham___er" (with a few letters obscured)?

What a way to prove your point. I kept thinking "hamster", "hammer" and then eventually realised that I didn't spot your miss-spelling of 'we' and that I read right over it and filled in the blank.

--

Pinky: "What are we going to do tomorrow night Brain?"
Brain: "I would tell you Pinky but this 120 char limi
Re:Reduced Redudancy by Da+Twink+Daddy · 2004-09-01 22:16 · Score: 2, Interesting

I filled in the blank with hamster [Making it: "The boy ate a hamster"], but maybe I'm just an oddity.
Re:Reduced Redudancy by TheWormThatFlies · 2004-09-01 23:00 · Score: 3, Informative

This got slashdotted!? The idea of recognizing words by "word shape" seems so silly to me that I almost feel as if the author is attacking a straw man rather than a widely accepted linguistic theory.
The author is aiming the article at typographers, not linguists and psychologists. It seems that while everyone who does scientific research into the way that we read has known for a long time that the word shape theory is full of crap, the theory persists as a kind of urban myth among typographers. So the paper is a scientific literature review for the benefit of people working in typography.
Re:Reduced Redudancy by nine-times · 2004-09-02 03:30 · Score: 2, Interesting

>> How else would be understand a sentence like "The boy ate a ham___er" (with a few letters obscured)?
What a way to prove your point. I kept thinking "hamster", "hammer" and then eventually realised that I didn't spot your miss-spelling of 'we' and that I read right over it and filled in the blank.
Wow. Not only did I do what you did, but not-even-reading your post, I picked out "ham___er", "hamster", "hammer", and "we", and tried to figure out if you were suggesting that "we" fit in the missing space, and he meant to say "hamweer".

How we read... by stupid_is · 2004-09-01 21:34 · Score: 2, Interesting

A while ago I was emailed something that stuck out from the usual chain/joke/... flood. Basically it had a very long and badly spelled sentence, where the only rules followed were that the first and last letter in the word were in the correct position. You could read it easily. Go figure!

Hree is an epamxle of jsut taht, it's qitue esay to raed, ins't it? Agulohth it can get plluartraicy hrad wtih the lgnoer wdros.

--
-- Intelligence is soluble in alcohol

Re:How we read... by Johan+Veenstra · 2004-09-01 22:02 · Score: 5, Informative

The example:

Aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it deosn't mttaer in waht oredr the ltteers in a wrod are, the olny iprmoetnt tihng is taht the frist and lsat ltteer be at the rghit pclae. The rset can be a toatl mses and you can sitll raed it wouthit porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe.

But soon enough there was a counter example:

Anidroccg to crad cniyrrag lcitsiugnis planoissefors at an uemannd, utisreviny in Bsitirh Cibmuloa, and crartnoy to the duoibus cmials of the ueticnd rcraeseh, a slpmie, macinahcel ioisrevnn of ianretnl cretcarahs araepps sneiciffut to csufnoe the eadyrevy oekoolnr.

In the counter example, the letters are not randomly scrabled, the letters are in reverse order, except the first and last letters.
Re:How we read... by djmurdoch · 2004-09-01 22:32 · Score: 2, Funny

Anidroccg to crad cniyrrag lcitsiugnis planoissefors at an uemannd, utisreviny in Bsitirh Cibmuloa, and crartnoy to the duoibus cmials of the ueticnd rcraeseh, a slpmie, macinahcel ioisrevnn of ianretnl cretcarahs araepps sneiciffut to csufnoe the eadyrevy oekoolnr.

This would be a lot easier to read without that misplaced comma.
Re:How we read... by mikael · 2004-09-01 23:06 · Score: 2, Interesting

Interesting. Maybe word recognition uses a small cache to perform error correction if characters are swapped around by 2-3 spaces. In the case of the reversed characters, this won't work.

--
Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads

So ... by Pegasus · 2004-09-01 21:40 · Score: 4, Insightful

when are they going to repeat these experiments in let say China or Japan? I'm *very* interested in what would the conclusions be there.
For what i know abaout japanese, they don't use spaces between 'words'. A single kanji represents the whole word and their outline is always more or less square. So the whole bouma theory fails here, as he finds out.
I'm sure they could leard more interesting things in other writing sysmtems ...

Re:So ... by macshit · 2004-09-01 23:57 · Score: 3, Informative

Kanji = picture-based
English = character-based

It's like comparing apples and oranges - two completely different ways a written language is interpreted.

I think they're not quite as different as many people seem to think though.

Most kanji are composed of more primitive components. From observing myself reading Japanese, I've noticed that I make many of the same mistakes in recognition, and use similar tricks in recognizing unknown kanji, as I do when reading english. For instance, I frequently confuse two kanji because they have mostly the same primitive components, but differ in one (often the radical -- even though it's arguably the most important part of a kanji, I find I tend to ignore it when reading!).

In my opinion it's not unreasonable to think of the parts of a kanji as being like letters and the whole thing as being like a word.

--
We live, as we dream -- alone....

Though comes before language by alanxyzzy · 2004-09-01 21:42 · Score: 4, Informative

I would love to see a study comparing how english is read to how chinese is read by native speakers.

There is an interesting article at the Harvard Gazette about research which seems to show that thought comes before language. The Korean language distinguishes between two meanings of "in" - fitting loosely or tightly.

Research shows that

Infants of English-speaking parents easily grasp the Korean distinction between a cylinder fitting loosely or tightly into a container. In other words, children come into the world with the ability to describe what's on their young minds in English, Korean, or any other language. But differences in niceties of thought not reflected in a language go unspoken when they get older.

Re:Though comes before language by julesh · 2004-09-02 00:50 · Score: 2, Insightful

The relationship is probably a lot more complicated than "thought comes before language". I suspect they are both highly dependent on each other.

For instance, it is clear that many non-verbal animals are able to think, in at least some limited fashion. Larger rodents, for instance, are able to build models of their world and solve simple problems (not limited to learning by trial and error). It is exactly this kind of modelling that concepts like one object being inside another stem from -- spatial reasoning is almost certainly the most deeply embedded and instinctive part of thought, and therefore the least likely to depend on language.

However, the ability to form complex theories and plans may or may not be entirely dependent on our ability to express them. Could primitive man, for instance, have looked at the weather and decided whether it would be best to go hunting today or finish building that shelter first, if he didn't have words for 'rain', 'shelter', and 'later'? The question might be too complex to approach without some kind of symbolism that can be internalised. Or it might not. Its very hard to tell.

This was a very interesting paper. by PotatoHead · 2004-09-01 21:42 · Score: 3, Interesting

I found myself becoming aware of how I read while I read. Fun! I agree with the author regarding letter recognition. The parallel aspect of word recognition is very interesting as well because it begins to explain why we are albe ot raed srcambled txet os eaisly!

Also, more work needs to be done to consider the visual cues outside the focus of attention. It is here that, I believe, shape and form cue the reader, more than letter shapes do, as to the potential content of the text to come. (Exactly how is for the geniuses.)

--
Blogging because I can...

Read it... by zoney_ie · 2004-09-01 21:45 · Score: 2, Insightful

Read it, it's interesting. It does get a wee bit weird when it's describing how you read as you read... a sort of super-conciousness about my eye movements. It's like when you become aware of your breathing or something and then have to conciously pay attention to it for a while to make sure it doesn't stop!

--
-- *~()____) This message will self-destruct in 5 seconds...

Focuses on 1 script, 1 language by kahei · 2004-09-01 21:46 · Score: 4, Insightful

While some of the results here are interesting (but old), the fact that the entire study focuses on exactly 1 script and 1 language basically renders the conclusions worthless (as conclusions about cognition in general... I suppose they still have value as conclusions about English and the Latin script).

What has happened here is:

1 -- Observe people reading a given language/script

2 -- See how they make use of features of that particular language/script, such as tall letters, case, and the occurrence of 'skippable' words such as articles

3 -- Describe the way they use these local features, and call that a theory of reading in general.

I don't really understand how to apply a theory of reading based on word and letter shapes when there are so many people reading text in which:

--There are no letter boundaries, and/or
--There are no word boundaries, and/or
--Letters all have the same form factor

The experiments described would probably generalize very well to arabic and greek scripts, pretty well to cyrillic (no tall/short letters to speak of), badly to devanagari-type scripts, very badly to Chinese and Japanese, and not at all to hieroglyphics (though I agree that there may never have been a reader of hieroglyphics who was fluent by modern standards).

To pretend that these experiments apply to humanity in general rather than the author's own language/script choice is silly. It's an interesting article and I'm glad the research was done but unfortunately a certain failure to 'get' the multilingual nature of humanity, which I don't really expect to find in MS work, is in evidence here.

--
Whence? Hence. Whither? Thither.

Re:Focuses on 1 script, 1 language by hazem · 2004-09-01 22:05 · Score: 5, Insightful

Everybody seems to be giving this guy a hard time because he did his research for reading only English. My guess is that the guy reads/speaks English and has ready access to people who do the same. This research is a good start and seems to have valuable results.

Now someone else can work on a PhD Thesis by taking his work and seeing if it applies in other languages.

Isn't this how science works? You do research, try to make some conclusions, and publish the results. If you wait to publish until you've found the Grand Unified Theory of Everything, then nobody publishes anything and science doesn't advance at all.

I'm not sure that he missed anything. He has started with what he knows and has resources to study.
Re:Focuses on 1 script, 1 language by olau · 2004-09-01 23:43 · Score: 5, Insightful

To pretend that these experiments apply to humanity in general rather than the author's own language/script choice is silly.

You know what is also silly? To pretend that this was the conclusion, although clearly the paper nowhere stated that it had found the grand unified theory of how people read. Here's a hint: when the paper talks about reading, it is obviously talking about reading English.

Yes, the paper would be even more interesting if it included studies of other scripts, and the failure to acknowledge the existence of other scripts should be criticised. But the rest of your criticism is unfounded.

Please by tgv · 2004-09-01 21:46 · Score: 2, Informative

Although it is nice to see mentioning of my trade a /., this paper has about the status of a student's essay. It doesn't even mention literature after 1998!

Article in short... by uss_valiant · 2004-09-01 21:55 · Score: 4, Informative

Further examination of the evidence used to support the word shape model has demonstrated that the case for the word shape model was not as strong as it seemed. The word superiority effect is caused by familiar letter sequences and not word shapes. Uppercase is faster than lowercase because of practice. Letter shape similarities rather than word shape similarities drive mistakes in the proofreading task. And pseudowords also suffer from decreased reading speed with alternating case text. All of these findings make more sense with the parallel letter recognition model of reading than the word shape model.

Of course he describes all the models before he concludes that from the three models, Word Shape Recognition (oldest), Serial Letter Recognition and Parallel Letter Recognition (newest), the latter is the one that is today the most accepted model.

Re:Comments by t0c · 2004-09-01 22:02 · Score: 2, Informative

http://news.bbc.co.uk/1/hi/health/3618060.stm is a good read about dyslexia it isn't exactly related but you might be interested :)

Re:aaah!! eyes hurt! by The+Grassy+Knoll · 2004-09-01 22:06 · Score: 5, Funny

>renerding on firefox

re-nerding! ha ha. Best... typo... ever...

--
They will never know the simple pleasure of a monkey knife fight

or maybe it's both? by Illserve · 2004-09-01 22:14 · Score: 4, Interesting

If there's one real take-home lesson of brain-design from cognitive science, it's that the brain tends to do everything several different ways in parallel, and then use the results from all of them.

Obviously it can't all be shape, there are plenty of words with identical shapes and yet these are distinguishable.

But it could certainly be true that we use shape and parallel letter recognition at the same time. Shape narrows the field of possibilities from millions to a small handful, and then parallel recognition chooses one of the options.

Whatever happens, you can be sure it's terribly complicated, extremely robust and very efficient.

Don't shout! by meckardt · 2004-09-01 22:34 · Score: 4, Interesting

From the article: ...lowercase text is read faster than uppercase text. This could also explain why nobody likes to read email where the other person uses all caps.

Re:Don't shout! by Seahawk · 2004-09-01 23:31 · Score: 4, Informative

And if you had read the rest of the article, you would know that this is just because 99% of all we read is lowercase.

People can easily be trained to read text in caps as fast as lowercase text - or mirrored text.

What I fail to understand is how randomizing the middle letters of a word doesnt affect reading much. I had hoped he would use that as an example.

Tihs is a emxpale of the efecft.

Re:aaah!! eyes hurt! by DrSkwid · 2004-09-01 22:40 · Score: 4, Informative

dunno, firefox / moz has one of my favourite features

tools ... options ... general .... fonts & colours .... minimum font size : 14

great for annoying "web site designers" who can't design for shit

--
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter

FTA... by Anonymous+Writer · 2004-09-01 22:45 · Score: 2, Interesting

Why I wrote this paper

I am a psychologist who has been working for Microsoft in different capacities since 1996. In 2000 I completed my PhD in cognitive psychology from the University of Texas at Austin studying word recognition and reading acquisition. I joined the ClearType team in 2002 to help get a better scientific understanding of the benefits of ClearType and other reading technologies with the goal of achieving a great on-screen reading experience.

I'm surprised this guy is actually working with ClearType. That is just a simple way of making characters appear better by using sub-pixels to increase character resolution. I would think this type of work would be better applied in optical character recognition, maybe even with cursive handwriting.

Well, it does in a way. by BigRedFish · 2004-09-01 22:51 · Score: 3, Interesting

The FArticle does, in fact, address this, though not directly - it puts forth a theory that all letters in a word are absorbed simultaneously, and the brain re-orders them. This is given as theory #3, admittedly a ways down.

This gets me thinking, though, about the importance of context. If you drew the letters PLEORBM in a Scrabble game, it might take a while to see the word staring at you. But in the context of a (mangled) sentence: "you can sitll raed tish wouthit a pleorbm," it much more easily jumps out. Interesting.

Re:Thought comes before language by achurch · 2004-09-01 22:55 · Score: 3, Interesting

Infants of English-speaking parents easily grasp the Korean distinction between a cylinder fitting loosely or tightly into a container. In other words, children come into the world with the ability to describe what's on their young minds in English, Korean, or any other language. But differences in niceties of thought not reflected in a language go unspoken when they get older.

Absolutely. And adults can "relearn" those distinctions, too; I found that as my Japanese studies progressed (started at 19, pretty close to native now) the range of things I was able to think about expanded considerably--so much so that now I sometimes have trouble speaking to people in English because English doesn't have a word for the concept I'm thinking about.

Microsoft Research Web Site by Numen · 2004-09-01 22:56 · Score: 5, Informative

If there's those that have shied away from Microsoft, well because they're Microsoft, you might not be aware of http://research.microsoft.com which regardless of which side of various fences you might sit has some very interesting material and is generally worth tracking over time.

Aplogise for the tangent, on the back of this article seemed an apt place to point to the MS research site for those that might not of been aware of it.

Phonemic information mandatory? by jsebrech · 2004-09-01 23:41 · Score: 2, Insightful

The internal representations for these models convert the letter information to phonemic information, which is seen as a mandatory step for word recognition. It is well known that words that have a consistent spelling to sound correspondence such as mint, tint, and hint are recognized faster than words that have an inconsistent spelling to sound correspondence such as pint

I can not believe this is in a serious paper. Mandatory? Please. What about people born deaf? Are they all unable to read?

Reading Cognizance Level by Gallenod · 2004-09-02 00:17 · Score: 2, Insightful

I'm wondering about the competence level of the readers used in the various tests. People have, in my experience, a wide range of reading ability levels ranging from those who still have to "spell out" many words letter by letter up to speed readers who can read entire phrases (or even whole sentences) as easily as most people read single words. If we divide them into three groups (phonetic readers, whole word readers, and "cognitive chunkers"), would these results be consistent from group to group?

Learning to read, like learning higher math, is a process of internalizing certain reflexes. Most people alive today do not understand calculus. Most will also never learn to read much faster than they can speak aloud.

Ultimately, 80% of the people voting in the next presidential election are of average or lower intelligence.

The rest of us are Slashdot members. :)

--

TLR

A man no more knows his destiny than a tea leaf knows the history of the East India Company

Re:Thought comes before language by Lars+Clausen · 2004-09-02 00:37 · Score: 3, Interesting

As a non-native (but fluent) speaker of English, and the husband of a fluent English speaker learning Danish, I can tell you quite well that there are many concepts that have a single word describing them in Danish but not in English, and vice-versa. Some words are normally considered equivalent but have slightly different extents ("pink" covers more colors than the common translation "lyserød", for instance).

The grandparent also didn't say "couldn't be expressed", but "has no word". Given enough verbiage, you can (probably) express any word in one language in any other language, but that's not what you want to do in conversation.

And if the "language of Shakespeare" is so all-encompassing, why has English since then been stealing words from other languages like a slum rat during a riot in a shopping mall? Mind you, I think this is a good feature that adds expressiveness to the language, but it clearly shows that there are things that English speakers consider important enough to be able to express succinctly that they'll bring in foreign words for it.

-Lars

amusing test... by zozzi · 2004-09-02 00:43 · Score: 5, Interesting

I enjoy giving people this test: Write a long sentence and make sure that the last word of the sentence is a filler word. Then write that filler word again at the start of the next sentence and write some more. Eg:

Yesterday I went to the beach and saw the the boat I always dreamt about.

~ 7 out of 10 people fail to spot it, even if told beforehand there's an obvious error. Somehow music people are more prone to spot the error straight away.

--
---

Source code? by maxwell+demon · 2004-09-02 01:00 · Score: 2, Interesting

While the study is certainly about reading English texts, could one draw some conclusions about the readability of source code? I guess at least the finding that whitespace governs the jumps of our eyes might have some relevance here.

--
The Tao of math: The numbers you can count are not the real numbers.

I think Chinese is similar by peter303 · 2004-09-02 01:11 · Score: 2, Interesting

I am somewhat fluent in Chinese. Though syllables in Chinese (and Korean) approximately fit into squares, they share two characteristics with alphabetic word shapes:

First, Chinese characters are often composed of several smaller characters, 500 or so, instead of the 70'ish letters and numerals (including capitals) in English. We say such a character may have a "moon" sub-character on the left, a "white" on the right and so on. The sub-characters can be partial clues to meaning and pronunciation (e.g. a left moon usually signifies part of a body, and the a right moon means its sounds like "bai"), but there are no steadfast rules. Just like in English where the pronunciation can vary from the spelling, and the whole meaning vary from those of the prefix and root. But Chinese breaks the rules more often, probably since many of the characters have been around 3000 years- seven times longer than modern English spelling rules. The Korean writing system is totally planned and recent, so it is very logical. The sub-characters represent the beginning, vowel, and end of a syllable, gracefully packed into a square.

Second, you can trace the boundaries of Chinese characters too and see distinguishing characteristics. They might have a gap in one corner, a ragged stroke defining an edge, etc. Just like in English words a learner will move from observing the strokes and sub-characters into the gestalt of the whole character. Just like any other language, chinese characters are contextual. Combinations are sematically constrained to one to four syllable word. And they are grammatically constrained to expect nouns, verbs, modifiers, etc, in certain sequences.

Re:REKANYZE! by TarlCabbot · 2004-09-02 01:12 · Score: 5, Insightful

I am sure that we've seen this e-mail floating around. Doesn't it seem like we read in shapes?

I cdnuolt blveiee taht I cluod aulaclty uesdnatnrd waht I was rdgnieg The phaonmneal pweor of the hmuan mnid Aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it deosn't mttaer inwaht oredr the ltteers in a wrod are, the olny iprmoatnt tihng is taht the frist and lsat ltteer be in the rghit pclae. The rset can be a taotl mses and you can sitll raed it wouthit a porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe. Amzanig huh? yaeh and I awlyas thought slpeling was ipmorantt!

Interesting observation by lazyl · 2004-09-02 01:13 · Score: 4, Interesting

It makes a big difference if your messed up words use common letter patterns (what, in the article he called 'Psuedowords'), or not.

Example:

'uesdnatnrd' wasn't to hard to recognize beacuase 'uesd' and 'tnrd' aren't letter patterns that exist in real words. So the mind works quicker to rearrange the letters to find a real word.

'aulaclty' was much harder because it's almost pronouncable. 'lac' and 'lty' are common patterns from real words, and 'aul' might not be common but it's pronouncable.

Just an observation.

--
Aw crap, ninjas!

why is MicroSoft research so disconnected? by peter303 · 2004-09-02 01:21 · Score: 2, Insightful

I notice MS Research doing lots of basic research that has never been productized. Its rare to see corporations being so liberal with their resources. Even Google's very imaginative projects seem to be directed towards a commercial goal.

This suggests an interesting contradiction in MS product strategy. MS has a long history of "clone and conquer", e.g. Excel copies off VisiCalc and Lotus 123. Just this week MS cloned Apple iTunes. Yet MS Research is conducting some very interesting basic research. Go figure!

Larson has his detractors by toby · 2004-09-02 01:47 · Score: 2, Interesting

Larson presented this paper at the 2nd international Conference on Typography and Visual Communication in Thessaloniki, July this year. Other speakers, in particular Peter Enneson, questioned some of his methods and conclusions, and the paper - as convincing as it appears at first glance - should definitely be taken with a grain of salt.

Following a day's sessions on legibility and word recognition, the Thessaloniki conference held a round table on legibility and the processes of word recognition, chaired by John Hudson and participants Mary Dyson, Hrant Papazian, Kevin Larson and Peter Enneson.

--
you had me at #!

Re:REKANYZE! by iamacat · 2004-09-02 02:41 · Score: 5, Interesting

Don't give any ideas to spammers on how to sneak their "pneis elnraegemnt ceram" past the filters. I do suspect that the effect is local to the small group of letters and long words that are totally randomized will be difficult to read.

Re:Thought comes before language by MegaFur · 2004-09-02 03:03 · Score: 2, Interesting

The grandparent also didn't say "couldn't be expressed", but "has no word". Given enough verbiage, you can (probably) express any word in one language in any other language, but that's not what you want to do in conversation.

The approprately clunky sounding phrase to express the thought related in your second sentence is "concatenative assemblage".

--
Furry cows moo and decompress.

we disagree by Doc+Ruby · 2004-09-02 03:09 · Score: 2, Insightful

One problem with deciding "word shape" vs. "letters" as the method a reader uses to recognize a word comes from treatment of the reader as "atomic". I am a proficient reader. When I read a word, whether written by another, or by myself (as I type), I have multiple subcurrents of consciousness. A typo in a word might leave me with recognition of the word, and a sense that "something's wrong", simultaneously - it sometimes takes me several seconds to detect the typo, especially if it's one I often make myself. Likewise, some spelling mistakes derive from the difference my spoken accent makes with the written conjugated spelling, most often in the case of syllables separated by an "e" that is pronounced as a "schwa", easily confused with some pronunciations of "i" or "a", and sometimes "y".

Reading words silently, I sometimes notice an inner chorus pronouncing the words, with one or two discordant notes, even from poorly organized structure or unparseable punctuation. Deciding how people recognize words must also account for how people's minds are organized. The myth of the "undivided self" gets in the way of understanding not only how complex we are "under the hood", where media is digested, but denies credit to our grand integrator, which juggles these partial selves into one face with which to confront the world. As machine intelligence benefits from multiple simultaneous processing, why should they have all the fun? As we mimic our own minds in computer simulations, why should we have all the fun?

--

--
make install -not war

Fantasy and Science Fiction writers take notice! by cmpalmer · 2004-09-02 04:19 · Score: 2, Interesting

I read really fast. I also read quite a bit of fantasy and science fiction. I have noticed the effect that weird alien and fantasy names (N'kalogh or Xyztle) are like driving over speedbumps. The higher the density of unfamiliar and nearly unpronounceable names, the more likely I am not to finish the book (or even pick it up).

"N'kalogh leapt onto his mighty huyloch and rode across the plains of V'looth'u". Next please.

This paper gives a convincing pyschological model about why this occurs and it is pretty much what I had surmised on my own.

So, from now on, please name all of your aliens Bob, Larry, Bubba, or Charles.

--
-- stream of did I lock the front door consciousness

My own experiments with an eye-tracking camera... by pandrijeczko · 2004-09-02 04:19 · Score: 2, Funny

...revealed that whenever I read the word "Microsoft", my pupils dilate and when I read the word "Longhorn", I fall into a deep sl...

ZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZ

--
Gentoo Linux - another day, another USE flag.

Re:Eye tracker experiments by TheLink · 2004-09-02 04:21 · Score: 2, Interesting

Here's one thing you might want to try if you do that. Get someone to track when you blink whilst reading and how often you blink, and whether it's consistent for a particular person, and compare it with other people.

It seems if you don't blink whilst reading it's like trying to eat food in big chunks... At least for some people. Then again it may be the stress of keeping your eyes open distracting you from reading?

My postulate is that the brain takes the blink time to dedicate more resources to processing and understanding what is read.

Coz it's amazing how much "brain CPU" vision/sight takes up for most people. I've got people (kids + adults) to try to do certain coordination tricks - like drawing a circle with the right hand whilst drawing a square with the right foot, or doing an OK sign with the left hand palm facing upwards, L sign with index and thumb of right hand, palm facing downwards, and then rolling hands over - switching to L with left palm down and and OK sign with right palm up, and back again.

I find that most people find it easier to learn how to do such stuff if they have their eyes closed and visualize it whilst doing it. Once they get it, they can do it with their eyes open.

Unfortunately many tasks that require high coordination require your eyes to be open :).

There seems to be a "brain state" difference between having my eyes open or closed. I find it harder to stand and balance on one foot with my eyes closed compared to with my eyes open EVEN THOUGH it is totally dark and my eyesight is useless, or I am blindfolded. (Of course it is much easier with my eyes open and the surroundings visible :) ).

I am not a scientist, and I haven't done a formal study on these items. So it's just anecdotal, but feel free to go do one - would be good if the resulting study is published somewhere on the internet.

--

Too many replies beneath your current threshold

Re:Thought comes before language by alanxyzzy · 2004-09-02 04:41 · Score: 4, Interesting

why has English since then been stealing words from other languages like a slum rat during a riot in a shopping mall?

"The problem with defending the purity of the English language is that English is about as pure as a cribhouse whore. We don't just borrow words; on occasion, English has pursued other languages down alleyways to beat them unconscious and rifle their pockets for new vocabulary."
- James D. Nicoll

Re:Eye tracker experiments by Smallpond · 2004-09-02 09:36 · Score: 2, Interesting

That gives me an even better idea. Its probably much easier to detect when someone blinks then to track eye motion. So just change a few words of text whenever they blink. I suspect the result could be pretty funny.

As for why people blink at a specific rate, and whether that changes based on level of concentration, that's been studied.

"Studies have measured the blink rate and tearing on computer workers and noted that the blink rate dropped very significantly during work at a computer compared to before and after work. There was no significant change in tearing. The data support the fact that blink rate decreases during computer use, but also show that other tasks can decrease the blink rate.

Possible explanations for the decreased blink rate include concentration on the task or a relatively limited range of eye movements. Although both book reading and computer work result in significantly decreased blink rates, a difference between them is that computer work usually requires a higher gaze angle, resulting in an increased rate of tear evaporation. Since the main route of tear elimination is through evaporation and the amount of evaporation roughly relates to eye opening, the higher gaze angle when viewing a computer screen results in faster tear loss. It is also likely that the higher gaze angle results in a greater percentage of blinks that are incomplete. It has been suggested that incomplete blinks are not effective because the tear layer being replenished is 'defective' and not a full tear layer."

Which suggests that my blink rate may go up now that I have bifocals, since I have to look through the bottom of them for close-up work (lower gaze angle). I think a lot of the dry eye that I get is from the a/c, anyway.

Slashdot Mirror

The Science of Word Recognition

80 of 430 comments (clear)