Are Two Spaces After a Period Better Than One? (arstechnica.com)
Researchers at Skidmore College conducted an eye-tracking experiment with 60 Skidmore students and found that two spaces at the end of a period slightly improved the processing of text during reading. Ars Technica reports the findings: Previous cognitive science research has been divided on the issue. Some research has suggested closer spacing of the beginning of a new sentence may allow a reader to capture more characters in their parafoveal vision -- the area of the retina just outside the area of focus, or fovea -- and thus start processing the information sooner (though experimental evidence of that was not very strong). Other prior research has inferred that an extra space prevents lateral interference in processing text, making it easier for the reader to identify the word in focus. But no prior research found by [study authors] Johnson, Bui, and Schmitt actually measured reader performance with each typographic scheme.
First, they divided their group of 60 research subjects by way of a keyboard task -- the subjects typed text dictated to them into a computer and were sorted into "one-spacers" (39 regularly put a single space between sentences) and "two-spacers" (21 hit that space bar twice consistently after a period). Every student subject used but a single space after each comma. Having identified subjects' proclivities, the researchers then gave them 21 paragraphs to read (including one practice paragraph) on a computer screen and tracked their eye movement as they read using an Eyelink 1000 video-based eye tracking system. [...] The "one-spacers" were, as a group, slower readers across the board (by about 10 words per minute), and they showed statistically insignificant variation across all four spacing practices. And "two-spacers" saw a three-percent increase in reading speed for paragraphs in their own favored spacing scheme. The controversial part of the study has to do with the 14 point Courier New font that the researchers presented to the students. "Courier New is a fixed-width font that resembles typewritten text -- used by hardly anyone for documents," reports Ars. "Even the APA suggests using 12 point Times Roman, a proportional-width font. Fixed-width fonts make a double-space more pronounced."
First, they divided their group of 60 research subjects by way of a keyboard task -- the subjects typed text dictated to them into a computer and were sorted into "one-spacers" (39 regularly put a single space between sentences) and "two-spacers" (21 hit that space bar twice consistently after a period). Every student subject used but a single space after each comma. Having identified subjects' proclivities, the researchers then gave them 21 paragraphs to read (including one practice paragraph) on a computer screen and tracked their eye movement as they read using an Eyelink 1000 video-based eye tracking system. [...] The "one-spacers" were, as a group, slower readers across the board (by about 10 words per minute), and they showed statistically insignificant variation across all four spacing practices. And "two-spacers" saw a three-percent increase in reading speed for paragraphs in their own favored spacing scheme. The controversial part of the study has to do with the 14 point Courier New font that the researchers presented to the students. "Courier New is a fixed-width font that resembles typewritten text -- used by hardly anyone for documents," reports Ars. "Even the APA suggests using 12 point Times Roman, a proportional-width font. Fixed-width fonts make a double-space more pronounced."
IMHO, it is better to use a tab.
So you're angry that the "old type-writer-using geezer" is reading ten words per minute more than you and you're trying to sabotage his productivity?
And millennials wonder why nobody likes them...
Because you have humans using the system.
Which is why you "ignore whitespace" and code it as an option wherever it's needed. If anything, it actually makes searching FASTER, because an exact-whitespace match takes longer to find.
When you then put in Unicode, other languages, non-breaking spaces, paragraph marks, and you're working on human-entered data, you're really onto a loser from the start if you have coded anything on a byte-for-byte matching process.
Also, your system works against you in more ways than one. Someone creates an entry with one space. Someone else doesn't see it so they create it with two. Now you have two entirely different entries with different data referring to different database rows, but both "look" identical.
Ignore whitespace, and the problem solves itself.
Oh, look at that.... Slashdot replaces double-space after period with single space. ...
It is not Slashdot. It is a browser and the HTML. Two or three spaces are always one in the browser https://stackoverflow.com/ques...
So a client of an Internet shop sees always one space, and it is confusing for her/him why the product does not sort properly.
In fact, I had a lot of problems with these multiple spaces coming with product titles from suppliers. And I do not work only with English language.
Not really, computer cannot really do this automatically except very complicated language-specific programs with lots of exceptions, because the "." is also used within sentences for abbreviations.
Sentence boundary disambiguation has always been a rather annoying problem.