Slashdot Mirror


Amazon "Suppresses" Book With Too Many Hyphens

An anonymous reader writes Author Graeme Reynolds found his novel withdrawn from Amazon because of excessive use of hyphens. He received an email from Amazon about his werewolf novel, High Moor 2: Moonstruck, because a reader had complained that there were too many hyphens. "When they ran an automated spell check against the manuscript they found that over 100 words in the 90,000-word novel contained that dreaded little line," he says. "This, apparently 'significantly impacts the readability of your book' and, as a result, 'We have suppressed the book because of the combined impact to customers.'"

59 of 292 comments (clear)

  1. from the what-until-they-get-a-load-of-this dept. by BarbaraHudson · · Score: 4, Funny

    You-should-ask-slashdot-to-publish-the-book-they-LIKE-hyphens.

    --
    "Transparent" is a shit show that trades on every stereotype going. A man in drag is NOT a transsexual.
  2. f-o-x-t-r-o-t-u-n-i-f-o-r-m-c-h-a-r-l-i-e-k-i-l-o by Anonymous Coward · · Score: 3, Funny

    f-o-x-t-r-o-t-u-n-i-f-o-r-m-c-h-a-r-l-i-e-k-i-l-o-y-o-u-a-m-a-z-o-n

  3. LOL ... w00t? by gstoddart · · Score: 2, Insightful

    So, Amazon is now the grammar police?

    I'm sure there are hundreds (if not thousands) of books on Amazon which have absolutely shit grammar and punctuation.

    To quote the author of the book ... what the actual fuck?

    --
    Lost at C:>. Found at C.
    1. Re:LOL ... w00t? by ma++i+ude · · Score: 4, Informative

      Actually, it looks like Amazon is the typography police. For whatever reason, the book seems to use en-dashes instead of hyphens (check the preview on Amazon). That is an abomination. Where the message changed from "please replace en-dashes with hyphens" to "don't use hyphens" is anyone's guess.

      --
      You can't shut us down! The Internet is about the free exchange and sale of other people's ideas!
    2. Re:LOL ... w00t? by Zocalo · · Score: 2

      Not with any consistency it seems. They are apparently fine with Ernest Vincent Wright's Gadsby which doesn't even include the letter "e" once in the main text (there's a nice bit of humour/irony in there being an ebook version though), with all the readabilty issues you might expect that to bring. The works of James Joyce also still seem to be listed, come to that, so I'm somewhat curious as to just how this "readability filter" get applied. I sure hope it's not just based on reader comments, because if it is a group like Anonymous or /b/ is about to have a book censoring field day.

      On the otherhand, if they can start with some of the religious dogma out there...

      --
      UNIX? They're not even circumcised! Savages!
    3. Re:LOL ... w00t? by ma++i+ude · · Score: 5, Informative

      Addendum: It turns out the author used the minus sign instead of the hyphen. That (a) looks wrong on the page, (b) breaks screen readers, (c) confuses readability scores and (d) makes this not news.

      --
      You can't shut us down! The Internet is about the free exchange and sale of other people's ideas!
    4. Re:LOL ... w00t? by Sperbels · · Score: 3, Informative
      I always thought the en-dash and hyphen were the same thing and the em-dash was the long one. Apparently there's hyphen, en-dash, and em-dash and the text of the book does indeed use the en-dash...and looks a little weird.

      Fangs burst through her gums as her jaw elongated into a razor–filled [razor-filled] muzzle and her ears elongated. After less than thirty seconds, the woman had been replaced by sleek, muscular, brown–furred [brown-furred] monster.

    5. Re:LOL ... w00t? by gnupun · · Score: 3, Informative

      An en-dash is much longer than a hyphen.

    6. Re:LOL ... w00t? by ma++i+ude · · Score: 4, Informative

      So, on a standard US keyboard, is this sign a minus or a hyphen?: -

      It's a hyphen. A standard keyboard layout has no minus sign, not even in the keypad. The author of the book explicitly specified a Unicode minus sign wherever a hyphen should've been because "I try to avoid using direct ascii hash codes because some ereaders can misinterpret them"

      --
      You can't shut us down! The Internet is about the free exchange and sale of other people's ideas!
    7. Re:LOL ... w00t? by Anonymous Coward · · Score: 3, Informative

      Hyphen.

      As well:

      — is an em-dash
      – is an en-dash
        is a minus, which you cannot see in Slashdot...

    8. Re:LOL ... w00t? by Anonymous Coward · · Score: 5, Informative

      A standard keyboard layout has no minus sign, not even in the keypad.

      That is very arguable. In fact it's just wrong. The key is a "minus" key, labelled with a "minus" sign, and in Windows at least it produces a scan code the constant for which is VK_SUBTRACT. What may ultimately be rendered in various text-entry contexts as a result of pressing that key may or may not be a minus sign, but the key is most definitely a minus key with a minus sign on it.

    9. Re:LOL ... w00t? by zippthorne · · Score: 2

      In his blog, there are a number of comments about the HTML entity he used instead of the hyphen character. There is speculation that text-to-speech accessibility features were mis-interpreting things as a result.

      On the TTS note, It seems like HTML (or at least the dialect used for ebooks, but why not everywhere?) should have a tag for providing pronunciation overrides, which would improve accessibility and finally allow us to know how the authors intended the pronunciation of all those apostrophe'd names.

      --
      Can you be Even More Awesome?!
    10. Re:LOL ... w00t? by Pope+Hagbard · · Score: 5, Insightful

      It's so not-news that it was debunked on Reddit and other places a week ago.

      Slashdot's given up on news for nerds, and it's giving up on stuff that matters.

    11. Re:LOL ... w00t? by ma++i+ude · · Score: 3, Interesting

      You're probably right. As soon as I hit 'Submit' I regretted having used the term 'sign'. It's better to distinguish between keycaps, scancodes, character codes and glyphs.

      --
      You can't shut us down! The Internet is about the free exchange and sale of other people's ideas!
    12. Re:LOL ... w00t? by pla · · Score: 2, Insightful

      So, on a standard US keyboard, is this sign a minus or a hyphen?: -

      Gonna piss off the typography police here, but...

      Yes. They mean the same damned thing, and don't give me any crap about one looking a little longer than the other. A hyphen is a dash is a minus sign is any mid-height horizontal line.

      Readability scores? Seriously? I will damned well use whatever character comes out when I press the key between "0" and "=" on my keyboard, and to hell with your broken automated readers that can't deal with the default character produced by 99.9% of keyboards in the English-speaking world.

    13. Re:LOL ... w00t? by Dishevel · · Score: 2

      Most people probably do not want Amazon to edit their books.

      --
      Why is it so hard to only have politicians for a few years, then have them go away?
    14. Re:LOL ... w00t? by pthisis · · Score: 5, Informative

      Propose such a "simple" perl script.

      Here are some cases it should know how to deal with:

      Between numbers (note that slashdot eats some of these characters; the numbers below all have different dashes or related symbols between "555" and "1000"):
      "Pages 555–1000 discuss this matter" (this should be an internumeral dash, which is typically an en dash, U+2013).
      "Her phone number is 5551000" (this should be a figure dash, U+2012).
      "There were actually a lot more of them than the estimated 555—1000, to be precise" (this should be an em dash, U+2014).
      "The teacher asked me to solve 5551000. I told him negative 455 was the answer." (this should be a minus sign, U+2212)

      Between letters/words you have a similar problem: even if you know it shouldn't be a minus sign (which symbolic algebra makes tough to know for sure, but suppose you could surmount that), you generally have no idea what kind of dash or hyphen it should be turned into.

      --
      rage, rage against the dying of the light
    15. Re:LOL ... w00t? by plover · · Score: 2

      What's news then, is that Amazon can't deploy a simple perl script to fix common typography errors such as these.

      There is nothing simple about typography, and a script such as you describe would cause more damage than it would fix. Any editor has to fully understand English, to know which word is the right choice, to understand syntax and grammar, and to know when a writer is deliberately or playfully bending the rules.

      If you want to see what the state of the art in automated editing looks like, try using Word's grammar checker. If all of its advice is followed, it can make any interesting story read as blandly as an 8th grader's essay paper on the history of frogs.

      --
      John
  4. If readability was a crime... by xxxJonBoyxxx · · Score: 5, Funny

    ...there would be no Slashdot summaries.

    1. Re:If readability was a crime... by amalcolm · · Score: 2

      The subjunctive is alive and living on Slashdot!!

      --
      Time for bed, said Zebedee - boing
  5. Link to the source by OzPeter · · Score: 4, Informative

    At least link to the actual story, rather than the discussion of the story.

    Hyphen Hate? When Amazon went to war against punctuation.

    Jeez. That was in the second paragraph of TFA.

    --
    I am Slashdot. Are you Slashdot as well?
    1. Re:Link to the source by Solandri · · Score: 4, Informative

      Thanks. Here's the rather important last line od the author's blog you linked:

      "UPDATE: The book is now back on sale. Common sense seems to have prevailed "

  6. About time Amazon cracked down on this by 91degrees · · Score: 5, Funny

    I also think it's about time they take down down on Mozart's "The Marriage of Figaro" from their mp3 store until someone can do something about the number of notes.

    1. Re:About time Amazon cracked down on this by The+Grim+Reefer · · Score: 3

      Which notes did you have in mind?

  7. Welcome to what happens.... by iamwhoiamtoday · · Score: 3, Insightful

    When you host your content on someone else's systems.

    1. Re:Welcome to what happens.... by ArcadeMan · · Score: 3, Funny

      My foot, your ass. See how that works? - Red Forman

  8. Re:Once every page and a half... by moosehooey · · Score: 2

    They might mean over 100 unique words...

  9. It was probably the wrong kind of "hyphen" by Anonymous Coward · · Score: 2, Insightful

    There are unfortunately lots of Unicode characters with the graphical appearance of a horizontal line at roughly the height of the middle line of a capital E. If you use the wrong one then it might look right for you but disastrously wrong for some readers. I suspect this may have happened in this case.

  10. What's next? by __aaclcg7560 · · Score: 2

    Amazon will be telling authors to break their novels into chapters and paragraphs.

    1. Re:What's next? by dead_user · · Score: 2

      IMNSHO,
      Chapter breaks allow for a reader to be able to step away from the novel knowing they haven't left a cliffhanger on the next page. Think of it like a scene change in a movie. I can't stand just picking some random place to stop reading a book because I have to go. If I decide to quit reading in the middle of a chapter, I check to see how much is left. If it is just a page or two more, I'll finish the chapter.

      Paragraph breaks let my brain codify smaller chunks of data for parsing more efficiently than long run-on paragraphs. It just works better.

  11. Re:from the what-until-they-get-a-load-of-this dep by Anonymous Coward · · Score: 2, Insightful

    The only way Slashdot would publish a book is if Bennett Haselton wrote it.

  12. I recently bought a book from Amazon... by QuietLagoon · · Score: 2
    ... that had too much use of the word "and," leading to an excessive amount of run-on sentences.

    .
    Maybe I should start hitting the Amazon reviews and flagging all the books whose grammar usage I find confusing.

    Let's see, this book uses strange and confusing Capitalization, making it difficult to read. Maybe Amazon should suppress it as well.

    1. Re:I recently bought a book from Amazon... by mrbester · · Score: 2

      "I bought this book of poetry in good faith based on recommendations. Has this 'e e cummings' never heard of capital letters? I demand a refund."

      --
      "Wait. Something's happening. It's opening up! My God, it's full of apricots!"
  13. Re:from the what-until-they-get-a-load-of-this dep by Attila+Dimedici · · Score: 3, Interesting

    Haven't they already published several of his books?

    --
    The truth is that all men having power ought to be mistrusted. James Madison
  14. Re:from the what-until-they-get-a-load-of-this dep by Mostly+a+lurker · · Score: 5, Informative

    Actually, your title is an (admittedly exaggerated) example of how hyphens can assist readability. The hyphens make clear that you are using a compound adjective. In fact, a common error in writing is omitting hyphens when they are necessary. For example, someone writing I saw a man eating alligator probably meant I saw a man-eating alligator .

  15. Re:from the what-until-they-get-a-load-of-this dep by Oligonicella · · Score: 4, Insightful

    It's self-publishing. No, they don't need to proof read.

  16. Amazon was being dumb by clovis · · Score: 4, Informative

    Looks like Amazon was being dumb.
    The problem was not too many hyphens, but rather that there were no hyphens. He had used the minus sign and that breaks some text-speech readers.
    Graeme has already fixed it.

    This is Graeme's blog telling the story, the problem, and the fix.
    https://graemereynolds.wordpre...

  17. Re:Why hyphenation in an e-text? by lgw · · Score: 5, Funny

    There is a unicode character known as a soft hyphen.

    Hey, this is Slashdot: we don't know about Unicode and we like it that way!

    --
    Socialism: a lie told by totalitarians and believed by fools.
  18. Re:from the what-until-they-get-a-load-of-this dep by Shadow+of+Eternity · · Score: 4, Funny

    The difference between the two being particularly notable as someone who lives in Florida... the former is most often a native and the latter most often a tourist.

    --
    A bullet may have your name on it but splash damage is addressed "To whom it may concern."
  19. Could be worse. by Richy_T · · Score: 3, Funny

    Could have been a proctology book rejected for too many colons.

  20. Re:from the what-until-they-get-a-load-of-this dep by Count+Fenring · · Score: 2

    Most man-eating alligators are natives... their food, on the other hand...

  21. Dammit! by frank_adrian314159 · · Score: 2

    There goes my book in morse code!

    --
    That is all.
  22. Re:What a horrible first world problem by Archangel+Michael · · Score: 2

    And "The Interview" was a problem for Kim (North Korea, not Kardashian) so what? The problem with listening to every whiner is that they get too much power in the process, and normal people start being impacted by all the various "rules" the whiners come up with that serve no purpose other than to annoy everyone else.

    Hey, I just described political correctness :-D

    --
    Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
  23. While we're on the subject... by ggraham412 · · Score: 2

    Not handling hyphens, minus signs or whatever: it doesn't surprise me in the least.

    Why don't eBook publishers use a typesetting system based on TeX or LaTeX? Good grief. I was formatting complex mathematical formulas and pretty printing them to Postscript and PDF before the lot of you were born. And not just text with mere hyphens.

    Is there something I'm missing, or are eBooks a major step backwards in formatting? Really. I can't tell you how many computer science and mathematics eBooks I've returned to Amazon or B&N because of the sh***y formatting of code and math formulas. Not just when eBooks first came out, but on and on, year after year, and it doesn't get better. It strikes me as the laziness of corporations.

    1. Re:While we're on the subject... by dgatwood · · Score: 2

      Why don't eBook publishers use a typesetting system based on TeX or LaTeX?

      Let me count the reasons.

      • Books in electronic form must be reflowable, to accommodate variations in device size, and to accommodate rotation. What this means is that page numbers can change continuously. If I rotate a reader from portrait to landscape mode, then flip to the next page, then rotate it back into portrait orientation, there's no guarantee that the page boundaries are the same as they were in portrait orientation previously. So imagine having to run LaTeX on your entire document more than once per second, with a page offset, rendering only a subset of the content.
      • Books in electronic form require the ability to reliably link between documents. Good luck with that in LaTeX, much less in a hypothetical reflowing LaTeX.
      • LaTeX's font handling and Unicode glyph handling are dreadfully subpar even in XeLaTeX. Line breaks around em dashes and en dashes are as broken as they were in OS 9, requiring use of \hspace{0.001pt} after them if you want LaTeX to wrap correctly.
      • LaTeX is, IMO, terrible for anything that involves even basic custom formatting. I've used it for fiction book publishing. I have over 2,400 lines of custom LaTeX macros to prove it. By contrast, even when working around the quirks of multiple EPUB readers, I have only about 1,000 lines of simple CSS that gives almost exactly the same results as those 2,400 lines of seriously complex macro code in LaTeX. To be fair, there's a bit of Perl content translation code that replaces a little bit of macro code, so the difference isn't quite as extreme as it sounds, but it's a lot easier to do math computation in Perl than in LaTeX macro code, and a good chunk of that math was only required because LaTeX lacks some fairly basic formatting functionality, such as an equivalent for the CSS min-width property. LaTeX is positively primitive when compared with HTML and CSS, IMO.
      • LaTeX is a write-only language. Like Tom Christenson said about Perl, nothing can parse LaTeX other than LaTeX. It is basically impossible to properly translate LaTeX into any other form, which makes it a terrible source language. Most people writing content want to write once and reuse in different formats, so you're better off starting in a proper semantic markup language like XML. And if you're starting from XML, it's easy to spit out HTML. It is butt ugly to spit out LaTeX. Been there, done that. I have almost 800 lines of custom XSL on top of the existing DocBook2XML code to prove it.

      There are probably many more reasons I could think of if I took the time, but that's just what comes to mind off the top of my head. Knowing what I know now, if I had to do my latest project over again, I would have written a custom typesetter in JavaScript that runs in a custom WebKit-based app. It would have been faster, and I would have had more control. I can't even imagine trying to use LaTeX in an eBook reader. It would be like printing a book using a million Chinese workers with quill pens.

      What math and CS book publishers should be doing is formatting formulas using LaTeX and converting the PDFs to SVG images. That should give you nicely formatted formulas whose text is still searchable. The rest of the content should be HTML, just like any other eBook. It isn't rocket science; the publishers you've dealt with just don't care about those books enough to do the job correctly. :-)

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

  24. Re:from the what-until-they-get-a-load-of-this dep by Hognoxious · · Score: 2

    I thought, they preferred, commas, that have, no logical reason, to be, there.

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  25. Re: from the what-until-they-get-a-load-of-this de by Anonymous Coward · · Score: 2, Informative

    The book was professionally edited. It's by a British author using British english. While some of the hyphens could have been removed they were not grammatically incorrect. The problem seems to be that Amazon does not recognise British grammar and punctuation differences.

  26. Re:from the what-until-they-get-a-load-of-this dep by Enry · · Score: 5, Informative

    In traditional book publishing, the author gets about 5% of the list. The publisher sells the book to a retailer for 50% of the list price and the author typically get about 10% of what the publisher sells it for. At least that's what it was in my case. So getting 70% on a self-published book isn't a bad deal. Though editors are still important.

  27. Re:from the what-until-they-get-a-load-of-this dep by BarbaraHudson · · Score: 5, Funny

    I imagine #they'd totally $freak at a @book about &perl.

    --
    "Transparent" is a shit show that trades on every stereotype going. A man in drag is NOT a transsexual.
  28. it wasn't about text-to-speech by epine · · Score: 2

    From Hyphen Hate? When Amazon went to war against punctuation

    A ridiculous number of people have gotten caught up in the whole âoehe used a minus sign instead of an ascii hyphen! The bastardâ controversy that has followed this thread around and has spilled over into any number of internet message boards. First of all, let me be clear. The issue was not with my use of a minus sign. The issue Amazon had was that someone had complained about hyphenation. Second, I have since gone back and checked the original file on the Kindle text-to-speech app and it renders fine. No issues. [my emph.]

    <acerbic>
    These days 75% of all Slashdot posts seem to involve drilling down to get the original story straight. Tell me, when did a mass-confusion clusterfuck become the new nerd foreplay? Kindle typography, meet declining Slashdot editorial standards. You've got more in common than you think.
    </acerbic>

  29. Re:Good luck not doing that by dgatwood · · Score: 2

    The Kindle can load its ebooks from anywhere. Sure, it's perhaps easier/faster to get them straight from Amazon, but there are no technical barriers whatever to loading any file you want onto your Kindle.

    Actually, there are technical barriers, and steep ones. Amazon does not use a standard eBook format, but rather uses its own custom binary blob. Because Amazon does not publish information about that format, there is exactly one tool that is known to generate this format in a guaranteed forward-compatible way. That tool, kindlegen, was written by Amazon, and the licensing terms from 2.0 onwards (the first version to support nontrivial formatting) do not allow you to use it for creating content that is sold outside Amazon's store. So in order to distribute content elsewhere, you have to either:

    • give it away for free,
    • violate Amazon's license,
    • use an unauthorized tool that produces content based on reverse engineering, which may or may not be even remotely correct, or
    • sell an EPUB book and require your readers to convert it to Kindle format themselves.

    None of these choices is viable, IMO. As such, I consider Kindle to be by far the single most locked-down eBook reader on the market today. At least when Apple puts licensing terms like that into their book generation software, they have the decency to document the format so that you aren't forced to use their toolchain....

    --

    Check out my sci-fi/humor trilogy at PatriotsBooks.

  30. Re:Why hyphenation in an e-text? by ThePhilips · · Score: 2

    There is a unicode character known as a soft hyphen. The soft hyphen indicates where to break a word if it doesn't all fit on a line. This character should be used instead of a hard hyphen most of the time.

    Too bad eBook readers are very inconsistent in their support for that. Some readers display an icon indicating an unknown glyph, many fail to insert the hyphen....

    Alas.

    That soft hyphen would have been a blessing for the German e-books. Some texts are flush with the overly long words, making them very hard to read.

    But Kindle (last time I checked) doesn't support it.

    Neither the Calibre and few other e-book viewers/editors I have tried in the past.

    In other words, in my experience the support is uniform and consistent: no support whatsoever, sadly.

    P.S. On top of it, the Kindle devices I have, also have the rendering and text selection bugs when displaying/selecting the text around words (even if they are hyphenated) which are longer than the single visible line.

    --
    All hope abandon ye who enter here.
  31. Re:from the what-until-they-get-a-load-of-this dep by ClickOnThis · · Score: 4, Interesting

    The hyphens make clear that you are using a compound adjective. In fact, a common error in writing is omitting hyphens when they are necessary. For example, someone writing I saw a man eating alligator probably meant I saw a man-eating alligator .

    This, this and this.

    Awhile ago, we saw a story on this site about a chocolate printer. Of course this was actually a chocolate-printer, a device that prints using chocolate. However, without the hyphen, it refers to a printer that is made out of chocolate. Without the hyphen, what are we to make of The Chocolate Lover's Cookbook?

    Hyphens are also important when one needs to disambiguate between compound adjectives and compound nouns. What's a high school building? A building that's a high school (a high-school building) or a school building that is high (a high school-building)?

    Hyphens are just another example of how we treat punctuation marks as though they were boogers, something to be expunged and discarded, kept away from ourselves and others. But without them, we cannot distinguish a panda bear who eats shoots and leaves from a mob hit-man who eats, shoots and leaves.

    --
    If it weren't for deadlines, nothing would be late.
  32. Re:from the what-until-they-get-a-load-of-this dep by macs4all · · Score: 2

    I imagine #they'd totally $freak at a @book about &perl.

    Or even better, APL.

  33. Re:Why hyphenation in an e-text? by macs4all · · Score: 2

    Doesn't the dam-ned text get re-flowed by the devi-ce or so-mething? That be-ing said, this is ridi-culous, all my prin-ted books have a few hy-phens, and I've ne-ver had any dif-ficulty rea-ding them. Maybe Ama-zon should just add "don't hyphenate" setting on their reading device and end it once and for all?

    The real question is, did he "hard-hyphen" the words, such that they wouldn't re-flow correctly; or did he just have lots of compound-adjectives, etc. that would actually call for hyphenation?

  34. Re:from the what-until-they-get-a-load-of-this dep by ClickOnThis · · Score: 2

    I imagine #they'd totally $freak at a @book about &perl.

    Or even better, APL.

    Or Brainfuck, more politely known as B****fuck.

    --
    If it weren't for deadlines, nothing would be late.
  35. Re:from the what-until-they-get-a-load-of-this dep by Immerman · · Score: 4, Informative

    Spoken language contains many variations of timing and inflection that clarify such things. Punctuation exists specifically to impart a rough approximation of those subtleties to the comparatively crude written language.

    --
    --- Most topics have many sides worth arguing, allow me to take one opposite you.
  36. Re:from the what-until-they-get-a-load-of-this dep by narcc · · Score: 5, Insightful

    without them, we cannot distinguish a panda bear who eats shoots and leaves from a mob hit-man who eats, shoots and leaves.

    No Oxford comma? Mod parent down!

  37. Re:from the what-until-they-get-a-load-of-this dep by narcc · · Score: 3, Funny

    It probably has a better chance than my book: Whitespace by example