Slashdot Mirror


Universal Ebook Format Debated

Amy Hsieh writes "A well-known ebook industry expert, Jon Noring, recently wrote an interesting article for eBookWeb, formally calling upon the ebook industry to adopt a single universal ebook distribution format. Right now there's a plethora of essentially incompatible ebook formats, and this format 'babel' is hampering the growth of the ebook industry. In the article, Mr. Noring proposes a promising open-standards candidate which appears to meet a list of basic requirements: The Open eBook Forum's OEBPS Specification. Andy Oram, a Linux programming editor for O'Reilly, wrote an interesting reply to the article that should also be read." On the other hand, Noring's proposal has also met with some skepticism elsewhere.

277 comments

  1. Babel? by Hogwash+McFly · · Score: 1, Funny

    Does this 'babel' format have any relationship with Babelfish? Please don't tell me it's used to translate books into different languages!

    Harry's Potter take ups his sword to slain the evil Mould a Wart

    --
    Mother, do you think they'll like this sig?
    1. Re:Babel? by aldousd666 · · Score: 5, Informative
      I thought we already had a standard: HTML

      Shows what I know.

      A couple of side notes: And how can you not know what babel is? Babel: Tower of babel: a story from the bible where King Nebekenezur (there is no correct spelling for that in english, just commonly accepted ones) wanted to build a tower to god, so god being jealous, put a spell on everyone, and they all ended up speaking a different language. It's how the christians believe that there came to be multiple languages.

      Now the website babelfish gets its name from 'The Hitchikers Guide to the Galaxy' by Douglas Adams, where the characters 'stick a babelfish in their ear' to act as a universal translator.

      --
      Speak for yourself.
    2. Re:Babel? by Anonymous Coward · · Score: 0
      I thought we already had a standard: HTML
      Which is utterly unsuitable as you get very little control over the presentation of the content. Presentation is incredibly important in publishing, whatever webmonkeys will tell you.
    3. Re:Babel? by Kris+Thalamus · · Score: 0

      No, they are referring to the tower of Babel.

      According to the bible, the entire human species once spoke the same language (a bit like how in fantasy novels a character can speak elfish or orc). Anyway, the bible says that some enterprising humans made an attempt to build a tower high enough to reach heaven. They almost made it there until Yahweh got nervous and decided to destroy the tower and give us incompatible languages so that we might never work collectively on a similar engineering project.

    4. Re:Babel? by jpmahala · · Score: 2, Informative

      BZZZT. Wrong.

      Nebuchadnezzar lived during the time of Daniel. The events of the Tower of Babel are chronicled in the book of Genesis.

    5. Re:Babel? by Anonymous Coward · · Score: 0

      Bur Nebuchadnezzar built the temple which is agreed to be the source of the Tower of Babel story. There's stuff from it in the Pergamon Museum in Berlin.

    6. Re:Babel? by Kris+Thalamus · · Score: 0

      "Come, let us build ourselves a city, and a tower with its top in the heavens, and let us make a name for ourselves, lest we be scattered abroad upon the face of the whole earth."

      And the Lord came down to see the city and the tower, which the sons of men had built. And the Lord said, "Behold, they are one people, and they have all one language; and this is only the beginning of what they will do; and nothing that they propose to do will now be impossible for them. Come, let us go down, and there confuse their language, that they may not understand one another's speech."

      So the Lord scattered them abroad from there over the face of the earth, and they left off building the city. Therefore its name was called Babel, because there the Lord confused the language of all the earth; and from there the Lord scattered them abroad over the face of the earth. - Genesis 11.

    7. Re:Babel? by Pxtl · · Score: 1

      Its a book. People read it. Hence, the text is whats important. Yes, if its a textbook or a graphic novel, presentation is important. For that, I accept that we need a standard (*cough* *rtf* *cough*) but otherwise, HTML or (god forbid) plain text works fine.

    8. Re:Babel? by Anonymous Coward · · Score: 0

      Nope, the presentation's important as well, sorry. If you've learnt anything today, you've learnt that.

    9. Re:Babel? by aldousd666 · · Score: 1

      Nimrod the Hunter. My bad.

      --
      Speak for yourself.
    10. Re:Babel? by Kris+Thalamus · · Score: 0

      Does this 'babel' format have any relationship with Babelfish?

      "Babel" is comes from two words, "baa" meaning "gate" and "el," meaning "god". It follows that it is "the gate of god." A related word in Hebrew, "balal" means "confusion".

    11. Re:Babel? by xtermz · · Score: 1

      Do you still use chamber pots for your bathroom, wood burning stoves to cook on, and ride a horse drawn cart to work?

      See, true inovation doesnt come from "it works now, why mess with it" mentality.

      What about an eBook about graphic layout, where rich text, graphics, and presentation make the book. There just are some things you cant do with HTML.

      And how to do you propose you encrypt and protect from distribution a bunch of html files and images? Thats the whole point of eBooks - a rich, open , and protectable standard.

      --


      I lost my concept of community when my community lost all concept of me.
    12. Re:Babel? by hammy · · Score: 1

      With CSS there's not a lot HTML can't do with layouts.

      How do expect protect something from distribution and cracking something that needs to decrypted to in order to be read? It's a Sisyphian task...

    13. Re:Babel? by Alan+Partridge · · Score: 1

      "It's how the christians believe that there came to be multiple languages."

      I don't think that a Christian neccessarily believes that crap - a Christian needs believe nothing more than that Christ's approach to living is something that they want to strive to emulate, all else is mumbo-jumbo of the worst sort.

      --
      That was classic intercourse!
    14. Re:Babel? by Alan+Partridge · · Score: 1

      My goodness, what a load of nonsense.

      --
      That was classic intercourse!
    15. Re:Babel? by jo42 · · Score: 1

      s/mumbo-jumbo/brain wash/

    16. Re:Babel? by Anonymous Coward · · Score: 0
      Microsoft eBooks are just HTML files wrapped in a compressed/encrypted container format which can be decrypted and decompressed with the appropriate reader key. Microsoft has never revealed the file format or the encryption, but it's widely known and anyone with the CLIT (Convert LIT) tool can unpack into the original HTML.

      A ZIP file with a password on it does exactly the same thing, except using open standards. Given that MS's private formats are equally "insecure" after someone becomes motivated and takes 3 days (at most) to do the reverse-engineering, why not just use open formats?

    17. Re:Babel? by Cinematique · · Score: 1

      Nevermind the fact that HTML (True HTML, not a hack or Internet Explorer function) doesn't have a tag for indenting... among other problems...

    18. Re:Babel? by dkragen2002 · · Score: 1

      but it's widely known and anyone with the CLIT (Convert LIT) tool

      Which MS uber-genius came us with that acronym? MS wouldn't use it, you can't achive world-domination with only 50% market share :)

      Dave

    19. Re:Babel? by X_Caffeine · · Score: 1
      What about

      ? With a CSS declaration of text-indent:5% of course.

      When you use HTML properly (i.e. wrapping paragraphs in -- gasp -- paragraph tags), it works great.

      --
      // I will show you fear in a handful of jellybeans.
    20. Re:Babel? by Cinematique · · Score: 1

      CSS is not HTML, last time I checked. Did something change?

    21. Re:Babel? by Anonymous Coward · · Score: 0

      Yes, something changed. Starting with xhtml 1.0, html is designed to be used with CSS.

    22. Re:Babel? by Cinematique · · Score: 1

      As opposed to HTML 4, which is incompatible with CSS, right? /sarcasm

  2. Sounds easy to me... by WestieDog · · Score: 3, Insightful

    What about .txt?

    1. Re:Sounds easy to me... by Anonymous Coward · · Score: 0

      Completely, utterly unsuitable. The fact that you got an "insightful" shows how little thwe moderators really know.

      Answer me this: how do you do a table of contents in .txt? Now how do you make sure everyone does it the same way? Now do you see how we've started inventing a standard e-book format?

      Also, how do you do illustrations in .txt?

    2. Re:Sounds easy to me... by flakac · · Score: 2, Insightful

      .txt is simply not sufficient to replace a book. A book contains graphics, charts, indexes, etc. and there's no reason these should not be exploited in any eBook format. Also, there's no reason not to extend the paradigm to allow for readers to make annotations in the book, just like you can in real paper-based books.

    3. Re:Sounds easy to me... by Enry · · Score: 1

      .txt has no markup capabilities. No way to signal chapter headings, etc.

      what about HTML? Compressed HTML at that.

    4. Re:Sounds easy to me... by jetmarc · · Score: 1

      > What about .txt?

      I don't think that you are serious about .txt - it is a good format to
      deliver textual information. But a good book contains more than just
      that. The layout, font and style makes reading easy or difficult,
      whatever publisher intended. Illustrations and other artwork (eg
      formulas) improve accessibility of the delivered information.

      A .txt hardly does this.

      Except for being a proprietary thing, I consider .PDF (not the ebook
      variant) a good format for books. It's not perfect, and its outstanding
      advantages over other formats seem to degrade and vanish with every
      new version of the format & reader. PDF 1.3 was a good thing (Acrobat 4.0)
      and I would really like to see PDF 1.3 or a carefully selected subset
      of PDF 1.3 being standardized for ebooks. In my dreams.. :(

      Marc

    5. Re:Sounds easy to me... by DrXym · · Score: 3, Insightful
      Have you ever tried to read something from Project Gutenberg in text format? It's horrible! Try downloading something like the King James Bible and go mad as you slog through in 80 character monospaced print. Furthermore, say goodbye to an pictures or diagrams, e.g. illustrations in Alice in Wonderland.


      A standardized rich text format is absolutely required, one which defines document structure so you get all the goodness like chapters, quotations, sidebars, footnotes, images etc., but doesn't impose how it should be laid for the most part, or the layout is specified by an accompanying style sheet.


      Something like docbook might be suitable, but some of its more gross or esoteric things would have to be pruned or moved into different levels of support for the sake of simplicity.

    6. Re:Sounds easy to me... by Lumpy · · Score: 1

      But a good book contains more than just
      that. The layout, font and style makes reading easy or difficult,
      whatever publisher intended. Illustrations and other artwork (eg
      formulas) improve accessibility of the delivered information.


      Funny.. the last 3 good books I read had none of this...

      Text books and manuals? yes.. a good book? nope.

      I suggest you go into the fiction and non fiction isles and pick up a few books and learn what is in the bulk of publications... Words... no pretty pictures and charts... but simple expressive words.

      and yes .txt is very useable for a E-book format. I can bookmark it and a table of contents is quite easy to impliment..

      --
      Do not look at laser with remaining good eye.
    7. Re:Sounds easy to me... by Alan+Partridge · · Score: 1

      "Also, how do you do illustrations in .txt?"

      ASCII art, of course! Are you kidding me?

      --
      That was classic intercourse!
    8. Re:Sounds easy to me... by msx2 · · Score: 0
      With ascii art!

      \\|||||//
      o)-._.-(o
      (__ ^ __)
      //|\U/|\\
      --
      -- Joao Barata (http://jb.home.sapo.pt) http://b-tools.hopto.org * http://ebk.home.sapo.pt
    9. Re:Sounds easy to me... by msx2 · · Score: 1

      You know what they say...
      "Great minds think alike" :)

      --
      -- Joao Barata (http://jb.home.sapo.pt) http://b-tools.hopto.org * http://ebk.home.sapo.pt
    10. Re:Sounds easy to me... by msx2 · · Score: 1

      But you can anottate in TXT, even without destroying the contents of the text (last link on my sig...) agreed that graphics and most complex charts are irreproducible in plain text, but then again that's why you have Rich Text Format...

      --
      -- Joao Barata (http://jb.home.sapo.pt) http://b-tools.hopto.org * http://ebk.home.sapo.pt
    11. Re:Sounds easy to me... by mwood · · Score: 1

      Sonny Jim, I've done graphics on a *keypunch*. ASCII art is a relatively simple problem.

    12. Re:Sounds easy to me... by Anonymous Coward · · Score: 0

      .txt won't be able to show formulae easily. Also the article mentions including snippets of audio and video into the book.

      Why you got modded insightfull I don't know.

    13. Re:Sounds easy to me... by lucretio · · Score: 1

      How about XML? HTML is for hypertext. I wouldn't consider a book hypertext. XML could be used to mark up a book using <chapter>, etc.

    14. Re:Sounds easy to me... by Happy+Monkey · · Score: 2, Insightful

      If I were choosing an edition of The Hobbit with or without Tolkien's illustrations, I'd choose the one with them. Likewise the Alice books and Tenniel's illustrations. A format that allows illustrations doesn't require them.

      However, there is certainly an important place for plain text ebooks - they are easy to convert to more complete formats. The efforts of Project Gutenberg aren't wasted.

      --
      __
      Do ya feel happy-go-lucky, punk?
    15. Re:Sounds easy to me... by gmhowell · · Score: 1

      While it sounds flippant, you are dead on. My problem is that there are no easy solutions for deaf-blind people. I'd like to get one of these for my wife, but it only takes text input. So I'm limited to Gutenberg and bookwarez.

      --
      Jesus was all right but his disciples were thick and ordinary. -John Lennon
    16. Re:Sounds easy to me... by The+Salamander · · Score: 1

      Try downloading something like the King James Bible and go mad as you slog through in 80 character monospaced print.

      I don't think your experience has anything to do with the print.

    17. Re:Sounds easy to me... by Tacky+the+Penguin · · Score: 1

      Have you ever tried to read something from Project Gutenberg in text format? It's horrible! Try downloading something like the King James Bible and go mad as you slog through in 80 character monospaced print. Furthermore, say goodbye to an pictures or diagrams, e.g. illustrations in Alice in Wonderland.

      The advantage of text format is that the user (you) can choose any font and text width desired. It is, truly, the universal standard.

      The only disadvantage is that the author can't use things like italics and bold print for emphasis. For that, HTML works fine.

    18. Re:Sounds easy to me... by DrXym · · Score: 1
      Sure you can choose any font, if you want your text to appear all ragged and uneven. And no you can't choose any width since Gutenberg is formatted and hyphenated to a set width.


      Furthermore, you also can forget your reader being able to figure out what is a chapter heading and what isn't, or who wrote the book, or generate an index. Any illustrations also fly out the window too, and if you're expecting Unicode, well too bad about that too.


      Any decent book format would preserve all these things.

  3. Well by The-Bus · · Score: 4, Funny

    If the software pirating industry can all agree on plain text "NFO" files with ASCII-painted flames, dragons eating your group's logo, and pot leaves surrounding shout-outs to your boys on efnet, I think the slightly more professional and law-abiding ebook industry can agree on a standard format.

    --

    Small potatoes make the steak look bigger.

    1. Re:Well by PerryMason · · Score: 2, Funny

      Yeah, and typical Microsoft went and broke the standard associating .nfo with System Info files in Win2k. Those guys just never stick to the standards...

      --
      "I'm tired of all this 'Aren't humanity great' bullshit. We're a virus with shoes" - Bill Hicks
    2. Re:Well by crisco · · Score: 1

      I think that just maintains the 'barrier to entry' to the warez scene. What with all the p2p and all these days you can't make it too easy to massively infringe on other's copyrights.

      --

      Bleh!

    3. Re:Well by ncc74656 · · Score: 1
      Yeah, and typical Microsoft went and broke the standard associating .nfo with System Info files in Win2k.

      Fortunately, Win2K added "Open With..." to the context menu. Add %systemroot%\system32\edit.com to the programs you use to open .nfo files and you're all set. (By using the old-school DOS editor instead of Notepad or Wordpad, you get compatibility with the IBM graphics used in many .nfos.)

      --
      20 January 2017: the End of an Error.
    4. Re:Well by rulethirty · · Score: 1

      Let's see... Microsoft has already made default associations for .inf and .nfo, there aren't many more choices left for our poor friends over in the software pirating inudustry considering they want to keep the standard 8.3 file naming convention... may i suggest the extension .pr8 or .el8 or .wrz?

    5. Re:Well by intermodal · · Score: 1

      and then trogdor come in the NIIIIIIIIIGHT!

      --
      In SOVIET RUSSIA... erm...NSA AMERICA, the Internet logs onto YOU!
  4. My ebook format by RenQuanta · · Score: 3, Informative

    Is Project Gutenberg and a Palm Pilot.

    1. Re:My ebook format by Anonymous Coward · · Score: 0
      Yes, although IMHO, a Newton works better than a palm, the screen is bigger, so you can use a bigger font, or have more txt on the screen at once. The newton's nicer for bedside reading, the palm works best to have a few books on one's person at all times. I've used both.

      What's with all the people complaining about "boo-hoo .txt doesn't do pictures"? What kind of books do you read? I've never had a problem with Gutenburg etexts, or even really bad OCR'ed etexts full errors, but then I guess I just have a tolerance.

  5. Better readers needed by Baumi · · Score: 5, Insightful

    I don't think any format will get Ebooks to catch on until we have reader hardware that makes reading those books at least as pleasant as reading a paper book.

    Here's hoping that all those e-paper efforts will produce something usable soon.

    1. Re:Better readers needed by FeloniousPunk · · Score: 2, Interesting

      I think the same thing as you about the dedicated reader devices, but I've started using the reader software that comes with my Pocket PC and now I actually prefer that to reading from a paperback. It's convenient since I usually have the thing with me most places I go, it's smaller than a paperback and you don't have to turn pages or worry about your bookmark falling out. I can navigate through the book pretty quickly with the directional pad, even faster than turning a page physically. And I can carry quite a few books with me, as most novels clock in at about 500 - 700 kb.
      I think it's a big waste to invest in a dedicated reader (that costs significantly more than my Pocket PC btw), but having that functionality in my PPC is just great.
      On the original subject, I think a universal format would be a good idea. Between Pocket PC and Palm, there are many PDA users and if publishers could reach a significant fraction of them, they'd probably get a good return on investment with eBooks. Having multiple formats (Palm, MS Reader and Adobe, probably more that I don't know of) complicates this though. I am a little frustrated since I've come across books I'd like to purchase but are for a different formate than what I use (and I don't want to run multiple eBook readers on my PPC, for various reasons). Having a universal format would be great.

      --
      I know this because Tyler knows this.
    2. Re:Better readers needed by jridley · · Score: 1

      I also like reading on a device; for me it's a Palm. It's not that I don't like paper better, but the Palm has other advantages that outweigh the crappiness of the display (my IIIxe's green display that is). Number one is I always have it. I get a lot more books read now that I have one to read if I'm 5 minutes early for a meeting, or have to wait for the dentist. Number two is I never have to remember a bookmark, the prog always remembers where I left off.

    3. Re:Better readers needed by skroz · · Score: 1

      I have an REB1100 and it's GREAT. Font scaling, bookmarks, dictionary, lots of storage... I have few complaints.

      The same cannot be said for the "Upgrade" model, the GEB1150. When Gemstar took over production from Rocket, then withdrew from manufacturing with RCA, they reworked the thing and made it useless. Gone are the ability to change font sizes beyond the default two loaded into the device, screen rotation, dictionary, the ability to load custom content, and the ability to backup content to a local PC. This "upgrade" broke everything that was useful about the REB1100. And it costs MORE.

      Fortunately, I was able to find a few REB1100s on ebay and keep one as a spare.

      --
      -- Minds are like parachutes... they work best when open.
    4. Re:Better readers needed by ClubStew · · Score: 1

      I don't like running multiple copies, either. Personally, I prefer MS Reader on my Pocket PC because Adobe eBook Reader takes way too long to open, uses too much memory, and takes up too much space (which really goes along with too much memory). I'd rather remove it.

      Even if the debates prove to be useful and a new format arises, I'm betting that Microsoft and Palm will probably integrate the new format in their eBook readers, but Adobe will probably do their own thing - they think they invented the "electronic document" and how dare anyone tell them how to do it!

    5. Re:Better readers needed by Epistax · · Score: 1

      I agree entirely. It would be awesome if a company would finally release a bunch of screens of different sizes and shapes. Each would take the same kind of memory card, and read books from it. If the tablet PC's were 1 cm thick, that'd be a good source.
      I would expect these to be extremely basic, so as to keep the cost low. Perhaps an on/off button, a flip page forward/backwords button, some sort of autogoto, and maybe a touch sensitive screen so you can write on it with any object (and it's saved to the memory).

    6. Re:Better readers needed by BFedRec · · Score: 2, Insightful

      I couldn't agree more. There is nothing that comes CLOSE to the pleasure of a REAL book. The smell of the pages of a familiar old book, the texture of the paper between your fingertips... these are things that it will be very hard to replace with a digital version. I'm certain that referance works and such will benefit greatly and be wonderful to have in a practical digital format... but even when the digital papers get to a functional point, I think there will still be a place for "real" books, if not I'll never sleep for the visions of Fahrenheit 451 going through my head.

      BFedRec

    7. Re:Better readers needed by zby · · Score: 2, Funny

      I wholehartedly agree! The readers are not good enough - they don't read carefully, skip important parts, so quickly forget what they have read. It's time to have better readers.

    8. Re:Better readers needed by TheRaven64 · · Score: 1
      I don't think any format will get Ebooks to catch on until we have reader hardware that makes reading those books at least as pleasant as reading a paper book.

      Don't forget all of the other uses for books. When not in use they make good and decorative wall insualtion, and improve the acoustic characteristics of a a room. Oh, and they smell better than electronics. And don't need electricity.

      I guess it depends on the type of book. For a lot of reference works I often prefer to use the PDF version, and print the occasionals chapter. For fiction nothing beats a real paper book.

      --
      I am TheRaven on Soylent News
  6. I dont know.... by AmoebafromSweden · · Score: 5, Interesting

    Why should i choose a format that have all possibilities to have DRM included in the future thus allowing only one read. And will require Electricity to read.

    This is especially true for for factbooks who are often used as reference and not to be read just one time.

    So far Ebooks cant beat the paper version in portability, convenience and ease of use.

    Paperbook still seems more favorable to me.

    1. Re:I dont know.... by RobotRunAmok · · Score: 3, Interesting

      Why should i choose a format that have all possibilities to have DRM included

      Because you might want to read an e-book of something NOT in the public domain, e.g, a current novel, and few authors or publishers are going to render their wares into a format that is going to end up on free P2P. There needs to be some way to ensure that money changes hands.

      You were planning on paying for the books you read, weren't you? Or is this all just an exercise in seeing how we can best Napsterize the publishing indutstry?

    2. Re:I dont know.... by Anonymous Coward · · Score: 2, Insightful

      You were planning on paying for the books you read, weren't you? Or is this all just an exercise in seeing how we can best Napsterize the publishing indutstry?

      I was.

      Because you might want to read an e-book of something NOT in the public domain, e.g, a current novel, and few authors or publishers are going to render their wares into a format that is going to end up on free P2P.

      .. because the publishing industry wants to sell me a non transferable license, not a book that can be lent, sold on etc. There are good reasons for being anti DRM which most people who aren't thick as fuck can see.

      -- ac

    3. Re:I dont know.... by RobotRunAmok · · Score: 0, Flamebait

      .. because the publishing industry wants to sell me a non transferable license, not a book that can be lent, sold on etc.

      You want to lend it out, sell it on e-bay? No prob. Buy the paper version, cuz paperbacks ain't going away anytime soon. You want to be tres cool and read a novel on your PDA/digital camera/linux console/Dick Tracy watch, well sure thing, Sex God, but you'll pay for the privelege. Kind of how people pay 4x the cost for a hardcover version of a book to read it "first." Or you can go support your local used bookseller and buy paperbacks for a fraction of their cover price. You want consumer choices? You got plenty.

      And don't get all "public good" and "social revolution" with me here, Bunky. You want to read the latest best-seller for free? Go to your local Public Library. Uncle Sam and the publishing industry worked out this whole "public good" thing way before either one of us were born.

    4. Re:I dont know.... by jeti · · Score: 2, Interesting

      > There needs to be some way to ensure that money changes hands.

      Yes. However, I want to be sure that I can still read the book
      after my original hardware broke or I replaced it with a newer
      model from another manufacturer. And even if the publisher and
      the producer of the reader have gone ot of business.

      When there is a standard for e-books that ensures I can keep
      reading em, I'm willing to pay.

    5. Re:I dont know.... by RobotRunAmok · · Score: 1

      When there is a standard for e-books that ensures I can keep
      reading em, I'm willing to pay.


      I completely agree with you. And until that degree of sophistication -- flexibility for the reader and security for the publisher -- is reached in the e-book software format, e-books of non-public-domain works will be scarce and multi-colored.

      Look, even here on this "geek" board, loud are the arguments made for the current "analog" standard (paper). You can bet that the publishing industry is not racing to embrace any seachanges in distribution if even the bleeding edge slashdotters are uncertain of any value to be found in the change.

    6. Re:I dont know.... by yerricde · · Score: 1

      You want to read the latest best-seller for free? Go to your local Public Library.

      If my local library refuses to carry a particular title, then what do I do?

      --
      Will I retire or break 10K?
    7. Re:I dont know.... by Sloppy · · Score: 1
      There needs to be some way to ensure that money changes hands.
      That is a social/legal problem, and cannot be addressed with technical measures.

      If a format that uses DRM is deployed, it will remain obscure, hard-to-use, and relatively unpopular until it is cracked. After it is cracked, then possibilities appear:

      • There can be competition among tools for reading/playing the content. This allows innovation.
      • There can be tools for conversion, migration, backup, etc. (Yes, some of these tools will be abused, just as a lead pipe can be abused by Colonel Mustard in the library. But there are ways to deal with that risk -- same as lead pipes.)
      and then these things will eventually cause purchasing the content to become a sane and safe investment. Then people will be able to start selling content and make real money.

      I propose skipping the industry-is-retarded-while-everyone-waits-for-the- crack step and go straight to the make-money step. Look at what you can do with DVDs nowdays. I hesitated to buy DVDs in the late 1990s, but nowdays I don't think twice: because I know I'll have the content forever, like I do with an audio CD.

      Or is this all just an exercise in seeing how we can best Napsterize the publishing indutstry?
      No, but if there are any technical barriers that make Napsterizing difficult, then utility value will be severely reduced (at least, so far no one has thought of a way for that not to be the case) and sales will reflect that. Remember: until we have strong AI and lawschools for the AIs to attend, a machine will never know the difference between good and bad.
      --
      As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.
    8. Re:I dont know.... by Tacky+the+Penguin · · Score: 1

      Baen is doing very well (and making money) with its ebook sales. Rather than forcing the user to choose one format, several are offered (html, .pdf, etc.) None have DRM, and all can be freely copied.

      Buy Baen ebooks and prove to the industry that they don't have to be so agressive with their copy protection to make money.

    9. Re:I dont know.... by Tacky+the+Penguin · · Score: 1

      Baen books is offering ebooks without DRM, and is doing very well.

      In theory, offering copyable ebooks should lose them money. In practice, they are making money. Someone needs to update their theory.

    10. Re:I dont know.... by RobotRunAmok · · Score: 1

      I'm a big fan of Baen books, David Weber's stuff in particular. (In fact, lemme plug him here again.) I'm also a big fan of Harlan Ellison's work, and Mr. Ellison's perspective is a bit different from Mr. Baen's.

      One size does not fit all. The Rolling Stones and Harlan Ellison don't need or want their work freely dispersed to generate "a buzz." For a prolific but otherwise "B" genre author like Mr. Weber (and many of those under the Baen umbrella) the promotional interest the freebies generate for the back catalog outweigh whatever retail losses their might be as a result.

      Do you think for a moment that if Tom Clancy signed with Baen he would allow them to do with his stuff what they've done to Weber's? Now, why would that be?

      Someone needs to update their theory.

      Update your own theory to account for individual author's popularity, prolific-ness, back catalog, and target audience, and get back to us.

    11. Re:I dont know.... by RobotRunAmok · · Score: 1

      If my local library refuses to carry a particular title, then what do I do?

      Seriously? Talk to your local librarian and make a request. Very frequently the tome you seek may be stacked not at your local branch but will be available within the greater library system of which your local library is a part. It may take a little extra time to get it, but keep reminding yourself that you will be reading it for free.

  7. Like the word processing industry by sould · · Score: 4, Insightful
    Right now there's a plethora of essentially incompatible ebook formats, and this format 'babel' is hampering the growth of the ebook industry.


    Yup - just like there's a plethora of essentially incompatible word processing formats - hampering the growth of the office/word processing market.


    But the industry doesn't matter to one player - only their market share does.


    The only way to really win this sort of thing is to persuade all (or at least most) consumers to boycott products that deliberately break compatability with standards.


    But how likely is that to happen?

    1. Re:Like the word processing industry by Surak · · Score: 2, Funny

      Yup - just like there's a plethora of essentially incompatible word processing formats - hampering the growth of the office/word processing market.

      What plethora of formats? Everyone knows there's only the Word *.doc format!

    2. Re:Like the word processing industry by Zeriel · · Score: 2, Insightful

      What plethora of formats? Everyone knows there's only the Word *.doc format!

      Yes, yes...but WHICH Word *.doc format?

      (By my recollection there've been at least four slightly incompatible ones. (95, 97, 2k/XP, 2k3))

      --
      "America has done some terrible things. But I know that Americans don't cheer when innocents die." -Dave Barry
    3. Re:Like the word processing industry by Surak · · Score: 1

      For the humour impaired -- that was part of the joke.

    4. Re:Like the word processing industry by sjames · · Score: 1

      There are a few factors that make the difference. Word processing is ubiquitous in business and very common for personal use as well whereas ebook users are virtually non-existant by comparison.

      Most word procesor packages have a variety of formats they can read or write, some of them more or less standards. While they tend to screw up formatting somewhat when they do convert formats, it's better than nothing.

      A major point of ebooks is that you can have a convieniant handheld reader with all of your books on it and that you can select based on the features you want. Having a different format for each publisher and each reader means either restricting your convieniant reading to the subset of publishers who make books available in a format your reader has, or destroying the convieniance by carrying several readers around (and spending the bucks for all the various readers). The fact that having multiple readers is pointless duplication in the service of business decisions made by pointy headed drooling morons will anger the consumer and could halt widespread adoption of ebooks (in any format). Unlike some concepts in technology, the average person will be able to figure out that since they don't need a different TV for each station they watch, there is no reason to need a different ebook for each publisher either.

      Finally, consumers are VERY familiar with the paperback book. Even iliterate people recognize a paperback and know in general how they work. They know you can lend it, resell it, or give it away, or check it out of the library. It requires no batteries, and you don't have to figure out if it's compatible with your reading glasses before you buy. If there's a page or two you need to share, any old copier will duplicate those pages for you (possibly for a small fee but most just use the copier at work). They also know that it will last a lot longer than their reading glasses, and that new glasses won't mean they have to buy all of their books over again.

      In contrast, when word processors came out, the only thing they had to compete with was the combination of hunt and peck on an electric typewriter, the big clunky copier, and the fax machine that jams half the time and needs special paper.

    5. Re:Like the word processing industry by Zeriel · · Score: 0, Offtopic

      I suppose it'll shoot my karma to admit I was trying to be modded funny, too. =P

      --
      "America has done some terrible things. But I know that Americans don't cheer when innocents die." -Dave Barry
  8. Why propose a different standard? PDF!!! by adzoox · · Score: 4, Interesting
    Why do we have to propose a different standard? Just because it's a slightly different industry than computers, most eBooks are going to be READ on computers. Wouldn't PDF be perfect? Doesn't Adobe have an eBook PDF format? If I'm not mistaken a PDF can be locked and encrypted. This is also the same as DVD+/- Minus is the better standard, but companies with deeper pockets and greedier "proprietary minded" philosophy back Plus. Standards make sales PERIOD!

    I think this was the mistake of the iTunes Music Store. While not terrible (actually slightly better quality) AAC is not as universal a standard as Mp3 or even Ogg. There are WAYS to encrypt and secure those formats. Napster, just before its demise, had figured out how to secure MP3's that were downloaded from it's system.

    --
    Yell & scream & rant & rave... it's no use... you need a shaaaave ~ Bugs Bunny
    1. Re:Why propose a different standard? PDF!!! by Mwongozi · · Score: 2, Insightful

      AAC is not as universal a standard as [...] Ogg.

      Oh rubbish. AAC is used way more than Vorbis (which is what I assume you meant) is. Apple's target market was big enough to overtake Vorbis usage in a single day, I'd bet.

    2. Re:Why propose a different standard? PDF!!! by Anonymous Coward · · Score: 0

      Um... hate to say this but The iTunes Music store has been an amazing sucess.

    3. Re:Why propose a different standard? PDF!!! by junklight · · Score: 2, Informative

      You obviously didn't read the article.

      PDF (while a great standard) doesn't do reflow very well. So on a handheld - page size becomes a total pain in the arse.

    4. Re:Why propose a different standard? PDF!!! by Anonymous Coward · · Score: 1, Informative

      Then the writer didn't do his research - Adobe has an ebook compatible PDF format

    5. Re:Why propose a different standard? PDF!!! by Anonymous Coward · · Score: 1, Informative

      I don't think he was debating that - he was debating standards. The iTMS doesn't use a standard encoding format - there are also COMMON standards out there that would have been as eay to use (in my opinion) and I suppose the ogg vorbis format would have made a lot of /.'s happy too! But in the end, I suppose they didn't go with ogg because they knew Linux hacks would QUICKLY and easily break the encoding/encryption scheme.

    6. Re:Why propose a different standard? PDF!!! by JimDabell · · Score: 1

      PDF preserves layout. This is useless when an ebook has to be read on devices ranging from handhelds to 21" monitors.

      You are right though, leveraging existing work is always good. What's wrong with DocBook?

    7. Re:Why propose a different standard? PDF!!! by Tsu-na-mi · · Score: 4, Interesting

      PDF only meets half the goals IMHO.

      You really have 2 types of documents, and the chosen format needs to support them both well. PDF is well-suited to Magazines or other content with lots of graphics where the layout of the page is important. Such content could be standardized one or more standards of 'page size' for this kind of thing is chosen.

      The other type of content is more like a novel, where it's just a very plain free-flow of text. Here, it would be nice to have the device render layout, allow the user to up the font size, etc. Something along the lines of plain vanilla HTML 1.0 would fit well. PDF explicitly positions everything, so it would be bulkier and less flexible.

      As I see it, 2 formats are needed, one with set layout and positioning, and one for free-flowing text. PDF and a stripped-down HTML would seem to fit the bill nicely.

      --
      I've built up so much character I have an alter-ego
    8. Re:Why propose a different standard? PDF!!! by Alan+Partridge · · Score: 1

      "The iTMS doesn't use a standard encoding format"

      Eh? AAC is documented and licenseable - by standard do you in fact mean 'open' or 'free'?

      --
      That was classic intercourse!
    9. Re:Why propose a different standard? PDF!!! by gblues · · Score: 2, Informative
      PDF (while a great standard) doesn't do reflow very well. So on a handheld - page size becomes a total pain in the arse.


      Like hell it doesn't. Like many things to do with PDF, it all depends on what you use to create your PDF. You'll find that a PDF created from a page layout program (PageMaker, InDesign, FrameMaker) through Acrobat Distiller reflows a lot better than a PDF made from MS Works using some archaic version of Acrobat.

      Nathan
    10. Re:Why propose a different standard? PDF!!! by junklight · · Score: 1

      and so - PDF is not the ideal standard. The goal is to get an eBook standard that always flows nicely and produce nice readable books. PDF can do it in some cicumstances but thats not the same as it being one of the platforms strengths.

    11. Re:Why propose a different standard? PDF!!! by coandco · · Score: 1

      The other nice thing is that you don't even need to buy the full version of Adobe Acrobat to make perfectly good PDF files. There are several ways of setting up a virtual printer to turn anything you print into a PDF file. You can see one such method at http://pad.olsonnetwork.com. I use it all of the time! Clint Olson http://www.olsonnetwork.com

    12. Re:Why propose a different standard? PDF!!! by Anonymous Coward · · Score: 0

      If you've ever looked at the PocketPC version of AcroReader (which I use on m iPAQ occasionally), it has an algorithm to re-flow text to better fit the screen (and yes, you can change the type size by changing the zoom). PDF, like any other document format, doesn't *require* that the renderer follow it explicitly. Ever use GhostScript's Extract TXT function? Same idea: the original, unformatted copy is there, you just have to decide what to do with it (and the formatting instructions).

  9. Survival of the best-marketed, I guess by heretic108 · · Score: 5, Interesting

    I had a taste of incompatible e-Book formats when I got my first colour Palm.

    Sadly, there were better (open) formats using better compression and rendering, losing out to closed formats with big marketing push.

    The format that ultimately prevails will not necessarily be the best. It'll be the format pushed by those with the greatest marketing skills/budget, and the one which gives them the greatest control over how their works are used.

    It wouldn't surprise me if authors are already signing e-book distribution deals which forbid them from releasing in rival formats.

    One of these days, the masses will choose software and data formats according to quality and freedom.

    But something within me suspects that the Pope will convert to Islam, and the Jews will profess the divinity of Christ first.

    --
    -- In the beginning was the WORD, and the WORD was UNSIGNED, and the main(){} was without form and void...
  10. FictionBook XML by ironhide · · Score: 4, Informative

    There also is this e-book xml format:
    http://haali.cs.msu.ru/pocketpc/FictionBo ok_descri ption.html

    I use his excellent HaaliReader as a text reader on my pocketpc (fullscreen, landscape mode). There are also html2xml and word2xml tools on his site.

  11. The ONLY Universal EBook Format! by Anonymous Coward · · Score: 5, Informative

    The Project Gutenberg Etexts should so easily used that no one should ever have to care about how to use, read, quote and search them ...

    This has created a need to present these Project Gutenberg Etexts in "Plain Vanilla ASCII" as we have come to call it over the years.

    The reason for this is simple. . .it is the only text mode that is easy on both the eyes and the computer.

    However, this encourages others to improve our etexts in a variety of ways and to distribute them in a variety of the available media, as follows:

    Once an etext is created in Plain Vanilla ASCII, it is the foundation for as many editions as anyone could hope to do in the future. Anyone desiring an etext edition matching, or not matching, a particular paper edition can readily do the changes they like without having to prepare that whole book again. They can use the Project Gutenberg Etext as a foundation, and then build in any direction they like.

    Thus any complaints about how we do italics, bold, and the underscoring, or whether we should use this or that markup formula are sent back with encouragement to do it any ways any person wants it, and with the basic work already done, with our compliments.

    The same goes for media. We have had a long-standing work ethic of providing our etexts in any medium people wanted: Amiga, Apple, Atari. . .to IBM, to Mac, to TRS-80. . .

    However, now that our etexts are carried in so many BBS's, networks and other locations, it is easier to download the file in a manner that puts them in your format than we can make and mail a disk, so we don't really do that too much.

    The major point of all this is that years from now Project Gutenberg Etexts are still going to be viable, but program after program, and operating system after operating system are going to go the way of the dinosaur, as will all those pieces of hardware running them. Of course, this is valid for all Plain Vanilla ASCII etexts. . .not just those your access has allowed you to get from Project Gutenberg. The point is that a decade from now we probably won't have the same operating systems, or the same programs and therefore all the various kinds of etexts that are not Plain Vanilla ASCII will be obsolete. We need to have etexts in files a Plain Vanilla search/reader program can deal with; this is not to say there should never be any markup. . .just those forms of markup should be easily convertible into regular, Plain Vanilla ASCII files so their utility does not expire when programs to use them are no longer with is. Remember all the trouble with CONVERT programs to get files changed from old word processor programs into Plain Vanilla ASCII?

    Do you want to go through all that again with every book a whole world ever puts into etext?

    The value of Plain Vanilla ASCII is obvious. . .so is very much of the value of most of the various markup systems we have in the world. But until some real standards arrive-- we would be limiting our options a great deal if we do not keep copies of all etexts in Plain Vanilla ASCII as well.

    We don't have anything against markup. Not vice versa.

    Alice in Wonderland, the Bible, Shakespeare, the Koran and many others will be with us as long as civilization. . .an operating system, a program, a markup system. . .will not.

    This includes the many requests we have for compression in particular formats. There are only two formats we know of that are suitable for transfer to a wide general audience: Plain Vanilla ASCII (.txt files) and ZIPped files of them, (.zip files). Requests for other compression formats must be ignored as they are appropriate only for small portions of our target audience. However, (programmers take note: we will need help) we are planning to put some compression links on our files so they can be transmitted in any of an assortment compression formats on the fly. i.e. we should be able to generate any kind of file asked for, but we can keep only one copy of each etext on our servers. . .as the .Z compression format does in a similar manner today.

    1. Re:The ONLY Universal EBook Format! by PhilHibbs · · Score: 1

      Unleash the awesome power of plain text, as I always maintain.

    2. Re:The ONLY Universal EBook Format! by Anonymous Coward · · Score: 0

      WAP (or rather WML) only provides a limited formatting set, and look how widely successful WAP is today... not.

    3. Re:The ONLY Universal EBook Format! by Ben+Hutchings · · Score: 1

      Yeah, it's so great that the Gutenberg Project only uses plain text with no markup of metadata, indexes, chapter headings and so on. I'm sure everyone enjoys having to manually pick these things out when republishing them.

    4. Re:The ONLY Universal EBook Format! by Anonymous Coward · · Score: 0

      I am absolutely amazed that anyone could possibly be so dense. I read some books from Gutenberg. I couldn't tell you just how interesting the books became when to understand the text completely, I was told to look at an image. Which was displayed in the Gutenberg text as [Image Here].

      Personally as a software developer I'm extremely fond of ISO8859-1, however I don't think it is proper method of viewing books. I think that books should contain italics and bold and images. It makes them more enjoyable to read.

      I like the idea of a universal format of e-books, however I'd like to get past the issue of making an e-book with images and files but not wrapped. They should be able to exist in a single file with a proper extension. A zip file is nice and should be renamed mybook.ebk

    5. Re:The ONLY Universal EBook Format! by shockbeton · · Score: 4, Interesting

      I'm happy you brought Project Gutenberg into the discussion. PG is a wonderful resource. When the project was nascent in the 1970s and 80s reading ASCII on your TRS-80 probably seemed pretty neat. But now that the PG dream of preserving and distributing the printed word through digital technology has stagnated into a dogmatic cult with the goal of preserving ASCII it's time to reevaluate the meaning of Project Gutenberg.

      Those of us who are literate and computer savvy and have seen places other than the USA recognize the harm that reducing printed material to chunks of ASCII does. And far from mere loss of formatting or typographical embellishments much of the meaning of a text is destroyed when run through the chunky sieve of ASCII conversion. Most accented Roman characters cannot be rendered in ASCII. Non-Roman characters cannot be rendered in ASCII. Typographical features such as relative type size, style, and formatting are either lost entirely or reduced to the low-res rendering capabilities of monospaced ASCII. ASCII has no provision for rendering traditional methods of communicating typographically such as small caps, ligatures, distinction between hyphen, endash, and emdash, etc. despite the fact that virtually every printed text makes use of these features.

      Digital technology has progressed without our friends at Project Gutenberg. There is an alternative to ASCII which is now standard to all major computing platforms: Unicode. From the unicode.org website:

      Unicode provides a unique number for every character, no matter what the platform, no matter what the program, no matter what the language.

      Encoding the PG texts in Unicode would require no extra effort on the part of the PG volunteers (well, those who have moved on from their TRS-80s, anyway).

      Why not use technology that attempts to accomodate the typographical traditions inherent in your source material rather than reducing that material to fit an obsolete technology?

      And even if you still cling to your belief in the infinite beauty, timelessness, and universality of ASCII, please stop using linefeeds every 70 characters within paragraphs. WTF do you Project Gutenbergers imagine we read these texts on TRS-80s?

    6. Re:The ONLY Universal EBook Format! by Anonymous Coward · · Score: 0

      Those could easily be done in plain text as well, with a standard way of representing them.
      The problem is, they accept submissions from a variety of people, and they do not all present chapter headings/indexes the same.

    7. Re:The ONLY Universal EBook Format! by Anonymous Coward · · Score: 0

      I stripped those linefeeds easily enough with my PG perl regex oneliner :)

    8. Re:The ONLY Universal EBook Format! by dvdeug · · Score: 1

      But now that the PG dream of preserving and distributing the printed word through digital technology has stagnated into a dogmatic cult with the goal of preserving ASCII

      Obviously spoken by someone who hasn't checked out PG in the last few years. When possible, PG keeps an ASCII copy, yes. But PG also has Japanese and Chinese texts with no ASCII copy. Even for mthose that PG offers an ASCII text (including for some reason French texts), many of them have Latin-1, UTF-8 and/or HTML copies in the same directory.

      Encoding the PG texts in Unicode would require no extra effort on the part of the PG volunteers

      No extra effort? I don't know what world you live in, but Unicode is not the simplest thing for everyone in the world to handle. Considering the almost non-existant technical savy of some of the volunteers at Distributed Proofreaders, using Unicode would be a serious pain to explain and would drive away many of our proofers.

      please stop using linefeeds every 70 characters within paragraphs. WTF do you Project Gutenbergers imagine we read these texts on TRS-80s?

      WTF do you imagine we read these texts on TRS-80s? I read mine in an xterm using less. IMO, there's no point in producing a plain text version without linefeeds like this; if you expect to wrap the text, you might as well go ahead and produce a simple HTML text.

    9. Re:The ONLY Universal EBook Format! by Anonymous Coward · · Score: 0

      Actually, original TRS-80's only had a 64 character wide display, so it would suck even for that. ;)

  12. but...but... by corian · · Score: 4, Insightful

    Different readers, different platforms, and different applications have different requirements!

    Some uses want a format which is compact as possible. Some focus on readibility (switchable fots, etc.) Others -- facimile-style releases -- emphasize that the copy should as closely mimic the original work as possible. Formats can emphasize the syntactic structure of the text (sentences, paragraphs), or the structural qualities (line breaks, pages).

    Even in their paper forms, books have different formats for different uses. Libraries prefer hardcovers, with durable bindings. Travlers prefer paperbacks, with small and light pages. Collectors pay extra for special editions, with quality supplies. Some readers prefer large-print copies, abridgements, or books on tape (in a choice of cassette tape or compact disc!)

    Any format makes assumptions, and deletions. It's perfectly fine to have a multiplicity of formats. If its useable, and reasonably priced, people will buy it.

    For me, the major hindrance to e-books is the price. Since there is no associated cost of the materials (paper/cardboard), printing, physical transportation, stocking space, and delivery, e-books should be [i]cheaper[/i] than physical books. But many of them are priced the same, or even high (you can check this at Amazon.) what's up with that?

  13. SVG by Per+Wigren · · Score: 2, Interesting

    Why not just use SVG?

    --
    My other account has a 3-digit UID.
    1. Re:SVG by Surak · · Score: 1

      Scalable Vector Graphics? *confused*

    2. Re:SVG by Anonymous Coward · · Score: 0

      The ultimate in bloat! Imagine; an XML document describing the vector points and bezier curves required to draw each and every character in an entire eBook! If nothing else, it would push the price of multi Tb drives down.

  14. What ebooks really need to take off... by TallEmu · · Score: 3, Interesting

    ... is paper. Seriously.

    The nice thing about a book is that it doesn't have a power switch - it's actually relaxing to sit there and read it.

    If it were possible to obtain a high speed printer capable of printing out "e-books" in the same form-factor as a normal book (ie double sided pages, standard size, neatly bound) then I for one would pay for *lots* more books (and paper, and ink.)

    1. Re:What ebooks really need to take off... by Lumpy · · Score: 2, Insightful

      The nice thing about a book is that it doesn't have a power switch - it's actually relaxing to sit there and read it.

      no the nice thing about a book is that It cant be taken away from you, you can lend it to a friend, you can sell it at a garage sale or trade it in at a used book store.. all of which the Writers Guild DESPERATELY want to stop you from doing.

      a paper book gives you a ton of freedom that publishers and writers are massively pissed off about and want to take away.

      This is the real benefit. I can bury my books in a locker so that they cant be stolen or burned... my e-books can have their "license" revoked at and second for any reason or self destruct.

      --
      Do not look at laser with remaining good eye.
    2. Re:What ebooks really need to take off... by Cro+Magnon · · Score: 1
      The nice thing about a book is that it doesn't have a power switch - it's actually relaxing to sit there and read it.

      no the nice thing about a book is that It cant be taken away from you, you can lend it to a friend, you can sell it at a garage sale or trade it in at a used book store.. all of which the Writers Guild DESPERATELY want to stop you from doing.

      a paper book gives you a ton of freedom that publishers and writers are massively pissed off about and want to take away.

      This is the real benefit. I can bury my books in a locker so that they cant be stolen or burned... my e-books can have their "license" revoked at and second for any reason or self destruct.


      You're both right! I can real my REAL books during a power-outage, AND I don't have to worry about some greedy SOB playing games with me. Once I buy the books, they're MINE, and I can do anything I want with the
      --
      Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.
  15. one.doc by pubjames · · Score: 2, Insightful

    I'm still amazed that the whole of the business world is happy to accept MSWord .doc as the standard to store virtually all of their documentation. I don't think the film industry would be happy standardizing on .avi or the music industry on .wav, so why doesn't the business word get it's act together and accept a better format than the crappy .doc?

    Take a look at this - 1dok.org - an open document format

    1. Re:one.doc by Surak · · Score: 1

      It was kind of a 'meme'. Back in the early days when people were switching from DOS to Windows there were only really two decent word processors that ran on Windows -- 'Word for Windows' from Microsoft, and Ami. Ami was cool but was largely incompatible with everything that came before it. Word for Windows was able to read WordPerfect 4.x/5.x for DOS documents (which is what everyone previously standardized on in the DOS world) and would provide help for WordPerfect users, who of course were all lost in this new world of GUI.

      Microsoft played it smart and catered to the business world. Unfortunately, WordPerfect for Windows came too late (mostly due to the fact that Microsoft used the 'synergy' between their apps group and their OS group that's been mentioned before in court) and Microsoft won.

    2. Re:one.doc by yancy · · Score: 1
      I don't think the film industry would be happy standardizing on .avi or the music industry on .wav, so why doesn't the business word get it's act together and accept a better format than the crappy .doc?

      Because the consumables of the business world are not internal documents. Phone company people don't lose sleep over the format of project charters. They lose sleep over DSL lines, losing customers to upstart rivals, etc.

      Film industry people care about video file formats because video is their consumable product. They probably use .doc internally too. [shrug]

      Yancy

      --
      "My license to make fun of everyone comes from knowing I'm the biggest joke of all."
  16. How about WAP? by expro · · Score: 2, Interesting

    Is Project Gutenberg and a Palm Pilot.

    I would like to put up a server to serve up Gutenberg, etc. a page or so at a time for low-end WAP phones, with simple indexing and serching capabilities. The simpler cell-phone is what I really always have in-hand with good connectivity when I would like to read. Palm Pilots never seem to have enough storage to keep whole books or widespread connectivity.

    Ha anyone done this? It should be popular and not too resource-intensive.

    1. Re:How about WAP? by expro · · Score: 1

      Palm Pilots never seem to have enough storage to keep whole books

      I meant to keep lots of whole books.

    2. Re:How about WAP? by FeloniousPunk · · Score: 1

      The Palms are a bit low on storage but other PDAs can store a lot more. I've got two novels on my PPC at the moment, Darwin's Radio by Greg Bear and Dreamcatcher by Stephen King and they are 442kb and 639kb respectively. I could carry about 10 or so in the built-in storage without crowding out other data I need, but if I would need to carry more, I could always put them on an SD media storage chip or CF card. They're cheap these days; a 64 MB SD chip is about $30. So that's about 30 novels right there.

      --
      I know this because Tyler knows this.
    3. Re:How about WAP? by SgtChaireBourne · · Score: 2, Interesting

      Sounds like a cool idea. I guess you might be able to use XLST to transform to the different WAP flavors.

      --
      Beta is broken and the link to classic doesn't work. Stop wasting our time or there won't be anybody left here.
    4. Re:How about WAP? by b0bby · · Score: 1

      Palm Pilots never seem to have enough storage to keep whole books
      Really? Even the 2mb models can hold a bunch of books - right now I have Treasure Island (reading to my daughter) & 100 years of Solitude on my palm, the two together take up only 750k. The longest book I ever read on my Palm, Cryptonomicon, was 1.5mb; the paper version is 800+ pages. I've found that I'm always able to have an assortment of interesting books on hand. My first gen 512k Palm was a bit tight, but even that would hold shorter books or I'd just load a few chapters at a time. I think that reading books is the main reason I still use my Palm regularly, though I'd keep it just as an address book.

    5. Re:How about WAP? by RichardX · · Score: 1

      Palm Pilots never seem to have enough storage to keep [lots of] whole books or widespread connectivity.

      You wot?
      Granted, my clie has 16 Mb, but even a bunch of games and a small library of books I still have over 7 Mb free.

      I strongly recommend any fellow palm owners to check out the superb Weasel Reader which uses ZTXT format (basically a zipped plaintext).. it's open source, efficient, and an all round lovely app. The kind of application you could take home to meet grandma.

      --
      Curiosity was framed. Ignorance killed the cat.
    6. Re:How about WAP? by alanh · · Score: 1
      THe following is a list of Palm devices that have some form of memory expansion:

      • Every one ever made by Sony
      • Every current Palm except the cheapest Zire
      • Previous Palms: m125, m500, m505
      • Every Handspring non-phone model (visors and treo 90)
      • every Handera 330 and TRGPro
      • AlphaSmart Dana
      • Every Acer model
      • etc....

        (check out the PalmEvolutionary Tree. of the 67 models listed there, about 22 don't have some form of expansion....)

        How much space do you need for lots of books? These days, a 256 MB SD card is less than $80, and that will hold a heck of a lot of ebook content... (Moby Dick was less than 700K on my Palm....)
      --
      - AlanH
    7. Re:How about WAP? by Tetsujin28 · · Score: 1


      The WAP idea sounds neat, but I doubt I'd use it much. Right now I have several novels -- including ULYSSES -- on the SD card in my Zaurus.

      By the way, the Zaurus SL-5500 is a *great* device for reading books. Confortable, portable, expandable, with a bright, clear screen a little wider than a newspaper column.

      --
      - - - -
      The real Tetsujin 28 is a giant robot.
  17. eBooks and DRM by FinnishFlash · · Score: 3, Interesting
    DRM is a hot-potato, and rightfully so...
    DRM Capability: Although end-users prefer not to purchase ebooks protected with DRM (Digital Rights Management), publishers are certainly interested in the DRM capability of the universal ebook format. Thus, the universal ebook format must allow inclusion of DRM protection technologies as needed.

    Take 2 minutes and read this article from RMS

    Right to Read
    --
    please proff read !
    1. Re:eBooks and DRM by Anonymous Coward · · Score: 0

      The thing about DRM is that it needs to be more like the real world.

      In the real world, if you want to copy a book you can. But it is incredibly impractical to do so. (Ever try to copy 200 pages on a photocopier?) You can still copy a single page for 'fair use' or 'fair dealing' (Canadian) if you need it.

      If somebody can develop a DRM system that does not make it impossible to copy something, but makes it more difficult to copy the entire work, I believe it will be acceptable and effective.

      Limit the ability to blatantly steal the work, but don't prevent the fair uses allowable by law.

  18. PDFs and html by Anonymous Coward · · Score: 5, Insightful

    You have to be careful. Half of you are saying "I won't use this until e-books are as pleasant as paper books" and half of you are saying "why not use the standards that are already there? Just make the device do everything."

    Don't you see these are at odds?

    To make e-books as pleasant as real books, you're going to want to make them thinner and thinner in profile. You're going to want to make them run on a single lithium cell battery or AAA. You're going to want to drop all of the interface but the forward, back, and bookmarking buttons. You're going to want the computing device to be as close to nothing as possible, so you can put weight into making the device indestructible like a real book. You want to go to the store, buy the title, and have it just work, or go to Amazon and *know* your desired title is published in that format. That's the ideal, in the near term. It isn't a device that will easily accomodate PDFs and HTML and a number of other standards.

    1. Re:PDFs and html by tuxedo-steve · · Score: 1
      ... so you can put weight into making the device indestructible like a real book.
      These are the words of a man who has never owned a dog.
      --
      - SMJ - (It's not just a name: it's a bad aftertaste.)
    2. Re:PDFs and html by Pxtl · · Score: 1

      Well, the funny thing they're complaining about is things like the poor zooming in PDF: it sounds like they want it to be presentation flexible. Which is HTML.

      Personally, I'd take an old, easy to render HTML standard (Netscape 2 era) - just the basics. links, tables, frames, text stuff, images, nothing else. No javascript, no ASP, no CSS, no whatever. Support standard image types (jpeg, gif, png) and nothing else. For sanity's sake, do not support animated GIF.

      Change the name - call it BkML - say its a fork from HTML, even if its just an old version. This will prevent people from loading in higher-level HTML into the thing. HTML files are named by page number, and the system is hardcoded for next/prev through page numbers. Multiple "books" are stored in a directory structure.

      Thus, you have something that's easy to hand code (but you'd better use a TeX compiler or something similar anyways, as consistent syntax sans CSS will be tricky). Its low on processor stress, and it can be renderd natively by a web browser.

      Sure, different ebook readers will render it differently. As they should. EBooks should not be presentation-dependant. You can use tables to control most of the formatting. The rest is up to the user - after all, a large panel-sized e-book is not the same as a watch-book reader. The user should not have to read by panning around, or by squinting for tiny type. If the user wants teh screen to be 20 characters wide and scroll down a lot, that should be there right. HTML is designed for scalable width. Deal with it.

    3. Re:PDFs and html by Anonymous Coward · · Score: 0

      To make e-books as pleasant as real books, you're going to want to make them thinner and thinner in profile. You're going to want to make them run on a single lithium cell battery or AAA. You're going to want to drop all of the interface but the forward, back, and bookmarking buttons. You're going to want the computing device to be as close to nothing as possible, so you can put weight into making the device indestructible like a real book. You want to go to the store, buy the title, and have it just work, or go to Amazon and *know* your desired title is published in that format. That's the ideal, in the near term. It isn't a device that will easily accomodate PDFs and HTML and a number of other standards.



      You've hit many of my points on the ideal ebook device right there. I can also add that maybe I'd prefer to store the e-books I own on the computer (HDD, CD's, whatever) and load them up in the reader whenever I want to actually read them. Going on vacation? Load up 5 novels to have with me. Give me a reader like the above, plus an USB wire and a diskette-size software to transfer to/from the ebook, and I'm sold.

    4. Re:PDFs and html by dabadab · · Score: 1

      I think you are missing some things.

      1. Processing power and memory is cheap (in money,space and power requirements and) and it's getting cheaper

      2. It does not takes too much to understand one more format if that's not overcomplicated.

      however

      3. It takes considerable processing power and memory to _render_ the characters, esp. if you want them to be rendered nicely (non-fixed length, antialiased and a lot of attributes that has silly names that only typographers understand ;) )

      --
      Real life is overrated.
    5. Re:PDFs and html by Fweeky · · Score: 1

      Uhm, CSS isn't exactly taxing and already has support for multiple media types, including handhelds. XML's easier to parse than bastardised SGML, more difficult to screw up, and able to encode much richer descriptions of whatever you encode in it. Why plan to make the same mistakes HTML made by bloating it with crap and the like?

      A custom XML doctype and basic CSS would work fine, and could even be rendered easily in current browsers, as well as handhelds. It compresses easily, degrades gracefully if you do it right, can be restyled to suit any environment you care to mention; including braile, audio, desktop and handheld, and can be converted to any other format you like with XSLT or $your_favourite_language.

      The metadata you can store using it can also be used to create eBook libraries and the like (similar to Music libraries in some current media players). An XML format could be perfect for users.

      It won't happen though. Industry's way too greedy to make it easy to read and copy stuff like this. If we do find them using an XML format, it'll be buried deep in some encrypted DRM enabled binary format containing the string 'Microsoft' and protected by DMCA 3 which will make it a capital crime to read it without a fully licensed reader and copy of the eBook, which will self-destruct immediately after you close the reader.

      *grumble*

    6. Re:PDFs and html by TheRaven64 · · Score: 1
      To make e-books as pleasant as real books, you're going to want to make them thinner and thinner in profile.

      Or print them...

      --
      I am TheRaven on Soylent News
  19. Exactly by YokuYakuYoukai · · Score: 2, Interesting

    The lack of an ebook standard has thusfar kept me from buying any hardware ebook reader. I would be happy to shell out the cash for one if i knew i could use it with all the books out there.

    1. Re:Exactly by lercio · · Score: 1

      True, that's why I use CSpotRun on my Palm. It's free (and good).

  20. Honest question... by mtrupe · · Score: 1

    Why not HTML?

    1. Re:Honest question... by zakezuke · · Score: 1

      Because that would be too practial, and would resolve issues about bold, italics, or underlining. Not to speak of the fact that illistrations can be imbeaded if nessicary, or ignored if the hardware doesn't support it.

      But I'm with you, provided that html tags are standardized for e-books. Keep them at a bear minium so lamer display devices and less of a chance of a really garish e-book comming out. Basic HTML makes alot of sence.

      Problem being is I think people want who want to sell e-books don't want their content to be pirated. Fair enough concern, and any universal standard in e-books would make it easier to pirate an e-book.

      --
      There is no sanctuary. There is no sanctuary. SHUT UP! There is no shut up. There is no shut up.
    2. Re:Honest question... by RichardX · · Score: 1

      Okay, before modding this as redundant, I know I already linked to it above, but-...

      Everyone seems to be in agreement that plaintext is a good thing on certain counts, mainly that it's a very simple standard, readable on just about anything, isn't dependant on the whims of any commercial entities, etc.

      And likewise, most seem to agree that the downside is lack of formatting, and so on.

      The suggestion above about using an old version of HTML makes a lot of sense to me - after all, who needs (or wants) animated gifs, cascading stylesheets, and popup adverts in an ebook?

      Could I suggest that someone with better programming skills than me take a look at WeaselReader for the Palm, a very nice open source ebook reader that essentially uses zipped .txt files (called ZTXT).. and consider making a similar reader for ZHTML? (perhaps this should be suggested to the weaselreader coder(s))

      Surely it makes sense?.. Plaintext covers the majority of people's needs.. adding just the options for font size, bold, italic, underline, l/r/c/full justification, images, tables, and probably bulletted lists... well, that would probably keep just about everyone happy, and it's really not a very big or demanding feature set.

      Links would be nice too, mainly for "anchor"* links (y'know, the # ones that link to a different part of the same html file) for
      chapters etc. (dunno the correct term, I'm no HTML guru)

      Anyways, it seems to me this would have all the benefits of plaintext, being open, simple, and useable on just about any system..

      Would retain the most essential formatting.. likely there would be some things not covered by this spec, you'll always get that, but I'd guess this would catch almost everything..

      Efficient. Yes, I know ZIP isn't as wonderful as RAR, etc, but it's a lot simpler, and can be used in many neat ways.. again I refer you to weaselreader, and the nifty way in which it doesn't need to uncompress the whole book at once. Also with the zipped thing you can have a SINGLE file.. very important. You don't want to have to distribute the text and a bunch of image files seperately.. that gets very messy.

      Just my 2p's worth, anyways

      --
      Curiosity was framed. Ignorance killed the cat.
    3. Re:Honest question... by TheRaven64 · · Score: 1
      Why not HTML?

      A number of reasons:

      1. HTML has lousy layout ability. Really.
      2. HTML does not properly support page-based documents, such as books.
      3. HTML layout has to be computed at run-time, which is hugely inefficient. A system like TeX, converting to PDF will make sure that all of your characters are nicely spaced at compilation time, so that you know exactly how the book will look to the end user. HTML looks different under any browser / window size / installed font set etc.
      There are probably other reasons too. My view would be the PDF with a fixed page size (8.5" x 11" seems popular amongst publishers) would be a better choice. That way any ebook reader could either have an 8.5" x 11" screen, or could have a pre-defined zoom setting, which would let people view a portion of the page at a time.

      Basically, the question is; would you rather that your eBook was typeset by a publisher, or a couple of megs of software (your browser)? Which do you think is going to be more readable?

      --
      I am TheRaven on Soylent News
  21. There's a lot to be said for plain text by gidds · · Score: 4, Insightful
    • It's universal. Everything supports it, from PDAs to supercomputers.
    • It's versatile. If properly formatted, it's reflowable on different screen sizes, fonts, layouts, &c. And it's perfect for other access methods.
    • It supports most characters you'd find in books. The de facto standard is the Windows Latin-1 encoding, which has all the punctuation as well as accented characters. (Yes, I know, I know. But it's not just on Windows -- both my Mac and my Psion use it, for example.)
    • It's editable. There are tons of tools already available, from spell-checkers in editors to complex analysis. I've written some of my own, for instance; one converts from American to British spelling, which is how I like to read my books.
    • It has conventions for /italics/, *bold*, _underlining_, &c. Yes, at first, these may look clumsy, but I actually prefer them in many ways, as they're more precise; for example, you can differentiate between *word* *by* *word* and *all at once* highlighting (see the Jargon File for the difference).
    • It's compact. Plain text files are smaller than HTML, PDF, RTF &c, sometimes by a lot; and when compressed in formats like PalmDOC (pdb) or TCR, they can be made even smaller and still usable directly.
    • It's future-proof. Plain text has been around for decades, and will be with us for many more, long after DRM keys have been lost and proprietary apps have died.

    Yes, of course some spiffy new format will have other advantages. But it's unlikely to gain quick acceptance. Plain text documents are everywhere, as are readers and other software. There are even online publishers selling text files. In fact, ASCII text is arguably the most successful electronic standard there is!

    --

    Ceterum censeo subscriptionem esse delendam.

    1. Re:There's a lot to be said for plain text by MrNemesis · · Score: 1

      For the most part I'd agree with you, but there's an awful lot of book that use (amongst other non-ASCII things) images.

      It's for things like this why I think the format should either be based on a PDF-ish format, or a halfway house with an XML based layout. But IMO plain old plain text is not a complete solution.

      --
      Moderation Total: -1 Troll, +3 Goat
    2. Re:There's a lot to be said for plain text by dekashizl · · Score: 1

      Of all the "just use text" posts, this is probably one of the best so far.

      *BUT* (you like my boldface?) this "just use text" concept is overly-simplistic and misses the point that books are more than text.

      It's kind of like the people in the mid 1990s who preached how great GOPHER was and that we don't need HTML/HTTP because gopher a) client runs on a terminal, b) it doesn't require a lot of memory, c) it is text-based so it compresses well, d) there are already great open source servers for it, etc...

      You can make all kinds of great arguments for why we should use a limiting technology, but the fact remains that it doesn't meet the needs of the problem it's trying to solve. So if it *is* a good solution, it's a good solution for someone else's problem, not for the format of eBooks.

    3. Re:There's a lot to be said for plain text by dabadab · · Score: 1

      "The de facto standard is the Windows Latin-1 encoding"

      There is no "Windows Latin-1" encoding, there's a Windows encoding - I think it's called codepage 1252 - that closly resembles ISO-8859-1 (or Latin-1). But, for me, it's not an option, since I live in Latin-2 land and there are people that use cyrillic, arabic, far-eastern or other alphabets, so your proposed solution would work only for America and the western parts of Europe.

      Plain text, however, usually means ASCII (as it is in the Gutenberg-project), in which case the only properly supported language is English.

      "It's compact. Plain text files are smaller than HTML, PDF, RTF &c, sometimes by a lot"

      A properly made HTML isn't any bigger than a plain text file (basic HTML tags are 3-5 chars long and you don't have to use them that often) but it has structure, has support for non-ASCII characters and even for pictures.

      --
      Real life is overrated.
    4. Re:There's a lot to be said for plain text by gidds · · Score: 1
      You'd be surprised how few mainstream books have images. (And even fewer rely on them; most can get by with a note such as '[runes]'.) I've many tens of MB of books and short stories, and only a handful needed images - mostly maps.

      And even in those cases where you'd want embedded images, a PDF-like format is not the way to go. It may look great on the right size of screen, and work wonderfully for printing (which was its aim), but as a general book format it's hamstrung by the fixed layout. Yes, there are PDF viewers even for handhelds (I've one on my Psion), but trying to read a book with one is an exercise in pain and eyestrain. Far better to use a format with soft layout, one which can reflow the text to fit the screen. An XML-like format is probably much better for this.

      As you say, plain text clearly isn't a complete format. However, IME it's complete enough for the vast majority of cases, and it has so many other benefits.

      While I'm on the subject, a mini-rant about HTML/XML: while it's great that newlines &c aren't significant, I find it extremely annoying that spaces aren't either (apart from separating words). Good typography has sentences separated by two spaces; in HTML this needs either a &nbsp; after every sentence, or a non-space blank character (which is what I do here, as Slashdot seems to strip out most & entitites...). Another problem that plain text doesn't have :)

      --

      Ceterum censeo subscriptionem esse delendam.

    5. Re:There's a lot to be said for plain text by gidds · · Score: 1
      There is no "Windows Latin-1" encoding, there's a Windows encoding - I think it's called codepage 1252 - that closely resembles ISO-8859-1 (or Latin-1)

      There are many Windows character encodings. The one I'm referring to is called 'Windows Latin-1' as well as 'CP1252'. It's effectively a superset of ISO Latin-1 (ISO 8859-1); the two are identical in the plain ASCII range 0-127, and also in the range 160-255 with accented characters &c. The only difference is in 128-159, where ISO Latin-1 has only control characters (effectively unused), and Windows Latin-1 has some useful punctuation (smart quotes, dashes, &c).

      It may have originated with Windows, and so leave a nasty taste in the mouth, but it's a well-defined standard, and it's useful; I'm surprised that the 8-bit Unicode range was based on ISO Latin-1 rather than Windows Latin-1.

      Ideally, of course, we'd all be using Unicode (which has all the Windows Latin-1 characters up in the 0x2000 'General Punctuation' range) - a neat solution to all the problems of different 8-bit encodings. But until that's widely supported, I'm sticking with Windows Latin-1.

      --

      Ceterum censeo subscriptionem esse delendam.

    6. Re:There's a lot to be said for plain text by gidds · · Score: 1
      Your use of the term `limiting technology' is interesting. I find it depends upon your PoV.

      Yes, plain text does limit what you can represent. (Though as I said above, I find that practically all of what I want fits within that limit.) However, a more complex format is also limiting: it limits what you can do with the file. You may gain the ability to show italics directly, but you lose the ability to knock up simple Perl scripts; you may gain embedded images, but you lose compatibility with handhelds; you may gain coloured text, but you lose the use of your standard editors and tools. And so on. Aren't those limits too?

      Yes, it would be great if a more powerful format was in use everywhere. But until then, I'll stick with a format I can use.

      --

      Ceterum censeo subscriptionem esse delendam.

    7. Re:There's a lot to be said for plain text by rdhyee · · Score: 1

      By plain text, do we want to mean unicode? One problem with ASCII is it's terrible for languages with non-Western script.

    8. Re:There's a lot to be said for plain text by gmhowell · · Score: 1

      Add one more to your list: deaf blind people can load them on one of these. It's a device to let deaf-blind people read 'portable books'. Unfortunately, it only takes plain text input. Limiting you to Gutenberg and bookwarez.

      --
      Jesus was all right but his disciples were thick and ordinary. -John Lennon
    9. Re:There's a lot to be said for plain text by gidds · · Score: 1
      That's exactly what I meant by `other access methods'.

      And there's a lot of bookwares out there if you know where to look. (Mainly Gnutella, though if anyone knows any other sources please let us know!) Plus quite a bit of legal stuff you can buy at places like Fictionwise. (They let you download in many formats; plain text isn't one, but it's trivial to convert PalmDOC to plain text.)

      --

      Ceterum censeo subscriptionem esse delendam.

    10. Re:There's a lot to be said for plain text by gmhowell · · Score: 1

      That's exactly what I meant by `other access methods'.

      My bad.

      And there's a lot of bookwares out there if you know where to look.

      irc.bookwarez.net

      (Or .com, I can't remember. It's bookmarked in mIRC)

      Sad thing is, I'm happy to spend money on the books. I tried contacting Stephen King (wife's favorite author) through website looking for braille books for purchase. No reply. Screw 'em. I'll buy the hardcopy for me, and assume that I'm allowed, via fair use, to get a txt copy online.

      --
      Jesus was all right but his disciples were thick and ordinary. -John Lennon
    11. Re:There's a lot to be said for plain text by Anonymous Coward · · Score: 0

      So you can easily run plain-text through a perl script. Big deal. I'll happily eat the limititations if it means I can have proper formatting, different fonts, images, etc.

      Don't get me wrong, I would like to see an open XML-based format as well, but until it comes around I'll stick with a format that gives me all the nicities of dead-tree on my screen. I don't see any point in going back to the stone-age just to have some added 'usability', particularly not when even screwy proprietary formats like PDF can still be converted to text with widely-available tools, and then 'used' to your heart's content. Guess what? You can't convert the other way.. more information = better in this case.

    12. Re:There's a lot to be said for plain text by 5KVGhost · · Score: 1

      Plain text is fine for, well, plain text. And it works pretty well as a least-common-denominator format for exchanging information between otherwise incompatible systems. But there's no way the industry or the public will be happy with the limitations it imposes on how their electronic books are presented.

      For one thing, straight ASCII has no capability to preserve the presentation of the original work, nor the flexibility to allow new electronic works to share the same richness that thousands of years of paper publishing have given us. These things do matter.

      Nor does plain text allow for proper handling of standard elements like footnotes, equations, diagrams, images, or even (aside from ugly hacks) simple text formatting. Why would we even consider going backwards?

      ASCII is popular because that's what Project Gutenberg has insisted on, but, frankly, I don't consider the texts you can download to Project Gutenberg to be accurate representations of the original work. The grunt work is done, but the details have been sacrificed for expediency, and details are important too.

      However, I really don't think we need to waste time specifying a fancy new format. Either PDF or nice clean HTML with appropriate layout and style sheets will probably work just fine.

  22. Inertia by Accidental+Hack · · Score: 1

    It's all about the killer app -- that's not out there -- to break the inertia of the market. Many of the responses so far have indicated little willingness to give up the print ... and why should they? Why should I? I work for a company on ebook related projects and even I just don't read ebooks. I prefer print and I have no reason to change.

    Maybe it's the target market that's the problem. Maybe mass-market consumers are the wrong people to convert first. Maybe it has to be the school/library market or the business market first. Wish I had some answers!

  23. MOD PARENT UP by ContemporaryInsanity · · Score: 1

    There's a hell of a lot to be said for simplicity.

  24. ebook expert? by tomstdenis · · Score: 1, Funny

    Which correspondence college offers that and how much does it cost?

    Ebook expert....

    Yo /. I'm a Desktop Folder Manangement Professional!

    I'm also a MSc in Network Ping Techniques. I can ping with one hand tied behind my back. My Masters thesis was whether gnip would work equally as well as a ping program. Turns out not. Stupid Command not found.

    Blah. Karma Killaz!

    --
    Someday, I'll have a real sig.
  25. Ogg, used as en encapsulating format, allows you to put ANYTHING (Divx,SVCD,MP3,MP4,WhatTheHell) and have it used as an ogg file.

    That's how men Fansub groups makes releases including a 4 subtitles choice with a nice XVID compressed video stream.

    Now, I don't say AAC don't do that. Just thatOgg is quite a universal standard.

    Also, the point is CHOICE. I want to choose the format I want and play it wherever I want, not Wherever I can..

    --
    It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
  26. Umm... SGML by HighOrbit · · Score: 1

    I thought SGML solved this problem years ago.. (except for the copyright-driven copy-protection schemes that seem to be in vogue with the profit-hounds) However, I don't think copy-protection is absolutely necessary. Publishers have made billions selling regular old paper books for years with no "copy-protection". Even the advent of easy copying with XeroX machines didn't kill the profit. What makes the same content in a new medium suddenly worthy of copy-protection ?

    1. Re:Umm... SGML by rpresser · · Score: 1

      I thought SGML solved this problem years ago..

      That is like saying that transistors solved the problem of radio transmission formats decades ago. SGML is far too flexible and powerful; I'm not even aware of any software that can render an arbitrary SGML document. (I'm sure such exists; please inform me. However, I'll bet that it is enormous and cumbersome to install and maintain.)

    2. Re:Umm... SGML by Brad2021hk · · Score: 1

      Have you ever tried to make a photo-copy of an entire book? I haven't. Because I know that it sucks before I start. Let's see, spend all day making copies of a book, page-by-page. At $0.10 a page. Then accidently mix up some of the pages. Or I can go to the book store and pay $25. Hmmm. Option #2 please. Now with an electronic eBook, I could just click onto some FTP site and download in 15 seconds. Not charge my credit card $25. Good for me. Bad for the publisher, which is bad for me later.

    3. Re:Umm... SGML by Anonymous Coward · · Score: 0
      What makes the same content in a new medium suddenly worthy of copy-protection ?

      Oh, I dunno.... zero cost bit copying maybe?

  27. I can see it now. This will be... by Dolemite_the_Wiz · · Score: 1

    ...Latex and Ghostscript revisited.

    Can we keep inventing more readers for specific uses?

    What's next? A new text reader for Man files that can only be read by one single reader and is hard to port to different text formats?

    Dolemite
    ______________

    --
    Save the World! Use a Quote!
  28. My proposed format by Alsee · · Score: 5, Funny

    I have developed a higly sophisticated format for storing books in a computer file.

    Each character of the book is to be enciphered to a byte. I reserve the first 32 codes (0-31) for various system function characters. The next 32 codes (32-63) encipher the space character, various punctuation marks, and numerals. The next 32 codes encipher the capital alphabet and a few more punctuation characters. With the simple use of 00111111 binary mask 'A' maps to 1, 'B' maps to 2, and 'Z' maps to 26. Quite clever if I say so myself! Naturally the next 32 codes encipher the lowercase letters in the same manner. Using the very same 00111111 bitmask you find 'a' mas to 1, 'b' maps to 2, and 'z' maps to 26! Ingenious, isn't it?

    To ensure compatibility with legacy computer systems values above 127 shall not be used.

    I call this encoding Advanced Storage Cypherment Input Ideal - or A.S.C.I.I. Any file utilizing this encipherment is a Tagged eXchange Template. These files may be identified by the use of a .TXT extention.

    -

    --
    - - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.
    1. Re:My proposed format by kubrick · · Score: 2, Funny

      Quick! Rush to the patent office, or Amazon might beat you to it...

      --
      deus does not exist but if he does
    2. Re:My proposed format by RichardX · · Score: 1

      I reserve the first 32 codes (0-31) for various system function characters

      Heyheyhey!

      Wait a minute...

      Do we really need all 32 of those for ebook texts?.. Sure, I realise some like carriage return are important, but I can't even think what most of the others could be.. surely we could find at least 3 redundant ones to replace with bold toggle, italic toggle, and underline toggle?

      Yes, I realise this is absolute heresy, and I must be burned at the stake for my sins.. but.. it's a thought, isn't it?.. create a similar-but-different eBook oriented offshoot of ASCII..?

      --
      Curiosity was framed. Ignorance killed the cat.
    3. Re:My proposed format by yerricde · · Score: 1

      Yes, I realise this is absolute heresy, and I must be burned at the stake for my sins.. but.. it's a thought, isn't it?.. create a similar-but-different eBook oriented offshoot of ASCII..?

      Actually, there is. It's called Wiki markup.

      --
      Will I retire or break 10K?
    4. Re:My proposed format by don.g · · Score: 1

      That's terribly inefficient, though. I propose an alternate format, with five bits per symbol. It can have "shift" symbols to switch to character sets with upper/lowercase characters, numeric digits, etc.

      I name this amazing creation Binary Allocation Utilised to Define Optimal Text - Baudot.

      --
      Pretend that something especially witty is here. Thanks.
  29. images? by Extrymas · · Score: 1

    And how about images?.. Do you suggest ascii graphics ? ;)

  30. Can you say TeX & PS? by SkewlD00d · · Score: 2, Informative

    Everyone in academia uses LaTeX and PostScript, since PDF is silly and HTML doesn't have layout features.

    --
    The biggest trick the devil pulled was letting lawyers become politicians so they can write the laws.
    1. Re:Can you say TeX & PS? by bcrowell · · Score: 1
      since PDF is silly
      How about making a serious argument? What's wrong with PDF? Actually PDF is simply an improved version of PS. It's better because it's device-independent (the "P" in PDF is for "portable), and it's better because it's Turing-incomplete.

      Everyone in academia uses LaTeX and PostScript
      Care to document this?

      LaTeX is quite common in math, physics, and computer science. It is not at all common in most academic fields.

      Assuming you really meant math, physics, and computer science, it's not true that PS is more common in those fields than PDF. For instance, you can submit papers to the arxiv.org preprint server in LaTeX, and they'll then automatically generate both PDF and PS for people who want to read your paper.

      BTW, I hope you're not under the impression that it's impossible to make PDF files without proprietary software. Here's a counterexample, done with LaTeX.

    2. Re:Can you say TeX & PS? by TheRaven64 · · Score: 1
      since PDF is silly

      Silly in what way? When I create a document for printing, then it's ps, but for online reading, PDF is far superior. PDF has support for bookmarks (that nice navigation tree you see in the left of Acrobat Reader), hyperlinks (both to other parts of the document and to other (external) documents. With the dvipdf utility and the hyperref package bookmarks are generated from your table of contents entrys, making a PDF far easier to read on screen than a PS.

      --
      I am TheRaven on Soylent News
  31. Just as long as it's encrypted to prevent piracy by Anonymous Coward · · Score: 1, Funny

    I nominate ROT-13!

  32. Stating the obvious by dnoyeb · · Score: 1

    How long did it take the music industry to realize this would not work with CDs? not long at all, but they do seem to want to undo this universal compatability. But people get greedy and look for ways to force it to work. It will never work.

    And yes, html is more than enough. This is a book, not a website. Its about reading words, nothing more nothing less. if you start up with the pictures and sound, people will ignore you in favor of a movie or TV...

    I bet someone will propose flash :D

    1. Re:Stating the obvious by RichardX · · Score: 1

      I bet someone will propose flash :D

      Don't be ridiculous! Who the hell needs scripted rollover buttons and that kind of nonsense in an ebook?.. there's absolutely nothing wrong with uncompressed AVI....

      It'd make the "scanning" process a lot easier, too.. just videotape someone flicking through the book real quick, and play back frame by frame...

      --
      Curiosity was framed. Ignorance killed the cat.
  33. Obvious by noselasd · · Score: 1

    Docbook PLEASE.
    How the heck it's rendered I dont care. But docbook must be the obvious choice ?

    1. Re:Obvious by clonebarkins · · Score: 1
      How the heck it's rendered I dont care. But docbook must be the obvious choice ?

      docbook is good for technical documentation (hence the "doc" part of the name), but not for general purpose use. Believe it or not, there are a lot of publishers out there who do not publish computer manuals.

      --

      "The evil of the world is made possible by nothing but the sanction you give it." -- Ayn Rand

    2. Re:Obvious by noselasd · · Score: 1

      And WHY is it not suitable for other types of books. ? It looks fine to me for most types.

    3. Re:Obvious by TheRaven64 · · Score: 1

      I've looked at Docbook, and it seems a lot more computer-friendly than human-friendly. HTML, SGML and XML are not easy to type in, and interrupt the flow of thought. LaTeX, on the other hand is far easier to enter, so I think I'll stick with LaTeX for now.

      --
      I am TheRaven on Soylent News
  34. eBook != replacement of conventional books by splateagle · · Score: 1

    This is ever so slightly off-topic, but why is it that whenever eBooks are mentioned, there's a clamor of people shouting "the paper based book is better becase x y z"?

    The arival of almost every other new media since the invention of the printing press, has been heralded as marking the end of the printed word. This hasn't happened in the past and I expect the same will be true of the eBook when it matures.

    Historically new media have complimented rather than replaced existing ones. eBooks and Monograph literature both have strengths and weaknesses, and there's plenty of room for both to co-exist.

    Just to bring myself back on topic a little, professionally speaking (as a librarian) it would be helpful if the eBook industry were standardised to a single open format. I expect it's more likely that we'll see progressive waves of competing formats develop as the technology improves. Perhaps the Open eBook initiative could better expend its energies ensuring that all eBook formats allow for data to be exported and reformatted in some way? so that materials aren't lost as formats become obsolete...

    1. Re:eBook != replacement of conventional books by TheRaven64 · · Score: 1
      Whenever I buy a textbook it is for two reasons:
      1. To read, cover to cover (maybe skipping some bits) to give myself a gounding in the subject.
      2. To use as a reference.
      A paper book is ideal for the first, an eBook is ideal for the second (due to the search feature). If every paper textbook I bought came with a pdf copy, then I would be a happy person. Actually I am a happy person most of the time anyway, but I would be a more happy person. Although not that much more happy. It's not like I lie awake at night thinking 'Oh, I wish I had a PDF copy of this book'...
      --
      I am TheRaven on Soylent News
  35. why even have eBooks? by thomasmd · · Score: 1

    Can somebody tell me why eBooks are better than audio content? What can possibly be done with electronic text, that cannot be accomplished through audio content? I can understand that audio books are much more expensive to produce, but surely we are nearing the point where synthetic computer voices can "read" the original text, instead of having to employ human voice actors. That being the case, what's the use?

    1. Re:why even have eBooks? by clonebarkins · · Score: 1
      Can somebody tell me why eBooks are better than audio content? What can possibly be done with electronic text, that cannot be accomplished through audio content? I can understand that audio books are much more expensive to produce, but surely we are nearing the point where synthetic computer voices can "read" the original text, instead of having to employ human voice actors. That being the case, what's the use?

      Well, for one thing, so that a person can read the book for themselves (but that's too obvious so I won't fault you for not thinking of it).

      I listen to audio books all the time and I enjoy them. But in every case, where I have both read the book and listened to the audio book, I have been disappointed with how the reader has interpreted certain portions of it (through voice inflection, pauses, etc.). This is even the case when the author is the one reading. I'm guessing it's not any better, and possibly (probably) worse, with synthesized readers.

      It's the same thing as saying "If there's a movie, why read the book?" It's because some (actually, many) people actually enjoy reading.

      Furthermore, how do you think text-to-speech programs work? YOU HAVE TO HAVE THE TEXT IN AN ELECTRONIC FORMAT FIRST! If you have a standard, open format for reading ebooks, then it's much easier for people to make all sorts of other derivitive formats, such as for reading on a desktop, reading on a palmtop, or even converting to audio. An open format creates possibilities, not restricts them.

      As for how synthetic voices sound, there are some decent programs out there, but none of them are cheap. It's still a cost-benefit analysis -- yes, hiring someone to read (especially somebody good) is probably more expensive than getting the software, but you get better quality.

      --

      "The evil of the world is made possible by nothing but the sanction you give it." -- Ayn Rand

    2. Re:why even have eBooks? by neglige · · Score: 1

      why eBooks are better than audio content?

      Currently? Size. An audio book will fit on a CD, an ebook will fit on a floppy disk. And then there are people that understand content better by reading it instead of hearing it (visual type - of course there is also an audio type, for them audio book would be better).

      we are nearing the point where synthetic computer voices can "read" the original text

      I doubt that will happen in the next few years. Synthetic voices still have a long way to go, especially when you take into account that you listen to a "synthetic audio book" over a longer period of time. Small glitches in the pronounciation and the voice melody are in this case more obvious (and more annoying) than when the voice announces the time or reading an email. But I agree, it could happen over time

      And in the end there are always people who prefer a "real book". I admit I've been one of these, but the advantages of well produced ebooks (read it in the dark with the backlight on, a PDA is quickly stored in your pocket as opposed to some books, keep several titles on your PDA, save room on the shelf) made me change my mind.

      --
      My cats ate my karma. They also wrote this comment.
    3. Re:why even have eBooks? by RichardX · · Score: 1

      Can somebody tell me why eBooks are better than audio content?

      Well, personally, I enjoy things like the BBC's audiobooks.. but I enjoy them alongside the text versions of the same work, not as a replacement to. For a start, I tend to find them extremely slow - I read a hell of a lot faster than most people seem to talk, at least when they're reading books out loud.

      Also, it's a similar thing to the book/movie difference. With a book you visualise people/places, etc.. you also build a mental image of characters and how they talk, etc.

      As for text-to-speech.. as yet the only character I've found that those can do convincingly is Marvin from the Hitchhiker's Guide

      --
      Curiosity was framed. Ignorance killed the cat.
  36. www.wapnovel.com by evilandi · · Score: 3, Interesting

    Expro: I would like to put up a server to serve up Gutenberg, etc. a page or so at a time for low-end WAP phones

    I got bored last Christmas and did this.

    www.wapnovel.com (WAP or desktop)

    There's also an as yet unused discussion group at:

    http://groups.yahoo.com/group/wapnovel

    --
    Andrew Oakley - www.aoakley.com
  37. Ok let's get this straight by Unleashd · · Score: 1

    You need a little correction on your tower of babel story .... The story came about in Genesis when all people spoke the same language. The people had decided to build a temple to heaven (as in a temple that reached al the way to heaven). They were trying to bypass the way that God had set for them to enter heaven. They were trying to "become like/equal to God". God knew that with one language they could accomplish anything so he decided to confuse their languages. At that point God caused people to speak different languages and the construction on the temple was abandoned because people couldn't work together. God did this because he had a better plan for mankind ... not because of jealousy.

    As for the babelfish I can't comment as I haven't read "The Hitchikers Guide to the galaxy" but it sounds like a reference back to the confusion of the languages

    --
    We don't need no stinking sig!
    1. Re:Ok let's get this straight by aldousd666 · · Score: 1
      Well, God's intentions aside, I think I had the story straight, except, I said 'tower to god' meaning 'tower to reach god.' I was unlcear. I also said Nebekennezur instead of Nimrod the Hunter. In some movie (overdramatized I'm sure) I saw Nimrod even shot an arrow at the clouds. These is, I might add still enourmous historical controversy over whether or not the story happened at all, but if your a Bible fan (as I am even though I don't beleive any of it comes from any kind of god) you beleive what you want.

      The Douglas Adams story is undoubtedly poking fun at the bible story.

      --
      Speak for yourself.
  38. Suite of formats by SgtChaireBourne · · Score: 2, Interesting
    What plethora of formats? Everyone knows there's only the Word *.doc format!

    That needs to be modded Funny.

    By my reckoning, MS-Word has had more than 15 different formats in 9 years. I gave up MS-products for Lent a few years ago, but back in the day when my new laptop arrived with MS-Word95 (or whatever it was called), I had to go find MS-Word 6 and resave manually every last word document + metadata in RTF format in order to be able to read them in the new program.

    Too bad the data format is tied into specific applications. This is an old archival issue that is fortunately being dealt with by establishing open file formats and cross-platform applications (staroffice, openoffice, wordperfect, abiword).

    HTML caused the WWW, it will be interesting to see what happens with file formats for productivity suites.

    --
    Beta is broken and the link to classic doesn't work. Stop wasting our time or there won't be anybody left here.
    1. Re:Suite of formats by jafuser · · Score: 1

      HTML succeeded because it was so simple that every living thing with opposable thumbs could figure it out. Anyone could learn just from viewing the source of already existing content. The format is very forgiving if you make mistakes.

      People can handle a mostly-text format with occasional escape codes to make something bigger, in italics, or to start a new paragraph.

      However, I don't see anything like this happening with productivity suite file formats. Most of the data used to describe the documents created in these suites are too complicated for the average techno-neophyte to figure out.

      I expect that few-to-no other protocols will ever come along which will have as simple of a learning curve (and therefore as much success) as HTML.

      --
      Please consider making an automatic monthly recurring donation to the EFF
    2. Re:Suite of formats by zog+karndon · · Score: 1

      Or, I suppose, you could have installed the Word 6 reader that comes with Word 95 (and all subsequent versions). In fact, if I recall correctly, the Word 6 reader comes with every Microsoft product that edits formatted text (works, publisher, etc.).

      Nah. God forbid Linux users should actually learn something.

    3. Re:Suite of formats by SgtChaireBourne · · Score: 1
      It's bad form to feed the trolls, but manual, three step process fo opening and converting files with a reader or with a previous verion of the full blown editor sucks. There is no substitute for direct access to the file. Fortunately, I had under 200 files to deal with.

      Word 6 filters came months after Word 95. Other filters came late as well and I knew two that wrote their own for a much earlier version. Neither Word 6 for Windows nor Word 95 (subsequent versions) were an improvement over Word for Windows 2c. On the Macintosh, Word 6 was so bad that students were continuously re-installing Word 5. Eventually we had to work out a deal to keep Word 5 in the labs.

      But that's gripes about bad design and bad programs, the heart of the issue is file format. The hassles could have been avoided by keeping the same file format.

      --
      Beta is broken and the link to classic doesn't work. Stop wasting our time or there won't be anybody left here.
  39. Audio is the SLOW alternative by ArsSineArtificio · · Score: 1

    Can somebody tell me why eBooks are better than audio content? What can possibly be done with electronic text, that cannot be accomplished through audio content?

    I can read a page of a standard paperback book in about 30 seconds for fiction, or between 45 sec. and a minute for non-fiction.

    Having a voice read that to me instead would be slow and tiresome.

    ASA

    --
    All employees must wash hands before seeking equitable relief.
  40. A well-known ebook industry expert, Jon Noring by Digital11 · · Score: 1

    A well-known ebook industry expert, Jon Noring
    That sounds a little funky to me... If someone is a well known industry expert, would they really need the phrase "well-known industry expert" before their name?

    --
    I am a leaf on the wind. Watch how I soar.
    1. Re:A well-known ebook industry expert, Jon Noring by Anonymous Coward · · Score: 0

      This means that he is well known in the e-book industry, but that you (the reader) not being in the e-book industry, might not have heard of him, so they're telling you who he is, just in case you haven't heard of him, being that you don't work in the e-book industry.

  41. Its pre-chosen by t_allardyce · · Score: 2, Interesting

    Face it, the only reason that HTML/XML/LaTeX or whatever simple suitable format hasnt been chosen as standard (even though its so bloody obvious that it should) is because they dont have DRM.

    DRM Capability: Although end-users prefer not to purchase ebooks protected with DRM (Digital Rights Management), publishers are certainly interested in the DRM capability of the universal ebook format. Thus, the universal ebook format must allow inclusion of DRM protection technologies as needed.

    Its obvious that the author of that article has no idea about DRM, any slashdotter will tell them that DRM is pointless, if you can read it you can copy it, if one copy gets made a million get made etc.

    Even if the publishing companies decide otherwise, everyone else will probably just rip books into HTML or something. Im sure most companies such as Microsoft or Adobe would love to have invented an all-purpose, DRM equipped publishing medium - tough, HTML/plain-text etc. lives on :)

    --
    This comment does not represent the views or opinions of the user.
  42. One Size Does Not Fit All by Rick.C · · Score: 1
    There are different kinds of books. They have different requirements.

    Fiction - .txt or .rtf is fine. Many of the people who refuse to consider ebooks are referring to reading fiction titles. The technical aspects of an ebook interfere with their aesthetic enjoyment of the story. As my wife would say, "It's not what I'm used to." To make fiction viable as an ebook you'd have to get the device down to a bare minimun of size and complexity.

    Textbooks - having only one physical ebook with all your textbooks loaded would be very handy. Pictures and graphics are required. Some net-content linking might be helpful if you include wireless support. Students would tolerate a larger device and more complex operation than fiction readers.

    Reference books - sound and video would be appropriate, as well as linking to other content.

    A Slashdot post is not the place to do an exhaustive discussion of the subject and I'm no expert, so you can take this idea wherever you wish.
    --
    You were 80% angel, 10% demon. The rest was hard to explain. - Over The Rhine
    "Math in a song is good."-Linford
    1. Re:One Size Does Not Fit All by Anonymous Coward · · Score: 0

      Textbooks - having only one physical ebook with all your textbooks loaded would be very handy. Pictures and graphics are required. Some net-content linking might be helpful if you include wireless support. Students would tolerate a larger device and more complex operation than fiction readers.

      Uh, yeah. So, how does the student have the calc book, university physics book, CRC Handbook, and lab manual open on their desk at the same time, available for *instant* cross referencing?

      So long as you can change what book you are looking at without using anything but your eyes, the system sounds fine. If it takes any concious effort at all, it is a waste of time and should never be adopted.

      Oh, and how does one keep archival copies of your e-book class texts for use five, ten, twenty, fifty years later? I have a bookcase filled with my undergraduate and graduate texts. They are useful for reference and review for hundreds of years after publication.

      Perhaps business students could find this useful. Everyone else will reject it.

      Jim

    2. Re:One Size Does Not Fit All by Rick.C · · Score: 1
      Uh, yeah. So, how does the student have the calc book, university physics book, CRC Handbook, and lab manual open on their desk at the same time, available for *instant* cross referencing?

      I sit in front on one screen all day with many windows open - email, several technical manuals, two or three TN3270 sessions, UltraEdit (with several files open) and of course, Slashdot. I bounce back and forth without losing my train of thought. I don't see how this would be a problem for a mind that's much younger and more nimble than mine.

      Yeah, not being able to keep the ebooks for reference would be a show-stopper for college students, but for high school, who cares?

      I personally won't buy ebooks as long as DRM is involved, but if my high school texts had been issued as ebooks, that would have been great.
      --
      You were 80% angel, 10% demon. The rest was hard to explain. - Over The Rhine
      "Math in a song is good."-Linford
    3. Re:One Size Does Not Fit All by TheRaven64 · · Score: 1
      Fiction - .txt or .rtf is fine.

      Maybe, but only if you have some really powerful hardware for rendering. Look at a fiction book carefully. Look at the character and word spacing. The way the text flows makes it easy to read, and that can not easily be done yet in real-time. Especially with a portable device. A better choice would be to store it as a PDF or PS type format, where the text spacing is pre-calculated.

      --
      I am TheRaven on Soylent News
  43. Standard format by joe630 · · Score: 1

    The last time I remember a big group of people coming together to create a standard for electonic media, it was called DVD.

    Maybe if there is no standard, authors will pick the format that allows them the exact type of access control that they want instead of having a format thrust upon them.

  44. The solution? by jabbadabbadoo · · Score: 2, Insightful
    A new revision of PDF with small rendering capabilities. How hard can that be?

    Adobe, can you hear me? Business Opportunity Nocking.

    1. Re:The solution? by EnglishTim · · Score: 1

      That's 'knocking'.

      Cheers.

    2. Re:The solution? by TheRaven64 · · Score: 1
      A new revision of PDF with small rendering capabilities. How hard can that be?

      Not hard at all. It's called 'Tagged PDF'. There's a load of documentation for it on the adobe web site.

      --
      I am TheRaven on Soylent News
    3. Re:The solution? by jabbadabbadoo · · Score: 1

      No - it's a wordplay (nock means putting an arrow on the bowstring.)

    4. Re:The solution? by EnglishTim · · Score: 1

      Ah!

      Thanks - you learn a new thing every day.

    5. Re:The solution? by jabbadabbadoo · · Score: 1

      Thanks for the enlightment. Now I wanna be a PDF tagger.

  45. Another perspective... by borschski · · Score: 3, Interesting


    My wife has a small publishing/consulting company that has taken us 16 years -- and a lot of investment and pain -- to build. She works her butt off gathering the content which she then publishes as print products and CD-ROM "ebooks".

    She is devastated when she hears from someone that they've copied one of her color newsletters; made a "backup" of the CD-ROM ebook and someone else "happens to be reading it so I thought I'd call with a question"; and otherwise copies illegally (no...we don't have the funds to pursue them). She had an opportunity to publish a digital product in Asia and another in Latin America but these markets are notorious for buying *one* and suddenly hundreds or thousands appear (I could digress with a personal story when I was at a software company and saw this first-hand...but it's too long).

    PDF is the best standard right now. Platform support for everything out there virtually; security; but there is no meaningful method of DRM that would protect a small businessperson AND make it relatively easy to move ebooks from device-to-device (I know that I would hate to have to remember codes from dozens of publishers; be locked in to one machine for viewing; or other cumbersome methods).

    However, no protection = no incentive. I don't care if you're an recording artist seeing your music ripped off or someone like my wife struggling to grow a business. Why should my bride travel to Europe and domestically gathering content; pay correspondents and photographers; and publish a product in ebook format that is super-simple to copy and distribute?

    This is why I'm struggling so hard with the whole discussion about ebooks; copyright; DRM and fair use. So some how, some way, we've got to come up with a solution that offers some sort of universal ebook format that content producers can agree on and users can live with.

    My $.02....

    1. Re:Another perspective... by Rashkae · · Score: 2, Insightful

      Mass pirating in Asia asside, why not look at this from another direction?

      Instead of being sad when a person who "happens" to get a copy somehow phones with questions, I would be happy, thrilled even, to have a potential new customer who would probably otherwise never have heard of me or even considered buying my stuff!!!

      As an indenpendant publisher, your biggest obstacle is probably obscurity, not piracy. The occasional 'casual' copy exchanged from friend to friend is the best advertising your going to get without having to pay for it.

      As it happens, this topic gets lots of discussion on the baen web boards. Jim has about as much experience with e-publishing of fiction as anyone else, and has strong views about what works in the "real world" to make money.

    2. Re:Another perspective... by Svartalf · · Score: 2, Insightful

      Paper doesn't have "protection". Never did, really.
      CD's didn't have "protection" until recently- what they've come up with is useless.
      Cassette didn't have "protection". Neither did vinyl.

      Everybody did FINE without protection. Protection does NOT equate to incentive. Lack of protection does NOT equate to lack of incentive.

      How do you deal with piracy? Not by locking the stuff up. You deal with it by making it such that it is no longer profitable to do so- that is the REAL reason why piracy happens.

      --
      I am not merely a "consumer" or a "taxpayer". I am a Citizen of the State of Texas
    3. Re:Another perspective... by metamatic · · Score: 1

      Well, let me give you another perspective.

      I haven't pirated any e-books, but the only e-books I've purchased are the O'Reilly Java reference library books. Know why? Because they're plain HTML. I can read them anywhere I need to, on any system I need to, I can search them, and I know I'll still be able to read them five years from now.

      The chances of me purchasing e-books in a proprietary copy-protected format are zero. Not low, zero. So it's your choice.

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
    4. Re:Another perspective... by Anonymous Coward · · Score: 0

      The problem is that it is easy for criminals to get around DRM, therefore it only hurts the legitimate users.

      Any person that wishes to copy your book can either break the encryption, or do screenshots, or countless other ways to duplicate the information.

      Probably the best way to handle it is to have a notice that the work is not to be redistributed, and give the users a way to purchase the book if they have come across it illegally.

      If you take a look into the paper equivalent, it is very popular for a person to buy a book and lend it out to friends. If the book is good, the friend may buy it. DRM usually doesn't have an easy way to let friends borrow.

  46. I agree. by MtViewGuy · · Score: 1

    I think the Adobe .PDF format is probably the best way to go for eBooks.

    I mean think about it: even the relatively low-powered CPU's used on PDA's have enough computing oomph to process and display .PDF files. And we're talking reading .PDF files, not creating one, which takes a lot more CPU processing power.

    1. Re:I agree. by wan23 · · Score: 1

      You apparently have never tried Acrobat Reader for PocketPC... ow, so painful :(

  47. Gutenbug limited to classics by peter303 · · Score: 1

    Its that damn Mickey Mouse copyright rule, now like 96 years since the authors death. So you wont see much Gutenberg seelctiosn from after the 1920s, unless the author has given permission.

  48. Why I don't e-book by whiskeypete · · Score: 2, Interesting

    It's all about the price.

    Shadow Puppets (hardcover) by Orson Scott Card is priced on Amazon for USD$18.15
    Electronic version USD$25.95 (M$Reader and Adobe)

    With the e-version the pubisher has no printing costs, no binding costs, virtualy no shipping costs, no warehousing fees, no sales clerks.

    Like most everyone, I prefer my books on paper, but there are times where an e-book version would be convenient. But I am not going to pay 4 times the paperback price for the experience.

  49. German GIFT vrs English GIFT by Anonymous Coward · · Score: 1, Interesting
    I've worked in the ebook world for 8+years it's not an easy thing to do. There are differing themes - and motivations. Publishers think in terms of paper - ebooks are not paper and you must think of content and structure and hidden meaning.

    Best example is in a German English Dictionary - if you choose the word "GIFT" and try to look it up, just what should you find? You might not like what you find.

    Then there is sytantical choices, ie: "I record the record" Which is a noun? Can you pretag, or not? What about the "People from the land of Er" - does the text to speach engine say "Urbidum? or is this the biblical land?

    These are *REAL* problems and no they are not solved, a trade publisher (novels, serials, magazines) has a different view then a reference (dictionary) publisher.

    If your goal is printed paper - postscript or PDF is fine, but to do syntatically correct search - it is horrible.

  50. Look at the Cellular markets by camconcay · · Score: 2, Interesting

    If the competing "companies" would take a look at Europe and how far past the US they are with cellular adoption, it would seem obvious that having one standard in the end benefits everyone. The US market is fragmented and the costs are much higher... .02 deposited

  51. The gaping flaw in their argument by Dagmar+d'Surreal · · Score: 4, Insightful

    "Right now there's a plethora of essentially incompatible ebook formats, and this format 'babel' is hampering the growth of the ebook industry."

    Bullshit.

    The problem is that the few people who actually still read books are not likely to be stupid people. On top of that, the people who are reading electronic formats of books are even less likely to be stupid people.

    However, it would take rather dim consumers indeed to not see a problem with paying the exact same cost for an eBook as one would in a brick and mortar bookstore for a paperback... and strangely when I go to these eBook sellers online, I see exactly that. "Oh joy! Instead of paying $7.95 for that paperback over an Barnes & Noble, I can pay just $7.95 to download an electronic copy in a format that I probably won't be able to read again in 10 years because the format and it's reader will have been declared obsolete!"

    The unwillingness of eBook publishers to see eBooks as something other than a way to increase sales profits by cutting out the middlemen of printing and shipping expenses is what is hampering eBook adoption.
  52. eBook experience by smaart · · Score: 1
    I bought one eBook in my life, and it will be the last. It was a pre-release of Scott Adams' God's Debris. The machine I installed it on needed to be rebuilt before I could read the eBook and I had nothing but trouble trying to get a second installation elsewhere. I eventually just gave up and kissed my 10 bucks goodbye.

    I later bought the book in plain analog bio-matter format and I will never trust an encrypted eBook that I can't back up ever again.

    BTW, the book was crap.

    1. Re:eBook experience by RichardX · · Score: 1

      I have to say, while I have a large number of eBooks, I haven't actually bought a single one of them, nor would I consider doing so any time soon.

      Before I get flamed as some kind of pirate scum, let me explain that they are all either public domain works from Project Gutenberg, or copyrighted works that I have bought in PBF (Physical Book Format ;)

      Personally I find this by far the simplest way. Generally I buy the real thing first, then download a text of it for my PDA for portability, etc, but I'll admit there have been times when I've downloaded an ebook first, and bought the physical copy after (usually the next day). And no, I don't feel any guilt over this.. it strikes me as an extremely sensible way to do it.

      I just download the text/pdf/html/rtf/lit/whatever (my preference being plain text generally), read the first few pages, just as I would if I were browsing in a bookshop to see if I like the book, and if it's a decent OCR/typeup... If I like it, and the quality is high enough, I'll keep it, read it, and be sure to buy the physical version next time I have a chance.

      Yes, it takes a little moral work, it's very tempting to think "Well, I already have it now, I don't REALLY have to buy it...", but if you want to see more books from that author, I'd suggest paying up.

      As an additional note to this, a while back I bought a paperback of Dirk Gently's Holistic Detective Agency by Douglas Adams (superb book, highly recommended ,but what do you expect? it's by Douglas Adams :).. anyways, I was quite happily reading my way through it until I got about two thirds of the way in, turned the page, and found.. an illegible mess of text.

      Basically, it looked like far too much ink had got on the printer or something.. where each letter should've been there was just a big misshapen black blob. This continued for about the next 15 pages or so. With great care it was just about (slowly) readable, but hardly ideal.. and where I live I couldn't really just pop back into town (35 minute drive) just to get a replacement, especially as the shops were shut at the time.. so instead, I just downloaded a .txt of it, and continued reading from there, then replaced the book next time I was in town. :)

      --
      Curiosity was framed. Ignorance killed the cat.
  53. It's not the format, it's the cost by Ratatoskr · · Score: 1

    I'm a big consumer of ebooks for my handheld. I've used several readers and several formats over the years, without becoming particularly attached (or annoyed) by any of them.

    What's keeping ebooks from taking off is cost. For contemporary titles, they charge as much for the e- version as the paper version. That's nuts. In a year or so, I'll probably have a completely different hardware (which may recognize the ebook format, but possibly not my authorization to use it). I just won't pay the same for electrons as I do for ink and wood pulp.

    So I get to read a lot of 19th Century novels :)

  54. Parent doesn't know what he is talking about... by danro · · Score: 2, Interesting

    The ultimate in bloat! Imagine; an XML document describing the vector points and bezier curves required to draw each and every character in an entire eBook! If nothing else, it would push the price of multi Tb drives down.

    You don't know squat about the SVG format, do you?
    If you took 2 minutes to check the SVG-specification you'd see that you are totaly wrong.
    Text in SVG is described as... wait for it... ordinary text. The characters are then mapped to glyphs in your SVG-viewer. Just like in MS Doc, or Adobe PDF.
    Just to show you the "ultimate bloat" I'll include an example. If you just want an ordinary plain vanilla book this is all it takes.

    <?xml version="1.0" standalone="no"?>
    <!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"
    "http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd" >
    <svg width="25cm" height="35cm" viewBox="0 0 2500 3500" xmlns="http://www.w3.org/2000/svg" version="1.1">
    <text x="250" y="150" font-family="Verdana" font-size="55" fill="blue">
    Insert book text here...
    </text>
    </svg>

    --

    "First lesson," Jon said. "Stick them with the pointy end."
  55. EBCDIC by meadowsp · · Score: 1

    Pah, Ascii. I want them in EBCDIC.

  56. robot fuckhead by Anonymous Coward · · Score: 0

    You wonderfully narrow minded, arrogant fool.

  57. Plain text - the format of the future by Anonymous Coward · · Score: 0

    /Italics/ *Bold* _Underline_

    And I'm sure you are aware of the artwork capabilities if you browse Slashdot at -1. :)

    Anything I type that I think may have some value in the future, I always save in plain text. I feel sorry for those who 100-years from now will be trying to figure out how to convert FrontPage or MSWord "HTML" documents into a human-readable format, not to mention the DOC format and other proprietary nightmares.

  58. pdf{tex,latex} by Grendel+Drago · · Score: 2, Informative

    While everyone may use LaTeX, PDF has become more and more popular for web distribution of papers. PS works fine when you're just sending it to the printer, but because Adobe didn't include PS support in Acrobat, Windows users don't bother.

    But TeX/LaTeX has the advantage of being pretty much immutable, second only to plain TXT on that count. The standard hasn't changed since, what, 1982? Hopefully we'll be able to process the same documents with the same tools fifty years from now.

    I think the important distinction between, say, Word format and TeX is that TeX is a piece of systems programming---it performs a well-defined task in a well-defined matter, much like lex or yacc do. An attempt to add 'features' is nonsensical. (Though functionality can be extended through the use of, say, pdftex.)

    --grendel drago

    --
    Laws do not persuade just because they threaten. --Seneca
  59. I concur... do you concur? by Bpr · · Score: 0

    As i sit here on my ipaq i have no books & no good format for them. duh i don't want drm on a $40 50k file - but i want a friendly program - w/ features like autoscroll - w/ the universal format.

    --
    -- Whee
  60. On Beyond ASCII by Creosote · · Score: 3, Informative
    I understand the support in a lot of the comments here for the plain-vanilla ASCII Project Gutenberg approach to ebooks. Paradoxically, however, a simple ASCII conversion from print to digital form provides less assurance of future survivability and usability of your book than rendering it with the structured XML markup specified by the Open eBook standard (where well-formed XHTML is the least common denominator).

    Why? Well, an ASCII text version of a printed book is really more like an analog facsimile than is a version in XML that has been tagged for structural features. Leaving aside issues of non-English characters, illustrations, and unusual typography, ASCII does a relatively poor job of capturing all of the structural conventions that exist in printed books. Books have copyright pages, tables of contents, chapter titles, subtitles, bylines, epigraphs, block quotations, footnotes, running headers and footers, citation lists, etc. ASCII can provide rough format equivalents of some of these, very poor equivalents of others. With an appropriate XML tagset, however, it's a relatively simple matter to tag most of the structural features of a book and then use stylesheets for presentational rendering. That's the whole assumption of the Open eBook specification.

    Suppose you're in a world where all printed copies of Huckleberry Finn have been lost. You have two CD-ROMS that somehow you've managed to decode so that you can read the files and interpret their character sets. One of them contains the Project Gutenberg etext of the novel, an ASCII transcription. The other contains an XML encoding tagged according to a DTD from the Text Encoding Initiative, the current best standard for encoding literary (and many other) texts. It has all of the textual content of the PG version, as well as some that's missing (like the table of contents and the copyright page from the transcribed edition, which the PG version unaccountably omits). XML tags mark all the line and page breaks of the original. In addition, there are tags to mark quoted speech, unusual typography, words in foreign languages, and other significant features of the original. The CD-ROM contains the DTD used along with documentation on the tagset.

    In this imaginary scenario, even if all of the XML documentation were missing it would be pretty straightforward for 31st-century programmers to strip out the tags and recreate the ASCII transcription. But with the documentation, it's possible to reconstruct something much closer to the original than the plain-vanilla PG version allows. And suppose your 31st-century archaeologist found a trove of TEI-tagged books on CD: with all of the structural tagging and metadata about authorship, publication dates, etc., a 31st-century librarian will be able to plug all of the books into a cataloging system that allows sophisticated searching. If instead you had a trove of plain-ASCII books, the best you could do with the collection would be simple full-text searches.

    Leaving aside the sci-fi scenario, the reality is that our documents, over the next few decades, will move from format to format and be used for purposes that we can only guess at right now. Of course plain ASCII, or even proprietary formats, will be better than no documents at all. But the work involved in converting them will be a lot higher than if they are tagged in a well-documented, structured markup language.

    Incidentally, there's already at least one project underway to take Project Gutenberg texts and add minimal XHTML or XML markup to capture structure and make them more readable via stylesheets. The Open eBook specification is just a more sophisticated way of doing the same thing.

    1. Re:On Beyond ASCII by dvdeug · · Score: 1

      an ASCII text version of a printed book is really more like an analog facsimile than is a version in XML that has been tagged for structural features.
      [...]
      (like the table of contents and the copyright page from the transcribed edition, which the PG version unaccountably omits). XML tags mark all the line and page breaks of the original.


      You seem to contradict yourself - the format that preserves line and page breaks and the copyright page and other details that are only of interest to the scholar is more like an analog facsimile then one that extracts the meaning and ignores all the details of how it happened to be dumped on the page by the typesetter.

      I think you're confusing two different things here. Everyone at PG wants to preserve the contents of the original book - the sidenotes, footnotes, tables, etc. - and there are continual improvements going on in that area. More and more books are being posted as HTML, for example. But some of those details that were "unaccountably" lost are because they aren't part of the text; they're edition details of interest only to the scholar. Even most analog facsimiles drop the copyright page, and page breaks and line breaks are rarely preserved in reprints and new editions. In the opinion of most of the people at PG, it's not worth the extra work to preserve all those details that had nothing to do with the original author, and are only of interest to the scholar.

  61. libraries by rodentia · · Score: 1

    You may not have noticed that libraries are a target for the industry. They have never been satisfied with the status quo on the public good of lending libraries. The DRM/DMCA Catch 22 is bespoke for this purpose and you can bet your ass as soon as they get comfortable with the uptake of TCA and pals, they will pull print publication in a heartbeat. The only remaining value-adds for the publishing industry are metadata, catalogs and editorial. Libraries do the first two better, both more openly and more broadly. And simple software agents can do taste-based referral and demographic aggregation better than either libraries or publishers.

    love,

    Bunky

    --
    illegitimii non ingravare
    1. Re:libraries by RobotRunAmok · · Score: 1

      Dear Bunky,

      I'm not seeing it that way. The cost of print publication has dropped dramatically in the past ten years, but the retail price of books has not. It's similar to the vinyl-CD transition in the music business. I don't see the number of libraries growing (regrettably), so the print industry's maintenance costs for their library "accounts" would be reduced. Moreover, all the library expansion I have seen is in the area of computer access, not stack space. At this moment in time, and for the foreseeable future, libraries seem to most serve the unwired poor and elderly, and no amount of print industry or local governmental soft-shoe is going to pry the free hardbacks out of their hands any time soon.

      Yeah, yeah, I know, "Power to the People! Stick It to The Man! Corporations R D SuXXors!" and all that, but unless you're truly paranoid or a bitter unpublished author wannabe, it's kinda hard to paint the Book Industry with the same brush used for the Music Business.

    2. Re:libraries by rodentia · · Score: 1

      Dear Robot,

      You missing a few book-business idiocyncracies that complicate your analysis. Firstly, costs of print publication are not dropping dramatically. Process improvements have helped hold the line on costs, but up to 85% of print costs is the paper itself and paper as a commodity is highly volatile. 1995 saw a fourfold increase in paper costs for all printers and contributed mightily to the consolidation in the newspaper industry, for example. This necessitates maintaining high inventories relative to other manufacturing processes in order to hedge raw materials price swings.

      Another aspect of the publishing market you may not be aware of is an anacronistic feature of book distribution contracts termed a buy-back clause. What this means is that publishers commit to repurchasing unsold inventory from distributors. This habit has some quirky historical background and is very uncommon in other industries. It lingers both because distributors are loath to give up this safety net and because of the high cost of inventory maintenance for print publications. The upshot of this is that the line between production and marketing costs is very fluid for most publishers. High print runs signal a publishers commitment to a book and can be considered an important aspect of marketing, but it also builds in a cost overhang. Among other things this inefficiency contributes to the durability of book genres and big names in print. New authors and untried formulas are highly risky. This also contributes to some eggregious contractual terms for authors. Did you know that the vast majority of publication advances to authors are refundable? That is, if the book fails to sell sufficiently for the author's royalties to cover the advance, the difference is the author's liability. In fact, most new works of fiction fail this test and most fiction authors' first books are consequently their last.

      As to the cost basis for libraries I refer you this article

      I regret to say that the absurdities /. finds in the infrastructure of industrial music production and distribution were invented and refined by the book business for two centuries before the advent of vinyl. The book industry taught the music and film studios how to do the business of mass meme distribution and how to control their creative talent.

      --
      illegitimii non ingravare
  62. Research fields use PDF or HTML by pdjohe · · Score: 1

    Documents written by researchers are available online mostly in both PDF and HTML formats. For instance, if you search around medical journals at www.pubmed.com, most full texts that are available are first shown in HTML with a link to download the PDF if you wish.

    And I generally get the PDF if it is available - everything is embedded nicely into one file and it looks identical to the printed version.

    Thinking about it now, I know some science journals that have the full current journal available on the PDF format. I wonder why more magazines and newspapers don't try and do something like that?! I think I would subscribe to a nice PDF version identical to the printed version of some magazines I get.

  63. *Please* not fixed layout (eg PDF) by lpontiac · · Score: 1

    I recently tried reading an eBook on the Palm. It was presented similarly to a PDF - fixed pages with fixed text in fixed positions. This totally ignores the sort of advantages an electronic device has over paper!

    I can't shrink the font size to get more text on the screen, because I'm viewing one page at a time and it's always got the same text on it. Even worse, I loaded the same eBook in the desktop version of the reader, and I was still viewing the same amount of text at once, in a ludicrously large font.

    Fixed layout is bad enough when shoehorning Letter size PDFs onto A4 paper. Taking the same approach to eBooks which will be viewed on a wide variety of devices is just retarded.

  64. 80 x 24 column text that looks good in less please by A55M0NKEY · · Score: 1

    Either that or if Pictures Etc MUST be shown use HTML. PDF sux 'cause you can't cut and paste pics and text as easily and you need a special reader.

    --

    Eat at Joe's.

  65. It really isn't the formats... by Mac+Degger · · Score: 4, Insightful

    Though it might help, a universal format isn't what's hampering ebooks. It's price. I refuse to pay full price (and sometimes more!) for an etext of something I can get on paper; especially when I only get the etext.

    Halfprice, maybe even quarter price, compared to deadtree is what ebooksellers should be going for...but if I still have to pay fullprice and I don't even get my dead tree, I'll pay the same for something slightly more tangible.

    Now I would pay a couple of bucks (ie $2) more for a deadtree book which includes the etext.

    --
    -- Waht? Tehr's a preveiw buottn?
  66. This says it all, really. by arafel · · Score: 3, Insightful

    >Although end-users prefer not to purchase ebooks
    >protected with DRM (Digital Rights Management),
    >publishers are certainly interested in the DRM
    >capability of the universal ebook format.

    So although people would prefer not to buy books with this stuff, we're going to put it in there anyway. Whatever happened to listening to your customers?

  67. A standard will help, but.... by mwood · · Score: 1

    Commoditizing* the hardware with format standards can help, but the hardware needs to become a lot more capable too.

    I've yet to see an ebook viewer that can present (at a readable size) all at once even what would be one full page of text in a pbook; every pbook presents *two* at a time. You can do things with two full pages that would be ugly and unusable with only half a page in view.

    A pbook will function acceptably from candlelight to full sunshine. It will operate indefinitely without power. It will operate *underwater* if you must.

    I still haven't seen a bookmark scheme that works anywhere near as well as sticking my fingers between the pages.

    I can afford to have five books open concurrently and spread out around my workspace, which is a frequent need. Who's going to own five ebook viewers at the same time?

    -----------------

    * Note that word. That's why it won't happen anytime soon.

  68. You Geeks Missed the Best Part by Anonymous Coward · · Score: 0

    If you read the entire story above:
    Amy Hsieh writes "A well-known ebook industry expert, Jon Noring, recently wrote an interesting article for eBookWeb, formally calling upon the ebook industry to adopt a single universal ebook distribution format. . . On the other hand, Noring's proposal has also met with some skepticism elsewhere.
    and followed the links, especially the last link. And followed the links in that article to their final destination, [this is a lot easier if your browser supports tabbed browsing :-)] you will find that Mr. Noring had only digitized 2 e-books, but that those books are high class por^H^H^H erotica with pictures.

    BTW, the critics site has lot of texts, many of which can be read to young children in the presence of their grandparents, available for downloading in various formats for free as in beer.

  69. Dynamic fonts by yerricde · · Score: 2, Informative

    With CSS there's not a lot HTML can't do with layouts.

    No free, mature implementation of HTML and CSS can render a font not installed on the user's machine from outline data stored in the document. Mozilla has a bug on this open in bugzilla.mozilla.org (bug 52746), but it doesn't look like it's going anywhere. And no, "just replace with Helvetica, which is installed everywhere" is not an option because Helvetica for every non-Latin writing system is not installed on every reading device.

    --
    Will I retire or break 10K?
    1. Re:Dynamic fonts by arose · · Score: 1

      Books in MS Comic and now way to change it, that's inovation.

      --
      Analogies don't equal equalities, they are merely somewhat analogous.
    2. Re:Dynamic fonts by X_Caffeine · · Score: 1

      how's this for an option: "font-family:avant garde, helvetica, arial, sans-serif;" If the user has the designer's preferred font, great, it gets used. If not, it'll review the designer's remaining preferences, and if worse comes to worse, just pick whatever the reader's default sans-serif font is.

      --
      // I will show you fear in a handful of jellybeans.
    3. Re:Dynamic fonts by Anonymous Coward · · Score: 0
      That would only be a problem if you were displaying the book online. The ideal HTML-eBook standard would basically be a compressed, optionally encrypted archive with all images, fonts, etc. inside.

      You could probably even add Java programs, music, sound, and video, if size wasn't a concern.

  70. DRM by yerricde · · Score: 1

    Publishers of novels first published since 1923 are in general not willing to publish in a text format without a digital restrictions management wrapper.

    --
    Will I retire or break 10K?
  71. both PDF and HTML by bcrowell · · Score: 4, Insightful
    The article proposes two incompatible criteria: typographic richness (nice formatting) and adaptability (letting the end-user change font sizes, reflowing the text to display it on small and large screens, etc.). PDF is a perfectly good format for typographic richness, and HTML is a perfectly good format for adaptability. There is never going to be a single format that's good for both, because they're mutually exclusive.

    What I mean by that is that for many books with complicated layouts (including my own free books), it's simply not possible to reflow the text automatically. Consider an illustrated science textbook, which is the kind of work I do. There's a lot of hand-tweaking involved in getting everything laid out on the pages in the best possible way. And my books' layouts aren't even that complex compared to a lot of the big commercial textbooks out there. Some slashdotters may have used LaTeX to write academic papers, so they'll know how LaTeX tries hard to flow the text correctly, but ultimately it doesn't always do what you want, and either you or the publisher ends up doing more tweaking.

    The solution isn't that complicated: if a publisher wants a book to make an electronic book available in both a a typographically rich version and an adaptable version, they can create both a PDF version and an HTML version. Of course, this is really an answer to a question that the publishers never asked. Most publishers don't want open formats, because open formats won't allow them to continue to steal away the rights of end-users, such as the right of first sale.

    1. Re:both PDF and HTML by Grotus · · Score: 1

      What about XML and XSLT? With one stylesheet for publication and others for reading on various platforms you get best of both worlds. If you have id attributes on all your elements you can get as anal about your layout as you like in your publication stylesheet, and still have the content be fully adaptable unmodified by simply using a different stylesheet.

      The trick is having everyone using the same schema so that those adaptible stylesheets can be used for multiple documents. For publishers, the trick would be getting those pesky authors to use it.

      The Open eBook is one attempt at a standard, DocBook is another. DocBook is notably used by O'Reilly to produce paper and electronic versions of their books.

      --
      "From my cold, dead hands you damn, dirty apes!" - CH
    2. Re:both PDF and HTML by bcrowell · · Score: 1

      The real problem is software. There's good open-source support for creating and viewing both HTML and PDF. For people who are not technically inclined, there are also various proprietary GUI tools like DreamWeaver, Quark Express, PageMaker, Adobe Illustrator, etc. Also, consider SVG, which would presumably be part of any XML-based approach. SVG support still isn't available in binary releases of Mozilla, and the open-source tools for editing SVG (e.g., Sodipodi) aren't very mature and full-featured yet. And how about MathML? -- not yet really supported in Mozilla, either.

  72. Re:Just as long as it's encrypted to prevent pirac by N0decam · · Score: 1

    Double ROT-13 is much better.

  73. I do know by TFloore · · Score: 2, Interesting
    Look, even here on this "geek" board, loud are the arguments made for the current "analog" standard (paper).

    That's because a lot of /. geeks are avid sci-fi readers, and they know what a good interface for a book is. Reference books are good candidates for ebooks (in open formats) because you want to search in a reference book. But for pleasure reading, the primary user requirements seem to be:
    • No electricity OR
    • Very long battery life (and very long battery shelf life)
    • easy portability
    • very durable - I can drop a paperback and just brush the dirt off... I drop an ebook reader, do I have to buy a new one to replace the cracked screen?
    • sharable - I can loan my paperbacks to friends, I want to be able to do the same with an ebook (note that: "loan" not "copy")
    • long-term accessible - I have paperbacks from 1968 (cover price is about 60cents, I think) and hardbacks from about 1890 (family bibles, mostly), and you know, my eyes have no problems reading them... show me an ebook that will still be accessible in 35 years
    • Better resolution - paperbacks are printed at better than 300dpi (I heard 2400dpi once, but I don't really believe it with the quality of paper of recent paperbacks I've bought), and are easy to read and recognize for eyes. EBook screens are mostly in the 100ppi range, which is noticably worse.

    The main place ebooks beat paperbacks are in density, you can have many ebooks in a single reader, and in searchability and bookmarking. Many ebooks in a single reader doesn't help much if your battery is only good for 4 hours. Searchability doesn't help much either, for pleasure reading, but is very useful for reference books and school books. Bookmarking is good for starting up where you left off, but that is already a solved problem in paperbacks, it isn't a new capability.

    The problem is that, at this point in development, ebooks are a *worse* way to read just about anything people buy for pleasure reading.

    When the technology develops sufficiently to solve at least most, if not all, of the issues I noted above, I'll look more at ebooks... but until then, I'll stick with paperbacks, because they are *better*.
    You can bet that the publishing industry is not racing to embrace any seachanges in distribution if even the bleeding edge slashdotters are uncertain of any value to be found in the change.

    I'm not so sure about this. Book publishers like the idea of ebooks, because they can build in more limitations than are legal with current physical book publishing. EBooks kill "right of first sale" because they license the work, they don't sell it. EBooks remove the publisher's major gripe about "more than one person reads each copy of a book/magazine". Publishers like ebooks because they can severely limit readers' rights with them, and think they can do it legally, or at least with fewer court hassles.

    It's a bad deal for readers/consumers, but hey, that's part of why book publishers like it. It changes the power relationship in the publisher's favor.

    Hmm... on reading through this... this has *got* to be -1, Redundant. At least, I hope it is. Karma to burn... :)
    --
    This is my sig. There are many like it but this one is... Oops. Frank, I've got your sig again! Where's mine?
  74. TXT by happyhangone · · Score: 1

    Can anyone on /. see that we need a standard format for ebooks other than txt? Hey... out there are pleople trying to make a living writting, and releasing their work on a txt doesn't cut it (well maybe if you want to be the bitch of kazaa, edonkey, or every p2p out there)... Maybe there is an automatic hate for DRM here (ms style) but look at the sucess of DRM on openning markets of music that wouldn't have the chance to get known (Apple and his future arragement with small labels)... DRM is not bad per se... is the implementation of it (look at itunes for an example...)

  75. HTML? by turgid · · Score: 1

    What's wrong with plain ASCII?

    1. Re:HTML? by jc42 · · Score: 2, Informative

      Plain ascii text isn't too bad, if you only write in English, and you don't want your text to look nice on tiny PDA screens.

      But sensibly-done "plain" HTML is generally better. For an example, look at Baen.com. In the upper right is a "free" link that points to a bunch of sci-fi works that are online. You can get them in several formats. The HTML is a good choice in most cases, because it's not overly fancy, but produces good rendering in just about any HTML-capable window on any size screen.

      So, even if you have a big screen, you can load the text into a narrow window along one side of your screen, and read it while you're waiting for a compile or a test run.

      Of course, there's the inevitable problem of junk HTML produced by such things as Microsoft's various editors and word processors. But this isn't HTML's fault; it's the fault of the idiots who foisted such software on unsuspecting customers. And even then, most HTML renderers will display it sensible, so the only real problem it causes is the long download time for all the spurious junk that clutters up the text.

      (Baen.com also had a thoughtful and entertaining essay on why they give out a lot of their books for free. It's an interesting summary of the impact of the Internet from an author's and a publisher's perspective.)

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    2. Re:HTML? by turgid · · Score: 1

      I see what you mean. People look at me funny when I write nice, simple HTML in a text editor. Not only that, but someone told me that nowadays it must be XML, not HTML. I'm all for progress, but I also believe in horses for courses.

  76. ZIP vs. RAR by Ayanami+Rei · · Score: 1

    they both suck.

    Why doesn't anyone use BZIP on windows?

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
  77. Unicode? by EnglishTim · · Score: 1

    Text? You mean ASCII? Sure, that might work for English, but what about other languages? You'd need it to be unicode.

    Also, I'm sorry to point this out, but normal books don't just contain text. Many books contain illustrations or diagrams as well. At the very least, most books will contain italics somewhere.

  78. Three issues by kalidasa · · Score: 1

    1. As the article notes, we need a format that is typographically rich. That means Unicode and style instructions. Maybe CSS3 or XSL:FO could do this, I haven't looked at the details of either closely enough. But plain text is NOT sufficient.

    2. As the article notes, we need a format that is adaptable. Adaptable means that the style information will adapt to the reading environment: both hardware (PDA, ebook reader, desktop monitor, projected on a large screen for an audience, audio, braille, etc.) and wetware (people who are blind, people who are colorblind, people who need large text, etc.). In order to be able to properly apply the styles, you would therefore need semantic markup (meaning an *ML, like DocBook or better TEI) and the ability to create multiple style sheets, or apply user style sheets.

    3. As the article mentions, DRM is a necessity (we may not think so, but the authors and publishers do). It seems to me that an open PKC could be used: you exchange keys with the publisher, and you can decrypt the document on screen, but the software doesn't allow you to export unless the publisher's key permits it. But this would probably be pretty hackable, wouldn't it?

    1. Re:Three issues by Anonymous Coward · · Score: 0

      No, DRM is not a necessity.

      If the publishers keep insisting that it is, then perhaps we should remind them that copyrights on eBooks are an option that citizens currently grant, and not a necessity.

    2. Re:Three issues by kalidasa · · Score: 1

      If the publishers keep insisting that it is, then perhaps we should remind them that copyrights on eBooks are an option that citizens currently grant, and not a necessity.

      And the authors will point out that their work is something they provide for consideration, and the right to read it is not inalienable.

  79. price by bcrowell · · Score: 1
    For me, the major hindrance to e-books is the price. Since there is no associated cost of the materials (paper/cardboard), printing, physical transportation, stocking space, and delivery, e-books should be [i]cheaper[/i] than physical books. But many of them are priced the same, or even high (you can check this at Amazon.) what's up with that?
    Paper, printing, and binding are typically only a very small percentage of the retail price of a book. For black-and-white upper-division college science textbooks, ther paper, printing, and binding cost is typically about $10, while the book retails for about $100-$150. Printing is very cheap once you get past the initial setup costs. A paperback bestseller at $7 probably cost less than a dollar to produce, because of economies of scale.

    I think what's holding back the adoption of e-books isn't price, it's quality. A book printed on paper generally provides a much better value for the reader, especially because reading long texts off of a low-resolution computer screen is so unpleasant.

  80. Latex by LotusMan · · Score: 1

    Why not use Latex? It's mostly plain text with command to format it.

    --
    -- Quidquid latine dictum sit altum viditur
  81. eBooks have had their chance and failed, why?... by Anonymous Coward · · Score: 1, Insightful

    Well there are a number of reasons why eBooks have failed.

    First and foremost is technology. Not just the software but hardware too. You simply cannot beat a stack of dead tree pulp squashed flat for durability and portability. People simply don't want to buy anything that disappears when their hard disk crashes!

    Secondly, DRM. Apple are starting to get the DRM idea right by allowing any number of CD burns and copies on a certain number of machines and iPods. Unfortunately, no book seller wants to agree to such liberal terms (with the exception of O'Reilly. Again, if your hard disk goes west you have to reapply for a license, yuk! In general, people regard books as property once they're bought. The IT idea of licensing simply doesn't wash with the masses.

    Thirdly, and back to technology again, hardware and standards. Without a single, simple easy to use and very very low energy consuming platform, eBooks simply won't take over the world. TFT isn't good enough and the sheer delicate nature of any electronic gadgetry makes it a no no right from the start. The current standards and level of DRM handcuff-ness is just a joke at the moment. There is no single compelling standard and all tie users too much to one copy or perhaps two if they're lucky... forget it!

    I worked for a big bookseller in the UK and we took a long hard look at eBooks (at great cost I may add!) and simply decided that the technology wasn't there yet and that the DRM implications make it prohibitive to implement in any sustainable manner. Perhaps Amazon and B&N can make it work for them, but I suspect that their sales are somewhat less than they would like.

    Having said all this, I still think there is a chance.
    Flexible displays that can take a pounding in a students back pack would be a good start. Make them stupidly energy efficient, add high capacity static memory (no hard disks!), single, simple, DRM liberal format and bring the price into the sub $100 range. Then we might see something happen. I'm not holding my breath though!

  82. It only makes sense by steveha · · Score: 1

    I would love to see a universal format. It needs to be rich enough to fully express a book: pictures, maps, small caps, everything. It should be compressed so the book can fit better on a PDA. It should be able to read books without putting a huge load on the relatively weak CPU of a PDA (this probably rules out PDF). And it needs to degrade gracefully: if a PDA chooses not to implement part of the standard, or if a new version of the standard comes out, the document reader software should be able to skip over the unknown parts of the document.

    The best part about a standard format is that you would be able to choose your reader, where today you may not be able to. I love reading Baen ebooks (I have bought many more Baen ebooks than actual paperbacks in the past year) but I hate the MobiPocket reader that I must use for Baen ebooks. But Baen has few choices: simple DOC format isn't rich enough for their purposes, and if they use something else (such as iSilo) then users are just locked in to a different standard.

    (Baen does offer mulitple formats, including RTF. I ought to see if iSilo offers a converter that can deal with RTF.)

    steveha

    --
    lf(1): it's like ls(1) but sorts filenames by extension, tersely
  83. What about HTML? by Anonymous Coward · · Score: 0

    Why invent a new format at all? What's the matter with HTML?

  84. MooV vs. Ogg? by yerricde · · Score: 1

    Ogg, used as en encapsulating format, allows you to put ANYTHING (Divx,SVCD,MP3,MP4,WhatTheHell) and have it used as an ogg file.

    So does MooV (*.mov), the QuickTime container format that underlies MPEG-4.

    That's how men Fansub groups makes releases including a 4 subtitles choice with a nice XVID compressed video stream.

    MooV does that too.

    Just thatOgg is quite a universal standard.

    The Ogg container may be a "widely used format with a public specification", which is just fine by me, but some people reserve "standards" to refer to formats defined by a publication of ISO, IEC, IEEE, ECMA, ANSI, DIN, or some other organization recognized by national governments as a standard-setting body.

    Has anybody worked with both Ogg and QuickTime? Which is more capable?

    --
    Will I retire or break 10K?
  85. Might as well use XHTML by yerricde · · Score: 1

    /Italics/ *Bold* _Underline_

    How does one represent pathnames such as /opt/kde/ if /foo/ emphasizes? (Use and you might as well use XHTML.)

    How does one represent mathematical formulas such as a*b*c? (Use MathML and you might as well use XHTML.)

    I feel sorry for those who 100-years from now will be trying to figure out how to convert FrontPage or MSWord "HTML" documents into a human-readable format

    HTML Tidy works for me.

    As long as English is readable, the XML, XSLT, and CSS specifications are readable. As long as relevant specifications are readable, documents written to those specifications are readable.

    --
    Will I retire or break 10K?
  86. And for unknown languages? by yerricde · · Score: 1

    if worse comes to worse, just pick whatever the reader's default sans-serif font is.

    Read what I wrote: "Helvetica for every non-Latin writing system is not installed on every reading device." What if the reader's default sans-serif font has no glyphs for a given language? What if the reader's operating system has no support for a given language?

    --
    Will I retire or break 10K?
  87. Publishing industry is dying by solprovider · · Score: 1

    I am sorry that your wife chose a business that had a limited lifespan. 16 years ago was 1987, so it was not obvious at the time that it was limited, but she needs to find a new focus of the business is to survive.

    "Publishers" had 3 purposes:
    1. Searching: Finding interesting material.
    2. Editing: Fixing the material to conform to stndards.
    3. Distribution: Spreading the material geographically.

    1. Searching for good material is still difficult, but only because most creators do not post their material on the web. Someone will make much money by creating a website where new material can be rated by users. Amazon and others have a good start, but they only rate materials after the materials have been selected by publishers. How can they rate my book when I have not made it available?

    If authors release their works on the web, publishers could make money by providing rating systems and headlining a book per month. Can the publishers make money with this type of service? Will they pass some of the money to the authors? Can they make their service so much better than amateur websites that people will pay for it?

    2. Editing is still a very useful function, but it is underrated by everyone. Spell-checkers and Grammar-checkers reduce much of the work. Style-checkers are more important.

    A friend just self-published a book. He wrote it, then had it printed by a "publisher". I do not know what tools were used, but I asked about editing and he said that the publisher did none. My first criticism was that he used passive voice for the first two chapters and switched to active voice for the rest. Since most people decide whether a book is worth reading during the first few chapters, this could have a major impact on its popularity. A human editor should have caught the issue.

    I recently read a book titled "The Mushroom Man". It is in the New Books section of my library, so I assume it followed the normal publication process for books. The entire story is told from the third-person perspective, which keeps the reader from getting involved with the characters. It also has the problem that the main plot is a children's story, but the main subplot is very adult-oriented, with concepts and language that will keep it out of the children's section. This is the author's first book. If the author had advice from an experienced editor, the entire book could have been rewritten from the perspective of one or two characters. The adult subplot should have been dropped. These two concepts would have made the book much more readable, and given it a clearly focused audience.

    Another example was "Snow Falling on Cedar" or something like that. It had a good story, but the sections concerning sex and war were the extremely boring. How did that get past an editor? I think the author is famous for his work in other areas, so maybe he was able to bully the editor. Usually war and sex add exceitement to anything; here it was painful.

    Publishers could assist authors in fixing books, but who pays? Will they work with an author for free, or will they charge the author? I doubt most beginning authors have the finances to pay for the service.

    3. Distibution has been changed by the internet. The internet has reduced the cost of distribution for anything that can be transmitted as bits to almost nothing. In a few decades, publishers will not be able to make any money from distribution. Yes, this includes music and video as well as written words. Publishers may be able to provide central places for downloads, but they cannot prevent others from duplicating their work. I have not even found the questions that could make efforts at distribution profitable.

    I am asking more questions than I am providing information. All industries based on providing content that can be reduced to bits are in danger, and need to transform to survive. Some have already disappeared.

    A friend worked for AMEX business travel. Her job was to find the bes

    --
    I spend my life entertaining my brain.
  88. Baen is selling E-books now - no DRM by Anonymous Coward · · Score: 0

    Baen publishes E-Book in various forms - even plain HTML. Cheaper than paper books. You even get them earlier than the bookstores.

    http://baen.com

    http://www.webscription.net

  89. Has there ever been a format... by Nucleon500 · · Score: 1

    Has there ever been a format that hasn't been quickly cracked and put on P2P? That it will appear on P2P isn't a good reason to use DRM, because it will anyway. Hint: fair use == less money, thus DRM.

  90. There is a universal e-book format by windowpain · · Score: 1

    It's called SGML. Been around a while. Whatever they come up with, it should be based on SGML.

    --
    Insert witty sig here.
  91. He seems to have missed something... by DivideByZero · · Score: 1

    Many publishers and self-published authors require their published content to be distributed with DRM protection. It is certainly possible to build into the native OEBPS Publication wrapper a DRM protection system. Microsoft LIT, as previously mentioned, is an excellent example proving this assertion since LIT is a DRM-protected wrapper of essentially an OEBPS Publication. No more need be said on this.

    Hasen't anybody told him LIT has been cracked? Can't he do a web search?

    I can't imagine his 'XML searchability' and multi-device interface (How would a program tell the difference between a serial braille printer and a 286 running a term program, for example?) are going to make the document text any more secure to being lifted, honestly.

    Untill he figures this out, it seems like the article can be basically summed up as 'You should publish everything in XML, and call it by my nifty new name.'

    1. Re:He seems to have missed something... by Anonymous Coward · · Score: 0

      I reread his article, and he does mention that LIT has been cracked. Reread the paragraph starting with "Two notable examples..."

  92. Good point by 2901 · · Score: 1

    This is such a key point. EBooks are expensive gadgets. They have to do something that paper books don't. Reflowing the text, for different screens sizes and font sizes isn't a killer app, but leave it out and you've just thrown away a major feature.

    PDF is such a nineteenth century format. I find it infuriating.

  93. Standards by Merovign · · Score: 1



    1. Read replies to this post.

    2. Think for a second.

    3. Get used to Babel.

    Any idea that depends on universal agreement is born doomed to failure.

    It would be nice to have an eBook standard. It might happen, but I expected the DVD standards war to be over two years ago and I was foolish to do so.

    I guess the best we can hope for is an active community writing conversion software. Which of course means ripoffs. I don't know that it's a winnable battle. I hope so.