Is Free Software Ready For E-publishing?
johanneswilm writes "Over more than 3 years I have been writing my PhD thesis on the politics of Nicaragua. Being the most professional system for PDF generation, I went with LaTeX, and, to make the text accessible for the editors, I used the LyX editor. Now that the publication date comes near, I found I had to spend considerable time creating a script to convert the manuscript to formats such as Epub as none of the available tools were quite ready to do it automatically. Is LaTeX only good for writers in the natural sciences? Is the open source community boycotting ebook formats, as Richard Stallman has proposed? Are there better tools to do the same?"
Being the most professional system for PDF generation, I went with LaTeX
Now that the publication date comes near, I found I had to spend considerable time creating a script to convert the manuscript to formats such as Epub
It sure sounds the like most professional system!
The truth is, if you want your job done, you look at the merits of every possible program without considering if it's open source or not. There are good software like Apache that are mostly good for web hosting (unless you have certain requirements). Then there is lots of shit. The same is true for proprietary software tho. But if you want to get something real done, it's just stupid to limit yourself to only open source OR proprietary software. Pick the best tool for the job.
Google+ vs. Facebook, and why Google+ will fail
The best way to go seems LaTeX->HTML->ePUB. I guess many of your problems do not come from LaTeX itself, but from the fact that the LaTeX code that LyX outputs is... well... not meant for human editing and for further work. (haven't worked with LyX in a while, though -- maybe the quality of the TeX it produces has considerably improved in the meantime).
My first program:
Hell Segmentation fault
calibre is a free and open source e-book library management application that can convert to and from most of ebook formats. And does a pretty good job at it.
http://calibre-ebook.com/
LaTeX -> pdf then convert here: http://www.2epub.com/
1. Realise no scripts exist for problem
2. Write scripts
3. Release scripts as open source
4. Don't post pointless problem on slashdot
5. ???
7. PROFIT!
(We don't talk about point. 6)
- http://www.milkme.co.uk
considered what LyX/LaTeX were actually designed for. LaTeX is an interface to the TeX *typesetting* program, which does exactly that: it sets type on a page for printing (on paper) purposes. It just so happens that TeX can "print" to digital formats like PDF and DVI, which also just happen to support some "ePublishing" features like indices, search and table of contents. TeX/LaTeX/LyX are themselves NOT ePublishing platforms, they are means of applying visual styles to text. Since ePublishing itself doesn't even make sense as a term (is a website with information considered "ePublishing"? I think so -- so perhaps you should have authored your paper on a web CMS) when you consider the technical intricacies of everything involved, I think you'd have been better off investigating all "publishing" options before even starting to author your paper in a complex system. If it was me: I would have started to write text in a plain text file until I had decided the best route to go. If graphics and equations were required, I would have moved to a generic HTML + css method.
Work with Calibre if you need to convert formats.
epub is basically compressed html with some indexes, I think its even .zip format.
Calibre can handle most of this stuff for you if you can get it into an acceptable format from tex.
Why not just use Calibre to convert from pdf? It is very easy to use and supports all kinds of formats. See: http://calibre-ebook.com/
Stallman complains about DRM and a lack of anonymity with eBooks. It seems to me that this story relates very closely to legally acquired music. While it is still difficult to legally acquire digital music anonymously, it is easy to get it without DRM. I suspect books will follow this same path if consumers value it as a feature. In practice there is in fact little anonymity in the purchase of real books as everyone wants you to swipe your "club" card and use your debit card to make the purchase but his point is well taken. The option to buy an unpopular book in secret is nice.
With time and interest from consumers we will have DRM free books.
Anonymity is dead and gone and I didn't even get an invitation to the funeral. We should all mourn it's passing.
I went to arxiv.org and picked a dozen or so papers from the "new" list and clicked their other format links. They're available in pdf, ps, and dvi formats. This is hardly a complete analysis since I don't have any access to the "real" journals, but I have to wonder how many journals and universities are demanding papers in ebook format.
Open Source generally scratches itches. You may be one of the first people with the itch of converting theses to ebook formats.
If I have been able to see further than others, it is because I bought a pair of binoculars.
"Is the open source community boycotting ebook formats?"
Hardly. Calibre is an excelent converter, library manager and it's compatible with most of the readers out there for syncing. You could try converting from pdf to e-pub with it, although PDF is a lousy input format.
I've found pandoc (here: http://johnmacfarlane.net/pandoc/) to be very useful for generating PDF/ePub/LaTeX/etc from Markdown formatted text files.
Others have told me that the financial gain of publishing an academic book may be up to 700 USD. In comparison to current Scandinavian wages that really means very little, so I don’t think that earning another 700 USD should be a motive to restrict the access to one’s thoughts.
First of all I would like to commend you and thank you for this sentiment.
Is the open source community boycotting ebook formats, as Richard Stallman has proposed?
I don't understand, Stallman decries e-book formats that aren't open. There are many open e-book formats--including ePub. Granted, there are tools out there that allow you (to varying degrees of success like Calibre) to crack and convert to these formats but why bother? As you can see in that table, most everyone supports PDF. You are misunderstanding Stallman's gripe. It's not that we are boycotting e-books, it's that e-book makers are trying to carve out their own proprietary section of the electronic market, reader and creators included. So let them take their ball and play elsewhere. As you noted in your blog, this isn't the only problem:
Most ebook-readers out there so not implement the Epub-standard perfectly. That means that although one has an Epub that follows all the standards, one can be quite sure that it will not display properly on all the readers. Kovid Goyal, the creator of the Calibre ebook management software has done a good job in creating conversion scripts that create Epubs for all the different readers. Unfortunately they do this by breaking compatibility with the standard, and many distribution sites will only check whether your Epub complies to the standards and not whether the book will actually look good in the reader.
Most readers handle PDF, I would just stick to the output of LaTeX. I might suggest that your expectations are misdirected at the open source community and might be better directed at the makers of readers that apparently force you to break standards. It's the IE6 conundrum all over again.
Stallman didn't suggest boycotting ebook formats, just the DRM associated with them (big surprise there). The problem you are experiencing is that sometimes it's difficult to go from one open standard to another. The tools are lacking in maturity and I'm guessing that since my Android phone can easily display PDFs for me that there's not a lot of people demanding this ePub support that apparently needs multiple flavors for each device (and Calibre helps you with this). The tools exist but they'll only get you so far and I think the really special stuff that LaTeX does well is what you'll find yourself needing to fine tune in the end product. Look at how long it's taken LaTeX to get that beautiful and I think you'll discover that making a magical cure-all converter to ${random format} can be a non-trivial task.
If you start a kickstarter and get your university to donate hosting to making an open free market for any academic papers in any open format, I'd definitely throw in $20 (I've spent about $200 on kickstarter in the past two years). Either that or maybe throw your lot in with arxiv and work with them to fund more format support?
My work here is dung.
The solution to your problems is Pandoc which can convert LaTeX to EPUB if you like. Now, it will probably take some fiddling on your part with the output but it very much smooths the process.
You should try Asciidoc or docbook directly. I don't know if LaTeX has enough information to be faithfully converted to epub. But Asciidoc can reuse the LaTeX notations for a number of things.
I think it's the questions of right tool for the job, docbook is very widely used and is designed for working with books, and asciidoc and simular tools are for the non masochistic of us that prefer to edit text files and not raw xml. If you like gui there are plenty of guies for docbook, including LyX.
While he states "We must reject e-books until they respect our freedom." He also outlines 7 things amazon's e-books do that violate this freedom. Fortunately epub is the most widely accepted e-book format and it has none of these 7.
RMS isn't against e-books. He's against amazon's approach to e-books.
Yes, well Stallman is quite the crackpot, so anything he says should be questioned as to ulterior purposes.
Tesla was a genius. Edison however was a overrated hack who liked to torture puppies.
Is there some reason MS Word or OpenOffice + stylesheets aren't up to the task? It sounds like you might be overcomplicating things.
LaTeX is the problem, not the FOSS movement. It's horribly obsolete and makes almost everything harder than it needs to be. LaTeX and TeX are the wrong answer for everything, natural science included, unless you have to set large amounts of mathematics.
I don't know if it meets your needs, but did you look into SGML and all the related facilities? That suite seems to enjoy good support, especially in Debian-based distros.
But if you want to get something real done, it's just stupid to limit yourself to only open source OR proprietary software. Pick the best tool for the job.
Be careful: sometimes, especially in cases of works under a "copyleft" or "share-alike" license, a work's copyright license limits which tools for the job are lawful. For example, some licenses require works to be made available in an editable format that isn't Java-trapped.* See, for example, sentences containing "Transparent" in the GNU Free Documentation License and sentences containing "technological" in CC BY-SA. You can use proprietary tools yourself, but you also have to make sure that the work can be edited with free tools.
*Term's original is historical, prior to IcedTea.
"Over more than most of the timely use of my time of 3 years I ..."
I heard you liked to describe descriptions so I put an adjective next to your adjective to describe your description
If graphics and equations were required, I would have moved to a generic HTML + css method.
Most web browsers that I've seen are based on the model of rendering a web page to a scroll that is 960px wide by infinitely tall. But in the real world of print, the codex has replaced the scroll. The paged media module in CSS3 is still only a Working Draft. So which web browser would you recommend that has thorough support for MathML and for CSS paged media?
I don't think anonymity is dead, I think its quite alive and kicking.
What did die was the expectation that others should protect your anonymity for you.
Googling pdf to epub turns up this: http://lifehacker.com/5509965/how-can-i-convert-pdfs-and-other-ebooks-to-the-epub-format It talks about an open source program called Calibre which can apparently convert pdfs to epub (and many others). So presumably pdflatex followed by this would give exactly what you needed.
http://compsoc.man.ac.uk/~shep/
I have helped to create a site for scientists to post their articles on the web. One of the problems is that academics tend to love their tools and do not want to switch, often because they have relatively elaborate workflows and practices, which can cope with their lives; whether this involves writing lots of maths, spending lots of time offline travelling, collaboration or whatever.
We got around this just using Wordpress. Many of the tools out there can already communicate with a blog: this includes Word which, like it or not, is the main tool that scientists use. Others have mentioned things such as asciidoc (which I use). It's okay for short articles, but for a thesis, I would want to use latex. The support for editing in asciidoc is just not as advanced, particulary if you want to do crossreferences, citations, graphs and so on.
There is currently not a good latex -> HTML solution -- in the end, I used PlasTeX to create a tool unimaginatively called latextowordpress. Not perfect, but it works okay in most cases.
http://www.russet.org.uk/blog/2010/08/latex-to-wordpress/
Once you are in wordpress epub and PDF fall out for free, as there are standard plugins for generating these. Personally, I don't do so; I have not found any substantive advantage over HTML, but they are there if you want them.
The process of publishing in this way is not entirely slick, but the results are quite nice. See http://knowledgeblog.org/ and subdomains for examples. And even if the process could be improved, my experience suggests that it is easier than using a commercial publisher. In many cases, it is even less error-prone, as you can see the final published form as you are going, without human intervention in the way.
First semantic markup refers to enhancing the text by providing information about its meaning. HTML (CSS) and LaTEX specify the layout of the text regardless of its meaning. Secondly the open source packages are a little behind the curve regarding ePub and support for MathML in browsers. Having a coherent tool that publishes to PDF, HTML , ePub and supports equations well would be a great boon to scientific and technical publishing. BTW it also needs to be scriptable. That said if people know of something I don't please fess up I do scientific publishing and you would save me considerable time if solutions already exist.
Sorry, I resoundingly disagree.
Looking at your brand new user name, some members would call your post a broad shill for all proprietary closed programs.
The entire point of Open Source is that it can be moved to new innovative uses. Open Source will be slightly-to-much harder to use in many cases! But that is not the point of Open Source! The point is that a valid computing experience can be made out of open components. Yes, someone will have locked down the "1-click" version of a feature with a patent. So it takes you three clicks. Three clicks is way better than spending an hour kludging it.
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
Richard Stallman is one of the geniuses of our time. All he wants is to share software with his friends. and to have his friends share it back.
johanneswilm also wants to share. Its clear by his wanting to use FOSS in his process.
Perhaps you need to rethink who is the crackpot here? The answer is clear here.
You should listen to him, then you need to judge for yourself if what he is saying is Insightful or just Dribble.
RMS is very intelligent, but an Utopian idealist, who looks at situations more academically then practically, has a strong distrust of opposing ideal, so he is closed minded.
If you just follow RMS then you are not thinking for yourself. RMS has some good points and great ideas, and other just mindless rants to rage against "The Man".
Being that RMS calls for a boycott of something doesn't mean one should follow it.
If something is so important that you feel the need to post it on the internet... It probably isn't that important.
There is an open free market for Scientific publishing called PLoS http://www.plos.org./ PDF's suck on eReaders mainly due to the fact the text does not reflow for different size readers. The reason eReaders don't support ePub as well as they should is because most eReaders are not sold for profit but to hook you into the distributor's DRM'd products ala Amazon. It is not their priority. Converts just suck. Enough said.
My version of e-publishing was, "write the thesis in LaTeX, output in PDF via pdfLaTeX, and upload the PDF to Google Books." Instant global accessibility for anyone that wants it (well, instant after the processing period) -- certainly a heck of a lot better than any exposure my University can offer, although I gave them the PDF too, and they supposedly make it available somewhere. It's not EPUB, sure, and I would convert it to other formats if I felt that the effort was worth it, but maximising availability was more important to me than making it convenient for small form-factor e-book readers. I considered EPUB, but I feel that PDF is good enough, particularly given the effort that went into making it look nice in its published dimensions.
If I were going to write another book, however, I'd finish my half-baked "writer's mark-up language" project first. It's a markup language designed to be writer-friendly, medium-agnostic, and readily translated into other forms like HTML and LaTeX for actual rendering. I don't have any immediate plans to write another book, though: writing the thesis has taken the edge off my enthusiasm for the subject for now.
proof, n. A demonstration that a conclusion is implied by certain premises and axioms.
What more do you want? PDF works well as an e-document.
While it's true that ebooks present the possibility of digital restrictions management, Smashwords, a ebooks distributor site, doesn't use DRM, AFAIK.
I'm not a lawyer, but I play one on the Internet. Blog
http://johnmacfarlane.net/pandoc/
Unless you do some really wacky latex stuff, pandoc works great
Do not look at laser with remaining good eye.
It is based on plain text and allows cross linking, references, equations. http://orgmode.org/
I really like TeX and LaTeX (But don't google on the latter without some additional modifying keywords...) and used to maintain my resume in it. Turns out most contracting companies don't want a static document they can't modify, so I ended up dropping the whole thing into a big E-Lisp data structure which I serialize into eieo objects and then emit to some other markup language. I wrote emitters for HTML and Plain Text, but really you can do anything. I have it on my to-do list to rewrite it in another object oriented language one of these days, and possibly change the storage format to XML. It wouldn't be hard to get it in XML format -- I'd just have to change my HTML markup emitter a little bit.
I'm trying to teach myself to set people on fire with my mind... Is it hot in here?
All you need is proficiency in latex and that's it. There is no shortcut. Be patient and become expert. The more expert you are the more fast and productive you will be. Hifi tools make you lazy.
Living in a Winblows world with all the brainwashed money and fanboys behind it, its hard to fight mass ignorance. Enter FastPencil.com... http://www.fastpencil.com/company/media http://www.fastpencil.com/company/learning_center#TB_inline?height=648&width=800&inlineId=importing-pdf
LaTeX is a formatter: it does an excellent job of typesetting and can produce publication-quality PDF (my company uses it all the time for this).
But it is not a reusable file format. The only processor that really groks it is TeX, and that can only output PDF or DVI. If you want multiple formats of output under program control, you need to use a output-agnostic file format like XML, from which you can generate LaTeX and any other kind of appropriate source code to create PDF and other kinds of output (eg HTML, XHTML, and whatever else you need).
I very strongly suggest you use TeX4ht to convert what you have into an intermediate format of XML, and then postprocess it and clean it up to create a master copy in a well-known and robust XML format like DocBook or TEI, and start from there. There are then plenty of good tools that will let you create multiple outputs, including eBooks, web sites, topic maps, even Braille and voice output...and of course LaTeX, to regenerate your original, if needed.
<plug>I am chairing the session on XML and Publishing at this year's XML SummerSchool in Oxford next month. Come and find out all about it.</plug>
This would only be relevant if we were discussing templates and packages to be embedded as part of the document
In the case of documents under the GFDL, the copyright license requires that those who distribute copies of the document also make copies available in a "Transparent" form, one editable using free software. So those who make derivative works have to make derivative works available in a "Transparent" form. Or perhaps I misunderstood "Transparent" in the GFDL; what am I missing
Even grandpa can do it!
CutePDF writer (www.cutepdf.com) Just print from your application.
Calibre to convert PDF to any other format (calibre-ebook.com).
I killed da wabbit -Elmer Fudd
I wanted to have an article readable on my kobo on the go. That was not so easy but not so difficult as well. epub is mainly compressed xml (or xhtml can not remember).
The procedure to obtain the proper html goes through compiling the latex with pdflatex and bibtext so as to have a proper pdf AND intermediate latex files. Extract the bibliography information from the intermediate file. Regenerate the bibliography in html format using bibtex2html. Play with head and tail to cut header and footer.
Then, generate the main document in html using a latex2html tool. I think one of the best one is hevea. It leaves some header and footer that can be once again removed with head and tail. I did not had chapters since it was an article, but if you do, you probably want to cut your latex source in independent chapters to have one html document per chapter.
Now you have everything you need in an html format, converting to epub is a few sed away to go from html to whatever fooML format epub is composed of. Finally write some TOC (or extract it from the latex intermediate files) and compress the whole thing.
Congratulations, you have an epub.
That's a pain in the ass to have to do that, but that is not very difficult as well. I'd love to see a tool do that automatically though.
Cash is hard to use why? I walk into a store. I pick out a CD, a DVD and a book. I proceed to checkout and pull out a fifty. I get change and leave.
Wherein lies the difficulty?
I write with Scrivener so my entire tool chain isn't open source but I wrote XHTML2EPUB to create an EPUB from my project.
Instead of thinking about the toolchain per se it goes from MMD -> XHTML -> EPUB with working directory for tweaking the CSS and whatnot.
The store has 87 CCTV cameras, all linked to face recognition software.
Sent from my ASR33 using ASCII
Or you could build yourself a PHP tool that does it, as I did here. (Disclaimer: intended for my personal use, so basically no user-friendliness at all.)
Creating ePub's is surprisingly easy from a programing perspective.
FanFictionRecs.net
I wrote PhD thesis using Open Document Format (ODF) and it worked out very well. I used OpenOffice.org, but I expect that LibreOffice would work at least as well. You can translate to many other formats (e.g., with "Save As" or external tools).
As with any big writing effort, one key is to separate formatting from content. You should FIRST set up a template with that does all the formatting, including all the paragraph types you'll need and the right format for them. Then write your document, selecting the paragraph types for each paragraph as appropriate. Do NOT embed formatting commands in the document itself - paragraphs should NOT have font settings, etc., but instead these should be controlled by the paragraph's paragraph type. In my case, I created an OpenDocument template for George Mason University (GMU), and gave it to GMU so others could share it. If you create a template, please share it with others.
- David A. Wheeler (see my Secure Programming HOWTO)
Calibre [http://calibre-ebook.com/] is open source and doesn't a decent job of converting between various e-book formats. I use the PDF to EPUB conversion frequently for my nook.
LyX already supports output to XHTML. Merely generating XHTML output and renaming the file extension to epub has a good chance of working.
Some things might have to be tweaked, however, and that would require changes to the LyX source code.
PDF is great for a lot of things, but not for readers. Sure you can read your PDF on your Android, but you better be prepared to zoom like a mad-man.
If I am reading a paper on the geo-political state of Belize, why would I need to be zooming so much? Just let the text be reflowed to my device. This is why epub and similar are the way to go, because they allow a great deal more portability. Why does the author of the paper care if I maintain his margins or his font? He doesn't.
I get so many different types of information each day via PDF when it really should be in another format that I am sick of it. If you are sending me a document that is about to be published and I am proof-reading the layout, then send me a PDF. If you are sending me an architectural drawing then send me an Autocad DWG!!!
Pretty much only the Kindle does not support ePub. Everything else pretty much supports ePub well.
Just a Tuna in the Sea of Life
and worked, over several years, as an editor in technology publishing, then as an editor in history publishing, then as an editor in academic publishing (journals in the social sciences). In all cases, we worked in Word and sent Word files to the production team/department/contractor. Their processes upon receipt of Word files varied, but the fact is that MS Word is a major standard in many areas of publishing, certainly in mass-market books.
STOP . AMERICA . NOW
Look, just because one uses GPL tools, it doesn't mean that any document created by such tools is automatically GFDL.
I know that. But all derivative works of a GFDL work are GFDL and therefore must be made available as a Transparent copy. I apologize for not making it clearer in the first place that I was referring to GFDL works.
One has to CHOOSE to use GFDL for their documents.
And if I'm tasked with converting a document for which someone chose to use GFDL, I have to abide by the GFDL. This means, as I understand it, that the toolchain must support starting with a Transparent copy.
Hey everybody, most of the comments seem to assume that given I write about the politics of Nicaragua, I surely cannot have solved this. That is incorrect, indeed I did wrote a monster spaghetti code script that does what I set out to do. It works with graphs made with pgfplots, etc. . I just wondered why I needed to do something that I saw as a fairly basic task. You can find it at: http://www.johanneswilm.org/download/compile
You are suffering from a fundamental misunderstanding of what an ebook is. An ebook is not any old digital file that contains text. For a digital file to be an ebook, it has to be reflowable. That means that a computer has to be able to relayout its text and images intelligently at different screen sizes, automatically. In order for a computer to do that, with a level of understanding, short of actually understanding the text, the text in the digital file has to be marked up semantically. That is paragraphs, headings, caption, etc have to identified as such and there must be no drawing commands like, "move to this position and draw this character".
PDF is a page layout format (I'm ignoring PDF tagging which is a bolted on after thought). It is designed to represent text and images at a *fixed* page/screen size. LaTeX is also a page layout format, which is why you use LaTeX to generate PDF, but you cannot use it to generate HTML (without ugly hacks that dont work for an arbitrary LaTeX document).
So if you want to publish in both PDF and an actual ebook format, you need to start with a semantic format like XML/markdown/etc. Then use tools to convert that semantic format to ebooks and to PDF. There will never be a tool (opensource or closed) that can convert a PDF that has non trivial non-formatting into a decent ebook automatically.
I've owned several generations of eBook readers (MS Reader/Acrobat on a Casio EM-500, Franklin EB-911, eb1100, Sony PRS500, PRS300, PRS600, and a first gen Kindle DX). Almost all of them notionally supported PDF. Out of all of those, only the Kindle DX does a decent job of displaying PDF documents. In fact, I had a first gen Nook that I didn't keep for more than 2 days because PDF support was so crappy.
[Styling for printing] doesn't matter to most people, that's an edge case.
But it's an edge case important to the recommendation that Anonymous Coward made: preparing a document that will eventually be published in a paged medium using "a generic HTML + css method".
Practically every time you ask for "recommendations"
I'll confess that I ask for recommendations to avoid "just Google it". I've found Google useful for learning more about something that I already know exists, not so much for discovering that something exists (gets tripped up on synonyms) or for finding what is the best practice (no indicator of a source's reliability).
it's for an edge case that simply isn't important to most people.
Part of what makes us individuals is that our respective situations each have different edge cases. One size does not fit all.
If you want to be a zealot on open source issues go ahead
You're right: The comparison of CSS layout engines at Wikipedia claims that IE and Opera are ahead of Firefox and Chrome in support for paged media, though I have no idea how up-to-date that is.
I guess that's one way to go about it. I must have misremembered the extent of "Transparent" in GFDL, having confused it with "Corresponding Source" from the GPL. It appears that a "Transparent" copy need not be the ultimate source document that is processed into the copy that is sold, just something machine-readable that preserves the semantics of the original source document. It is a form for making modifications rather than the preferred form for making modifications
"And LaTeX does not handle UTF-8 very well at all." Use XeLaTeX, it comes with the TeXLive distro.
I use DocBook XML, and convert it to XeLaTex (alternatively, to plain LaTeX) using dblatex. But if you don't like XML tags, I suppose you won't like DocBook. (There are programs that effectively hide the tags, like XMLMind.)
Markdown or one of it's derivatives like Multi-Markdown is much easier than LaTex for any writing without formulas. Gruber's Perl script will turn the text into clean HTML. There are several sources that provide the needed XTML templates to turn the HTML into ePub or mobi formats. Add a little CSS and you're golden.
Do not wait for anyone else to do a work that does not benefit them. If you want someone else to give you a professional solution where you only need to ask for features to be considered go for a commercial solution. Thats how it works.
>>Most readers handle PDF, I would just stick to the output of LaTeX
I assume you don't own a e-ink reader. PDF support tends to be poor to nice-but-better-it-have-been-formatted-to-be-printed-in-a-postcard. And most of user preferences cease to work for most PDFs out there, or break badly. When I have to read a PDF, most of times, i read it in my laptop. Which sucks. I would never do the same with a EPUB, a FB2 or any other sane e-book format. It is a printing format, describes presentation in paper; any real ebook format describes content.
PDF is a printing format. Don't let Adobe and their DRM-happy friends convince you it's a web or ebook format. It is not.
There are dozens of converters out there, I would personally recommend LaTeXML (http://dlmf.nist.gov/LaTeXML/) and if that fails try TeX4HT (http://www.tug.org/applications/tex4ht/mn.html)
I think people have already done the work for a full TeX->ePub translation with LaTeXML, but you should drop a line on their mailing list to make sure.
because sheepskin is expensive. and saran wrap is risky.
I have not produced any large books, but the above worked very well for a 40 page book. Create the content and basic text formatting in Word/Text/LibreOffice/etc copy and paste into arranged text areas in InDesign.
It is based on plain text and allows cross linking, references, equations. http://orgmode.org/
Let's see:
Org-mode is like a Swiss army knife. People use it for Getting Things Done (GTD), as a Day Planner, as a Notebook, for Web and PDF Authoring, and much more.
Now why isn't dissertation and thesis there in that list? I wonder. Hmmm. Let me think now....
Microsoft Write then Wordpad for all my assignments. Those are free
No.
Microsoft Write is not free. Just because it comes with the operating system you paid for does not make it free.
Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.