California Joins Open Document Bandwagon

Minnesota also by SEWilco · 2007-03-01 03:33 · Score: 4, Informative

Minnesota also is considering open documents.

Re:Minnesota also by bigtomrodney · 2007-03-01 03:35 · Score: 5, Insightful

From TFS
'all documents, including, but not limited to, text, spreadsheets, and presentations, produced by any state agency shall be created, exchanged, and preserved in an open extensible markup language-based, XML-based file format.' The one thing that stands out to me is specifying that it is XML-based. I have no problem with that, the two competing formats are indeed XML based but shouldn't the mandate of the state simply to specify the Open and Cross-Platform aspects?
What happens for instance if tomorrow all of us wonderful Slashdot readers co-developed a magical format that not only was open and cross platform but inexplicably worked with all currently available office suites without modification... ...but it wasn't XML. Does that disqualify it? I would rather see the politics of this issue left with the politicians, the tech issues left with the techies. I've seen other combinations go horribly wrong DMCA

--
I never get used to these constant resurrections
Re:Minnesota also by BlightShadow · 2007-03-01 03:35 · Score: 1

It just makes me feel warm and fuzzy to know that this is happening... now lets see how the government sways which open format is the most widely used.
Re:Minnesota also by Applekid · 2007-03-01 03:42 · Score: 1

What happens for instance if tomorrow all of us wonderful Slashdot readers co-developed a magical format that not only was open and cross platform but inexplicably worked with all currently available office suites without modification... ...but it wasn't XML

Easy. That law would be amended to this new wonderful format... in 2017.

--
More Twoson than Cupertino
Re:Minnesota also by foniksonik · 2007-03-01 04:17 · Score: 4, Interesting

If it is an open XML based format then doing a conversion to whatever new format arises should be trivial (maybe not fast, but fairly easy with XSLT). SO better to put it into XML now and worry about what better format may arise later.

This is good news... why be negative about it?

--
A fool throws a stone into a well and a thousand sages can not remove it.
Re:Minnesota also by Anonymous Coward · 2007-03-01 04:28 · Score: 1, Informative

It's also worth noting that despite all the talk a couple of years ago, Massachusetts isn't.

Take a peak at the upcoming Tax Fairness Bill - but only if you use Microsoft Word, as no other version is provided.

Massachusetts was going to implement an ODF-only policy at the start of 2007. I guess not!
Re:Minnesota also by daeg · 2007-03-01 04:34 · Score: 3, Insightful

If, and when, such a format comes into play, and has a large enough subset of tools available for it, then the laws can be revised. With all documents already in XML, converting to the new format should be nearly painless, and more likely, both formats could be used.

The tech needs to be spelled out clearly in the law, otherwise vendors like Microsoft will be able to say their format qualifies and lobby until enough tech-clueless legislators agree to it.
Re:Minnesota also by jedidiah · 2007-03-01 04:58 · Score: 1

It's somewhat good news. It's certainly a step in the right direction. However these laws still remain flawed and we should certainly strive to have them improved before they become "the law of the land".

The law should make clear the intent without interfering too much in actually carrying out the goal. Since it is a law, every little bit of cruft can have remarkably far reaching effects in terms of unintended consequences.

--
A Pirate and a Puritan look the same on a balance sheet.
Re:Minnesota also by endianx · 2007-03-01 05:03 · Score: 1

Yeah that was my first thought as well. Internet technology moves notoriously fast, and government moves notoriously slow. Let us not try and make the two work together.
Re:Minnesota also by Strudelkugel · 2007-03-01 06:20 · Score: 1

The tech needs to be spelled out clearly in the law
Which regulatory agency will now be responsible for document formats? Are word processing docs, spreadsheets, slide presentations, and simple desktop databases now to be subjected to government regulation? If so, Congress needs to get in the act to establish consistent regulation for the nation. I would guess these regulations will be managed by the FTC, but I am not sure. It will be interesting to see how it works out.
Given that web browsers are more commonly used than office applications, the same regulatory agency should start mandating browser standards as well. The public interest must be preserved. Anything widely used can not be trusted to private enterprise and choices of the average consumer.
(Which will predominate? +1 Insightful? -1 Troll or -1 Overrated?)

--
Imagine how much harder physics would be if electrons had feelings! -Feynman, maybe
Re:Minnesota also by DarkVader · 2007-03-01 06:49 · Score: 1

So the fix for this is to simply prohibit the government from using internet technology?

Keep in mind, these laws don't say anything about what the public should use, they only specify open formats for the government to use.
Re:Minnesota also by Anonymous Coward · 2007-03-01 06:51 · Score: 0

So, if they create a CAD file of blueprints of their new building to send to bidding contractors, it'll have to be XML?
Or if they save a vidoe of their ribbon cutting ceremony, it'll be in XML format?
Re:Minnesota also by DarkVader · 2007-03-01 06:56 · Score: 1

I realize that you were attempting sarcasm, but it fails because the points you are suggesting (regulation of document formats) actually needs to happen for formats that government agencies should use.

We're not talking about individual choice here - we're talking about government agency internal use, which is something that should not be subject to the whims of a few private individuals or a few corporations.

Everything the government does should be in open and accessible formats.
Re:Minnesota also by daeg · 2007-03-01 06:56 · Score: 1

Everything is subject to government regulation when it is used directly by the government. The people have an interest in maintaining the government through themselves through community involvement (free beer) instead of through companies. Companies prove left and right that they are only in it for the profit and nothing else.

And yes, if web browsers were used to produce thousands of publicly-owned documents per year and created them in a locked, proprietary format that was built to attempt to force million-dollar upgrades every 4 years, I would be screaming at the government to switch platforms, too.
Re:Minnesota also by endianx · 2007-03-01 07:00 · Score: 1

Not from "using" technology, but from enacting legislation based on technology that is likely to change. Saying that they have to use an open standard is fine. Specifying what kind of encoding should be used seems unnecessary.
Re:Minnesota also by john_lewmanny · 2007-03-01 07:33 · Score: 1

Vendors like Microsoft are welcome to develop file formats that are "interoperable among diverse internal and external platforms and applications", and if they Fully publish it and make it available royalty-free, we should be nothing against adopting it as a standard if it's "implemented by multiple vendors". The law is supposed to be against vendor lock in, not against Microsoft itself.
Re:Minnesota also by Strudelkugel · 2007-03-01 08:17 · Score: 1

Everything the government does should be in open and accessible formats
Which implies the government can not buy or use anything that has a proprietary element. Correct me if I am wrong, but I don't know of any published specifications for Oracle, DB2 or SQL Server file formats. Therefore the government should not use any of these products.
we're talking about government agency internal use, which is something that should not be subject to the whims of a few private individuals or a few corporations
The reality is that thousands of corporations and millions of users have decided to use Microsoft Office. Many will say Microsoft abused it's market power to make this happen. I don't believe it. Way back when I thought Borland Sprint was a decent word processor, and Wordstar before that. Then I started using Word on the Mac, then Windows. I never liked Wordperfect, Quattro Pro or 1-2-3, so I used more of the Office suite. I did like Lotus Improv better than Excel, but the macros were broken. Paradox and dBASE were not as good as Access, so I used Access instead.
No doubt lots of people will rant about how Wordperfect, 1-2-3 or whatever were better than the equivalent Microsoft product, but for me they were not. I am glad I had a choice as to what to use. The great danger I see in legislating document formats is the risk that it will ultimately undermine choice and innovation. The argument for open documents is compelling as well, but I don't think it supersedes the risk of government intervention as suggested by the parent.

--
Imagine how much harder physics would be if electrons had feelings! -Feynman, maybe
Re:Minnesota also by hey · 2007-03-01 10:08 · Score: 1

Should be "trivial".
Do you want to write that XSLT?
I think "possible" is more like it.
Re:Minnesota also by StikyPad · 2007-03-01 11:26 · Score: 1

I know we're getting lazy, but that's mentioned in the second sentence of the summary.

--
https://www.eff.org/https-everywhere
Re:Minnesota also by SanityInAnarchy · 2007-03-01 12:14 · Score: 1

If it is an open Binary based format than doing a conversion to whatever new format arises should be trivial (maybe not fast, but fairly easy with C). SO better put it into binary now and worry about what better format may arise later.

</sarcasm>

You obviously haven't been following Microsoft's "Open XML" initiative, which shows you just how closed and proprietary an XML spec can be. I dare you to write an XSLT for that to prove me wrong -- I imagine it'll be about as complex as the binary .doc import filters that OpenOffice, AbiWord, and KWord already use.

XML is good and useful, but it does not magically make it "open". Unfortunately, you and these state governments may easily be duped into thinking that "Open XML" is the way to go, because it's XML and claims to be open.

--
Don't thank God, thank a doctor!
Re:Minnesota also by Sam+Ritchie · 2007-03-01 12:24 · Score: 1

No, you can't do a conversion later without first changing THE LAW. Not to mention the fact that no-one will be designing such a format as govt departments will not be permitted to use it. This is pretty wrong-headed IMO - it's like 1960's legislators mandating the use of the 8 bit computers for interoperability.

--
This sig is false.
Re:Minnesota also by Anonymous Coward · 2007-03-01 14:56 · Score: 0
These laws are for government document. Most government have requirement that are difficult to understand and even more difficult to achieve. For example, you may have the requirement to keep a document in perpetuity (forever). Since, perpetuity means more than thousands of years, you need a format that will last a very very long time. This was a problem with paper document because the ink fainted or the paper lost it quality. In this case, the document was basically recopied. Now, think electronic, 200 years from now.
- Do you want to have someone start converting all Wordperfect document (all version), Microsoft Works, Microsoft Word (V1.0, V2.0, V6.0, 97, 2000, XP, OOXML 2006, OOXML 2009, OOXML 2011, MS-OOXML-EXTENTED-2014, etc...), OpenOffice, and more.
- Or, have all document from 2007 to 2207 created in a format that is compatible or easilly converted to the Open Specification 2207.
It is about time that governement agency think about it. Furthermore, it make a lot of sens that Microsoft Proprietary format (or any proprietary format for this mather) are in the departement of NON PERMITTED.
Re:Minnesota also by SEWilco · 2007-03-01 17:16 · Score: 1

The second sentence of the summary did not have the Minnesota links which were in my linked entry. It seemed redundant to copy them here.
Re:Minnesota also by Anonymous Coward · 2007-03-01 17:48 · Score: 0

Microsoft has proven time and time again that they will twist, wiggle, threaten, beg, and or bribe to maintain their monopoly on anything.

That clause is to make sure MS doesn't try to pull a fast one by putting out something that is described as open but is closed, is kinda xml-ish but really binary, and appears to be documented but really isn't.

Besides, being against vendor lock in is almost by definition being against Microsoft.
Re:Minnesota also by Anonymous Coward · 2007-03-01 18:37 · Score: 0

This XML should be nipped in the bud, before it becomes entrenched. And the only way were going to convince these numbnut politicians is to show them this future new format NOW! Leme do a quick number crunch on whats needed.

Obviously, It will need to be extensible, to future proof it from the start. And it will need a syntax, no.. a language.. Yes, a language! One that can be used to markup almost any document format we can think of.

An Extensible Language for Markups. I think we should call it.. ELM! God damn, I'm good!

WAIT!! God Fucking Damnit! Someone already has elm.org. Shit! Maybe it'll expire soon?..

Ahh, NO! Jesus H Christ, I just can't get a break here!

Alright, Ya know? Just forget it! Just.. just forget I ever brought this up. Frigin domain squatters ruin everything.

What does XML have to do with it? by beavis88 · 2007-03-01 03:35 · Score: 3, Insightful

As long as the format meets criteria 1-4, I don't see why it's necessary to specify that it must be XML-based. Keep it simple, and all that...

Re:What does XML have to do with it? by gEvil+(beta) · 2007-03-01 04:00 · Score: 4, Insightful

There's only one reason, and that's because the higher-ups think that EVERYTHING should be XML based. Of course, they have no idea what this actually means. They just know that it needs to have XML in it, because that's what the other guys are doing.

This reminds me of my boss, who keeps saying that we need to publish things in XML, but can't give me any reason why we should. Then again, two years ago I kept on hearing about how our company needed a blog, again with no justification as to how it would help us. Thankfully, that passed. Eventually, the XML thing will, too. Of course, this isn't meant to belittle the things out there that actually can benefit from utilizing an XML format.

--
This guy's the limit!
Re:What does XML have to do with it? by foniksonik · 2007-03-01 04:14 · Score: 1

XML is just an acronym for Extensible Markup Language. SGML and HTML are subsets along with many other MLs out there. What is important is the Doctype Definition. That is what specifies how the markup is implemented. XML could be anything you want it to be as proven by MSXML.

That being said, XML or a well-defined subset, is a great choice for text documents. It's not the best choice for lots of other structured data but for text that is meant to be human readable in it's raw format, it's great.

--
A fool throws a stone into a well and a thousand sages can not remove it.
Re:What does XML have to do with it? by itlurksbeneath · 2007-03-01 04:33 · Score: 1

Nice try, but XML and HTML are subsets of SGML. See the abstract in the spec document.

--
Have you ever considered piracy? You'd make a wonderful Dread Pirate Roberts.
Re:What does XML have to do with it? by T-Ranger · 2007-03-01 04:34 · Score: 1

If you think that both SGML and HTML are subsets of XML, its time to go back to set theory class.
Re:What does XML have to do with it? by beavis88 · 2007-03-01 04:35 · Score: 1

"Great choice", yes, no arguments there; "The one single choice as laid down by legislation", no.
Re:What does XML have to do with it? by Eccles · 2007-03-01 09:40 · Score: 1

Wouldn't it being XML allow for easy extension for markup or other purposes, and relatively easy differencing between file versions? I've never looked at the internals of PDF, but I'm guessing it's not particularly amenable to either.

--
Ooh, a sarcasm detector. Oh, that's a real useful invention.

Wtf? Why "XML-based", specifically? by Lord+Bitman · 2007-03-01 03:36 · Score: 4, Insightful

Why not just require the format to be in ANY published standard format? "XML" by itself is meaningless, "extensible" is a loaded term (and a very bad idea when trying to write a way to keep things compatible). Why do lawmakers always have to over-specify things until the purpose of the law is lost?

--
-- 'The' Lord and Master Bitman On High, Master Of All

Re:Wtf? Why "XML-based", specifically? by beavis88 · 2007-03-01 03:39 · Score: 3, Insightful

Why do lawmakers always have to over-specify things until the purpose of the law is lost?

Job security. If they wrote clear, concise, and sharply targeted bills, we wouldn't need to keep electing a fresh crop to fix the mess left by the last one.
Re:Wtf? Why "XML-based", specifically? by TedSandwich · 2007-03-01 03:54 · Score: 1

I agree. PDF is an published format, are they not going to use that because it isn't XML?

--
"Two things are infinite: the universe and human stupidity; and I'm not sure about the universe." -- Albert Einstein
Re:Wtf? Why "XML-based", specifically? by Anonymous Coward · 2007-03-01 04:01 · Score: 0

It's called the, "reasonable person" standard. The law has to be unequivocable or else it will be equivocated into oblivion at the taxpayer expense to prevent confusing the theoretical "average reasonable person" with its vagueries. Either that or the lawmakers in question are just unaware of the existence of other standard published formats.
Re:Wtf? Why "XML-based", specifically? by drix · 2007-03-01 05:40 · Score: 1

All this carping about the use of XML is really asinine. I mean hey, it's Slashdot, this site is nothing without a million of you, but still... yours is the third post I have read appeals to this mystical competing format, as if the law is some form of lock-in. Can you even name another "published standard format" that fits the bill? I doubt it, because there really aren't any. Nothing else approaches XML in terms of widespread adoption and mature, pre-existing libraries for every platform in existence. Remember, XML is a container format. We're not talking about how to structure the actual document, which CA wisely left an open question. XML was built for exactly this type of application, it works, everyone uses it, it's free and open, and I have a hard time seeing it becoming obsolete for a long time. So why are we complaining again?

--

I think there is a world market for maybe five personal web logs.
Re:Wtf? Why "XML-based", specifically? by Anonymous Coward · 2007-03-01 06:30 · Score: 0

mod up please!
Re:Wtf? Why "XML-based", specifically? by Lord+Bitman · 2007-03-01 07:32 · Score: 1

Because of the formats that /don't exist yet/.
If you think anything a government office writes down about "how they're going to do things" won't lock them in long after it makes sense, you've never worked with any government data, probably have never worked with any legacy data of any sort.

--
-- 'The' Lord and Master Bitman On High, Master Of All
Re:Wtf? Why "XML-based", specifically? by drix · 2007-03-01 08:18 · Score: 1

Theheh... god I just love this site sometimes... the ol' /. syllogism. "I disagree with you. People who disagree with me are stupid. Therefore, you are stupid." Actually, I as a researcher in the social sciences, I work with tons of government data. Gobs and gobs of it. Much of it from the 70s, 80s and 90s, so I guess that might qualify as legacy too. And you know what? I would give my left arm if they had been all done in in well-formed XML, as opposed to fixed-column tape archives with no typing, often times not even a column layout to go by. Boy, I drool at the mere thought of it. And let me tell you, anytime a government office actually bothers to write down how it's going to do things, that, my friend, is a victory in and of itself. One wonders if you've actually worked with and of these data.

So, can you suggest any tangible way in which XML is a bad choice? Or are you just going to stick with that handwave about our grandchildrens' formats? And before you answer, I want you to dwell for a bit on the fact that XML was specifically created to handle this sort of problem. It supports Unicode, so there are no issues with I18N. It's machine and human readable, so there will never be a case of someone being confronted with a mystery bit stream in 30 years' time, a situation I myself have personally experienced. It's self-documenting, which is almost worth the price of admission all by itself. It's very easy to validate.

I'm not saying XML is the be-all, end-all of formats. It's pretty heavy for mobile apps, for example. But the whole issue of closed document standards that weren't interoperable was the original impetus for the creation of the standard. Having read most of the posts in this story, I still have yet to see a coherent argument for why we shouldn't be using it for what it was intended for.

--

I think there is a world market for maybe five personal web logs.
Re:Wtf? Why "XML-based", specifically? by Anonymous Coward · 2007-03-01 08:59 · Score: 0

Why not just require the format to be in ANY published standard format? "XML" by itself is meaningless, "extensible" is a loaded term...
That's odd, because I occasionally peruse a book-long definition of XML that seems quite meaningful in very specific ways to both myself and others as we mess around with XML derivatives. And I've been involved in the evolution of DTDs, etc, where the language's extensibility was a very concrete, deterministic thing, not at all "loaded" in any sense of that usage that I am aware of.
Maybe those of us who've been working in XML are just deluding ourselves, and all the stuff we've been building is just a pile of crap that only works by coincidence.
PUTTING SNIDENESS aside, the development of SGML and XML is the first ever widely adopted effort to construct a universal language for human and machine usage. So far, it is proving to be a mostly successful effort. Enough so that it is now evident that any new and improved replacement we might come up with will have to provide an effective method of conversion of the existing library of XML and its relatives.
Re:Wtf? Why "XML-based", specifically? by fritsd · 2007-03-01 12:21 · Score: 1

You can't edit PDF. So in a dialogue with your government (say, chamber of commerce form to fill in), it's much much easier if you can download the form from the organization, fill it in, cryptographically sign it, and e-mail it back (either encrypted with the organization's public key or in the clear).
It's not rocket science, but it seems it's difficult to understand for some organizations (I'm looking at you, Dutch KvK!).

--
To be, or not to be: isn't that quite logical, Slashdot Beta?
Re:Wtf? Why "XML-based", specifically? by Lord+Bitman · 2007-03-02 11:48 · Score: 1

If you're looking for reasons to not use XML, look elsewhere. I am not saying that. I'm sure somebody is, but that's no what my post is about.
Use XML. XML is quite decent for small things which would be considered "documents" instead of a loose use of the term "database'. Which is what this law is about. There is nothing wrong with XML for that.

There /is/ something wrong with codifying "Use XML" to mean "use open standards".

I don't know if you've just been ignoring my arguments because you found them harder to attack than the made-up one of "xml is a bad choice", or just honestly didn't catch them (forgive me for assuming you had no idea what you were talking about simply because you seemed to be replying to a post with no knowledge of what /it/ was talking about), but hopefully this reply will clear that up.

--
-- 'The' Lord and Master Bitman On High, Master Of All
Re:Wtf? Why "XML-based", specifically? by Lord+Bitman · 2007-03-02 11:52 · Score: 1

I don't know which side of yourself you're on.
Are you saying [falsely, and as a negative] that you can't download a PDF form, fill it out, save it, sign it, and e-mail it back, making it non-suitable for such tasks
or are you saying [falsely, and as a positive] that you can't alter a PDF other than the form elements specifically noted to be alterable, making it suitable for such tasks?

--
-- 'The' Lord and Master Bitman On High, Master Of All
Re:Wtf? Why "XML-based", specifically? by fritsd · 2007-03-03 03:23 · Score: 1

I'm saying the first option, that I can't download a PDF form, and *fill it out*, and send the altered PDF back.
I know that there is a procedure using pdftk to generate an FDF file, but that's so user-unfriendly that I don't think a company would ask its clients to use it :-)
I've only ever been able to download a PDF form, print it out, fill it out on paper, and send it back via snail-mail.
So, please enlighten me why PDF is suddenly not an "output-only" format, and which programs can be used to fill one in?

--
To be, or not to be: isn't that quite logical, Slashdot Beta?
Re:Wtf? Why "XML-based", specifically? by Lord+Bitman · 2007-03-03 05:04 · Score: 1

Adobe Acrobat (the full version, not the free one) allows you to not only fill out the fields (as is possible with the free version), but also click "save" to have the field values be saved.

"There are no good programs which have anything to do with PDF" is a valid statement, but it's not a limitation of the PDF format.

--
-- 'The' Lord and Master Bitman On High, Master Of All

Dominoes by kensai · 2007-03-01 03:36 · Score: 1

The dominoes are beginning to fall.

Re:Dominoes by RichMan · 2007-03-01 03:44 · Score: 2, Funny

> The dominoes are beginning to fall.

Are they being hit by flying chairs, perchance?
Re:Dominoes by Anonymous Coward · 2007-03-01 04:06 · Score: 0

...like a house of cards. Checkmate!
Re:Dominoes by Anonymous Coward · 2007-03-01 04:50 · Score: 0

*YAWN*. Been hearing that noise for 10 years+ guys. Come back when they've actually fallen.

Text in XML? by The+Monster · 2007-03-01 03:37 · Score: 2, Insightful

...text, spreadsheets, and presentations, produced by any state agency shall be created, exchanged, and preserved in an open extensible markup language-based, XML-based file format.

Why would text need to be in an XML-based format, when it can be in a... text format? If you have a text document that doesn't require any formatting, just make it text!

--

[100% ISO 646 Compliant]
SVM, ERGO MONSTRO.

Re:Text in XML? by McNihil · 2007-03-01 03:54 · Score: 1

Maybe it is to make sure that the format is not a binary one. One can read XML without special filters if push comes to shove. It should add provisions to how documents should be compressed though. Gnuzip maybe?
Re:Text in XML? by hey! · 2007-03-01 04:03 · Score: 4, Interesting

Well, obviously it doesn't need to be xml, but XML does have one nice self-documentation property that plain text lacks: the character encoding.

If you've looked at project gutenberg texts, you can see why this is a problem. Not a huge problem, but a problem. When a source text has a non-ascii character in it, they have to put some sequence of ascii characters which will suggest what the glyph is supposed to be. This doesn't really preserve the information in the source document, nor does it make the document easy to read.

So, you could have a trivial text XML format that has only one defined tag. It's still useful:
<xml version="1.0"? encoding="us-ascii">
<text>
This is my text. It has no wacky glyphs so ASCII is fine.
</text>

vs.

<?xml version="1.0"? encoding="utf8">
<text>
This is my text. It has wacky glyphs therefôre ascii sücks for it!
</text>

--
Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
Re:Text in XML? by gstoddart · 2007-03-01 07:01 · Score: 1

Why would text need to be in an XML-based format, when it can be in a... text format? If you have a text document that doesn't require any formatting, just make it text!

Well, XML also offers the ability to have a physical structure to your document, the ability to define new structures (ie, schema), and it's hopefully available in a vendor neutral format.

If you needed to have structured objects in a vendor-neutral format, you might need to come up with your own, or go with a binary structure, or something else.

One (possibly small) reason for using XML in this context is specifically because the technology to have a hierarchical structure comes for free. XML also provides you with a native expectation of support for Unicode.

XML isn't the be-all and end-all, but, when you're trying to settle on an open document format, XML goes a long way to helping you do it with established technologies. I can't think of another document format which would have as much too support as exists for XML.

Cheers

--
Lost at C:>. Found at C.
Re:Text in XML? by Chandon+Seldon · 2007-03-01 08:57 · Score: 1

Or you could just define text as "utf8 text" and you'll largely be fine.

--
-- The act of censorship is always worse than whatever is being censored. Always.
Re:Text in XML? by hey! · 2007-03-01 10:08 · Score: 1

Or you could just define text as "utf8 text" and you'll largely be fine.

How?

How are you going to "define it" as UTF-8?

In the file name?

Bad idea. The first time a user redefines the document from importantHistoricalDoc.UTF8.txt to ABMTREATY.TXT, you've lost your metadata. For that matter, you shouldn't assume that data always resides in files. It may be a "database" "record". It may be an chunk floating in some kind of futuristic data soup we haven't even conceived of yet.

In some kind of OS metadata record (e.g. mime type on your web server)? That stuff tends to get lost when you copy data between operating systems.

In the file? OK, how do you make sure the processing program that reads the docuemnt in fifty years understands your peculiar way of separating data from metadata. Suppose it sees this:
UTF8

TREATY BETWEEN THE UNITED STATES OF AMERICA AND THE UNION OF SOVIET SOCIALIST REPUBLICS ON THE LIMITATION OF ANTI-BALLISTIC MISSILE SYSTEMS

How does it know that UTF8 isn't part of the text? Maybe it's some kind of treaty organization, or the login account of the person who typed the document.

Well, OK, let's get everybody in the world to agree that every text file should start with the encoding, followed by CRLFCRLF. Riiight.

So
METADATA STARTS HERE
TITLE: TREATY BETWEEN THE UNITED STATES OF AMERICA AND THE UNION OF SOVIET SOCIALIST REPUBLICS ON THE LIMITATION OF ANTI-BALLISTIC MISSILE SYSTEMS
AUTHOR: State Deparment of the United States, Foreign Ministory of the Union of Soviet Socialist Republics
Signed at Moscow May 26, 1972
Ratification advised by U.S. Senate August 3, 1972
Ratified by U.S. President September 30, 1972
Proclaimed by U.S. President October 3, 1972
Instruments of ratification exchanged October 3, 1972
Entered into force October 3, 1972
Character Encoding: UTF-8 (variable length Unicode encoding upwardly compatible from US ASCII; see RFC 3629)

METADATA ENDS HERE DOCUMENT STARTS BELOW THIS LINE

In the Treaty on the Limitation of Anti-Ballistic Missile Systems the United States and the Soviet Union agree that each may have only two ABM deployment areas,1 so restricted and so located that they cannot provide a nationwide ABM defense or become the basis for developing one. Each country thus leaves unchallenged the penetration capability of the others retaliatory missile forces. ...

Yep. This is not bad. Sure, it smacks the future user in the face with a lot of data before he gets to what he wants. Programs may from time to time trip up on searchs, turning this up because they don't know where the metadata starts and the text begins, because it's something you pulled out of your ass (e.g. somebody searching for "UTF-8" will find lots of documents with metadata "encoded" this way"). But overall it's not that bad.

Now, how do you make sure that all your documents have this data? Well, you have to define it as a file format, design your programs to save with this data and parsers to ensure that it is correct.

When you're done, you have all the complexity of the parts of XML you need, but none of the standardization.

Nobody hates XML-mania as much as I do. XML to carry data in an on-wire protocol? Madness. XML to implement a simple key/value configuration file? Ridiculous overkill. XML to create object descriptions that are translated into a programming language (gad, I've seen this and it made me wretch). Retarded.

XML to structure a text centric document? Excellent solution.

--
Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
Re:Text in XML? by Chandon+Seldon · 2007-03-01 12:00 · Score: 1

No. You don't do any of that crap.
You simply store all of your text as UTF8, and assume that all text you get from anyone else is UTF8 unless someone tells you otherwise. It's not like people are going to be using EBCDIC. UTF8 is reasonably standard at this point - if you see a text file, it's a safe assumption that it's UTF8.

--
-- The act of censorship is always worse than whatever is being censored. Always.
Re:Text in XML? by SanityInAnarchy · 2007-03-01 12:19 · Score: 1

How are you going to "define it" as UTF-8?

By it being a text document. Generally speaking, text today is either ASCII or UTF-8, and UTF-8 is a superset of ASCII. So just read everything as UTF-8, and add useless XML crap if you really, really need to support something strange like UTF-16.

--
Don't thank God, thank a doctor!
Re:Text in XML? by CandyMan · 2007-03-01 12:37 · Score: 1

Except that most of Europe uses ISO-8859-something (or ISO-8859-1something), and many, many people still encode their documents with windows codepage 1250 (number pulled out of ass, mibht be a different one) or something similar.

I agree with you that UTF-8 sucks the less of any format for non-ascii latin characters, with the added benefit that it also does non-latin, but specifying it somewhere is really helpful. If only the ISO Latin-1something people and the windows codepage something people were equally nice about it.

--
http://barrapunto.com/ - News for nerds, en español
Re:Text in XML? by ChrisFedak · 2007-03-02 09:26 · Score: 1

That runs into regional issues. Japan, for instance, is not happy with the assumptions made by unicode, and have their own preferred encoding. Agreeing on a standard doctype specification (at the very least) resolves many of these sorts of issues.
Re:Text in XML? by Chandon+Seldon · 2007-03-02 09:57 · Score: 1

Agreeing on a standard doctype specification (at the very least) resolves many of these sorts of issues.

Well... except for the fact that it makes storing flat text files impossible. And it requires programmers to handle character sets everywhere (which history shows doesn't happen). After having to deal with a bunch of these character set issues, I have to say that assuming a UTF8 default is a lot more likely to actually work than trying to get character set specified everywhere.
There's an argument for UTF16 - it reduces the storage size of Chinese, Japanese, Korean, etc text by a third. A character set like Shift_JIS just isn't a very good deal - foreign letters do occur, and it's useful to be able to represent them in text.

--
-- The act of censorship is always worse than whatever is being censored. Always.
Re:Text in XML? by Deven · 2007-03-02 12:15 · Score: 1

In the file? OK, how do you make sure the processing program that reads the docuemnt in fifty years understands your peculiar way of separating data from metadata.

If you really feel the need to make the data self-identifying as UTF-8, you can use the Byte Order Mark for that purpose.

--

Deven
"Simple things should be simple, and complex things should be possible." - Alan Kay

You must not have got the memo by grahamsz · 2007-03-01 03:40 · Score: 1

XML is the future. It's the perfect format for any kind of data.

The draw of a markup langauge for documents is that you can print out the raw file and even a lay person can read it just by ignoring the markup tags. Even without knowing anything about xml, I could inspect the file format and write an XML to Text converter in about 1 line of perl.

Re:You must not have got the memo by maxume · 2007-03-01 03:50 · Score: 3, Funny

I can write a text to text converter in about *0 lines* of perl.

--
Nerd rage is the funniest rage.
Re:You must not have got the memo by UncleTogie · 2007-03-01 03:57 · Score: 1

...."Name that tune!"

{note to self: I *gotta* stop watching GSN.}

--
Don't tell me to get a life. I'm a gamer; I have LOTS of lives!
Re:You must not have got the memo by Anonymous Coward · 2007-03-01 03:58 · Score: 3, Funny

I can write a text to text converter in about *0 lines* of perl.

Which truly is the perfect amount of Perl to ever write.
Re:You must not have got the memo by Coryoth · 2007-03-01 04:51 · Score: 1

I can write a text to text converter in about *0 lines* of perl. Well sure, but you're assuming an encoding standard for the text. If the output text has to be EBCDIC I doubt you'll manage it in zero lines. Given the way things are going, assuming XML as a markup for structured files is no worse than assuming ASCII as the encoding for text files. It doesn't have to be XML, but then a text file doesn't have to be ASCII.

--
Craft Beer Programming T-shirts
Re:You must not have got the memo by zippthorne · 2007-03-01 05:55 · Score: 1

Can't you write any Perl program in only one line?

--
Can you be Even More Awesome?!

<xml>we should do this too</xml> by wwwillem · 2007-03-01 03:41 · Score: 4, Funny

<user="wwwillem">
<subject>we should do this too</subject>
<content>
What is good for government documents is also good for Slashdot posts. :-)
</content>
</xml>

--
Browsers shouldn't have a back button!! It's all about going forward...

Anything can parse XML. by Anonymous Coward · 2007-03-01 03:43 · Score: 1, Insightful

Anything from .Net to Perl can already parse XML.

What type of formats will qualify? by tttonyyy · 2007-03-01 03:44 · Score: 4, Funny

Format is irrelevant - since these documents will contain legal-speak, they'll be unreadable anyway. ;)

--
biopowered.co.uk - catalytically cracking triglycerides for home automotive use since 2008. Just say no to big oil!

Re:What type of formats will qualify? by OldeTimeGeek · 2007-03-01 04:25 · Score: 1

This isn't just the Legislature, it's all of the State Agencies.
Which is why it's largely useless.
Putting aside people's feelings about Microsoft or ODF, realistically, it ain't gonna happen.
California State agencies, up until now, haven't had to follow any standard document format. Although most use Microsoft Word, the California CMAS master contract that State Agencies use to buy their software has Word, WordPerfect and other word processing packages on it. Which means that agencies can pick and choose what they think that they need. Now the Legislature wants to set one - and you can bet that there's no money in the bill to do it. If the bill is passed, agencies will need to convert already existing processes and applications to use the new format. With no new staff nor new money to hire someone else to do it.
Yeah, it'll happen.
Re:What type of formats will qualify? by 99BottlesOfBeerInMyF · 2007-03-01 05:33 · Score: 1

Although most use Microsoft Word, the California CMAS master contract that State Agencies use to buy their software has Word, WordPerfect and other word processing packages on it. Which means that agencies can pick and choose what they think that they need. Now the Legislature wants to set one..
They are proposing to set one format, not one word processor. Agencies can still choose among software packages, just with noncompliant packages removed from the bidding. I think there are a dozen products now with ODF support.

If the bill is passed, agencies will need to convert already existing processes and applications to use the new format. With no new staff nor new money to hire someone else to do it.
There are now free software solutions that read both MSWord and ODF files, including Google's Web based offering. When departments start budgeting for next year they will have to take these rules into account and it will probably end up saving them money from their budgets within a few years.
Re:What type of formats will qualify? by mysticgoat · 2007-03-01 09:27 · Score: 1

There are now free software solutions that read both MSWord and ODF files
Better than that, AIR, there was news on slashdot yesterday that Sun has produced a free plug-in for MS Office 2003 that allows it to read and write ODF. So companies that are using MS Office now can simply stay with it, and meet any government ODF requirements.
I doubt that MS will develop an ODF plugin: that would be out of character. I also doubt that anyone is going to rush to make an ODF plugin for MS Office 2007: there wouldn't seem to be any motivation to do that.

XML panacea by Gothmolly · 2007-03-01 03:44 · Score: 4, Funny

N00b: Hey we have this data representation problem, we'll use XML!
Greybeard: Son, now you have two problems.

--
I want to delete my account but Slashdot doesn't allow it.

Re:XML panacea by mrchaotica · 2007-03-01 04:33 · Score: 1

I keep seeing that quote posted, but what exactly are those two problems?

--
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Re:XML panacea by greginnj · 2007-03-01 04:45 · Score: 1

The joke whooshing over your head is that the two problems are : the N00b's original problem, plus the second new problem of having to deal with XML.

--
Read the best of all of Slash: seenonslash.com
Re:XML panacea by mrchaotica · 2007-03-01 04:53 · Score: 1

You're right that it went (and still is) whooshing over my head, because I don't understand why "having to deal with XML" is a problem.

--
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Re:XML panacea by ratboy666 · 2007-03-01 06:02 · Score: 2, Informative

From a greybeard

XML Problem Checklist:

- Ensure the library functions are flexible enough for the application, and do not consume too many resources. This can be an issue for smaller systems, and may be an issue for larger ones as well.

- Ensure that the XML implementation is usable on a security basis. Should not have buffer overruns - parsing problems etc.

- Ensure that the XML works with ANOTHER XML implementation. Adds testing (its not just "someone elses problem").

- Application data structures are influenced by XML (eg. XML does not represent bitfields/bitsets directly). Mappings must be reasonable.

- XML should be 'vetted so that the format doesn't look like it was laid out by 'noobs. Stuff like the Microsoft specification for XML WORD shouldn't be generated.

- Because the only reason that XML formats are being used is for interoperability, a separate set of interoperability documents must be maintained. At some point, someone will ask the question "You use XML, I want to stick the data into ... That should be easy (because of XML). Help me."?

--
Just another "Cubible(sic) Joe" 2 17 3061
Re:XML panacea by greginnj · 2007-03-01 07:04 · Score: 1

I agree, on XML, but your original post was that you 'keep seeeing' that quote, and was wondering what it meant. The original used regular expressions as the issue rather than XML. The phrase itself seems to have become a meme.

Your original post implied (to me at least) that you were missing the point of the meme itself (if I'm misunderestimating you, apologies), especially since I've never seen it applied to XML before. (Agreed that the validity of the meme for both XML and regexps is at least debatable.) I understand it as saying that any problem complex enough to require mulling over before you even decide on a solution vector of regexps|XML|neural networks|genetic algorithms|laser-toting killbots is likely to have problems with the implementation of the solution as well...

--
Read the best of all of Slash: seenonslash.com
Re:XML panacea by nasch · 2007-03-01 07:22 · Score: 1

Other than (maybe) the last one, don't all of those considerations apply to any data representation? For example, it's not OK to have buffer overruns and parsing problems if you're using some non-XML format.
Re:XML panacea by Anonymous Coward · 2007-03-01 07:39 · Score: 0

Other than (maybe) the last one, don't all of those considerations apply to any data representation? For example, it's not OK to have buffer overruns and parsing problems if you're using some non-XML format.

One point is that XML forces you to always deal with these issues.

For example, IP manages to be portable when sending an unsigned 32-bit value over the net by using a fixed size, fixed format representation that by definition can't overflow or be parsed incorrectly. A textual representation of a numeric value doesn't have the same properties.
Re:XML panacea by Anonymous Coward · 2007-03-01 07:44 · Score: 0

Parent "greybeard" must speak from experience, because he knows so well that a custom created file format suffers none of these problems, and are in fact the panacea of interoperability, security, and stability.

Why is the victim silent? by 140Mandak262Jamuna · 2007-03-01 03:44 · Score: 2, Insightful

Well why corporate America is so silent on this issue? After all almost all the revenue of MSFT comes from corporations. If the palying field becomes level and multiple vendors compete to serve them while being fully inter-operatble, it will be the corporate America that will benefit most. Technically they are the victim of the monopolistic deeds of MSFT. Still they remain silent, and the Govenment, after protecting the citizens from their own stupidity (seat belts, airbags, spacing between crib railings) now comes to rescue corporate America?

If Government intervention is what it takes to force a level playing field, I will accept it. But still I would prefer it if market forces create a level playing field instead of government mandates.

--
sed -e 's/Chuck Norris/Rajnikant/g' joke > fact

Re:Why is the victim silent? by Stewie241 · 2007-03-01 04:02 · Score: 1

This isn't a government mandate per se... the government isn't mandating corporations and businesses to interact with each other using an open format. It is only saying that if the state produces a document, it has to be in an open format.

This is the same as say Walmart saying that all of its documents had to be produced in an open format - this is an internal policy more than a government mandate.
Re:Why is the victim silent? by 99BottlesOfBeerInMyF · 2007-03-01 04:59 · Score: 1

If the palying[sic] field becomes level and multiple vendors compete to serve them while being fully inter-operatble, it will be the corporate America that will benefit most. Technically they are the victim of the monopolistic deeds of MSFT. Still they remain silent, and the Govenment, after protecting the citizens from their own stupidity (seat belts, airbags, spacing between crib railings) now comes to rescue corporate America?
Corporate america is not a single entity that acts in a unified fashion. It is a horde of competing companies. Many of those companies probably know that it is in the best interests of corporate america as a whole to switch to open standards, but that does not make it in the best interests of any one company to switch until the others have done so. Aside from that, they can only ask for action through the government.

If Government intervention is what it takes to force a level playing field, I will accept it. But still I would prefer it if market forces create a level playing field instead of government mandates.
It is important to note, that the government is not mandating open standards for corporate america, only for the government themselves who can afford to act in more long term ways. As for "market forces" the point of a monopoly is significant influence in a market to bypass market forces. You can never rely upon market forces to correct a monopoly.
Re:Why is the victim silent? by aero6dof · 2007-03-01 05:23 · Score: 1

Well why corporate America is so silent on this issue?

Because our market based system isn't as good as delivering efficiency as proponents make it out to be. Making money != delivering value to society. Many times the corpoarat profit motive does deliver social value, and often more efficiently then other comparable systems, but its not equivalent and has many disconnects. (see U.S. heathcare for example)

Re:we should do this too by wwwillem · 2007-03-01 03:47 · Score: 1

We should set up a little competition: who can type a first post in xml. Remember, you have to type in all these < and > (which I now had to type as "<" to show the ampersand ... this becomes a recursive nightmare :-).

--
Browsers shouldn't have a back button!! It's all about going forward...

Criteria n3 by DrYak · 2007-03-01 03:57 · Score: 4, Funny

As long as the format meets criteria 1-4

In other news, Microsoft is quickly subsidizing 3 small companies to write quick and meaningless stupid plug-ins using OOXML as input, just to pretend that their format is "Implemented by multiple vendors" and on "diverse (...) platforms" (ie.: Windows 98, Windows ME, Windows 2000, Windows XP *and* Windows Vista)...

--
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]

Re:Criteria n3 by Coryoth · 2007-03-01 04:45 · Score: 2, Interesting

In other news, Microsoft is quickly subsidizing 3 small companies to write quick and meaningless stupid plug-ins using OOXML as input, just to pretend that their format is "Implemented by multiple vendors" and on "diverse (...) platforms" (ie.: Windows 98, Windows ME, Windows 2000, Windows XP *and* Windows Vista)... I think MS may actually have already squeezed through this particular hole. The reality is that other vendors are going to have to be able to read and write OOXML at a basic level for compatability. They'll never fully implement the standard that MS has written because they can't possibly implement the behaviour of the <SpaceLikeWord95> tag and all the others like it, but they'll have something. If you demand a "full" implementation from multiple vendors you're just digging yourself a different hole: nothing will qualify. Honestly, try reading and writing .odf files in a few different products (OpenOffice.org, KOffice, AbiWord, Writely, etc.) and you'll find the formatting gets noticeably messed around. Fully implementing ODF might be more feasible than fully implementing OOXML, but that doesn't mean it will actually happen. It seems MS has managed an effective pre-emptive strike against such legislation - they've created a "standard" that is just open enough to qualify, but sufficiently obscured that no one else can actually claim full compatability - the kicker being that full compatability is unlikely with any sufficiently complicated standard (as standard for office documents are bound to be) so you can't specify that as a criterion.

--
Craft Beer Programming T-shirts
Re:Criteria n3 by kennygraham · 2007-03-01 05:24 · Score: 2, Insightful

That will meet 1-3, but
(4) Controlled by an open industry organization with a well-defined inclusive process for evolution of the standard.
Microsoft still isn't an open industry organization, they're one company. I think #4 is the most important part.
Re:Criteria n3 by man_of_mr_e · 2007-03-01 06:14 · Score: 2, Insightful

I think you make a disingenuous argument. By definition, no office suite can fully implement any other office suites interoperability unless suite b is a complete superset of suite a's features, regardless of document type.

By way of example, let's take something like KOffice. It seems unlikely that KOffice is a complete superset of OpenOffice, therefore even if both OOo and KOffice implement ODF, KOffice can never be completely interoperable with OOo (at least OOo -> Koffice). Further, if KOffice implements any features that OOo doesn't have, then the same is true in reverse.

This argument is a red herring. No, no office suite can implement every feature of OOXML, because OOXML is, by definition, a direct map of Office functionality, but even if Office used ODF, those same features would have to be represented in ODF somehow, and those same suites would still not be able to implement them.

--
If you need web hosting, you could do worse than here
Re:Criteria n3 by Coryoth · 2007-03-01 06:45 · Score: 1

I think you make a disingenuous argument. Well, I'm making the deliberately disingenuous argument that I expect Microsoft to make. The simple answer is that not fully implementing the standard is, as you say, pretty much inevitable. That means that the partial implementations of OOXML provided by OpenOffice.org and others can be considered as on the same lavel as the various implementations of ODF - at least as far as the wording of the legislation is concerned. That means, from the point of view of the legislation, it can be argued that ODF and OOXML are equivalent in terms of interoperability and multiple implementations. Sure its a disingenuous argument that ignores the spirit of the legislation, but then when in the past has MS been unwilling to make disingenuous arguments and hew to the letter rather than the spirit of the law?

--
Craft Beer Programming T-shirts

Called my rep by inverselimit · 2007-03-01 03:58 · Score: 3, Informative

Just called my CA Assembly rep to ask them to support the bill. Look yours up here.
It may not be perfect, but is a move in the right direction.

Re:Called my rep by inverselimit · 2007-03-01 04:08 · Score: 2, Informative

There is also an online form for comment here.

Re:we should do this too by saskboy · 2007-03-01 03:59 · Score: 1

I guess "plain" text seemed too plain for the legislators. Better to make a law with a hard coded computer language, than a flexible variable.

--
Saskboy's blog is good. 9 out of 10 dentists agree.

Does MS's format qualify? by danpsmith · 2007-03-01 04:01 · Score: 2, Insightful

MS made a format that fits the very definition of what they said will be required in this bill. Is this bill just going to lead to government organizations upgrading to the new Office? Technically, all of these things apply even if the implementation of the "standard" will later be forked by MS with their extend and extinguish model. In short, does this really mean truly open formats will get a boost? Or that MS's new format will seem like the solution to a problem they have practically invented?

--
Judges and senates have been bought for gold; Esteem and love were never to be sold.

Re:Does MS's format qualify? by Dan+Ost · 2007-03-01 04:38 · Score: 1

Microsofts format fails the 4th criterion:

(4) Controlled by an open industry organization with a well-defined inclusive process for evolution of the standard.

--

*sigh* back to work...
Re:Does MS's format qualify? by Ngwenya · 2007-03-01 05:02 · Score: 1

Doesn't the requirement state that multiple implementations must exist? As far as I know, only Office fully supports OOXML [though I did hear about someone else attempting an implementation, but I could be wrong].

If there is such a requirement, then MS's offering would not qualify. However, if they selected ODF, and Office supported ODF properly, then the government offices might very well upgrade to the next Office. That would be a matter for their local IT decision makers.

But the software used must satisfy the business requirements; that's not just for office documents, it should be a general law of software and hardware procurement.

--Ng
Re:Does MS's format qualify? by a1mint · 2007-03-01 06:39 · Score: 0

I've read that Microsoft's "open format" is unbelievably convoluted and can only be used by MS software.
Re:Does MS's format qualify? by Anonymous Coward · 2007-03-01 19:01 · Score: 0

Your first question: Does Microsoft qualify? No. Microsoft did not make anything that will fit the definition specified in this bill. They want multiple vendors (saying microsoft has multiple platforms doesn't count, microsoft is one vendor. One. One isn't many. ODF is available on multiple vendors, OOXML is not. Secondly, they want an something thats an international standard. ODF is ISO26300. Microsofts OOXML has no ISO number, because it failed to pass the standard. So Microsoft fails in several areas to meed the specs for this bill, and their format fails to meet international standards approval because it fails to meet several requirements for approval. Worse, the ODF spec went through 4 major versions over 4 years with hundreds of end users and was submitted to ISO as a lean 300 pages. Microsoft raced quick and hard with their spec, 4 months, 1 version, no user feedback, and the spec they submitted to ISO is over 5000 pages long. ISO rejected it.

Woohoo, ODF soon to be here by guruevi · 2007-03-01 04:02 · Score: 2, Interesting

I think the only document format that would qualify is ODF (by OASIS). It's the only well known document format, based on XML and extensible, open and implemented by different vendors and office suites.

--
Custom electronics and digital signage for your business: www.evcircuits.com

Re:Woohoo, ODF soon to be here by Lord+Bitman · 2007-03-01 07:43 · Score: 1

Yes, specifying "XML" is an obvious (and pointless) bias towards using ODF.

--
-- 'The' Lord and Master Bitman On High, Master Of All

American legislation 101 by bornbitter · 2007-03-01 04:02 · Score: 2, Interesting

"Why do lawmakers always have to over-specify things until the purpose of the law is lost?"

This is over-simplified, but here goes... American laws are made in sub-committees of committees of the legislative body. The committees are packed with 'specialized' delegates, i.e. someone with a political stake or in the pocket of a special interest group, (like Microsoft, OSDL, or Green Peace). ...of course that is the federal process, and the states vary in their organization, but it is mostly the same. It all depends on how the states have drawn-up the rules for their specific legislature.
Keeping that in mind, every law has to 'pass' through the upper committee after the sub-committee, before passing in the full-legislative body. The extra wordiness is to satisfy the other 'specialized' delegates' demands.
To put it simply; They HAVE to make it ridiculously wordy or it will never become a law. There is just too much money involved. This means that all Microsoft, or anyone else, has to do is 'buy' an influential delegate in the sub-committee, or the chair of the committee in order to kill this bill before it is even voted on in the full legislature.

--
"Our Constitution was made only for a moral and religious people. It is wholly inadequate to govern any other" -John Ada

Further, all brackets, braces and slashes... by Anonymous Coward · 2007-03-01 04:03 · Score: 0, Troll

...must be free-range, smoke-free, and grown under organic conditions in a carbon-neutral environment and driven to their respective lead-free file folders in ethanol-fueled hybrid vehicles.

You do need a blog (mildly OT) by Mateo_LeFou · 2007-03-01 04:05 · Score: 2, Insightful

Your company needs a blog, but (and this is critical) it won't work if it's part of your corporate strategy of appearing-to-look-really-hip. It works if one of your employees creates it on her own initiative, and the strategists leaves her alone.

--
My turnips listen for the soft cry of your love

Re:You do need a blog (mildly OT) by gEvil+(beta) · 2007-03-01 07:28 · Score: 1

You have absolutely no idea what it is that my company does, yet you are going to tell me that we need a blog. Please tell me exactly why my company needs a blog since you seem to know so much about what we do.

--
This guy's the limit!
Re:You do need a blog (mildly OT) by Mateo_LeFou · 2007-03-01 09:09 · Score: 1

Alright there's a nonzero chance that you don't need a blog. But I think companies in general ought to wake up to the fact that markets are conversations and it's obvious to the "consumer" when the other party in the conversation is not an actual human being, but a marketing/PR committee.

I meant no offense

--
My turnips listen for the soft cry of your love

Could be worse... by Penguinisto · 2007-03-01 04:08 · Score: 1

...they could've demanded LaTeX instead.

(/me ducks and runzlakhell...)

(though /me wonders... why the hell not ps? Guess it doesn't have all those neat little bracketed thingies in it that say "tech!" to the average politician)

/P

--
Quo usque tandem abutere, Nimbus, patientia nostra?

Re:Could be worse... by Coryoth · 2007-03-01 04:56 · Score: 1

...they could've demanded LaTeX instead. Actually TeX does have something going for it - there are multiple implementations from different vendors that fully implement the standard and will all produce identical output from a given file. That's more than you can say for any of the XML document formats currently around - even ODF produces different output in different implementations (try opening the same file OpenOffice.org, KWord, and Abiword)! Of course much of that is due to the fact that TeX has been around for so long without any significant changes and, given enough time, XML formats will likely settle toward the same level of quality from different implementations. Still, TeX's consistency is impressive.

--
Craft Beer Programming T-shirts

Microsoft's open XML format: by hoggoth · 2007-03-01 04:10 · Score: 5, Insightful

Just specifying XML doesn't mean much, really:

<document>
Description of MS Open Format
<![CDATA[
37642364 78346478 23465789 34657834 65783465 78934653 47895634 78563478 65347856
56347825 63478256 34786578 34567893 45678934 65783456 78465783 46578346 57834567
34895723 48957348 90578934 75890347 58934758 93475892 ... more binary crap...
]]>
</document>

--
- For the complete works of Shakespeare: cat /dev/random (may take some time)

Re:Microsoft's open XML format: by Coryoth · 2007-03-01 04:32 · Score: 2, Insightful

I think the kicker for Microsoft will be the
(1) Interoperable among diverse internal and external platforms and applications; and
(3) Implemented by multiple vendors; clauses, though it will of course depend on exactly how those are interpreted. It is unlikely that anyone other than Microsoft will fully implement OOXML, so "multiple vendors" rules it out under a strict interpretation. If people want to get around it, however, you can go with a loose interpretation and point to all the vendors who will, out of necessity, provide a basic implementation of OOXML for the sake of compatability and importing documents. Likewise with the interoperable among platforms - with no MS Office for anything other than Windows and Mac, OOXML doesn't qualify as interoperable among diverse platforms. On the other hand you can go back to a loose interpretation and claim that OpenOffice.org "has an implementation of OOXML" (because they will have to have something for compatability's sake) and hence OOXML works on a diverse range of platforms (everything that OpenOffice.org runs on).

--
Craft Beer Programming T-shirts
Re:Microsoft's open XML format: by skubeedooo · 2007-03-01 04:50 · Score: 2, Informative
Just specifying XML doesn't mean much, really
Which is why the bill doesn't just specify XML. FTFA,

...the department in its evaluation of open, XML-based file formats shall consider all of the following features:
1. Interoperable among diverse internal and external platforms and applications.
2. Fully published and available royalty-free.
3. Implemented by multiple vendors.
4. Controlled by an open industry organization with a well-defined inclusive process for evolution of the standard.
Re:Microsoft's open XML format: by hoggoth · 2007-03-01 06:09 · Score: 1

Well, my point was that Microsoft can and will do what they can to make their 'open standard' as proprietary as possible. They can hire/bribe third-party companies to write (crappy) implementations to meet some of those requirements. I think specifying XML in the requirements is a useless, even foolish, thing to do. If XML is the best way to meet the interoperable requirements then it will be used, otherwise something else will. Why are politicians making technical implementation decisions? And this isn't because I despise XML. Really, it's not.

--
- For the complete works of Shakespeare: cat /dev/random (may take some time)
Re:Microsoft's open XML format: by atamyrat · 2007-03-01 06:41 · Score: 2, Interesting

http://worsethanfailure.com/Articles/XML_vs_CSV__0 x3a__The_Choice_is_Obvious.aspx

More importantly, what *documents* qualify? by mrchaotica · 2007-03-01 04:10 · Score: 1

As in, do these laws also include stuff like CAD drawings, which currently get stored in Autodesk's proprietary format? That would certainly make me extremely happy, since AutoCAD's monopoly on the CAD industry is as bad or worse than Microsoft's monopoly on office applications.

--

"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

Re:More importantly, what *documents* qualify? by Eccles · 2007-03-01 04:39 · Score: 2, Informative

I believe the IFC (Industry Foundation Classes) format, intended for cross-app CAD communication, is XML-based, and major CAD developers are supporting or working on support for it.

Death to DWG/DXF.

--
Ooh, a sarcasm detector. Oh, that's a real useful invention.
Re:More importantly, what *documents* qualify? by Eccles · 2007-03-01 09:37 · Score: 1

My belief is incorrect. IFC is not XML, aecXML is. According to the wikipedia write-up, "aecXML is a specific XML mark-up language which uses industry foundation classes to create a vendor-neutral means to access data generated by Building Information Modeling", so IFC is used as a component of an XML format, but isn't one itself.

--
Ooh, a sarcasm detector. Oh, that's a real useful invention.

Computer processability by DrYak · 2007-03-01 04:11 · Score: 1

Some documents need to be further processed.

For a computer, flat plain vanilla text is meaning less, it's just a long stream of ASCII (or UTF-8 if you need accented letters or more alphabets) letters.
For a given flat text, you can't easily extract titles and build a table of content for example, because the titles aren't specifically taged as such.
Therefore you need some tagged kind of format to be able to further process the documents. You can't do it with plain Text (nor ready to print formats like PS or PDF).

But, on the other hand, you don't need to restrict to XML only. Whether the format should use XML, HTML+CSS, CSV (for tabular data), SVG (for graphical data), LaTeX, RTF, YAML, Binary ML, C-like structures (like POVRay and similar), specially designed for format, or whatever else.
XML may have some advantages (widely available parsing libraries, technologies like XSLT for easy translations between standarts, etc...) but that doesn't mean XML should be enforced. Any markup format should do the job, as long as it's well documented, implemented on several architectures/platforms/softwares *including FLOSS* and patent free (or with patents that specifically allow FLOSS implementations).

--
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]

Re:Computer processability by Kadin2048 · 2007-03-01 07:40 · Score: 1

Any markup format should do the job, as long as it's well documented, implemented on several architectures/platforms/softwares *including FLOSS* and patent free (or with patents that specifically allow FLOSS implementations).

At least for basic purposes, Markdown fits the bill nicely.

Every once in a while, I dream of a time when Slashdot might implement Markdown, or one of its cousins like MultiMarkDown or Markdown with Smartypants, since they can be easily and quickly translated to HTML with a few lines of Perl (or Python), but are much more user-friendly than typing the raw HTML itself. But then the drugs wear off, and I remember that this is Slashdot.

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."

Funny you should mention that - by Penguinisto · 2007-03-01 04:13 · Score: 1

I just noticed that the shiny new SCSI controller cards I got yesterday had a big fat "no lead" symbol prominently featured on the boxes...

/P

--
Quo usque tandem abutere, Nimbus, patientia nostra?

Re:Funny you should mention that - by Anonymous Coward · 2007-03-01 05:26 · Score: 0

RoHS
Re:Funny you should mention that - by demonbug · 2007-03-01 06:24 · Score: 1

I just noticed that the shiny new SCSI controller cards I got yesterday had a big fat "no lead" symbol prominently featured on the boxes...

Yup... they finally stopped making the packages out of lead when they noticed how much it was driving up their shipping costs.

Texas House and Senate bills by ptbarnett · 2007-03-01 04:16 · Score: 2, Informative

HB 1794
SB 446

So far, each bill has been filed and referred to the appropriate committee. However, the legislative session just started in January and things don't usually start happening until after the filing deadline on 2007-03-09.

Oh Please! by Anonymous Coward · 2007-03-01 04:16 · Score: 1, Interesting

There is one particular spreadsheet we have to email for a report - it is loaded with Active-X controls and the like, it Works with Excel on Windows and nothing else. One of the insances where we have to pull out 'the Windows laptop' to do the report once a month.

But since that is a reporting metod with contractors (not the general public), I bet it would be an exception.

Along with that there are other Windows specific gotchas - one is an the Access DB that another program has required us to use, and the third instance, a reporting site that is loaded with IE specific ActiveX code (even if you spoofed as IE, it doesn't work).

Every other state report we do sanely accepts either a, delimited text uploads, plain old paper reports, or a 'most browser friendly' web form.

XML is like violence... by It's+a+thing · 2007-03-01 04:22 · Score: 0

...if it doesn't solve the problem, use more! :P

--
Staring at a white background [on a computer screen] while you read is like staring at a light bulb — Maddox

Re:Anything can parse any other file formats, too. by morgan_greywolf · 2007-03-01 04:23 · Score: 1

Provided the format is well-known and well documented, any language can be used to write a parser for any file format. Now, the nice thing about XML in particular is that most modern languages have either built-in parsers or pre-written libraries of parsers available.

--
My blog

Here's my problem: by superbus1929 · 2007-03-01 04:27 · Score: 5, Insightful

There's one company missing from the Open Document party here, and without it playing along, I don't see this lasting long.

I am, of course, talking about Microsoft. They refuse to accept the Open standard.

Until that happens, there will be problems. Yes, you could have .odt documents sent internally, but what if someone has to send a document to someone outside the company? Microsoft Office does not recognize .odt, and if you think that you can train someone to remember to send .doc files to outside users, and keep internal documents to .odt, then I have a bridge to sell you.

--
Let's stop dilly-dallying and just change "-1: Overrated" to "-1: Disagree" or "-1: Doesn't Subscribe to Groupthink".

Re:Here's my problem: by mrchaotica · 2007-03-01 04:39 · Score: 2, Insightful

Yes, you could have .odt documents sent internally, but what if someone has to send a document to someone outside the company?

We're talking about government offices here, not companies. The answer to your question in this case is (or at least, should be) "we compel the outside person to get an ODT-compatible program by government fiat."

--
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Re:Here's my problem: by itlurksbeneath · 2007-03-01 04:40 · Score: 1

Problem solved. See this previous post

--
Have you ever considered piracy? You'd make a wonderful Dread Pirate Roberts.
Re:Here's my problem: by ibbo · 2007-03-01 04:42 · Score: 1, Insightful

MS are loosing their struggle.

The EU managed to make charges stick to prevent them monopolising. Drip by drip Europe his heading for open standards and now we see states in the US going the same way too.

If MS do not get on the boat they will be left behind.

Open format is the way forward thus corporate America and corporate Europe are dictating to MS the terms of the game these days whereas yesteryear it was MS marketing bollox that dictated terms.

The age of MS has passed. Long live open source.

Ibbo

--
Linux user #349545 (GNU/Linux)iD8DBQBAzWjX+MZAIjBWXGURAmflAKCntuBbuKC WenpmXoA7LNydllVQOwCfdjyzXscd
Re:Here's my problem: by The+Cisco+Kid · 2007-03-01 14:58 · Score: 1

Why should CA send MS-Word files to anyone? They would send them in whatever eligible open format they choose, which the recipient can choose whichever tool capable of reading them they want. If MS opts to not provide such a tool, then one will have to be obtained elsewhere. This may hopefully force MS to actually compete fairly, and provide the features its customers want, instead of forcing 'features' on them that prevent them from interoperating with anything else.

MS can either kick and scream fighting it, or they can get onboard - interoperability is on the way in and vendor lock-in is on the way out.

For one thing... by brunes69 · 2007-03-01 04:31 · Score: 4, Insightful

XML means it is readable by humans. You don't even NEED any kind of a program to get the text.

Re:For one thing... by greginnj · 2007-03-01 04:40 · Score: 1

XML means it is readable by humans. You don't even NEED any kind of a program to get the text.
... if it's printed out on paper, you mean. If you can suck text directly out of a CD, floppy, hard disk, or USB drive without some sort of power source, CPU, and and OS, then my hat's off to you ... our new ... robotic, data-crunching overlord? [ or something... ]

--
Read the best of all of Slash: seenonslash.com
Re:For one thing... by dvice_null · 2007-03-01 05:33 · Score: 1

> XML means it is readable by humans.

Not quite so. Microsoft uses xml in their new format, but I doubt that anyone could read that because:
- It uses element names that can't be understand without reading the 6000 page specifications.
- Most of the media-data is in a format that can't be understand even with the 6000 page specifications.
Re:For one thing... by rtaylor · 2007-03-01 05:37 · Score: 1

XML means it is readable by humans.

No it doesn't. XML made for humans is readable by humans. XML made for computers is not going to be any easier to read by humans in a text editor than a typical binary document format.

Heck, I've even seen self-defining XML documents where the top half contained the tag names and meanings for the bottom half, essentially defining macros to reduce duplication. Toss in several different character encodings and write the data in a random order using pointers (again, to reduce duplication) and you've got something nearly impossible for a human to understand without assistance.

--
Rod Taylor
Re:For one thing... by _xeno_ · 2007-03-01 06:33 · Score: 1

<?xml version="1.0"?><!DOCTYPE p [<!ELEMENT p (PCDATA)><!ENTITY b "b"><!ENTITY e "e"><!ENTITY q " "><!ENTITY r "t"><!ENTITY t "?">]><p xmlns="http://example.com/sure/it/is" >Wanna&q;&b;&e;&r;&t;</p>

Sorry, I had to.

(And, yes, that's both well-formed and valid XML! Well-formed means it can be parsed as XML, and the valid part means it validates against its embedded DTD.)

--
You are in a maze of twisty little relative jumps, all alike.
Re:For one thing... by flyingfsck · 2007-03-01 07:32 · Score: 1

Yeah friggen right - XML has so much superfluous bracketitice, that it is pretty much unreadable by any human, unless maybe you send through a 'grep ignore' filter first...

--
Excuse me, but please get off my Pennisetum Clandestinum, eh!
Re:For one thing... by Lord+Bitman · 2007-03-01 07:36 · Score: 1

Yeah, I know... but don't tell my boss or he'll find out the two weeks I spent reverse-engineering that file we got with no specifications was wasted effort!

--
-- 'The' Lord and Master Bitman On High, Master Of All
Re:For one thing... by SEWilco · 2007-03-01 17:20 · Score: 1

For no good reason, that reminds me of my favorite "Obfuscated C" contest entry.
#include /dev/tty
It won a "Best abuse of the rules" prize.

Re:we should do this too by roach2002 · 2007-03-01 04:31 · Score: 1

Where's my "score -1, not well formed" moderation option?

Is Arnie still governor? by Bazman · 2007-03-01 04:38 · Score: 2, Funny

What next? Hasta la vista, Vista?

Two Possible Outcomes by BCMcI · 2007-03-01 04:47 · Score: 2, Interesting

If California passes this resolution I can see two outcomes. 1 The state recognizes that ODF has to be used and scraps Office and loads OpenOffice or StarOffice. Big win for the citizens of California big loss for Microsoft. 2 The state recognizes that ODF has to be used and because older versions of Office won't work with ODF they purchase Vista and Office 2007 for all state agencies. Huge loss for the citizens of California huge win for Microsoft. Guess which is more likely?

Re:Two Possible Outcomes by gauauu · 2007-03-01 06:05 · Score: 1

I'm not convinced that would be a huge loss for California. If MS were to properly support open formats, then would they still be considered "evil"? I think it's worth the money to go to a open and compatible system, even if the vendor with the best implementation happens to be microsoft.

Note that the key word here is "properly support". I'm not sure that will happen.

In Completely Unrelated News... by antirelic · 2007-03-01 04:47 · Score: 2, Interesting

3 states who's yearly budget is under review are looking for ways to drive down existing IT costs by threatening to pass legislation that will get them huge discounts on Operating System and Office Software .........

--
20th century Marxism is not progress...

Reality vs. Concepts- by ShrapnelFace · 2007-03-01 04:53 · Score: 2

The absolute unfortunate truth in this case is that it will not matter what requirements the state of California sets forth, because in the end it serves not the people but the income of the government.

You see, in California, we have this precedence of hiring under-motivate, under-educated, people into roles to fulfill status-quo on the premise of serving equality.

This results in a rule that I call "Factor 4" where by you can take the initial cost of any related project, service or resource requisition, and thereby multiply it by factor of 4 in order to obtain the actual cost to the government.

Sadly, Factor 4, is a direct result of the mediocrity that has taken up residence within all of our government agencies. I cannot imagine a bigger nightmare than this one that I just read about. Half of the institutions within the government are filled with people that have no idea what that means, and lack the education to understand it.

With this being true, we open the door to committees, educational round-tables to determine educational requirements, requisitions for training, then post-comittees to evaluate if the needs were met, then another comittee to determine if the proper mixture of minority members were upheld, then further we'll add layers of evaluation to insure that all submissions qualified with the sole purpose of perpetuating a verification process that checks itself sometimes 3 times over- with absolutely no guarantee that said process is: accurate, predictable, or effective.

All this does is allow state governments the ability to ask for additional funding, which they will earmark with non-related items, and then fund other programs with the initial request.

Translation- the greater good for which said items are presented will be moderately served.

Outcome: Ho hum and whatever. Can't we think of better things to do with my tax money than fuddling around with this area of business? I say they throw out this status quo requirement and start paying people what they are worth so that we can get some really bright minds into our state governments.

Re:Reality vs. Concepts- by Anonymous Coward · 2007-03-01 05:42 · Score: 0

you have it *all* wrong.

the real problem with government is they spend OPM - other people's money.

let me make this very simple.

$1,000 OPM = $1 MM (My Money - as in government official).

iow, the CORRUPT (not dumb, CORRUPT) government employees will gladly trade $1,000 OPM to PERSONALLY BENEFIT by $1.

that's where all your money goes - to folks who get $1,000 for each $1 invested in corrupt politicians and government officials.

if people are too dumb to realize that al gore burns 2,200% more KW/hr than the average american as he tells everyone else to conserve energy, we are too dumb to keep tabs on the corrupt politicians when they sell $1,000 OPM for $1 MM. We won't even talk about how a one way trip on a gulf stream jet creates nearly as much carbon dioxide as an average american ALL YEAR! tell that to arnold and diane feinstein as they tell *you* to conserve energy as they burn nearly as much carbon dioxide in a coast to coast round trip gulf stream 4 flight as 4 americans do all year (fly to east coast, drop of feinstein, fly back to san fran to park. fly back to east coast to pick up feinstein, fly back home - 180,000lbs of carbon dioxide unleashed on the planet. the average american creates ~ 50k lbs of carbon dioxide a year)!

no, the government officials aren't dumb, the electorate is dumb and they are fleecing us. by definition, the dumb folks don't fleece the smart folks, the immoral smart folks fleece the dumb folks.
Re:Reality vs. Concepts- by ShrapnelFace · 2007-03-01 05:49 · Score: 1

Thanks for flaming me- I think if you read what I said we are actually in agreement.

The two things are parallels on the same plane- you cant have what I say without what you say.

Forest for the trees and all, ya know?
Re:Reality vs. Concepts- by geekoid · 2007-03-01 06:19 · Score: 1

your an idiot, thousands of projects get done on or below budget.

In fact, government agencies are often less wastefull then corporate projects.

Yes, some things go over budget, and when that happens the media jumps on it, and they should.
But it is the exception, otherwise it wouldn't be in the news.

--
The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
Re:Reality vs. Concepts- by Anonymous Coward · 2007-03-01 07:27 · Score: 0

Hi I work here in for CA GOV... and I'm responding to this from work, during my regular work shift, not taking a break. spending your dollars to do it.
I have a bumper sticker on my Van that states. Don't laugh you paid for it.

So thanks to all of you for supporting me as I endeavor to spend your money and waste it as fast as I can. /end

ps. I'm only joking about wasteful spending we do really try hard to get money for projects that are needed. But it's robbing peter to pay paul in many cases. If I take 50K for a server for our planning department thats 50K less planning done. Our IT is supported 100% by our customers. So yea the factor 4 comes into play some because we're at the bottom of a very long line with our hands out. However IT is a big expense and ANY money we can save our customers helps by putting that money into real things our REAL customers can see. That's you Mr. Tourist or Mr. Californian or whoever else visits this state.

ODF will be a good step if we can get out of the MS cycle.
Re:Reality vs. Concepts- by ShrapnelFace · 2007-03-01 08:19 · Score: 1

So what you are saying is:

The total cost of most projects in California occur under budget.

I'll allow you the opportunity to see how well we are doing right here:

http://www.lao.ca.gov/2006/major_features/major_fe atures_2006.html/

California is so ontrack that there are a ton of revisements.

Yeah- Im an idiot.
Re:Reality vs. Concepts- by ShrapnelFace · 2007-03-01 08:28 · Score: 1

Editing my link- doesnt seem to work

http://www.lao.ca.gov/2006/major_features/major_fe atures_2006.html

Re:we should do this too by armomurha · 2007-03-01 04:58 · Score: 1

XML Parsing Error: not well-formed Location: http://slashdot.org/ Line Number 2, Column 6: <user="wwwillem"> -----^

Re:we should do this too by Odiumjunkie · 2007-03-01 05:10 · Score: 1

<user="Odiumjunkie" />
<title="Re:we should do this too"</xml>

<quote id="wwwillem">
<pre><user="wwwillem"></pre>
</quote>

<p><span class="pedantic">Forget to close that tag much?</span></p>
</xml>

Re:we should do this too by Odiumjunkie · 2007-03-01 05:14 · Score: 1

inevitably, I fucked that up

Good news for some by Big+Nothing · 2007-03-01 05:19 · Score: 1

This should bring in some big bucks for certain projects at least...

--
SIG: TAKE OFF EVERY 'CAPTAIN'!!

No brainer by electrosoccertux · 2007-03-01 05:19 · Score: 4, Funny

I don't get what all the hoo-haw is and why we need courts or lobbying for any of this. I find it very difficult to write anything when my term paper or [insert your document here] isn't open. Sounds like a bunch of people just need to learn how to double-click.

Re:No brainer by Anonymous Coward · 2007-03-01 07:26 · Score: 0

Whotf moded that insightful? Lol...

Re:we should do this too by wwwillem · 2007-03-01 05:21 · Score: 1

I'm sure that by now you agree that doing a First Post this way, would be a tough challenge.... :)

--
Browsers shouldn't have a back button!! It's all about going forward...

Governor Swartzenager said... by Comboman · 2007-03-01 05:42 · Score: 1

Governor Swartzenager said in a press release, "The state of California has Terminated vendor lock in. It's the End of Days for the Raw Deal and True Lies we were getting from Microsoft. Documents can now be backed up for Total Recall."

--
Support Right To Repair Legislation.

I think it fails on several criteria... by Finsterwald+P+Ogleth · 2007-03-01 05:45 · Score: 1

(1) Interoperable among diverse internal and external platforms and applications;

"Diverse internal and external..." I think diversity would include Linux distros...MS products don't run natively on Linux-based OS's - Partial failure

(2) Fully published and available royalty-free;

I assume "Fully..." means no secret binaries, or API's..."available royalty-free" define what is a royalty... as in MS can't "choose" to whom to license it...and can it be passed on? - Partial Failure

(3) Implemented by multiple vendors;

I assume "implemented" means "used as a native data format to the application, not something that requires a "filter" to open or save it... Failure

(4) Controlled by an open industry organization with a well-defined inclusive process for evolution of the standard.

'nuff said...Complete Failure.

Re:I think it fails on several criteria... by Coryoth · 2007-03-01 06:39 · Score: 1

I think MS will take the tack that many of these points are arguable.
(1) Interoperable among diverse internal and external platforms and applications;
"Diverse internal and external..." I think diversity would include Linux distros...MS products don't run natively on Linux-based OS's - Partial failure Sure, but OpenOffice.org will, and you know they're going to have at least a partial implementation of OOXML working which MS will then point to. I've discussed the issue with demanding a full implementation elsewhere.
2) Fully published and available royalty-free;
I assume "Fully..." means no secret binaries, or API's..."available royalty-free" define what is a royalty... as in MS can't "choose" to whom to license it...and can it be passed on? - Partial Failure Here MS will point to their ISO standard document. Sure, you can debate the issue of whether that is fully published, given the issue of tags like <SpaceLikeWord95> that provide no explanation other than "use spacing like Word95". The point is that it'll be a debate, and you're arguing technical details, which will tend to be missed by most politicians. I think MS expects to squeak through on this point on sheer volume: "Surely a 6000 page document is fully publishing the standard..."
(3) Implemented by multiple vendors;
I assume "implemented" means "used as a native data format to the application, not something that requires a "filter" to open or save it... Failure Again, MS will point to the numerous incomplete implementations from other vendors who need to be compatible with the dominant standard. Sure, they won't be fully compatible, but then I expect you'll find most ODF implementations won't be fully compatible - closer than any non MS implementations of OOXML, but now we're down to quibbling over degrees...
(4) Controlled by an open industry organization with a well-defined inclusive process for evolution of the standard.
'nuff said...Complete Failure. This is an interesting one because it is not clear to me how this should be interpreted. Presumably MS could claim, if OOXML gets accepted by ISO, that ISO controls the standard and they are "an open industry organization with a well-defined inclusive process for evolution of the standard". Whether that argument would actually fly I don't know - IANAL.

The point is that MS can make arguments for every single point, and there's a reasonable chance that, with enough money to grease the wheels, they'll squeak through on every criterion.

--
Craft Beer Programming T-shirts

Credit Knuth. by Kadin2048 · 2007-03-01 05:54 · Score: 1

Of course much of that is due to the fact that TeX has been around for so long without any significant changes and, given enough time, XML formats will likely settle toward the same level of quality from different implementations. Still, TeX's consistency is impressive.

It's like that by design. IIRC, Knuth is very concerned with the stability of TeX, in terms of producing predictable output from a given input file. I've read that the plan is to completely freeze the codebase when he dies -- I think he described it as a point when "all remaining bugs will become features" -- and although others will be able to be free to take the code and produce some other typesetting engine from it, "TeX" itself will be set in stone, so you'll always be able to take a TeX document and get the same output from it. This is represented by current version numbers that asymptotically approach pi (e.g. version 3.14, 3.141, 3.1415...) with each bugfix, where the final version will be marked by changing the version "number" to \pi itself. I think METAFONT approaches e in the same way.

I've always thought that this represented a pretty forward-thinking view. Not too many people really think too hard about what will become of their software after they die. But what do you expect from a guy who thinks that this is a stop-the-presses, call-your-sysadmin "dramatic improvement"? Now that's attention to detail. (Or, how about his taxonomy of diamond-shaped road signs?)

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."

Same way GSA works by Krishnoid · 2007-03-01 06:13 · Score: 1

You want to do business with our company, you exchange documents in our desired format. But we're not compelling you to do business with our company. However, our company is the gub mit, so it's probably in your enterprise's best interest to consider us a very large business partner or customer.

Dell will preinstall linux on systems for a large enough order; state governments negotiate discounts with drug companies; is it that difficult to think that a small company will make the effort to load openoffice or some other odt-compatible app for the opportunity to transact business (not a fundamental right of freedom, mind you) with one of the largest customers they have the opportunity to get?

And the answer is ... by cdrguru · 2007-03-01 06:23 · Score: 1

What is needed is an ODF compatible version of WordStar.

The idea that at an enterprise level there are multiple vendors (real vendors, not distributors) of a word processor and spreadsheet program is a joke. There are perhaps three, and the two I know of are OpenOffice/StarOffice and Microsoft. And there are huge questions about the enterprise viability of OpenOffice that have yet to be answered.

Also, the level of complexity for ODF is such that it is unlikely that every implementation is going to render it the same. This means you create a document with one application and all the form fields are lined up. It is then printed with a different application - still using ODF - and the form fields are shifted over slightly. Maybe just enough to move from column D to column E on the form.

The level of complexity is utterly absurd for any cross-application compatibility. Micrsoft at least understands the problem and clearly indicates that such compatibility isn't going to happen. Reading their standard shows that. Without a committee overseeing development and implementation and certifying implementations, there will never be the level of compatibility that is required.

Re:And the answer is ... by Anonymous Coward · 2007-03-01 07:15 · Score: 0

> and the form fields are shifted over slightly

Do you think that this does not happen with .docs ? Do you think that this does not happen between different versions of MSWord ?

> clearly indicates that such compatibility isn't going to happen

Obviously then we need to ban all other companies and ensure that everyone only ever uses the latest version of Microsoft products.
Re:And the answer is ... by The+Cisco+Kid · 2007-03-01 14:52 · Score: 1

That there are not multiple vendors of 'Word Processing' software is directly due to Microsoft illegal maintentance and extension of their monopoly, not the market's fault. It is also a very unhealthy situation, and anything that promotes there to truly be multiple real vendors is a Good Thing.

And incompatibility between the several dozen versions of MS Word is at least as bad as what you attribute to ODF. Note that the bill doesnt require 'ODF'. It merely requires that the format publically documented for royalty-free use and implementation, as well as a few other criteria, but the article, and in fact the slashdot summary speak for themselves - perhaps you need to read it over again. It doesnt specifically require ODF (although ODF would in fact meet the criteria). MS is free to *really* publically document their formats for royalty-free implementation, and then they would qualify. If MS doesnt want to do that, they can take their ball and go home.

But the second outcome works! by Krishnoid · 2007-03-01 06:36 · Score: 1

The state recognizes that ODF has to be used and because older versions of Office won't work with ODF they purchase Vista and Office 2007 for all state agencies. Huge loss for the citizens of California huge win for Microsoft.

Large win for Microsoft -- they sell Vista and Office 2007 to all state agencies this go-round.
Large loss for Microsoft -- if they want to play in the government agency space, they run the risk of losing the monopoly-format lock on potentially all customers
Small win for businesses -- they now have a choice to use openoffice or some other format to exchange documents with the gub mit?
Small win for citizens -- no need to buy MS Office to view gub mit documents?
Small win for everybody -- first step in releasing the MS stranglehold by hitting them in the MS Office handcuffs?
Unknown -- MS adulter^H^H^H^H^H^H^Hextends the ODF standard to make other implementations incompatible and someone eventually files a lawsuit

Besides, who gots mo money than they know what to do wit? The Gub Mit. (-- In Living Color). Let them pay to insert the thin end of the wedge into the MS monopoly.

A dozen products? by cdrguru · 2007-03-01 06:36 · Score: 1

What are they? All the "enterprise-level" products I have heard of are Office and OpenOffice. WordPerfect or PerfectOffice has been out of that game so long as to not even be a real consideration.

Are you including things like KOffice?

Where is the compatibility certification that says the documents are rendered identically? You know that is going to come up, sooner or later. Because it is going to be important at a government level. You have preprinted forms that are filled in on the printer. Without identical rendering the spacing may be off just enough to matter.

Re:A dozen products? by Anonymous Coward · 2007-03-01 09:35 · Score: 0

Where is the compatibility certification that says the documents are rendered identically?
There isn't one. And that's the beauty of XML: you use CSS to manipulate the rendering in whatever way you need to. The XML describes the logic of the document; the way it is presented is decided at the time of presentation by the client used to do the presentation.
Re:A dozen products? by fritsd · 2007-03-01 12:39 · Score: 1

I'm probably showing my ignorance here, but isn't it possible to validate the consistency of an XML document using XSLT software like xalan: http://xml.apache.org/xalan-j/?
Then, you at least know whether the document is valid ODF. Step 2 would then be to make sure that all valid ODF documents are also rendered identically, which depends on either bugs in the word processor programs (to be squashed by the companies making them), or inconsistencies and/or ambiguities in the standards document (to be complained about at OASIS).

--
To be, or not to be: isn't that quite logical, Slashdot Beta?

I vote -1 troll by Anonymous Coward · 2007-03-01 06:43 · Score: 0

I vote -1 troll

Overhead will kill you.... by tinkerghost · 2007-03-01 07:44 · Score: 1

If you are doing large datasets, you can end up trippling your filesize easily. Think about this, if you have 90 datapoints recorded 100 times a second, that's 9K+ sets of tags that have to be decoded every second with at least 7 extra characters per tag set. That doesn't include the performance hit for encoding & decoding the datastructure. <data> <time stamp=[timestamp]> <sensor number=1>5.00</sensor> <sensor number=2>5.01</sensor> ... <sensor number=90>5.00</sensor> </time> <time stamp=[timestamp+1]> ... </data> For large, structured datadumps like that, CSV can actually be a faster way to work. A hybrid works even better to make a more flexible system with some of the best features of both. <format> <interval> <unit>Second</unit> <value>0.01</value> </interval> <field name='Sensor1' position=1/> ... <field name='Sensor90' position=90/> </format> <CSV_Data> 1,2,3,4,5,.. .,90 1,2,3,5,4,...,90 ... </CSV_Data> It uses XML to define the data format, but at the same time, it doesn't require a lot of processing to decode the actual dataset. Also the file size is minimally impacted.

Re:Overhead will kill you.... by Anonymous Coward · 2007-03-01 08:00 · Score: 0

XML is for documents. If you're not working with documents, use some other format.
Re:Overhead will kill you.... by tinkerghost · 2007-03-01 08:24 · Score: 1

XML is for self describing data. Anything properly formatted into XML becomes an XML document. If that XML document can be displayed as a formatted Text document, a spreadsheet, or a presentation so be it. Web services are clearly not 'documents' as you are using the word, but they are certainly defined by XML documents (SOAP formatting). Most of the data exchange going on today in system integration/intercommunication is happening through XML documents.
The EDI standard for telephone ordering is defined in a 700 page document explaining how the various types of orders effect the positional interpretation of the string. Calling it ugly is being cruel to ugly. This is a data standard that could definately benefit from XML.
The interface for several external systems that I have had to intigrate with have used XML structures to transmit & receive data. It does work well, given certain constraints. The biggest one being that you don't transmit huge data tables via fully encoded XML.
Re:Overhead will kill you.... by GaryOlson · 2007-03-01 16:24 · Score: 1

That doesn't include the performance hit for encoding & decoding the datastructure.
This is Government work TFA is talking about. Efficiency is neither relevant nor desired.
Put down the rational thought. Step away from the rational thought and no one in the government will get hurt.

--
Every mans' island needs an ocean; choose your ocean carefully.

Dept Tag--"Lobbiest"? by adavies42 · 2007-03-01 07:53 · Score: 1

Lobbiests would be people who are most like lobbies, I take it?

--
Media that can be recorded and distributed can be recorded and distributed.
-kfg

Please resubmit your comments... by Physics+Dude · 2007-03-01 08:08 · Score: 1

Error processing slashdot post 18194566: Line 2: tag "user" has no matching closing tag.

;-)

-1 redundant by Physics+Dude · 2007-03-01 08:13 · Score: 1

Error parsing slashdot post 18198372: User forgot to refresh article for before posting

Oops!

Isn't quite the same situation by DrYak · 2007-03-01 08:20 · Score: 2, Interesting

It seems unlikely that KOffice is a complete superset of OpenOffice, therefore even if both OOo and KOffice implement ODF, KOffice can never be completely interoperable with OOo (at least OOo -> Koffice). Further, if KOffice implements any features that OOo doesn't have, then the same is true in reverse.

Although the starting situation is mortly similar with OOXML, it isn't quite exactly the same.
Yes, there may be some different way to interpret the standards, and maybe two different implementation produce slighlty different results (Spreadsheet formules, for exemple aren't standarized yet). The difference is that this standard is controlled by a whole comitee (OASIS), in which several software maker are represented, include FLOSS, and it's in their interest to have the best interoperability as possible.
Thus there's a high probability that, faced with such a situation, the detail of the implementation will be specified in next OpenDocument revision, so that the other software can do a better job in opening their interoperation. In fact, latest versions of AbiWord seem to be much more close to the original OpenOffice.org document. Or maybe they'll even create a new options that allows to tweak the parameters of the function in a documented way ( where 2.67 imitates best the behaviour of OOo and 3.14 is KOffice's default and 3.00 is what every new application is supposed to assume in case of missing param, according to documentation)
And if some other product develops more functionality than ODF is capable of encoding, there's a high probability than an extension will be written and published with the next ODF revision.

In fact, ODF isn't as much direct memory dump of OpenOffice.org as SXW was. ODF has been further processed by OASIS. Whereas OOXML (for now) is still a direct memory dump.
s the sole and unique maintainer of the OOXML specification, it's not in their interest to maintain pixel-perfect conversion for competitor (they need competitor to be bale to interoperate with document formats - to shut the people complaining up - but they need to be the only product that can promise 100% pixel-perfect imports).
Regarding with difference of working, the whole documentation is bloated with definition of options like "" for several thousand pages, which aren't explicitly documented at all (it's only written that they will be deprecated and that implementation aren't required to react to them). Only MS-Office will ever be able to open them by definition (Abiword may be able to reverse engeneer them, but it'll take more work than asking OASIS for a better documentation).
You can bet that, if Microsoft adds some new functionality, they'll be the only one to support them as a paid-for extension... probably called 'Visual OOXML#'. You can be sure that they'll out-"Embrace, Extend, Extinguish" their own ECMA approved standard.
Or at least to produce as badly written as possible documentation.

What will make the difference between OOXML and ODF is microsoft willingness to cooperate (or lack of), and OASIS comitee collective need to collaborate between members.
The only potential way to save OOXML is to put it into control of a groups, in which there's at least one FLOSS represented (say, WordView), which will have to grant full right to use and promise not to patent-sue independent implementations, and will force Microsoft to use OOXML instead of some "MS OOXML.net" extensions.
Which they'll never agree to.

--
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]

Re:Isn't quite the same situation by man_of_mr_e · 2007-03-01 19:53 · Score: 1

Maybe i'm confused, but you seem to be contradicting yourself. At one point you say "Yes, there may be some different way to interpret the standards, and maybe two different implementation produce slighlty different results" then down further you claim it is imperative for other office suites to create pixel perfect MS Office renderings. Why is it no big deal if ODF based office suites can't render each others documents exactly, but such an enormous deal if they can't render an Office document exactly?

I'm also not sure why you state that OOXML doesn't have an independant working group. When I looked at the ECMA OOXML working group, it consisted of about a dozen organizations, including software companies like Apple and Novell, and non-profit organizations (who I would assume have a vested interest in a working archival standard), such as the british library. Novell is an open source company now, and they are producing an open source implementation of OOXML for OOo, which seems to contradict your point as well.

Your argument seems to be ignorant (not a slam, you really don't seem to know the details and say things that are already the case have to happen), emotional, and not very well thought out, as evidenced by your contradictory positions based on what argument you wish to present at the time.

You also didn't really address my argument, which was based on the implementation argument. No two office suites will have identical feature sets, therefore, no two office suites can completely interoperate, regardless of data format. For example, what happens when you open an OOo document that has text frames in it, but the application you're using doesn't support text frames? It will either have to convert the document to something it can support, or ignore it (possibly corrupting the integrity of the document), or even just delete it. So then, you make a change, save the document and send it back to the original author and now it's completely different.

OOXML might be a direct dump of office, but frankly I don't see how that really matters. Even if Office used ODF, and ODF was capable of supporting all of Offices features (something i'm not sure about, MS claims not but ODF supports claim it can), no suite will be able to implement it either, because Office still has to implement all it's features, those same features that are in OOXML.

All i'm asking is that you look at the arguments you're using rationally. Many of them don't make sense and flip-flop.

--
If you need web hosting, you could do worse than here

Re:we should do this too by iabervon · 2007-03-01 09:46 · Score: 1

<div class="commentTop"> <div class="title"> <h4>Re:<xml>we should do this too</xml></h4> </div> <div class="details"> by iabervon (1971) </div> </div> <div class="commentBody"> <div id="comment_body_n">Slashdot posts are already xml, you know. Using a targetted schema like XMPP would actually be simpler than the status quo.</div> </div>

illegal now: PNG, JPEG, PDF, OGG... by r00t · 2007-03-01 13:14 · Score: 2, Informative

All video formats are now illegal. (or is there now an XML video abomination?)
All audio formats are now illegal.
Probably all image formats are now illegal.

Whee.... this'll be entertaining.

Re: illegal now: PNG, JPEG, PDF, OGG... by gidds · 2007-03-02 00:38 · Score: 1

In fact, doesn't this also make plain 7-bit ASCII text illegal too?

--
Ceterum censeo subscriptionem esse delendam.

Re:we should do this too by RealGrouchy · 2007-03-01 15:49 · Score: 1

No, you simply type it in a web page editor like Dreamweaver or NVU, then copy the html code from the html tab into the content tab, then copy the newly-created html code into the text box.

- RG>

--
Hey pal, this isn't a pleasantforest, so don't waste my time with pleasantries!

MOD UP PLEASE by duffahtolla · 2007-03-01 17:37 · Score: 1

A lot of people miss this point. Thanks.

Lobbiests? Typical Slashdot shoddiness by Anonymous Coward · 2007-03-02 01:15 · Score: 0

No other site I know of has such illiterate and apathetic editors. Come on guys, this is just fucking embarrassing. Did you even go to school? Do you even care that you look like fucking morons and drag down the credibility of the site? Fucking asswipes, do your job or you should get a boot helping your ass out the door.

Slashdot Mirror

California Joins Open Document Bandwagon

188 comments