Australian Stats Agency Goes Open Source
jimboh2k writes "The Australian Bureau of Statistics will use the 2011 Census of Population and Housing as a dry run for XML-based open source standards DDI and SDMX in a bid to make for easier machine-to-machine data, allowing users to better search for and access census datasets. The census will become the first time the open standards are used by an Australian Federal Government agency."
I'm perplexed why people continue to use XML when there is YAML. What is it that makes XML so attractive as a durable format? it's not human readable in a practicale sense, and YAML very much is. Since it's delimeters are comlicated and variable, It's harder to parse in ad hoc ways than yaml (line and white space) which means that for rapidly extracting things there are no shorcuts to instantiating a whole document. It's hard to grep. And both formats can fully do the other ones job so they are interchangeable.
Some drink at the fountain of knowledge. Others just gargle.
"The census will become the first time the open standards are used by an Australian Federal Government agency."
Really?
http://xena.sourceforge.net/
Australia is openly embracing census data and enhancing it's availability.
Canada's government is going out of its way to prevent census data collection.
134340: I am not a number. I am a free planet!
Meanwhile, in other government agencies and private enterprise there are open file formats such as the geophyical SEGD and SEGY formats that have been used since at least the 1980s. That means you can read data files from 1982 on current software.
Closed file formats are an "innovation" of Microsoft and similar companies. It's really any different from the bastards that write unreadable code in an attempt to provide job security.
hopefully in the future some of the practices of elements of Microsoft and many others will be remembered like the claim salters and others with "sharp" business practices in the old west.
We should find out what percentage of the population thinks that this is a good idea....
...and here's why:
It's official - Munich Linux migration is "dead - abandoned in all but name." - Linux
Yes, you read right: "Dead - abandoned in all but name".
it looks like they want all data up, the only data not collected is names and addresses, you can use any of the questions to define your sets.
"DDI and SDMX are good at describing things, and we're testing the very notion that you can actually consume this stuff and make it discoverable metadata for your search engines."
"We definitely want to see who's keen, who's interested in statistics and metadata, open data, data linking and what people can do with it as well."
You have 5 Moderator Points!
Which Helpless Linux zealot/MS basher do you want to mod down today?
Munich Linux migration is "dead - abandoned in all but name."
Last I heard it was a migration to open source and they were successfully using open source desktop applications. The operating system may be Windows rather than Linux but this still seems to be a victory for open source. On the desktop the applications are far more important than the operating system.
There is some difference. I'm not clear from the summary exactly what's going on.
XML is perfectly suitable for long term data storage and exchange. You have namespaces, schemas, and a millions of tools to handle it.
YAML is OK for storing configuration data. It's not even that good for anything else.
Also anyone who "parses in ad hoc ways" deserves to be slapped in the face.
How many Jedi's currently live in Australia.
If you mod me down the terrorists will have won
but the govt still thinks sharing is bad
This is why an Australian invented Wikileaks... I mean... "information wants to be free" and such...
and open source you share your code freely to help everyone
Hey, where does it say that they'll share the code? TFA quote:
with the ABS directing software developer Space-Time Research to utilise the standards for both input and output of all data collected next year.
So: ;) )
1. it is the data that will be shared (govt takes preemtive - still legal - actions against Wikileaks?
2. the guys that are doing the software is Space Time Reseach - the way I know, a bit far from a open source establisment (note: I have no affiliation with them)
Questions raise, answers kill. Raise questions to stay alive.
When is it ever desirable for indentation to not match the logical structure of a program?
The only possible reason I can come up with is if you're intentionally attempting to obfuscate your code.
As the author of the Perl module YAML::Tiny, and the current maintainer of the original YAML.pm I call troll on the parent.
YAML as a specification is way more complex than XML and it's way harder to implement.
And who in their right mind is going to read the raw census statistical quads directly? The point is moot.
XML is ideal for machine to machine communication. It's easily machine readable, and easily debuggable by nerds (which is the bit of "readable" that really matters here). And machine readable is what the ABS has in mind here as their goal.
Adam K (too lazy to log in)
The census will become the first time the open standards are used by an Australian Federal Government agency.
What the hell are you talking about? We use a variety open standards every day of every minute across every department with any modern IT assets, I think what you meant to say was the first time that open standards are being used by an Australian Federal Government agency to communicate with the general public. Even then, it's not exactly news, it was going to happen eventually.
You're getting close, it's definitely to allow an intentional expression and it's going to be a bug for all people who use white space to express more than just the {}'ness.
I wonder why a language has to enforce something that could have been enforced by the editor for those that value it.
Strictness on this is what kept back so many perl coders and stopped python from ruling the world.
But... I don't mind... and python-ites prefer white space to world domination, so thats good too!
blog.sam.liddicott.com
I'm with you on the python whitespace thing, but for YAML it's different. We're not talking about writing code here. It can be tricky to get the whitespace right but it's a damn sight easier than learning and reading XML syntax. Remember that 99% of the time machines process these files and we only care to make reading easy (where YAML whitespace is a non-issue) and human editing easy, where it isn't too bad. Composing from scratch by hand isn't really something you're going to be doing with YAML (or XML).
I want my Cowboyneal
bullshit, idiot. indentation is simple.