XML for Ancients
Andrew writes: "More than 5,000 years ago, the very first information revolution occurred when some unknown research team in Mesopotamia found a way to download and store language through a killer application called "writing.". The cuneiform digital library will have 60,000 texts ready in a couple of years. Using SVG and XML to represent their documents. Similar efforts are underway for hieroglyphics."
I have been working in IT since 1997, yeah I know a mere blink of an eye for some Unix Wizards (ie. beards, strange clothing and their own arcane language). What I have noticed is that every year my handwriting has been getting progressively worse. What with my PDA, laptop, PCs etc. I just have no need to wield a pen no more :)
Apart from signing my name on credit card chits, the only time I am required to write is for birthday/Christmas and other assorted cards. Its getting so bad now that I start to write a long word and just give up. My once pristine handwriting now looks like a doctors prescription scrawl.
Any else get this too?
Po
Site appears to be slash-dotted already...
So.. Are these 5000 year old documents going to be freely available or will the database of texts be copyrighted/restricted?
Looking for any old 8-bit Heathkit/Zenith software/hardware - http://heathkit.garlanger.com
With all these ancient language/hieroglyphic texts being archived, I have a feeling that we'll be hitting that 65536 character wall very shortly, since someone in the future might need that Cunieform version of M$ Word (hey, it could happen). Is it time for UTF-32?
They're using XML? They could integrate this with some sort of retrieval language and couple it with Jabber clients. That way you could send some sort of command-line search/retrieval command to the database using a regular Jabber client and have the XML data sent back, since Jabber natively supports the standard.
How Snowcrash.
they are also writing their tcp packets on clay tablets, and attempting to send them down the wire. That was the quickest /.'ing I've *ever* seen.
[smile]
Scientific American has this article on Information Technology, 2500 B.C. on what life was like for the information worker of that day.
As many as half a million cuneiform tablets, hand size up to book-page size, are now available around the world. Surely many more are waiting to be found. Those samples are of every quality: once prized accounts and receipts, schoolboys' lessons, litigation profound or droll, literary essays, erotica, mathematics--and entire ancient epics, centuries older than Father Abraham's. A mostly unread treasury, comprising the equivalent of tens of thousands of large printed volumes.
Looks like there could be a lot of fun and good stuff there.
"It is a greater offense to steal men's labor, than their clothes"
-- William "Scorpion King" Gates
"It was a summer's tale: Just a boy, his Linux, and a head full of dreams..."
Sooo... this project has been going on for about 5,000 years, they're finally going to be making a large release in a few years, and we're *JUST NOW* hearing about this?
My *god*, talk about keeping the PR lid on tight!
Unicode is often referred to as a 16-bit system, which would allow for only 65,536 characters, but by reserving some code points for mapping into additional 16-bit planes, it has the potential to cope with over one million unique characters.
The current version (3.1) of the Unicode Standard, developed by the Unicode Consortium, assigns a unique identifier to each of 94,140 characters
I used to bulls-eye womp-rats in my pants
IIRC, cuneiform writing is composed entirely of angle brackets. To write this in XML, every character is going to have to be escaped!
"justified.dtd" >
The cuneiforms are justified and ancient.
and well formed.
XML is gonna rock you.
I believe the ancient Egyptians avoiding using XML at the time because of concerns over RAND licencing and prefered the patent-free ideograms.
No, really.
Reliable, Great Value Hosting: $7.95/mo 2.4G/120G
I was worried I might end up here instead...
Correct me if I'm wrong, but what is XML doing that some homegrown solution couldn't? Obviously clients would have to know the protocol, but with XML that is also the case.
I use XML all the time, maily because of XSLT, but I think its less functional and more hype. Feel free to enlighten me.
Clients would have to know and implement the protocol. But since XML always looks the same, implementing the protocol is just a matter of linking the standard XML library in the language of your choice and using the DTD to decide what you want your client to understand.
There's other advantages, but that's a big one.
It's rare that you're presented with a knob whose only two positions are Make History and Flee Your Glorious Destiny.
Consider, for example, the carry dots that some people use to add up numbers. Dots and things like that in the text may well uncover the way that calculations were done.
OS/2 - because choice is a terrible thing to waste.
I haven't looked in almost a year now, but the last time I did, there was an alpha (rendered lots of graphics correctly, lots incorrectly) patch for Mozilla and no SVG support for IE or any other browser. Did everybody catch up while I wasn't looking?
All I can think of now is the new book series:
"XML for Mummies"
At least in this case when you see the reviews "this book will put you to sleep" it really doesn't matter.
-Brad
the xml.org link for cuneiform encoding initiative is at http://www.jhu.edu/ice/
There is an initiative for almost every ancient language that is know (and decipherable). I'm sure digging thru xml.org will turn up a bounty of results =]
"I would say that 99 per cent of what my father has written about his own life is false." - L. Ron Hubbard Jr.
So a 5000 years original text should be no problem.
The case will happen if you ask for the translation (What, you are not Cuneiform litterate ? Talk about education 8)
It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
Copyright is 70 years on books
No, 95 years on all works first published on or after January 31, 1923. See also Sonny Bono Copyright Term Extension Act. And it'll get even longer before 2020 as Di$ney frantically bribes Congre$$ to pass yet another corporate-welfare copyright extension.
The case will happen if you ask for the translation
Will I retire or break 10K?
Not having these glyphs in the Unicode set would be like asking English-speakers to use alphabets reduced by five or six characters (M and N are similar, X, Q, C and Z could be replaced by one character as well)
Spelling reform. China (outside Taiwan) has had it. It's perfectly possible to write English with only 18 letters.
and dictionaries from which three out of four words have been deleted due to redundancy or age
So? Desk dictionaries aren't nearly as comprehensive as Oxford English Dictionary or even the unabridged Webster's Third New International Dictionary.
Will I retire or break 10K?
Then I can write a washing bill in Babylonic cuneiform
What are the earliest statements they've found? What do they say?
"First post"
- Scott
Scott Stevenson
Tree House Ideas