XML for Ancients
Andrew writes: "More than 5,000 years ago, the very first information revolution occurred when some unknown research team in Mesopotamia found a way to download and store language through a killer application called "writing.". The cuneiform digital library will have 60,000 texts ready in a couple of years. Using SVG and XML to represent their documents. Similar efforts are underway for hieroglyphics."
I have been working in IT since 1997, yeah I know a mere blink of an eye for some Unix Wizards (ie. beards, strange clothing and their own arcane language). What I have noticed is that every year my handwriting has been getting progressively worse. What with my PDA, laptop, PCs etc. I just have no need to wield a pen no more :)
Apart from signing my name on credit card chits, the only time I am required to write is for birthday/Christmas and other assorted cards. Its getting so bad now that I start to write a long word and just give up. My once pristine handwriting now looks like a doctors prescription scrawl.
Any else get this too?
Po
Site appears to be slash-dotted already...
So.. Are these 5000 year old documents going to be freely available or will the database of texts be copyrighted/restricted?
Looking for any old 8-bit Heathkit/Zenith software/hardware - http://heathkit.garlanger.com
With all these ancient language/hieroglyphic texts being archived, I have a feeling that we'll be hitting that 65536 character wall very shortly, since someone in the future might need that Cunieform version of M$ Word (hey, it could happen). Is it time for UTF-32?
Wonder how long it will be before someone finds something interesting here, and how long it will take to "doctor" it?
Alternately, how long will it take for someone to fake something.
OS/2 - because choice is a terrible thing to waste.
They're using XML? They could integrate this with some sort of retrieval language and couple it with Jabber clients. That way you could send some sort of command-line search/retrieval command to the database using a regular Jabber client and have the XML data sent back, since Jabber natively supports the standard.
How Snowcrash.
they are also writing their tcp packets on clay tablets, and attempting to send them down the wire. That was the quickest /.'ing I've *ever* seen.
[smile]
Scientific American has this article on Information Technology, 2500 B.C. on what life was like for the information worker of that day.
As many as half a million cuneiform tablets, hand size up to book-page size, are now available around the world. Surely many more are waiting to be found. Those samples are of every quality: once prized accounts and receipts, schoolboys' lessons, litigation profound or droll, literary essays, erotica, mathematics--and entire ancient epics, centuries older than Father Abraham's. A mostly unread treasury, comprising the equivalent of tens of thousands of large printed volumes.
Looks like there could be a lot of fun and good stuff there.
"It is a greater offense to steal men's labor, than their clothes"
-- William "Scorpion King" Gates
"It was a summer's tale: Just a boy, his Linux, and a head full of dreams..."
Sooo... this project has been going on for about 5,000 years, they're finally going to be making a large release in a few years, and we're *JUST NOW* hearing about this?
My *god*, talk about keeping the PR lid on tight!
Unicode is often referred to as a 16-bit system, which would allow for only 65,536 characters, but by reserving some code points for mapping into additional 16-bit planes, it has the potential to cope with over one million unique characters.
The current version (3.1) of the Unicode Standard, developed by the Unicode Consortium, assigns a unique identifier to each of 94,140 characters
I used to bulls-eye womp-rats in my pants
...or else the uhh.. because... uhmm..
Oh, what the hell.
Micro$oft sucks.
...by the pretentious language used to trick up something that doesn't need to be jazzed up with references to modern digi-whiz-bang this-and-that?
come on, cuneiform is damn neat all by itself. no need to make cutesy-ass references to the digital internet web cyberspace whatever.
IIRC, cuneiform writing is composed entirely of angle brackets. To write this in XML, every character is going to have to be escaped!
This is how.
"justified.dtd" >
The cuneiforms are justified and ancient.
and well formed.
XML is gonna rock you.
Batch,
C#,
C++,
Diff,
Eiffel,
HTML,
IDL,
Java,
JavaScript,
LaTex,
Lisp,
Lua,
Makefile,
PHP,
Pascal,
Perl,
Python,
Ruby,
SQL,
TCL,
Text,
VisualBasic,
XML,
XLST.
Have a really good day, Mr Katz.
I believe the ancient Egyptians avoiding using XML at the time because of concerns over RAND licencing and prefered the patent-free ideograms.
No, really.
Reliable, Great Value Hosting: $7.95/mo 2.4G/120G
I was worried I might end up here instead...
Dammit... I want to look at old books and stuff, but they're web server is going... very.... very... slow.....
....ooops... it... just... died....
Lets all of us try and hit it again in exactly 12 hours, hey?
Correct me if I'm wrong, but what is XML doing that some homegrown solution couldn't? Obviously clients would have to know the protocol, but with XML that is also the case.
I use XML all the time, maily because of XSLT, but I think its less functional and more hype. Feel free to enlighten me.
the very first information revolution occurred when some unknown research team in Mesopotamia found a way to download and store language through a killer application called "writing."
Talk about dead projects. I mean, freshmeat has nothing on these guys. 5,000 years, and how many upgrades? I'm STILL using writing 1.0, for chrissakes, not because it's better, but because there are no other versions!
python -c "x='python -c %sx=%s; print x%%(chr(34),repr(x),chr(34))%s'; print x%(chr(34),repr(x),chr(34))"
Clients would have to know and implement the protocol. But since XML always looks the same, implementing the protocol is just a matter of linking the standard XML library in the language of your choice and using the DTD to decide what you want your client to understand.
There's other advantages, but that's a big one.
It's rare that you're presented with a knob whose only two positions are Make History and Flee Your Glorious Destiny.
Consider, for example, the carry dots that some people use to add up numbers. Dots and things like that in the text may well uncover the way that calculations were done.
OS/2 - because choice is a terrible thing to waste.
Who sung that?
http://www.deathrock.com/tsol/lyrics.html
I haven't looked in almost a year now, but the last time I did, there was an alpha (rendered lots of graphics correctly, lots incorrectly) patch for Mozilla and no SVG support for IE or any other browser. Did everybody catch up while I wasn't looking?
I know it's bad form to criticise someones writing on /. but it really is time for you to catch up with modern developments ... Version 1.0 (codename cuneiform) has long been superceded by 2.0 (codename Heiroglyphics), 3.0 (greek), and 3.1 (latin).
While there is still some support for all sub-releases of version 3, I suggest you upgrade to the latest release (3.1.27 - 'joined up alphanumeric').
Of course there has been some criticism of the 'open source' nature of the writing project with claims that it leads to too many active branches (most notably with interoperability issues with the popular 'Chinese', 'Arabic' and 'Roman' branches).
And I just took that book out of the library to re-read again....very bizarre....
On this lovely Thursday.
SVG (Scalable Vector Graphics) is subset of XML. Stating that something is stored in SVG AND XML format is a tautology.
"There is a terrorist behind every bush"
All I can think of now is the new book series:
"XML for Mummies"
At least in this case when you see the reviews "this book will put you to sleep" it really doesn't matter.
-Brad
the xml.org link for cuneiform encoding initiative is at http://www.jhu.edu/ice/
There is an initiative for almost every ancient language that is know (and decipherable). I'm sure digging thru xml.org will turn up a bounty of results =]
"I would say that 99 per cent of what my father has written about his own life is false." - L. Ron Hubbard Jr.
all your stoopid hemrons belong to l ron cupboard
Claytablets also had a very good way of authenticiation: "...The task of preserving the integrity of the records, incidentally, posed the problem of how to prevent
unauthorized changes. Seals were in wide use, but the Assyrians invented a system of wrapping the
clay tablet with a contract in a second layer of clay and copying the contract again on the enveloppe.
In case of disagreement on the authenticity of the text on the outer layer, this layer was broken and
the inside could be inspected. As there was no way of changing the inside tablet without damaging
the outer layer beyond repair, this offered an early but very effective way of authenticiation
Hans Paijmans
So a 5000 years original text should be no problem.
The case will happen if you ask for the translation (What, you are not Cuneiform litterate ? Talk about education 8)
It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
As it says in Snowcrash, cuneiform is just a succession of 1 & 0s...
I mean, about just the same as todays computers...
Maybe we could try and feed the Enki story to a computer...
It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
So, did they have the slashdot effect back then? Umm, like 100 scholars trying to access a single clay tablet at the same time? Did the clay tablet crash? Wait a minute - did I just give an anology where slashdotters were scholars? Woah, must have regular dose of caffine before I make more 'horrendous mistakes'.
-Shaunak.
Copyright is 70 years on books
No, 95 years on all works first published on or after January 31, 1923. See also Sonny Bono Copyright Term Extension Act. And it'll get even longer before 2020 as Di$ney frantically bribes Congre$$ to pass yet another corporate-welfare copyright extension.
The case will happen if you ask for the translation
Will I retire or break 10K?
Not having these glyphs in the Unicode set would be like asking English-speakers to use alphabets reduced by five or six characters (M and N are similar, X, Q, C and Z could be replaced by one character as well)
Spelling reform. China (outside Taiwan) has had it. It's perfectly possible to write English with only 18 letters.
and dictionaries from which three out of four words have been deleted due to redundancy or age
So? Desk dictionaries aren't nearly as comprehensive as Oxford English Dictionary or even the unabridged Webster's Third New International Dictionary.
Will I retire or break 10K?
Weren't the old drum storage systems used in the 1960's a ceramic structure coated with magentic surface? And that was an improvement on those birch bark 80 column cards.
But now we have advanced ceramics used in various other electronic media. And we measure our mean time between failure in hours.
So, how far have we really come in the last 5,000 years? They had fire and clay and their data remains readable after 5,000 years. We have lightening and clay and can't read data from 15 years ago and hard drives can fail in a flash.
Why aren't we planning storage and retrieval systems that can last thousands of years? Is it because our technical culture only values the last 2 to 3 years? How will we answer to our children when they can't figure out what we did 25 or 50 years from now? And I don't think we can blame it all as a planned obsolescence feature of Microsoft...well, maybe not all of it!
i certainly hope its a freely available public resource... i've been studying cuneiform texts (mainly Sumerian myths) for a couple years now and an archive like that would.. well the idea lossens my bowels and excites my senses! in other words i damn near crapped myself with joy when i read that.
Truth, Just Us, And Hatred For All Mankind!
Then I can write a washing bill in Babylonic cuneiform
My father worked as a bank manager for some time. he was signing things every couple of _seconds_. As a result his sig now resembles a stylized 'R' with a squigle that he can write faster then you'd think possible. It's also, nearly, impossible to forge (hey, we all went through school, no? ;P)
Did they find exactly who invented writing? What are the earliest statements they've found? What do they say?
Actually, they're called "hieroglyphs," and the writing is called "hieroglyphic writing." This is according to The Oriental Institute in Chicago.
Cheers, Toliaro
What are the earliest statements they've found? What do they say?
"First post"
- Scott
Scott Stevenson
Tree House Ideas