Claimed Proof That UNIX Code Was Copied Into Linux

More details and downladable archive by tomhudson · 2010-07-11 14:22 · Score: 5, Informative

More details, and a downloadable archive here - because there's no telling how long those files will remain on McBride's blog,

Also, we find out more about streams, and how SCOsource was bogus.

Re:More details and downladable archive by 10101001+10101001 · 2010-07-11 16:01 · Score: 2, Interesting

The best part? The Tab 229 example includes a define for RTLD_GLOBAL but in the SCO code it's value is 4 instead of the 0x100 used in Linux. Why the discrepancy? Well, probably because the FSF was cloning BSD, not Unix (as BSD was probably more popular and readily available to many than one of the myriad Unix forks). Oops.
Perhaps McBride is unaware of the BSD lawsuit? Certainly, if anyone has any room to complain, it'd be Berkley. However, given that the examples seem to repeatedly have jumbled lines or inconsistent values, I'd imagine that regular reverse-engineering was employed in the construction of most of the headers. Ie, it just further highlights how unlikely there was copyright infringement.

--
Eurohacker European paranoia, gun rights, and h

Shocking by Kenoli · 2010-07-11 14:24 · Score: 4, Funny

How dare they copy/paste those blank lines!

Re:Shocking by bsDaemon · 2010-07-11 14:28 · Score: 2, Funny

between that and pre-processor #include directives from the standard C library and POSIX stuff, well... damn. How could any judge have failed to see this!? /sarcasm
Re:Shocking by physicsdot · 2010-07-11 16:07 · Score: 4, Informative

How dare they copy/paste those blank lines!
Just in case you thought you were kidding: http://www.mcbride-law.com/wp-content/uploads/2010/07/Tab-422.pdf
Line 22 is blank, and is indicated as being copied.
Re:Shocking by mpe · 2010-07-11 18:47 · Score: 5, Insightful

This was the first time I had a chance to see any of their "evidence". How exactly did this make it all the way to court?

Especially given that "original" files appear to have been altered to contain comments to the effect that a USL copyright statement somehow proves that the information is "UNPUBLISHED PROPRIETARY SOURCE CODE" together with a meaningless statement about the existance of a copyright statement not implying publication.
It looks as though they have "mailmerged" part of their claims into their supposed evidence. Shouldn't this have resulted in the judge throwing this out as null and void?

Re:First post by Anonymous Coward · 2010-07-11 14:27 · Score: 3, Insightful

Yeah, because open-source developers can just see and copy the proprietary closed-source code - oh wait...

Re:More details and downloadable archive by tomhudson · 2010-07-11 14:27 · Score: 5, Informative

For those not logged in who don't see the download url in my sig

"In a blog post dated July 10th, 2010, Kevin McBride has leaked almost 50 of the code comparisons that were submitted in evidence in SCO vs Novell. You can download the archive.

Read on to view individual files if you don't want to download the whole thing.

Linux STREAMS

We also learned that the whole STREAMS fuss was not about linux, but about a product distributed by gcom, a provider of legacy solutions.

Their Linux STREAMS (LiS) product provides a couple of loadable drivers that would intercept calls to the old streams api and convert them. In other words, far from the allegations that the linux kernel contained code that infringed streams, it's evident from the need of an add-on loadable module that the linux kernel does not contain any STREAMS code.

Of particular note, and probably a source of much consternation to SCO and their proponents, is that LiS itself doesn't implement streams either, just does protocol translation. So neither linux nor LiS contains infringing code.

The whole end-user $699 license was a scam

In my view, contract violations by IBM would not result in liabilities by other Linux users.

So according to Kevin McBride, one of the lawyers who worked on the case, there was no reason for end users to take out a license. It's logical to conclude that SCOsource was a protection scam. So what happened? To me, it looks like SCO lawyer-shopped until they found attorneys who were willing to go along with the scheme for a price - everyone has their price, and in this case, it was $30,000,000.00.

The Appeal of SCO's loss to Novell - Novell will probably win.

Will Novell win the current SCO appeal? Probably. Will Novell donate the UNIX copyrights to the Linux community if it wins the current appeal? Probably-although Novell's Linux activities have been difficult to predict in recent years.

So it's pretty much as we suspected all along.

comments added... by nacks1 · 2010-07-11 14:28 · Score: 2, Interesting

I find it rather funny that the Linux code is well commented but the SVR4 code has little to no comments at all. Just because the function names are the same doesn't mean it was copied. It just means that the coders implemented functions with the same names (and I bet that the Linux versions worked rather differently than the original SVR4 code).

Re:comments added... by Jane+Q.+Public · 2010-07-11 15:31 · Score: 2, Insightful

I agree. As a programmer myself, I saw significant differences between the two sets of code in each example that SCO claimed was "evidence". I would have to look at more of the examples, but from what I saw, if I were the judge, I'd tell SCO to stop wasting everybody's time.
Re:comments added... by silentcoder · 2010-07-12 00:37 · Score: 2, Insightful

Keep in mind further a huge chunk of that code is GLIBC code - that is code that isn't part of Linux, it's from GNU and was written by the FSF (interestingly SCO never tried to sue the FSF, but non profit orgs tend to lack much money so I guess their not great targets for an extortionist).
IBM may have contributed some code to glibc but under FSF rules it could not be accepted unless they signed over copyright anyway. Still if it was infringing from there then I can see the point of sueing IBM.
All this are more like an intro though to my actual response to your comment: which is yes, and so did GNU. In fact RMS details in the history of the gnu project the (rather radically) different design decisions that he and his team made early on in their writing of unix-like components for gnu where their implementations completely deviated from the original UNIX versions:
1) Whenever a file was small enough (I believe their limit CA 1988) was 2mb) it would be read entirely into memory and not buffered in chunks from disk for any program, and then written back as a whole again. This uses more memory but is a much faster way to work. Today Linux (and other OS's) actually does this on the filesystem level with massive amounts of file changes kept in memory and only written out in idle-times or at dismount time because memory is so much faster than disk.
2) They made no attempt to support any architectures below 32-bit, so no you couldn't compile gnu on a 16-bit system. Downside - even if you could port the linux kernel to a 286 the rest of the base system wouldn't be able to compile on it. Upside - it could actually take advantage of the capabilities of 32-bit systems, especially with regard to memory addressing and use optimizations in how variables were defined based on this.
These are just some pieces of a much longer list, but they were the main reason why the gnu tools developed a major reputation for being faster and more reliable than their unix counterparts and the popularity of the gnu toolchain prior to the appearance of any kernel. When Linux appeared on the scene and was married to the two, it's own approach of maximising it's target platforrm's abilities (the same thing that made Tanenbaum angry) created probably the most performance efficient operating system of the time. Much faster than any other unix then available and able to run on commodity hardware. The legendary portability would only come later (you can largely thank Jon Maddog Hall for that as he got that ball rolling).
But the core point is - for very good reasons both the FSF and the Linux developers used some radically different fundamental design goals in how they wrote their tools compared to unix, these differences led to things be implemented in radically different ways on some levels. Your suggestion is verifiable and documented fact. This doesn't change that there really is only one way to write a proper quicksort - the core algorythms may well have had significant similarity, but the surrounding code was designed differently (how the data got INTO the algorythm in the first place was one of the things that GNU deliberately didn't do the same way unix did - as per example one above).

--
Unicode killed the ASCII-art *

Even if the Linux folks didn't, I did by wandazulu · 2010-07-11 14:31 · Score: 5, Funny

I'm really sorry, but there was some code that was already written that was just too good to pass up for the project I was on:

#include int main(int argc, char* argv[]) { printf("Hello World!\n"); return 0; }
Now that I'm using Java, it won't happen again.

Re:Even if the Linux folks didn't, I did by mysidia · 2010-07-11 14:45 · Score: 3, Funny

Are you sure it hasn't already happened again? // hello.java /** This application greets the world. * * @Deprecated Earth has been destroyed by global warming, this is superceded by the goodbye class */ @Deprecated public class hello { public static void main(String args[]) { System.out.println("Hello World!"); } }
Re:Even if the Linux folks didn't, I did by JWSmythe · 2010-07-11 15:59 · Score: 2, Informative

Slashdot tends to eat anything that looks like an HTML tag. I'm sure when he put it in, it said:

#include <stdio.h>

--
Serious? Seriousness is well above my pay grade.

Re:Cheat Detection by Darkness404 · 2010-07-11 14:34 · Score: 2, Interesting

The difference is that there are only a handful of solutions for the same problem when it comes to computer code and still make sense.

On the other hand there are tons of ways of conveying a simple fact in English, consider the statement that "George Washington was the first president of the US" the same statement might read that The first president of the US was George Washington. George Washington was elected as the first president of the US. The first president in office in the US was George Washington. And so on.

--
Taxation is legalized theft, no more, no less.

SCO! by Anonymous Coward · 2010-07-11 14:38 · Score: 3, Funny

Die, Monster, Die!

Re:SCO! by lostmongoose · 2010-07-11 14:58 · Score: 3, Insightful

How do you kill that which is already dead?
Re:SCO! by MightyMartian · 2010-07-11 15:09 · Score: 2, Insightful

I think it's about time for someone to bring up the Dead Parrot Sketch. It seems to suit this situation perfectly.

--
The world's burning. Moped Jesus spotted on I50. Details at 11.
Re:SCO! by jd · 2010-07-11 16:13 · Score: 2

That which is not dead may eternal lie
And in strange eons even death may die

--
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Re:SCO! by deniable · 2010-07-11 17:07 · Score: 2, Funny

Parrot's not dead, it's still in beta.

libelf!?! by Dahamma · 2010-07-11 14:39 · Score: 5, Informative

I actually find it ironic that libelf was picked as an example of infringement. I can tell you first hand that the (more standard) UNIX/Solaris libelf is NOT compatible with the Linux/libc libelf. And I can also tell you that after pointing this out to Ulrich Drepper he really didn't give a shit... (I think his approximate words were "It's been like that for a while, too late, I won't change it").

Their only mistake was actually naming it "libelf"... since it is most definitely NOT the same library...

Re:libelf!?! by olsmeister · 2010-07-11 14:48 · Score: 5, Funny

And here I always thought that libelf was the version of the libel() function that returned a nicely formatted incorrect defamatory statement.

Re:What's so liberal about it? by tsalmark · 2010-07-11 14:41 · Score: 4, Informative

You're not a programmer are you. The header files are pretty much just a bunch of definitions. there is no programming to speak of in either of those files. From reading the Posix standards you will end up with the same code but with your own comments. The header files are a lot like the ingredients section of a recipe. So it's like looking at two recipes for omelets then complaining that both have eggs listed in the ingredients section.

Re:What's so liberal about it? by mewyn · 2010-07-11 14:42 · Score: 5, Informative

It's a header file for a standardized interface. All this stuff needs to be the same for any *NIX-like operating system to be *NIX-like, otherwise, you're making an incompatible operating system. To make source-compatible operating systems you need to have common interfaces, and those interfaces lie in the header files. Saying that this is copyright infringement is like saying that they patented a hole in the wall as a way of getting in and out of a room.

Re:What's so liberal about it? by Anonymous Coward · 2010-07-11 14:43 · Score: 5, Informative

In case you are not trolling, virtually all of the allegedly copied code is boilerplate stuff defining types and structs or function interfaces. These have to be the same for Linux to be posix compatible. The little actual code there is, it isn't similar at all. Copyright can't keep you from writing a function that acts like another, that is for software patents, there should be actual copying and for such tiny functions it would be pretty hard to demonstrate (!s) was copied from (s==NULL).

I still think the jury, knowing nothing about computers, would have ruled against Linux, but the claims were ridiculous.

Re:What's so liberal about it? by Anonymous Coward · 2010-07-11 14:45 · Score: 2, Informative

If that's the one I'm looking at it's a header file for handling ELF executables.
All that's in there are the names, arguments and return types of ELF functions, and constants used.

All the names, values and return types of those functions are defined in the ELF standard, as are the constants. Of course they're going to be the same, they HAVE to be, and the ELF standard wasn't a proprietary Unix thing.

Note that the actual implementation of these functions is not in question, because it's not the same code

Re:What's so liberal about it? by marcansoft · 2010-07-11 14:47 · Score: 4, Interesting

Of course it looks rearranged. It's a header file. Some of the ELF constants come straight from the ELF spec. The #ifndef stuff is bog standard code, there are a finite number of ways of writing that and the one presented happens to be the most common. The #include is another "duh" - of course you have to #include the right header, that doesn't mean it's copied. The header file is presumably deliberately compatible with the original, hence the function definitions are prototype-compatible (while being considerably different in style).

There is nothing indicative of code copying in that PDF. The Linux header is just about as different as it can be while remaining source-compatible, as it should be.

It's only a header file by DrJimbo · 2010-07-11 14:48 · Score: 4, Informative

From Sega v. Accolade:

Computer programs pose unique problems for the application of the "idea/expression distinction" that determines the extent of copyright protection. To the extent that there are many possible ways of accomplishing a given task or fulfilling a particular market demand, the programmer's choice of program structure and design may be highly creative and idiosyncratic. However, computer programs are, in essence, utilitarian articles -- articles that accomplish tasks. As such, they contain many logical, structural, and visual display elements that are dictated by external factors such as compatibility requirements and industry demands... In some circumstances, even the exact set of commands used by the programmer is deemed functional rather than creative for the purposes of copyright. When specific instructions, even though previously copyrighted, are the only and essential means of accomplishing a given task, their later use by another will not amount to infringement.

It is nearly impossible to win a copyright suit over a header file. The only chance you would have would be if it was a straight copy-and-paste which this was clearly not. The reason for this is that there is just not much room for creative expression in header files. Likewise, you can't copyright a word or a short sentence.

There are a very limited number of ways to declare functions. If someone was allowed to copyright certain function declarations then they would have control over a large segment of the software industry. Likewise, if someone was allowed to copyright particular words, they would have control over a segment of the publishing industry.

--
We don't see the world as it is, we see it as we are.
-- Anais Nin

Re:It's only a header file by GigaplexNZ · 2010-07-11 15:22 · Score: 2, Interesting

Likewise, if someone was allowed to copyright particular words, they would have control over a segment of the publishing industry.
If the RIAA can claim copyright infringement because someone sung a copyrighted song in public, one could claim copyright infringement if those copyrighted words were spoken in public. You'd have control over a lot more than just the publishing industry.
Re:It's only a header file by postbigbang · 2010-07-11 15:24 · Score: 2, Insightful

To drive that point home, take a look at BSD kernel source code, libs, headers, etc etc.
There was this litigation, long ago, University of California Regents, if I recall.... and it caused this schism.... and more free code than had probably ever been produced with C before Torvalds and RMS came along.

--
---- Teach Peace. It's Cheaper Than War.

Re:What's so liberal about it? by Nikker · 2010-07-11 14:48 · Score: 4, Insightful

Nah it's a joke. The only thing highlighted were the function / subroutine definitions and not even across the board. Just because 2 programs have hooks or functions called "ReadX" does not mean there was any copying involved. They even highlighted include statements and data structures. It's almost like suing an author for starting with "Once upon a time". I guess they just figured no judge would be able to clue in on this kind of stuff and they would be able to sift through junk like this for decades. Hell they could use the same reasoning against pretty much any software and win if that is all the proof he needed. I guess Unix was truly the precursor to all the code ever written so Novell can truly say "all your base are belong to us".

--
A loop, by its nature, continues. If that didn't make sense, start reading this sentence again.

clutching at red straws on pdfs by euphemistic · 2010-07-11 14:48 · Score: 2, Insightful

I went and looked through a handful of the pdfs, most of what he's claiming are just variable declarations with intuitive naming (as they well should be). Really? What's next, someone claiming copyright infringement based on using the letter i as a temporary variable? Thank god for copyright law, or else people might just get away with such atrocious fraud... Furthermore, if it takes that many red lines to elucidate the copy&pasting, surely that's visual indication enough that you're clutching at straws.

Re:clutching at red straws on pdfs by zeroshade · 2010-07-12 05:51 · Score: 2, Insightful

The GPL is only needed due to copyright law. Without copyright law, there would be no need for the GPL as anyone could use the code without issue. =) Necessary evil and all that jazz.

Re:What's so liberal about it? by SpazmodeusG · 2010-07-11 14:49 · Score: 4, Informative

No, read the POSIX interface standard (or in this case specifically the ELF executable standard).
You have to give your functions certain names to be compliant to the specification. The code shown is interface code, the implementation is somewhere else. Interface code simply names the functions, parameters and variables. As the functions must have certain names and parameters to fit the standard you will get the exact same line that declares a function. Any C programmer could recreate that same block of code with just a list of functions names and parameters that must be declared.

eg. If you have to have a global function called elf_version with return of unsigned int and parameter of the version you'll get the line
extern unsigned in elf_version( unsigned int __version );

We see that same line of code in both files as they both implement the same specification. I'm sure there's a ton of other UNIXes out there that have the same line of code.

Oh Good by bky1701 · 2010-07-11 14:53 · Score: 3, Insightful

More news about imaginary property. How much time and money does our society waste on propping up this outdated concept that you can own an idea? "#include " constitutes copy and pasting? I guess every program on earth violates the copyright of the guy who first wrote "int main(", and whoever started the convention of naming C++ files cpp or cxx should be hiring a lawyer about now. Money is to be made.

--
Great Intellect...

Re:Oh Good by jbengt · 2010-07-11 15:12 · Score: 2, Insightful

It's not copyright infringement to write something required for a specific functionality (e.g. POSIX compliance or API compatibility) even if it's exaclty the same words. Copyright only covers creative expression. If the ways of saying something are limited, that's not covered by copyright.
Of course that didn't stop the SCO trolls from trying and wasting everyone's time.
Delayed justice is no justice at all.
Re:Oh Good by bky1701 · 2010-07-11 16:21 · Score: 4, Insightful

"It's not an "outdated idea". If you really think that copyrights and patents are a bad idea, you need only look at countries where they did not exist -- like Russia during its peak of Socialist power -- to see how that works out economically. Hint: it doesn't. "

You're seriously assigning all the problems with the Soviet economy to their lack of copyright? You, sir, take the cake. That's the most absurd statement in support of copyright I have ever heard.

"The idea being that those who ignore (or don't know) their history, will be doomed to repeat it. And you obviously don't know your history, or I am about as certain as the sun will come up tomorrow that you would change your mind on that issue. "

Given your interpretation of the history of economics is rather... I'll be nice and say 'intriguing', I guess I have to agree that I don't know it. I don't see how the lack of copyright led to Stalin screwing the country over, but I guess I am just dense.

"The fact that some laws have been corrupted, like the duration of copyrights for example, is not justification for elimination of all such laws. Sure, it needs to put back the way it was, but not eliminated."

But the fact they are built on the concept of preventing free flow of information - and the fact they can no longer be enforced even by companies with more power than some countries - DOES make copyright and patents outdated. That ship has sailed.

"Copyrights and patents were established for the public good, and believe me, in the places where there are none, there is also damned little public good."

The entire world prior to about the 18th century begs to differ with you. On the other hand, I don't know history, so don't listen to me; nothing of merit was produced before copyright! The Greek epics, countless thousands of books, paintings, songs, architecture, machine designs, all came to exist about 300 years ago, right around the time copyright was invented. Not one before it was established. Thank you for setting the record straight!

--
Great Intellect...
Re:Oh Good by Anonymous Coward · 2010-07-11 16:47 · Score: 5, Insightful

Sorry, Jane. You are wrong. Not your fault, really, since you are just repeating what you've been told many times over the years. But it is wrong.
Your appeal to socialist Russia as an example of why copyrights are needed is laughable. The Soviet system failed mainly for many reasons, but lack of copyright recognition wasn't one of them.
The fact is that the idea that copyrights and patents are a benefit to society is based on no evidence at all. It is one of those ideas that were accepted for a long time without being examined in any detail. There are many examples of places and times where copyrights and patents did not exist and innovation there certainly was not harmed, and careful studies have found pretty good evidence that innovation was helped by their absence. The site techdirt.com discusses these issues regularly, frequently pointing to academic research in peer reviewed journals so you can check out the research yourself if you don't trust the reporting. But you don't have to depend only on research. There are examples today where industries thrive in the absence of copyrights or patents. One easy to understand example is the fashion business. No copyrights or patents. Lots of copying, yet good money is being made by many players. And there's certainly no lack innovation there.
It certainly is true that eliminating copyrights and patents would be kind of disruptive, since some large businesses have come to depend on the artifical monopoly authorized by copyright and patents. But there is reason to believe that we would be better off now had they never been invented. Whether the disruption that would come from eliminating them is reason enough to keep them is questionable. I am pretty sure we would quickly adjust to their absence if they were abolished tomorrow, and be better off, on the whole. Of course, that is unlikely to happen because our government is bought and paid for by the businesses that depend on copyrights and patents, so we will have to suffer the burden of these government-granted monopolies for some considerable time to come.
Re:Oh Good by bky1701 · 2010-07-11 18:13 · Score: 2, Insightful

And were it some small group of developers that SCO sued rather than Red Hat/Novell/IBM, it would have been worse than delayed justice. Copyright does not protect the little guy, much like the majority of our legal system.

--
Great Intellect...
Re:Oh Good by ignavus · 2010-07-11 21:00 · Score: 3, Insightful

"It's not an "outdated idea". If you really think that copyrights and patents are a bad idea, you need only look at countries where they did not exist -- like Russia during its peak of Socialist power -- to see how that works out economically. Hint: it doesn't. "
A wild thought. It is *JUST* possible that some other aspect of communism, besides the lack of copyright, was responsible for the slow rate of development in the Soviet Union.
Your reasoning would lead us to conclude that the high rate of alcoholism and consumption of vodka was caused by the lack of copyrights ... and that Mr Gorbachev need only turn back the clock, implement a proper copyright system, and become a Hero of the Soviet Union by saving them all from economic perdition.

--
I am anarch of all I survey.
Re:Oh Good by silentcoder · 2010-07-12 01:33 · Score: 2, Insightful

>It's not an "outdated idea". If you really think that copyrights and patents are a bad idea, you need only look at countries where they did not exist -- like Russia during its peak of Socialist power -- to see how that works out economically. Hint: it doesn't.
Oh that's right, Russia's economy failed because they didn't have IP laws. Odd they wrote some of the 20th century's best literature (Dostoyevski for example) without copyright. They made some of the 20th century's most important scientific advances without patents (they got to space BEFORE America remember).
Seem the facts don't match your claims but you must be right because you studied history. Obviously an excessive amount of government power and high levels of (creativity diminishing) censorship had nothing to do with it. The fact that every xerox machine in Russia had an armed guard next to it to prevent samizdat in fact made for a much more complete control of copying (with the technology of the time) than the copyright in the west did - and modern copyright regimes have far more in COMMON with Russia's control over copying during communism than it does with capitalism.
The fact that their economy was tied up by a massive an inefficient beaurocracy didn't contribute to their failure at all and of course all those factory workers who didn't bother to be productive because in that kind of communist system there is no way to advance anyway and even if you did you'd still be earning just as little (unless of course you were military or in politics and could get special favors) didn't do anything to harm their productivity (and thus their economy). All those economics theorists and philosophers who say so are obviously just trying to trick us all. Good thing the trick worked because it was the cornerstone of the US's approach to beating Russia during the last decade of the cold war.
The dictatorial system with it's endemic corruption also did no harm to Russia's economy. It all failed because they didn't have strong enough patent and copyright laws and we can prove this by indicating that after 20 years the only countries where these laws are not getting pretty damn draconian are the ones which have no government to speak off, or at least, none that the rest of the world gives a damn about and so their people tend to suffer.
Yeah that makes sense. The good countries are good because they have draconian IP laws. It isn't that the people who lobby for draconian IP laws tend not to bother with the poorest countries on earth because there's no point trying play extortionist with people who don't have any money you can fleece them for. That has nothing to do with it.
Thanks you're intelligently researched arguments with their undeniable factual basis, so complete and devoid of any selective reporting have changed my minds. IP laws for the win...
PS. If you haven't figured it out by now. This post is a textbook example of sarcasm.

--
Unicode killed the ASCII-art *

variable names and data structures. by goombah99 · 2010-07-11 14:55 · Score: 4, Interesting

Comparing a variable named elf_t_arname to one names elf_c_arname is not very convincing. The suffix is generic, the prefix is activity specific, and the middle letter is presumably some datatype indicator.
Where it gets dicey is when there are structs and every variable in the struct has a somewhat similarly named variable in the other one. This does arouse suspicion. even if you forget the variable names for a moment, any pattern like bool,real,real, *real, int, *char,*char,*bool,.... that is identical between two structs would be an improbable occurence. and when you see it in back to back structs it becomes nearly impossible to happen by chance.

The key question then is if there is some structural reason why the two might share an identical stuct? for example, is there an elf spec that defines a protocol for communication or the way a record on disk is serialized (i.e. packed)? if so then of course these will occur like this. Or perhaps both are derived from a common BSD ancestor so both vary only slightly.

if the answer is no, there was no reference implementation and no ancestor then I'd say that for examples like 251, Mcbride has some evidence.

However for most of the ones he cites there is no there, there.

--
Some drink at the fountain of knowledge. Others just gargle.

Re:variable names and data structures. by johnmoe · 2010-07-11 15:19 · Score: 4, Informative

"is there an elf spec"
http://en.wikipedia.org/wiki/Executable_and_Linkable_Format
Re:variable names and data structures. by Fizzl · 2010-07-11 16:44 · Score: 4, Insightful

This does arouse suspicion. even if you forget the variable names for a moment, any pattern like bool,real,real, *real, int, *char,*char,*bool,.... that is identical between two structs would be an improbable occurence. and when you see it in back to back structs it becomes nearly impossible to happen by chance.
Umm, no. This is only logical. If he had the documentation to data types, or even if he reverse engineered them, they would be identical. I have done some trampolining of functions to/from code which i only have binary access. If I am handling a struct, I would examine the structure in memory and make my struct identical so I could copy it with simple memcpy(to, fro, sizeof(struct)), or replace references by just changing the pointer to different place.

--
Bot Assisted Blogging
Re:variable names and data structures. by ipX · 2010-07-11 17:22 · Score: 3, Interesting

Reflections on Trusting Trust
Re:variable names and data structures. by http · 2010-07-11 17:40 · Score: 2, Informative

ipX, for the win:

"In the ten years that we have worked together, I can recall only one case of miscoordination of work. On that occasion, I discovered that we both had written the same 20-line assembly language program. I compared the sources and was astounded to find that they matched character-for-character."

http://cm.bell-labs.com/who/ken/trust.html

--
If opportunity came disguised as temptation, one knock would be enough.
3^2 * 67^1 * 977^1
Re:variable names and data structures. by slack_justyb · 2010-07-11 20:24 · Score: 2, Interesting

Having done patches for some of our Linux servers, I can tell you that most structs, unions, etc... are in the order that they are specified in the man page for a given function. Same way with BSD and other open source projects that I've seen.

If you take a look at something like (just picking something) IP, you'll notice the struct is in the same order as the actual packet. Does it have to be that way? No, but it's usually what everyone goes with.

All the examples that I saw on the site seem a bit too generic to be called copying. I know that some of the snippets have a bit of history, aka didn't come from SCO or Linux, but I'm not that buff in kernel programming to know the difference.

Re:What's so liberal about it? by bky1701 · 2010-07-11 15:01 · Score: 2, Interesting

Well lets see.

"ELF_C_..." - each of these is the name of a type in C. I don't see how this is even a bit creative. I had a very similar enum in a program I wrote, except with data types from a 3D engine. My guess is that ELF_C means it would be the ELF binary format's C data type. Nothing to see here.

"ELF_K/ELF_T" - It says in the open source one that these are descriptors as well. More or less the same; universal concepts if you're going to be programming a C compiler. I bet you can find an enum just like this in visual C++.

Hmm... beyond the headings, that's really all that is in that file. If you really have been a programmer for 20 years, you have without question violated thousands of copyrights... if that file does.

--
Great Intellect...

Re:What's so liberal about it? by Anonymous Coward · 2010-07-11 15:05 · Score: 2, Interesting

(same AC) Again, 20 years programming experience, no experience with some, apparently very important for *NIX, POSIX IEE document. If this is a published standard, obviously they'll be similar. I just didn't understand why every function definition from line 138 to 176 is copied verbatim. With the same cryptic series of letters which obviously imply meaning (strptr, newscn, getshdr, getphdr, newehdr, elf_flagelf, elf32_faize) are VERBATIM. I think a more effective summary would have been to link to the public domain specification with these exact terms mentioned in the document so we could all laugh at how their only evidence is a public spec-based implementation.

Instead I see a long list of identical function names, enum types, enum labels, and identically named structure fields. When I google the relationship between POSIX ELF I don't exactly get any hints , for something that's supposed to be (I'm assuming its relating to http://en.wikipedia.org/wiki/Executable_and_Linkable_Format)

How much touching up, commenting, refactoring, general massaging is necessary for it to become "unique" code? I think if you got a jury that wasn't as studied as me, you're right, Linux would have been screwed without a little more explanation..

Re:More details and downloadable archive by tomhudson · 2010-07-11 15:09 · Score: 2, Informative

I've never done that (replaced a url with another) in my life - nice try. Next time, try to be at least a bit more credible by not posting as a lying A.C.

Why the fuck are we wasting more time on this by haruchai · 2010-07-11 15:09 · Score: 4, Insightful

this fiaSCO has been running on for nearly 8 years - what the hell is up with the courts that they keep this bullshit alive.
Kill -9 all | sort > /dev/null

--
Pain is merely failure leaving the body

Re:More details and downloadable archive by Runaway1956 · 2010-07-11 15:10 · Score: 3, Insightful

Ho-hum.

I'm not a coder. I couldn't create a kernel if my life depended on it. I couldn't code a hungry cat to catch a mouse. 1/2 or more of what I read in code is gibberish to me.

But, one thing is pretty sure. Linus Torvalds wrote Linux, and his programming background came directly from Unix. OF COURSE he is going to write the same commands he has used a thousand times in the same way. OF COURSE there are going to be lines that look very much the same, sometimes even identical.

Now, open any dozen books that are 50,000 words in length. Search for strings that are duplicated between the books. Entire sentences, or phrases, it hardly matters. Just do the search. Anyone who is used to playing with databases can probably search those dozen books, and find numerous instances of phrases that were copy/pasted from one author's book to another. In fact, I'll bet that technical and factual books will have a higher incidence of matching phrases and sentences than works of fiction - but fiction will have it's share as well.

And, before we do this data base mining, we need to set up some method of assigning a variable string for proper names. In the wife's romance novels, we would be looking for " $ kissed $ ". It will be repeated so often that you can't help seeing the plegiarism. One author after another rips it from his predecessors.

50 instances are claimed for "copying". Out of how many lines of code? Good grief. I guess it should have been mandatory that Linus write any code not only in a different programming language, but in a different language than English. Then, he MIGHT have been safe. Maybe. Not likely though, because SCO can probably read Chinese, or hire some scumbag lawyer who can.

--
"Windows is like the faint smell of piss in a subway: it's there, and there's nothing you can do about it." - Charlie Br

Re:Who needs proof? by Anonymous Coward · 2010-07-11 15:10 · Score: 2, Insightful

It is true that anyone who slams Microsoft for not innovating, while simultaneously claiming that Linux is innovative, is an idiot. Neither is particularly innovative. (Nor is OS X for that matter.)

The difference is that Linux openly admits it's just a solid, free implementation of the tried-and-tested UNIX design, while Microsoft spouts so much bullshit about "innovation" that the word has become practically meaningless.

That's talking about the Linux kernel, of course. Some projects commonly used with Linux, such as Gnome and KDE, are actually very innovative. The only problem is that every time they try something new, half the community slams them for being too different from Windows and the other half slams them for being too similar to Windows. It's tough to innovate when everyone actually wants everything to stay exactly the same forever.

Re:First post by SpongeBob+Hitler · 2010-07-11 15:11 · Score: 2, Funny

Just don't forget to pay your SCO licensing fees you cock-smoking teabaggers!

Okay. Now I'm really confused. What does Sarah Palin have to do with SCO???

--
Wollt ihr den totalen Krieg?

Disbarment? Jail time? by DrJimbo · 2010-07-11 15:13 · Score: 5, Informative

This code was the last big unknown in this long sorry saga. Even if SCO owned the copyrights, (and hadn't distributed it under the GPL, and hadn't signed the UnitedLinux agreement, etc.) it is now crystal clear that SCO's Microsoft-funded anti-Linux campaign was based on a stack of frivolous law suits.

I think Darl's brother is scrambling to cover his backside so that when the disbarments and criminal charges come down, he has a chance to escape.

Groklaw (of course) has IBM's response to SCO's claims that these paltry examples are worth BILLIONS of dollars in copyright damages. None of the code they offered is protectable under copyright law. Some of it is BSD code that everyone is free to use however they want (if they include the copyright notice). A lot of it is header files that were not copy-and-pasted which are nearly impossible to protect under copyright law. Then they have some snippets of generic code. Given the size of the source code for Linux, it would be astounding if there weren't some similar snippets. The idea that this is proof that Linux violated any Unix copyrights is totally absurd. The idea that these generic snippets are what made Linux enterprise-ready is beyond insane.

The recent SCO v. Novell case decided that SCO never even owned the copyrights it was suing about. And then instead of the millions of lines of code they claimed were infringing, they presented this meager collection of totally unprotectable snippets. I sure hope SCO's lawyers get severely punished for perpetrating this fraud on the court for the past seven years.

--
We don't see the world as it is, we see it as we are.
-- Anais Nin

Re:oops I meant 331 not 251 by tomhudson · 2010-07-11 15:20 · Score: 4, Informative

re 331: It's from BSD

Man Pages
Manual Reference Pages - ELF (5)
NAME
elf - format of ELF executable binary files CONTENTS
Synopsis
Description
See Also
History
Authors
SYNOPSIS
.In elf.h
DESCRIPTION
The header file
.In elf.h defines the format of ELF executable binary files. Amongst these files are normal executable files, relocatable object files, core files and shared libraries.
An executable file using the ELF file format consists of an ELF header, followed by a program header table or a section header table, or both. The ELF header is always at offset zero of the file. The program header table and the section header table's offset in the file are defined in the ELF header. The two tables describe the rest of the particularities of the file.
Applications which wish to process ELF binary files for their native architecture only should include .In elf.h in their source code. These applications should need to refer to all the types and structures by their generic names "Elf_xxx" and to the macros by "ELF_xxx". Applications written this way can be compiled on any architecture, regardless whether the host is 32-bit or 64-bit.
Should an application need to process ELF files of an unknown architecture then the application needs to include both .In sys/elf32.h and .In sys/elf64.h instead of .In elf.h . Furthermore, all types and structures need to be identified by either "Elf32_xxx" or "Elf64_xxx". The macros need to be identified by "ELF32_xxx" or "ELF64_xxx".
Whatever the system's architecture is, it will always include .In sys/elf_common.h as well as .In sys/elf_generic.h .
These header files describe the above mentioned headers as C structures and also include structures for dynamic sections, relocation sections and symbol tables.
...
[snippage]
...
HISTORY
The ELF header files made their appearance in Fx 2.2.6 . ELF in itself first appeared in AT&T V . The ELF format is an adopted standard.

This is the problem with SCO's case - OldSCO/Caldera only could have gotten what Novell originally had to give, if Novell HAD assigned copyrights to OldSCO. A lot of the stuff was from BSD.

Re:What's so liberal about it? by SpazmodeusG · 2010-07-11 15:20 · Score: 4, Informative

It's not even that. It's plain old rewriting a library to remain compatible.
Here's an example of some end-user programs that use those very enumerations. The ELF_Type enumeration is used on page 37 in an end user application and ELF_T_WORD value is assigned to it on page 45.
http://elftoolchain.sourceforge.net/for-review/libelf-by-example-20100112.pdf

There's no coincidence involved. If you write applications that use the ELF_Type enumeration and you decided to write a new elf library to support that app you'd end up having the same enumeration names to maintain compatibility.

Copyright allows you to recreate something that's compatible as long as it isn't copied directly.

Re:What's so liberal about it? by marcansoft · 2010-07-11 15:30 · Score: 4, Insightful

No one cares whether it's a public spec or not (it may be, I do not know). It's a OS header file. The functions have to be named the same way in order for end-user programs to be source-compatible. If you've been programming for 20 years, this shouldn't be too hard to grasp.

If SCO were allowed to claim copyright over this, then it would be simply impossible for Linux to provide a compatible libelf. This means you'd essentially prevent anyone from ever making compatible OS libraries, as they'd be infringing on the original author's copyright. That would be ridiculous. Public function names and prototypes (documented or not, standardized or not) are not considered copyrightable.

Another example: is Wine, according to you, a humongous violation of Microsoft's copyrights? After all, it implements the Windows API with identical function names and prototypes, undocumented features and all (which is nowhere near a published standard of any sort).

Re:More details and downloadable archive by tomhudson · 2010-07-11 15:47 · Score: 4, Interesting

I've seen cases where me and another person are working on code independently, and when it came time to merge, we had both ended up creating the same variable names, and pretty much the same code.

About the only difference was in indentation - mine is "always put the opening brace on the same line, one true tab, else in same column as if, no braces for any single-line condition to a control structure (for, if, else, while, etc)". Even the comments were pretty much the same.

In this case, though, some of the code is from BSD - which is perfectly fine.

Re:Cheat Detection by Chibi+Merrow · 2010-07-11 15:47 · Score: 2, Interesting

Now I'm not a system programmer, so I may be completely off base, but that looks like a system call for supporting ELF binaries. Meaning it's an implementation of a public standard. Meaning there's a very limited number of correct ways of doing this. Possibly only one correct way of doing this.
Every file I've randomly selected has been like this; either references to the ELF format (which is a public standard), or ABI type stuff (which is also a public standard). That's what was rumored to be SCO's "copying evidence" like six or so years ago... And now we've found out it's the truth.

They are either insane or they are "dumb like a fox," because anyone who knows anything about UNIX system development could tell you that this isn't stuff you could sue over.

--
Maxim: People cannot follow directions.
Increases in truth directly with the length of time spent explaining them

Re:What's so liberal about it? by Anonymous Coward · 2010-07-11 15:53 · Score: 3, Insightful

Interface names needed for interoperability are fair game. Interfaces once published are public domain. The specific document is not.

Software is copyrightable but it is not a work of fiction where more protection is given. Writing a book on physics protects the specific wording in that book but it doesn't keep others from using the word "relativity" with the same meaning or your specific name for the speed of light -c- to explain the same exact phenomenon your original book described.

Guess what? Wine contains a function named MessageBoxW and I can't remember the parameter types, but I assure you they are the same as in Windows source. That Microsoft hasn't done anything about it, should tell you how much legal standing a suit on struct member names or function behavior would have.

Re:FUD by baryluk · 2010-07-11 15:57 · Score: 2, Insightful

I have read 20 more Tab-xxx.pdf files. Claiming that they are copy&paste is again nonsense.

They mostly consists of straight forward code, #defines or function definitions which looks similar.

Of course they are similar becuase this are POSIX functions and defines!

Their claims are just ridiculus.

Re:What's so liberal about it? by tofubeer · 2010-07-11 15:58 · Score: 2, Informative

Ell given that everything in a header file is public if you want source compatibility they have to have the same names.

I, admittedly, didn't read 100% of all of the links, but the slightly more then skimming I did do did not show me anything that was the same that couldn't have been for achieving source compatibility.

It would be like complaining that glibc defines EXIT_SUCCESS...

Re:What's so liberal about it? by Jahava · 2010-07-11 15:58 · Score: 3, Interesting

Of course it looks rearranged. It's a header file. Some of the ELF constants come straight from the ELF spec. The #ifndef stuff is bog standard code, there are a finite number of ways of writing that and the one presented happens to be the most common. The #include is another "duh" - of course you have to #include the right header, that doesn't mean it's copied. The header file is presumably deliberately compatible with the original, hence the function definitions are prototype-compatible (while being considerably different in style).

There is nothing indicative of code copying in that PDF. The Linux header is just about as different as it can be while remaining source-compatible, as it should be.

Commenting further on that, here is a link to the System V Reference Specs, one of which is the ELF Tool Interface Standard Specification. This contains not only several constants, structures, and function names, but suggests function prototypes and programming style.

Like you said, any author wishing to build an ELF-capable system would almost have to have that exact same code. There are only so many ways to build an enum or struct following the exact TIS specifications, and there is no virtue in paraphrasing C code.

Much of the rest of the code is libc and POSIX prototypes (and more headers), all of which are covered in the System V ABI specification. Anybody wishing to build a POSIX-compatible system would have to define those prototypes.

Several of the function implementations with similarities are very basic functions. Most of the similarities are in the constant names (rather than the specific implementation of those simple functions), and the constant names are defined by ... the TIS spec. The remainder is a no-brainer. See, for example, Tab 422. This is a simple accessor method. There are only so many ways to retrieve a value from a structure...

10 words that tell it all. by Fr33thot · 2010-07-11 15:59 · Score: 3, Insightful

"until they lost the battle by losing ownership of UNIX" Those are all the words you needed to read. You cannot loose ownership of that which you never did own. SCOs gambit was to gain ownership by bamboozling everyone. You know when one party is blowing smoke in these issues when they refuse to point to the infringing code outside of court. If the quote included is to be believed, they lost on appeal and now that they have filed yet another appeal they are suddenly going to show us all the holy grail of "infringing" code. In each case where they've brought up "infringing" code, it was either released by themselves, or was code they didn't own in the first place.

Unix Copyrighted Code in Linux? Not likely. by softcoder · 2010-07-11 16:03 · Score: 2, Interesting

Just because Linux and Unix have some of the same lines of code, does NOT mean that linux copied the code from unix.
The code could have come from BSD for example and in fact there are several instances where linux and Unix share (or shared) the same BSD code.

The code could also have come from implementing the Posix Standard. The PDF linked to seems to be an implementation of errno.h which I believe is part of the POSIX standard.
So again just because the code appears in Unix, does NOT mean that Unix had copyright ownership of that code.

To prove its case SCO would have had to prove that:
a) Linux had lines of code that were substantially similar to Unix. (some minor examples provided but even that was not definitive)
In fact the judge who supervised the discovery kept asking for details and at the end of the multi year discovery process, said, "Is this all you've got?"

b) Unix had copyrights to the code in question (again not proven)

c) SCO owned the Unix copyrights (again not proven)

d) SCO never granted the rights to use that code in any way. In fact Caldera (aka SCO) distributed a version of Linux under the GPL which in effect granted GPL license to any of their code that happened to be in Linux.

So even if all of a, b, and c were true,
they STILL did not have a case for infringement.
I almost wish that SCO had owned the UNIX copyrights, because then this whole issue would have been resolved by now, instead of relying on Novell.

softcoder.

Re:More details and downloadable archive by bennomatic · 2010-07-11 16:17 · Score: 5, Funny

I learned perl from someone who named all his variables with variations on "foo" and "bar". Back in those days, if I was writing something short and simple enough, it was hard for me to break the habit of naming things $foo, $bar, $boo, $far, $foofoo, etc. I'll bet a lot of our code looked like it was from the same person :)

--
The CB App. What's your 20?

To Bogosity and beyond! by DrJimbo · 2010-07-11 16:26 · Score: 4, Informative

The courts have established that in order to determine software copyright infringement (for non-literal copying, which is what we have here but filtration is required even for literal copying), one must perform what is called the Abstraction, Filtration, Comparison Test. In court documents related to the code in question, SCO admitted the did not perform this test on this code. They claimed that that was IBM's job. The article linked to above explains the test:

1. break down the plaintiff’s program into its constituent structural parts (“abstraction”);

2. examine each part for incorporated “ideas,” elements taken from the public domain, methods of operation, processes or procedures, or otherwise unprotected material (“filtration”); and

3. compare the remaining kernel of creative expression, if any, to the work alleged to infringe at each level of abstraction (“comparison”).

They further explain:

The scenes à faire doctrine is often applied in software cases because it is frequently impossible to write a program in a particular computing environment without employing certain standard programming techniques and design elements. This is because certain functions, data elements, and the order of operation of a program can be dictated by such things as the type of computer on which the program will run, the programming language used, the operating system environment, governmental requirements, industry demands and standards, and widely accepted programming practices.

I suspect the reason SCO didn't filter this code is because if they did, there would be nothing at all left to present to the court as their fig leaf to avoid being charged with perpetrating a fraud on the court.

--
We don't see the world as it is, we see it as we are.
-- Anais Nin

Re:What's so liberal about it? by tomhudson · 2010-07-11 16:41 · Score: 2, Interesting

Or they both legitimately got it from BSD, or linux got it from the standard.

HISTORY
The ELF header files made their appearance in FreeBSD 2.2.6. ELF in
itself first appeared in AT&T System V UNIX. The ELF format is an
adopted standard.

We know from the AT&T settlement that there's a lot of BSD in AT&T Unix, and that even some of the non-BSD AT&T stuff simply isn't protectable by copyright.

Unix kernel hacker and almost IP lawyer view by harlows_monkeys · 2010-07-11 16:57 · Score: 5, Interesting

I spent several years as a Unix kernel hacker, working extensively with AT&T source code. I also went to law school and was one bad case of writer's block away from becoming a copyright lawyer. Thus I found those code snippets quite interesting, both from my Unix kernel hacker persepective and my almost-became-a-copyright-laywer perspective.

My conclusion, from the half dozen or so of his samples that I looked at? They show nothing remotely resembling copyright violation.

Copyright covers expression, not ideas. What that means when dealing with functional works, such as computer programs, is that things that anyone implementing that functionality will have to do are unlikely to be covered by copyright.

All of the functions I saw that were allegedly copied were very simple functions. All they did was check arguments to make sure they were legal, return the expected error code if not, or return some very simply value otherwise.

Even if the corresponding functions in Linux were exact matches to the SCO code, it would probably not be enough to support an inference of copying, because there just aren't a lot of ways to reasonably express such simple functions. And they were not exact matches. One would check for a null pointer by comparing to NULL, one would use if(!p), for instance.

The header files are more similar, so copying is more believable there. The problem with SCO's case there is that the elements in the header files I looked at are entirely dictated by compatibility requirements. There's no copyrightable expression in them.

To summarize, SCO's claims appear to fall into two groups. First, things where the implementation is so simple that it is not possible to infer copying from similarity since the similarity is imposed by the nature of the function. Second, things where there may have been copying--of things that aren't protected by copyright.

Re:More details and downloadable archive by tomhudson · 2010-07-11 17:01 · Score: 3, Insightful

Except that gcom didn't ship Caldera's STREAMS implementation. They shipped a loadable module (driver) that intercepted calls made by legacy apps to the STREAMS api and translated them to calls that linux could fulfill. In other words, no STREAMS code necessary in either the LiS product or in linux, and not even code to emulate STREAMS - just translate the call and forward it.

Re:First post by kdemetter · 2010-07-11 17:04 · Score: 4, Insightful

Since we have to rely on this PDF , and there is no way to prove that that code was actually in UNIX at the moment ( as it's closed , there's no way to check ) .

I am left to conclude that it's much easier to copy from Linux (which is open ) , than from UNIX ( which is closed ).
In other words , using this PDF , i could also claim to SCO stole from Linux , and then implemented it in their commercial product , thus violating the GPL .

--
Slipping shoelaces ?

Re:First post by JavaBear · 2010-07-11 17:10 · Score: 5, Funny

Just don't forget to pay your SCO licensing fees you cock-smoking teabaggers!

Okay. Now I'm really confused. What does Sarah Palin have to do with SCO???

The approximate IQ of the management.

Re:Disbarment? Jail time? by UnknowingFool · 2010-07-11 17:20 · Score: 4, Informative

In Gates v Bando the Tenth Circuit established the abstraction-filtration-comparison test that would become the standard in software copyright infringement. Specifically in the filtration step, all elements which are not protected by copyright must be removed from consideration. In this case, most of the code falls under scenes a faire: "expressions that are standard, stock, or common to a particular topic or that necessarily follow from a common theme or setting . .these external factors may include: hardware standards and mechanical specifications." Most of the code were simply declarations needed for compatibility and cannot be copyrighted.

--
Well, there's spam egg sausage and spam, that's not got much spam in it.

Re:First post by Z00L00K · 2010-07-11 17:20 · Score: 5, Insightful

Header files are public - but they seldom contains any advanced functionality. They are just a definition of the calls available, defined data types and constants.

If the header files didn't contain the same (or very similar) definitions then the API wouldn't work. I expect the same header definitions to reappear in many other operating systems with minor differences - many even in Windows (Which do have a Posix API)

But of course - a lawyer wouldn't understand that, it's just a question of money.

--
If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.

Re:First post by Cylix · 2010-07-11 17:26 · Score: 3, Interesting

The pdf linked in the document is a snippet for what looks like a struct for the elf API interface. This specification is open and judging by the code they are using it exactly as intended.

I'm going to guess the majority of their findings are specifically computer generated. They may have known first hand what the code was or even where it came from. However, if pressed to say how they discovered these violations I'm quite sure they would fall back on "the program made the mistake your honor." This would generate a plausible stance when the foundation began to crumble.

Going further on a limb I'm also guessing this is why they would never release any of the alleged violations. In days a website similar to groklaw would be up in for everyone to review, identify and mark the source of the "violation." ie, this is a struct for the elf library specification or this is a header of a BSD library. (Remember that BSD ancestry is likely still there in large chunks)

All of this happening in the court room and they had to know there were big holes in the allegations. Even a cursory glance reveals that some of the crap submitted is just that. This was a court room poker face with a huge bluff that many parties would just settle. I suppose it worked because too many people rolled over and handed out free cash.

--
"You should always go to other people's funerals; otherwise, they won't come to yours." -- Yogi Berra

Re:More details and downloadable archive by gringer · 2010-07-11 17:27 · Score: 2, Interesting

mine is "always put the opening brace on the same line, one true tab, else in same column as if, no braces for any single-line condition to a control structure (for, if, else, while, etc)"

Coding style like this makes me cringe, particularly the thing about no braces for single-line conditionals -- it makes it far too easy to make mistakes because you indent code and forget that indentation doesn't mean it's part of the conditional (unless you are using python, of course).

--
Ask me about repetitive DNA

Re:Found this in SCO's code... by Legion303 · 2010-07-11 17:46 · Score: 3, Funny

Looks like I typo'd = instead of ==, which I guess only proves that I review my joke code more thoroughly than SCO reviewed their actual code.

Re:More details and downloadable archive by sg_oneill · 2010-07-11 18:08 · Score: 4, Informative

And heres the Magic. Linus learned his style by closely reading Andrew Tanenbaum's books, and reading the Minix code. Which of course is what your supposed to do with Minix. So have most OS coders who had their education back then.

The end result of course is that everyones code ends up looking like Tanenbaums , which is not a bad thing, the guy is up there with the gods in terms of importance to O/S theory.

--
Excuse the Unicode crap in my posts. That's an apostrophe, and slashdot is busted.

Re:First post by Score+Whore · 2010-07-11 18:13 · Score: 3, Funny

Going further on a limb I'm also guessing this is why they would never release any of the alleged violations. In days a website similar to groklaw would be up in for everyone to review, identify and mark the source of the "violation." ie, this is a struct for the elf library specification or this is a header of a BSD library. (Remember that BSD ancestry is likely still there in large chunks)

If this paragraph is indicative of your general level of knowledge about the history of Unix, you might want to look into ELF and how it got into Unix. I can tell you this however: it didn't came from Berkeley. The various BSD derivatives got it from SysV long after the fact.

Also you might want to spend about ten more seconds checking out the PDFs at the linked site, not just the one linked by the knucklehead kdawson. It's not simply header files. There is actual code. The similarities in code flow, layout, variable names, filenames, etc. are conclusive. The linux contributor didn't implement this code clean room-style based on the specification, plainly having used the Unix implementation as the source.

Re:First post by lordholm · 2010-07-11 18:21 · Score: 2, Insightful

Though, I do agree that it looks like copying. I do not think (having read the ELF spec and implemented it), that code would look that similar in general (some of the duplicated enums are not in the ELF spec (which funny enough is published by SCO)), however, I was under the impression that header files where not copyrightable since they only describe an interface.

If there were any reason to implement source level compatibility with SCO, i think this would be allowed (though IANAL). Ridiculous is to bring in the dlopen header, this is implemented on every UNIX-like system, and headers are needed for compatibility, which is explicitly allowed by most copyright laws around the world (the same headers are also in OS X, BSD, Solaris), the important part, if you want to determine whether code was copied is the code, not the headers.

From the looks of it, the ELF code was actually copied, and I do not think that you need any liberal notion of copying for seeing that, I have seen this kind of copying by students earlier, and they usually rightfully get into troubles for it.

--
"Civis Europaeus sum!"

The first 33 lines by Lorens · 2010-07-11 18:22 · Score: 2, Interesting

Sure, they copied all those those blank lines, and in at least one case (tab 247) they also copied the BSD copyright header. Shocking! Funnily enough SCOX removed those lines on both sides. Kind of them.

They also copied strn?casecmp definition (tab 241). For quite astonishing values of "copied":

-SCOX
+RedHat glibc

-38: int strcasecmp();
-39: int strncasecmp();

+53: /* Compare S1 and S2, ignoring case. */
+54: extern int strcasecmp (__const char *__s1, __const char *__s2)
+55: __THROW __attribute_pure__;
+57: /* Compare no more than N chars of S1 and S2, ignoring case. */
+58: extern int strncasecmp (__const char *__s1, __const char *__s2, size_t __n)
+59: __THROW __attribute_pure__;

This is clearly an extremely grave violation. However, it is interesting to note that SCOX did not complain about the definitions of all the other string functions. Maybe because the header of their file specifies

In addition, portions of such source code were derived from Berkeley
4.3 BSD under license from the Regents of the University of
California.

Presumably the other string functions came from BSD, and ignoring character case was a UNIX improvement that BSD couldn't have thought of by themselves. Right? Right??

Re:What's so liberal about it? by Dantoo · 2010-07-11 18:24 · Score: 4, Interesting

Ostriches aren't Australian, they're African. Omelettes can be made from emu eggs and I have tasted one. It really wasn't any different to one prepared from hen's eggs. It looked no different to this observer. Compare an emu egg to a hen's egg and they are quite different in size, colour and even texture internally and externally. The formula (recipe) however was just for a standard omelette that we would all recognise by sight instantly. Interestingly, it tasted like one prepared from hen's eggs as well. Couldn't tell the finished product apart.

Posix header files also look remarkably similar to this observer. If code is being written to a required formula so that it interacts correctly with other code (a standard) then there should be little surprise that it looks the same.

Egg analogies make me hungry.

Re:Found this in SCO's code... by Wuhao · 2010-07-11 18:34 · Score: 3, Funny

Dude, you ruined it. As written, you have a brilliant joke in C: the premise is bad (we're beating a dead horse), the plan is questionable (we're directly comparing a pointer and a string literal), and the execution is sloppy (thanks to the typo, we're testing the result of a non-zero assignment, so the horse is beat forever regardless of liveliness).

This is pretty much the story of SCO.

Re:More details and downloadable archive by Anne+Thwacks · 2010-07-11 18:51 · Score: 5, Informative

Because I, J, K, L, M, or variable names beginnning with them, are integers in Fortran. Otherwise variables are floating point, and floating point loop variables is a bad idea especially in Fortran.

Foobar is a WW2 acronym for F*%&'d up beyond all recognition. (Typically referring to the military situation (See "The Longest Day").

Now get off my lawn.

--
Sent from my ASR33 using ASCII

Re:What's so liberal about it? by lordholm · 2010-07-11 18:53 · Score: 2, Informative

Reverse engineering for compatibility is allowed (at least where I live). Thus, headers cannot be considered code, since they are only expressing an interface, essential for compatibility.

If you want to be compatible with say int foo(int, float), then you actually need to write that exact sequence in your own headers, since the header define the symbol name and the calling convention for the functions. You do also, by the way need to name the header the same as in the system you are emulating for compatibility reasons, as otherwise the programs that you want to run on your system will not compile.

--
"Civis Europaeus sum!"

Re:More details and downloadable archive by moronoxyd · 2010-07-11 18:53 · Score: 4, Interesting

The truth is that code was reused from a UNIX derivative, which is now (somewhat disputably) owned by SCO.

Did I miss a verdict here?
As far as I know, it is right now only a claim, not yet proven.

And using the terms "truth" and "SCO" in one sentence... well, it just feels wrong.

Re:More details and downloadable archive by pugugly · 2010-07-11 19:05 · Score: 4, Insightful

I'm all for strict ethical standards, and I won't claim to have gone through more than a half dozen of these PDF's checking them.

That said - the six or so files I looked at were exclusively issues about some fairly well established naming conventions, the repetition of which is about as unlikely as finding an elm street and sycamore street 'coincidentally' close to each other in both our home towns and calling it 'evidence' the city planner of one stole the plans of the city planner of the other.

I've heard people claim in several posts that "Well, if you looked at the PDF's that weren't cherry picked for ridiculousness ..." there are 'obvious' copying of code. That may be, but I'd like someone to actually link to a pdf with this obvious copying of code - burying the evidence in bullshit data is what you do when the evidence is against you and you *don't* want the other side to find it, not when it's in your favor and you're trying to make your case.

At a minimum, the fact that he has buried his 'evidence' in with other bs data doesn't make it very clear he knows what actual evidence of copying would look like.

Pug

--
An Invisible Entity of Vast Power whose existence must be taken on faith alone: Liberal Media

Re:First post by Z00L00K · 2010-07-11 19:07 · Score: 3, Insightful

I did see the code - and the filenames, but the majority of that was header files that are containing definitions that have to be identical for portability reasons.

And the few code snippets that did contain actual functions didn't contain any real and obvious surprises. The working code was hardly something that you can call "advanced" by itself.

And then there is the issue of which way some code actually did go. From Unix to Linux or the opposite way.

--
If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.

Re:First post by jbolden · 2010-07-11 19:12 · Score: 4, Insightful

They both had a common parent which was public with those structures. Remember these were public APIs, no one is disputing that the structures were similar. The question is whether the implementations were copyright violations.

Re:First post by SpazmodeusG · 2010-07-11 19:13 · Score: 4, Insightful

"it was certainly a matter of one window with unix source to read from, the other window with linux source to write"
Hell no. That's a bullshit accusation.

Here we likely have a programmer who has never seen the SCO code (was it even released publicly back then?) but knows that applications can call functionY from libraryX from the interface documentation. If he creates a new libraryX containing a compatible functionY he isn't violating copyright. It is established that creating code with a compatible API interface isn't a copyright violation unless you actually do copy the code.
Indeed it appears this is exactly what is happening in the linked elflib files. The filenames are the same because they have to be (note both the redhat_libelf.h and SCO_libelf.h are actually referred to by just libelf.h in code and elflib.h is what applications need to link to) and the functions are the same because they have to be or else applications will be calling functions that don't exist.

This also happens with the WINE project. People there are creating the same header filenames and inside those headers the exact same function names. They aren't working off a split screen setup and blatantly copying Microsofts closed source code. They are merely recreating the functions that applications can call in Windows.

A similar thing happened years ago with Compaqs IBM compatible BIOS. Compaq re-created the IBM BIOS to get around having to buy the IBM BIOS. Both BIOSes responded to the same application calls in the same way with the same return values and messages. Despite this Compaq didn't break copyright because they still re-wrote the code. In fact Compaq hired engineers specifically for the fact they had never seen IBM BIOS code. This way it could be easily established that they didn't copy the code but instead just wrote the same code that performed the same task.

Re:More details and downloadable archive by MSG · 2010-07-11 19:15 · Score: 3, Interesting

OF COURSE he is going to write the same commands he has used a thousand times in the same way

I'm sure this is one of the reasons it's best to call the system GNU: Linus didn't write any of the "commands". Linus wrote a kernel and GNU ended up adopting it. The GNU project wrote the system "commands".

Just do the search.

Trivia: Actually, the people at exbiblio found that there is very little repetition of text in literature. Any four or five word sequence in a common magazine article is likely to appear in very few or no other texts. That fact is foundational to their technology.

Re:First post by bytesex · 2010-07-11 19:25 · Score: 2, Insightful

One function per file, and the same name for the file as the function, is actually damn good C coding practice. Nothing else.

--
Religion is what happens when nature strikes and groupthink goes wrong.

Re:What's so liberal about it? by MSG · 2010-07-11 19:27 · Score: 2, Interesting

Just because 2 programs have hooks or functions called "ReadX" does not mean there was any copying involved.

On the other hand...

Re:First post by MSG · 2010-07-11 19:30 · Score: 2, Insightful

The PDFs provided seem pretty damning.

Let me guess, you're neither a programmer nor a lawyer. Am I right?

Tried to find source code by spitzak · 2010-07-11 19:44 · Score: 2, Interesting

Clicking on these I find a lot of .h files implementing POSIX and BSD standards (here is a choice one that is such an absurd claim of copyright violation that I can't believe they did it: http://www.mcbride-law.com/wp-content/uploads/2010/07/Tab-2421.pdf) Most of the others are not quite that bad.

So you don't waste your time, after quite a lot of clicking I finally found some actual code: http://www.mcbride-law.com/wp-content/uploads/2010/07/Tab-415.pdf

Here we see that they both used the name "elf" to name a pointer to the ELF structure! Why the chances of two programmers deciding to do that must be astronomical!

I stared at this thing for quite awhile trying to match up the code as it certainly is different. Finally figured it out: the Unix code goes to the i'th field in the structure and returns it if and only if the "index" field in it is equal to i. The Linux code instead searches and returns the first field with the index field equal to i, whether or not it is at i. Umm, this seems to be a pretty significant difference!

This is such a load of bullshit that mr Mc Bride should be ashamed.

Re:First post by Chrisq · 2010-07-11 19:52 · Score: 4, Insightful

OK show me something that is not a header, non obvious, and not found in an open common variant such as BSD. post should be modded ignorant.

Missed verdict by The+Cornishman · 2010-07-11 20:04 · Score: 4, Informative

You only missed a verdict if you haven't looked up for seven years! Recently a jury in Utah confirmed what a judge found in a bench trial: Caldera (later SCO Group) did not get, and was not entitled to get, the UNIX copyrights in the 1995 deal they did with Novell. Unless you think that the jury was unreasonable in that finding (and guess what, SCOG and its lawyers do), SCOG does not 'own' UNIX in any useful sense.

Re:First post by Richard_at_work · 2010-07-11 20:13 · Score: 3, Informative

Since we have to rely on this PDF , and there is no way to prove that that code was actually in UNIX at the moment ( as it's closed , there's no way to check ) .

I am left to conclude that it's much easier to copy from Linux (which is open ) , than from UNIX ( which is closed ). In other words , using this PDF , i could also claim to SCO stole from Linux , and then implemented it in their commercial product , thus violating the GPL .

While I disagree with the SCO case, your assertion that its impossible to check is wrong - UNIX may be 'closed' but that doesn't mean that the code base is impossible to get hold of. One of the licenses you could buy was full source code access, and many companies and academic institutes took this license (I for example have seen the code base to AIX and HPUX several times at different places).

'Closed' doesn't always mean 'you can never have the source code', it can just mean 'you can't do what you want with the source code'. UNIX is one of these cases.

Re:More details and downloadable archive by Anonymous Coward · 2010-07-11 20:14 · Score: 4, Insightful

Because in FORTRAN, variable names beginning with letters I through N meant integers, and all other meant floating-point. For those with some FORTRAN experience, naming the first 6 loop counters "i,j,k,l,m,n" is automatic.

Re:More details and downloadable archive by jimicus · 2010-07-11 20:32 · Score: 4, Interesting

Now, open any dozen books that are 50,000 words in length. Search for strings that are duplicated between the books. Entire sentences, or phrases, it hardly matters. Just do the search. Anyone who is used to playing with databases can probably search those dozen books, and find numerous instances of phrases that were copy/pasted from one author's book to another. In fact, I'll bet that technical and factual books will have a higher incidence of matching phrases and sentences than works of fiction - but fiction will have it's share as well.

Actually, that's not true. There is some evidence to suggest you only need a remarkably short string of words to uniquely identify a piece of English prose - it's this kind of thing that cheating-detection algorithms rely on.

But we're talking about a structured programming language - with far more structure and rules than the English language - and the things that are at issue are by and large implementations of existing standards. The final link in TFS is a comparison of ELF utility header files, FFS. They've got to look fairly similar or they won't be any use for dealing with ELF executables! Even then they're sufficiently different that it would probably have been easier to write from scratch than it would be to execute the "copy/paste/obfuscate" cycle that is being alleged.

Re:oops I meant 331 not 251 by Anonymous Coward · 2010-07-11 20:53 · Score: 3, Informative

Barratry.

SCO is in possession of Unix and BSD source code and stuff older than that.
Rather than make comparisons using their older pre-copyright notice code, they selectivity chose to compare it with stuff after attaching copyright? notices.

They chose not to apply abstraction-filtration-comparison tests to their own code.

I think the Judge did not ask the right questions for this fraud to go on for as long as it has. Hard jail time is needed for the 'players' who made this possible.

Re:More details and downloadable archive by gmack · 2010-07-11 20:57 · Score: 4, Informative

These are not small words like "kissing" that are under dispute, this is not about reusing some very common routines that everyone uses, that's just silly. Rather it's about companies wanting to maintain compatibility with legacy versions of UNIX and doing so by referring directly to the legacy UNIX at best, and plagiarizing their code at worst.

Except it's not about that at all. It's about implementing standard interfaces that are defined by POSIX. POSIX defines things down to the variable type so it's natural that the resulting header files will look similar. In fact, some of the differences I'm seeing in these files are from SCO not implementing POSIX properly ex return type int where it should be size_t.

You also need to keep in mind that SCO's predecessor (AT&T) was itself caught copying code from Berkely.

There was exactly one case where there was copying shown between SCO and Linux. In that case the code was from Berkely (licensed open source) copied into SCO and copied into Linux by SGI as one of their internal filesystem driver headers. The code was determined to be non infringing due to it's history but deleted because it was old and reimplemented in a better way elsewhere.

From working with the Linux kernel maintainers I know they take copyright very seriously and investigate even the possibility that code was copied and you owe them an huge apology for that uninformed set of accusations.

Palin by gd2shoe · 2010-07-11 21:01 · Score: 2, Informative

Okay. Now I'm really confused. What does Sarah Palin have to do with SCO???

An even better question would be: What does Sarah Palin really have to do with the "tea party" movement?

(Besides the fact that she wants as much air time as possible, I mean...)
("Tea Party Express" sort-of has something to do with the "tea party" movement, but not strongly. Largely, it's a degenerate Republican cheer squad. The rest of the movement embraces fiscal responsibility, not blind adherence to a political party.)

--
I won't join Slashcott. OTOH, If Beta goes live, I just won't be back until it's fixed. Sorry Dice.

Re:More details and downloadable archive by MrHanky · 2010-07-11 21:03 · Score: 3, Interesting

Is that so? Let's see if we take a phrase from your own comment: "a higher incidence of matching phrases". One hit. Not bothering with linking to them all, but how about "rips it from his predecessors"? One hit. "strings that are duplicated between the books"? One hit. "his programming background came directly from Unix"? One hit. "open any dozen books"? One.

I have, of course, duplicated them in this comment, meaning there will be two hits very soon. BTW, these are all the strings I searched for, giving your comment a 100% originality rating (admittedly, I didn't search for "I'm not a coder", which I expect would show up several times).

Duplication of whole sentences in ordinary human language is actually quite uncommon for all but the most trivial declarations and stock phrases ("Just do the search" gives 3 million hits; "Just do the twist" gives 105 000).

Re:First post by Joce640k · 2010-07-11 21:05 · Score: 3, Funny

The PDFs are machine generated. I ran the same program using a copy of Moby Dick and the Microsoft EULA, it produced damning evidence that the programmers had copied them word for word into their source code.

--
No sig today...

Re:First post by joss · 2010-07-11 21:05 · Score: 5, Informative

> The similarities in code flow, layout, variable names, filenames, etc. are conclusive.

I did look at the code on the linked site and what I saw looked entirely like a clean room implementation to me. The similarities were superficial and much less impressive than similarities I have seen in code that I know was written independently (because I wrote it). Two programmers working on the same problem can easily come up with strikingly similar looking solutions which a non-programmer (or an inexperienced programmer) would never believe was independent. I was astounded at how pathetic the supposed similarities were.

--
http://rareformnewmedia.com/

Re:First post by shentino · 2010-07-11 21:20 · Score: 2, Informative

In theory, having a problem that only accepts a single solution by definition would preclude the creativity inherent in a copyrightable work.

Re:First post by shentino · 2010-07-11 21:22 · Score: 2, Informative

Which doesn't even address the fact that SCO made contributions to the kernel under the GPL.

Any so-called infringement is fully licensed.

Re:More details and downloadable archive by Rogerborg · 2010-07-11 22:07 · Score: 5, Funny

Trying to imply that this is some nonsense that should be dismissed just because you like Linux is like playing down and ridiculing the evidence of the murder of Hans Reiser's wife because you like ReiserFS. It's even sillier in some ways because Linux isn't at stake in the case like ReiserFS was. (An extreme analogy I know, but valid).

That's the kind of analogy that Hitler would have made.

--
If you were blocking sigs, you wouldn't have to read this.

Re:More details and downloadable archive by loufoque · 2010-07-11 22:19 · Score: 2, Insightful

About the only difference was in indentation - mine is "always put the opening brace on the same line, one true tab, else in same column as if, no braces for any single-line condition to a control structure (for, if, else, while, etc)"

I hope they took the other person's then.
Never mix tabs and spaces.

Re:More details and downloadable archive by Eunuchswear · 2010-07-11 22:28 · Score: 2, Informative

Of particular note, and probably a source of much consternation to SCO and their proponents, is that LiS itself doesn't implement streams either, just does protocol translation. So neither linux nor LiS contains infringing code.

From http://email.gcom.com/home/linux/lis/

What is LiS
LiS is a software package that comprises an implementation of SVR4 compatible STREAMS for Linux.

Of course LiS does protocol translation - that's what STREAMS is, a framework for doing protocol translation.

As to whether LiS is infringing I couldn't say - I've never seen the SVR4 code to STREAMS.

--
Watch this Heartland Institute video

more interesting is that its his brother by SkunkPussy · 2010-07-11 22:57 · Score: 2, Interesting

So essentially Darl McBride took over a failing company, employed his brother as counsel. Then proceeded to embark on a huge programme of litigation until the company was dead. Thus transferring assets from SCO to his family.

To what extent is this legitimate?

--
SURELY NOT!!!!!

Re:More details and downloadable archive by silentcoder · 2010-07-11 23:30 · Score: 3, Informative

>But we're talking about a structured programming language - with far more structure and rules than the English language

Not to mention a far smaller vocabulary, the complete absense of abstract forms of speech (no metaphors, similes)and in fact of even fundemental sentences.
The vast majority of sentences in a programming language are verb(subject); THAT'S it, a rare few have an "object" (e.g. substr(S1,S2)) but at heart, that's 99% of the lines in a program. Simple commands. There are identifiers, control concepts (loops and conditionals) and structural stuff (classes, functions and the like) but these make up very little of the bulk. The implementation section consists of commands and variables for them to act on.
Thus for the same algorythmic task, barring minor changes in indentation and identifier naming (which will be minor because both are matters on whic standards exist and within organisations some or other standard is usually enforced) the statistical likelihood of two programmers writing and identical solution to the problem is very high. After all, programming is maths and there is only so many ways you can calculate the same equation - which is basically all any algorithm does.

You need a lot more than a few functions with identical structures to prove copyright violation when the scope for individual change is that much more limited. Creativity in programming is VERY rarely coming up with a NEW algorythm for an old task. Nearly always it lies in how we combine algorythms with one another to solve the bigger problems. The bits and pieces of code are like nuts and bolts, every engine has a million of them and they all look pretty much the same.

--
Unicode killed the ASCII-art *

Re:The thing that makes me angriest... by Stumbles · 2010-07-11 23:36 · Score: 2, Informative

Software rot is immaterial to the issue. I think IBM summed it up nicely about the supposed amount of "their code" that is in Linux this way:

Even if (contrary to fact) all of the System V Code were protectable by copyright, it could not render the Linux kernel substantially similar to the System V Works because the Disputed Code represents a trivial portion -- less than five one-thousandths of one percent (.005%) -- of the most cited of the System V Works and is qualitatively insignificant.

--
My karma is not a Chameleon.

Nothing new in this article by kspirov · 2010-07-11 23:59 · Score: 3, Informative

About libelf - 6 years ago Groklaw closed this "issue": http://www.groklaw.net/articlebasic.php?story=20040722135616439 The varable names are the same in both headers, because they correspond to Tool Interface Standard (TIS) 1.1 (Oct 1993). http://webster.cs.ucr.edu/Page_TechDocs/pfmt11.pdf

Re:First post by Score+Whore · 2010-07-12 00:00 · Score: 2, Informative

They both had a common parent which was public with those structures.

No they didn't. ELF originated in AT&T SysV Unix.

A year of discovery by Lorien_the_first_one · 2010-07-12 00:01 · Score: 3, Informative

I read the part in Groklaw about discovery in SCO v. IBM. They spent a year in discovery to see if there was similar code and once the infringing code was revealed, the response from the judge was "Is that all ya got?" After some pretty detailed analysis and dissection of the code in court, I think the result amounted to 230 lines of code out of several million.

Since they couldn't find examples of literal copying, SCO really wanted to go after methods and concepts as if that was protected by copyright. And they might have done it if they had been more forthcoming. But they were always late, delaying discovery as much as possible out of fear that any code revealed would be code removed from Linux. In a sense, they wanted a permanent tax on Linux without revealing the code so that it couldn't be removed.

Since you write code and you appear to have greater ethics than SCO, you wouldn't do what SCO did in order to recoup your investment in your code. What SCO did was try to reach something far beyond copyright enforcement in any reasonable sense of what the law actually says. And that was without owning the copyrights.

Having said all that, I can say that I am too, opposed to plagiarism in Linux. But I am also fairly confident that plagiarism isn't a problem in Linux since the code is open for anyone to see, and vetted by hundreds if not thousands of other programmers.

The far more compelling case of copyright infringement will come out in the counterclaims that IBM has against SCO. Right now, that giant is sleeping, but when it wakes, it's going to keep Boise Schiller very busy.

--
The diversity and expression of human opinion is essential to human survival.

Re:More details and downloadable archive by camperdave · 2010-07-12 00:08 · Score: 3, Funny

Do lambda functions not count as metaphors?

--
When our name is on the back of your car, we're behind you all the way!

Re:First post by jimfrost · 2010-07-12 00:26 · Score: 4, Interesting

That's true, but in the push to get UNIX into the commercial space the SysV interfaces were released as an open specification. This was actually covered during the trial.

The fact of the matter was that the Linux folk didn't copy code, something that would have been obvious to any observer following it's development. The idea that there were vast amounts of stolen code was ludicrous if you knew anything at all about the internal structure of the two operating systems.

There was always the possibility of code that got injected during the large commercial code donations by e.g. IBM or SGI, and in fact the only piece of code that showed actual derivation came from SGI ... But it turned out to be both a very small amount of code and buggy to boot. As soon as people got a look at it they excised it in favor of working, original, code.

I personally expected it to go more the way of the AT&T veresus BSD case, where it turned out that AT&T had stolen tons of code from BSD, not the other way around. The Linux emulation layer in SCO UNIX seemed a particularly likely candidate. Either that turned out not to be the case or IBM simply didn't push the issue (perhaps because SCO was having so much trouble proving anything in their claims) though.

SCO's strategy always seemed to me to be a shakedown, scare companies into license agreements. Why they went after one of the deepsest pockets first is beyond me, IBM was very likely to fight given their investment, but it was clear early on that management was not very competent.

--
jim frost
jimf@frostbytes.com

Re:More details and downloadable archive by Xtifr · 2010-07-12 01:03 · Score: 4, Informative

The truth is that code was reused (if not copied, exactly, in the same way you don't submit a copied essay which you've taken from a classmate) from a UNIX derivative, which is now (somewhat disputably) owned by SCO.

The truth is that SCO does not own the copyrights to UNIX code, as ruled by a judge, a jury and a second judge in Utah.

Beyond that, SCO already turned over all their evidence to IBM several years ago, where it was analyzed by an expert named Dr. Brian W. Kernighan (if the name doesn't ring a bell, you're not qualifed to be commenting on this topic), and he examined the comparisons and came to the conclusion that no illegal copying had taken place. Note, that's not "no copying", but "no illegal copying". UNIX is based on a number of sources. Most famously, they stole (and I use that word advisedly, since they removed copyright notices, which was illegal) from BSD. They also contributed parts of UNIX to public standards, including the ELF standard which is one of the examples shown here.

The problem is that you seem to think there's a single copyright to UNIX. There isn't. The three biggest stakeholders are Novell (inherited from AT&T), the University of Cal. regents (all the BSD code in UNIX), and Sun, but IBM and SGI also own largish chunks. The people you're accusing of being "lazy and careless" actually own parts of the code you're claiming they illegally copied. Do you know who actually owns the parts in question? I don't, but no evidence of illegal copying has been shown in court!

Releasing this thoroughly debunked information at this late date can only be a desperate attempt to spread FUD. Don't fall for it. I'm sorry that you once had your code plagiarized, but this case is nothing like yours.

Re:More details and downloadable archive by tibit · 2010-07-12 01:08 · Score: 2, Insightful

The truth is that code was reused (if not copied, exactly, in the same way you don't submit a copied essay which you've taken from a classmate) from a UNIX derivative, which is now (somewhat disputably) owned by SCO.

Um, citation needed? Nothing in TFA suggests to me any code reuse.

Don't mistake writing for the same API as "code reuse". You seem to have no clue that APIs are not a subject of copyright protection -- either you're badly underinformed, or you're a troll.

If you write for given API, where variable/parameter/function/field names are either part of the API or are a de-facto standard, there is no helping creating structures that look the same, creating same function declarations that look the same, etc.

Again, let there be no mistake: in my opinion, there is no code "reuse" or "plagiarism" in any of the samples TFA refers to. So most of your rant is a waste of steam. The fact that Linux-is-a-UNIX makes certain things look similar, but that's simply because to be a UNIX, you must implement a bunch of APIs, and by necessity functions and structs will have same signatures, and there will be a bunch of C macros that are same as well, perhaps to the letter. When it comes to C macros, sometimes there is only one way to write them correctly, save for whitespace, and function dictates form, to the letter. This does not imply any reuse of code, merely implementing same thing.

What code of yours was plagiarized, again?

--
A successful API design takes a mixture of software design and pedagogy.

Re:First post by Anonymous Coward · 2010-07-12 01:11 · Score: 5, Interesting

And the idea that this key book to early '80s PC tech (still worryingly relevant today!) was somehow missing from all the bookshelves reachable by the Compaq BIOS writing department is just silly.

You don't know what you're talking about. I was there at the time: Compaq had administrative staff remove the BIOS listings from all IBM tech ref manuals before they were given to the engineers. (This was especially easy to do because they came in the form of ring binders.)

At one point, since I didn't work on writing BIOS code, I was assigned to be the one designated guy who could disassemble the IBM BIOS for a certain model. When the BIOS developers got stumped by a compatibility problem, they could send me a question, and I was allowed to poke around in the IBM ROM and then give a "Magic 8 Ball" type vague answer.

Here's a bit of trivia: A few PC applications wouldn't work unless the ID string "IBM" appeared at a certain address within the BIOS code. Compaq developers worked out a way to make those bytes at that address appear in part of an actual executable code sequence instead.

Re:More details and downloadable archive by schon · 2010-07-12 01:48 · Score: 2, Insightful

Thank you for the clarification - I'd thought that Gcom shipped an actual STREAMS implementation, not just an API translator.

Still, just makes my point even further - Kevin claiming that Linux infringes SCOX's copyrights because it "included" STREAMS is just batshit insane.

Re:First post by Anonymous Coward · 2010-07-12 01:56 · Score: 3, Informative

It must have been during the "fascist totalitarianism" days of Reaganism when it was completely impossible to get hold of books which weren't in some specific location on some specially designated shelf.

The books were rather obscure and very pricey. They most likely weren't in libraries, and the only way for a typical 20-something developer to get one would be to buy it. I don't know anybody who would plunk down a huge chunk of change to buy their own copy of one of those books, especially when reverse engineering was plenty fun work anyway. It was like solving a puzzle.

At any rate, I have no doubt that IBM had their lawyers pore over every byte of the Compaq BIOS looking for evidence of copying. If there had been any, they would have stomped Compaq off the map.

You seem pretty unappreciative for someone who benefits greatly from reverse engineering efforts. If none of that had happened anywhere, all systems from different vendors would still be totally incompatible and proprietary, and you might still be shelling out the equivalent of $10,000 in 2010 dollars for a crappy midrange personal computer.

Re:First post by Anonymous Coward · 2010-07-12 02:08 · Score: 2, Interesting

You just reminded me of something amusing. Back in the early 1980's I met the guy who was in charge of the Unix business at AT&T. He told me that they had a problem - because they were a regulated utility, they were not allowed to make a profit on Unix. But of course, they also wanted it to pay for itself. But the lower the price they put on it, the more companies bought licenses, so the revenues went up instead of down!

-- not logging in because it's too much hassle for one comment.

Re:First post by The+Second+Horseman · 2010-07-12 02:49 · Score: 2, Insightful

Back in the day, my University had the source code for VAX/VMS 4.2 or so on microfiche. Endless lines of Macro-32 and BLISS. We had some of the the AT&T Unix materials as well. It wasn't that difficult to arrange, if you were a higher ed institution or a customer of a certain size.

Two things by br00tus · 2010-07-12 02:56 · Score: 2, Interesting

First, SCO released early versions of Unix to the public under a BSD-like license in 2002. So if any of that code happened to be in Linux, it is there legally. Only code added in later versions of Unix would be affected.

Secondly, a lot of code in Linux is created to follow POSIX standards. There is code in SCO Unix and Linux which looks very similar, but the source for the material is the IEEE Posix standard, not SCO Unix source code.

Re:More details and downloadable archive by LWATCDR · 2010-07-12 02:57 · Score: 3, Informative

Actually it is even simpler than that. The code in the PDF I saw for for ELF.
They where all typedefs. Elf is a well documented format and not of the code that shows copying was actual functional code.
As I was reading the code I was thinking just how trivial the example was but also how well written both .h files where. You could tell exactly what each variable in the type def did. It also looked like a lot of my own code when I am having a good day.
Also these are .h files! they are not functional code blocks just definitions. Of course the definitions for the typedefs of a well documented file format will look a lot alike!
It is a huge duh but an attorney that knows nothing about programing might not understand that.
If this was an example of the infringement I would say the court did a great job when they tossed it out.

--
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.

Re:More details and downloadable archive by butlerm · 2010-07-12 03:14 · Score: 3, Informative

Because I, J, K, L, M, or variable names beginnning with them, are integers in Fortran

I am afraid the tradition is much older than that. "i", "j", and "k" are standard subscript variables in math and physics, almost as common as "x", "y", and "z".

Re:First post by Chris+Burke · 2010-07-12 03:25 · Score: 2, Informative

Header files are public - but they seldom contains any advanced functionality. They are just a definition of the calls available, defined data types and constants.

But of course - a lawyer wouldn't understand that, it's just a question of money.

Any lawyer versed in the relevant law would understand that. A lawyer may be able to pretend they don't understand for the sake of their client; I'm not sure but wouldn't that run afoul of rules regarding legal practice? It's certainly possible Kevin really doesn't understand, and even avoided learning so he could in all sincerity advise his client with the bad advice they wanted.

In any case, the judge understood, which is why back when they submitted this evidence, the judge correctly ruled that necessary API definitions didn't by themselves constitute copyright violation. Besides, what with them being necessary APIs, there's no way to say they were copied from UNIX versus any other implementation which would also have the same headers.

This is a sad, pathetic attempt to claim that there really was something to the UNIX-code-in-Linux accusation. Especially sad because as they admit this evidence was already submitted, examined, and rejected as valid evidence of what they claim.

--

The enemies of Democracy are

Re:First post by MikeBabcock · 2010-07-12 04:11 · Score: 3, Informative

You're not a programmer, are you?

Reading the man pages for (random example) memcpy tells you how to define it. It also describes what it does. Tell me how you'd re-implement this function without your header definition being identical to SCO's (and yes, the definition is included verbatim right there):

NAME memcpy - copy memory area

SYNOPSIS #include <string.h>

void *memcpy(void *dest, const void *src, size_t n);

DESCRIPTION The memcpy() function copies n bytes from memory area src to memory area dest. The memory areas should not overlap. Use memmove(3) if the memory areas do overlap.

--
- Michael T. Babcock (Yes, I blog)

Re:More details and downloadable archive by Chapter80 · 2010-07-12 04:55 · Score: 3, Insightful

So use a REAL tab character in your code like $DIETY intended, and set your editor to "show tab character". We have wide screens now - there's NO excuse for using anything except a real tab any more.

Actually, there are several huge reasons for not using real tab.

1) Tab means different things to different people. Even when you spell it out, people interpret it differently. In the original sense (i.e. old Underwood typewriters, and the like), tab meant to release the carriage and let it move thanks to the spring, until it was stopped by a tab stop. This means that if you set your tab stop at position 5, and 60, and you pressed tab when positioned anywhere from position 1 to 4, it skipped to 5. Pressing tab when positioned at 5 through 59 (in this example) skipped you to position 60. So, in it's original sense, tab relied on tab stops (literally tiny "tabs" on the top of the typewriter). However, there are few standard document formats (especially for source code) that define the tab stops. You don't see a line in an ASCII or Unicode source code file that says "the tabs for this document are at position 5 and 60". There's no common convention for this.

So people invented arbitrary tab stop conventions. Like "tab stops are every 4 characters" or "tab stops are every 8 characters". But a small difference like this can change the meaning of your document! If you line up code and comments with "real tab characters" every 4 characters, and then someone opens your document with tab stops every eight characters, then the issue is NOT just that things are moved right. The issue is that things do not line up! If I create a nice comment section with a table explaining something, and use tab characters counting on a tab stop every 4 characters, and you open it with tab stops every 8 characters, the MEANING of the comments may change.

Example:
/* Here is a table of all the fields and whether they are changed by this function:

[tab][tab]Passed to function[tab]Returned from function[tab]Changed by function [tab][tab]------------------[tab]----------------------[tab]------------------- A[tab][tab][space][space][space]Yes[tab][tab][tab][space][space][space]Yes longer[tab][tab][space][space][space]Yes

*/
These comments will mean different things depending on the tab stop assumptions!

2) The designers of some editors mis-understood how tab-stops worked, and instead, some made tabs equivalent to a fixed number of spaces. For instance, for some editors a tab is instantly interpreted as 4 spaces. But in the original definition of tabs, it was a "variable" amount of spacing, which took you to a predictable column. These are two vastly different concepts.

3) Those who are smart enough to realize that there is confusion are really annoyed by those who are clueless and inserting real tab characters without knowing that there's an issue.

Copying, and copyright notices.. by mengel · 2010-07-12 04:58 · Score: 4, Informative

The code was first copied, correctly.

The copyright notices in the comments, etc were then replaced with AT&T ones, replacing the Berkeley ones (also replacing the earlier AT&T ones, btw.) I can vouch for this personally, having worked on the "vi" source code both at Purdue (original BSD 4.3 code) and at AT&T (System Vr4 code) -- all of the BSD copyrights, as well as the (bad) poetry, had been removed from the comments in the vi sources.

The Folks from the UCB law school took advantage of this in the counter suit, since the AT&T folks, having changed the copyright notices in the troff sources, ended up doing this this then in the printed manuals. So while AT&T was suing about vague things like including code derived from code derived from code they wrote; UC Berkeley countersued about printed, published, paper manuals, where AT&T was clearly publishing them without the UCB copyright and license info. Clear, obvious, game-set-match, paper copyright violations.

So rather than have to find and "Destroy all Copies" of SystemVr4 manuals (including those published in turn by licencees like HP, IBM, etc.) AT&T agreed to drop their initial suit and make the countersuit go away.

--
- "History shows again and again how nature points out the folly of men" -- Blue Oyster Cult, 'Godzilla'

Re:First post by dgatwood · 2010-07-12 05:32 · Score: 2, Informative

It doesn't really matter. Headers almost certainly cannot be copyrighted anyway. A header file is really no different than a phone book. It is a purely factual description of an interface. Phone books were held to not be copyrightable in Feist Publications v. Rural Telephone Service. In that case, the court ruled that although creative aspects of a collection of facts can be copyrighted (which facts to include, the order and style of the information, etc.), the facts themselves are facts and cannot be copyrighted.

An interface definition is a fact. There is only one interface definition that correctly describes a given interface. Change even one letter (not including comments) and code written against that interface will not compile without modification (unless you're doing something funky with C preprocessor macros, but even then, what the compiler eventually sees must be identical). Therefore a header is nothing more than a collection of facts. So long as the organization of the header is substantively different from the organization of the original header and the new header does not contain any non-paraphrased comments from the original, it is unlikely that copyright violation has occurred.

And Sega v. Accolade seems to support that assessment, along with several other reverse engineering cases.

--

Check out my sci-fi/humor trilogy at PatriotsBooks.

Re:More details and downloadable archive by gwjgwj · 2010-07-12 07:01 · Score: 2, Informative

God is real, unless explicitly declared as integer.

Re:More details and downloadable archive by tomhudson · 2010-07-12 09:19 · Score: 2, Interesting

Of course it's insane. We've had 7 years of insanity - why should it suddenly stop now?

The latest speculation is that insiders suspect SCO will be in Chapter 7, and it's time to start making with the "plausible deniability" game because the creditors will be getting a closer look at SCO's internals.

Re:First post by Chris+Burke · 2010-07-13 04:11 · Score: 2, Insightful

It doesn't matter how often people say this, it won't make it true. Case history even in the early '80s already confirmed that it was fairly difficult to prove that derived firmware work had been created (for reasons I explained in the '99 case). IBM, unlike you, had smart enough lawyers to know that they weren't going to be able to protect their whole PC over a few kilobytes of ROM BIOS.

It doesn't matter how often you say IBM didn't want to stop Compaq, it only demonstrates your ignorance. You're nearly right about the rest, though, in that IBM's lawyers understood copyright law well enough that once the facts of the case were in the open, they knew that they didn't have a chance and endless appeals would be pointless.

You seem to think every company is like SCO, and will pursue a case indefinitely if they want to win regardless of facts, so if they accept a defeat, it means they didn't want to win. Which is cute and funny and stupid.

That's ridiculous and you know it. If you look at what X's work does and how it works, and use that information to copy what it does, then you are using X's creative input in order to make a creative work of your own. Thus a derivative work has been created.

And here you demonstrate your ignorance regarding what constitutes a derivative work, and then in your analogies ignorance of the specific situation. Maybe read up on the legal code and acquire a clue.

Today, a "clean-room reinterpretation" is used fairly honestly: you can only read what you're not restricted by or from reading. IOW, you implement from published licence-unencumbered specs. 30 years ago it meant, "Pretend that a derived work hadn't been created by handwaving an imaginary wall. This wall ignores the nature of creative expression in software and takes advantage of a technically illiterate court."

The wall was real, no handwaving, otherwise Compaq would have lost the case. And the same technique is used today which you'd know if you had any relevant experience and weren't using your imagination in place of knowledge -- the court exceeding you both in technical experience and knowledge of the law despite your hilarious claims otherwise. The main difference today is in that word "license-unencumbered" since back then software patents were much more rare. Had IBM held patents on pieces of the BIOS, not even a clean-room implementation would have been legally safe. But I'm sure that's not the point you were trying to make, since it's far too intelligent.

You're angry because you know that so much of the geek dream lore surrounding the development of PC clones is sophomoric bullshit.

I'm laughing my ass off. You're mad because your attempt to re-imagine reality, the public record, and established case history is failing so spectacularly.

BIOS. Clones were born, IBM moved on, and the victory for straight "piracy" was rechronicled lest the nascent commercial PC software market be scared away. Perhaps had people such as yourself not been so dishonest at the time, we'd see a very different software landscape where copying is considered far more acceptable.

Bwa ha ha! Oh yeah! If IBM and others hadn't realized there was a legally valid way to implement "secret" functionality, they totally would have been happy to have people outright copy later! Please. The connection to "straight piracy" is in your imagination only. The rest of the engineering profession knows that reverse-engineering is important to the point where they were even able to get exceptions in the DMCA for it. The "pro-piracy" crowd you are so hilariously trying to paint the accurate historical view as coming from has not such power. IEEE and ACM? That's a different story.

IBM hadn't moved on even in the 90s, btw, when I was there. They were still bitter over losing control of the PC market, though their impression of the case was the same as everyone else, at least those who know what happened. I.e. not you. But keep it up! You're cracking me up here with your "truth".

--

The enemies of Democracy are

Slashdot Mirror

Claimed Proof That UNIX Code Was Copied Into Linux

138 of 578 comments (clear)