GNU Releases Free Documentation License
Bananenrepublik writes "The GNU Project has released the GNU Free Documentation License. It is meant 'to assure everyone the effective freedom to copy and redistribute it [the documentation], with or without modifying it, either commercially or noncommercially.'"
I think this new license can only be a good thing. There is not much awareness in the community of the FSF's position on free documentation. The existence of this license will hopefully cause people to consider the issue, and decide for themselves what they believe, rather than just being unaware of the issue.
perl -e 'fork||print for split//,"hahahaha"'
Copyright (c) YEAR YOUR NAME.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.0 or any later version published by the Free Software Foundation; with the Invariant Sections being LIST THEIR TITLES, with the Front-Cover Texts being LIST, and with the Back-Cover Texts being LIST. A copy of the license is included in the section entitled "GNU Free Documentation License".
What a novel license.. Kudos to RMS and the rest of the GNU crew!
EraseMe
First thoughts: Excellent stuff! I'm glad to have "the GPL of document processing" nicely laid out. ;) Roll on w3c and the DOM and any other transparent document models!
I also approve muchly of "Opaque formats include PostScript, PDF, proprietary formats that can be read and edited only by proprietary word processors, SGML or XML for which the DTD and/or processing tools are not generally available, and the machine-generated HTML produced by some word processors for output purposes only." I'm not convinced about some of the layout specs ("first page", "title page", "adjacent pages" - can't we have a one-paragraph "this is GDL'd" with pointer to appendix Z?) It also needs to define "compilation copyright" - what is it? Is it Yet Another American thing? (I thought that around these parts, anything you "write" was automatically copyrighted... confusing.) If the GPL is but one open-source license for software, is there an "open-source" definition for documents? (How much is www.transparent-source.org going for?
~Tim
--
Rushing on down to the circle of the turn
Wow, very good you can read. Now go back and carry on where you left off, you've missed most of the license, like the bit that modified versions must be distributed under the same license as the original, which is presumably the whole point.
perl -e 'fork||print for split//,"hahahaha"'
Ok so I can't write, I meant "like the bit that said modified versions must be distributed under the same license".
perl -e 'fork||print for split//,"hahahaha"'
RMS just wrote in to say that there has been a few minor last-minute corrections to the license. I'm sorry that I do not have any more details at the moment, but please do not use this license just yet.
That's what I've been waiting for :)
Is it just me, or does the Opaque/Transparent
distinction seem too vague to be enforcable?
The portion indicating that HTML is sometimes
but not always opaque seems the best example of
this, but overall the distinction seems to be
problematic.
For every problem, there is at least one solution that is simple, neat, and wrong.
I am not a lawyer. But finally there is a public licence for documents. I always thought that the old way (simply posting the GPL as it is at the beginnig/end of the document) was not correct. Simply because there was no mentioning of "documentation" but only of "program" (and such). The only thing that makes me sick is that we need a thing like that. Aren't things posted publically public domain unless stated so? Will I need to add a General Public Posting Licence to post on Slashdot without allowing Cmdr.Taco to get on the exclusive rights of my posting? And sell them to Holliwood? -- A First Timer Anonymous Coward
Note that someone can create their own non-free work, label it all "invariant", place it under the FDL and then combine it with your work to get a non-free derivative work. Anything directly derived from your sections remains free in the derivative work though. I suppose this is something like how the Lesser GPL works.
Also note that untested licenses are at least as dangerous as untested software! You probably want to wait a while before actually *using* this license. Remember it is only version 1.0 and be careful.
perl -e 'fork||print for split//,"hahahaha"'
The line seems to be too arbitrarily drawn. Postscript is not transparent? That depends. I know quite a few Postscript hackers who can directly edit PS source without batting an eyelid. But PS generated by TeX itself, is really obscure largely becuase of all the font declarations.
Is there a more satisfactory way to address this issue?
I think it can be agreed this is a Good Thing©. I'm curious if the GNU Doc copyleft will be limitted in use to HOW-TOs and such. Will Tim O'Reilly sell copylefted books knowing that the text can/will be available and "free" somewhere. He's done something of the sort with the Samba book IIRC.
On the other hand perhaps this will act to increase mainstream doc pulication. Any publisher can stock the shelves with a complete work without author fees. I admit ignorance as to how heavily that cost weighs against total production.
Oh, by the way:
These comments copyright (c) 2000 Jeff Kustermann. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.0 or any later version published by the Free Software Foundation; with the Invariant Sections being LIST THEIR TITLES, with the Front-Cover Texts being LIST, and with the Back-Cover Texts being LIST. A copy of the license is included in the section entitled "GNU Free Documentation License"There's a spider on your shoulder.
Here is the problem.
A publisher like O'Reilly produces technical manuals, under the license an author chooses. If a would-be author (who is about to do a lot of work) wants advice on licenses, O'Reilly is stuck between a rock and a hard place, they want to support open source, but they have to admit that the author will probably make more if it is not an open source license on the book. For some reason people like writing software but find documentation a chore.
However what seems to work very well is if O'Reilly can work with the author to produce both a book and connected documentation. An example is Programming Perl where the online documentation started life as the book rearranged (and without the bad jokes). If the online documentation is exactly the book, people act as if the book is a cheap rip-off. If there is a clear division, then they don't.
But if you do the above, the online documentation gets maintained and the dead tree version does not. At some point you need to re-synch. But what pair of licenses allows that?
Personally I think that it would be good to create some sort of arrangement where the exact text and arrangement of a document may or may not be free, but it and all its derivatives must allow the technical information in them to be free to use in any other document using either of the pair of licenses. IOW O'Reilly or anyone else can come out with clearly differentiated books, but the information contained in such has to be available as free documentation.
But the devil is in the details...
Cheers,
Ben
My usual seat in the cluetrain is at A HREF="http://pub4.ezboard.com/biwethey.ht
This is interesting:
Translation is considered a kind of modification, so you may distribute translations of the Document under the terms of section 4. Replacing Invariant Sections with translations requires special permission from their copyright holders, but you may include translations of some or all Invariant Sections in addition to the original versions of these Invariant Sections. You may include a translation of this License provided that you also include the original English version of this License. In case of a disagreement between the translation and the original English version of this License, the original English version will prevail.
Does anyone see any true sense to this clause? We still have to refer back to the original document in order to stick to the license, so I would imagine this would just make things more complicated for such projects as the LDP when they start moving towards more sophisticated language support.
EraseMe
This Troll is protceted under the GNU FDL. Feel free to use this troll as you wish, so long as you include this licence and credit to it's original author, me. If you release any troll based on this one, you also must release it under the GNU FDL, and credit must be given. In addition, if reading this troll inspires you to write a troll of your own, credit must also be given to the original author, me. If someone asks your opinion of trolling, trolls in general, this troll, particular Trolls and thier work, credit must again be given to me. By accepting this licence, which you did inadvertantly by reading the first sentance (to quote the latin, "I 0wn j00!"), you accept that in your mind, heart and actions, the name gnarphlager is synonymous with trolling, and this is a Good Thing. You accept that trolling is a vaild form of expression, and you look forward to and enjoy all trolls. You accept that gnarphlager is your one and only true god, and you acknowledge no other gods, lest gnarphlager informs you of thier existance. Text of the Troll follows:
The Troll:
Richard Stallman and I used to eat cheese together. Ah, those were the days, back when the internet itself was a wild and Darkly Darkly Wood. But then he ate MY goat, and I wasn't too happy about that. So we don't talk as much any more, but we're still on friendly terms when we see each other.
thankyoutheend.
Copyright 2000, by gnarphlager under the GNU FDL. For terms of this licence, see above
Just out of curiousity, was this read/edited by a lawyer to make it legally ironclad? Seems straightforward enough, just like the GPL, but just curious as to how well it can stand up in court.
Touche'
I wonder if a /. shorthand will evolve for an "understood" GPL'ed comment. CmdrTaco will have to insert a link something like: /. comment copyleft definition so we can just include it at the bottom of a post.
There's a spider on your shoulder.
I've speed read the whole deal and find it seems to be lacking some key stuff:
1. Excerpts. What if a print magazine is doing an article on Widgets, and wants to quote two paragraphs from the GDL'd Widgets Manual. Is it possible? Does the Magazine have to GDL itself? GDL that article? Since the magazine has a circulation of >100 does that have an impact?
2. Private use. Some guy wants to take a whole GDL document, modify it with his comments and give it to the 115 people in his lecture class. Does he also have to give them floppies since the distribution is > 100?
3. Inclusions. Some guy is writing a GDL'd document and wants to include a longish section of a non-GDL'd document. Is this illegal, as it would be with code under GPL? Suppose I want to quote a large chunk of text that is genuinely public domain. Does the license now infect that text in other places?
I was never a massive fan of GPL, although it has its uses. I think GDL will have its uses too, but it is a minority license suitable only for a certain set of technical documentation.
-----
From the document:
Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed.
This is certainly not allowed by the FDL...
Lars
__
Reality or nothing.
You give the documentation away, and you make money by...? By what? Support of the documentation? That is, you get paid for adapting, modifying, and or re-writing the documentation? I don't think this works.
I'd really like to see the incentive model for writing free documentation. Programmers do free software for fun and fame. That's their compensation. Writing documentation, however, is not fun, and also doesn't give one any brownie points in the community. Writing documentation is just plain hard work. What's the compensation for that work?
Also, book writing (even large books) is still a one-person show (as opposite to software writing). And if the book you write is good, you can easily make some money out of it. So what is the incentive to give it away? You get credit for software by the community. As a documentation writer, no one even remembers your name in the community. So going to a traditional publisher seems a more natural way for one, in terms of money, as well as fame. And if you don't trust traditional publishers (or don't find one), you can still publish your work yourself.
oops -terminal klutz mode entered
this post has been accidentaly moderated down.
apologies
treefrog.
Easiest way might be to contact the original author and ask for permission. The author holds the copyright, and no one in the world can deny the author the right to release (part of) his/her work under a different license (has happened with GPL's software too, e.g. Ghostscript).
2. Private use. Some guy wants to take a whole GDL document, modify it with his comments and give it to the 115 people in his lecture class. Does he also have to give them floppies since the distribution is > 100?
The GDL says so, yes. Same thing as with GPLed software. The license grants redistribution, but requires to follow the rules.
3. Inclusions. Some guy is writing a GDL'd document and wants to include a longish section of a non-GDL'd document. Is this illegal, as it would be with code under GPL? Suppose I want to quote a large chunk of text that is genuinely public domain. Does the license now infect that text in other places?
The GDL has the same virus-like behavior as the GPL license for software. And like the GPL prevents the inclusion of e.g. BSD licensed software in a GPL software, the GDL prevents the same for including documentation. Not a too clever move from the FSF.
As long as I'm posting, regarding "quot[ing] two paragraphs from the GDL'd Widgets Manual": this is what is called "fair use"; copyright restrictions do not apply to small snippets of a large document.
Including public domain text should be safe, but I'm not sure about other licenses. It would be ironic if you could not include GPL'ed code (beyond fair use) in a GDL'ed manual.
I don't see how this would cause problems for the LDP (or anybody). ??
You better give us some proof that "RMS wrote in", otherwise it looks like you're heading down to the -1 section.
The lecturer with 115 students doesn't have to give out floppies - he/she can make them available on request. Better still he/she can put the source ("transparent copy") on a website (this is the 21st century you know!), that way everyone gets the benefit of the new comments.
"What I look forward to is continued immaturity followed by death."
Since you don't seem to know who Jonas Oberg is, I would surmise that's you who's heading into -1 territory pretty soon.
PostScript varies in its transparency, and the PostScript used to describe a documentation will probably be generated by software (instead of handcoded) and will be opaque.
The PostScript I handcode is easily understood, but since it's just to make tape and CD covers, it's very basic, a bunch of lineto's, fonts and shows. For example:
%!PS
0 setgray
1.5 setlinewidth
72 72 moveto
436 72 lineto
436 416 lineto
72 416 lineto
72 72 lineto
stroke
/Americana findfont 18 scalefont setfont
108 382 moveto
(Grateful Dead) show
108 358 moveto
(11/11/73 Ip IIp) show
/Americana findfont 12 scalefont setfont
120 334 moveto
(Ip) show
108 318 moveto
(Weather Report Suite Prelude>) show
108 302 moveto
(Weather Report Suite Part 1>) show
108 286 moveto
(Let It Grow) show
120 270 moveto
(IIp) show
108 254 moveto
(Noodling>) show
108 238 moveto
(Dark Star>) show
108 222 moveto
(Mind Left Body Jam>) show
108 206 moveto
(Eyes of the World>) show
108 190 moveto
(China Doll) show
showpage
You don't even need to know PostScript to get the gist of what I'm doing.
At the other end of the readability scale are the desktop publishing packages. In PostScript I've seen from Frame (IIRC), each letter was individually placed, and most of the PostScript commands were redefined, so instead of something nearly transparent like:
108 238 moveto
(Dark Star>) show
You would get something like the following (I don't want to bother finding a copy of Frame to verify):
108 238 mt
(D) sh
109 238 mt
(a) sh
110 238 mt
(r) sh
111 238 mt
(k) sh
and so on, and so on. I think the reason they call out each letter individually is for letter by letter placement, Frame decides the typography instead of the PostScript interpreter of the printer.
I guess I can understand the GPL requirements now, most desktop publishers generate opaque PostScript.
George
...Version 1.0 or any later version published by the Free Software Foundation...
I've always found this a bit odd with the GPL, and now the FDL. What happens if an item in the license changes, which the author/coder no longer agrees with?
At any rate, the point to "compilation copyright" and the "title page" stuff is that the goal is to provide a way of combining several requirements, including:
The net result can't be completely "clean."
As for "compilation copyright," the point of that is that a collection of documents can be copyrighted even if the components aren't. For instance, a phone book consists largely of a list of names of people and their phone numbers. The individual components aren't copyrighted, but the collection or "compilation" of them is.
In the same way, William Shakespeare's works are long out of copyright, but if I make a book that includes the plays along with some of my own commentary, the collection may become copyrighted, and you can only make copies at my sufferance.
The relevance is that there are vendors that put together collections of things like HOWTOs, and the GDL needs to have some rules to indicate how it interacts with the needs of such "collections.."
If you're not part of the solution, you're part of the precipitate.
What the hell was the moderator smoking when he marked this 'offtopic'? This is the GNU webmaster, and he is telling the truth; just go and look.
The other problem is that there are all sorts of possible pathological cases.
For instance, Postscript is described as an "Opaque" format, but supposing someone follows the dictums of TINYDICT, and writes their documents in raw Postscript, then despite the fact that Postscript is usually considered "Opaque," it is, in fact, the "Transparent" form.
That's probably the most pathological (and perverse-sounding) case, and is one that I brought up in some discussions on the license last year.
HTML is a necessarily ambiguous form.
In such a case, HTML is the "most transparent form available."
In practice, I don't think this will be a big problem. After all, am I likely to sue someone for releasing "freely," under the "GDL," some documentation in a form that I don't much like? I think not...
If you're not part of the solution, you're part of the precipitate.
Just because somebody posts under the name Jonas Oberg, doesn't actually mean that they are who they claim. Either that or Bill Clinton has posted here before.
Of course in this case it seems that they are who they claim.
I hope this clears things up a bit.
how can a copyleft doc have parts that forbid modification??? oh, rms said it was so, it must be ok.... but is it Open Source...
If the Document specifies that a particular numbered version of this License "or any later version" applies to it, you have the option of following the terms and conditions either of that specified version or of any later version that has been published (not as a draft) by the Free Software Foundation. If the Document does not specify a version number of this License, you may choose any version ever published (not as a draft) by the Free Software Foundation.
- (c) 2018 Hank Zimmerman
Richard Stallman and the Troll used to eat cheese together. Maybe the clown has a friend that also wants to eat me. Between the two of them, they will certainly find me when I am sleeping. I fear my ability to sleep over the weekend... pray for me in church this Sunday.
Well, I just had my first documentation book published (not counting inhouse software manuals), the Samba Administrator's Handbook, ISBN 0-7645-4636-8) so I thought I'd make a few comments from an author's viewpoint.
I'd really like to see the incentive model for writing free documentation. Programmers do free software for fun and fame. That's their compensation. Writing documentation, however, is not fun, and also doesn't give one any brownie points in the community. Writing documentation is just plain hard work. What's the compensation for that work?
Writing is work, boring and tedious, I spent a lot of nights writing when I'd have rather been snuggling with my honey, playing with daughter, building Lego, surfing the web or even configuring my Linux boxes.
I don't think I've gotten any brownie points in the community, though the 5 star review on Amazon was nice. I haven't gotten any book related email either, and I'm not hard to track down (the joy of having a unique last name).
Financially I've done alright though, the advance helped me buy my house.
Also, book writing (even large books) is still a one-person show (as opposite to software writing).
Or a two or three person show, but I get your gist. You don't have 20 member teams writing books.
And if the book you write is good, you can easily make some money out of it. So what is the incentive to give it away? You get credit for software by the community. As a documentation writer, no one even remembers your name in the community. So going to a traditional publisher seems a more natural way for one, in terms of money, as well as fame. And if you don't trust traditional publishers (or don't find one), you can still publish your work yourself.
I could publish any book myself I wanted to, but the printing costs would probably astronomical (unless I used the production printers at work), the distribution costs would be astronomical, and forget getting my books to a brick and mortar bookstore, at the moment, if you want a wide audience for a dead tree book, you probably need to work with a publisher.
Once you do work with a publisher, you can't just write any book you want to. You need to sell your concept to them, submit sample chapters, compromise on what they want to publish, it becomes more of a collaborative effort than one person blindly dumping 400 pages of Word files to the publisher.
It was an interesting time, certainly an ego trip to see my name on Amazon, but I don't think I would do it again for free, the non-financial rewards wouldn't justify all the time and effort.
George
This seems to mesh well with RMS's criticism of companies like O'Reilly & Associates.
/.
Basically, Stallman has criticized the commercial publishers who write and sell manuals for GNU software, because they "scratch the itch" for people willing to pay the bux for their books, but in effect remove the incentive to write good free documentation and manuals for the Free Software.
On the other hand, the FSF tries to generate income by selling books like Stallman's GNU Emacs manual, which is rather pricey.
Interesting stuff for a discussion, glad to see it on
The main problem with GPL'ed software in general is the question "how can I make a living writing free software." Companies like Red Hat, Caldera, and the rest of the Linux start-ups answer this question by providing technical support for a fee. However, not all programs lend themselves to this economic model. While it may be appropriate for complex software like operating systems and server programs, it is not nearly as feasable for desktop applications -- particuarly if they are very intuitive and user-friendly. A program that's easy to use won't need much in terms of tech support.
Besides providing support services, historically the only other significant way open-source programmers have been able to support themselves directly is to write & sell books. (ESR and Larry Wall spring to mind as examples of this model of compensation).
As a programmer, I'd hate to think that after putting hundreds or thousands of hours of my time into writing an open-source program, the only way I could make any money would be thru banner ads and selling tee shirts and stuffed toys. If I wanted to sell souvineers for a living, I wouln't have busted my ass getting an engineering degree. When you pour your blood, sweat, and tears into somthing, you deserve to be rewarded for your effort. If ego gratification is enough of a reward for you, that's fine; but remember that even the most altrustic programmer still needs to provide for himself and his family.
The problem with the free documentation licence is , like the GPL, it has a "viral" nature. Let's suppose I write program foo and release it under the GPL, then release a basic user's manual under the FDL. Because of the viral nature of the FDL, I could not then go write a book (foo In A Nutshell) that expands on the FDL'ed documentation. Strictly interpreted, even quoting a single line of FDL'ed text could render the entire new document FDL'ed. Even paraphrasing the original text might not be enough to get it out from under the FDL, given the translation clause.
Look at the Declaration of Independence : because it's in the public domain, anyone can publish a copy of the DoI without restriction. However, if I take the DoI and intersperse it with a line-by-line analysis of what it means, this derivitive work is fully copyrightable. However, if I did the same thing with a FDL'ed document, I would have to give up all rights to the new work, regardless of if I wanted to or not. I should have the freedom to decide how to assign my intellectual property rights.
Tim O'Reilly has done some great things for the open-source community, has made a good bit of money doing it, and has helped many open-source programmers, and has given a lot back to the community. But even a publisher as open-minded at O'Riley & Assc. would have to think twice about publishing a book that could be copied & resold by anyone.
"The axiom 'An honest man has nothing to fear from the police'
Why is it that the proponents of "one nation under God" are so eager to get rid of "liberty and justice for all"?
I do so much want to eat you. You would make the most pleasing meal. It is good that you suggest finding a friend; I'm sure that the mime would also like to eat you, for you are yummy. I will talk to the mime, and he will not talk back, but together we will be able to find you. Please bathe in garlic on Saturday night.
-the clown
I actually met Richard Stallman once at the Linuxworld Expo in San Jose. I asked for his autograph on a flyer the GNU folks were distributing there, and guess what? He refused to give it to me, saying he would rather I *BUY* an Emacs manual, and let him sign that, so I would be supporting the GNU project. I was broke at the time, so I could not afford to do this, needless to say. My question is, does this new GDL mean that RMS intends to GDL all of his Emacs documentation? Does that mean I can get an Emacs manual for free by copying someone else's, and getting RMS to sign THAT???
O'Reilly has stated before that they publish their books with the license that the author chooses. Yeah, I'm sure they've thought twice about publishing freely-copyable books in this manner, but that time is passed. I'm certain that O'Reilly would publish under the FDL unless they found some serious problems with the license (some of which you describe)
I/O Error G-17: Aborting Installation
Using 2 licenses is fine for a first edition, but the second edition should include corrections and additions submitted to the online documentation from third parties, that gets more complicated...
Cheers,
Ben
My usual seat in the cluetrain is at A HREF="http://pub4.ezboard.com/biwethey.ht
Now it's "insightful". How is relaying a factual statement from someone else insightful? Surely you meant 'informative'
Moderators - might I recommend a (Score:-1, Offtopic) for this post?
Right. I'm off to metamoderate.
Another poster astutely identified updates and new editions as difficult areas. I think the license is not the problem here; logistics are. Samba developers and O'Reilly are still working out these logistics. That's a different story.
There was also a brief discussion of lots of related issues at the O'Reilly Web site:
http://forums.oreilly.com/~publishing
So does the GNU Free documentation license use the GNU Free documentation license? We can have the world's first recursive license.
P.S Ok, apparently it doesn't. bah.
Personally I think that it would be good to create some sort of arrangement where the exact text and arrangement of a document may or may not be free, but it and all its derivatives must allow the technical information in them to be free to use in any other document using either of the pair of licenses.
Vanilla copyright law does that. Copyright covers the specific expression, but not the ideas. If you've got a copyrighted technical reference in front of you, you can't copy the exact wording, arrangement of information, and so on, but you can copy the technical information itself all you want. That may be plagiarism, if you fail to give the author credit and if your new book doesn't add clarity or value, but it doesn't violate the copyright to express technical facts in a new form.
The license seems to imply that a transparent format is one that can be easily modified by a writer - analogous to source.
But looking at the list of suggested transparent formats, I see things like LaTeX and raw HTML. Does RMS seriously think most writers work directly in formatting languages instead of using document processors? Really, for practical purposes, a document distributed in M*cros*ft Word format is much more likely to be easily modifiable by a writer than one in LaTeX input.
(Word may not be the best example, being unquestionably proprietary... but it's a common format and been reverse-engineered enough times that you don't need to patronize MS to read it. Or take RTF, which is also proprietary but whose definition is publicly available last I checked.)
Maybe RMS finds it easier to type in formatting codes than to work in a document processor, but most writers don't. Many writers, in fact, can't. If the point of the transparent-format verbiage is to make sure documents are distributed in a form that writers can modify and improve, I think it's missing the target by a wide margin.
Seriously, this license is intended mainly for software documentation, not novels. If you're writing software manuals and you can't get your head around LaTeX, texinfo or simple HTML then you have no business being a technical writer. LaTeX is very straight forward, I managed to typeset my first document with complicated mathematical symbols (I'm a maths student) within less than an hour.
Microsoft Word format is certainly proprietary, and if you don't know how much of a PITA it is to print, much less change, a reasonably complicated MSW document on a free os then you haven't tried. As RTF, I'm not at all that certain if it is proprietary in any real sense of the word, if the specification is publically available, but I'm no expert so I wouldn't pretend to give an authoritative answer.
The source code for a work means the preferred form of the work for making modifications to it.
IANAL, but that in my mind, that takes care of obfuscated source and pre-processed source (e.g. run it through CPP first).
In fact I have come to believe this is good philosophy too. E.g. if I am setting up a source control system for a group of engineers, the source code is exactly the set of files that human beings edit. It's not the set of files in some particular language(s).