Slashdot Mirror


Controversy Over San Francisco Public Transportation Data

paimin writes "A struggle is breaking out in San Francisco over whether the developer of a publicly-funded installation of real-time tracking for the San Francisco Municipal Transit Agency has a right to control the use of data from the system. The situation is not totally clear, but this sure seems like an attempt to use patent threats to hijack public data. The city paid for the system, and the developer claims he lost money on the deal, so now he's shutting down applications like Routesy and Munitime that use data from the system unless they license the 'copyrighted' data from him."

31 of 111 comments (clear)

  1. Whoops by njfuzzy · · Score: 2, Insightful

    Just because you lose money doesn't mean you can change the rules after the fact. I guess this guy shouldn't have bid so low to get that contract.

    --
    My Photography - http://ian-x.com
    The Deathlings (comic) - http://thedeathlings.com
    1. Re:Whoops by Anonymous Coward · · Score: 5, Insightful

      It's called 'renegotiation'.

      Others call it blackmail.

      Whatever. he dude's playing a dangerous game.

    2. Re:Whoops by Paradise+Pete · · Score: 3, Insightful

      NBIS may own copyrights on that data because they have invested tens or hundreds of thousands in employees and installing those transponders. They can only recoup the cost + make a profit by charging tens of thousands per month.

      Their motivation is clear, but data is not typically copyrightable. That they have invested money and want to make a profit doesn't change that.

    3. Re:Whoops by vux984 · · Score: 3, Insightful

      It's just business -- you can't walk into a BMW dealership and demand they give you a new car for $1,000. The seller sets the price, and if you don't like it, don't buy it.

      But if you do walk into a BWM dealership and ask them for the best price on a BMW complete with a navigation system that lets you see where you've driven, how fast, what your fuel efficience was, etc, etc, etc.

      They turn around and sell you a car; but it turns out it was below their cost.

      They can't turn then around and say that all the data produced by the navigation system actually belongs to them and that you need to pay copyright license fees in order to see the data your navigation system has been collecting for you... (er for them??).

    4. Re:Whoops by Nutria · · Score: 3, Insightful

      If NBIS owns (copyright owner) the data, he has to pay whatever they charge. If the San Francisco local government owns the data, then it's probably public data, I'm not sure. NBIS may own copyrights on that data because they have invested tens or hundreds of thousands in employees and installing those transponders.

      I work for the company that runs a big chunk of E-ZPass, and even though

      • the transponders are built by a private company,
      • "rented" to citizens from stores leased by us, a private company,
      • they send their money to us, and not the government,it would never even occur to us to treat it as if it were our data.

        Someone in SanFran City Hall is doing a piss-poor job of contract management!

      --
      "I don't know, therefore Aliens" Wafflebox1
    5. Re:Whoops by Dare+nMc · · Score: 3, Informative

      they cant...you need to pay copyright license fees in order to see the data your navigation system has been collecting for you... (er for them??).

      The auto industry is one of the culprits that does almost exactly that with the fault descriptions, and related data logs from "their" controllers. IE many data parameters cannot be looked at our changed with any of the data readers available to non-dealers. Sure they are required to allow these readers to exist and show some standard faults and data in a open format. But most of the data logged on your car, will require you to pay money to a licensed dealer to access. This may be justified while the car is under warranty, but there is no unlocking, or accessing that data for free once the warranty has expired.

    6. Re:Whoops by Nekomusume · · Score: 2, Insightful

      It's called 'fraud'.

      He's claiming the data is his under copyright. You can't copyright facts. Thus, fraud.

  2. Lost money? by mmarlett · · Score: 3, Insightful

    The developer, at least in the linked articles, does not claim that it has lost money on the system. It simply claims to own the data and that it has licensed the exclusive rights (from another private company) to develop with the data. The question becomes, "So, OK, you have paid to develop this data, but why? It is, after all, public data."

    1. Re:Lost money? by Ironsides · · Score: 4, Informative

      The question becomes, "So, OK, you have paid to develop this data, but why? It is, after all, public data."

      This gets into the contracts and the "data rights" agreements. For example, there are a few different contracts that can be set up even when a government pays a company to develop an application.

      No Data Rights: The customer (government) buys the application and can use it as is. The customer gets no detailed information, source code or redistribution rights, just an end product.
      Trade-off: The developer charges less for development as they believe they will be able to sell it elsewhere or further develop it as the sole source.

      Limited Data Rights: The customer buys the application and has full access to the detailed information, source code, etc. However, it can not be redistributed for a number of years (say, 5). After that number of years, the customer has full data rights.
      Trade-off: The developer charges slightly more for development, as they will not have a monopoly on the product after a few years.

      Full Data Rights: The customer has full access to everything necessary to duplicate and modify the product immediately.
      Trade-off: The developer chargers more as they can not guarantee that they will make any further money off of the product.

      It's like professional photographers. It's a picture of you, but if you want a copy, you're going to have to pay for it. If you want the negatives, you're going to have to pay for those as well. There are further variations that combine these ones, but they give you an idea of the three types that get modified for an actual contract. From the article, it sounds like NBIS is trying to claim that SF doesn't have the data rights to redistribute the information beyond a specific set of applications/methods. To figure out what the truth is, we would need to read the contracts.

      --
      Fly me to the moon Let me sing among those stars Let me see what spring is like On jupiter and mars
    2. Re:Lost money? by mmarlett · · Score: 4, Informative
      The city of San Francisco says that it is a no data rights agreement. From the article:

      Muni spokesperson Judson True says ... that, no, Muni owns the data in question and that the public is, of course, entitled to access it. In fact, he went even further: Muni isn't just giving us all permission to access the data, they're also committed to finding ways to make it easier to get to it.

  3. This is only the beginning by HangingChad · · Score: 3, Interesting

    It may be bus arrival times in San Francisco today, but this whole notion of data being exclusive property isn't new and isn't going away. And if Bilski stands and ends up partially undermining software patents, then I would hazard a guess that more companies are going to try monetizing the data aggregates and outputs. Even without Bilski as software becomes more of a commodity market, then data and data aggregates will become the value market.

    This isn't a new concept. The public pays for scientific research at an institution of higher learning also funded by tax dollars, yet sometimes the only way you could get a copy of the results is pay for an expensive subscription to a scientific journal, which claims copyright on the published data.

    This case probably isn't a good example and the developer trying to be the data gatekeeper is going to lose, but it's only the beginning. There will be more.

    --
    That's our life, the big wheel of shit. - The Fat Man, Blue Tango Salvage
    1. Re:This is only the beginning by wjh31 · · Score: 3, Informative

      "I would hazard a guess that more companies are going to try monetizing the data aggregates and outputs."

      see http://tech.slashdot.org/article.pl?sid=09/05/19/1846258 for wolfram¦alpha copywright claim over its outputs

    2. Re:This is only the beginning by ScrewMaster · · Score: 4, Interesting

      Look at the outfits that monetize the NOAA's data: that's public information as well. The NOAA was "publishing" this information in a very complicated binary format, and these outfits were making a ton of money in converting it to other purposes. I remember reading here on Slashdot a couple years ago that the government was thinking of making weather data available in XML or some other standard format, and that a couple of these outfits went after them in court to try and prevent it (thereby preserving their distribution lock.) I don't know what the eventual outcome of that was.

      --
      The higher the technology, the sharper that two-edged sword.
    3. Re:This is only the beginning by igjeff · · Score: 5, Informative

      They lost, and they lost rather completely.

      Here's a starting point for exploring some of this data. There's probably more places where this data is available from the NWS in very open formats, and I believe more is to come.

      http://www.weather.gov/rss/

    4. Re:This is only the beginning by bitingduck · · Score: 2, Interesting

      This isn't a new concept. The public pays for scientific research at an institution of higher learning also funded by tax dollars, yet sometimes the only way you could get a copy of the results is pay for an expensive subscription to a scientific journal, which claims copyright on the published data.

      That model is starting to go away (with the publishers kicking and screaming). The US government is starting to (started a while ago) include clauses in grant contracts that limit the exclusive data rights of the investigators, and also require gov't funded authors to use copyright transfer agreements that give the publishers a limited time for exclusive publication. I haven't been keeping up closely, but it's inevitable and accelerating.

    5. Re:This is only the beginning by gemada · · Score: 2

      Look at the outfits that monetize the NOAA's data

      please stop using the word monetize. it is a weasel word.

    6. Re:This is only the beginning by jvkjvk · · Score: 2, Interesting

      but isn't that the point? the only people who would 'monetize' formerly open data are weasels.

  4. Re:This seems comparable to uni students by ScrewMaster · · Score: 3, Interesting

    I also seem to recall a few occations of similar stuff where workers stuff was claimed by their employers, also tended to go in favour of the employer, usually especially so because it was stated in whatever contract

    There are, however, limits on those kinds of shenanigans. I worked as a developer back in the eighties for an outfit whose employment contract not only entitled them to ownership of any software or intellectual property that I developed while on company time (obviously I had no problem with that) but ANYTHING I did outside of work, even if in a completely unrelated field, for a period of FIVE YEARS after I left their employment. Naturally I refused to sign that little bastard until they fixed it to my (and my attorney's) satisfaction. Even so, I have the feeling there aren't many courts that would have upheld that contract, but I felt it was best to have the worst portions excised.

    The place was run by chimpanzees anyway, with a couple of orangutans in the head office. Yeah, it was a game company, and as employers go they made Electronic Arts look good.

    --
    The higher the technology, the sharper that two-edged sword.
  5. Copywrongs by DynaSoar · · Score: 4, Insightful

    If the data could be copyrighted, ownership would go to the creator of the data. That would be the city of SF, not the programmer. They created the data with the software they contracted him to produce for them, then they ran their software on their hardware, watching their mass transit movements and recording the results on their computers. The programmer could not own the data because he could not create it. He has no mass transit system with which to do so.

    In any case, it is highly unlikely anyone could copyright the data. Copyright requires at least minimal creativity. Data produced automatically requires no creativity. In addition, works produced by the government (ie. by the public for their own good via their chosen representatives) cannot be copyrighted.

    The programmers actions are likely to be considered by the court (unless he backs down very quickly) blackmail. These days, if the actions threaten public safety, they might even be considered terrorism. Under these charges, even if he backs down the damage is done and he might well be looking at many years in prison. The SF DA could file such charges to scare him as they often do with other charges. But terrorism charges tend to go all the way through once the process is started. To prevent others from trying this stunt, they may well do just this. And I hope they do.

    The contract may have given him the right to use the data. There's no doubt it my mind that it did not give him sole use, much less state that he also had sole control over its use. There's no way the SF city attorneys would have allowed that in a contract.

    --
    "I may be synthetic, but I'm not stupid." -- Bishop 341-B
    1. Re:Copywrongs by langelgjm · · Score: 2, Insightful

      In any case, it is highly unlikely anyone could copyright the data. Copyright requires at least minimal creativity. Data produced automatically requires no creativity.

      True, but the "minimal" creativity is extremely minimal. Meaning that if anyone did any selection or editing or massaging of the information in any way, it might pass the threshold.

      I once asked my IP law prof if images captured by automated cameras (e.g. from toll booths) could be copyrighted, since there was no human involvement, and it was basically a purely mechanical process, devoid of creativity. He agreed with me in spirit, but said that even the act of installing a camera, or setting up an automated camera, would likely qualify as enough creativity in courts these days.

      In addition, works produced by the government (ie. by the public for their own good via their chosen representatives) cannot be copyrighted.

      That only applies to works of the federal government. State laws vary. (An added caveat is that the federal government can acquire copyrighted works).

      --
      "Anyone who [rips a CD] is probably engaging in copyright infringement." - David O. Carson
  6. Re:Me things he looses by langelgjm · · Score: 2, Informative

    Are you sure about that? Maps are databases.

    Not in copyright lingo they aren't. Interestingly, maps were one of the original works mentioned in the earliest federal copyright act. "Databases" has a very different meaning; US copyright law has been loathe to grant protection to facts themselves. IIRC, the EU does have copyright protection (or some kind of protection) for databases.

    --
    "Anyone who [rips a CD] is probably engaging in copyright infringement." - David O. Carson
  7. Re:This seems comparable to uni students by TinBromide · · Score: 2, Insightful

    Even so, I have the feeling there aren't many courts that would have upheld that contract, but I felt it was best to have the worst portions excised.

    Better a few billable hours up front than hundreds of billable hours in a court case later.

    --
    Is it sad that I am more likely to recognize you and your posts by your sig than your name or UID?
  8. Arrival times != timetables by abigsmurf · · Score: 4, Insightful

    To me the author of the article is deliberately confusing public timetables with transmissions showing the position and expected arrival times of a bus.

    If the position and expected arrival time is calculated on the fly, that's more of a service than just pure publically available data. If the condition of this service being provided is that the data is confidential or restricted to licensees. The provided data is processed real time using their equiptment and code. It's one thing to say "at 2:12pm the bus is 5 miles from transponder 2A, 1/5 mile from 4B and 8 miles from 1E" which is pure statistics (albeit collected from private equiptment), but to say "it's just left the anystreet stop and will arrive at noname plaza in 6 minutes in the current traffic conditions", could be seen as editorialising. If you're able to get as much of this information whenever you want, it then goes beyond fair use too.

    An extreme argument of what the author is saying could be this: The fact that Michael Jackson died is public fact, a 400 word article going into the detail of how he died is copyrighted and subject to fair use restrictions. The interesting argument that applies here is, if that same news report was machine generated based on a few facts fed into it and the rest padded out through AI, could you copyright that?

  9. Re:Me things he looses by bitingduck · · Score: 2, Informative

    IANAL, but I believe it also depends on how the facts are organized. If its a simple list by date or time alphabetical order, then they generally aren't copyrghtable. Being able to copyright that would prevent someone else from collecting the same facts independently and publishing them in an obvious order. If there's some creative addition to the data structure or organization then the database may be copyrightable, while the facts themselves are still not copyrightable.

    I.e. someone could publish a database with some creative structure and relationships and you can't copy that and republish with the same structure, but if you just take the facts and organize them some other way you can do that.

  10. Re:Me things he looses by Paradise+Pete · · Score: 3, Insightful

    Maps are databases.

    Maps are the presentation of data, not the data itself.

  11. Re:Me things he looses by nbauman · · Score: 4, Interesting

    If I pay to collect the data & generate a database that doesn't mean that I can be forced to give the data away. But also, I can't stop anybody else from collecting the data & making their own database. If you don't want to buy it from me go forth & make your own database

    That's an interesting argument, and it's logical from where you're coming from.

    But the copyright law comes at it from a different direction.

    If you go to a lot of effort to collect data, that's commendable. In copyright law, that's called "sweat of the brow."

    But in copyright law, you can't copyright data that you've collected just by sweat of the brow. It also takes some kind of creativity or innovation or judgment.

    That's what the Supreme Court decided in Feist. Phone numbers can't be copyrighted. http://en.wikipedia.org/wiki/Feist_Publications_v._Rural_Telephone_Service

  12. What part is public, and what part is proprietary. by Animats · · Score: 2, Informative

    I met the inventor of NextBus some years ago at the Hacker's Conference. What they get from the bus is position, speed, and a few bits of data like the destination sign setting, "doors open" and "wheelchair lift deployed". After much crunching on this data, info like "Next bus at this stop: 6 minutes" comes out. Over time, as more data comes in, the predictions get better. It's a good machine learning problem, because you have actuals; you can tell when the bus eventually gets to the stop, so you have hard data from which to validate the prediction algorithm. You don't even need a map.

    The early business plan for NextBus had a little dedicated receiver they were going to sell to consumers. The idea was that you have one at home, and it tells you the number of minutes until the next bus gets to the stop near your house, so you know when to leave the house. That was before the World Wide Web, so that wasn't necessary.

    Originally, Muni management hated the system, because it was too honest about their bad service. But after much political effort, eventually it was deployed on a few lines, where it was very popular. Then it was put in everywhere.

    Muni probably owns the raw data, and NextBus probably owns the predictions. I'm not sure on that, though.

  13. Re:Me things he looses by mrchaotica · · Score: 2, Insightful

    But maps are "facts themselves." Unless, of course, you want to argue that there's "artistry" in deciding what kind of information to put on the map -- but then you could argue that there's exactly the same kind of artistry in deciding what to include in a database!

    In other words, if the Copyright Act of 1790 made a distinction between maps and databases, then it was wrong. Either both should be copyrightable, or (better yet) neither should be!

    --

    "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

  14. Re:Me things he looses by langelgjm · · Score: 2, Interesting

    You're absolutely right. The boundaries in copyright law are pretty fuzzy when it comes to things like this.

    Maps were almost certainly included in the original act purely to appease mapmakers, not for any principled theory of what copyright should or should not apply to. Retrospectively, however, people justify maps by saying that they do involve artistry, and it's not totally unreasonable. You could even argue that more artistry is involved in making a map than in taking a photograph.

    The most well-known case dealing with whether a collection of facts can be copyrighted is Feist. But the issue is complex, because while facts themselves aren't copyrightable, enough selection of facts can produce a result that is. What is "enough" is usually determined by a court case (or a settlement).

    --
    "Anyone who [rips a CD] is probably engaging in copyright infringement." - David O. Carson
  15. Cars in San Francisco by Cassander · · Score: 2, Insightful

    After I moved to SF, I sold my car. No need for it here.

    Unless, of course, you ever want to leave SF for any reason. I know there's some decent public transportation to surrounding areas, but it's far from comprehensive, especially after the sun goes down.

    Speaking of public transportation, it sure would be nice to get BART up here (I live in Santa Rosa). Fuck you Marin County, fuck you very much.

    --
    Knowledge != Intelligence
  16. Is it the locations or the predictions? by Anonymous Coward · · Score: 2, Interesting

    Ahh, but in this case the data that is being collected is bus location information. The information the everyone wants access to is the bus route timing predictions. Those are created via some complex code that looks at the location information and other parameters and information to produce a prediction of when the bus is to arrive. This information may well be copyrighted as it is not "in plain view" but is generated via some creativity.

    Now, if someone else were to generate their own predictions from the actual raw information that is one thing. But just redistributing the predictions generated by NextBus sounds like taking propriatory information and distributing it without the consent of those who created it.