Slashdot Mirror


The New York Times Digitizes Millions of Photos Going Back To 19th Century (betanews.com)

The New York Times is digitizing millions of historical photos dating back to 1896. From a report: The NYT has a massive collection of photos dating back decades, and the plan is to digitize millions of images -- some dating back to the late nineteenth century -- to ensure they can be accessed by generations to come. The digitization process will also prove useful for journalists who will be able to delve into the archives far more easily in future. Until now, historic news articles and photos have been stored on microfilm and in other physical forms. This is not only difficult to catalog and navigate, but also prone to deterioration over time and through use. The newspaper is using Google Cloud for the digitization.

29 comments

  1. Highlander by Anonymous Coward · · Score: 4, Funny

    This will come in handy for catching time-travelers and immortals.

    1. Re:Highlander by jd · · Score: 1

      Not really, you're forgetting The Doctor regenerates a lot.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    2. Re:Highlander by Impy+the+Impiuos+Imp · · Score: 1

      The highlander may be your father.

      --
      (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    3. Re:Highlander by nanospook · · Score: 1

      The highlander may be your mother.

      --
      Have you fscked your local propeller head today?
  2. What brand of hard drive did they use? by Anonymous Coward · · Score: 3, Funny

    > The newspaper is using Google Cloud for the digitization.

    But what I would like to know is what brand of hard drive the images are stored on. Also the favorite coffee brand of the main developer.

    Who on earth is interested in what cloud provider they used except for Google's marketing department?

    1. Re: What brand of hard drive did they use? by Anonymous Coward · · Score: 0

      I care. As someone who might need to do a project like this in the future, knowing the tech used could be helpful.

    2. Re:What brand of hard drive did they use? by colinwb · · Score: 1

      Another interested party might be the New York Times if they get a discount for mentioning that Google Cloud is being used.

    3. Re:What brand of hard drive did they use? by Impy+the+Impiuos+Imp · · Score: 1

      > The newspaper is using Google Cloud for the digitization.

      But what I would like to know is what brand of hard drive the images are stored on. Also the favorite coffee brand of the main developer.

      Who on earth is interested in what cloud provider they used except for Google's marketing department?

      I think you answered your own question. It's so important that not only do they mention Google Cloud, but they put "google_cloud" into the name for the link for the story itself.

      --
      (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    4. Re: What brand of hard drive did they use? by Anonymous Coward · · Score: 0

      Well, it probably gives the new images a shelf life of under 5 years before Google gets bored and discontinues the platform.

  3. Re: failing nyt by Anonymous Coward · · Score: 0

    Well, one side affect of digitization and DRM is the ability to change history. The Orwellian memory hole described in 1984 has been made real. Need to change sleeping beauty to make it 2018 PC compliant... no problem. When the DVDs and VHS tapes eventually wear out, the only version that will legally exist will be the "remastered" edition where the princess wakes up of her own accord just because.

  4. The real question is by bobstreo · · Score: 2

    how many of the images are considered to be in public domain?

    Just because you digitize something doesn't mean you own the rights to the image.

    Sounds like a combination of cash grab and physical space savings.

    Hope they're using a lossless/high res algorithm to preserve the images.

    1. Re:The real question is by CanadianMacFan · · Score: 3, Informative

      It's their own collection of images from their archive of previous articles. There's no issue of copyright unless they distribute the images in a new way that's not covered in contracts.

    2. Re:The real question is by Anonymous Coward · · Score: 0

      I don't know the exact date, but surely anything older than a certain date is no longer even under copyright. I'd guess that anything older than 1922 (and possibly somewhat later) are going to be in the public domain.

      dom

  5. Okay, they're storing data on Google Cloud by Anonymous Coward · · Score: 0

    Which is, uh, great to know, But is there any information about whether and where we can see the collection? Or how they've decided to actually do the archival photographs/scans or whatever it is?

  6. Google has... issues... by jd · · Score: 1

    If their algorithms don't like the images, they're likely to lock the NYT out or delete the contents of the drive.

    I'm not sure there are any vendors out there I'd trust. I'd suggest buying a large number of commodity drives (they don't have to be perfect) and place them in groups of six to make a RAID 6 drive. It's not that expensive, especially compared to rebuilding the archive if/when things go wrong.

    If this is a do-once sort of thing, once it's done it's done and if it crashes it is lost forever, then the financial industry often uses mirrored RAID 6. It's not impossible for that to die, but it's very, very unlikely.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    1. Re:Google has... issues... by Anonymous Coward · · Score: 0

      I largely agree with this. But you'll have an easier time with scale-out cloud-type storage like on Ceph, OpenStack [Swift/Cinder?], SeaweedFS or MooseFS than a set of RAID6 arrays at this scale.

  7. Is it going to be available? Didn't say. by Geoffrey.landis · · Score: 1

    Which is, uh, great to know, But is there any information about whether and where we can see the collection?

    The article didn't say anything at all about making the collection available to the public.

    It only said that they were digitizing it. They may be doing this for their own use.

    --
    http://www.geoffreylandis.com
  8. Re: failing nyt by Anonymous Coward · · Score: 0

    How does it feel being so woke, and knowing there is nothing you can do to stop us?

  9. Historical Societies by ChrisMaple · · Score: 1

    Most towns have historical societies, many of which have boxes full of fading photographs. Digitizing those photos is something they should be doing. As a bonus, a lot of fading can be undone with free software.

    --
    Contribute to civilization: ari.aynrand.org/donate
    1. Re:Historical Societies by Anonymous Coward · · Score: 0

      I'd suggest doing it with old family photos as well.

  10. Not Archived? by Anonymous Coward · · Score: 0

    Not really backed up unless they are stored in 3 places on different continents.

    One in North America, One in Europe and one in Oceania possibly New Zealand.

  11. NYT changes the views by WillAffleckUW · · Score: 1

    If it ain't film it ain't real

    --
    -- Tigger warning: This post may contain tiggers! --
  12. Re: failing nyt by Anonymous Coward · · Score: 0

    Just for once I'd like to see a technical discussion on Slashdot that doesn't immediately become a cesspit for the insane asylum of US politics.

  13. Only the altered photos... by Anonymous Coward · · Score: 0

    After all, given the NYT's ownership, who wouldn't be surprised by a bit of digital historical revisionism?