Digitizing The New York Times archive with Google Cloud


First New York City subway, 1904. And there you have it. The morgue is what makes The Times The Times. There’s six hundred cabinets, a few thousand drawers. Six to eight million photographs dating from the late 1800s on till the 1990s. This is the Flying Hunters. This ran in 1930. George Washington Bridge. France’s biggest naval ship. American soldier greeting his mom. Christmas at Penn Station. I mean there’s pretty much
anything and everything. The history of the world through
the eyes of The New York Times. Allan: I didn’t think too
much about it at first. Like, “Yeah, sure, we have photo archives.” And then I learned more about it, and there were, like, millions and millions of photos down there that, except for, like, one person, nobody
really knows what’s hiding down there. For every picture that we were able to publish, many never saw the light of day. The more that I work down in the morgue the more the real value and the real importance of having access to it dawned on me. The exciting thing about this project is The New York Times has over
a hundred years of photos locked in a basement, and this project will allow people to see
photos that have never been seen before and make them accessible to the
newsroom at The New York Times. Allan: When I first heard that there was a
possibility of us digitizing this, I was really excited. So Google and The New York Times have
had a partnership for many years. And we think it’s actually
an ideal partnership to leverage the power of
Google Cloud’s technology. What Google is bringing to the table is a lot of the infrastructure that
The New York Times needs as well as providing the, sort of, platform
level services on the Vision APIs. Allan: The first part of the digitization process
is obviously the physical scan of the photos. Getting them out of folders,
getting them out of all these boxes and physically scanning them. Jeff: So here’s the front
of the picture, but the back of the picture
is just as interesting. Allan: The stamps, handwritten notes, etc.
That tells us something about the photo, who took it, etc. That is the data we need to extract. Nancy: These markers all over
the back are the clues for where the picture was used. So here it was published
in the newspaper at least twice. Here we can see the captions that were
taped on the back indicating publication and along this top edge
there’s a number. That is the indicator of where this
photograph lives inside the morgue. Samuel: We’ll upload them into tools, which will allow the photo editors to search the
archive and bring up the images they need. Allan: So once we’re done with this,
it will enable the newsroom to immediately access our entire archive from their desktop. Jeff: Once the pictures are
digitized, I mean, everything old is new again. Cornelius: We get the sense
that covering current events is talking about what just happened. But having this resource available
to reporters and editors gives them the ability to draw in
all the context of what preceded it, the wider world that led to
this contemporary event. Nancy: There is nothing else,
no other way of reporting what goes on
in the universe that can do that the way
a still photograph can. Cornelius: The idea of telling stories in pictures is
how society works now. Samuel: For Google, our job
is to make the world’s information universally
accessible and useful. And in this project, we’re helping The New York Times
with their data to be able to do that.

24 Comments

  1. Asim Khan said:

    Google w the sauce

    November 9, 2018
    Reply
  2. PAlex388 said:

    How long will it take to get through all photos ?

    November 9, 2018
    Reply
  3. Richard McDonald said:

    What's the music at 3:07? With the piano and clarinet.

    November 9, 2018
    Reply
  4. Aamir Bilal said:

    Is this coincidentally timed when Flickr went all pro?

    November 9, 2018
    Reply
  5. dn said:

    Imagine the karma one could farm with that

    November 9, 2018
    Reply
  6. Yukai said:

    What type of flatbed scanner is being used here?

    November 9, 2018
    Reply
  7. Theoria Apophasis said:

    2 million pounds of paper prints turned into ONE 20 gram thumb drive. #HellYeah

    November 10, 2018
    Reply
  8. Animesh Sharma said:

    #sphinx

    November 10, 2018
    Reply
  9. Gabriel Teruel said:

    Looks nice but it doesn't say anything about what exactly Google does. "Type of scanner" as someone asked, what the AI is for and many more things could have been said about your role here…

    November 12, 2018
    Reply
  10. Veera Dsouza said:

    Hey google
    Can u tell me about the security of our files in GOOGLE DRIVE…
    this is because I have heard that most of the files aren't secure. .

    November 12, 2018
    Reply
  11. Linda Carmichael said:

    Amazing, but please wear gloves to protect the originals when handling to scan!

    November 12, 2018
    Reply
  12. Michael-John Jennings said:

    Looks great…would be very interested to hear what scanning tech is being used? are they digitising prints only no negs?

    November 13, 2018
    Reply
  13. Joseph Allen said:

    holy freakin wow

    November 13, 2018
    Reply
  14. John MacLean said:

    2:19 I hope they're not scanning with the lid up on all the scans. I assume this was for effect?

    November 14, 2018
    Reply
  15. Ivan Lietaert said:

    I hope NYT is aware of the fact that the paper pictures, if stored correctly, will last much, much longer than Google… These pictures, once scanned, should be stored in a stable environment, somewhere deep under the ground, in a salt mine.

    November 14, 2018
    Reply
  16. Oleg Marchenko said:

    Cool! It is a greate idia!

    November 14, 2018
    Reply
  17. Camera Perv said:

    It's scan-tastic! I'll show myself out.

    November 14, 2018
    Reply
  18. Eddie Dennis said:

    This is a cool project. I hope they make the archives available to browse.

    November 15, 2018
    Reply
  19. Pieter said:

    That is brilliant. Nice pictures by the way.

    November 15, 2018
    Reply
  20. Simon Greenidge said:

    I wonder; do they have any original negatives? Ideally they would be scanning those. Perhaps not the oldest (negatives on the glass) but so much of their stuff must have been on 35 mm and larger format film negatives and slides.

    November 15, 2018
    Reply
  21. Runy said:

    A genuine question. NYT does not have the negatives of those pictures ? Why they are scanning the prints ?

    November 15, 2018
    Reply
  22. Gustavo Alcántara said:

    Cool

    November 16, 2018
    Reply
  23. Tenet said:

    Very cool. Currently digitizing 3 suitcases worth of negative & positive slides my grandfather took in the 50-60s during his military service. Huge work but so fun to see the photos digitally.

    November 17, 2018
    Reply
  24. Sim said:

    all the geniuses wondering why didn't they use negatives, google hires 1 in 500 people who apply there, you really think they overlooked it

    December 3, 2018
    Reply

Leave a Reply

Your email address will not be published. Required fields are marked *