• Home
  • About the unofficial footnote blog

The Unofficial Footnote Blog

An insider blog about the history website, Footnote.com

Feed on
Posts
Comments
« Free lectures for your genealogy society (using Skype)
1930 Census – WWII Collection? »

Newspapers at Footnote.com

Mar 29th, 2009 by sharbrough

Searching Newspapers

Let’s assume that you already know HOW to search within a title or a group of titles on Footnote. When you search for something in a newspaper, here’s how it works.

Search at Footnote looks at two kinds of information:

  • “Metadata”. Fields that were indexed digitally by Footnote can be searched using the way-cool searching and filtering functions. This is generally limited, in the case of newspapers, to the title, date, and place.
  • “OCR data”.  OCR is short for “optical character recognition.” I wish it produced perfect results – I wish that computers could read better than they do. A lot of OCR data is gibberish. But we do successfully read millions of words, and Footnote’s newspapers OCR is state of the art. They do old-fashioned fonts, they use several OCR schemes and take the best results, and for what they have to work with, the results are quite good. That said, OCR output is just a “text blob” for that page. It’s not broken down into fields like names, places, headlines, or even by article. You can still search this data, but only by using “keyword” search. Footnote does not routinely identify names in OCR text.

If you are looking for newspaper content, and you use any search filter other than keywords, you will find the results quite limited. On the other hand, if you want to look for the word “baseball” in the 8-April-1923 Chicago Tribune, you can do that quite easily. When the search results are displayed, the keywords are highlighted on the page – that’s pretty helpful as opposed to having to read the whole page to find your word.

When using Footnote’s viewer, if there is any OCR text for the image, you’ll see the “FIND” button in the viewer’s toolbar. That will show you any occurances of a word on that page, and is sometimes a really useful feature.

Summing it up

There are almost 3 million pages of newspapers on Footnote. They provide a great historical context for people and events. The OCR, search, and image quality are pretty impressive. If you haven’t tried them, you owe it to yourself to take 15 minutes and try it. It’ll cost you an hour, but that’s what Footnote does.

Pages: 1 2 3 4

Posted in Uncategorized

Comments are closed.

  • Recent Posts

    • Tufblog is moving
    • Enhanced Browsing available
    • Happy 3rd Birthday
    • Something special – Deseret Iron Co accounts
    • Titles updated on Footnote, week 2
    • Titles updated on Footnote, week 1
  • Categories

    • Business (4)
    • Content (11)
    • Events (5)
    • News (3)
    • tips (5)
    • Uncategorized (29)
  • Archives

    • June 2010 (1)
    • March 2010 (1)
    • February 2010 (1)
    • January 2010 (5)
    • December 2009 (11)
    • November 2009 (7)
    • October 2009 (2)
    • April 2009 (2)
    • March 2009 (3)
    • January 2009 (1)
    • November 2008 (1)
    • September 2008 (1)
    • August 2008 (1)
    • July 2008 (2)
    • March 2008 (1)
    • February 2008 (2)
    • January 2008 (1)
    • December 2007 (2)
  • About

    Profile
    The Unofficial Footnote Blog
    An insider blog about the history website, Footnote.com
    There are 45 Posts and 21 Comments so far.

  • Tags

    American History Business chicago symphony orchestra civil war enemy alien fbi case files Francis Sharbrough frederick stock Los Angeles new search Revolutionary War Southern Utah University SUU the orchestral association tips user meetings US History wwi

  • Pages

    • About the unofficial footnote blog
  • Meta

    • Log in
    • Valid XHTML
    • WordPress

The Unofficial Footnote Blog © 2010 All Rights Reserved.

MistyLook with 2 Sidebars