Today: McKeldin CLOSED

Digitization Standards

NDNP projects digitize newspapers primarily from preservation microfilm print negatives, which is more cost-effective than digitizing from print for large-scale digitization. The technical quality and bibliographic completeness of a title on microfilm is evaluated before a newspaper is selected for digitization. In some cases, we may choose to supplement the microfilm and digitize missing or poorly filmed issues from print (if available).

The following is a summary of NDNP deliverables. For additional information, consult the NDNP technical specifications.


For each title:

  • Up-to-date MARC record from the CONSER OCLC database,
  • Additional title-level metadata related to the title run(s) digitized and delivered, and
  • Newspaper History Essay, which encompasses the scope and content of each title, history, and significance.


For each issue/edition:

  • Structural metadata for issues/editions digitized and organized by date.


For each newspaper page:

  • Page image in two raster formats:
    • Grayscale, scanned for maximum resolution possible between 300-400 dpi, uncompressed TIFF 6.0
    • Same image, compressed as JPEG2000,
  • OCR text and associated bounding boxes for words, 1 file per page image,
  • PDF Image with Hidden Text, i.e., with text and image correlated,
  • Structural metadata to relate pages to title, date, and edition, to sequence pages within issue or section; and to identify associated image and OCR files, and
  • Technical metadata to support the functions of a trusted repository.


For each microfilm reel digitized:

  • A second-generation (2N) duplicate silver negative microfilm, made from the camera master, to be deposited with the Library of Congress on completion of the award, and
  • Technical metadata concerning the quality characteristics of the film used for digitization encoded in a METS object with other digital assets.