Skip to content

Blue Mountain Objects

Cliff Wulfman edited this page May 1, 2015 · 10 revisions

Blue Mountain is a database of digital objects: groupings of machine-readable files that together constitute a representation.

Journal Objects

A journal object will comprise the following elements:

title-level bibliography
An article-level prose description. (bmtnid.tei.xml)
issues
one or more Issue Objects.
title-level metadata wrapper
encapsulates the title-level prose description of the title as a whole and title-level descriptive metadata comprising a detailed, machine-readable description of the periodical as a whole. Initially encoded in MODS for compatibility with library systems, but translatable into other formats (e.g., TEI).

Issue Objects

Representations of periodical issues. Issue objects comprise the following:

preservation-quality images
high-quality TIFF files (‘master TIFFs’), produced according to local best practices and in conformance with the FADGI standards (http://www.digitizationguidelines.gov/guidelines/digitize-technical.html).
generative image derivatives
more manageable forms of the master TIFFs, meant to serve as the source for online deliverables, etc. Encoded in the JPEG2000 format, according to specifications described below.
delivery derivatives
images optimized for delivery over the World Wide Web.
issue-level descriptive metadata
a MODS document (see below).
text encodings
Initially these will be in the form of corrected OCR for each page, encoded in the ALTO schema (output by ABBYY via docWORKS). Future encodings will likely include TEI representations, derived from the ALTO documents, for detailed textual analysis.
deliverable text-under-image PDF
another ABBYY output format.
issue-level metadata wrapper
a METS document. The METS half of METS/ALTO, the structMap of this document links constituent-level items to the regions identified in the ALTO documents, and to the page image. (See below for detailed specification.)