Skip to content

How to get docWorks output into Veridian

Clifford Wulfman edited this page Jun 20, 2014 · 2 revisions

Ingesting METS/ALTO from DDD

Create a tarball of the delivery in diglibdata1/ddd/bluemountain/deliveries.

This tarball is the backup of exactly what DDD delivered. You're going to be moving files into different places next, so you want to be able to get back to the original delivery state.

For a given batch BATCH_X:

  1. Log onto diglibdata1 as the ddd user.
  2. cd /ifs/ddd/bluemountain/workflow/4_final_output_from_ddd
  3. tar czvf /ifs/ddd/bluemountain/deliveries/batch_x.tgz BATCH_X

Copy binary data into bluemountain repository

  1. rsync -avz /ifs/ddd/bluemountain/workflow/4_final_output_from_ddd/BATCH_X/astore/ /ifs/bluemountain/astore/
  2. rsync -avz /ifs/ddd/bluemountain/workflow/4_final_output_from_ddd/BATCH_X/pstore/ /ifs/bluemountain/pstore/

Copy metadata into git repository

There is no git on diglibdata1, so you'll have to do this from a different machine (perhaps libserv14).

You may need to convert all the DOS CR/LF endings to Unix CRs first; use the dos2unix command in Linux: find . -type f -exec dos2unix {} +

  1. git checkout -b batch_x
  2. (get the stuff in there, probably using rsync)
  3. git add -A
  4. git commit -m"Adds metadata for batch X to repository."
  5. git push -u origin batch_x

Copy data to server machine (libserv14)

Copy metadata

  1. git pull
  2. git checkout batch_x

Copy astore

As root(?)

  1. rsync -avz --log-file=/tmp/batch_x_astore.log /mnt/ddd/bluemountain/workflow/4_final_output_from_ddd/BATCH_X/astore/ /mnt/data/projects/bluemountain/astore/
  2. Change permissions.

Ingesting Locally-Produced METS/ALTO

Step 1: Copy output files from lib-docworks1 to diglibdata1 (Isilon).

One method: mount [email protected] on one of the docWorks workstations, then copy from lib-docworks1/OUT.

Then rsync them to the bluemountain/ partition: rsync the astore/ rsync the pstore/

Step 2: Merge metadata into GitHub tree. You will probably have to convert all the DOS CR/LFs to Unix CRs; use the dos2unix in Linux:

  • find . -type f -exec dos2unix {} +
  • git branch localnew
  • git checkout localnew
  • (get the stuff in there, probably using rsync)
  • git add --all .
  • git commit
  • git push -u origin localnew

Then, on libserv14, as dlconsulting user:

  • git pull
  • git checkout localnew

Step 3: Prepare for ingest on libserv14

  • Update the git repository (do this as the dlconsulting user)
  • Copy astore files from diglibdata1/bluemountain/astore to /usr/share/Bluemountain. Must do this as root and then change permissions. For example,
    • cd /usr/share/BlueMountain/astore/periodicals/
    • [root@libserv14 periodicals]# rsync -avz /mnt/ddd/bluemountain/deliveries/batch05/Niederrheinische_Musik-Zeitung/astore/periodicals/bmtnabi .