a digitization-project tracking system
- develop and deploy a minimal application that supports digitization project tracking and reporting
- reuse / enhance / extend existing infrastructure where possible
- develop new infrastructure where needed
- An
Archivist
generates aWork Order File
and delivers it to the appropriateDigitization Team
- The
Digital Content Manager
runs a script on theWork Order File
that creates entries in theFlow
system andUnit of Work
directories on the file system - A
Digitization Specialist
processes theUnit of Work
directory through the various digitization steps - As the
Unit of Work
is processed, aMonitoring Script
updates theUnit of Work
status in theFlow system
- The
Unit of Work
status is available via a web application: theFlow Web UI
Digital Content Managers
andProject Managers
use theFlow Web UI
to trackUnit of Work
statuses
- An
Archivist
uses the ArchivesSpace UI to select theItem
s they want digitized - The
Archivist
generates aWork Order File
using the ArchivesSpace work-order plugin - The
Archivist
delivers theWork Order File
to theDigitization Team
- The
Digital Content Manager
runs theWork Order Processor
script- for each line in the
Work Order File
the script:- creates a
Digitization ID
and stores that in aSource Entity
(se
) record inFlow
- creates a
Unit of Work
directory, using theDigitization ID
as the directory name
- creates a
- for each line in the
- The
Digital Content Manager
assigns theUnit of Work
to aDigitization Specialist
- The
Digitization Specialist
digitizes theItem
, placing the digital-object files into theUnit of Work
directory - The
Digitization Specialist
moves theUnit of Work
directory to theQC
directory - The
Digital Content Manager
performs a Quality Control check on theUnit of Work
- if the
Unit of Work
passes the Quality Control check, theDigitization Manager
moves theUnit of Work
to theToServer
directory - if the
Unit of Work
fails the Quality Control check, theDigitization Manager
moves theUnit of Work
to theDoubleCheck
directory
- if the
- A
Directory Monitoring Script
watches various directories and updatesUnit of Work
statuses using theDigitization ID
s as a key
- An
Archivist
uses the ArchivesSpace UI to select theItem
s they want digitized - The
Archivist
generates aWork Order File
using the ArchivesSpace work-order plugin- the
Work Order File
is in atab separated values
format and contains one line perItem
- the
- The
Archivist
delivers theWork Order File
to the appropriateDigitization Team
- The
Digital Content Manager
runs theWork Order Processor
script that:- gets a list of
partners
via thersbe::client
gem - asks the
Digitization Manager
to select thepartner
- gets a list of
collections
that belong to the selectedpartner
via thersbe::client
gem - asks the
Digitization Manager
to select thecollection
- processes the
Work Order File
line-by-line- for each line in the
Work Order File
theWork Order Processor
script:- parses the line to extract the relevant information, e.g., the
Component Unique Identifier (cuid)
, andArchival Object URI
[3] - generates the
Digitization ID
per the following template:<partner code>_<collection code>_<cuid> with '.' replaced with '_'>
- e.g.,
Given partner_code = 'fales' collection_code = 'gcn' cuid = '231.1234' Then digitization_id = 'fales_gcn_231_1234'
- instantiates an
se
object that belongs to the selectedcollection
and saves it toFlow
using thersbe::client
gem - checks the status of the save operation
- creates the
Unit of Work
directory on theatkins-SAN
volume using theDigitization ID
for the directory name
- parses the line to extract the relevant information, e.g., the
- for each line in the
- gets a list of
- The
Digital Content Manager
assigns theUnit of Work
to aDigitization Specialist
- The
Digitization Specialist
moves theUnit of Work
directory to theProcessing
directory and waits 5 minutes to allow theDirectory Monitoring Script
to update the status inFlow
- The
Digitization Specialist
moves theUnit of Work
directory to their local machine - The
Digitization Specialist
digitizes theItem
, placing the digital-object files into theUnit of Work
directory - The
Digitization Specialist
moves theUnit of Work
directory to theQC
directory- the
Directory Monitoring Script
runs and:- for each
Unit of Work
directory in theQC
directory- looks up
se
inFlow
using theDigitization ID
- sets the
se
status toQC
- looks up
- for each
- the
- The
Digital Content Manager
QCs theUnit of Work
-
if the
Unit of Work
passes QC, theDigitization Manager
moves theUnit of Work
to theToServer
directory- the
Upload Manager Cron Job
runs and:- for each
Unit of Work
directory in theToServer
directory- looks up the Flow
se
inFlow
using theDigitization ID
- runs automated quality control checks on the
Unit of Work
- if the automated checks fail, the
Unit of Work
is moved into theDoubleCheck
directory
- if the automated checks fail, the
- looks up the Flow
- if the automated checks pass, the
Unit of Work
is packaged using the Flowse
partner
andcollection
and uploaded toR*
- if packaging and upload pass, the
Unit of Work
directory is moved to theUploadOK
directory - if packaging and upload fail, the
Unit of Work
directory is moved to theUploadFail
directory
- if packaging and upload pass, the
- for each
- the
-
if the
Unit of Work
fails QC, theDigitization Manager
moves theUnit of Work
to theDoubleCheck
directory -
every 5 minutes the
Directory Monitoring Script
runs and checks each monitored directory, updatingFlow
based on whichUnit of Work
directories are in each monitored directory.
-
(see Pivotal Tracker project)
- add
work order uuid
to ArchivesSpace work-order plug-in