Document Type
Article
Publication Date
5-11-2017
Abstract
When faced with the prospect of manually uploading thousands of collection objects into our digital repository, I knew I needed to create a workflow to automate batch uploading processes. This resulted in a workflow that allows me to take a metadata spreadsheet containing thousands of rows and transform it into a series of MODS XML files contained in one master file, using OpenRefine's templating tool. The csplit command can be used to split the master file up into thousands of fully-formed MODS XML files. Using a Perl script, the files can be batch renamed to match their corresponding digital object files. These matched files can then be uploaded as a zip file into Islandora for easy batch uploading. Each one of the tools used in this process can be modified to enhance the existing workflows of any institution.
Recommended Citation
Bacon, Scott, "Business in the Front, Party in the Back: Revising Metadata Processes Up-Front to Benefit Back-End Workflows" (2017). Library Faculty Presentations. 1.
https://digitalcommons.coastal.edu/lib-fac-pres/1
Comments
Poster presented at the 2017 ALCTS Exchange.