Skip to content

Carnegie Hall Archives maintains a series of small, portable scripts to expedite batch processes for quality control on our Digital Collections.

License

Notifications You must be signed in to change notification settings

CarnegieHall/quality-control

Repository files navigation

quality-control

OVERVIEW

Carnegie Hall Archives maintains a series of small, portable scripts to expedite batch processes for quality control on our Digital Collections.

These scripts have benefitted immensely from a wide community of archiving, preservation, and programming experts who share their code and troubleshooting techniques online. We are excited by the opportunity to participate in this community and have our methods improve through open collaboration and mutual exchange.

CONTENTS

SCRIPTS

Script Name Purpose
checksumValidation.py Compares two given checksum hashes and outputs log of pass/fail/missing
qa_cksum.sh Creates formatted output of md5, md5 create timestamp, filename, mime type, last modified timestamp
copyFilesFromList.py Copies files to a target directory based on filename identified in a CSV
matchvaluesfromlists.py Compares two lists based on shared value and outputs information about that value
md5Scrape.py Scrapes all .md5 sidecar files in a given directory and outputs information into a formatted CSV
reconcileList.py Compare two lists of files, and output CSV of non-matching values
embedCopyrightMetadata.sh Script to embed hardcoded Creator and Copyright Notice metadata using ExifTool
mediaconch-xmlreport-summary.py Script to print pass/fail counts when given a MediaConch XML report

DOCUMENTATION

USAGE AND LICENSE

USAGE GUIDELINES

This code is provided “as is” and for you to use at your own risk. The information included in the contents of this repository is not necessarily complete. Carnegie Hall offers the scripts as-is and makes no representations or warranties of any kind.

LICENSE

The MIT License (MIT)

Copyright (c) 2016 Carnegie Hall

All contents are released under the terms described in the MIT License included in this repository.

FEEDBACK

We plan to update the scripts regularly. CH Archives welcomes your thoughts, questions, and recommendations on our evolving quality control strategies.

ADD OR COMMENT ON TOPICS

Anyone is welcome to start a new topic ("issue") by selecting the Issues tab in GitHub and clicking the green New Issue button in the upper right.

All existing issues, open and closed, may be reviewed or commented upon in the Issues section.

CONTACT THE ARCHIVES

Email your thoughts to the Carnegie Hall Archives at [email protected] with the subject line Digital Collections: Quality Control.

About

Carnegie Hall Archives maintains a series of small, portable scripts to expedite batch processes for quality control on our Digital Collections.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published