HTRC Workset Toolkit Header/Footer Extractor

HTRC_Logo_240px.png

URL

View the repo for this project:
https://github.com/htrc/HTRC-WorksetToolkit

Description

The Workset Toolkit Header/Footer Extractor was developed as additional functionality of the HTRC Workset Toolkit. The header/footer extractor allows users to remove running headers/footers from volumes downloaded from the HathiTrust to a HTRC data capsule. The header/footer extractor is written in Python and can be run as a flag with the Workset Toolkit in the command line.

Client

HathiTrust Research Center

Services

Text Analysis

Start Date

Aug 2020

End Date

Apr 2021