Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add shell script to verify output directory structure #52

Open
wants to merge 39 commits into
base: main
Choose a base branch
from

Conversation

thobson88
Copy link

A shell script to compare the directory structure of input & output directories, providing a partial verification that all input data has been processed by alto2txt.

Usage:

./alto2txt-verify.sh /path/to/alto2txt_input/ /path/to/alto2txt_output/

It produces a new directory named alto2txt-verify containing the results of the verification process.

@griff-rees
Copy link
Collaborator

Looks ok to me, but was hoping to have a unittest alongside. I've got a partial means of adding that but still getting my head around the existing unittests. I can add some suggestions as another commit so you can see where I've gotten if you like?

spool and others added 26 commits November 29, 2022 19:55
METS and ALTO are artefacts of the digitisation process that link metadata and transcribed text to images of the physical page; born-digital newspapers don't use the same formats. So it's a little clearer and it might also aid discoverability.
- Fix conflict in README.md
- Add $ and console for command line examples in README.md
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants