-
-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
status command should show if OCR has completed #17
Comments
I think the only reliable way of telling if OCR has completed is to call But that's quite expensive, because it also returns the first page of JSON - which could be ~1MB of data. I think the most efficient way to do this would be to check the expensive API for completion of each job, but then to update the |
Another option: add a file called Even better: if we change the design of those JSON files to all live in the |
This is actually quite difficult.
It turns out the
textract-output/JOB_ID
folder is created, empty, early on in the process. Then files called1
and2
and so-on are added to it - but they're not all added at once, so the existence of files in that folder doesn't necessarily mean that the OCR process has completed for that job ID.The text was updated successfully, but these errors were encountered: