-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
workspace download: also traverse dependent file groups? #412
Comments
To clarify: For a PAGE URL |
No, not quite (I think). After downloading a PAGE-XML, its (original and derived) image references could be relative paths (and then instead of replacing them with a URL by prepending |
The difficult part is how to download those references, if they are relative file URL (i.e. were produced by OCR-D before). We do have now support for both local and remote URL #1079 but that is not widely used yet and even if it was, it's unlikely that OCR-D users would expose the intermediary results via URL. The only way around this restriction is if the remote workspace is available as OCRD-ZIP, in which case we assume that all the referenced image should be in the workspace. AFAIK nobody except us is using So unless I'm mistaken, there is no good way to solve this. |
When I want to download a PAGE-XML from remote, it would be very helpful if core would also download all the files referenced in
/PcGts/Page/@imageFilename
and*/AlternativeImage/@filename
. Is this feasible?The text was updated successfully, but these errors were encountered: