-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Indexing issue: illegal character in path #3694
Comments
METS file cannot be read. This is another job for org.kitodo.dataformat.access.FLocatXmlElementAccess.getAndRepairUri(FileType file) |
I don't know how the |
With your change in #3698 I can even more illegal characters like normal white space. |
I assume the mistake was there before, only now you can see it for the first time. |
Sure. I don't know the reason nor the time when this illegal characters was "added". Maybe from a former migration (1.5.x to 1.6.x or so). Maybe I can fix this for our data but maybe even the application should handle this. |
@henning-gerhardt, could you make me a list of the illegal characters you found in paths and how the paths should look correct? |
There is no list and the list of illegal characters depends on many things like your used operation system, used file system and how you may interact with this kind of characters. All illegal characters ( |
Since we don't have a clear error pattern, I would answer your initial question that such errors have to be corrected locally outside the application. Should we still be able to obtain a clear error pattern in the future, which affects several installations, then we can of course also incorporate a correction function here. |
After migrating existing meta data files to new format with the provided transformation file and start indexing all the data this error appear in the
catalina.out
file:An excerpt from the meta data file of this process:
I don't know how this error is influencing the index operation. Should this fixed outside of the application or should the application handle this?
The text was updated successfully, but these errors were encountered: