I am having problems uploading a ZIP file to a MarkLogic-based XQuery application and extracting its contents. The problem concerns accented characters in the filepath of one of the files in the ZIP.
Here is a sample ZIP that demonstrates the issue. It has an e-acute in the folder name:
Note that the ZIP was prepared on a Windows system, and the XQuery code is also running on Windows, within MarkLogic 10.
My website can upload the ZIP and read the manifest, but the manifest says the character is actually "‚", and when I try to extract the specific file in that folder I get a "file not found" error.
I figured out that of course 201A is NOT an e-acute in Unicode. So I tried converting "‚" to "©", and that still gave the same error, and I even tried "é" with the same result.
I am now thinking this is completely broken, and there appears to be no way I can extract files from a ZIP when there is an accented character in the path to the file (or the filename too I expect).
Can anybody help? I do not mind having to FIX paths if needed, but as I show above I have not even been able to achieve that.
Neil.