I want to extract the embedded OLE objects in the RTF file. I prefer to implement it in Java, so I have read the doc & source code of Apache Tika RTFParser (1.25 & 2.0-ALPHA) and found that Tika just extracts text for search purpose and can't get the obj data. Perhaps it is feasible to write some code based on TextExtractor.
Then I also tried this post in C# but error on if (type != 3) // 3 is file, 1 is link
in PackagedObject.Extract.
Can anyone help me to figure out what is the simplest way (less code) to extract obj data in the RTF file? Cross-platform and server-side solution is preferred (Java or .net-core). But using Word. Application in C# is not considerable, because it depends on the Word client and sometimes unexpectedly terminated.