using uima ruta 2.7.0
DECLARE Substance;
WORDLIST EnzymeSearchList = 'enzyme.txt';
Document{-> MARKFAST(Substance, EnzymeSearchList, true)}; // true ignores case
enzyme.txt contains ~ 16.000 entries (=lines)
If I use a file containing few entries, for example 5, my further rules work without any problem. Once I provide the full list of thousands of entries, my results are incomplete.
Can be the issue caused by reaching WORDLIST limit? Or heap maybe? Nothing fails upon program execution.
I have found a thread specifically stating
There is no maximum size for the wordlists in UIMA Ruta. ... My largest wordlist consisted of about 500k entries