Questions tagged [ruta]

Apache UIMA Ruta is a scripting language for natural language processing.

Apache UIMA Ruta (formerly called TextMarker) consists of a rule-based script language and an Eclipse-based workbench.

279 questions
9
votes
1 answer

Uima Ruta Out of Memory issue in spark context

I'm running an UIMA application on apache spark. There are million of pages coming into batches to be processed by UIMA RUTA for calculation. But some time i'm facing out of memory exception.It throws exception sometime as it successfully process…
Gaurav
  • 139
  • 1
  • 16
6
votes
2 answers

Java API for running UIMA Ruta scripts

I am new to UIMA Ruta. I made some annotators using scripting language. I am able to run them within EclipseIDE. I want to write a JAVA API to automatically run scripts on the input provided. I am using the same example project provided in UIMA…
Anshul
  • 83
  • 8
5
votes
1 answer

How to create an AnalysisEngineDescriptor from an uima-ruta script to use in a SimplePipeline

I'm not able to run an uima ruta script in my simple pipeline. I'm working with the next libraries: Uimafit 2.0.0 Uima-ruta 2.0.1 ClearTK 1.4.1 Maven And I'm using a org.apache.uima.fit.pipeline.SimplePipeline with: SimplePipeline.runPipeline( …
German Attanasio
  • 22,217
  • 7
  • 47
  • 63
4
votes
1 answer

Fuziness In UIMA ruta

Is there any option of fuzziness in case of word matching, or ignoring some special cases. For ex: STRINGLIST AMIMALLIST = {"LION","TIGER","MONKEY"}; DECLARE ANIMAL; Document {-> MARKFAST(ANIMAL, AMIMALLIST, true)}; I need to match words with…
Gaurav
  • 139
  • 1
  • 16
4
votes
1 answer

Document is ambiguous, use one of the following instead : org.apache.uima.ruta.type.Document uima.tcas.DocumentAnnotation

I'm using Ruta annotation framework for annotating the input text previously I was using Ruta script from classpath. But according to client requirement we have to move out Ruta script outside the code all this need to be decouple from the system.…
Gaurav
  • 139
  • 1
  • 16
4
votes
1 answer

Maximum size for a single Wordlist-UIMA RUTA

What is the maximum size for a wordlist in Uima Ruta? Because I want to store list of countries, states and cities name.
4
votes
0 answers

How to access a string array child tag in UIMA-RUTA?

We are having trouble to retrieve a data from a XMI file. The following excerpt illustrates an example of what we are trying to do:
Cogroo
  • 41
  • 1
4
votes
1 answer

UIMA RUTA : regular expression in WORDLIST

Is there any way to have regular expressions in WORDLIST? I need to implement the same as mentioned in https://issues.apache.org/jira/browse/UIMA-3382. Or is there any alternate way to resolve it? EDIT : WORDLIST is defined as a list of text items.…
user3778893
  • 125
  • 5
3
votes
0 answers

Ruta process taking long time in Java Workspace - Uima

I tried to mark paragraph which endswith space. In Ruta workspace SpaceBeforeEnter rule ran quickly. But in Java workspace same rule taking more time to excecute. Used versions as follows, uimaj-core version =>2.10.2, ruta-core version =>2.8.1 and…
3
votes
1 answer

String Concatenation in Ruta

does somebody know what is wrong with my String Concatenation in Ruta? FOREACH (d) IngredientConcept{} { d{->CREATE(Entity, "label"="Drug", "value"= d.conceptID + "_" + d.dictCanon)}; } Caused by:…
3
votes
1 answer

UIMA ruta - Using annotations from different views

I have a document, after few annotations, i am writing it into a new view using HTMLConverter Sample Input:

[1] SJ.…

3
votes
2 answers

Whether sequencing is possible-uima ruta

Whether sequencing is possible in uima ruta. For example: Input File: some text Fig 1.1 Table 1.1 Fig 1.2 some text Pic 1.2 Table 1.2 some text Table 1.3 Pic 1.3 some text Fig 1.4 some text Table 1.4 some text Table 1.5 Fig 1.6 Box 1.1 Fig 1.5 How…
3
votes
1 answer

Wordlist -uima ruta

Actually I used some CITY names and PUBLISHERS names in Wordlist.In my understanding, Wordlist will annotate all occurrences of any list item in a document.But I found a problem,that number of occurence was increased or decreased when I changed the…
3
votes
1 answer

Html Annotator,Html Converter in Uima Ruta

Can anyone briefly explain about the Html annotator, Html converter and TEIViewWriter with some examples.I want to create annotations in the initial view. Awaiting for the Answer. Main Script: PACKAGE uima.ruta.example; SCRIPT…
3
votes
1 answer

uima ruta Score Condition

I tried a Script to mark the Journal using Score Condition. W{REGEXP("Journal",true)->MARK(ONLY_Journal)}; W{REGEXP("Retraction|Retracted")->MARK(RETRACT)}; W{REGEXP("Suppl")->MARK(SUPPLY)}; NUM {->MARK(VOLUMEISSUE,1,6)}LParen NUM …
1
2 3
18 19