Questions tagged [treetagger]

The TreeTagger is a tool for annotating text with part-of-speech and lemma information.

It was developed by Helmut Schmid in the TC project at the Institute for Computational Linguistics of the University of Stuttgart.he TreeTagger is a program developed by Helmut Schmid at the University of Stuttgart (now at the University of München), for part-of-speech tagging and lemmatization. Language models (known as “parameters”, file extension .par) are supplied on the TreeTagger webpage for using the program with texts in English, French, German, Italian, Spanish, Russian, Bulgarian, Dutch, Estonian, Finnish, Galician, Latin, Mongolian, Polish, Slovak and Swahili, and models for some other languages are available from sites linked to the TreeTagger webpage. For a language for which no model exists, it is necessary to hand-tag some text, and then run a training program (provided with the TreeTagger) to create the model.

40 questions
5
votes
1 answer

Python - error: could not create 'treetagger.egg-info': Permission denied

I tried to install a python module (treetager-python) by using the command: python3 setup.py install When trying this, I get the error: running install running bdist_egg running egg_info creating treetagger.egg-info error: could not create…
MBT
  • 21,733
  • 19
  • 84
  • 102
5
votes
2 answers

Error: inreplace failed while installling with homebrew?

I would like to install treetagger in OS X. In order to make it easier I tried to search if this is possible with Homebrew. So I look at the web and found this formula from pepijnkokke user. Next, I tried to install treetagger as…
tumbleweed
  • 4,624
  • 12
  • 50
  • 81
3
votes
4 answers

TreeTagger in R

I have downloaded TreeTaggerv3.2 for Windows and have configured it per the install.txt. I am trying to use it in R with koRpus package. I have set the kRp.env as - set.kRp.env(TT.cmd="C:\\TreeTagger\\bin\\tag-english.bat", lang="en", …
vivsingh
  • 41
  • 1
  • 5
3
votes
1 answer

On using Lingua-TreeTagger-0.06

I am very, very new to NLP and the like. Therefore, I have a very basic question. I want to POS-tag an corpus of files with TreeTagger using a Mac OSX 10.6.8. I have installed TreeTagger by using the instructions provided at…
2
votes
1 answer

UIMA Ruta run errors using DKProCore Part-of-Speech Tagger (German Novels tutorial)

When following this tutorial on UIMA Ruta for German Novels step by step, I get the following error upon running Main.ruta: *SEVERE: Exception occurred org.apache.uima.analysis_engine.AnalysisEngineProcessException (...) Caused by:…
2
votes
3 answers

Chunking with Python-Treetaggerwrapper

The Treetagger can do POS-tagging as well as text-chunking, which means extracting verbal and nominal clauses, as in this German example: $ echo 'Das ist ein Test.' | cmd/tagger-chunker-german reading parameters ... tagging ... …
1
vote
1 answer

How to use TreeTagger in Google Colab?

i want to use TreeTagger module to tag POS-information on the raw corpus. As it seems to be faster to use GPU via Google Colab, I installed TreeTagger module, but Colab codes cannot locate TreeTagger directory. The error type is like…
Arktis Son
  • 11
  • 1
1
vote
1 answer

How does TreeTagger get the lemma of a word?

I am using TreeTagger to get the lemmas of words in Spanish, but I have observed there are too much words which are not transformed as should be. I would like to know how this operations works, if it is done with techniques such as decision trees or…
1
vote
0 answers

koRpus package utilising TreeTagger within R (RStudio) on macOS

im exploring the koRpus package in R on macOS, in trying to use the treetag function on the following object: text [1] "Because I could not stop for Death -" "He kindly stopped for me -" [3] "The Carriage held but just Ourselves -" "and…
1
vote
1 answer

Using TreeTagger with Python : can't find the Treetagger bin

I'm trying to use TreeTagger with Python. I followed these steps for the installation : treetagger-python miotto TreeTagger works well when I'm using it from a command prompt but when I'm trying to launch it from Python, here's what I have…
Marine9357
  • 41
  • 1
  • 6
1
vote
1 answer

Treetag package koRpus R

First, I'm sorry for my English, I speak very bad English. So, I have a question about treetag function from koRpus package on R. I want to lemmatized a text and it's ok for me with Treetagger. I used the treetag function : treetag(file,…
1
vote
1 answer

Bash: Extract cells from output formatted as table

I am using TreeTagger (http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/) to extract nouns from a text. My problem is that the output is formatted as such: word pos lemma The DT the TreeTagger NP TreeTagger is …
Sulli
  • 763
  • 1
  • 11
  • 33
1
vote
1 answer

Batch : how to run a program for all files in a folder and how to create an output file for the result using the original filename?

I'm a noob at batch programming so excuse me for the simple question. I have a folder called FSC in C:\TTG\FSC, with a thousand and thousand of text files called like 1_A.txt 2_A.txt etc. I would like to run a programm and have its output in…
KeyPi
  • 516
  • 5
  • 20
1
vote
0 answers

string replacement on koRpus treetag

The function tokenize("C:\\test_treetag.txt", clean.raw=list("aren't"="are not")) allows me to tokenize and replace "aren't" by the tokens "are not" I'm trying to do the same in the function in the treetag function.
1
vote
1 answer

Optimizing function computation in a pandas column?

Let's assume that I have the following pandas dataframe: id |opinion 1 |Hi how are you? ... n-1|Hello! I would like to create a new pandas POS-tagged column like this: id| opinion |POS-tagged_opinions 1 |Hi how are you?|hi\tUH\thi …
tumbleweed
  • 4,624
  • 12
  • 50
  • 81
1
2 3