Questions tagged [moses]

A statistical machine translation system.

For more details see the Homepage.

53 questions
5
votes
1 answer

How to tune a Machine Translation model with huge language model?

Moses is a software to build machine translation models. And KenLM is the defacto language model software that moses uses. I have a textfile with 16GB of text and i use it to build a language model as such: bin/lmplz -o 5 text.arpa The…
alvas
  • 115,346
  • 109
  • 446
  • 738
5
votes
2 answers

Installing Moses Translation Software. Error message: "ld: library not found for -lboost_thread"

I am installing the Moses Translation Software on my Mac OS X 10.9.5 with Xcode 6.1. The instructions say that I need g++ and Boost installed. Once I do that, I git clone, "cd" into the directory, and then type ./bjam -j8. First, I verified I have…
ComputerScientist
  • 936
  • 4
  • 12
  • 20
5
votes
1 answer

cannot compile moses(undefined reference to boost:: ...)

I am trying to compile moses(machine translation tool). I specified the location of boost when compiling as follows. ./bjam --with-boost=/home/xhotsuki/bin/boost_1_56_0 -j8 But I got many errors something like this. ...failed gcc.link…
hitochan
  • 1,028
  • 18
  • 34
4
votes
1 answer

undefined reference to `gzopen' in moses

I'm currently trying to install moses script, part of moses the machine translation tool, but keep getting error: /home/moses/scripts/training/lexical-reordering/reordering_classes.cpp:349: undefined reference to…
ndriks
  • 359
  • 6
  • 16
4
votes
1 answer

Getting Error "stdbuf was not found; communication with perl may hang due to stdio buffering." when using when using MosesTokenizer with Python

problem arises when using mosestokenizer (https://github.com/luismsgomes/mosestokenizer, installed through pypi https://pypi.org/project/mosestokenizer/) on windows platform (tried multiple windows machines). Although there was no new releases…
Simon
  • 55
  • 3
4
votes
1 answer

What's the keyword mutable in the Moses source code for?

This is about the source code of the statistical machine translation system Moses. In the Factor class of the Mosesdecoder project, there is this strange use of the keyword mutable: class Factor { __SOME_OTHER_CODE__ // FactorCollection…
Yuhuan Jiang
  • 2,616
  • 2
  • 19
  • 21
4
votes
1 answer

Bad Result And Evaluation From Giza++

I have tried to work with giza++ on window (using Cygwin compiler). I used this code: //Suppose source language is French and target language is English plain2snt.out FrenchCorpus.f EnglishCorpus.e mkcls -c30 -n20 -pFrenchCorpus.f …
m-Abrontan
  • 503
  • 4
  • 7
3
votes
1 answer

What does ($pre =~ /\./ && $pre =~ /\p{IsAlpha}/) mean in the Moses Tokenizer?

Moses Tokenizer is the tokenizer widely used in machine translation and natural language processing experiments. There is a line of regex that checks for: if (($pre =~ /\./ && $pre =~ /\p{IsAlpha}/) || ($NONBREAKING_PREFIX{$pre} &&…
alvas
  • 115,346
  • 109
  • 446
  • 738
3
votes
0 answers

moses train-model.perl script error, --lm factor:order:filename requied

when I run: $MOSES/scripts/training/train-model.perl -hierarchical -ghkm -external-bin-dir /home/zhanwang/giza-pp/tools/ -root-dir . --corpus corpus/nl2mr --f mr --e nl showed: ERROR: use --lm factor:order:filename to specify at least one…
Johnny
  • 31
  • 1
3
votes
1 answer

Why such a bad performance for Moses using Europarl?

I have started playing around with Moses and tried to make what I believe would be a fairly standard baseline system. I have basically followed the steps described on the website, but instead of using news-commentary I have used Europarl v7 for…
scozy
  • 2,511
  • 17
  • 34
3
votes
1 answer

Documentation of Moses (statistical machine translation) mose.ini file format?

Is there any documentation of the moses.ini format for Moses? Running moses at the command line without arguments returns available feature names but not their available arguments. Additionally, the structure of the .ini file is not specified in the…
Kwame
  • 752
  • 7
  • 19
3
votes
1 answer

Statistical Machine Translation from Hindi to English using MOSES

I need to create a Hindi to English translation system using MOSES. I have got a parallel corpora containing about 10000 Hindi sentences and corresponding English translations. I followed the method described in the Baseline system creation page.…
AvinashK
  • 3,309
  • 8
  • 43
  • 94
3
votes
4 answers

Need to split Unicode string

I am using the moses toolkit for my translation system. I am using Assamese and English parallel corpus and trained them. But some proper nouns are not translated. This is because I have a very small corpus (parallel data set). So I want to use the…
2
votes
1 answer

What do the counts in giza++ phrase-table mean?

I have been searching for the meaning of numbers in giza++ phrase-table output within the official website (and pdf manual): http://www.statmt.org/moses/?n=FactoredTraining.ScorePhrases And this is what I've come up to. Let's say this is a line from…
2
votes
0 answers

I interrupted a clone operation. How can I clone again without errors?

I ran git clone https://github.com/moses-smt/mosesdecoder.git and then pressed Ctrl + C. I then tried to clone again, but I got the following error: fatal: destination path 'mosesdecoder' already exists and is not an empty directory How can I…
Thuy Nguyen
  • 353
  • 2
  • 10
1
2 3 4