Questions tagged [lemur]

The Lemur Project develops search technologies with the collaboration of several academic institutions. One of their most famous/popular products is the Indri search engine.

The Lemur Project develops search technologies with the collaboration of several academic institutions. One of their most famous/popular products is the Indri search engine. http://www.lemurproject.org/

17 questions
2
votes
1 answer

Using LDA in Galago search engine

I have started to use Galago for document retrieval. I want to cluster some documents (initially retrieved documents with any model) using LDA. I prefer to use a java-based implementation that can be integrated into my code using Galago. I'd…
Magen
  • 23
  • 4
2
votes
1 answer

Get vocabulary list in Galago

I am using Galago retrieval toolkit (a part of the Lemur project) and I need to have a list of all vocabulary terms in the collection (all unique terms). Actually I need a List or Set I really appreciate to let me know how can I…
boomz
  • 657
  • 5
  • 21
2
votes
1 answer

Indexing collections with stopword removal in Galago

I successfully indexed a collection using Galago. I didn't found any parameter for removing stopwords for indexing. Does galago remove stopwords automatically? If no, how can I pass the stopwords list to Galago and how I can tell Galago to remove…
John Foley
  • 957
  • 9
  • 19
1
vote
1 answer

Error when installing Galago

Hi i want to install Galago from the netbeans IDE or from the Promt command line with mvn package I installed maven But when i run build with dependency, i have this error message: [ERROR] Failed to execute goal on project tupleflow-gridengine:…
Chedi Bechikh
  • 183
  • 3
  • 13
1
vote
1 answer

Formulating Boolean Queries on Lemur Indri

I am unable to write the following boolean query, using Indri Query Language. THe query I need to write is the following: (Stana or Katic) AND (Jon or Huertas) AND (co-stars or colleagues) I need the list of all documents, containing the above. How…
darth
  • 169
  • 1
  • 16
1
vote
1 answer

What metrics can I use to validate and test RankNet in the RankLib library in the Lemur Project?

I am currently using the RankLib implementation of the RankNet algorithm (-ranker 4) with a held-out set. I am using the jar file in terminal to run this. The documentation stipulates: metric2t (e.g. NDCG, ERR, etc) only applies to list-wise…
BenObe
  • 13
  • 3
1
vote
2 answers

Galago 3.5 Indexing

Downloaded Galago 3.5 bin version and tried to index wiki-small.corpus following this guide. Strangely I get a File Not Found Exception for the .index file when trying to run the build index command. This error goes away when I explicitly use the…
Prateek
  • 685
  • 1
  • 7
  • 7
1
vote
2 answers

Why is the make command giving errors and how to fix it?

I have to use this program Lemur for some h.w but i am having troubles installing it as I have never used Linux. I downloaded cygwin and ran the ./configure which seemingly runs fine. When I try the make command I'm coming across some issues which I…
gilad
  • 41
  • 4
0
votes
1 answer

Lemur RankLib return code 1 on training

I am using RankLib for my data (shape: 218279 rows × 1504 columns) using python and getting error code 1 with none output. I am just wondering is there any documentation regarding error codes on RankLib? I am using Jupyter iPython for my project and…
0
votes
1 answer

Installation of Galago fails: JAVA_HOME is not defined correctly

I am trying to install Galago via the binary file(galago-3.12-bin) on unix. When i run the file I get the following error: SEF-IS-017660:galago-3.12-bin anton$ bin/galago Error: JAVA_HOME is not defined correctly. We cannot execute …
WaterBoy
  • 697
  • 3
  • 13
  • 35
0
votes
1 answer

JS: Drag and drop image in a search engine interface

I'm building a search engine interface. I've got some webpages raw content (including images) in Lemur indexes. I want to drag and drop images and implement image searching later. I know how to make an image draggable in HTML by setting…
Sharon Tan
  • 89
  • 1
  • 7
0
votes
1 answer

blank output on IndriRunQuery in lemur project

I'm using lemur for a nlp project, and I indexed some data succesffully I wanna run a query on index files by IndriRunQuery command parameter file: PATH-TO-INDEX-DIRECTORY 1
reza
  • 1,746
  • 6
  • 16
  • 32
0
votes
1 answer

Lemur Installation on Linux machine

I am trying to install Lemur-4.12 on Linux, but when running make I get: ../../MakeMod:48: recipe for target 'ResultFile.o' failed make[1]: *** [ResultFile.o] Error 1 make[1]: Leaving directory '/data1/myuser/lemur-4.12/retrieval/obj' Makefile:28:…
Crista23
  • 3,203
  • 9
  • 47
  • 60
0
votes
1 answer

Difference in text file saved manually and with Python codecs : Lemur Malformed document

I am using Python codecs to write some UTF-8 text to a file #-*-coding:utf-8-*- import codecs filename = 'afile' with codecs.open(filename, encoding='utf-8', mode='w') as fw : fw.write('\n') fw.write(filename) …
Avisek
  • 363
  • 1
  • 3
  • 16
0
votes
1 answer

IndriUI Index not building

I am trying to build index using Indri UI. I created parameter files and stopword lists for building the index. When I click build index, the UI keeps building for long time and the index is never built. UI hangs here, Here is my input.txt…
abtdw
  • 165
  • 2
  • 12
1
2