Questions tagged [carrot2]

Carrot2 is an open source search results clustering engine, not to be confused with the AMQP client (carrot)

Carrot2 is an open source search results clustering engine, not to be confused with the AMQP client (carrot).

It is implemented in Java and can automatically organize small collections of documents (search results but not only) into thematic categories.

107 questions
5
votes
5 answers

Searching over documents stored in Hadoop - which tool to use?

I'm lost in: Hadoop, Hbase, Lucene, Carrot2, Cloudera, Tika, ZooKeeper, Solr, Katta, Cascading, POI... When you read about the one you can be often sure that each of the others tools is going to be mentioned. I don't expect you to explain every…
garret
  • 75
  • 2
  • 6
3
votes
0 answers

Sunspot Rails and Carrot2

I'm using the sunspot_rails gem for a rails application. Is there a way to integrate the cluster engine carrot2 into a rails app? Obviously carrot2 is part of sunspot as it is commented in the solr configs. But I'd like to know wheather someone has…
ticcoid
  • 261
  • 1
  • 11
2
votes
1 answer

How does (carrot) clustering work in solr?

i have running Lucene/Solr 4 for testing different features, also "clustering". Currently, 1 million documents are indexed. Every document has the following fields: ID (unique Key) Example1: 10245 Example2: 24974 TOPIC (Keywords of…
The Bndr
  • 13,204
  • 16
  • 68
  • 107
2
votes
0 answers

carrot2 workbench and solr

I have a solr 7.3 installation on debian, and want to cluster results with carrot2 workbench 3.16.0. I used carrot2 manual chapter 4.2.5 "clustering documents from a solr index" and: - set "solr" as a source - used default values in section medium -…
2
votes
1 answer

Solr with Carrot2

I'm trying to integrate Solr with Carrot2 clustering engine.I successfully managed to do clustering via Solr following this link : Result Clustering I'm getting the same output as mentioned in the techproducts example.I'm not able to visualize…
2
votes
1 answer

I can't open Carrot2 Workbench on WIndows 10

I tried to open Carrot2 Workbench in my Windows 10 x64 with JRE 9.0.4 but there is an error like this: !SESSION 2018-02-26 22:49:13.739 ----------------------------------------------- eclipse.buildId=unknown java.version=9.0.4 java.vendor=Oracle…
Saturn
  • 23
  • 5
2
votes
2 answers

carrot2 - can I cluster documents from a folder?

I'm trying to cluster documents I have collected as part of a research project. I am trying to use Carrot2 workbench and can't find out how to point carrot at the folder containing the documents. How do I do this please? (I have a small number of…
user2682250
  • 21
  • 1
  • 2
2
votes
2 answers

How to increase Java heap size for Carrot2?

How can I increase Java heap size for Carrot2 Document Clustering Workbench?
petexx67
  • 23
  • 2
2
votes
3 answers

Comparing clustering results in Carrot2

I am using Carrot2 tool for my thesis and running different algorithms on this tool. My question is that, how can i compare the results of different algorithms scientifically? I mean, i need a proof of that the clustering results of algorithm 1 is…
emre
  • 239
  • 2
  • 5
  • 13
1
vote
1 answer

Solr ClusteringComponent Carrot2 : ClassNotFoundException

I get the following error when I try to run Solr with the Clustering Component. java.lang.NoClassDefFoundError: org/carrot2/core/ControllerFactory at…
Mark
  • 2,522
  • 5
  • 36
  • 42
1
vote
1 answer

Tweaking Lingo parameters with carrot2 (with PHP)

I'm trying to tweak the call to the Carrot2 REST API : $client = new Client(); try { $params = [ 'multipart'=> [ ['name'=> 'dcs.c2stream', 'contents' => $xml], …
Touane
  • 23
  • 4
1
vote
1 answer

How do I get carrot2 workbench running with the solr core that I have created?

I want to integrate my Solr data core with carrot2, to get a nice clustered visualization. However, I am having difficulties with getting carrot2 running in the first place as the documentation I have come across is rather vague. What is needed…
blah
  • 674
  • 3
  • 17
1
vote
1 answer

Getting Java Heap Space error while using Carrot2

I have all my search result formatted in XML format and am trying to run lingo algorithm in the Carrot2 workbench and am continuously running into Java heap space error. The XML is formatted in a way that Carrot2 uses. I am running Carrot2 workbench…
Zxxxxx
  • 397
  • 1
  • 5
  • 16
1
vote
1 answer

Carrot: different clusters for the same query

When issuing the same query with match all query (* : *) I get different clusters and scores all the time. What could be the reason? First try: label: "В Минске" score: 52.79549568196028 Second try: label: "В Минске" "score":…
D_K
  • 1,410
  • 12
  • 35
1
vote
1 answer

Modifying stop words list

I would like to tune the carrot2 clusters to avoid labels, that do not start with prepositions -- for Russian language it looks quite strange to see a word in a grammatical case (non Nominative) and not have a preposition. The clustering is done…
D_K
  • 1,410
  • 12
  • 35
1
2 3 4 5 6 7 8