Questions tagged [analyzer]

For questions related to a person or device that analyses (or analyzes) given data

An analyser, also spelt analyzer, is a person or device that analyses given data. It examines in detail the structure of the given data and tries to find patterns and relationships between parts of the data. An analyser can be a piece of hardware or a software program running on a computer.

Information is taken from Wikipedia. The following article contains more information about the topic and links to other topics about commonly used analyzer types.

642 questions
112
votes
2 answers

Comparison of Lucene Analyzers

Can someone please explain the difference between the different analyzers within Lucene? I am getting a maxClauseCount exception and I understand that I can avoid this by using a KeywordAnalyzer but I don't want to change from the StandardAnalyzer…
Journeyman
  • 10,011
  • 16
  • 81
  • 129
74
votes
1 answer

How can I make my code diagnostic syntax node action work on closed files?

I'm building a set of code diagnostics using Roslyn (in VS2015 Preview). Ideally, I'd like any errors they produce to act as persistent errors, just as if I were violating a normal language rule. There are a bunch of options, but I'm having a hard…
Jon Skeet
  • 1,421,763
  • 867
  • 9,128
  • 9,194
59
votes
9 answers

Is there a log file analyzer for log4j files?

I am looking for some kind of analyzer tool for log files generated by log4j files. I am looking something more advanced than grep? What are you using for log file analysis? I am looking for following kinds of features: The tool should tell me how…
Juha Syrjälä
  • 33,425
  • 31
  • 131
  • 183
49
votes
4 answers

How to not-analyze in ElasticSearch?

I've got a field in an ElasticSearch field which I do not want to have analyzed, i. e. it should be stored and compared verbatim. The values will contain letters, numbers, whitespace, dashes, slashes and maybe other characters. If I do not give an…
Alfe
  • 56,346
  • 20
  • 107
  • 159
48
votes
8 answers

How do I disable all Roslyn Code Analyzers?

I'm trying to work with a large opensource project that has a handful of Roslyn Code Analyzers. When I open the solution Visual Studio uses ~35% CPU for about 15 minutes. Using PerfView I've figured out that the code analyzers being run on the…
JoshVarty
  • 9,066
  • 4
  • 52
  • 80
48
votes
2 answers

Elastic search- search_analyzer vs index_analyzer

I was looking at http://euphonious-intuition.com/2012/08/more-complicated-mapping-in-elasticsearch/ which explains ElasticSearch analyzers. I did not understand the part about having different search and index analyzers. The second example of…
Pavan Kemparaju
  • 1,591
  • 3
  • 16
  • 25
46
votes
3 answers

Analyzers in elasticsearch

I'm having trouble understanding the concept of analyzers in elasticsearch with tire gem. I'm actually a newbie to these search concepts. Can someone here help me with some reference article or explain what actually the analyzers do and why they are…
Vamsi Krishna
  • 3,742
  • 4
  • 20
  • 45
39
votes
4 answers

How to use a Lucene Analyzer to tokenize a String?

Is there a simple way I could use any subclass of Lucene's Analyzer to parse/tokenize a String? Something like: String to_be_parsed = "car window seven"; Analyzer analyzer = new StandardAnalyzer(...); List tokenized_string =…
Felipe Hummel
  • 4,674
  • 5
  • 32
  • 35
30
votes
6 answers

Making a lexical Analyzer

I'm working with a Lexical Analyzer program right now and I'm using Java. I've been researching for answers on this problem but until now I failed to find any. Here's my problem: Input: System.out.println ("Hello World"); Desired…
KLoverated
  • 311
  • 1
  • 3
  • 6
29
votes
2 answers

How do I know if the statistics of a Postgres table are up to date?

In pgAdmin, whenever a table's statistics are out-of-date, it prompts: Running VACUUM recommended The estimated rowcount on the table schema.table deviates significantly from the actual rowcount. You should run VACUUM ANALYZE on this…
Beibei
  • 586
  • 1
  • 4
  • 6
29
votes
2 answers

how edge ngram token filter differs from ngram token filter?

As I am new to elastic search, I am not able to identify difference between ngram token filter and edge ngram token filter. How these two differ from each other in processing tokens?
Karunakar
  • 2,209
  • 4
  • 15
  • 20
29
votes
1 answer

ElasticSearch Analyzer and Tokenizer for Emails

I could not find a perfect solution either in Google or ES for the following situation, hope someone could help here. Suppose there are five email addresses stored under field "email": 1. {"email": "john.doe@gmail.com"} 2. {"email":…
LYu
  • 2,316
  • 4
  • 21
  • 38
26
votes
3 answers

MySql query analyzer - free solutions

Is there a good Query Analyzer for MySQL (that's either free, or has a trial), that can analyse a query and make suggestions for indexes, like the "Display estimated execution plan" in Microsoft SQL Server management studio?
frank
  • 495
  • 1
  • 7
  • 13
25
votes
1 answer

Elasticsearch : How to list each analyzer used by a specific index

I need to find out which analyzer (type, language..) is configured in a specific index. I tried http://localhost:9200/wazzup/_mapping but it only gives information about field names/types. Thanks
Spadon_
  • 495
  • 2
  • 5
  • 11
21
votes
2 answers

Elasticsearch can't update non dynamic settings

I am trying to create a test analyzer to experiment on analysis on Elastic Search. I have an index created called "mytest" which is available and searchable, but when I try to create a custom analyzer I get below error { "error":…
Bren
  • 2,148
  • 1
  • 27
  • 45
1
2 3
42 43