Questions tagged [lexicon]
69 questions
167
votes
18 answers
What is "Orthogonality"?
What does "orthogonality" mean when talking about programming languages?
What are some examples of Orthogonality?

ahmet alp balkan
- 42,679
- 38
- 138
- 214
7
votes
2 answers
C++ Most efficient way for storing, loading and looking up a lexicon
I have a dictionary that consists of words and their phonetic transcriptions.
The words are all lower case, so there is not case-sensitive search involved.
The lexicon is really huge, and I need to load it quickly when my application starts.
I…

tmighty
- 10,734
- 21
- 104
- 218
5
votes
1 answer
Russian-to-English Parallel Word Corpus?
I am looking for a simple Russian to English word corpus. It can be as simple as a csv that lists a russian word in the first column and the equivalent English word in the second. Any ideas where I can find such a thing? Does the NLTK toolkit have…

Cygorger
- 772
- 7
- 15
5
votes
7 answers
Creating Lexicon and Scanner in Python
I'm new here in the world of coding and I haven't received a very warm welcome. I've been trying to learn python via the online tutorial http://learnpythonthehardway.org/book/. I've been able to struggle my way through the book up until exercise 48…

Zaqory
- 71
- 1
- 4
4
votes
1 answer
Example of NLTK's Vader Scoring Text
I would like someone to correct my understanding of how VADER scores text. I've read an explanation of this process here, however I cannot match the compound score of test sentences to Vader's output when recreating the process it describes. Lets…

Laurie
- 1,189
- 1
- 12
- 28
3
votes
2 answers
Python VADER lexicon Structure for sentiment analysis
I am using the VADER sentiment lexicon in Python's nltk library to analyze text sentiment. This lexicon does not suit my domain well, and so I wanted to add my own sentiment scores to various words. So, I got my hands on the lexicon text file…

user2238328
- 259
- 2
- 6
- 16
3
votes
0 answers
Correlation between different sentiment lexicons
I am trying to build a sentiment lexicon for a specific domain. At the moment I need to compute the correlation of the sentiment scores derived from my method with one of the standard already existing lexicons. First, can anybody help me and tell me…

user823743
- 2,152
- 3
- 21
- 31
3
votes
1 answer
English Lexicon for Search Query Correction
I'm building a spelling corrector for search engine queries by implementing the method described in "Spelling correction as an iterative process that exploits the collective knowledge of web users".
The high-level approach is as follows: for a…

danben
- 80,905
- 18
- 123
- 145
2
votes
0 answers
After creating a build from expo, the Apk needs metro bundle to run. What am I doing wrong?
I have created a build for android in expo. I got a standalone link which helped me download an Apk link. I installed that apk in my mobile device but it is not getting connected with the backend (api). When I run metro bundler on my development…

Dreamer
- 112
- 1
- 14
2
votes
1 answer
building a lexer with very many tokens
I've been searching for two hours now And I don't really know what to do.
I'm trying to build a analyser which use a lexer that can match several thousand words. Those are natural language words, that's why they are so many.
I tried first in a…

Corentin LE ROY
- 21
- 2
2
votes
1 answer
Text analysis and clustering for small text
I have a dataset of programming skills which I like to preprocess/clean it and create some more general groups.
For the clean what text cleaning could I make to the following text. Examples from the following datasets. Visual C and C are the same…

Eracog
- 225
- 1
- 2
- 8
2
votes
1 answer
Wordnet query to return example sentences
I have a use case where I have a word and I need to know the following things:
Synonyms for the word (just the synonyms are sufficient)
All senses of the word, where each sense contains - the synonyms matching that word in that sense, example…

SexyBeast
- 7,913
- 28
- 108
- 196
1
vote
3 answers
Rank in given order
I have been given an n length string and I need to find its rank in the alphabetical ordering of all the n length strings. Like for example, let's say I have been given,
"ABC" then as n=4 then ordering will be as {ABCD, ABCE, ABCF ....} thus the…
user20284443
1
vote
1 answer
How to use a custom NRC-style lexicon on Syuzhet for R?
I am new to R and new to working with Syuzhet.
I am trying to make a custom NRC-style library to use with the Syuzhet package in order to categorize words. Unfortunately, although this functionality now exists within Syuzhet, it doesnt seem to…

Alexandra Hudson
- 11
- 3
1
vote
0 answers
Find the number of positive and negative words in a text using a Lexicon,
I am trying to figure out how to create a lists of lists where each sublist contains the number of positive words and negative words in a given text. Below I have the names of the positive and negative text files that I am working with and an…

lbh57
- 31
- 5