Questions tagged [fuzzy]

DO NOT USE - ambiguous: see fuzzy-search, fuzzy-logic, or image-processing for more appropriate tags.

Do not use this tag. It is ambiguous: see , , or for more appropriate tags.

363 questions
32
votes
14 answers

Fuzzy date algorithm

I'm looking for a fuzzy date algorithm. I just started writing one and realised what a tedious task it is. It quickly degenerated into a lot of horrid code to cope with special cases like the difference between "yesterday", "last week" and "late…
Rog
  • 17,070
  • 9
  • 50
  • 73
32
votes
3 answers

django fuzzy string translation not showing up

Why sometimes I get a fuzzy item in django.po language file. Actually, I have checked in my project the fuzzy string item is totally unique. #: .\users\views.py:81 .\users\views.py:101 #, fuzzy msgid "username or email" msgstr "9988" It is ok to…
icn
  • 17,126
  • 39
  • 105
  • 141
24
votes
4 answers

Clang for fuzzy parsing C++

Is it at all possible to parse C++ with incomplete declarations with clang with its existing libclang API ? I.e. parse .cpp file without including all the headers, deducing declarations on the fly. so, e.g. The following text: A B::Foo(){return…
20
votes
6 answers

Fuzzy matching deduplication in less than exponential time?

I have a large database (potentially in the millions of records) with relatively short strings of text (on the order of street address, names, etc). I am looking for a strategy to remove inexact duplicates, and fuzzy matching seems to be the method…
Dave W.
  • 1,576
  • 2
  • 18
  • 29
15
votes
4 answers

String immutability in CPython violated

This is more of an 'interesting' phenomena I encountered in a Python module that I'm trying to understand, rather than a request for help (though a solution would also be useful). >>> import fuzzy >>> s = fuzzy.Soundex(4) >>> a = "apple" >>> b =…
Alex
  • 18,332
  • 10
  • 49
  • 53
13
votes
2 answers

Calculating a relative Levenshtein distance - make sense?

I am using both Daitch-Mokotoff soundexing and Damerau-Levenshtein to find out if a user entry and a value in the application are "the same". Is Levenshtein distance supposed to be used as an absolute value? If I have a 20 letter word, a distance of…
Joseph Tura
  • 6,290
  • 8
  • 47
  • 73
12
votes
1 answer

Emacs fuzzy autocompletion

I really like ido's fuzzy matching in emacs. I would like to have that with autocompletion. Preferably with auto-complete, since I have ac-python and other things setup with auto-complete. I am aware that auto-complete provides fuzzy matching if…
fread2281
  • 1,136
  • 1
  • 11
  • 31
11
votes
2 answers

Find similar ASCII character in Unicode

Does someone know a easy way to find characters in Unicode that are similar to ASCII characters. An example is the "CYRILLIC SMALL LETTER DZE (ѕ)". I'd like to do a search and replace for similar characters. By similar I mean human readable. You…
DrDol
  • 2,220
  • 2
  • 19
  • 23
10
votes
2 answers

Django's makemessages creates a lot of fuzzy entries

Each time I added some strings to a Django project, I run "django-admin.py makemessages -all" to generate .PO files for all locales. The problem is even I only added 5 news strings, the makemessages command will mark 50 strings as fuzzy in .PO files…
jack
  • 17,261
  • 37
  • 100
  • 125
9
votes
4 answers

Fuzzy Date Time Picker Control in C# .NET?

I am implementing a Fuzzy Date control in C# for a winforms application. The Fuzzy Date should be able to take fuzzy values like Last June 2 Hours ago 2 Months ago Last week Yesterday Last year and the like Are there any sample implementations…
abhilash
  • 5,605
  • 3
  • 36
  • 59
9
votes
1 answer

clustering and matlab

I'm trying to cluster some data I have from the KDD 1999 cup dataset the output from the file looks like…
G Gr
  • 6,030
  • 20
  • 91
  • 184
8
votes
1 answer

Matching fuzzy strings

I have two tables that I need to merge together in PostgreSQL, on the common variable "company name." Unfortunately many of the company names don't match exactly (i.e. MICROSOFT in one table, MICROSFT in the other). I've tried removing common…
aesir
  • 565
  • 2
  • 13
  • 23
8
votes
3 answers

Python fuzzy string matching as correlation style table/matrix

I have a file with x number of string names and their associated IDs. Essentially two columns of data. What I would like, is a correlation style table with the format x by x (having the data in question both as the x-axis and y axis), but instead of…
8
votes
2 answers

Create a unique ID by fuzzy matching of names (via agrep using R)

Using R, I am trying match on people's names in a dataset structured by year and city. Due to some spelling mistakes, exact matching is not possible, so I am trying to use agrep() to fuzzy match names. A sample chunk of the dataset is structured as…
thomasB
  • 303
  • 3
  • 11
8
votes
3 answers

Fuzzy date parsing with Java

Are there any libraries for Java that allow you to interpret dates like "Yesterday", "Next Monday", ...
carrier
  • 32,209
  • 23
  • 76
  • 99
1
2 3
24 25