7

Reading Heroku help on enabling full text search in PostgreSQL I see that it doesn't support Hebrew by default. Does anyone know how to add support for Hebrew dictionary in PostgreSQL on Heroku?

MikeMarsian
  • 608
  • 1
  • 7
  • 21
  • did you manage to get full text search running on pg? – Gady Dec 26 '13 at 15:44
  • I did, using pg_search gem https://github.com/Casecommons/pg_search The caveat is that I couldn't find any hebrew dictionaries, so I ended up using the default non-Hebrew dictionary which doesn't support all kinds of nice things you could expect Hebrew dictionary to support, like matching results that start with different letter. – MikeMarsian Dec 26 '13 at 16:09
  • so did you get hebrew searches working without the dictionary? – Gady Dec 26 '13 at 16:11
  • yep, hebrew searches work, but they are kinda basic. For example, if you look for "נשי" you'll find "נשים", but you won't find "לנשים". – MikeMarsian Dec 26 '13 at 16:13

1 Answers1

5

I work on Heroku Postgres, and would like input on this matter from those able to vend it.

I'm looking into this but so far the waters have been murky as to how Hebrew is supported in many/any open source projects, including dedicated full text searching projects like Lucene or Xapian. There are full blown toolchains for dealing with this, but their integration with PostgreSQL is not yet existent as far as I know, e.g. hebstem, hspell and libhspell, HebMorph.

If someone knows of what the current state of the art is for this in Postgres, I can try to make it work on Heroku at a time of my discretion, depending on the precise details of that implementation that I have to review somewhat carefully.

As-is my attempts to locate an ispell dictionary have been questionable, as is the efficacy of ispell style dictionaries for Hebrew given the reportedly very different stemming rules.

Related work:

Thoughts?

Community
  • 1
  • 1
fdr
  • 447
  • 2
  • 6