Re: [Virtuoso-users] Dumb strings in free text search

2009-02-04 Thread Kjetil Kjernsmo
On Tuesday 03 February 2009 17:54:29 Ivan Mikhailov wrote: > One can declare a new "language", as a named set of callbacks that split > the text into words, normalize that words and decide which words should > be indexed and which are not. OK, that sounds good! > We did not re-invent bicycles in

Re: [Virtuoso-users] Dumb strings in free text search

2009-02-03 Thread Ivan Mikhailov
Kjetil, One can declare a new "language", as a named set of callbacks that split the text into words, normalize that words and decide which words should be indexed and which are not. We did not re-invent bicycles in this area and what was sufficient for other developers in other projects is probab

[Virtuoso-users] Dumb strings in free text search

2009-02-03 Thread Kjetil Kjernsmo
All, I'd like to put one of the harder problems we're struggling with to you all: In freetext queries, our experience is that people tend to write the "dumbest" version of a string what they search. For example, they are likely to write "Gothe" or "Goethe" rather than "Göthe". The problem is no