I think if you add a field that has an analyzer that creates tokens on
alpha/digit/punctuation boundaries, that should go a long way. Use that both
at index and search time.

For example:
* 3555LHP  becomes "3555" "LHP"
 Searching for D3555 becomes "D" OR "3555", so it matches on token "3555"
from 3555LHP.

* t14240 becomes "t" "14240"
 Searching for t14240-ss  becomes "t" OR "14240" OR "ss", matching "14240"
from "t14240".

Similarly for your other examples.

If this proves to be too broad, you may need to define some stricter rules,
but you could use this for starters.

I think you will have to write your own analyzer, as it doesn't look like
any of the analyzers available in Solr/Lucene do exactly what you need. But
that's relatively straightforward. Just start with the code from one of the
existing Analyzers (e.g. KeywordAnalyzer).

Good luck,
Frans

On 5/31/07, realw5 <[EMAIL PROTECTED]> wrote:


Hey Guys,
I need some guidance in regards to a problem we are having with our solr
index. Below is a list of terms our customers search for, which are
failing
or not returning the complete set. The second side of the list is the
product id/keyword we want it to match.

Can you give me some direction on how this can (or let me know if i can't
be
done) with index/query analyzers. Any help is much appeciated!

Dan

---------------------------

Keyword Typed In / We want it to find

D3555 / 3555LHP
D460160-BN / D460160
D460160BN / D460160
Dd454557 / D454557
84200ORB / 84200
84200-ORB / 84200
T13420-SCH / T13420
t14240-ss / t14240
--
View this message in context:
http://www.nabble.com/SOLR-Indexing-Querying-tf3843221.html#a10883456
Sent from the Solr - User mailing list archive at Nabble.com.


Reply via email to