Re: length of indexed value

John Blythe Wed, 04 Oct 2017 04:25:45 -0700

interesting idea.

the field in question is one that can have a good deal of stray zeros based
on distributor skus for a product and bad entries from those entering them.
part of the matching logic for some operations look for these discrepancies
by having a simple regex that removes zeroes. so 400010 can match with
40010 (and rightly so). issues come in the form of rare cases where 41 is a
sku by the same distributor or manufacturer and thus can end up being an
erroneous match. having a means of looking at the length would help to know
that going from 6 characters to 2 is too far a leap to be counted as a
match.


--
John Blythe

On Wed, Oct 4, 2017 at 6:22 AM, alessandro.benedetti <a.benede...@sease.io>
wrote:

> Are the norms a good approximation for you ?
> If you preserve norms at indexing time ( it is a configuration that you can
> operate in the schema.xml) you can retrieve them with this specific
> function
> query :
>
> *norm(field)*
> Returns the "norm" stored in the index for the specified field. This is the
> product of the index time boost and the length normalization factor,
> according to the Similarity for the field.
> norm(fieldName)
>
> This will not be the exact length of the field, but it can be a good
> approximation though.
>
> Cheers
>
>
>
> -----
> ---------------
> Alessandro Benedetti
> Search Consultant, R&D Software Engineer, Director
> Sease Ltd. - www.sease.io
> --
> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
>

Re: length of indexed value

Reply via email to