I generally prefer index-time work to query-time work on the theory
that the index-time work is done once and the query time work is done
for each query.

That said, for a corpus this size (and presumably without a large
query rate) I doubt you'd be able to measure any difference.

So basically choose the easiest to implement IMO.

Best,
Erick

On Wed, Mar 29, 2017 at 8:43 PM, Alexandre Rafalovitch
<arafa...@gmail.com> wrote:
> I am not sure I can tell how to decide on one or another. However, I
> wanted to mention that you also have an option of doing in in the
> UpdateRequestProcessor chain. That's still within Solr (and therefore
> is consistent with multiple clients feeding into Solr) but is before
> individual field processing (so will survive - for example - a
> copyField).
>
> Regards,
>    Alex.
> ----
> http://www.solr-start.com/ - Resources for Solr users, new and experienced
>
>
> On 29 March 2017 at 23:38, Derek Poh <d...@globalsources.com> wrote:
>> Hi
>>
>> Ineed to create afield that will be prefix and suffix with code 'z01x'.This
>> field needs to have the code in the index and during query.
>> I can either
>> 1.
>> have the source data of the field formatted with the code before indexing
>> (outside solr).
>> use a charFilter in the query stage of the field typeto add the codeduring
>> query.
>>
>> <charFilter class="solr.PatternReplaceCharFilterFactory" pattern="^(.*)$"
>> replacement="z01x $1 z01x" />
>>
>> OR
>>
>> 2.
>> use the charFilter before tokenizerclass during the index and query analyzer
>> stage of the field type.
>>
>> The collection has between 100k - 200k documents currentlybut it may
>> increase in the future.
>> Theindexing time with option 2 and current indexing time is almost the same,
>> between 2-3 minutes.
>>
>> Which option would you advice?
>>
>> Derek
>>
>> ----------------------
>> CONFIDENTIALITY NOTICE
>> This e-mail (including any attachments) may contain confidential and/or
>> privileged information. If you are not the intended recipient or have
>> received this e-mail in error, please inform the sender immediately and
>> delete this e-mail (including any attachments) from your computer, and you
>> must not use, disclose to anyone else or copy this e-mail (including any
>> attachments), whether in whole or in part.
>> This e-mail and any reply to it may be monitored for security, legal,
>> regulatory compliance and/or other appropriate reasons.

Reply via email to