Many thanks pals. I will walk some of those ways (and return with new questions) ;) Best regards,
Francisco El vie., 11 de sept. de 2015 a la(s) 5:41 a. m., Upayavira <u...@odoko.co.uk> escribió: > It sounds to me like you are wanting to *filter* your document to only > include terms within that medical dictionary. Or to have a keyword field > based upon those of your 100k terms that appear in that doc. > > Synonyms are your saviour, if that's the case. Create a synonyms list > for your terms, they can be a one-to-one mapping, so: > > diabetes => diabetes > > is quite okay. Then, in your index time analysis chain, have a > SynonymFilterFactory followed by a TypeTokenFilterFactory configured to > only allow SYNONYM tokens through. > > Then, in your index, you will have a field that contains all the terms > from your 100k that are included in that particular document. > > Does that get it? > > Upayavira > > On Fri, Sep 11, 2015, at 03:21 AM, Francisco Andrés Fernández wrote: > > Yes. > > I have many drug products leaflets, each corresponding to 1 product. In > > the > > other hand we have a medical dictionary with about 10^5 terms. > > I want to detect all the occurrences of those terms for any leaflet > > document. > > Could you give me a clue about how is the best way to perform it? > > Perhaps, the best way is (as Walter suggests) to do all the queries every > > time, as needed. > > Regards, > > > > Francisco > > > > El jue., 10 de sept. de 2015 a la(s) 11:14 a. m., Alexandre Rafalovitch < > > arafa...@gmail.com> escribió: > > > > > Can you tell us a bit more about the business case? Not the current > > > technical one. Because it is entirely possible Solr can solve the > > > higher level problem out of the box without you doing manual term > > > comparisons.In which case, your problem scope is not quite right. > > > > > > Regards, > > > Alex. > > > ---- > > > Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: > > > http://www.solr-start.com/ > > > > > > > > > On 10 September 2015 at 09:58, Francisco Andrés Fernández > > > <fra...@gmail.com> wrote: > > > > Hi all, I'm new to Solr. > > > > I want to detect all ocurrences of terms existing in a thesaurus > into 1 > > > or > > > > more documents. > > > > What´s the best strategy to make it? > > > > Doing a query for each term doesn't seem to be the best way. > > > > Many thanks, > > > > > > > > Francisco > > > >