We're facing an issue with stemming in solr. Most of the cases are working correctly, for example, if we search for bidding, solr brings results for bidding, bid, bids, etc. However, with nouns ended with 'ion' suffix, stemming is not working. Even when analyzers seems to have correct stemming of the word, the results are not reflecting that. One example. If I search 'identifying', this is the output:
Analyzer (image): [cid:image002.png@01D61EEB.C2A5EC50] A clip of results: "haschildren_b":false, "isbucket_text_s":"0", "sectionbody_t":"\n\n\nIn order to identify 1st price auctions, leverage the proprietary tools available or manually pull a log file report to understand the trends and gauge auction spread overtime to assess the impact of variable auction dynamics.\n\n\n\n\n\n\n", "parsedupdatedby_s":"sitecorecarvaini", "sectionbody_t_en":"\n\n\nIn order to identify 1st price auctions, leverage the proprietary tools available or manually pull a log file report to understand the trends and gauge auction spread overtime to assess the impact of variable auction dynamics.\n\n\n\n\n\n\n", "hide_section_b":false As you can see, it has used the stemming correctly and brings results for other words based in the root, in this case "Identify". However, if I search for "Identification", this is the output: Analyzer (image): [cid:image003.png@01D61EF4.5BECD6F0] Even with proper stemming, solr is only bringing results for the word identification (or identifications) but nothing else. The queries are over the same field that has the Porter Stemming Filter applied for both, query and index. This behavior is consistent with other 'ion' ended nouns: representation, modification, etc. Solr Version: 8.1. Does anyone know why is it happening? Is it a bug? Thanks. [https://resourcesanalytics.blob.core.windows.net/email-signature-logos/sig/EMEA/IT/Prodigious/logopro.jpg] Jhonny Lopez Technical Architect Avenida Calle 26 No. 92 - 32, Edificio BTS3 APDO. 128-1255 Bogota T: +573006805461 jhonny.lo...@publicismedia.com www.prodigious.com ------------------------------------------------------------------------ Disclaimer The information in this email and any attachments may contain proprietary and confidential information that is intended for the addressee(s) only. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution, retention or use of the contents of this information is prohibited. When addressed to our clients or vendors, any information contained in this e-mail or any attachments is subject to the terms and conditions in any governing contract. If you have received this e-mail in error, please immediately contact the sender and delete the e-mail.