Re: Searching for credit card numbers

2020-07-28 Thread Walter Underwood
If you reindex, I’ve become a big fan of adding a date field with an index timestamp. That will allow you to check whether everything has been reindexed. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Jul 28, 2020, at 2:11 PM, Jörn Franke wrot

Re: Searching for credit card numbers

2020-07-28 Thread Jörn Franke
A regex search at query time would leave room for attacks (eg a regex can easily be designed to block the Solr server forever). If the field is store you can also try to use a cursor to go through all entries using a cursor and reindex the doc based on the field: https://lucene.apache.org/solr/

Re: Searching for credit card numbers

2020-07-28 Thread lstusr 5u93n4
Possible... yes. Agreed that this is the right approach. But if we already have a big index that we're searching through? Any way to "hack it"? On Tue, 28 Jul 2020 at 14:55, Walter Underwood wrote: > I’d do that at index time. Add an update request processor script that > does the regex and adds

Re: Searching for credit card numbers

2020-07-28 Thread Walter Underwood
I’d do that at index time. Add an update request processor script that does the regex and adds a field has_credit_card_number:true. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Jul 28, 2020, at 11:50 AM, lstusr 5u93n4 wrote: > > Let's say I have

Re: Searching document content and mult-valued fields

2020-07-06 Thread Emir Arnautović
Hi Shaun, If project content is relatively static, you could use nested documents or you could plain with join query parser . HTH, Emir -- Mon

Re: Searching individual pages in solr

2020-03-24 Thread Erick Erickson
Well, given the structure of an inverted index, how would you have a clue what page the hit was on? You could conceivably index enough data with payloads and the like, but that’d cause a lot more bloat than just indexing each page. Using grouping would allow you to show, say, the top three pages

Re: Searching on dates and time

2019-07-05 Thread Erick Erickson
There should be a number of these in the example schemas, although perhaps without indexed=“true” in the fieldType... DateRanges are pretty cool, but this in the “keep it simple” category, you might just be able to use plain pdates with the standard [time TO time] syntax. Although when I try

Re: Searching on dates and time

2019-07-05 Thread Steven White
Achieving the use-case is a must. So if here is an alternative to using solr.DateRangeField, I'm willing to use it. What do you mean by "pdate" and what is it? I'm reading this link on how to use DateRangeField but yet it is not working for me: https://lucene.apache.org/solr/guide/6_6/working-wi

Re: Searching on dates and time

2019-07-05 Thread Erick Erickson
I think what Mikhail is asking is whether your use-case would be satisfied by just indexing a standard pdate rather than a daterange, then querying by fq=CC_FILE_DATETIME:[some_date/MONTH TO some_maybe_other_full_date]. With regular pdates, you can use “date math” to round to whatever you want o

Re: Searching on dates and time

2019-07-05 Thread Steven White
I need both: point in time and range. In both cases, I need to be able to search between just 2 years, between year-month to year-month-day-time, etc. So getting my schema right, what and how I index right and the search syntax right are all important. This is why, in my original post, I shared

Re: Searching on dates and time

2019-07-05 Thread Mikhail Khludnev
Hold on. Do you need a range or just point in time? On Fri, Jul 5, 2019 at 6:51 PM Steven White wrote: > Thanks Mikhail. I will read those links and switch over to latest Solr. > > Just to be sure, my schema setup and the way I'm indexing the date data are > not the issue, right? > > Steven. >

Re: Searching on dates and time

2019-07-05 Thread Steven White
Thanks Mikhail. I will read those links and switch over to latest Solr. Just to be sure, my schema setup and the way I'm indexing the date data are not the issue, right? Steven. On Fri, Jul 5, 2019 at 11:05 AM Mikhail Khludnev wrote: > Hello, > > The indexed daterange value is really narrow,

Re: Searching on dates and time

2019-07-05 Thread Mikhail Khludnev
Hello, The indexed daterange value is really narrow, it might not be easy to pick per se. I'm in doubts regarding " in queries. At least TO syntax expects [ ] You can start from these baseline cases https://github.com/apache/lucene-solr/blob/master/solr/core/src/test/org/apache/solr/schema/DateRan

Re: searching only within a date range

2019-06-07 Thread Mark Fenbers - NOAA Federal
Disregard my previous response. When I reindexed, something went wrong and so my Lucene database was empty, which explains the immediate results and 0 results. I reindexed again (properly) and all is working find now. Thanks for the help. Mark On Fri, Jun 7, 2019 at 10:40 AM Erick Erickson wro

Re: searching only within a date range

2019-06-07 Thread Mark Fenbers - NOAA Federal
I added "posttime" to the schema first thing this morning, but your message reminded me that I needed to re-index the table, which I did. My schema entry: But my SQL contains "SELECT posttime as id" as so I tried both "posttime" and "id" in my setParam() function, namely, query.setParam("fq", "

Re: searching only within a date range

2019-06-07 Thread Erick Erickson
Yeah, it can be opaque… My first guess is that you may not have a field “posttime” defined in your schema and/or documents. For searching it needs “indexed=true” and for faceting/grouping/sorting it should have “docValues=true”. That’s what your original facet query was telling you, the field i

Re: searching only within a date range

2019-06-07 Thread Mark Fenbers - NOAA Federal
So, instead of addDateRangeFacet(), I used: query.setParam("fq", "posttime:[2010-01-01T00:00:00Z TO 2015-01-01T00:00:00Z]"); I didn't get any errors, but the query returned immediately with 0 results. Without this contraint, it searches 13,000 records and takes 1 to 2 minutes and returns 356 reco

Re: searching only within a date range

2019-06-07 Thread Andrea Gazzarini
Hi Mark, you are using a "range facet" which is a "query-shape" feature, it doesn't have any constraint on the results (i.e. it doesn't filter at all). You need to add a filter query [1] with a date range clause (e.g. fq=field:[ TO or *>]). Best, Andrea [1] https://lucene.apache.org/solr/gui

Re: searching is slow while adding document each time

2018-10-28 Thread Erick Erickson
bq. Do you really think running a profiler on 4.4 will be more effective than upgrading to 7.x? No but it's better than random speculation. On Sun, Oct 28, 2018 at 9:34 PM Deepak Goel wrote: > > What are your hardware utilisations (cpu, memory, disk, network)? > > I think you might have to tune l

Re: searching is slow while adding document each time

2018-10-28 Thread Deepak Goel
What are your hardware utilisations (cpu, memory, disk, network)? I think you might have to tune lucene too On Wed, 26 Sep 2018, 14:33 Mugeesh Husain, wrote: > Hi, > > We are running 3 node solr cloud(4.4) in our production infrastructure, We > recently moved our SOLR server host softlayer to d

Re: searching is slow while adding document each time

2018-10-28 Thread Walter Underwood
Do you really think running a profiler on 4.4 will be more effective than upgrading to 7.x? wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Oct 28, 2018, at 8:53 PM, Erick Erickson wrote: > > Put a profiler on it and see where the hot spots are? >

Re: searching is slow while adding document each time

2018-10-28 Thread Erick Erickson
Put a profiler on it and see where the hot spots are? On Sun, Oct 28, 2018 at 8:27 PM Walter Underwood wrote: > > Upgrade, so that indexing isn’t using as much CPU. That leaves more CPU for > search. > > Make sure you are on a recent release of Java. Run the G1 collector. > > If you need more thr

Re: searching is slow while adding document each time

2018-10-28 Thread Walter Underwood
Upgrade, so that indexing isn’t using as much CPU. That leaves more CPU for search. Make sure you are on a recent release of Java. Run the G1 collector. If you need more throughput, add more replicas or use instance with more CPUs. Has the index gotten bigger since the move? wunder Walter Unde

Re: searching is slow while adding document each time

2018-10-28 Thread Parag Shah
The original question though is about performance issue in the Searcher. How would you improve that? On Sun, Oct 28, 2018 at 4:37 PM Walter Underwood wrote: > The original question is for a three-node Solr Cloud cluster with > continuous updates. > Optimize in this configuration won’t help, it w

Re: searching is slow while adding document each time

2018-10-28 Thread Walter Underwood
The original question is for a three-node Solr Cloud cluster with continuous updates. Optimize in this configuration won’t help, it will just cause expensive merges later. I would recommend updating from Solr 4.4. that is a very early release for Solr Cloud. We saw dramatic speedups in indexing

Re: searching is slow while adding document each time

2018-10-28 Thread Erick Erickson
Well, if you optimize on the master you'll inevitably copy the entire index to each of the slaves. Consuming that much network bandwidth can be A Bad Thing. Here's the background for Walter's comment: https://lucidworks.com/2017/10/13/segment-merging-deleted-documents-optimize-may-bad/ Solr 7.5 i

Re: searching is slow while adding document each time

2018-10-28 Thread Parag Shah
What would you do if your performance is degrading? I am not suggesting doing this for a serving index. Only one at the Master, which ones optimized gets replicated. Am I missing something here? On Sun, Oct 28, 2018 at 11:05 AM Walter Underwood wrote: > Do not run optimize (force merge) unless

Re: searching is slow while adding document each time

2018-10-28 Thread Walter Underwood
Do not run optimize (force merge) unless you really understand the downside. If you are continually adding and deleting documents, you really do not want to run optimize. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Oct 28, 2018, at 9:24 AM, Parag

Re: searching is slow while adding document each time

2018-10-28 Thread Parag Shah
Hi Mugeesh, Have you tried optimizing indexes to see if performance improves? It is well known that over time as indexing goes on lucene creates more segments which will be searched over and hence take longer. Merging happens constantly but continuous indexing will still introduce smaller seg

RE: Searching by dates

2018-08-16 Thread Markus Jelsma
project in on-going development since 2010, but not open source, so if you are interested contact us off list. Regards, Markus -Original message- > From:Shawn Heisey > Sent: Thursday 16th August 2018 20:09 > To: solr-user@lucene.apache.org > Subject: Re: Searching by dates

Re: Searching by dates

2018-08-16 Thread Shawn Heisey
On 8/16/2018 9:20 AM, Christopher Schultz wrote: > Hmm. I could have sworn the documentation I read in the past (maybe as > long as 3-4 months ago) indicated that date+timestamp was necessary. > Maybe that was just for the index, while the searches can be partial. DateRangeField was introduced fou

Re: Searching by dates

2018-08-16 Thread Alexandre Rafalovitch
You could have PatternReplace in your field definition either as a CharFilter or a TokenFilter. See: http://www.solr-start.com/info/analyzers/ Regards, Alex. On 16 August 2018 at 11:20, Christopher Schultz wrote: > Shawn, > > On 8/16/18 10:37 AM, Shawn Heisey wrote: >> On 8/16/2018 7:48 AM, C

Re: Searching by dates

2018-08-16 Thread Christopher Schultz
Shawn, On 8/16/18 10:37 AM, Shawn Heisey wrote: > On 8/16/2018 7:48 AM, Christopher Schultz wrote: >> I haven't actually tried this, yes, but from the docs I'm guessing that >> I can't search for a DOB using e.g. 2018-08-16 but instead I need to >> search using 2018-08-16T00:00:00 plus maybe "Z" a

Re: Searching by dates

2018-08-16 Thread Alexandre Rafalovitch
However, you probably will still need to convert your dates into strings as well to match people's search expectation, as the date fields do not store _english_ month names internally. So, you will want to have a secondary field that expands 2018-02-31 into "February 2018" (and "Feb 2018"?) includ

Re: Searching by dates

2018-08-16 Thread Shawn Heisey
On 8/16/2018 7:48 AM, Christopher Schultz wrote: > I haven't actually tried this, yes, but from the docs I'm guessing that > I can't search for a DOB using e.g. 2018-08-16 but instead I need to > search using 2018-08-16T00:00:00 plus maybe "Z" at the end for the TZ. > > No user is ever going to do

Re: Searching for a phrase in proximity to another token in SOLR

2018-02-27 Thread Erick Erickson
Did you try the ComplexPhraseQueryParser? See: https://lucene.apache.org/solr/guide/6_6/other-parsers.html Best, Erick On Tue, Feb 27, 2018 at 7:23 AM, Deyan Yotsov wrote: > Hello, > > Is there a way to achieve something along these lines: > > "("john smith") josh"~12 > > Thank you, > > Deyan >

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-02-02 Thread Alessandro Benedetti
1) Diego's observation about IDF is absolutely correct here, but I don't think he was pointing it to be a negative aspect of your new approach. I think he just wanted to warn you about this. The way BM25 uses the IDF feature of a term is to estimate how important is the term in the context ( givin

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-02-01 Thread Luigi Caiazza
t if you index > all the documents at the beginning your scores will be different - since > idf will be computed on all the documents that you have in the collection. > > Cheers, > Diego > > > From: solr-user@lucene.apache.org At: 01/31/18 20:12:16To: > solr-user@lucene.a

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-02-01 Thread Diego Ceccarelli (BLOOMBERG/ LONDON)
different - since idf will be computed on all the documents that you have in the collection. Cheers, Diego From: solr-user@lucene.apache.org At: 01/31/18 20:12:16To: solr-user@lucene.apache.org Subject: Re: Searching for an efficient and scalable way to filter query results using non-indexed a

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-02-01 Thread Emir Arnautović
Hi, I did not check it in code, but based on earlier comments on ML, it seems that in place updates are not as it sounds - it will rewrite doc values for the segment that is updated. If you really want to avoid index changes, you can maybe use external field: https://lucene.apache.org/solr/guid

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-02-01 Thread Alessandro Benedetti
Reading from the wiki [1]: " An atomic update operation is performed using this approach only when the fields to be updated meet these three conditions: are non-indexed (indexed="false"), non-stored (stored="false"), single valued (multiValued="false") numeric docValues (docValues="true") fields

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-01-31 Thread Luigi Caiazza
Hi, first of all, thank you for your answers. @ Rick: the reason is that the set of pages that are stored into the disk represents just a static view of the Web, in order to let my experiments be fully replicable. My need is to run simulations of different crawlers on top of it, each working on t

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-01-31 Thread Rick Leir
Luigi Is there a reason for not indexing all of your on-disk pages? That seems to be the first step. But I do not understand what your goal is. Cheers -- Rick On January 30, 2018 1:33:27 PM EST, Luigi Caiazza wrote: >Hello, > >I am working on a project that simulates a selective, large-scale >cr

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-01-31 Thread Alessandro Benedetti
I am not sure I fully understood your use case, but let me suggest few different possible solutions : 1) Query Time join approach : you keep 2 collections, one static with all the pages, one that just store lighweight documents containing the crawling interaction : 1) Id, content -> Pages 2)pageId

Re: Searching With UTF-8

2017-08-29 Thread Diego Ceccarelli
Hello Lawrence, Which type did you use in the solr schema for your fields? Cheers, Diego On Tue, Aug 29, 2017 at 5:34 PM, Elitzer, Lawrence < lelit...@lgsinnovations.com> wrote: > Hello! > > > > It seems I can correctly import (with DIH) UTF-8 characters such as J but > I am unable to search on

Re: Searching under multiple field

2017-06-11 Thread Zheng Lin Edwin Yeo
Hi, I found that this edismax query works. http://localhost:8983/solr/collection1/select?defType=edismax&q=*test* AND field1_tc:"Main"* &qf=field1_s field2_s field3_s field4_s Regards, Edwin On 12 June 2017 at 11:20, Zheng Lin Edwin Yeo wrote: > Hi Ravi, > > Thanks for your suggestion. > > I

Re: Searching under multiple field

2017-06-11 Thread Zheng Lin Edwin Yeo
Hi Ravi, Thanks for your suggestion. I'm looking at the edismax query parser, but could not figure out how we can streamline the query using that. As for using copyField, it is probably not a good idea, as it will increase the index size. Regards, Edwin On 12 June 2017 at 09:34, ANNAMANENI RA

Re: Searching under multiple field

2017-06-11 Thread ANNAMANENI RAVEENDRA
Hi, Use dismay or edismax query parser Or Use copy field concept Thanks Ravi On Sun, 11 Jun 2017 at 9:32 PM, Zheng Lin Edwin Yeo wrote: > Hi, > > Currently, I'm using the following query to search for the same word under > different fields > > http://localhost:8983/solr/collection1/select?q=

Re: Searching for a term which isn't a part of an expression

2016-12-15 Thread Dean Gurvitz
I think queries would usually not contain more than one phrase per query, but there isn't a fixed list. Anyways, your solution is very very good for us. We could write a QueryParser or a SearchComponent that edits the Lucene Query object in the ResponseBuilder to include the relevant SpanNotQuery.

Re: Searching for a term which isn't a part of an expression

2016-12-15 Thread Ahmet Arslan
Hi, Span query family would be a pure query-time solution, SpanNotQuery in particular. SpanNearQuery include = new SpanTermQuery(new Term(FIELD, "world"); SpanNearQuery exclude = new SpanNearQuery(new SpanQuery[] { new SpanTermQuery(new Term(FIELD, "hello")), new SpanTermQuery(new Term(FIELD

Re: Searching for a term which isn't a part of an expression

2016-12-14 Thread Dean Gurvitz
Hi, The list of phrases wil be relatively dynamic, so changing the indexing process isn't a very good solution for us. We also considered using a PostFilter or adding a SearchComponent to filter out the "bad" results, but obviously a true query-time support would be a lot better. On Wed, Dec 14,

Re: Searching for a term which isn't a part of an expression

2016-12-14 Thread Ahmet Arslan
Hi, Do you have a common list of phrases that you want to prohibit partial match? You can index those phrases in a special way, for example, This is a new world hello_world hot_dog tap_water etc. ahmet On Wednesday, December 14, 2016 9:20 PM, deansg wrote: We would like to enable queries for

Re: Searching Special charterer in solr behaving inconsistent

2016-09-16 Thread Erick Erickson
You really have to define _how_ it's not working, provide field definitions, perhaps the result of adding &debug=query to the URL. You might review: http://wiki.apache.org/solr/UsingMailingLists At a guess and based on fragmentary information, your second query is searching against the default fi

Re: Searching Home's, Homes and Home

2016-07-22 Thread Surender
Thanks for all the responses... I have checked these options, none of the option has worked so far. The option is only giving only two results not the third one. I am checking some more options and if you can share more ideas, that would be great. Thanks, Surender Singh -- View this message in

Re: Searching Home's, Homes and Home

2016-07-12 Thread Vijaymhaskar
Hi Surender, Please go through Stemmer documentation which will give you idea on how stemmer works. I see below issues in configured field types, 1. You have added porter stemmer awa english minimal stemmer also. You can remove one of those based on your requirement. Minimal stemmer is conservati

Re: Searching Home's, Homes and Home

2016-07-12 Thread Surender
Hi, I do not want to use Synonyms.txt as this would require to a big library and that will be time consuming. Thanks, Surender Singh -- View this message in context: http://lucene.472066.n3.nabble.com/Searching-Home-s-Homes-and-Home-tp4286341p4286897.html Sent from the Solr - User mailing lis

Re: Searching Home's, Homes and Home

2016-07-12 Thread Surender
Hi, The following is the analyzer information and let me know what I am missing.

Re: Searching Home's, Homes and Home

2016-07-12 Thread John Blythe
copy in your analyzer from your schema.xml -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Tue, Jul 12, 2016 at 8:10 AM, Surender wrote: > Hi, > > I have checked the results and I am not getting desire

Re: Searching Home's, Homes and Home

2016-07-12 Thread Surender
Hi, I have checked the results and I am not getting desired results. Please suggest. Thanks, Surender Singh -- View this message in context: http://lucene.472066.n3.nabble.com/Searching-Home-s-Homes-and-Home-tp4286341p4286757.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Searching Home's, Homes and Home

2016-07-12 Thread Vijaymhaskar
Hi Surender, Can you share your current field configuration so that we can debug it from there.. ? Share your field and fieldtype definition from schema.xml . -- View this message in context: http://lucene.472066.n3.nabble.com/Searching-Home-s-Homes-and-Home-tp4286341p4286768.html Sent from

Re: Searching Home's, Homes and Home

2016-07-12 Thread kostali hassan
Or you can build a file called synonym.txt in your directory config of your core. Le 11 juil. 2016 17:06, "Surender" a écrit : > Thanks... > > I am applying these filters and will share update on this issue. It will > take couple of days. > > Thanks, > Surender Singh > > > > -- > View this messag

Re: Searching Home's, Homes and Home

2016-07-11 Thread Surender
Thanks... I am applying these filters and will share update on this issue. It will take couple of days. Thanks, Surender Singh -- View this message in context: http://lucene.472066.n3.nabble.com/Searching-Home-s-Homes-and-Home-tp4286341p4286579.html Sent from the Solr - User mailing list arch

Re: Searching Home's, Homes and Home

2016-07-08 Thread Alessandro Benedetti
I second Jamal, using a soft stemmer for your language should solve the problem. Specifically to the english language and the cases you mentioned : 1) Minimal English stemmer should be a good solution [1] 2) The english porter stemmer can be valid for your use case as well [2] 3) Not sure if the

RE: Searching Home's, Homes and Home

2016-07-08 Thread Jamal, Sarfaraz
I would start by looking at the stemming documentation - It might be of help. Sas -Original Message- From: Surender [mailto:surender.si...@rsystems.com] Sent: Friday, July 8, 2016 8:30 AM To: solr-user@lucene.apache.org Subject: Searching Home's, Homes and Home User can type keyword f

Re: Searching inside parent AND children, returning only parents.

2016-07-01 Thread Mikhail Khludnev
Yep, query syntax is a little bit challenging. Here I'm showing query nesting to answer your question and also a better syntax for boolean queries: q={!parent which="parentDoc:true" v=$childq} (+parentDoc:true +) &childq=+parentDoc:false + 30 июня 2016 г. 10:11 пользователь "Marre" написал: > I

Re: searching in two indices

2016-05-31 Thread Bernd Fehling
Hi Mikhail, I will check that out, thanks. Regards, Bernd Am 31.05.2016 um 10:53 schrieb Mikhail Khludnev: > Hello Bernd, > > I recently committed [subquery] document transformer which sounds pretty > much the same. > Find the details at > https://cwiki.apache.org/confluence/display/solr/Transf

Re: searching in two indices

2016-05-31 Thread Mikhail Khludnev
Hello Bernd, I recently committed [subquery] document transformer which sounds pretty much the same. Find the details at https://cwiki.apache.org/confluence/display/solr/Transforming+Result+Documents It's not yet released, thus I appreciate if you take a nightly build from https://builds.apache.or

Re: searching in two indices

2016-05-30 Thread Bernd Fehling
Thanks for sharing your solution and experience. I'm just thinking about to load all article data (100 mio.) and all personal data (4 mio.) into one core with a selector field "db" containing either "article" or "pdata". But still not really satisfied with this solution. Anyway, MySQL is a good h

Re: searching in two indices

2016-05-30 Thread John Blythe
We had previously done something of the sort. With some sources of truth type of cores we would do initial searches on customer transaction data before fetching the related information from those "truth" tables. We would use the various pertinent fields from results #1 to find related data in co

Re: Searching for term sequence including blank character using regex

2016-05-03 Thread Erick Erickson
Not quite sure. How is the field type defined? What is the result of adding &debug=true to the query? Have you looked at the actual terms indexed via the admin UI/schema browser? Have you looked at the admin/analysis page to see how the data is parsed by the fieldType? If this is a tokenized field

Re: Searching special characters

2016-02-12 Thread Erick Erickson
Also look at the admin/analysis page to see the effects of various filters in your analysis chain. It's very likely that the * is not even _in_ the index. Here is a partial list of elements that _may_ be in your analysis chain: https://cwiki.apache.org/confluence/display/solr/Filter+Descriptions a

Re: Searching special characters

2016-02-12 Thread Modassar Ather
These special characters can be removed if at begging or end or can be taken care by the relevant filters depending on the schema defined. E.g "Audit"/*Audit should be searched by query Audit so I see no reason of indexing "/* of the content. You can use PatternReplaceFilter for replacing these spe

Re: Searching special characters

2016-02-12 Thread Anil
Thanks for quick response. Should these be treated differently during index ? I have tried *\"Audit* which is returning results of *Audit *also which is incorrect. what do you say ? On 12 February 2016 at 15:07, Modassar Ather wrote: > You can search them by escaping with backslash. > > Best,

Re: Searching special characters

2016-02-12 Thread Modassar Ather
You can search them by escaping with backslash. Best, Modassar

Re: Searching words with spaces for word without spaces in solr

2014-08-02 Thread Umesh Prasad
But how is your analyzer going to know that "icecream" should > > index as two tokens: "ice" "cream" ? You're asking analysis to do too > much > > in this case. This is where spellcheck can bridge the gap. > > > > Of course, if you have a di

Re: Searching and highlighting ten's of fields

2014-07-31 Thread Manuel Le Normand
Right, it works! I was not aware of this functionality and being able to customize it by hl.requireFieldMatch param. Thanks

Re: Searching words with spaces for word without spaces in solr

2014-07-31 Thread sunshine glass
urse, if you have a discrete list of words you want split like this, > then you can do it with analysis using index-time synonyms. In this case, > you need to provide it with the list. See > https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory > for more informatio

RE: Searching words with spaces for word without spaces in solr

2014-07-31 Thread Dyer, James
Original Message- From: sunshine glass [mailto:sunshineglassof2...@gmail.com] Sent: Thursday, July 31, 2014 10:32 AM To: solr-user@lucene.apache.org Subject: Re: Searching words with spaces for word without spaces in solr I am not clear with this. This link is related to spell check. Can you e

Re: Searching words with spaces for word without spaces in solr

2014-07-31 Thread sunshine glass
glass [mailto:sunshineglassof2...@gmail.com] > Sent: Wednesday, July 30, 2014 9:38 AM > To: solr-user@lucene.apache.org > Subject: Re: Searching words with spaces for word without spaces in solr > > This is the new configuration: > > > positionIncrementGap="100&qu

Re: Searching and highlighting ten's of fields

2014-07-30 Thread Erick Erickson
bq: Is there a way to search the global copyField but highlight the original stored fields? That's what I was suggesting. Specify the global field for your search, but use hl.fl for fields you want to copy. And yes, storing the fields is required for highlighting. Consider stemming (or worse, ph

Re: Searching and highlighting ten's of fields

2014-07-30 Thread Manuel Le Normand
The slowdown occurs during search, not highlighting. Having a disjunctive query with 50 terms running 20 different posting lists is a hard task. Harder than searching these 50 terms on a single (larger) posting list as in the copyField case. With the edismax qf param, sure, hl.fl=* works as it sho

RE: Searching words with spaces for word without spaces in solr

2014-07-30 Thread Dyer, James
l+Checking James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: sunshine glass [mailto:sunshineglassof2...@gmail.com] Sent: Wednesday, July 30, 2014 9:38 AM To: solr-user@lucene.apache.org Subject: Re: Searching words with spaces for word without spaces in solr This

Re: Searching words with spaces for word without spaces in solr

2014-07-30 Thread sunshine glass
This is the analysis page: ​​ ​ Please help me now. On Wed, Jul 30, 2014 at 8:08 PM, sunshine glass < sunshineglassof2...@gmail.com> wrote: > This is the new configuration: > > > positionIncrementGap="100"> >> >> >> >> >> > outputUnigrams="true" tokenSeparat

Re: Searching and highlighting ten's of fields

2014-07-30 Thread Erick Erickson
Doesn't hl.fl work in this case? Or is highlighting the 10 fields the slowdown? Best, Erick On Wed, Jul 30, 2014 at 2:55 AM, Manuel Le Normand < manuel.lenorm...@gmail.com> wrote: > Current I use the classic but I can change my posting format in order to > work with another highlighting compone

Re: Searching words with spaces for word without spaces in solr

2014-07-30 Thread sunshine glass
This is the new configuration: positionIncrementGap="100"> > > > > outputUnigrams="true" tokenSeparator=""/> > generateWordParts="1" generateNumberParts="1" catenateWords="1" > catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"/> > >

Re: Searching and highlighting ten's of fields

2014-07-30 Thread Manuel Le Normand
Current I use the classic but I can change my posting format in order to work with another highlighting component if that leads to any solution

Re: Searching and highlighting ten's of fields

2014-07-30 Thread aurelien . mazoyer
Hello, Do you use classic highlighter or fast vector highlighter? Aurélien On 30.07.2014 09:36, Manuel Le Normand wrote: Hello, I need to expose the search and highlighting capabilities over few tens of fields. The edismax's qf param makes it possible but the time performances for searching

Re: Searching for string having apostrophe

2014-06-17 Thread Erick Erickson
I really have to ask why you want to search for apostrophes. Usually these are considered junk characters and are best ignored. Best, Erick On Tue, Jun 17, 2014 at 6:03 AM, Shawn Heisey wrote: > On 6/16/2014 11:34 PM, Gaurav Deshpande wrote: >> I want to perform name searches in Solr on String a

Re: Searching for string having apostrophe

2014-06-17 Thread Shawn Heisey
On 6/16/2014 11:34 PM, Gaurav Deshpande wrote: > I want to perform name searches in Solr on String and text datatypes but > names contain apostrophes in it. > > Is there a way I can escape these apostrophes and perform searches ? > > Using '\' before apostrophe results in forbidden access due to

Re: Searching for string having apostrophe

2014-06-16 Thread Ahmet Arslan
Hi, Apostrophes are not part of the special query parser characters. You don't need to escape it. Can you give some examples ? On Tuesday, June 17, 2014 8:35 AM, Gaurav Deshpande wrote: Hi, I want to perform name searches in Solr on String and text datatypes but names contain apostrophes i

Re: Searching words with spaces for word without spaces in solr

2014-05-30 Thread Erick Erickson
I'd spend some time with the admin/analysis page to understand the exact tokenization going on here. For instance, sequencing the shinglefilterfactory before worddelimiterfilterfactory may produce "interesting" resutls. And then throwing the Snowball factory at it and putting synonyms in front

Re: Searching words with spaces for word without spaces in solr

2014-05-30 Thread sunshine glass
Hi Folks, Any updates ?? On Wed, May 28, 2014 at 12:13 PM, sunshine glass < sunshineglassof2...@gmail.com> wrote: > Dear Team, > > How can I handle compound word searches in solr ?. > How can i search "hand bag" if I have "handbag" in my index. While using > shingle in query analyzer, the query

Re: Searching for tokens does not return any results

2014-05-02 Thread Erick Erickson
t; Have a good weekend. > > Thanks, > Yetkin > > -Original Message- > From: Erick Erickson [mailto:erickerick...@gmail.com] > Sent: Friday, May 02, 2014 11:57 AM > To: solr-user@lucene.apache.org > Subject: Re: Searching for tokens does not return any results > > bq:

RE: Searching for tokens does not return any results

2014-05-02 Thread Yetkin Ozkucur
good weekend. Thanks, Yetkin -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Friday, May 02, 2014 11:57 AM To: solr-user@lucene.apache.org Subject: Re: Searching for tokens does not return any results bq: but this index was created using a Java program

Re: Searching for tokens does not return any results

2014-05-02 Thread Erick Erickson
bq: but this index was created using a Java program using Lucene interface Elaborating a bit on Koji's comment... The fact that you used Lucene to index the doc means that the analysis page is almost, but not quite entirely, useless on the indexing side. It's looking at your field definition in

Re: Searching for tokens does not return any results

2014-05-01 Thread Koji Sekiguchi
Hi Yetkin, welcome! I think StandardAnalyzer of Lucene is the problem you are facing. Why don't you have another field using StandardAnalyzer and see how it tokenizes CRD_PROD on Solr admin GUI? I forgot in the detail but we can use Lucene's Analyzer in schema.xml something like this:

Re: Searching for tokens does not return any results

2014-05-01 Thread Ahmet Arslan
Hi Yetkin, You are on the right track by examining analysis page. How is your query analyzed using query analyzer? According to what you pasted q=CRD should return your example document. Did you change something in schema.xml and forget to re-start solr and  re-index? By the way simple letter

Re: Searching multivalue fields.

2014-04-08 Thread Vijay Kokatnur
Since Span is the only way to solve the problem, I won't mind re-indexing. It's just that I have never done it before. We've got 80G of indexed data replicated on two nodes in a cluster. Is there a preferred way to go about re-indexing? On Tue, Apr 8, 2014 at 12:17 AM, Ahmet Arslan wrote: >

Re: Searching multivalue fields.

2014-04-08 Thread Ahmet Arslan
Hi, Changing value of omitTermFreqAndPositions requires re-indexing, unfortunately. And I remembered that you don't want to reindex. It looks like we are out of options. Ahmet On Tuesday, April 8, 2014 12:45 AM, Vijay Kokatnur wrote: Yes I did restart solr, but did not re-index.  Is that

Re: Searching multivalue fields.

2014-04-07 Thread Vijay Kokatnur
Yes I did restart solr, but did not re-index. Is that necessary? We've got 80G of indexed data, is there a "preferred" way of doing it without impacting performance? On Sat, Apr 5, 2014 at 9:44 AM, Ahmet Arslan wrote: > Hi, > > Did restart solr and you re-index after schema change? >On Sa

  1   2   3   4   5   6   >