RE: How can i multiply documents after DIH?
Why the fck is the FW -Original Message- From: Harshvardhan Ojha [mailto:harshvardhan.o...@makemytrip.com] Sent: 08 January 2013 16:51 To: solr-user@lucene.apache.org Subject: FW: How can i multiply documents after DIH? All, Looking into a finding solution for Hotel searches based on the below criteria's 1.City/Hotel 2.Data Range 3.Persons We have created documents which contains all the basic needed information inclusive of per day rates. The document looks like the below = SHL 2013-01-06T18:30:00Z 2013-01-06T18:30:00Z 2008090516 2400.0 600.0 1423509483572690944 = My search requirement is like q=city AND startdate:[2013-01-06 TO 2013-01-08] or q=id: 2008090516 AND startdate:[2013-01-06 TO 2013-01-08] and this combination for dates can be anything from daterange:[x TO y]. I have close to a 100K combinations to start with based on city,date-ranges,number of nights(days of stay) . I am looking at options to create search responses or even using this set of documents as an input source for them e.g: Running some Map-Reduce jobs to get all the 100K search responses and putting into the store or cache. Looking for suggestions cum options. Regards Harshvardhan Ojha
RE: How can i multiply documents after DIH?
Apologies folks was an mistake. -Original Message- From: Shubham Srivastava [mailto:shubham.srivast...@makemytrip.com] Sent: 08 January 2013 16:58 To: solr-user@lucene.apache.org Subject: RE: How can i multiply documents after DIH? Why the fck is the FW -Original Message- From: Harshvardhan Ojha [mailto:harshvardhan.o...@makemytrip.com] Sent: 08 January 2013 16:51 To: solr-user@lucene.apache.org Subject: FW: How can i multiply documents after DIH? All, Looking into a finding solution for Hotel searches based on the below criteria's 1.City/Hotel 2.Data Range 3.Persons We have created documents which contains all the basic needed information inclusive of per day rates. The document looks like the below = SHL 2013-01-06T18:30:00Z 2013-01-06T18:30:00Z 2008090516 2400.0 600.0 1423509483572690944 = My search requirement is like q=city AND startdate:[2013-01-06 TO 2013-01-08] or q=id: 2008090516 AND startdate:[2013-01-06 TO 2013-01-08] and this combination for dates can be anything from daterange:[x TO y]. I have close to a 100K combinations to start with based on city,date-ranges,number of nights(days of stay) . I am looking at options to create search responses or even using this set of documents as an input source for them e.g: Running some Map-Reduce jobs to get all the 100K search responses and putting into the store or cache. Looking for suggestions cum options. Regards Harshvardhan Ojha
Large numbers of executeWithRetry INFO messages
Hi, I am getting the below log's Apr 5, 2012 6:27:59 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: I/O exception (org.apache.commons.httpclient.NoHttpResponseException) caught when processing request: The server 192.168.6.135 failed to respond Apr 5, 2012 6:27:59 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: Retrying request Apr 5, 2012 6:28:39 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: I/O exception (org.apache.commons.httpclient.NoHttpResponseException) caught when processing request: The server 192.168.6.135 failed to respond Apr 5, 2012 6:28:39 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: Retrying request Apr 5, 2012 6:30:39 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: I/O exception (org.apache.commons.httpclient.NoHttpResponseException) caught when processing request: The server 192.168.6.135 failed to respond Apr 5, 2012 6:30:39 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: Retrying request Apr 5, 2012 6:31:59 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: I/O exception (org.apache.commons.httpclient.NoHttpResponseException) caught when processing request: The server 192.168.6.135 failed to respond Apr 5, 2012 6:31:59 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: Retrying request Apr 5, 2012 6:32:59 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry INFO: I/O exception (org.apache.commons.httpclient.NoHttpResponseException) caught when processing request: The server 192.168.6.135 failed to respond Apr 5, 2012 6:32:59 PM org.apache.commons.httpclient.HttpMethodDirector executeWithRetry every now and then and on every slave randomly. However I haven't seen any issues with replication of Master-Slave as such , validated with Index Version and Generated numbers as well as the data. I am using solr3.5 with 5Slaves + 1Master. Polling interval being 20seconds and docs are updated(delta-import) every 60 seconds through Master.Slaves only are for read. I am running solr with tomacat 6.0.35 and below is the connection settings Heap size is 1Gb( Xms=Xmx=1024m). Any pointers what could be wrong. Regards, Shubham
Searching on fields with White Spaces
I have a custom fieldtype with the below config I have an Autocomplete configured on the same field which gives me result as expected. A new use case is to search kualalumpur or say newyork with out spaces returning Kuala Lumpur and New York which happen to be the original values. What should be the recommended solution. Regards, Shubham
Re: Index Corruption
I am using 3.5 . - Original Message - From: Lance Norskog [mailto:goks...@gmail.com] Sent: Monday, May 14, 2012 11:08 AM To: solr-user@lucene.apache.org Subject: Re: Index Corruption "Index corruption" usually means data structure problems. There is a Lucene program 'org.apache.lucene.index.CheckIndex' in the lucene core jar. If there is a problem with the data structures, this program will find it: java -cp lucene-core-XX.jar org.apache.lucene.index.CheckIndex /index/data Do you use Solr 3.1, 3.2 or 3.3? There was an index flushing bug in this series of Solr releases. Solr 3.4, 3.5 and 3.6 don't have the problem, and the trunk never had the problem. You should not (to my knowledge) ever have duplicated documents if there is a crash while indexing. If this happens, but there is no Lucene index corruption, please file a bug. Lance On Sun, May 13, 2012 at 1:51 PM, shubham wrote: > We have a problem in last couple of days when a particular Solr master was > restarted while there was an import running . This led to the corruption of > some document entities where they had multiple doc's of same unique id etc. > > Is this kind of corruption possible , by now I expected that Solr indexing > works a way where either the data is completely imported/updated or nothing > has changed, But with this there exists a third possibility which is pretty > risky. Apart from writing queries to generate alerts when some kind of > corruption occurs is there a recommended way to do the same. However why the > same happened in terms of corruption still bother's me. > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Index-Corruption-tp3983579.html > Sent from the Solr - User mailing list archive at Nabble.com. -- Lance Norskog goks...@gmail.com
RE: Problem with sorting solr docs
Just adding to the below--> If there is a field(say X) which is not populated and in the query I am not sorting on this particular field but on another field (say Y) still the result ordering would depend on X . Infact in the below problem mentioned by Harsh making X as sortMissingLast="false" sortMissingFirst="false" solved the problem while in the query he was sorting on Y. This seems a bit illogical. Regards, Shubham From: Harshvardhan Ojha [harshvardhan.o...@makemytrip.com] Sent: Tuesday, July 03, 2012 5:58 PM To: solr-user@lucene.apache.org Subject: RE: Problem with sorting solr docs Hi, I have added to my schema.xml, although I am searching on name field. It seems to be working fine. What is its default behavior? Regards Harshvardhan Ojha -Original Message- From: Rafał Kuć [mailto:r@solr.pl] Sent: Tuesday, July 03, 2012 5:35 PM To: solr-user@lucene.apache.org Subject: Re: Problem with sorting solr docs Hello! But the latlng field is not taken into account when sorting with sort defined such as in your query. You only sort on the name field and only that field. You can also define Solr behavior when there is no value in the field, but adding sortMissingLast="true" or sortMissingFirst="true" to your type definition in the schema.xml file. -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch > Hi, > Thanks for reply. > I want to sort my docs on name field, it is working well only if I have all > fields populated well. > But my latlng field is optional, every doc will not have this value. > So those docs are not getting sorted. > Regards > Harshvardhan Ojha > -Original Message- > From: Rafał Kuć [mailto:r@solr.pl] > Sent: Tuesday, July 03, 2012 5:24 PM > To: solr-user@lucene.apache.org > Subject: Re: Problem with sorting solr docs > Hello! > Your query suggests that you are sorting on the 'name' field instead > of the latlng field (sort=name +asc). > The question is what you are trying to achieve ? Do you want to sort > your documents from a given geographical point ? If that's the case > you may want to look here: > http://wiki.apache.org/solr/SpatialSearch/ > and look at the possibility of sorting on the distance from a given point. > -- > Regards, > Rafał Kuć > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - > ElasticSearch > Hi, > > I have 260 docs which I want to sort on a single field latlng. > > 1 > Amphoe Khanom > 1.0,1.0 > > > My query is : > http://localhost:8080/solr/select?q=*:*&sort=name +asc > > This query sorts all documents except those which doesn’t have latlng, > and I can’t keep any default value for this field. > My question is how can I sort all docs on latlng? > > Regards > Harshvardhan Ojha | Software Developer - Technology Development > | MakeMyTrip.com, 243 SP Infocity, Udyog Vihar Phase 1, Gurgaon, > Haryana - 122 016, India > What's new?: Inspire - Discover an inspiring new way to plan and book travel > online. > Office Map > Facebook > Twitter >
HASH range calculation
The contents of this email, including the attachments, are PRIVILEGED AND CONFIDENTIAL to the intended recipient at the email address to which it has been addressed. If you receive it in error, please notify the sender immediately by return email and then permanently delete it from your system. The unauthorized use, distribution, copying or alteration of this email, including the attachments, is strictly forbidden. Please note that neither HotelTravel nor the sender accepts any responsibility for viruses and it is your responsibility to scan the email and attachments (if any). No contracts may be concluded on behalf of HotelTravel by means of email communications.