Re: Indexing using CSV

2016-03-21 Thread Zheng Lin Edwin Yeo
Are you using post.jar or other methods of indexing the CSV file? You have to ensure that the number of fields in your CSV file are the same as the fields in Solr. Also, each records in the CSV file must be on separate lines, and each record must have the same number of fields, with each field sep

Re: How fast indexing?

2016-03-21 Thread Shawn Heisey
On 3/21/2016 7:48 PM, Amit Jha wrote: > When I run the same sql on DB it takes only 1 sec. And 6-7 documents are > getting indexed per second. That's really slow. It seems likely that you are having extreme performance issues due to garbage collection problems, possibly from a heap that needs t

Re: How fast indexing?

2016-03-21 Thread Amit Jha
When I run the same sql on DB it takes only 1 sec. And 6-7 documents are getting indexed per second. As I've 4 node solrCloud setup, can I run 4 import handler to index the same data? Will it not over write? 10-20k is very high in numbers, where can I get the actual size of document. Rgds AJ

Re: How fast indexing?

2016-03-21 Thread Amit Jha
Yes, I do have multiple modes in my solr cloud setup. Rgds AJ > On 21-Mar-2016, at 22:20, fabigol wrote: > > Amit Jha, > do you have several sold server with solr cloud? > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/How-fast-indexing-tp4264994p4265122.ht

Re: PERFORMANCE WARNING: Overlapping onDeckSearchers=2

2016-03-21 Thread Shawn Heisey
On 3/21/2016 6:49 PM, Aswath Srinivasan (TMS) wrote: >>> Thank you for the responses. Collection crashes as in, I'm unable to open >>> the core tab in Solr console. Search is not returning. None of the page >>> opens in solr admin dashboard. >>> >>> I do understand how and why this issue occurs a

RE: PERFORMANCE WARNING: Overlapping onDeckSearchers=2

2016-03-21 Thread Aswath Srinivasan (TMS)
>>The only way that I can imagine any part of Solr *crashing* when this message >>happens is if you are also hitting an OutOfMemoryError exception. You've said that your collection crashes ... but not what actually happens -- what "crash" means for your situation. I've never heard of a colle

Re: How fast indexing?

2016-03-21 Thread Shawn Heisey
On 3/20/2016 6:11 PM, Amit Jha wrote: > In my case I am using DIH to index the data and Query is having 2 join > statements. To index 70K documents it is taking 3-4Hours. Document size would > be around 10-20KB. DB is MSSQL and using solr4.2.10 in cloud mode. My source data is in a MySQL databas

Re: PERFORMANCE WARNING: Overlapping onDeckSearchers=2

2016-03-21 Thread Shawn Heisey
On 3/21/2016 12:52 PM, Aswath Srinivasan (TMS) wrote: > Fellow developers, > > PERFORMANCE WARNING: Overlapping onDeckSearchers=2 > > I'm seeing this warning often and whenever I see this, the collection > crashes. The only way to overcome this is by deleting the data folder and > reindexing. > >

Suspicious message with attachment

2016-03-21 Thread help
The following message addressed to you was quarantined because it likely contains a virus: Subject: RE: PERFORMANCE WARNING: Overlapping onDeckSearchers=2 From: "Aswath Srinivasan (TMS)" However, if you know the sender and are expecting an attachment, please reply to this message, and we will

RE: PERFORMANCE WARNING: Overlapping onDeckSearchers=2

2016-03-21 Thread Aswath Srinivasan (TMS)
If you're seeing a crash, then that's a distinct problem from the WARN -- it might be related tothe warning, but it's not identical -- Solr doesn't always (or even normally) crash in the "Overlapping onDeckSearchers" situation That is what I hoped for. But I could see nothing else in the log. Al

RE: PERFORMANCE WARNING: Overlapping onDeckSearchers=2

2016-03-21 Thread Chris Hostetter
: What I'm wondering is, what should one do to fix this issue when it : happens. Is there a way to recover? after the WARN appears. It's just a warning that you have a sub-optimal situation from a performance standpoint -- either committing too fast, or warming too much. It's not a failure, a

RE: PERFORMANCE WARNING: Overlapping onDeckSearchers=2

2016-03-21 Thread Aswath Srinivasan (TMS)
Please note that I'm not looking to find ways to avoid this issue. There are lot of internet articles on this topic. What I'm wondering is, what should one do to fix this issue when it happens. Is there a way to recover? after the WARN appears. Thank you, Aswath NS -Original Message- F

RE: Explain score is different from score

2016-03-21 Thread Rick Sullivan
I haven't checked this thread since Friday, but here are my responses to the questions that have come up. 1. How is ranking affected? Some documents have their scores divided by an integer value in the response documents. 2. Do you see the proper ranking in the explain section? Yes, the expla

Re: Save Number of words in field

2016-03-21 Thread Jack Krupansky
You can write an Update Request Processor that would count the words in the source value for a specified field and generate that count as an integer value for another field. My old Solr 4.x Deep Dive book has an example that uses a sequence (chain) of existing update processors to count words in a

Re: Seasonal searches in SOLR 5.x

2016-03-21 Thread Jack Krupansky
You can write an Update Request Processor which takes a pair of date field value and creates a season code value for a separate field, which could be multivalued for date ranges spanning seasons. Similarly you could have another generated multivalued field which listed the months when the data was

PERFORMANCE WARNING: Overlapping onDeckSearchers=2

2016-03-21 Thread Aswath Srinivasan (TMS)
Fellow developers, PERFORMANCE WARNING: Overlapping onDeckSearchers=2 I'm seeing this warning often and whenever I see this, the collection crashes. The only way to overcome this is by deleting the data folder and reindexing. In my observation, this WARN comes when I hit frequent hard commits o

date range faceting on the whole dataset

2016-03-21 Thread Alisa Z .
Hello, Is it possible to perform date range faceting on the whole dataset without indicating facet.range.start and facet.range.end? What if  I have no clue about when my data starts and when it ends (might be some point in the future)?  A sample query: http://localhost:8983/solr/enron-path/

Solr 3.6 Issues

2016-03-21 Thread Shailendra Tiwari
Hi All, I am new to Solr, and started to look at our existing Solr implementation. We are We have a Solr 3.6 implementation with an index size of 54 GB with a configuration of 2 Master/3 Slaves. Each Machine has a RAM of 32 GB. Avg. Size of document is 100KB, and avg number of documents being inde

Re: SolrCloud - Fails to delete documents when some shard is down

2016-03-21 Thread Renaud Delbru
On 21/03/16 14:43, Erick Erickson wrote: Hmmm, you say "where I have many shards and can't have one problem causing no deletion of old data.". You then have a shard that, when it comes back up still has all the old data and that _is_ acceptable? Seems like that would be jarring to the users whe

Re[2]: [nested] how to specify a path for multiple nesting?

2016-03-21 Thread Alisa Z .
Thanks, Mikhail. I eventually added a distinguishing field "path" and queried unambiguously.  >Четверг, 17 марта 2016, 9:46 -04:00 от Mikhail Khludnev >: > >Hello, > >Please find inline > >On Wed, Mar 16, 2016 at 10:10 PM, Alisa Z. < prol...@mail.ru > wrote: >> Hi all, >>I have a deeply mult

Seasonal searches in SOLR 5.x

2016-03-21 Thread Ioannis Kirmitzoglou
Hi all, I would like to implement seasonal date searches on date ranges. I’m using SOLR 5.4.1 and have indexed date ranges using a DateRangeField (let’s call this field date_ranges). Each document in SOLR corresponds to a biological sample and each sample was collected during a date range that

Re: Save Number of words in field

2016-03-21 Thread Ahmet Arslan
Hi Rajesh, The number of words are already stored (docValues) in the index. If you don't use index time boost, length of the field can be restored (with some precision loss) from field length norm. http://lucene.apache.org/core/5_5_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.htm

Re: BCE dates on solr TrieDateField

2016-03-21 Thread Chris Hostetter
BCE dates have historically been problematic because of ambiguity in both the ISO format that we use for representing dates as well as the internal java representation, more details... https://issues.apache.org/jira/browse/SOLR-1899 ..the best work around I can suggest is to use simple numeri

Re: How fast indexing?

2016-03-21 Thread fabigol
Erick, in fact, i looked % of cpu. The % often changed but sometime it was very low (<10%) but the memory was heavy. I think what there was a problem and i cut the indexation. I don't index documents but some data of postgres database. If i decrease the number of fields, do you win time? is it nor

BCE dates on solr TrieDateField

2016-03-21 Thread jude mwenda
Hey, I hope this email finds you well. I have a solr.TrieDateField and I am trying to send -ve dates to this field. Does the TrieDateField allow for -ve dates? when I push the date -1600-01-10 to solr i get 1600-01-10 as the date registered. Please advise. -- Regards, Jude Mwenda

Re: How fast indexing?

2016-03-21 Thread fabigol
Amit Jha, do you have several sold server with solr cloud? -- View this message in context: http://lucene.472066.n3.nabble.com/How-fast-indexing-tp4264994p4265122.html Sent from the Solr - User mailing list archive at Nabble.com.

Save Number of words in field

2016-03-21 Thread G, Rajesh
Hi, When indexing sentences I want to store the number of words in the sentence in a fields that I can use to with other query later for word count match. Please let me know whether it is possible? Thanks Rajesh Corporate Executive Board India Private Limited. Registration No: U741040HR2004

solr

2016-03-21 Thread jude mwenda
Hey, I hope this email finds you well. I have a solr.TrieDateField and I am trying to send -ve dates to this fields. Does the TrieDateField allow for -ve dates? when I push the date -1600-01-10 to solr i get 1600-01-10 as the date registered. Please advise. -- Regards, Jude Mwenda

Re: Indexing using CSV

2016-03-21 Thread Paul Hoffman
On Sun, Mar 20, 2016 at 06:11:32PM -0700, Jay Potharaju wrote: > Hi, > I am trying to index some data using csv files. The data contains > description column, which can include quotes, comma, LF/CR & other special > characters. > > I have it working but run into an issue with the following error >

Re: Deploy solr on glassfish

2016-03-21 Thread Daniel Collins
You have already asked this question, and there is a thread on going on this? To quote the previous thread, Solr is no longer a webapp that can be deployed on any servlet container, it is now a black-box application, so you should just deploy Solr as it is, and then connect to it yourself, which y

Re: Issue Running Solr

2016-03-21 Thread Erick Erickson
What does the Solr log say? That usually gives you a better idea of what the root cause is, the script really doesn't have access to the root cause. Best, Erick On Mon, Mar 21, 2016 at 5:54 AM, Salman Ansari wrote: > Hi, > > I am facing an issue in running Solr server. I tried different approac

Re: SolrCloud - Fails to delete documents when some shard is down

2016-03-21 Thread Erick Erickson
Hmmm, you say "where I have many shards and can't have one problem causing no deletion of old data.". You then have a shard that, when it comes back up still has all the old data and that _is_ acceptable? Seems like that would be jarring to the users when some portion of the docs in their collect

NPE from UpdateLog.lookup(UpdateLog.java:706) (Version 4.8.1)

2016-03-21 Thread Shay Sofer
Hi all, During daily work I got NPE at mention line. full method below. Anyone know if it's a known bug ? should I open a ticket for Solr ? Thanks in advance, Shay. public Object More ...lookup(BytesRef indexedId) { 683 LogPtr entry; 684 TransactionLog lookupLog; 685 686 synchroniz

Re: Explain score is different from score

2016-03-21 Thread Ahmet Arslan
Hi Alessandro, OP have different ranking: fl=score and explain's score would have retrieve different orders. I wrote test cases using ClassicSimilarity, but it won't re-produce. This is really weird. I wonder what is triggering this. aHmet On Monday, March 21, 2016 2:08 PM, Alessandro Bened

SolrCloud - Fails to delete documents when some shard is down

2016-03-21 Thread Tali Finelt
Hi, I am using Solr 4.10.2. When one of the shards in my environment is down and fails to recover - The process of deleting documents from other shards fails as well. For example, When running: https://:8983/solr//update?stream.body= *:*&commit=true I get the following error message: No regis

RE: Explain score is different from score

2016-03-21 Thread G, Rajesh
Please find my answer inline Corporate Executive Board India Private Limited. Registration No: U741040HR2004PTC035324. Registered office: 6th Floor, Tower B, DLF Building No.10 DLF Cyber City, Gurgaon, Haryana-122002, India. This e-mail and/or its attachments are intended only for the use of

Issue Running Solr

2016-03-21 Thread Salman Ansari
Hi, I am facing an issue in running Solr server. I tried different approaches and still receive the following error "ERROR: Solr at http://localhost:8983/solr did not come online within 30 seconds" I tried running the following commands 1) solr -e cloud 2) solr.cmd start -cloud -p 8983 -s "C:\S

Re: Explain score is different from score

2016-03-21 Thread Alessandro Benedetti
I would like to add a question, how the ranking is affected ? Do you see the proper ranking in the explain section ? And what about the results ? Are they ranked accordingly the correct score, or they are ranked by the wrong score ? I got a similar issue, which I am not able to reproduce yet, but i

Deploy solr on glassfish

2016-03-21 Thread Adel Mohamed Khalifa
Hello All, I have a solr-5.3.0 and installed it and make my core but when I try to deploy it on glassfish and follow steps in "https://wiki.apache.org/solr/SolrGlassfish " it's recommended a solr.war file which isn't exactly found on my opt/solr/dist Where I can find it. Regards, Adel K

Re: Boosts for relevancy (shopping products)

2016-03-21 Thread Alessandro Benedetti
Mmm maybe I didn't explain properly, all the fields you have in the index for the products could be used to design features . Of course my list was an example, but when processing clicks you should first take in consideration all the features you can extract that should affect your ranking algorith

Re: How fast indexing?

2016-03-21 Thread fabigol
For the indexation i use DIH. I find this link for solar Indexation. With solar it is more quick? -- View this message in context: http://lucene.472066.n3.nabble.com/How-fast-indexing-tp4264994p4265050.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How fast indexing?

2016-03-21 Thread fabigol
for the jvm i have 8GO -- View this message in context: http://lucene.472066.n3.nabble.com/How-fast-indexing-tp4264994p4265047.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How fast indexing?

2016-03-21 Thread fabigol
thank for your response. soir worked on a server with 6CPU and 10 Go Memory Ram We receive some data evrytime. Now, we do indexing 2 times per day the database contains 5 tables (18k row, 3times 300k row and one of 6 millions rows) The indexation was done in 6 hours. I didn't modify the original so

RE: Explain score is different from score

2016-03-21 Thread G, Rajesh
Hi Ahmet, I am using solr 5.5.0. I am running single instance with single core. No shards I have added to my schema as suggested by Rick Sullivan. Now the scores are same between explain and score field. But instead of previous results "Lync - Microsoft Office 365" and "Microsoft Office 365"

Sold 5.3 certificate client authentication

2016-03-21 Thread Abdul Rahim, Muzammil (Nokia - IN/Bangalore)
Hi, My basic requirement is to secure the solr admin page. I thought of securing it using certificate authentication. So I followed the below link https://cwiki.apache.org/confluence/display/solr/Enabling+SSL With the help of above link I was able to secure my admin page but I am using tomcat