Are you using post.jar or other methods of indexing the CSV file?
You have to ensure that the number of fields in your CSV file are the same
as the fields in Solr.
Also, each records in the CSV file must be on separate lines, and each
record must have the same number of fields, with each field sep
On 3/21/2016 7:48 PM, Amit Jha wrote:
> When I run the same sql on DB it takes only 1 sec. And 6-7 documents are
> getting indexed per second.
That's really slow. It seems likely that you are having extreme
performance issues due to garbage collection problems, possibly from a
heap that needs t
When I run the same sql on DB it takes only 1 sec. And 6-7 documents are
getting indexed per second.
As I've 4 node solrCloud setup, can I run 4 import handler to index the same
data? Will it not over write?
10-20k is very high in numbers, where can I get the actual size of document.
Rgds
AJ
Yes, I do have multiple modes in my solr cloud setup.
Rgds
AJ
> On 21-Mar-2016, at 22:20, fabigol wrote:
>
> Amit Jha,
> do you have several sold server with solr cloud?
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/How-fast-indexing-tp4264994p4265122.ht
On 3/21/2016 6:49 PM, Aswath Srinivasan (TMS) wrote:
>>> Thank you for the responses. Collection crashes as in, I'm unable to open
>>> the core tab in Solr console. Search is not returning. None of the page
>>> opens in solr admin dashboard.
>>>
>>> I do understand how and why this issue occurs a
>>The only way that I can imagine any part of Solr *crashing* when this message
>>happens is if you are also hitting an OutOfMemoryError
exception. You've said that your collection crashes ... but not what
actually happens -- what "crash" means for your situation. I've never heard of
a colle
On 3/20/2016 6:11 PM, Amit Jha wrote:
> In my case I am using DIH to index the data and Query is having 2 join
> statements. To index 70K documents it is taking 3-4Hours. Document size would
> be around 10-20KB. DB is MSSQL and using solr4.2.10 in cloud mode.
My source data is in a MySQL databas
On 3/21/2016 12:52 PM, Aswath Srinivasan (TMS) wrote:
> Fellow developers,
>
> PERFORMANCE WARNING: Overlapping onDeckSearchers=2
>
> I'm seeing this warning often and whenever I see this, the collection
> crashes. The only way to overcome this is by deleting the data folder and
> reindexing.
>
>
The following message addressed to you was quarantined because it likely
contains a virus:
Subject: RE: PERFORMANCE WARNING: Overlapping onDeckSearchers=2
From: "Aswath Srinivasan (TMS)"
However, if you know the sender and are expecting an attachment, please reply
to this message, and we will
If you're seeing a crash, then that's a distinct problem from the WARN -- it
might be related tothe warning, but it's not identical -- Solr doesn't always
(or even normally) crash in the "Overlapping onDeckSearchers"
situation
That is what I hoped for. But I could see nothing else in the log. Al
: What I'm wondering is, what should one do to fix this issue when it
: happens. Is there a way to recover? after the WARN appears.
It's just a warning that you have a sub-optimal situation from a
performance standpoint -- either committing too fast, or warming too much.
It's not a failure, a
Please note that I'm not looking to find ways to avoid this issue. There are
lot of internet articles on this topic.
What I'm wondering is, what should one do to fix this issue when it happens. Is
there a way to recover? after the WARN appears.
Thank you,
Aswath NS
-Original Message-
F
I haven't checked this thread since Friday, but here are my responses to the
questions that have come up.
1. How is ranking affected?
Some documents have their scores divided by an integer value in the response
documents.
2. Do you see the proper ranking in the explain section?
Yes, the expla
You can write an Update Request Processor that would count the words in the
source value for a specified field and generate that count as an integer
value for another field.
My old Solr 4.x Deep Dive book has an example that uses a sequence (chain)
of existing update processors to count words in a
You can write an Update Request Processor which takes a pair of date field
value and creates a season code value for a separate field, which could be
multivalued for date ranges spanning seasons. Similarly you could have
another generated multivalued field which listed the months when the data
was
Fellow developers,
PERFORMANCE WARNING: Overlapping onDeckSearchers=2
I'm seeing this warning often and whenever I see this, the collection crashes.
The only way to overcome this is by deleting the data folder and reindexing.
In my observation, this WARN comes when I hit frequent hard commits o
Hello,
Is it possible to perform date range faceting on the whole dataset without
indicating facet.range.start and facet.range.end?
What if I have no clue about when my data starts and when it ends (might be
some point in the future)?
A sample query:
http://localhost:8983/solr/enron-path/
Hi All,
I am new to Solr, and started to look at our existing Solr implementation.
We are
We have a Solr 3.6 implementation with an index size of 54 GB with a
configuration of 2 Master/3 Slaves. Each Machine has a RAM of 32 GB. Avg.
Size of document is 100KB, and avg number of documents being inde
On 21/03/16 14:43, Erick Erickson wrote:
Hmmm, you say "where I have many shards and
can't have one problem causing no deletion of old data.".
You then have a shard that, when it comes back up still
has all the old data and that _is_ acceptable? Seems like
that would be jarring to the users whe
Thanks, Mikhail.
I eventually added a distinguishing field "path" and queried unambiguously.
>Четверг, 17 марта 2016, 9:46 -04:00 от Mikhail Khludnev
>:
>
>Hello,
>
>Please find inline
>
>On Wed, Mar 16, 2016 at 10:10 PM, Alisa Z. < prol...@mail.ru > wrote:
>> Hi all,
>>I have a deeply mult
Hi all,
I would like to implement seasonal date searches on date ranges. I’m using SOLR
5.4.1 and have indexed date ranges using a DateRangeField (let’s call this
field date_ranges).
Each document in SOLR corresponds to a biological sample and each sample was
collected during a date range that
Hi Rajesh,
The number of words are already stored (docValues) in the index.
If you don't use index time boost, length of the field can be restored (with
some precision loss) from field length norm.
http://lucene.apache.org/core/5_5_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.htm
BCE dates have historically been problematic because of ambiguity in both
the ISO format that we use for representing dates as well as the internal
java representation, more details...
https://issues.apache.org/jira/browse/SOLR-1899
..the best work around I can suggest is to use simple numeri
Erick,
in fact, i looked % of cpu. The % often changed but sometime it was very low
(<10%) but the memory was heavy. I think what there was a problem and i cut
the indexation.
I don't index documents but some data of postgres database.
If i decrease the number of fields, do you win time?
is it nor
Hey,
I hope this email finds you well. I have a solr.TrieDateField and I am
trying to send -ve dates to this field. Does the TrieDateField allow for
-ve dates? when I push the date -1600-01-10 to solr i get 1600-01-10 as the
date registered. Please advise.
--
Regards,
Jude Mwenda
Amit Jha,
do you have several sold server with solr cloud?
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-fast-indexing-tp4264994p4265122.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi,
When indexing sentences I want to store the number of words in the sentence in
a fields that I can use to with other query later for word count match. Please
let me know whether it is possible?
Thanks
Rajesh
Corporate Executive Board India Private Limited. Registration No:
U741040HR2004
Hey,
I hope this email finds you well. I have a solr.TrieDateField and I am
trying to send -ve dates to this fields. Does the TrieDateField allow for
-ve dates? when I push the date -1600-01-10 to solr i get 1600-01-10 as the
date registered. Please advise.
--
Regards,
Jude Mwenda
On Sun, Mar 20, 2016 at 06:11:32PM -0700, Jay Potharaju wrote:
> Hi,
> I am trying to index some data using csv files. The data contains
> description column, which can include quotes, comma, LF/CR & other special
> characters.
>
> I have it working but run into an issue with the following error
>
You have already asked this question, and there is a thread on going on
this?
To quote the previous thread, Solr is no longer a webapp that can be
deployed on any servlet container, it is now a black-box application, so
you should just deploy Solr as it is, and then connect to it yourself,
which y
What does the Solr log say? That usually gives you a better
idea of what the root cause is, the script really doesn't have
access to the root cause.
Best,
Erick
On Mon, Mar 21, 2016 at 5:54 AM, Salman Ansari
wrote:
> Hi,
>
> I am facing an issue in running Solr server. I tried different approac
Hmmm, you say "where I have many shards and
can't have one problem causing no deletion of old data.".
You then have a shard that, when it comes back up still
has all the old data and that _is_ acceptable? Seems like
that would be jarring to the users when some portion of the
docs in their collect
Hi all,
During daily work I got NPE at mention line. full method below.
Anyone know if it's a known bug ? should I open a ticket for Solr ?
Thanks in advance,
Shay.
public Object More ...lookup(BytesRef indexedId) {
683 LogPtr entry;
684 TransactionLog lookupLog;
685
686 synchroniz
Hi Alessandro,
OP have different ranking: fl=score and explain's score would have retrieve
different orders.
I wrote test cases using ClassicSimilarity, but it won't re-produce.
This is really weird. I wonder what is triggering this.
aHmet
On Monday, March 21, 2016 2:08 PM, Alessandro Bened
Hi,
I am using Solr 4.10.2.
When one of the shards in my environment is down and fails to recover -
The process of deleting documents from other shards fails as well.
For example,
When running:
https://:8983/solr//update?stream.body=
*:*&commit=true
I get the following error message:
No regis
Please find my answer inline
Corporate Executive Board India Private Limited. Registration No:
U741040HR2004PTC035324. Registered office: 6th Floor, Tower B, DLF Building
No.10 DLF Cyber City, Gurgaon, Haryana-122002, India.
This e-mail and/or its attachments are intended only for the use of
Hi,
I am facing an issue in running Solr server. I tried different approaches
and still receive the following error
"ERROR: Solr at http://localhost:8983/solr did not come online within 30
seconds"
I tried running the following commands
1) solr -e cloud
2) solr.cmd start -cloud -p 8983 -s
"C:\S
I would like to add a question, how the ranking is affected ?
Do you see the proper ranking in the explain section ?
And what about the results ? Are they ranked accordingly the correct score,
or they are ranked by the wrong score ?
I got a similar issue, which I am not able to reproduce yet, but i
Hello All,
I have a solr-5.3.0 and installed it and make my core but when I try to
deploy it on glassfish and follow steps in
"https://wiki.apache.org/solr/SolrGlassfish " it's recommended a solr.war
file which isn't exactly found on my opt/solr/dist
Where I can find it.
Regards,
Adel K
Mmm maybe I didn't explain properly, all the fields you have in the index
for the products could be used to design features .
Of course my list was an example, but when processing clicks you should
first take in consideration all the features you can extract that should
affect your ranking algorith
For the indexation i use DIH.
I find this link for solar Indexation.
With solar it is more quick?
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-fast-indexing-tp4264994p4265050.html
Sent from the Solr - User mailing list archive at Nabble.com.
for the jvm i have 8GO
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-fast-indexing-tp4264994p4265047.html
Sent from the Solr - User mailing list archive at Nabble.com.
thank for your response.
soir worked on a server with 6CPU and 10 Go Memory Ram
We receive some data evrytime. Now, we do indexing 2 times per day
the database contains 5 tables (18k row, 3times 300k row and one of 6
millions rows)
The indexation was done in 6 hours.
I didn't modify the original so
Hi Ahmet,
I am using solr 5.5.0. I am running single instance with single core. No shards
I have added to my schema as
suggested by Rick Sullivan. Now the scores are same between explain and score
field.
But instead of previous results "Lync - Microsoft Office 365" and "Microsoft
Office 365"
Hi,
My basic requirement is to secure the solr admin page. I thought of securing it
using certificate authentication. So I followed the below link
https://cwiki.apache.org/confluence/display/solr/Enabling+SSL
With the help of above link I was able to secure my admin page but I am using
tomcat
45 matches
Mail list logo