Hello All,
I am using Solr for indexing my data. My client is in C++. So I make Curl
request to Solr server for indexing.
Now, I want to use indexing in SolrCloud mode using ZooKeeper for HA. I
read the wiki link of SolrCloud (http://wiki.apache.org/solr/SolrCloud).
What I understand from wiki t
Hello Dear,
I have 1000 GB of data that I want to index.
Assuming I have enough space for storing the indexes in a single machine.
*I would like to get an idea about Solr performance for searching an item
from a huge data set.
Do I need to use shards for improving the Solr search efficiency or it i
Hi,
I want to write a custom updateRequestHandler.
Can you pl.s guide me the steps I need to perform for that ?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Writing-a-customize-updateRequestHandler-tp4115059.html
Sent from the Solr - User mailing list archive at Nabble.co
Hi Jack,
As you are Solr expert so I am expecting suggestion on this post by you.
Actually, the best approach would be using "ExtarctingUpdateProcessor" for
the kind of problem I have but unfortunately the same is not released yet
and even not started yet.
If you have something on top of your mind
Hi All,
In my application, I have the requirement which can be solved by
"ExtractingUpdateProcessor". May I know by when it would be released (jIRA
ticket: SOLR-1763) ?
Can you pls. tell me where I can get "ExtractingUpdateProcessor" code base
??
--
View this message in context:
http://lucene
Thanks Jack for throwing your idea. I tell you my problem in more detail.
I have an email which I have to index. The email contains some text contents
(email headers, email body) and binary data (email attachments in pdf, doc
etc formats). I want to index text and binary contents separately.
So I a
Hello All,
I have a problem as described below and would like to have your opinion:
I have multiple documents with same unique id(by setting overwrite filed as
false). Let's say I have three documents (Doc1, Doc2, Doc3) and all are
having same unique id. I can search one of any of the three docume
Hi,
I am using ExtractingRequestHandler to extract text from binary data and
then index the text but getting *error:
org.apache.solr.update.DirectUpdateHandler2; No uncommitted changes.
Skipping IW.commit.*
*solrconfig.xml:*
true
attachment
ignored_
Pls. find my response in-line:
Assuming that your binary fields are mime attachments to email messages,
they will probably already be encoded as base 64. Why not just leave
them that way in solr too? You can't do much with them other than store
them right? Or do you have some kind of image pr
Hi,
Pls. find my response in-line:
That said, the obvious alternative is to use /update/extract instead of
/update – this gives you a way of handling up to one binary stream in
addition to any number of fields that can be represented as text. In that
case, you need to construct a POST request that
Thanks everybody for throwing your ideas.
So, I came to know that XML can not carry random binary data so I will
encode the data in base64 format.
Yes, I can write a custom URP which can convert the base64 encode fields to
binary fields. Now, I have binary fields in my document.* My question is
th
Hi Alexandre,
Thanks very much for responding my post. Pls. find my response in-line:
1) For your email address fields, you are escaping the brackets, right?
Not just "solr solr
<[hidden email]>" as you show, but the < and > escaped, right? Otherwise,
those email addresses become part of XML
Hi,
I am using Solr for searching my email data. My application is in C++ so I a
using CURL library to POST the data to Solr for indexing. I am posting data
in XML format and some of the XML fields are in plain text and some of the
fields are in binary format. I want to know what should I do so th
Thank you all for responding to me. Due to some other activity, I was moved
out of it and now I am on it.
I tried to use ExtractingUpdateProcessorFactory but it seems to me that its
support is not in Solr 4.5(which I am using) even not in any of the Solr
version available in market. Pls. find the
Hi guys,
I am using solr4.5.0 version and getting the errors which I mentioned in
previous post.
Any help is highly appreciated
--
View this message in context:
http://lucene.472066.n3.nabble.com/Error-while-using-ExtractingUpdateProcessorFactory-tp4105155p4105269.html
Sent from the Solr -
Hi,
I am using ExtractingUpdateProcessorFactory in my application to extract
binary data and make indexing for that using tika.
I did following configuration in solrconfig.xml:
attachmentchain
binary_content
I can not convert base64 encoded data to text in my application as it will
impact my core application processing. I want this task should be done at
Solr side. Can I use Apache Tika for this at solr side ?
But the format I am sending to Solr is XML format with some fields are in
plain text and some
Hi,
I am using Solr for searching my email data. My application is in C++ so I a
using CURL library to POST the data to Solr for indexing. I am posting data
in XML format and some of the XML fields are in plain text and some of the
base64 encoded. I want to know what should I do so that Solr can in
Hi,
I am integrating solr search engine with my email clients. I am sending POST
request to Solr using REST.
I am successfully able to post email's to, from, subject etc headers to solr
for making index.
Since email can have mime type bodies and attachments so I am not able to
understand how to pos
Thanks for giving me your valuable thoughts.
I used CDATA for escaping the special characters('<', '>', & etc) in XML
file.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-indexing-on-urlencoded-fields-tp4096994p4097138.html
Sent from the Solr - User mailing list archiv
Hi,
I am a new solr user. I need to integrate Solr with my email application for
searching.
My code is in C++ so I am making REST request to post the data in solr for
making indexes.
I have fields like: from, to , subject, body etc and they can contain the
characters which needs to be urlencoded.
21 matches
Mail list logo