update in SolrCloud through C++ client

2014-02-14 Thread neerajp
Hello All, I am using Solr for indexing my data. My client is in C++. So I make Curl request to Solr server for indexing. Now, I want to use indexing in SolrCloud mode using ZooKeeper for HA. I read the wiki link of SolrCloud (http://wiki.apache.org/solr/SolrCloud). What I understand from wiki t

Solr performance on a very huge data set

2014-02-11 Thread neerajp
Hello Dear, I have 1000 GB of data that I want to index. Assuming I have enough space for storing the indexes in a single machine. *I would like to get an idea about Solr performance for searching an item from a huge data set. Do I need to use shards for improving the Solr search efficiency or it i

Writing a customize updateRequestHandler

2014-02-03 Thread neerajp
Hi, I want to write a custom updateRequestHandler. Can you pl.s guide me the steps I need to perform for that ? -- View this message in context: http://lucene.472066.n3.nabble.com/Writing-a-customize-updateRequestHandler-tp4115059.html Sent from the Solr - User mailing list archive at Nabble.co

Re: Cross referencing of solr documents

2014-02-02 Thread neerajp
Hi Jack, As you are Solr expert so I am expecting suggestion on this post by you. Actually, the best approach would be using "ExtarctingUpdateProcessor" for the kind of problem I have but unfortunately the same is not released yet and even not started yet. If you have something on top of your mind

ExtractingUpdateProcessor release date

2014-01-29 Thread neerajp
Hi All, In my application, I have the requirement which can be solved by "ExtractingUpdateProcessor". May I know by when it would be released (jIRA ticket: SOLR-1763) ? Can you pls. tell me where I can get "ExtractingUpdateProcessor" code base ?? -- View this message in context: http://lucene

Re: Cross referencing of solr documents

2013-12-20 Thread neerajp
Thanks Jack for throwing your idea. I tell you my problem in more detail. I have an email which I have to index. The email contains some text contents (email headers, email body) and binary data (email attachments in pdf, doc etc formats). I want to index text and binary contents separately. So I a

Cross referencing of solr documents

2013-12-19 Thread neerajp
Hello All, I have a problem as described below and would like to have your opinion: I have multiple documents with same unique id(by setting overwrite filed as false). Let's say I have three documents (Doc1, Doc2, Doc3) and all are having same unique id. I can search one of any of the three docume

org.apache.solr.update.DirectUpdateHandler2; No uncommitted changes. Skipping IW.commit.

2013-12-18 Thread neerajp
Hi, I am using ExtractingRequestHandler to extract text from binary data and then index the text but getting *error: org.apache.solr.update.DirectUpdateHandler2; No uncommitted changes. Skipping IW.commit.* *solrconfig.xml:* true attachment ignored_

Re: Indexing on plain text and binary data in a single HTTP POST request

2013-12-09 Thread neerajp
Pls. find my response in-line: Assuming that your binary fields are mime attachments to email messages, they will probably already be encoded as base 64. Why not just leave them that way in solr too? You can't do much with them other than store them right? Or do you have some kind of image pr

Re: Indexing on plain text and binary data in a single HTTP POST request

2013-12-09 Thread neerajp
Hi, Pls. find my response in-line: That said, the obvious alternative is to use /update/extract instead of /update – this gives you a way of handling up to one binary stream in addition to any number of fields that can be represented as text. In that case, you need to construct a POST request that

Re: Indexing on plain text and binary data in a single HTTP POST request

2013-12-09 Thread neerajp
Thanks everybody for throwing your ideas. So, I came to know that XML can not carry random binary data so I will encode the data in base64 format. Yes, I can write a custom URP which can convert the base64 encode fields to binary fields. Now, I have binary fields in my document.* My question is th

Re: Indexing on plain text and binary data in a single HTTP POST request

2013-12-09 Thread neerajp
Hi Alexandre, Thanks very much for responding my post. Pls. find my response in-line: 1) For your email address fields, you are escaping the brackets, right? Not just "solr solr <[hidden email]>" as you show, but the < and > escaped, right? Otherwise, those email addresses become part of XML

Indexing on plain text and binary data in a single HTTP POST request

2013-12-09 Thread neerajp
Hi, I am using Solr for searching my email data. My application is in C++ so I a using CURL library to POST the data to Solr for indexing. I am posting data in XML format and some of the XML fields are in plain text and some of the fields are in binary format. I want to know what should I do so th

Re: Indexing on plain text data and base64 encode data in a single HTTP POST request

2013-12-05 Thread neerajp
Thank you all for responding to me. Due to some other activity, I was moved out of it and now I am on it. I tried to use ExtractingUpdateProcessorFactory but it seems to me that its support is not in Solr 4.5(which I am using) even not in any of the Solr version available in market. Pls. find the

Re: Error while using ExtractingUpdateProcessorFactory

2013-12-05 Thread neerajp
Hi guys, I am using solr4.5.0 version and getting the errors which I mentioned in previous post. Any help is highly appreciated -- View this message in context: http://lucene.472066.n3.nabble.com/Error-while-using-ExtractingUpdateProcessorFactory-tp4105155p4105269.html Sent from the Solr -

Error while using ExtractingUpdateProcessorFactory

2013-12-05 Thread neerajp
Hi, I am using ExtractingUpdateProcessorFactory in my application to extract binary data and make indexing for that using tika. I did following configuration in solrconfig.xml: attachmentchain binary_content

Re: Indexing on plain text data and base64 encode data in a single HTTP POST request

2013-10-26 Thread neerajp
I can not convert base64 encoded data to text in my application as it will impact my core application processing. I want this task should be done at Solr side. Can I use Apache Tika for this at solr side ? But the format I am sending to Solr is XML format with some fields are in plain text and some

Indexing on plain text data and base64 encode data in a single HTTP POST request

2013-10-26 Thread neerajp
Hi, I am using Solr for searching my email data. My application is in C++ so I a using CURL library to POST the data to Solr for indexing. I am posting data in XML format and some of the XML fields are in plain text and some of the base64 encoded. I want to know what should I do so that Solr can in

Solr indexing on email mime body and attachment

2013-10-24 Thread neerajp
Hi, I am integrating solr search engine with my email clients. I am sending POST request to Solr using REST. I am successfully able to post email's to, from, subject etc headers to solr for making index. Since email can have mime type bodies and attachments so I am not able to understand how to pos

Re: Solr indexing on urlencoded fields

2013-10-22 Thread neerajp
Thanks for giving me your valuable thoughts. I used CDATA for escaping the special characters('<', '>', & etc) in XML file. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-indexing-on-urlencoded-fields-tp4096994p4097138.html Sent from the Solr - User mailing list archiv

Solr indexing on urlencoded fields

2013-10-22 Thread neerajp
Hi, I am a new solr user. I need to integrate Solr with my email application for searching. My code is in C++ so I am making REST request to post the data in solr for making indexes. I have fields like: from, to , subject, body etc and they can contain the characters which needs to be urlencoded.