first time user
Hi. I tried setting up and running solr on a pc. Then I tried to index a document that was semicolon delimited although it has a file extension of .csv and got the following: C:\Users\Owner\Downloads\SOLR\solr-4.10.2>java -classpath dist/solr-core-4.10.2.jar -Dauto org.apache.solr.util.SimplePostTool example/exampledocs/tweets_01-01-12_12-00-02.csv SimplePostTool version 1.5 Posting files to base url http://localhost:8983/solr/update.. Entering auto mode. File endings considered are xml,json,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log POSTing file tweets_01-01-12_12-00-02.csv (text/csv) SimplePostTool: WARNING: Solr returned an error #400 (Bad Request) for url: http://localhost:8983/solr/update SimplePostTool: WARNING: Response: 4000CSVLoader: input=null, line=1,expected 1 values but got 2 values={'662641699;11:55:03;01;01;2012;EMILYWRIGHT9;RT @alandavidharris: Great bargains on our 8ft tall LED outdoor cherry tree. Now half price at £149.00 http://t.co/PJHQTUnV;DEVON',' ENGLAND;0;0',}400 SimplePostTool: WARNING: IOException while reading response: java.io.IOException: Server returned HTTP response code: 400 for URL: http://localhost:8983/solr/update 1 files indexed. COMMITting Solr index changes to http://localhost:8983/solr/update.. Time spent: 0:00:00.078 C:\Users\Owner\Downloads\SOLR\solr-4.10.2> Then I tried a query for 'eyebrows' which I saw was in the .csv file like this: at http://localhost:8983/solr/#/collection1/query http://localhost:8983/solr/collection1/select?q=eyebrows&wt=json&indent=true but I got: { "responseHeader": { "status": 0, "QTime": 0, "params": { "q": "eyebrows", "indent": "true", "wt": "json", "_": "1418362651114" } }, "response": { "numFound": 0, "start": 0, "docs": [] } } Can anyone suggest what I have done wrong and how I can index this type of file? Thank you. -- View this message in context: http://lucene.472066.n3.nabble.com/first-time-user-tp4174121.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: first time user
Hi Alex, thank you for the response and information. In your opinion, data is stored in semicolon delimited files and some of the fields in the data are text and may on occasion have semicolons in them, will it be possible for solr to index the data properly by itself or will I have to use some outside scripting language like python to enclose all text with quotation marks? Thanks again. -- View this message in context: http://lucene.472066.n3.nabble.com/first-time-user-tp4174121p4174449.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: first time user
Thanks Jack. Can I ask, does it give a solution to my problem of the semicolons in the text and as delimiters? Bill On Tue, Dec 16, 2014 at 9:19 PM, Jack Krupansky-2 [via Lucene] < ml-node+s472066n4174529...@n3.nabble.com> wrote: > > My Solr Deep Dive e-book has full details and lots of examples for CSV > indexing: > > http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html > > -- Jack Krupansky > > -Original Message- > From: Alexandre Rafalovitch > Sent: Tuesday, December 16, 2014 12:12 AM > To: solr-user > Subject: Re: first time user > > Just test a manual example yourself. Much easier. I am pretty sure Solr > can't read your mind which particular semicolon is which use case. > > Worse though, I can't remember how smart it is about quotes either. > > Easier to test than to guess. > > Regards, > Alex > On 15/12/2014 7:28 pm, "onyourmark" <[hidden email] > <http:///user/SendEmail.jtp?type=node&node=4174529&i=0>> wrote: > > > Hi Alex, thank you for the response and information. In your opinion, > data > > is > > stored in semicolon delimited files and some of the fields in the data > are > > text and may on occasion have semicolons in them, will it be possible > for > > solr to index the data properly by itself or will I have to use some > > outside > > scripting language like python to enclose all text with quotation marks? > > Thanks again. > > > > > > > > -- > > View this message in context: > > > http://lucene.472066.n3.nabble.com/first-time-user-tp4174121p4174449.html > > Sent from the Solr - User mailing list archive at Nabble.com. > > > > > > -- > If you reply to this email, your message will be added to the discussion > below: > http://lucene.472066.n3.nabble.com/first-time-user-tp4174121p4174529.html > To unsubscribe from first time user, click here > <http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=4174121&code=d2lsbGlhbTEwOEBnbWFpbC5jb218NDE3NDEyMXwtNDQwNjEzMjkx> > . > NAML > <http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml> > -- View this message in context: http://lucene.472066.n3.nabble.com/first-time-user-tp4174121p4174531.html Sent from the Solr - User mailing list archive at Nabble.com.