Re: Question on Batch process

2011-04-27 Thread Charles Wardell
fast last night, but today the commits are taking so long. Is this to be expected? -- Best Regards, Charles Wardell Blue Chips Technology, Inc. www.bcsolution.com On Wednesday, April 27, 2011 at 6:15 PM, Otis Gospodnetic wrote: > Hi Charles, > > Yes, the threads I was referring

Re: Question on Batch process

2011-04-26 Thread Charles Wardell
queueSize, int threadCount) throws MalformedURLException Thanks again, Charlie -- Best Regards, Charles Wardell Blue Chips Technology, Inc. www.bcsolution.com On Tuesday, April 26, 2011 at 5:12 PM, Otis Gospodnetic wrote: > Charlie, > > How's this: > * -Xmx2g > *

Question on Batch process

2011-04-26 Thread Charles Wardell
I am sure that this question has been asked a few times, but I can't seem to find the sweetspot for indexing. I have about 100,000 files each containing 1,000 xml documents ready to be posted to Solr. My desire is to have it index as quickly as possible and then once completed the daily stream

Query performance

2011-04-16 Thread Charles Wardell
Hi All, I have an index with about 30M documents. For the most part queries are very fast. However, when I add a wildcard to a search field. +title:h*twitter it can take a few minutes. 8GB 1 quad core CENTOS false 100 512 10 1 1000 1 native

unindexible Chars?

2011-04-06 Thread Charles Wardell
Once and awhile, my post.jar seems to fail on commit. Durring the commit process, I have gotten a few errors. One is that EOF character found, and another is that semicolon expected after &the. I also have come across a > was expected. So my question is what characters do I need to strip out o

Unexpected character '0' (code 48) in prolog;

2011-03-30 Thread Charles Wardell
Can someone translate this error for me. My data looks pretty clean, so I am not sure what is going on here. Mar 30, 2011 5:21:52 AM org.apache.solr.common.SolrException log SEVERE: Error processing "legacy" update command:com.ctc.wstx.exc.WstxUnexpectedCharException: Unexpected character '0' (

Re: Exporting to CSV

2011-03-29 Thread Charles Wardell
Hi Koji, Do you mean that adding &wt=csv to my http request will give me a csv? The only downloads that I see on the SOLR site is for 1.4.x Is there a 3.1 beta? On Mar 29, 2011, at 10:32 PM, Koji Sekiguchi wrote: > (11/03/30 10:59), Charles Wardell wrote: >> Is there an easy way t

Exporting to CSV

2011-03-29 Thread Charles Wardell
Is there an easy way to get queried data exported from solr in a csv format? Hoping there is a handler or library for this. Regards, charlie

Fields not being indexed?

2011-03-28 Thread Charles Wardell
Sorry for inadvertently Hijacking the last thread. Can someone take a look at this and let me know what I am doing wrong. According to luke, only guid, tags, and aquiDate are available. Schema is below as well. http://twitter.com/AshleyxArsenic/statuses/52164920388763648 []

Fields not being indexed?

2011-03-28 Thread Charles Wardell
Can someone take a look at this and let me know what I am doing wrong. According to luke, only guid, tags, and aquiDate are available. Schema is below as well. http://twitter.com/AshleyxArsenic/statuses/52164920388763648 []

Re: problems indexing web content

2011-03-28 Thread Charles Wardell
ot;" >> >> [] >> >> >> >> >> On Mar 28, 2011, at 1:02 PM, Jan Høydahl wrote: >>> Hi, >>> >>> I assume you try to post HTML files from post.jar, and use >>> HTMLStripCharFilter to sanitize the HTML. >>> &

Re: problems indexing web content

2011-03-28 Thread Charles Wardell
simplest case first, > get rid of mappingCharFilter, StopFilter, WordDelimFilter and Stemmer - just > do the most basic stuff you can and go from there. > > -- > Jan Høydahl, search solution architect > Cominvent AS - www.cominvent.com > > On 28. mars 2011, at 18.52, Charles Wa

problems indexing web content

2011-03-28 Thread Charles Wardell
Hi Everyone, I setup a server and began to index my data. I have two questions I am hoping someone can help me with. Many of my files seem to index without any problems. Others, I get a host of different errors. I am indexing primarily web based content and have identified my text field as foll