Question mark glyphs in indexed content

2009-08-10 Thread Rupert Fiasco
Hello, I am using the latest Solr4j to index content. When I look at that content in the Solr Admin web utility I see weird characters like this: http://brockwine.com/images/solrglyphs.png When I look at the text in the MySQL DB those chars appear to just be plain hyphens. The MySQL table charact

Responses getting truncated

2009-08-24 Thread Rupert Fiasco
I am seeing our responses getting truncated if and only if I search on our main text field. E.g. I just do some basic like title_t:arthritis Then I get a valid document back. But if I add in our larger text field: title_t:arthritis OR text_t:arthritis then the resultant document is NOT valid X

Re: Responses getting truncated

2009-08-25 Thread Rupert Fiasco
n very well be an issue with the data itself. For example, if the data > contains un-escaped characters which invalidates the response. I don't know > much about ruby, but what do you get with wt=json? > > Rupert Fiasco wrote: >> >> I am seeing our responses getting

Re: Responses getting truncated

2009-08-25 Thread Rupert Fiasco
t; Avlesh > > On Tue, Aug 25, 2009 at 10:01 PM, Rupert Fiasco wrote: > >> Using wt=json also yields an invalid document. So after more >> investigation it appears that I can always "break" the response by >> pulling back a specific field via the "fl&quo

Re: Responses getting truncated

2009-08-25 Thread Rupert Fiasco
nyone else have a better idea, I'm actually very curious to hear about > it. > > Uri > > Rupert Fiasco wrote: >> >> The text file at: >> >> http://brockwine.com/solr.txt >> >> Represents one of these truncated responses (this one in XML). It >

Re: Responses getting truncated

2009-08-25 Thread Rupert Fiasco
> 1. Exactly which version of Solr / SolrJ are you using? Solr Specification Version: 1.3.0 Solr Implementation Version: 1.3.0 694707 - grantingersoll - 2008-09-12 11:06:47 Latest SolrJ that I downloaded a couple of days ago. > Can you put the orriginal (pre solr, pre solrj, raw untouched, etc..

Re: Responses getting truncated

2009-08-28 Thread Rupert Fiasco
Firstly, to everyone who has been helping me, thank you very much. All this feedback is helping me narrow down these issues. I deleted the index and re-indexed all the data from scratch and for a couple of days we were OK, but now it seems to be erring again. It happens on different input documen

Re: Responses getting truncated

2009-08-28 Thread Rupert Fiasco
at might > have had some effect, it wouldnt hurt to try though > > > On 08/28/2009 02:04 PM, Rupert Fiasco wrote: >> >> Firstly, to everyone who has been helping me, thank you very much. All >> this feedback is helping me narrow down these issues. >> >>

Re: Responses getting truncated

2009-08-28 Thread Rupert Fiasco
Aug 24, 2009 at 6:30 PM, Rupert Fiasco wrote: >> If I run these through curl on the command its >> truncated and if I run the search through the web-based admin panel >> then I get an XML parse error. > > Are you running curl directly against the solr server, or going > t

Re: Responses getting truncated

2009-08-28 Thread Rupert Fiasco
on an initial re-index things work OK then a couple of days later they break. Keep y'all posted. Thanks -Rupert On Fri, Aug 28, 2009 at 3:12 PM, Rupert Fiasco wrote: > Yes, I am hitting the Solr server directly (medsolr1.colo:9007) > > Versions / architectures: > > Jetty

Re: Responses getting truncated

2009-09-03 Thread Rupert Fiasco
ponse starting off with the tail of another >>> response). That doesn't look like it could be a Solr bug... but >>> rather smells like a thread safety bug in the servlet container. Thanks for everyones help and input. LucidWorks For The Win. -Rupert On Fri, Aug 28, 2009

Specifying multiple documents in DataImportHandler dataConfig

2009-09-08 Thread Rupert Fiasco
I am using the DataImportHandler with a JDBC datasource. From my understanding of DIH, for each of my "content types" e.g. Blog posts, Mesh Categories, etc I would construct a series of document/entity sets, like

Re: Specifying multiple documents in DataImportHandler dataConfig

2009-09-08 Thread Rupert Fiasco
, Rupert Fiasco wrote: > I am using the DataImportHandler with a JDBC datasource. From my > understanding of DIH, for each of my "content types" e.g. Blog posts, > Mesh Categories, etc I would construct a series of document/

Understanding prefix query searching

2008-10-21 Thread Rupert Fiasco
So I tried to look on google for an answer to this before I posted here. Basically I am trying to understand how prefix searching works. I have a dynamic text field (indexed and stored) "full_name_t" I have some data in my index, specifically a record with full_name_t = "Robert P Page" A search

Spell checking not returning "full" terms

2009-02-04 Thread Rupert Fiasco
We are using Solr 1.3 and trying to get spell checking functionality. FYI, our index contains a lot of medical terms (which might or might not make a difference as they are not English-y words, if that makes any sense?) If I specify a spellcheck query of "spellcheck.q=diabtes" I get suggestions

Re: Spell checking not returning "full" terms

2009-02-04 Thread Rupert Fiasco
kenization and probably downcasing. See > http://wiki.apache.org/solr/SpellCheckComponent and > http://wiki.apache.org/solr/SpellCheckerRequestHandler for tips on how to > handle analysis for spelling. > > On Feb 4, 2009, at 2:33 PM, Rupert Fiasco wrote: > >> We are using Sol

Issuing just a spell check query

2009-02-06 Thread Rupert Fiasco
The docs for the SpellCheckComponent say "The SpellCheckComponent is designed to provide inline spell checking of queries without having to issue separate requests." I would like to issue just a spell check query, I dont care about it being inline and piggy-backing off a normal search query. How

Re: Issuing just a spell check query

2009-02-06 Thread Rupert Fiasco
But its deprecated (??) -Rupert On Fri, Feb 6, 2009 at 11:51 AM, Otis Gospodnetic wrote: > Rupert, > > You could use the SpellCheck*Handler* to achieve this. > > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > > ____

search returns matches for non-starting wildcard prefix queries

2009-02-09 Thread Rupert Fiasco
(I think I have a horrible subject line but I wasnt sure how to properly explain myself). I have a text field that I store last names in (and everything is lowercased prior to insertion, not sure if that matters). The field is described as:

Indexing issue with XML control characters

2009-07-20 Thread Rupert Fiasco
During indexing I will often get this error: SEVERE: com.ctc.wstx.exc.WstxUnexpectedCharException: Illegal character ((CTRL-CHAR, code 3)) at [row,col {unknown-source}]: [2,1] at com.ctc.wstx.sr.StreamScanner.throwInvalidSpace(StreamScanner.java:675) By looking at this list and elsewhe