Re: Solr Cell. Seems to be only indexing the first N bytes of a text file.

2010-03-20 Thread Ross
Thanks Erick. That was it. All looking good now. Cheers Ross On Sat, Mar 20, 2010 at 9:29 PM, Erick Erickson wrote: > Does our solarconfig file have a line like... > 1 > ? > > Try upping the 1... > > HTH > Erick > > On Sat, Mar 20, 2010 at 8:40 PM, Ross wrote: > >> Hi all >> >> I'm tr

Re: Solr Cell. Seems to be only indexing the first N bytes of a text file.

2010-03-20 Thread Erick Erickson
Does our solarconfig file have a line like... 1 ? Try upping the 1... HTH Erick On Sat, Mar 20, 2010 at 8:40 PM, Ross wrote: > Hi all > > I'm trying to index some text files using Solr Cell. I'm using the > schema from Avi Rappoport's tutorial about indexing html and text > files altho

MLT question

2010-03-20 Thread blargy
Im playing around with MLT and I am getting back decent results when searching against a particular document. My question is how can I paginate the results of this query? For example instead of setting rows you must specify mlt.count in the params. But how can I set the offset? mlt.offset? Thank

Solr Cell. Seems to be only indexing the first N bytes of a text file.

2010-03-20 Thread Ross
Hi all I'm trying to index some text files using Solr Cell. I'm using the schema from Avi Rappoport's tutorial about indexing html and text files although I also had the same problem with the example/solr setup. My problem is that words past or "below" a certain point in a file are not being inde

Re: dynamic categorization & transactional data

2010-03-20 Thread caman
@Grant Less than a minute. If we go with the meta-retrieval from the index, we will have to keep the index updated down to seconds. But that may not scale well. Probably a hybrid approach? I will look into classifier. thanks Grant Ingersoll-6 wrote: > > > On Mar 18, 2010, at 2:44 PM, cama

64 bit integers (MySQL bigint) and SOLR

2010-03-20 Thread Shawn Heisey
The primary key for my database is a BIGINT, basically a 64 bit integer. The value is well below the 32 bit maximum (about 230 million right now) but someday in the future that might not be the case. In the schema, we have it mapped to a "tint" field type as defined in the example schema. I

GSOC: Solr/Mahout integration

2010-03-20 Thread Grant Ingersoll
For any students out there interested in doing a Google Summer of Code on Open Source and wanting to contribute to Solr, check out https://issues.apache.org/jira/browse/MAHOUT-343

Re: Features not present in Solr

2010-03-20 Thread Israel Ekpo
One feature that is not available in Solr is any licensing fees and fine print. Also you should not expect to pay in order to use Solr. On Fri, Mar 19, 2010 at 11:16 PM, Srikanth B wrote: > Hello > > We are in the process of researching on Solr features. I am looking for two > things >1

Re: SOLR-1316 How To Implement this autosuggest component ???

2010-03-20 Thread stocki
thx, but i had already implement this way, but w are not so happy with the solutions that this way returns =( is there any possibility to group suggests ? Erick Erickson wrote: > > This link might help. Although it talks about getting the data in with > DIH, > you can skip that part > >

Re: PDFBox/Tika Performance Issues

2010-03-20 Thread Grant Ingersoll
What's your configuration look like for the ExtractReqHandler? On Mar 19, 2010, at 2:42 PM, Giovanni Fernandez-Kincade wrote: > Yeah I've been trying that - I keep getting this error when indexing a PDF > with a trunk-build: > > Apache Tomcat/5.5.27 - Error report > HTTP Status 500

Re: Stop words

2010-03-20 Thread Ahmet Arslan
> if i hide the class="solr.StopFilterFactory"  ignoreCase="true" > words="stopwords.txt" enablePositionIncrements="true"/> > it will work. then > how can i get the "this is it" with out hide > solr.StopFilterFactory ... any > solution One solution can be using CommonGramsFilterFactory [1] inste

Re: related search

2010-03-20 Thread Ahmet Arslan
> > How can i make related search in solr.if i search ipod > i need to get > > answer like ipodsuffle,ipodnano,ipone with out using > morelikethis option First of all you need to construct these list. Either automatic or manually. You can use SynonymFilterFactory [1] at index time with this setti

Re: AutoSuggest

2010-03-20 Thread Suram
Suram wrote: > > > > Shalin Shekhar Mangar wrote: >> >> On Sat, Mar 13, 2010 at 9:30 AM, Suram wrote: >> >>> >>> Erick Erickson wrote: >>> > >>> > Did you commit your changes? >>> > >>> > Erick >>> > >>> > On Fri, Mar 12, 2010 at 7:38 AM, Suram wrote: >>> > >>> >> >>> >> Can set my index

Re: related search

2010-03-20 Thread Suram
Suram wrote: > > How can i make related search in solr.if i search ipod i need to get > answer like ipodsuffle,ipodnano,ipone with out using morelikethis option > any solution please kindly tell me -- View this message in context: http://old.nabble.com/related-search-tp27933778p27951060.h

Stop words

2010-03-20 Thread Suram
Hi all, My index file field name="this is it", but am trying search,it would not found if i hide the it will work. then how can i get the "this is it" with out hide solr.StopFilterFactory ... any solution -- View this message in context: http://old.nabble.com/Stop-words-tp27951059p

Re: [ANN] Zoie Solr Plugin - Zoie Solr Plugin enables real-time update functionality for Apache Solr 1.4+

2010-03-20 Thread Janne Majaranta
To my understanding it adds a in-memory index which holds the recent commits and which is flushed to the main index based on the config options. Not sure if it helps to get solr near real time. I am evaluating it currently, and I am really not sure if it adds anything because of the cache r