Re: Solr, Multiple processes running

2007-12-12 Thread Chris Hostetter
: Subject: Solr, Multiple processes running : References: <[EMAIL PROTECTED]> : <[EMAIL PROTECTED]> : <[EMAIL PROTECTED]> ... http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply

Re: does solr handle hierarchical facets?

2007-12-12 Thread Chris Hostetter
: > such that if you search for category, you get all those documents that have : > been tagged with the category AND any sub categories. If this is possible I : > think I'll investigate using solr in place of some existing code we have : > that deals with indexing and searching of such data. : :

Re: criteria for using the property stored="true" and indexed="true"

2007-12-12 Thread Chris Hostetter
: http://wiki.apache.org/solr/SchemaXml#head-af67aefdc51d18cd8556de164606030446f56554 : : indexed means searchable (facet and sort also need this), stored instead : is needed only when you need the original text (i.e. not : tokenized/analyzed) to be returned. : When stored and indexed are not pr

RE: Two Solr Webapps, one folder for the index data?

2007-12-12 Thread Chris Hostetter
: I asked a question similar to this back in : http://mail-archives.apache.org/mod_mbox/lucene-solr-user/200709.mbox/[EMAIL PROTECTED] : SolrDispatchFilter and stored in the global Config). This way, I can : have a multiple instances of Solr up and running with the exact same : configuration

Re: SOLR X FAST

2007-12-12 Thread Chris Hostetter
: Why use FAST and not use SOLR ? For example. : What will FAST offer that will justify the investment ? Am I the only one that finds these questions incredibly hilarious? particularly on this list? You should also email FAST customer service and ask them "Why use Solr and not use FAST ?" :)

Re: Solr 1.3 expected release date

2007-12-12 Thread Venkatraman S
On Dec 13, 2007 1:38 AM, Ryan McKinley <[EMAIL PROTECTED]> wrote: > > I think the last *major* change going into 1.3 is SOLR-303 (Distributed > Search over HTTP) -- this will require some reworking of new features > like SearchComponents and solrj. After that, changes will mostly be for > stabili

Re: Solr and Flex

2007-12-12 Thread Venkatraman S
I presume you understand the difference between Solr and Flex - and am not sure what you need the code for? do you want an AS3 script implementation/wrapper for Solr or are you expecting an application which call uses Solr(to index the docs) and retrieve the docs using some web services and present

RE: Solr 1.3 expected release date

2007-12-12 Thread Norskog, Lance
... SOLR-303 (Distributed Search over HTTP)... Woo-hoo! -Original Message- From: Ryan McKinley [mailto:[EMAIL PROTECTED] Sent: Wednesday, December 12, 2007 12:09 PM To: solr-user@lucene.apache.org Subject: Re: Solr 1.3 expected release date Owens, Martin wrote: > What date or year do

Re: Autocommit

2007-12-12 Thread Michael Thessel
> > I already posted a while ago a problem that one of the solr threads > > starts using 100% of one of the processor cores on a 4 core > > system. > > This sounds like warming / autowarming. > The other possibility is garbage collection. What can I do here? Decrease the autowarmcount? My curren

Re: Autocommit

2007-12-12 Thread Yonik Seeley
On Dec 12, 2007 6:15 PM, Michael Thessel <[EMAIL PROTECTED]> wrote: > I already posted a while ago a problem that one of the solr threads > starts using 100% of one of the processor cores on a 4 core > system. This sounds like warming / autowarming. The other possibility is garbage collection. >

Autocommit

2007-12-12 Thread Michael Thessel
Hello UG I already posted a while ago a problem that one of the solr threads starts using 100% of one of the processor cores on a 4 core system. This doesn't happen right after the start it slightly increaes for about a week until the process runs constantly at 100%. I couldn't figure out a soluti

Re: Solr, search result format

2007-12-12 Thread Mike Klaas
On 12-Dec-07, at 11:58 AM, Owens, Martin wrote: Hello everyone, Hi Martin, It is usually preferrable to not reply to an existing message in the group when starting a new thread. Some people (like me) use clients that properly track the Followup-To header that gets added, so multiple t

Re: Solr, search result format

2007-12-12 Thread Walter Underwood
Fetch your 70,000 results in 70 chunks of 1000 results. Parse each chunk and add it to your internal list. If you are allowed to parse Python results, why can't you use a diffetent XML parser? What sort of "more work" are you doing? I've implemented lots of stuff on top of a paged model, includin

RE: Solr, search result format

2007-12-12 Thread Owens, Martin
>> I think your biggest problem is requesting 70,000 records from Solr. >> That is not going to be fast. I know it, but the limits on the development don't lend themselves to putting all of the fields into lucene so a proper search can be conducted. We need to return them all because more wor

Re: Solr, search result format

2007-12-12 Thread Ryan McKinley
Owens, Martin wrote: Hello everyone, I'm looking for a better solution that the current xml output we're currently getting; if you return more than 70k records the webserver can no longer cope with parsing the xml and the machine falls over out of memory. Ideally what we'd like is for the sea

Re: Solr, search result format

2007-12-12 Thread Walter Underwood
I think your biggest problem is requesting 70,000 records from Solr. That is not going to be fast. Two suggestions: 1. Use paging. Get the results in chunks, 10, 25, 100, whatever. 2. Since you are running out of memory parsing XML, I'm guessing that you're using a DOM-style parser. Don't do tha

Re: Solr 1.3 expected release date

2007-12-12 Thread Ryan McKinley
Owens, Martin wrote: What date or year do we believe Solr 1.3 will be released? Regards, Martin Owens 2008 for sure. It will be after lucene 2.3 and that is a month(more?) away. My honest guess is late Jan to mid Feb. I think the last *major* change going into 1.3 is SOLR-303 (Distributed

Solr 1.3 expected release date

2007-12-12 Thread Owens, Martin
What date or year do we believe Solr 1.3 will be released? Regards, Martin Owens

Re: Solr and word frequencies?

2007-12-12 Thread Otis Gospodnetic
Recono, This would be easier to do with Lucene. Solr uses Lucene under the hood, so just write an app that opens appropriate indices and makes use of various docFreq methods in the Lucene API. Look at TermDocs, IndexReader, TermEnum, etc. Otis -- Sematext -- http://sematext.com/ -- Lucene -

Re: Creating document schema at runtime

2007-12-12 Thread Ryan McKinley
Shalin Shekhar Mangar wrote: Hi, I'm looking on some tips on how to create a new document schema and add it to solr core at runtime. The use case that I'm trying to solve is: 1. Using a custom configuration tool, user creates a solr schema 2. The schema is added (uploaded) to a solr instance (o

Re: display tokens

2007-12-12 Thread Ryan McKinley
Chris Hostetter wrote: : Subject: display tokens : : How can I retrieve the "analyzed tokens" (e.g. the stemmed values) of a : specific field? for a field by name independent of documents? the LukeRequestHandler can give you the top N terms for a field ... but if you mean "i did a search, i

Re: Leading WildCard in Query

2007-12-12 Thread Michael Kimsal
Please vote for SOLR-218. I'm not aware of any other way to accomplish the leading wildcard functionality that would be convenient. SOLR-218 is not asking that it be enabled by default, only that it be functionality that is exposed to SOLR admins via config.xml. On Dec 12, 2007 6:29 AM, Eswar K

Leading WildCard in Query

2007-12-12 Thread Eswar K
Hi All, I understand that a leading Wild card search is not allowed as it is a very costly operation. There is an issues logged for it . ( http://issues.apache.org/jira/browse/SOLR-218). Is there any other way of enabling leading wildcards apart from doing it in code by calling * QueryParser.setAl

Re: criteria for using the property stored="true" and indexed="true"

2007-12-12 Thread Walter Ferrara
See: http://wiki.apache.org/solr/SchemaXml#head-af67aefdc51d18cd8556de164606030446f56554 indexed means searchable (facet and sort also need this), stored instead is needed only when you need the original text (i.e. not tokenized/analyzed) to be returned. When stored and indexed are not present, I

Re: SOLR X FAST

2007-12-12 Thread Svein Parnas
On Dec 12, 2007, at 2:50 AM, Nuno Leitao wrote: FAST uses two pipelines - an ingestion pipeline (for document feeding) and a query pipeline which are fully programmable (i.e., you can customize it fully). At ingestion time you typically prepare documents for indexing (tokenize, character

Solr and word frequencies?

2007-12-12 Thread Recono
Hi, iam working on the following task. I have a big Solr index "B"(round about 2 million forum-post entries) and 50 Sub-Indices "S1-50"(sub-forum entries) which are also included in "B". Now I want Solr to compare the word frequency of each Word in "S1-50" to the the word frequency of the whole b

criteria for using the property stored="true" and indexed="true"

2007-12-12 Thread Dilip.TS
Hi, I would be some clarifications on which fields should we assign the property stored="true" and indexed="true" What is the criteria for these property assignments? What would be the impact if no field is assigned with this property? Thanks in Advance, Regards, Dilip TS Starmark Services Pvt.