Re: Error when indexing XML files

2009-10-15 Thread Fergus McMenemie
>Hi, > >Please find the schema file attached. Please let me know what I am doing wrong. > >Regards >Chaitali > >--- On Wed, 10/14/09, Fergus McMenemie wrote: > > >From: Fergus McMenemie >Subject: Re: Error when indexing XML files >To: solr-user@lucene.apache.org >Date: Wednesday, October 14, 2009

Re: Error when indexing XML files

2009-10-15 Thread Fergus McMenemie
Hi, Please find the schema file attached. Please let me know what I am doing wrong. Regards Chaitali --- On Wed, 10/14/09, Fergus McMenemie wrote: From: Fergus McMenemie Subject: Re: Error when indexing XML files To: solr-user@lucene.apache.org Date: Wednesday, October 14, 2009, 2:25 AM >Hi

RE: hadoop configuarions for SOLR-1301 patch

2009-10-15 Thread Pravin Karne
Hi, Patch(SOLR-1301) provides distributed indexing (using Hadoop). Now I have Hadoop cluster with 1 master and 2 slaves. Also I have applied above path to solr and build solr. So how I integrate above solr executables with Hadoop cluster? Can u please tell what are the steps for this. Shall I

Re: Using DIH's special commands....Help needed

2009-10-15 Thread Noble Paul നോബിള്‍ नोब्ळ्
use LogTransformer to see if the value is indeed set this should print out the entire row after the transformations On Fri, Oct 16, 2009 at 3:04 AM, William Pierce wrote: > Thanks for your reply!  I tried your suggestion.  No luck.  I have verified > that I have version  1.6.0_05-b13 of jav

Re: hadoop configuarions for SOLR-1301 patch

2009-10-15 Thread Jason Rutherglen
Hi Pravin, You'll need to setup a Hadoop cluster which is independent of SOLR-1301. 1301 is for building Solr indexes only, so there isn't a master and slave. After building the indexes one needs to provision the indexes to Solr servers. In my case I only have slaves because I'm not incrementally

Re: advice on failover setup

2009-10-15 Thread Jason Rutherglen
Don, I neglected to mention the Solr Katta integration patch, SOLR-1395 That's a great place to start coding wise! -J On Wed, Oct 14, 2009 at 4:20 PM, Don Clore wrote: > I'm sorry, for clarification, is it the *wiki# pages that are under > development, or the features (I'm guessing the latter)?

Re: Customizing solr search: SpanQueries (revisited)

2009-10-15 Thread Chris Hostetter
: with (in my overridden process() method): : String[] selectFields = {"id", "fileName"}; // the subset of fields : I am interested in : TopDocs results = searcher.search(cmd.getQuery(), 10); // : custom spanquery, and many/all hits : /* save hit info (doc & score) *

RE: Right place to put my Tokenizer jars

2009-10-15 Thread Chris Hostetter
: Actually, I meant to say I have my Tokenizer jars in solr/lib. : I have the jars that my Tokenizer jars depend in lib/ext, : as I wanted them to be loaded only once per container : due to their internal description. Bad idea? unless there is something *really* hinky about those dependencies, i

Re: Using DIH's special commands....Help needed

2009-10-15 Thread William Pierce
Thanks for your reply! I tried your suggestion. No luck. I have verified that I have version 1.6.0_05-b13 of java installed. I am running with the nightly bits of October 7. I am pretty much out of ideas at the present timeI'd appreciate any tips/pointers. Thanks, - Bill --

Re: Using mincount with date facet in Solr 1.4

2009-10-15 Thread Chris Hostetter
: But I was getting facets even with count 0. So I tried following : combinations of mincount parameters, as none was specified in the : wiki, : for date faceting. mincount is not a date faceting option -- it only applies to field value faceti

Re: Solr/Lucene keeps eating up memory while idling

2009-10-15 Thread Yonik Seeley
I just did some allocation profiling on the stock Solr example... it's not completely idle when no requests are being made. There's only one thing allocating memory: org.mortbay.util.Scanner.scanFiles() That must be Jetty looking to see if any of the files under webapps has changed. It's really n

Re: Solr/Lucene keeps eating up memory while idling

2009-10-15 Thread nonrenewable
Here is exactly half an hour from roughly the beginning of logging. There's nothing to see really because no requests are sent, you just see the GC behaviour: [Full GC 211987K->208493K(432448K), 0.6273480 secs] [GC 276333K->212269K(438720K), 0.0929710 secs] [GC 289133K->216269K(439936K), 0.1019780

Re: Using DIH's special commands....Help needed

2009-10-15 Thread Fergus McMenemie
Hi, For example, my data-import.conf has the following. It allows me to specify a parameter "single=pathname" on the url used to invoke DIH. It allows a doc to be deleted from the index by, in my case its pathname, which is stored in the field fileAbsolutePath.

Re: Facet query help

2009-10-15 Thread Chris Hostetter
: the original pastie(http://pastie.org/650932). I tried the fq query body with : quotes and without quotes. the entire fq param shouldn't be in quotes ... just the value that you want to query on (since it's a string field and you want the whole field treated as a single string... fq =

Re: (Solr 1.4 dev) Why solr.common.* packages are in solrj-*.jar ?

2009-10-15 Thread Chris Hostetter
: BTW, is there some sort of transition guide for Solr 1.4? : I see there are changes how classes are divided into JARs : like above, and there are some incompatible API changes. : It'll be greate if such information can be part of CHANGES.txt. CHANGES.txt contains an "Upgrading from Solr 1.3" se

Re: Using DIH's special commands....Help needed

2009-10-15 Thread Shalin Shekhar Mangar
On Fri, Oct 16, 2009 at 12:46 AM, William Pierce wrote: > Thanks for your help. Here is my DIH config fileI'd appreciate any > help/pointers you may give me. No matter what I do the documents are not > getting deleted from the index. My db has rows whose 'IndexingStatus' field > has values

Re: Solr/Lucene keeps eating up memory while idling

2009-10-15 Thread Grant Ingersoll
Please send a log covering at least the 2.5 minutes you discuss, but upwards of 5 minutes would be good. On Oct 15, 2009, at 1:26 PM, nonrenewable wrote: Did I read that right? 330K docs == 12 GB index. Ops, missed the dot - 1.2GB, but i don't think that should really make the differen

Re: Conditional copyField

2009-10-15 Thread Grant Ingersoll
Nice find, Amhet, I'd love to see this formalized in the Solr schema syntax, as it is something I've often wanted to. Max Chars is OK, too, but would like to see max tokens as well. On Oct 12, 2009, at 6:31 PM, AHMET ARSLAN wrote: Hi, I am pushing data to solr from two different sources n

Re: Using DIH's special commands....Help needed

2009-10-15 Thread William Pierce
Thanks for your help. Here is my DIH config fileI'd appreciate any help/pointers you may give me. No matter what I do the documents are not getting deleted from the index. My db has rows whose 'IndexingStatus' field has values of either 1 (which means add it to solr), or 4 (which means d

Re: Using DIH's special commands....Help needed

2009-10-15 Thread Shalin Shekhar Mangar
On Thu, Oct 15, 2009 at 10:42 PM, William Pierce wrote: > Thanks, Shalin. I am sorry if I phrased it incorrectly. Yes, I want to > know how to delete documents in the solr index using the $deleteDocByQuery > special command. I looked in the wiki doc and could not find out how to do > this...

Re: Solr/Lucene keeps eating up memory while idling

2009-10-15 Thread nonrenewable
>Did I read that right? 330K docs == 12 GB index. Ops, missed the dot - 1.2GB, but i don't think that should really make the difference in this case. Even if it was 12 GB it would just have some really juicy documents, right? :) >Can you share the Solr logs and/or your config? Is this happenin

Re: Using DIH's special commands....Help needed

2009-10-15 Thread William Pierce
Thanks, Shalin. I am sorry if I phrased it incorrectly. Yes, I want to know how to delete documents in the solr index using the $deleteDocByQuery special command. I looked in the wiki doc and could not find out how to do this Sorry if this is self-evident... Cheers, - Bill ---

Re: Using DIH's special commands....Help needed

2009-10-15 Thread Shalin Shekhar Mangar
On Thu, Oct 15, 2009 at 6:25 PM, William Pierce wrote: > Folks: > > I see in the DIH wiki that there are special commands which according to > the wiki > > "Special commands can be given to DIH by adding certain variables to the > row returned by any of the components . " > > In my use case, my d

Re: Solr/Lucene keeps eating up memory while idling

2009-10-15 Thread Grant Ingersoll
On Oct 14, 2009, at 12:26 PM, nonrenewable wrote: I'm curious why this is occurring and whether i can prevent it. This is my scenario: Locally I have an idle running solr 1.3 service using lucene 2.4.1 which has an index of ~330K documents containing ~10 fields each(total size ~12GB).

Re: Boosting of words

2009-10-15 Thread AHMET ARSLAN
> Hi, >   > I am able to see the results when i pass the values in the > query browser. >   > When i pass the below query i am able to see the difference > in output. >   > http://localhost:8983/solr/select/?q=java^100%20technology^1 >   > Each time user cannot pass the values in the query browser

Re: Boosting of words

2009-10-15 Thread Michel Bottan
Hi Bhaskar, The parameter you're looking for is the Boost Query. Remember using Dismax Query Handler. http://wiki.apache.org/solr/DisMaxRequestHandler#bq_.28Boost_Query.29 http://localhost:8983/solr/select/?q=video&qt=dismax&bq=cat:electronics^5.0 Michel On Thu, Oct 15, 2009 at 6:04 AM, bhas

Limit occurences per page of items with same category

2009-10-15 Thread javier_uru
I was reading about field collapsing but I think is not what I'm looking for. I have to resolve this problem. After a search, I need to show, for example, 3 items per page which have the same Category. I will display 10 items per page. Suppose the search returns 15 items in this order after pri

Re: 'Down' boosting shorter docs

2009-10-15 Thread Walter Underwood
Another approach is to change the document length normalization formula. See Similarity.lengthNorm() in Lucene. wunder On Oct 15, 2009, at 12:45 AM, Andrea D'Ippolito wrote: I've read (correct me if I'm wrong) that a solution to achieve that is overboost all the other fields. but I guess this

Using DIH's special commands....Help needed

2009-10-15 Thread William Pierce
Folks: I see in the DIH wiki that there are special commands which according to the wiki "Special commands can be given to DIH by adding certain variables to the row returned by any of the components . " In my use case, my db contains rows that are marked "PendingDelete". How do I use the

Re: browse terms of index

2009-10-15 Thread Grant Ingersoll
Have a look at http://wiki.apache.org/solr/TermsComponent On Oct 15, 2009, at 5:43 AM, jfmel...@free.fr wrote: Hi I use a sample embedded Apache Solr to create a Lucene index with few documents for tests purpose. Documents have text string, sint, sfloat, bool, and date fields, each of them

browse terms of index

2009-10-15 Thread jfmelian
Hi I use a sample embedded Apache Solr to create a Lucene index with few documents for tests purpose. Documents have text string, sint, sfloat, bool, and date fields, each of them are indexed. At this time they are also stored but only the ids documents will be stored at the end. I want to

Re: Adding callback url to data import handler...Is this possible?

2009-10-15 Thread Noble Paul നോബിള്‍ नोब्ळ्
It is not yet implemented .You may open an issue for the same --Noble On Thu, Oct 15, 2009 at 12:14 PM, William Pierce wrote: > If the JavaScript support enables me to invoke a URL,  it's really OK with > me. > > Cheers, > > - Bill > > -- > From: "

Re: Boosting of words

2009-10-15 Thread bhaskar chandrasekar
Hi,   I am able to see the results when i pass the values in the query browser.   When i pass the below query i am able to see the difference in output.   http://localhost:8983/solr/select/?q=java^100%20technology^1   Each time user cannot pass the values in the query browser to see the output.   B

Filtered search for subset of ids

2009-10-15 Thread Andrea D'Ippolito
Hi everybody, I'm new here..and this is my last chance to find a solution for my problem. I'm using acts_as_solr for Ruby On Rails. I need to submit a query to a subset of documents which id belong to an array of ids that I want to pass as parameter. for istance, something like: find_by_solr(qu

Re: 'Down' boosting shorter docs

2009-10-15 Thread Andrea D'Ippolito
I've read (correct me if I'm wrong) that a solution to achieve that is overboost all the other fields. but I guess this works easily only if u have few fields indexed ;) bye 2009/10/15 Simon Wistow > Our index has some items in it which basically contain a title and a > single word body. > > If

Re: Sorting on Multiple fields

2009-10-15 Thread Neil Lunn
On Thu, Oct 15, 2009 at 12:55 AM, Avlesh Singh wrote: > > > > Do we attempt to raise some sort of functional query to find the least > > amount of the requested price id's? This would seem to imply some playing > > around in the query handler to allow a function of this sort. > > > Unless I am mi