Re: Date Facet Giving Count more than actual

2009-11-05 Thread Aakash Dharmadhikari
Thanks Hoss, the problem is resolved. The real problem was my query parameter. I was storing daysForFilter with offset of 1 sec, and date in query parameter "facet.date.start" also had same offset. This was causing the overlaps, as in the facet value of 2009-10-23T18:30:01 was matching both 2009-1

ERROR: multiple values encountered for non multiValued copy field

2009-11-05 Thread Christian López Espínola
Hi, I'm using solr with solrj and when I specify a field to copy in my schema it stops working with the exception: org.apache.solr.client.solrj.SolrServerException: org.apache.solr.client.solrj.SolrServerException: org.apache.solr.common.SolrException: ERROR: multiple values encountered for non

Re: leading and trailing wildcard query

2009-11-05 Thread Otis Gospodnetic
> Please elaborate. What do you mean by *desrever* string? Try reading in reverse ;). Otis -- Sematext is hiring -- http://sematext.com/about/jobs.html?mls Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR - Original Message > From: A. Steven Anderson > To: solr-user@luce

Re: CPU Max Utilization

2009-11-05 Thread Otis Gospodnetic
You may also want to share some sample queries, your fields definitions, and tell us how long a core remains 100% utilized. Otis -- Sematext is hiring -- http://sematext.com/about/jobs.html?mls Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR - Original Message > From: b

Re: solr query help alpha numeric and not

2009-11-05 Thread Joel Nylund
Avlesh, thanks those worked, for somre reason I never got your mail, found it in one of the list archives though. thanks again Joel On Nov 5, 2009, at 9:08 PM, Avlesh Singh wrote: Didn't the queries in my reply work? Cheers Avlesh On Fri, Nov 6, 2009 at 4:16 AM, Joel Nylund wrote: Hi yes

Re: DIH timezone offset

2009-11-05 Thread Noble Paul നോബിള്‍ नोब्ळ्
anyone to add this here http://wiki.apache.org/solr/DataImportHandlerFaq On Thu, Nov 5, 2009 at 8:35 PM, wrote: > """ > DIH relies on the driver to get the date. It does not do any automatic > conversion. Is it possible for the driver to give the date with the > right offset? > """ > > I have re

Re: specify multiple files in for DataImportHandler

2009-11-05 Thread Jay Hill
You can set up multiple request handlers each with their own configuration file. For example, in addition to the config you listed you could add something like this: data-two-config.xml and so on with as many handlers as you need. -Jay http://www.lucidimagination.com On Thu, Nov 5, 2009 a

Re: CPU Max Utilization

2009-11-05 Thread Walter Underwood
Are you requesting results by relevance or are you sorting by a field? How many results are you requesting? Are you using real user queries (with repetition) or a flat distrubution of queries? wunder On Nov 5, 2009, at 6:20 PM, ba ba wrote: Greetings, I'm running a solr instance with 100

CPU Max Utilization

2009-11-05 Thread ba ba
Greetings, I'm running a solr instance with 100 million documents in it. The index is 18 GB. The strange behavior I'm seeing is CPU utilization gets maxed out. I'm running on an 8 core machine with 32 GB or ram. Every concurrent query I run on it uses up one of the cores. So, if I am running 1 co

Re: solr query help alpha numeric and not

2009-11-05 Thread Avlesh Singh
Didn't the queries in my reply work? Cheers Avlesh On Fri, Nov 6, 2009 at 4:16 AM, Joel Nylund wrote: > Hi yes its a string, in the case of a title, it can be anything, a letter a > number, a symbol or a multibyte char etc. > > Any ideas if I wanted a query that was not a letter a-z or a number

Re: StreamingUpdateSolrServer - indexing process stops in a couple of hours

2009-11-05 Thread Yonik Seeley
Seems fixed. https://issues.apache.org/jira/browse/SOLR-1543 -Yonik http://www.lucidimagination.com On Mon, Nov 2, 2009 at 6:05 AM, Shalin Shekhar Mangar wrote: > I'm able to reproduce this issue consistently using JDK 1.6.0_16 > > After an optimize is called, only one thread keeps adding doc

Re: leading and trailing wildcard query

2009-11-05 Thread Andrzej Bialecki
A. Steven Anderson wrote: No thoughts on this? Really!? I would hate to admit to my Oracle DBE that Solr can't be customized to do a common query that a relational database can do. :-( On Wed, Nov 4, 2009 at 6:01 PM, A. Steven Anderson < a.steven.ander...@gmail.com> wrote: I've scoured the a

Re: field queries seem slow

2009-11-05 Thread Lance Norskog
Restarting Solr clears out all caching. Doing a commit used to drop all of the caches for new requests, but it no longer does this. On Linux you can clear the kernel's disk buffer cache with a special hook. You echo '1' into a /proc/something and this tells the kernel to drop its caches. Sorry, d

Re: Newb Question about the TemplateTransformer

2009-11-05 Thread Lance Norskog
I think you need custom code for this. You can write plugins in Java, or (in Java 1.6) any of the Java-based scripting languages like JavaScript. http://wiki.apache.org/solr/DataImportHandler#ScriptTransformer On Thu, Nov 5, 2009 at 8:54 AM, Mark Ellul wrote: > Hi Noble, > > Thanks for the respo

Re: DIH full-import with fetchSize(Integer.MIN_VALUE) taking long time to start processing rows

2009-11-05 Thread Lance Norskog
Right, a view will not help here. It is just and SQL query embedded as a virtual table, and is used to lift SQL syntax out of the DIH. InnoDB is row-level except for auto-increment operations. Ow. You could drop the indexes on the table. Each insert batch has to recalculate all indexes, so this wi

Re: Regarding to ramBufferSizeMB and mergeFactor

2009-11-05 Thread Attachot Tuangphon
Hi, Jeff Newburn Thank you for you good explanations. That helps me a lot. Attachot Tuangphon On 09/11/06 0:36, "Jeff Newburn" wrote: > If I am correct the two are related but not dependent on each other. Merge > factor is used to determine how many segment files exist on disk where as > the

Re: leading and trailing wildcard query

2009-11-05 Thread A. Steven Anderson
> Not sure what version it was supported from, but we're on 1.3. Really!? Great answer! Thanks! -- A. Steven Anderson

RE: leading and trailing wildcard query

2009-11-05 Thread Bernadette Houghton
Not sure what version it was supported from, but we're on 1.3. bern -Original Message- From: A. Steven Anderson [mailto:a.steven.ander...@gmail.com] Sent: Friday, 6 November 2009 10:25 AM To: solr-user@lucene.apache.org Subject: Re: leading and trailing wildcard query > Hi Steve, a query

Re: Set MMap in Solr

2009-11-05 Thread ba ba
Thanks for the help. -Brad Anderson 2009/11/5 Otis Gospodnetic > To use MMapDirectory, invoke Java with the System property > org.apache.lucene.FSDirectory.class set to > org.apache.lucene.store.MMapDirectory. This will cause > FSDirectory.getDirectory(File,boolean) to return instances of this

Re: leading and trailing wildcard query

2009-11-05 Thread A. Steven Anderson
> Note that N-grams are limited to specific string lengths. I presume that > you need to search for arbitrary strings, not just three-letter ones. > Understood, but that is a limitation that we can live with. Thanks! -- A. Steven Anderson

Re: leading and trailing wildcard query

2009-11-05 Thread A. Steven Anderson
> Ah. With that restriction, it is impossible. > If it is OK to pay Lucid to make a one-line change, you might be able to do > it. Otherwise, get ready to spend a lot of money for a search engine. > Well, now that Lucid is getting In-Q-Tel $$$, they will soon learn that officially releases are all

Re: leading and trailing wildcard query

2009-11-05 Thread Walter Underwood
Note that N-grams are limited to specific string lengths. I presume that you need to search for arbitrary strings, not just three-letter ones. wunder On Nov 5, 2009, at 3:23 PM, Bernadette Houghton wrote: Hi Steve, a query such as *abc* would need the NGramFilterFactor, hence the doubleed

Re: leading and trailing wildcard query

2009-11-05 Thread A. Steven Anderson
> Hi Steve, a query such as *abc* would need the NGramFilterFactor, hence the > doubleedgytext, and would be retrievable by a query such as contains:abc. > Note that you can set the max and minimum size of strings that get indexed. > Excellent! Just to clarify though, NGramFilterFactor is a Solr

Re: leading and trailing wildcard query

2009-11-05 Thread Walter Underwood
Ah. With that restriction, it is impossible. If it is OK to pay Lucid to make a one-line change, you might be able to do it. Otherwise, get ready to spend a lot of money for a search engine. wunder On Nov 5, 2009, at 3:18 PM, A. Steven Anderson wrote: Unfortunately, we can only use offici

RE: leading and trailing wildcard query

2009-11-05 Thread Bernadette Houghton
Hi Steve, a query such as *abc* would need the NGramFilterFactor, hence the doubleedgytext, and would be retrievable by a query such as contains:abc. Note that you can set the max and minimum size of strings that get indexed. bern -Original Message- From: A. Steven Anderson [mailto:a.st

Re: leading and trailing wildcard query

2009-11-05 Thread A. Steven Anderson
> Doesn't it work to call SolrQueryParser.setAllowLeadingWildcard? Good question. Anyone? > It can be really slow, what an RDBMS person would call a full table scan. Understood. > There is an open bug to make that settable in a config file, but this is a > pretty tiny change to the source.

Re: leading and trailing wildcard query

2009-11-05 Thread Erick Erickson
Because that is the semantics of Solr/Lucene wildcard syntax. * stands for "any number of any character". Basically, it enumerates all the terms in the field for all the documents and assembles a list of all of them that contain the substring "abc" and uses that as one of the clauses of your search

Re: leading and trailing wildcard query

2009-11-05 Thread A. Steven Anderson
Thanks for the solution, but could you elaborate on how it would find something like *abc* in a field that contains abc. Steve On Thu, Nov 5, 2009 at 5:25 PM, Bernadette Houghton < bernadette.hough...@deakin.edu.au> wrote: > I've just set up something similar (much thanks to Avesh!)- > >

Re: solr query help alpha numeric and not

2009-11-05 Thread Joel Nylund
Hi yes its a string, in the case of a title, it can be anything, a letter a number, a symbol or a multibyte char etc. Any ideas if I wanted a query that was not a letter a-z or a number 0-9, given that its a string? thanks Joel On Nov 4, 2009, at 9:10 AM, Jonathan Hendler wrote: Hi Joel,

MoreLikeThis and filtering/restricting on "target" fields

2009-11-05 Thread Cody Caughlan
I am trying to use MoreLikeThis (both the component and handler, trying combinations) and I would like to give it an input document reference which has a "source" field to analyze and then get back other documents which have a given field that is used by MLT. My dataset is composed of documents li

Re: leading and trailing wildcard query

2009-11-05 Thread Walter Underwood
Doesn't it work to call SolrQueryParser.setAllowLeadingWildcard? It can be really slow, what an RDBMS person would call a full table scan. There is an open bug to make that settable in a config file, but this is a pretty tiny change to the source. http://issues.apache.org/jira/browse/S

RE: leading and trailing wildcard query

2009-11-05 Thread Bernadette Houghton
I've just set up something similar (much thanks to Avesh!)- . . . . bern -Original Message- From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] Sent: Friday, 6 November 2009

Re: leading and trailing wildcard query

2009-11-05 Thread A. Steven Anderson
> > The guilt trick is not the best thing to try on public mailing lists. :) > Point taken, although not my intention. I guess I have been spoiled by quick replies and was getting to think it was a stupid question. Plus, I'm literally gonna get trash talk from my Oracle DBE if I can't make this

Re: Set MMap in Solr

2009-11-05 Thread Otis Gospodnetic
To use MMapDirectory, invoke Java with the System property org.apache.lucene.FSDirectory.class set to org.apache.lucene.store.MMapDirectory. This will cause FSDirectory.getDirectory(File,boolean) to return instances of this class. So, start your servlet container with -Dorg.apache.lucene.FSDi

Re: leading and trailing wildcard query

2009-11-05 Thread Otis Gospodnetic
The guilt trick is not the best thing to try on public mailing lists. :) The first thing that popped to my mind is to use 2 fields, where the second one contains the desrever string of the first one. The second idea is to use n-grams (if it's OK to tokenize), more specifically edge n-grams. Oti

Re: how to use ajax-solr - example?

2009-11-05 Thread Lance Norskog
google "applying a diff patch" http://www.linuxjournal.com/article/1237 looks like a good start. On Thu, Nov 5, 2009 at 6:39 AM, Joel Nylund wrote: > this is exactly what I was looking for, any directions how to install? I > dont really understand how to use a .patch file. > > thanks > Joel >

Re: Sending file to Solr via HTTP POST

2009-11-05 Thread Jay Hill
Here is a brief example of how to use SolrJ with the ExtractingRequestHandler: ContentStreamUpdateRequest req = new ContentStreamUpdateRequest("/update/extract"); req.addFile(fileToIndex); req.setParam("literal.id", getId(fileToIndex)); req.setParam("literal

Re: leading and trailing wildcard query

2009-11-05 Thread A. Steven Anderson
No thoughts on this? Really!? I would hate to admit to my Oracle DBE that Solr can't be customized to do a common query that a relational database can do. :-( On Wed, Nov 4, 2009 at 6:01 PM, A. Steven Anderson < a.steven.ander...@gmail.com> wrote: > I've scoured the archives and JIRA , but the

Set MMap in Solr

2009-11-05 Thread ba ba
Hi, I'm trying to set my default directory to MMap. I saw that this is done by specifying here A DirectoryProvider plugin can be configured in solrconfig.xml with the following XML: in solrconfig.xml. This did not work for me when I put in the MMapDirectory class name. I got this informatio