getting data from only one database

2011-08-26 Thread mss.mss
hi , we created a solr which is connected to two databases and we created a jquery auto complete.in two databases we r having keywords and it is default search.so beside the search button we r ctearing more more drop down list and nmaed the two databases when the user click one one database and en

Re: Shingle and Query Performance

2011-08-26 Thread Erik Hatcher
On Aug 26, 2011, at 17:49 , Lord Khan Han wrote: > We are indexing news document from the various sites. Currently we have > 200K docs indexed. Total index size is 36 gig. There is also attachement to > the news (pdf -docs etc) So document size could be high (ie 10mb). > > We are using some com

auto suggestion with text_en field

2011-08-26 Thread Paul
Sorry if this has been asked before, but I couldn't seem to find it... I've got a fairly simple index, and I'm searching on a field of type text_en, and the results are good: I search for "computer" and I get back hits for "computer", "computation", "computational", "computing". I also want to cr

Shingle and Query Performance

2011-08-26 Thread Lord Khan Han
Hi, We are indexing news document from the various sites. Currently we have 200K docs indexed. Total index size is 36 gig. There is also attachement to the news (pdf -docs etc) So document size could be high (ie 10mb). We are using some complex queries which includes around 30 - 40 terms per qu

Viewing the complete document from within the index

2011-08-26 Thread karthik
Hi Everyone, I am trying to see whats the best way to view the entire document as its indexed within solr/lucene. I have tried to use Luke but it's still showing me the fields that i have configured to be returned back [ie., stored=true] unless I am not enabling some option in the tool. Is there

synonyms vs replacements

2011-08-26 Thread Robert Petersen
Hello all, Which is better? Say you add an index time synonym between nunchuck and nunchuk and then both words will be in the document and both will be searchable. I can get the same exact behavior by putting an index time replacement of nunchuck => nunchuk and a search time replacement of

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread Gora Mohanty
On Fri, Aug 26, 2011 at 6:51 PM, nagarjuna wrote: > Thank u very much for ur reply Erick Erickson >    i am using solr 3.3.0 version >  and i have no idea about the cron job i thought that it would be for unix > but i am using windows > and i would like to integrate my scheduling task with my solr

Re: Highlight on alternateField

2011-08-26 Thread Val Minyaylo
Thanks a lot Koji. On 8/25/2011 5:04 PM, Koji Sekiguchi wrote: (11/08/26 2:32), Val Minyaylo wrote: Hi there, I am trying to utilize highlighting alternateField and can't get highlights on the results from targeted fields. Is this expected behavior or am I understanding alternateFields wrong?

Re: Solr Geodist

2011-08-26 Thread solrnovice
Eric, thanks for the quick response. I left out the "d" value, yes, when you perform a spatial query, we should have a distance of d>0, sorry about that. What is the setting of your "store" value, i mean in the schema, was it marked at LatLong. For some reason i dont see the geodist() being retur

Re: Solr Geodist

2011-08-26 Thread Erick Erickson
When I try this form the stock solr example (using "store" rather than "coordinate" for the field, your first example gives me an error of "d must be > 0" or some such. When I add a "d" value to the query, both your first and second queries work just fine and the second returns a "geodist" value in

Solr Geodist

2011-08-26 Thread solrnovice
Hi, i am trying to return distance in the solr query, by passing in the "fl=geodist()" and i dont see distance being returned. We have a field called coordinates which is configured as latlong and when i perform the following search, i do see results q=*:*&fq={!geofilt}&sfield=coordinates&pt=31.22

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread simon
The built-in DIH scheduling was never implemented as far as I know - the Wiki section is just a design proposal and explicitly says "Hasn't been committed to SVN (published only here) " On Windows, you can use the Task Scheduler to do the kinds of things that cron does on Unix/Linux. -Simon On F

Re: syntax for functions used in the fq parameter

2011-08-26 Thread Erick Erickson
Why doesn't " AND text:foo" fill this requirement? Best Erick On Fri, Aug 26, 2011 at 2:27 PM, Jason Toy wrote: > I'm trying to limit my data to only docs that have the word 'foo' appear at > least once. > I am trying to use: > fq=termfreqdata,'foo'):[1+TO+*] > > but I get the syntax error: > Ca

syntax for functions used in the fq parameter

2011-08-26 Thread Jason Toy
I'm trying to limit my data to only docs that have the word 'foo' appear at least once. I am trying to use: fq=termfreqdata,'foo'):[1+TO+*] but I get the syntax error: Caused by: org.apache.lucene.queryparser.classic.ParseException: Encountered " ":" ": "" at line 1, column 33. Was expecting one o

Re: New IndexSearcher and autowarming

2011-08-26 Thread Mike Austin
Hi Erick, It might work. I've only worked with solr having one index on one server over a year ago so I might need to just research more about the replication. I am using windows and I remember that replication on windows had some issues with scripts and hard links, however it looks like we have

Re: New IndexSearcher and autowarming

2011-08-26 Thread Erick Erickson
Why doesn't standard replication with auto-warming work for you? You can control how often replication gets triggered by controlling your commit points and/or your replication interval. This seems easier than maintaining cores like your problem statement indicates. Best Erick On Fri, Aug 26, 2011

Re: New IndexSearcher and autowarming

2011-08-26 Thread simon
The multicore API (see http://wiki.apache.org/solr/CoreAdmin ) allows you to swap, unload, reload cores. That should allow you to do what you want, -Simon On Fri, Aug 26, 2011 at 11:13 AM, Mike Austin wrote: > I would like to have the ability to keep requests from being slowed from > new > docum

Re: DIH importing

2011-08-26 Thread simon
It sounds as though you are optimizing the index after the delta import. If you don't do that, then only new segments will be replicated and syncing will be much faster. On Fri, Aug 26, 2011 at 12:08 PM, Mark wrote: > We are currently delta-importing using DIH after which all of our servers > h

Re: Paging over mutlivalued field results?

2011-08-26 Thread darren
Yeah, I've resigned that this is the most practical workaround. But it also means a 100-1 explosion in my index size. For every book document, there will now be a 100 (for example) sentence documents from it. What's the best way to submit a feature request for Solr? Many thanks. On Fri, 26 Aug

DIH importing

2011-08-26 Thread Mark
We are currently delta-importing using DIH after which all of our servers have to download the full index (16G). This obviously puts quite a strain on our slaves while they are syncing over the index. Is there anyway not to sync over the whole index, but rather just the parts that have changed?

RE: commas in synonyms.txt are not escaping

2011-08-26 Thread Moore, Gary
Alexi, Yes but no difference. This is apparently an issue introduced in 3.*. Thanks for your help. -Gary -Original Message- From: Alexei Martchenko [mailto:ale...@superdownloads.com.br] Sent: Friday, August 26, 2011 10:45 AM To: solr-user@lucene.apache.org Subject: Re: commas in synony

RE: commas in synonyms.txt are not escaping

2011-08-26 Thread Moore, Gary
Thanks, Yonik. Gary -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Friday, August 26, 2011 11:25 AM To: solr-user@lucene.apache.org Subject: Re: commas in synonyms.txt are not escaping On Fri, Aug 26, 2011 at 11:16 AM, Yonik Seeley w

Re: commas in synonyms.txt are not escaping

2011-08-26 Thread Yonik Seeley
On Fri, Aug 26, 2011 at 11:16 AM, Yonik Seeley wrote: > On Fri, Aug 26, 2011 at 10:17 AM, Moore, Gary wrote: >> >> I have a number of chemical names containing commas which I'm mapping in >> index_synonyms.txt thusly: >> >> 2\,4-D-butotyl=>Aqua-Kleen,BRN 1996617,Bladex-B,Brush killer 64,Butoxy-D

Re: commas in synonyms.txt are not escaping

2011-08-26 Thread Yonik Seeley
On Fri, Aug 26, 2011 at 10:17 AM, Moore, Gary wrote: > > I have a number of chemical names containing commas which I'm mapping in > index_synonyms.txt thusly: > > 2\,4-D-butotyl=>Aqua-Kleen,BRN 1996617,Bladex-B,Brush killer 64,Butoxy-D > 3,CCRIS 8562 > > According to the sample synonyms.txt, the

New IndexSearcher and autowarming

2011-08-26 Thread Mike Austin
I would like to have the ability to keep requests from being slowed from new document adds and commits by having a separate index that gets updated. Basically a read-only and an updatable index. After the update index has finished updating with new adds and commits, I'd like to switch the update to

core creation and instanceDir parameter

2011-08-26 Thread Gérard Dupont
Hi all, Playing with multicore and dynamic creation of new core, I found out that there is one mandatory parameter "instanceDir" which is mandaotry to find out the location of solrconfig.xml and schema.xml. Since all my cores share the same configuration (found realtively to the $SOLR_HOME defined

what is scheduling ? why should we do this?how to achieve this ?

2011-08-26 Thread nagarjuna
i dont know what exactly the meaning of scheduling in solr and why should i do ?and how can achieve this please help me to do this i already saw this link http://wiki.apache.org/solr/DataImportHandler?highlight=%28scheduling%29#Scheduling except this one please send me any sample codes or links

Re: Solr Implementations

2011-08-26 Thread zarni aung
Thank you so much for your response Erik. On Fri, Aug 26, 2011 at 8:30 AM, Erick Erickson wrote: > See below > > On Thu, Aug 25, 2011 at 4:22 PM, zarni aung wrote: > > First, I would like to apologize if this is a repeat question but can't > seem > > to get the right answer anywhere. > > > > -

Re: Solr and client app on same Jetty?

2011-08-26 Thread Gérard Dupont
Hi, On 26 August 2011 16:23, Arcadius Ahouansou wrote: > Hello. > > I have Solr running on Jetty and I also have a web client application > running on another jetty instance on the same box. > > The question is: wouldn't it be better to run the client and solr on the > very same jetty instance?

Re: commas in synonyms.txt are not escaping

2011-08-26 Thread Alexei Martchenko
Gary, isn't your wordDelimiter removing your commas in the query time? have u tried it in the analyzer? 2011/8/26 Moore, Gary > Here you go -- I'm just hacking the text field at the moment. Thanks, > Gary > > > > > synonyms="index_synonyms.txt" > tokenizerFactory="solr.K

RE: commas in synonyms.txt are not escaping

2011-08-26 Thread Moore, Gary
Here you go -- I'm just hacking the text field at the moment. Thanks, Gary -Original Message- From: Alexei Martchenko [m

Re: commas in synonyms.txt are not escaping

2011-08-26 Thread Alexei Martchenko
Gary, please post the entire field declaration so I can try to reproduce here 2011/8/26 Moore, Gary > > I have a number of chemical names containing commas which I'm mapping in > index_synonyms.txt thusly: > > 2\,4-D-butotyl=>Aqua-Kleen,BRN 1996617,Bladex-B,Brush killer 64,Butoxy-D > 3,CCRIS 856

Solr and client app on same Jetty?

2011-08-26 Thread Arcadius Ahouansou
Hello. I have Solr running on Jetty and I also have a web client application running on another jetty instance on the same box. The question is: wouldn't it be better to run the client and solr on the very same jetty instance? I came across http://wiki.apache.org/solr/Solrj#EmbeddedSolrServer as

commas in synonyms.txt are not escaping

2011-08-26 Thread Moore, Gary
I have a number of chemical names containing commas which I'm mapping in index_synonyms.txt thusly: 2\,4-D-butotyl=>Aqua-Kleen,BRN 1996617,Bladex-B,Brush killer 64,Butoxy-D 3,CCRIS 8562 According to the sample synonyms.txt, the comma above should be. i.e. a\,a=>b\,b.The problem is that ac

Re: Paging over mutlivalued field results?

2011-08-26 Thread Erik Hatcher
The way folks have addressed this situation to date is to model the "multivalued fields" as additional documents too. On Aug 26, 2011, at 09:32 , wrote: > > Many thanks Erick. > > I think a good feature to add to Solr to address this is > to allow a query to return either the "document" as

Re: Paging over mutlivalued field results?

2011-08-26 Thread darren
Many thanks Erick. I think a good feature to add to Solr to address this is to allow a query to return either the "document" as a result or the matching (multivalued) fields of a document as individual results (subject to paging too). Because sometimes the field value (only) is the desired result

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread nagarjuna
Thank u very much for ur reply Erick Erickson i am using solr 3.3.0 version and i have no idea about the cron job i thought that it would be for unix but i am using windows and i would like to integrate my scheduling task with my solr server please give me the suggestion -- View this messa

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread Igor MILOVANOVIC
Easiest way is to do it via cron job. 2011/8/26 nagarjuna > hi everybody... > i dont know about how to perform DIH scheduling for fullimport in > solri got little bit information from > > http://stackoverflow.com/questions/3206171/how-can-i-schedule-data-imports-in-solr > here > bu

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread vighnesh
thanx for giving response i am unable to configure this please provide any samples code for how to use template transformer -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-differentiate-multiple-datasources-when-building-solr-query-tp3286309p3286816.html Sent from the

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread Erick Erickson
Although I'd really recommend using underscore rather than hyphen, since '-' is a query operator and it'll mess you up *sometime* ... Best Erick On Fri, Aug 26, 2011 at 8:43 AM, Erik Hatcher wrote: > Vighnesh - > > What you're looking for is DataImportHandler's TemplateTransformer.  Docs > her

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread Erick Erickson
Actually, the easiest thing to do would be to make a cron job on *nix or use task scheduler on windows to fire off a delta-import request to your solr server on a schedule you'd like. The code you reference appears to be for Solr 1.2, which is way old... Best Erick On Fri, Aug 26, 2011 at 7:45 A

Re: Paging over mutlivalued field results?

2011-08-26 Thread Erick Erickson
OK, I think I have it. It's a problem, indeed. And no, there's no way I know of to make a doc fetch only bring back some range of values in a multivalued field. So you're stuck with either getting the whole book back and peeling out the pages (how do you know which sentences are on which page any

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread Erik Hatcher
Vighnesh - What you're looking for is DataImportHandler's TemplateTransformer. Docs here: Basically just enable the TemplateTransformer in each of your DIH configs then set a literal field value like this differently for each

Re: Solr Implementations

2011-08-26 Thread Erick Erickson
See below On Thu, Aug 25, 2011 at 4:22 PM, zarni aung wrote: > First, I would like to apologize if this is a repeat question but can't seem > to get the right answer anywhere. > >   - What happens to pending documents when the server dies abruptly?  I >   understand that when the server shuts dow

Re: SolrServer instances

2011-08-26 Thread François Schiettecatte
Sounds to me that you are looking for HTTP Persistent Connections (connection keep-alive as opposed to close), and a singleton object. This would be outside SOLR per se. A few caveats though, I am not sure if tomcat supports keep-alive, and I am not sure how SOLR deals with multiple requests co

Re: missing field in schema browser on solr admin

2011-08-26 Thread Erik Hatcher
Is the field stored? Do you see it on documents when you do a q=*:* search? How is that field defined and populated? (exact config/code needed here) Erik On Aug 25, 2011, at 23:07 , deniz wrote: > hi all... > > i have added a new field to index... but now when i check solr admin, i s

Re: solr indexing exception

2011-08-26 Thread Gora Mohanty
On Fri, Aug 26, 2011 at 1:47 PM, abhijit bashetti wrote: > Hi, > > I am using DIH for indexing 50K documents . > > I am using 64-bit machine with 4GB RAM How much memory is allocated to Solr? What is the approximate size of the data being indexed into Solr. Regards, Gora

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread vighnesh
yes those are two data-sources name . how can i get the data from only datasource-1 or from data source-2 or from both -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-differentiate-multiple-datasources-when-building-solr-query-tp3286309p3286325.html Sent from the Solr -

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread Lance Norskog
Did you mean datasource-1 and datasource-2 ? On Fri, Aug 26, 2011 at 2:41 AM, vighnesh wrote: > hi all > > I have a two data sources in data-config file and i need data from first > datasource , second datasource and from both .how can acheive this in solr > query. > > example like: first dataso

how to differentiate multiple datasources when building solr query....

2011-08-26 Thread vighnesh
hi all I have a two data sources in data-config file and i need data from first datasource , second datasource and from both .how can acheive this in solr query. example like: first datasource: http://localhost:8983/solr/db/select/?q=newthread&version=2.2&start=0&rows=200&indent=on&datasource=da

solr indexing exception

2011-08-26 Thread abhijit bashetti
Hi, I am using DIH for indexing 50K documents . I am using 64-bit machine with 4GB RAM I got the following exception: org.apache.solr.handler.dataimport.DataImportHandlerException: java.lang.OutOfMemoryError: Java heap space at org.apache.solr.handler.dataimport.DocBuilder.buildDocumen