Re: Search and Entity structure

2012-10-25 Thread v vijith
Hi, Dear All, Apologize for lengthy email SOLR Version: 4 Im a newbie to SOLR and have gone through tutorial but could not get a solution. The below requirement doesnt seem to be impossible but I think Im missing the obvious. In my RDBMS, there is a Qualification table and an Employee tab

Re: DIH update?

2012-10-25 Thread Gora Mohanty
On 26 October 2012 08:51, Billy Newman wrote: > Any plans on adding update functionality to DIH? What do you mean by "update functionality"? Re-running an import with changed values for a document with an existing ID will update values in the Solr index. If you mean adding new documents, please

DIH update?

2012-10-25 Thread Billy Newman
Any plans on adding update functionality to DIH?

Re: Best way to commit data to Solr

2012-10-25 Thread Tomás Fernández Löbbe
If you don't need them searchable, you don't use Solr4 with transaction log and it is a relatively fast process (so it's not a pain to run it all again in case of some failure), I would just commit at the end. In any other case, I would use autocommit. Anyway, I'm not sure if you'll see much differ

Re: Field Grouping problem/question

2012-10-25 Thread Tomás Fernández Löbbe
By default you get the top result from each group (if you are not sorting on a field specifically, then you should be getting the document with the highest score per group). This can be changed with the "group.limit" parameter. See http://wiki.apache.org/solr/FieldCollapsing#Request_Parameters To

Re: throttle segment merging

2012-10-25 Thread Tomás Fernández Löbbe
> > Is there way to set-up logging to output something when segment merging > runs? > I think segment merging is logged when you enable infoStream logging (you should see it commented in the solrconfig.xml) > > Can be segment merges throttled? > You can change when and how segments are merged wit

throttle segment merging

2012-10-25 Thread Radim Kolar
I have problems with very low indexing speed as soon as core size grows over 15 GB. I suspect that it can be due io intensive segment merging. Is there way to set-up logging to output something when segment merging runs? Can be segment merges throttled?

Re: How to boost exact match?

2012-10-25 Thread Samuel García Martínez
whops! Jack got it faster :D Sorry for "double" posting. On Thu, Oct 25, 2012 at 11:41 PM, Jack Krupansky wrote: > You may want to do a copyfield and have one field that is stemmed for > casual matches at a lower boost while one is unstemmed and boosted higher. > > You could also do a copyfield

Re: How to boost exact match?

2012-10-25 Thread Samuel García Martínez
First of all, if you stem on indexing and you don't on query time, your queries wont find anything valuable. This is an easy approach. If you want full term (no stemmed) matches were boosted you have to query like: ?q=data management&defType=edismax&qf=*name_without_stemming^200* name^100 text ui

Re: How to boost exact match?

2012-10-25 Thread Jack Krupansky
You may want to do a copyfield and have one field that is stemmed for casual matches at a lower boost while one is unstemmed and boosted higher. You could also do a copyfield to a non-tokenized field (TextField with KeywordTokenizer and lower case and trim filters) and give a match in that fie

Re: How to boost exact match?

2012-10-25 Thread bbarani
Thanks a lot for your reply. Consider the below query ?q=data management&defType=edismax&qf=name^100 text uid^0.3&fl=name I am trying to boost the name as much as possible, even then the results are in below order - MANAGER - MANAGER - MANAGERS - ...AA DATA MANAGEMENT

Re: SolrCloud and distributed search

2012-10-25 Thread Tomás Fernández Löbbe
It also provides high availability for indexing and searching. On Thu, Oct 25, 2012 at 4:43 PM, Bill Au wrote: > So I guess one would use SolrCloud for the same reasons as distributed > search: > > When an index becomes too large to fit on a single system, or when a single > query takes too long

Re: SolrCloud and distributed search

2012-10-25 Thread Bill Au
So I guess one would use SolrCloud for the same reasons as distributed search: When an index becomes too large to fit on a single system, or when a single query takes too long to execute. Bill On Thu, Oct 25, 2012 at 3:38 PM, Shawn Heisey wrote: > On 10/25/2012 1:29 PM, Bill Au wrote: > >> Is

Re: SolrCloud and distributed search

2012-10-25 Thread Shawn Heisey
On 10/25/2012 1:29 PM, Bill Au wrote: Is SolrCloud using distributed search behind the scene? Does it have the same limitations (for example, doesn't support MoreLikeThis) distributed search has? Yes and yes.

Search and Entity structure

2012-10-25 Thread v vijith
Dear All, Apologize for lengthy email SOLR Version: 4 Im a newbie to SOLR and have gone through tutorial but could not get a solution. The below requirement doesnt seem to be impossible but I think Im missing the obvious. In my RDBMS, there is a Qualification table and an Employee table. A

solr -autosuggest

2012-10-25 Thread Sujatha Arun
Hi, A few question on Solr Auto suggest below Q1)I tried using the Index based Suggest functionality with solr 3.6.1 , can I combine this with file based boosting .Currently when I specify the index field and the sourcelocation,the file in the source location is not considered. Is there any

Re: Field in DataConfig not in Solr Schema

2012-10-25 Thread Shawn Heisey
On 10/25/2012 10:52 AM, Billy Newman wrote: This is OK by me since I don't really want index that field anyways, but I am wondering if I am doing something wrong. Is there a way to mark a column in the Data Config as 'not indexed'. Or should I put a field in my Solr Schema and mark it as indexe

Field in DataConfig not in Solr Schema

2012-10-25 Thread Billy Newman
I am using columns in my DataConfig specifically to grab them later as a template. i.e. Where stuff is the field I want to index, but all stuff is just a temporary column used such that I can get at it via the template transformer. 'stuff' is in my schema 'allStuff' is not. I get the followi

Re: How to boost exact match?

2012-10-25 Thread Jack Krupansky
You neglected to show us your example queries and how you wanted to boost them. The simple answer is to supply multiple query terms with increasing level of detail (e.g., more terms in a quoted phrase) and apply a query boost that is higher for the more exact matches. If you don't get expected

Highlights not returning after upgrading from 3.3 to 4.0

2012-10-25 Thread Daniel Skiles
I'm running some test on Solr 4.0 before putting it into production, and I've just encountered an issue with hit highlighting. I started by placing my index from 3.3 into a Solr 4.0 install. I then edited the field definition in the schema config to match my schema from 3.3, with the addition of

How to boost exact match?

2012-10-25 Thread bbarani
Hi, We have a name field which I am boosting using dismax parser. I want the search to bring the documents that has exact match compared to partial match. Ex: Name:account Name:account number Name:account number - closed account When I search for account number, I want solr to return the

ShardHandler - distribution to non-default request handler doesn't work

2012-10-25 Thread AlexeyK
I tried to define a suggest component as appears in Wiki. I also defined a specific /suggest request handler. This doesn't work in SolrCloud setup, as the query is distributed to the default /select handler instead. Specifically, shard handler gets default urls and other cores forward to /select. s

Re: Solr question regarding handlers and escaping

2012-10-25 Thread Jack Krupansky
Could you provide a little more detail. I'm not sure if you are simply describing behavior you didn't expect or like, or whether there might be an actual problem. -- Jack Krupansky -Original Message- From: sameera somisetty Sent: Wednesday, October 24, 2012 6:47 PM To: solr-user@luce

Re: UnsupportedOperationException: ExternalFileField (SOLVED)

2012-10-25 Thread Carrie Coy
The problem seems to have been caused by my failure to completely remove the existing index files when I switched the inStock field from an indexed boolean field to externally maintained. After I removed everything and re-indexed from scratch, the error went away. On 10/24/2012 08:57 PM, Car

Re: one field type extending another

2012-10-25 Thread Jack Krupansky
Sorry, but not currently. Great suggestion though. I didn't check to see if there might already be a Jira for it, but I vaguely recall some discussion or mention of it. -- Jack Krupansky -Original Message- From: Radim Kolar Sent: Wednesday, October 24, 2012 5:49 PM To: solr-user@luce

Re: SolrCloud leader election on single node

2012-10-25 Thread Mark Miller
Thanks Alexey - Could you move this to a JIRA issue? - Mark On Oct 25, 2012, at 7:53 AM, AlexeyK wrote: > setup: > 1 node, 4 cores, 2 shards. > 15 documents indexed. > > problem: > init stage times out. > > probable cause: > According to the init flow, cores are initialized one by one synch

Re: org.apache.lucene.queryparser.classic.ParseException - a Bug?

2012-10-25 Thread Ahmet Arslan
Hi Deniz, Haven't used geofilt query parser but with solrJ you don't need URL encoding. Use solrQuery.set("fq","{!geofilt sfield=store}"); instead. --- On Thu, 10/25/12, deniz wrote: > From: deniz > Subject: org.apache.lucene.queryparser.classic.ParseException - a Bug? > To: solr-user@lucen

Re: Improving performance for use-case where large (200) number of phrase queries are used?

2012-10-25 Thread Peter Keegan
Yes #5 is the same thing (sorry, I didn't read them all thoroughly). Your description of the phrases being 'tags' suggests that you don't need term positions for matching, and as you noted, you would get unwanted partial matches. And, the TermQuerys would be much faster. Peter On Wed, Oct 24, 20

Re: MMapDirectory, demand paging, lazy evaluation, ramfs and the much maligned RAMDirectory (oh my!)

2012-10-25 Thread Erick Erickson
You may well have already seen this, but in case not: http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html FWIW, Erick On Wed, Oct 24, 2012 at 9:51 PM, Shawn Heisey wrote: > On 10/24/2012 6:29 PM, Aaron Daubman wrote: >> >> Let me be clear that that I am not interested in RAMD

Facet date/range + facet.mincount + distributed search issue

2012-10-25 Thread Dovao Jimenez, Oscar
Dear Mrs/Mr, Using facet date/range on a date typed field and on a distributed search between schema compatible cores, the use of facet.mincount=1 brings a cut down number of facet values (over 500 facet values expected, 5 facet values retrieved). I wonder whether facet.mincount is supported on

SolrCloud leader election on single node

2012-10-25 Thread AlexeyK
setup: 1 node, 4 cores, 2 shards. 15 documents indexed. problem: init stage times out. probable cause: According to the init flow, cores are initialized one by one synchronously. Actually, the main thread waits ShardLeaderElectionContext.waitForReplicasToComeUp until retry threshold, while replic

Query elevation component fails

2012-10-25 Thread AlexeyK
Using SolrCloud release with following configuration: string elevate.xml explicit text elevator Running the query http://localhost:8080/solr/collection1/elevate?q=evelatedtext constantly getting the following exception: SEVERE: nu

one field type extending another

2012-10-25 Thread Radim Kolar
can i do something like this: (fails with fieldType: missing mandatory attribute 'class') termVectors="true" termPositions="true" termOffsets="true"/> one field type will extending another type to save copy and paste.