Re: Trying to understand error

2014-12-17 Thread Gora Mohanty
On 18 December 2014 at 13:10, Joel Stobart wrote: > When I try to upload a solr item by the index: > > /solr/#/collection1/documents > > When I upload a simple text file via the interface: > > Response: > > { > "error": { > "msg": "java.lang.NoSuchFieldError: LFH_SIG", > "trace": "java.l

Trying to understand error

2014-12-17 Thread Joel Stobart
When I try to upload a solr item by the index: /solr/#/collection1/documents When I upload a simple text file via the interface: Response: { "error": { "msg": "java.lang.NoSuchFieldError: LFH_SIG", "trace": "java.lang.RuntimeException: java.lang.NoSuchFieldError: LFH_SIG\n\tat org.ap

Fwd: Issue with SOLR Distributed Search

2014-12-17 Thread rashi gandhi
Hi, This is regarding the issue that we are facing with SOLR distributed search. In our application, we are managing multiple shards at SOLR server to manage the load. But there is a problem with the order of results that we going to return to client during the search. For Example: Currently ther

can't make sense of spellchecker results when using techproducts example

2014-12-17 Thread Chris Hostetter
Ok, so i've been working on updating hte ref guide to account for hte new way to run the "examples" in 5.0. The spell checking page... https://cwiki.apache.org/confluence/display/solr/Spell+Checking ...has some examples that loosely corroloate to the "techproducts" example, but even

Re: Exception writing document to the index; possible analysis error.

2014-12-17 Thread Shawn Heisey
On 12/17/2014 3:00 PM, bengates wrote: > I'm losing my hair trying to add a simple document on a freshly installed > Solr core. > I'm running Solr 4.10.2 on jetty with the start.jar. > > I have the following (managed) schema and my core has been reloaded : > > > Consider my core have 0 document. No

Re: A field-wide remove duplicate tokens filter

2014-12-17 Thread Walter Underwood
Why is that useful? It breaks phrase search. If you want to ignore term frequency in ranking, change the Similarity class. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ On Dec 17, 2014, at 2:40 PM, Varun Rajput wrote: > The org.apache.solr.analysis.RemoveDuplic

A field-wide remove duplicate tokens filter

2014-12-17 Thread Varun Rajput
The org.apache.solr.analysis.RemoveDuplicatesTokenFilter, as per its description, "Filters out any tokens which are at the same logical position in the tokenstream as a previous token with the same text." A very useful filter would be one which filters out duplicate tokens throughout the field,

Re: converting to parent/child block indexing

2014-12-17 Thread Michael Sokolov
Thanks, Mikhail! That explains the situation pretty well. -Mike On 12/17/14 4:49 PM, Mikhail Khludnev wrote: Hm.. really sorry about that. The current implementation is not really ideal, you know. When handles update it tries to recognize whether it block or not and in fact it uses _root_ fiel

Exception writing document to the index; possible analysis error.

2014-12-17 Thread bengates
Hello everyone, I'm losing my hair trying to add a simple document on a freshly installed Solr core. I'm running Solr 4.10.2 on jetty with the start.jar. I have the following (managed) schema and my core has been reloaded : Consider my core have 0 document. Now I'm trying to add a document via

Search Query with editdistance - throws Null Pointer

2014-12-17 Thread Trilok Prithvi
When I run the following query (Solr 4.10.2) with edit-distance, I'm getting a null pointer exception: */solr/select?q=fld:(Event ID)&fl=strdist("eventid",fld_alphaonly,edit)* java.lang.NullPointerException 500 It works if I remove the space between "Event ID", like: */solr/select?q=fld:(EventID

Re: converting to parent/child block indexing

2014-12-17 Thread Mikhail Khludnev
Hm.. really sorry about that. The current implementation is not really ideal, you know. When handles update it tries to recognize whether it block or not and in fact it uses _root_ field to enforce uniqueness. There are few consequences: - _root_ field spans whole block, not the parent one - cur

converting to parent/child block indexing

2014-12-17 Thread Michael Sokolov
Have other people tried migrating an index that was created without block (parent/child) indexing to one that *does* have it? Did you find that you got duplicate documents - ie multiple documents with the same uniqueField value? That's what I found, and I don't see how that's possible. What

Re: Partial match autosuggest (match a word occurring anywhere in a field)

2014-12-17 Thread bbarani
Thanks for your response. I fixed this issue by using the -- View this message in context: http://lucene.472066.n3.nabble.com/Predictive-search-match-a-word-occurring-anywhere-in-a-field-tp4

/export - Why need sort criteria (4.10.2)?

2014-12-17 Thread Damien Dykman
The /export request handler mandates a sort order. Is there a particular reason? It'd be nice to have the option to tell Solr: "just export in the order you want", to limit any kind of overhead added by sorting. Or am I missing something? If exports were distributed, I can see the need for some ki

Re: Spellchecker delivers far too few suggestions

2014-12-17 Thread Erick Erickson
That seems fine. What happens if your prefix is just "b"? Just to verify that you're getting something back Although I usually just enable the terms component and specify the field and all that on the URL, but what you're doing should work fine This is seeming like a puzzler... Erick On

Re: Spellchecker delivers far too few suggestions

2014-12-17 Thread Martin Dietze
On 17 December 2014 at 16:41, Erick Erickson wrote: > First, I'd look in your corpus for "bnak". The problem with index-based > suggestions is that if your index contains garbage, they're "correctly > spelled" since they're in the index. TermsComponent is very useful for this. > > You can also loo

Re: Spellchecker delivers far too few suggestions

2014-12-17 Thread Dan Davis
What about the frequency comparison - I haven't used the spellchecker heavily, but it seems that if "bnak" is in the database, but "bank" is much more frequent, then "bank" should be a suggestion anyway... On Wed, Dec 17, 2014 at 10:41 AM, Erick Erickson wrote: > > First, I'd look in your corpus

Re: Spellchecker delivers far too few suggestions

2014-12-17 Thread Erick Erickson
First, I'd look in your corpus for "bnak". The problem with index-based suggestions is that if your index contains garbage, they're "correctly spelled" since they're in the index. TermsComponent is very useful for this. You can also loosen up the match criteria, and as I remember the collations pa

Re: Block join subqueries

2014-12-17 Thread Mikhail Khludnev
I don't see any problem with it. Could you try? On Wed, Dec 17, 2014 at 5:18 PM, marotosg wrote: > > Yes, that's true. I mean join then. > is it possible to join three cores A B C. > I know it is possible to join A -> B and A -> C > > Is it possible to join them > A -> B -> C > > Thanks > > > > -

Spellchecker delivers far too few suggestions

2014-12-17 Thread Martin Dietze
I recently upgraded to SOLR 4.10.1 and after that set up the spell checker which I use for returning suggestions after searches with few or no results. When the spellchecker is active, this request handler is used (most of which is taken from examples I found in the net): explicit

LIRE Solr plugin updated to 4.10.2 and new demo ...

2014-12-17 Thread Mathias Lux
Hi all! After the initial release I finally came around to update the content based image retrieval plugin LIRE Solr to the current version and it has been extended to support more CBIR features. https://bitbucket.org/dermotte/liresolr I also took the freedom to update the web client and the dem

Re: Block join subqueries

2014-12-17 Thread marotosg
Yes, that's true. I mean join then. is it possible to join three cores A B C. I know it is possible to join A -> B and A -> C Is it possible to join them A -> B -> C Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Block-join-subqueries-tp4174709p4174749.html Sent fr

Re: CLUSTERSTATUS timeout

2014-12-17 Thread adfel70
Hi Jonathan, We are having the exact same problem with Solr 4.8.0. Did you manage to resolve this one? Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/CLUSTERSTATUS-timeout-tp4173224p4174741.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread andreic9203
Hello again, So, what you are saying is to create the collections and documents with this in mind, and what I want couldn't be done from querying? Yes, but my index is already big, and this kind of operation is hard to be done... Thank you for the idea. If I didn't understand as it should be, pl

Re: Block join subqueries

2014-12-17 Thread Mikhail Khludnev
Hello, BlockJoin doesn't work cross core, it's only possible with {!join On Wed, Dec 17, 2014 at 3:12 PM, marotosg wrote: > > Hi Mikhail, > > Thanks for that. That's exactly what I was looking for but this is for the > same core. > This allows you to search in a document nested two levels. > >

Re: Keepwords & DataImportHandler

2014-12-17 Thread Doug Turnbull
Leo, everything you describe sounds correct. Are you having any problems? are keep words not working for DIH for you? Or are you just looking for general pointers? If so, your approach to this sounds a lot like a blog post I recently wrote, which you might find useful: http://opensourceconnection

SolrJ insert PDF for different language

2014-12-17 Thread Gabi
Following the sample in http://wiki.apache.org/solr/ContentStreamUpdateRequestExample I'm able to insert a PDF and search words, etc. ContentStreamUpdateRequest up = new ContentStreamUpdateRequest("/update/extract"); up.addFile(new

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread lboutros
You could try to denormalize even more : Create two collections: - one with user groups in mind - the second collection with user and groupSignature groups in mind. For instance, with user groups in mind : { "id": "svsKQSFfzhu-SznsU8FUII", "user": "admin", "furniture_cou

Re: Keepwords & DataImportHandler

2014-12-17 Thread Ahmet Arslan
Hi Leo, You are doing OK. DIH and analysis are separate issues. Please note that analysis changes indexed values. Like you see analysis section in solr panel. When you retrieve stored values using fl= parameter, original values are displayed. So natural question is, how are you doing to consu

Re: questions about BlockJoinParentQParser

2014-12-17 Thread Michael Sokolov
Thanks Andrey! I voted for your patch -Mike On 12/17/2014 4:01 AM, Kydryavtsev Andrey wrote: For support scoreMode parameter in BlockJoinParentQParser we have this jira with attached patch https://issues.apache.org/jira/browse/SOLR-5882 17.12.2014, 06:54, "Michael Sokolov" : I'm trying to us

Re: Block join subqueries

2014-12-17 Thread marotosg
Hi Mikhail, Thanks for that. That's exactly what I was looking for but this is for the same core. This allows you to search in a document nested two levels. I was expecting to do the same for cross core joins. That's basically doing a join from Core1 to Core2 to Core3. I couldn't find anything a

Re: Block join subqueries

2014-12-17 Thread Mikhail Khludnev
Sergio, did you check http://blog.griddynamics.com/2013/12/grandchildren-and-siblings-with-block.html ? On Wed, Dec 17, 2014 at 1:07 PM, marotosg wrote: > > Hi, > > Is is possible to do a query joining three levels. > For isntance with three cores Person, Person Job and Company. > > I know is pos

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread andreic9203
Ok, if you have a bit of time, you can explain your last comment on this example, please? { "name": "Table", "user": "admin", "updateDate": "2012-12-31T15:31:16.012Z", "id": "svsKXRqbVhqZa-SznsU8FUII", "type": 1, "groupSignature": "5FJHzWke16auaB2hdq

Block join subqueries

2014-12-17 Thread marotosg
Hi, Is is possible to do a query joining three levels. For isntance with three cores Person, Person Job and Company. I know is possible to join from Person to Person Job and from Person to Company. For instance {!type=join from=PersonIdsS to=PersonID fromIndex=personjob}type_level:parent AND {

Re: Join in SOLR

2014-12-17 Thread Mikhail Khludnev
On Wed, Dec 17, 2014 at 11:51 AM, Rajesh Panneerselvam < rajesh.panneersel...@aspiresys.com> wrote: > > Yes Mikhail. This is what I want exactly. My sub-entities should be > added as child document to the root entity. But will this feature be > available in upcoming release anytime soon? > Rajesh,

Keepwords & DataImportHandler

2014-12-17 Thread leostro
Hi all, This is my first question in this forum :D I'm trying to import documents using a DataImportHandler. The first test is to import some document having only a title, I want to import this field indexing it as a standard text type value. Moreover I'd like to

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread lboutros
I don't know if this is possible for you but: could you pre-process the group and create nested documents with pre-computed document counts ? https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Index+Handlers?focusedCommentId=48792150#UploadingDatawithIndexHandlers-NestedChildDoc

Re: questions about BlockJoinParentQParser

2014-12-17 Thread Kydryavtsev Andrey
For support scoreMode parameter in BlockJoinParentQParser we have this jira with attached patch https://issues.apache.org/jira/browse/SOLR-5882 17.12.2014, 06:54, "Michael Sokolov" : > I'm trying to use BJPQP and ran into a few little gotchas that I'd like > to share with y'all in case you have a

Large scale Update of solr indexed documents

2014-12-17 Thread atawfik
Hi all, I have a scenario where I need to generate summaries of indexed documents. So, I initially thought I should do that at Nutch because I am using Nutch to push documents to Solr. However, I will need some statistics about terms and documents. Hence, I will have to duplicate analysis at Nutch

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread andreic9203
Hello, First, thank you for the quick answer. Second, This is exactly my current problem. I'm doing this with 2 queries, but with the current performance issues, I want to change the implementation. Now I am using grouping and then I take the count and set that count to the all grouped documents

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread lboutros
Andrei, Pivot faceting is the Solr implementation for Hierarchical Facets. I don't think this is what you need. Could you please describe the original use case ? Just to eliminate XY problem. I don't know if this is acceptable for you in term of performance but you could try to solve your proble

Fwd: Join in SOLR

2014-12-17 Thread Mikhail Khludnev
Hello Rajesh, Please find the brief below > > Thanks Mikhail. As per what you have mentioned can I get a list of sub > entities with this new Zipper join. Because now in existing DIH I'm getting > a list for individual fields of the sub entities. > > 1) If I get a list of sub entities how can I g