Re: Is it possible to grouping solr results by their domain ?

2012-04-09 Thread Jan Høydahl
Sure! http://wiki.apache.org/solr/FieldCollapsing -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. apr. 2012, at 07:27, hadi wrote: > I have crawled many site with nutch and using solr 3.4 to browse the results > but i want to

Cloud-aware request processing?

2012-04-09 Thread Benson Margulies
I'm working on a prototype of a scheme that uses SolrCloud to, in effect, distribute a computation by running it inside of a request processor. If there are N shards and M operations, I want each node to perform M/N operations. That, of course, implies that I know N. Is that fact available anypla

Re: Cloud-aware request processing?

2012-04-09 Thread Jan Høydahl
Hi, Instead of using Solr, you may want to have a look at Hadoop or another framework for distributed computation, see e.g. http://java.dzone.com/articles/comparison-gridcloud-computing -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.c

'No JSP support' error in embedded Jetty for solrCloud as of apache-solr-4.0-2012-04-02_11-54-55

2012-04-09 Thread Benson Margulies
Starting the leader with: java -Dbootstrap_confdir=./solr/conf -Dcollection.configName=rnicloud -DzkRun -DnumShards=3 -Djetty.port=9167 -jar start.jar and browsing to http://localhost:9167/solr/rnicloud/admin/zookeeper.jsp I get: HTTP ERROR 500 Problem accessing /solr/rnicloud/admin/zookeep

Re: Cloud-aware request processing?

2012-04-09 Thread Benson Margulies
Jan Høydahl, My problem is intimately connected to Solr. it is not a batch job for hadoop, it is a distributed real-time query scheme. I hate to add yet another complex framework if a Solr RP can do the job simply. For this problem, I can transform a Solr query into a subset query on each shard,

RE: Re: Cloud-aware request processing?

2012-04-09 Thread Darren Govoni
"...it is a distributed real-time query scheme..." SolrCloud does this already. It treats all the shards like one-big-index, and you can query it normally to get "subset" results from each shard. Why do you have to re-write the query for each shard? Seems unnecessary. --- Original Message

Is http://wiki.apache.org/solr/SolrCloud#Example_A:_Simple_two_shard_cluster up to date?

2012-04-09 Thread Benson Margulies
I specify -Dcollection.configName=rnicloud, but the admin gui tells me that I have a collection named 'collection1'. And, as reported in a prior email, the admin UI URL in there seems wrong.

Re: JNDI in db-data-config.xml websphere

2012-04-09 Thread tech20nn
Have to use exact JNDI name in db-data-config.xml, as unmanaged threads in Websphere do not have access to java:comp/env namespace. Resource name can not be mapped to websphere jdbc datasource name via reference definition in web.xml. Now using jndiName="jdbc/testdb" instead of jndiName="java:co

Re: Re: Cloud-aware request processing?

2012-04-09 Thread Benson Margulies
On Mon, Apr 9, 2012 at 9:50 AM, Darren Govoni wrote: > "...it is a distributed real-time query scheme..." > > SolrCloud does this already. It treats all the shards like one-big-index, > and you can query it normally to get "subset" results from each shard. Why > do you have to re-write the query f

Re: Solr is indexing but not showing results

2012-04-09 Thread Ahmet Arslan
> stored="true" > required="true"/> > stored="true" > required="true"/>      String type is not tokenized. Indexed verbatim. Use a different type for full text search. e.g. type="text"

Stumped on using a custom update request processor with SolrCloud

2012-04-09 Thread Benson Margulies
If you would be so kind as to look at https://issues.apache.org/jira/browse/SOLR-3342, you will see that I tried to use a working configuration for a URP of mine with SolrCloud, and received in return an NPE. Somehow or another, by default, the XmlUpdateRequestHandler ends up using (I think) the P

RE: To truncate or not to truncate (group.truncate vs. facet)

2012-04-09 Thread Young, Cody
I believe you're looking for what's called, "Matrix Counts" Please see this JIRA issue. To my knowledge it has been committed in trunk but not 3.x. https://issues.apache.org/jira/browse/SOLR-2898 This feature is accessed by using group.facet=true Cody -Original Message- From: danjfole

RE: how to correctly facet clothing multiple sizes and colors?

2012-04-09 Thread Robert Petersen
You *could* do it by making one and only one solr document for each clothing item, then just have the front end render all the sizes and colors available for that item as size/color pickers on the product page. You can add all the colors and sized to the one document in the index so they are searc

Re: 'No JSP support' error in embedded Jetty for solrCloud as of apache-solr-4.0-2012-04-02_11-54-55

2012-04-09 Thread Ryan McKinley
zookeeper.jsp was removed (along with all JSP stuff) in trunk Take a look at the cloud tab in the UI, or check the /zookeeper servlet for the JSON raw output ryan On Mon, Apr 9, 2012 at 6:42 AM, Benson Margulies wrote: > Starting the leader with: > >  java -Dbootstrap_confdir=./solr/conf -Dcol

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-04-09 Thread danjfoley
I did get this working with version 4. However my facet queries still don't group. Sent from my phone - Reply message - From: "Young, Cody [via Lucene]" Date: Mon, Apr 9, 2012 12:45 pm Subject: To truncate or not to truncate (group.truncate vs. facet) To: "danjfoley" I believe you'r

RE: To truncate or not to truncate (group.truncate vs. facet)

2012-04-09 Thread Young, Cody
You tried adding the parameter &group.facet=true ? Cody -Original Message- From: danjfoley [mailto:d...@micamedia.com] Sent: Monday, April 09, 2012 10:09 AM To: solr-user@lucene.apache.org Subject: Re: To truncate or not to truncate (group.truncate vs. facet) I did get this working wit

RE: To truncate or not to truncate (group.truncate vs. facet)

2012-04-09 Thread Young, Cody
One other thing, I believe that you need to be using facet.field on single valued string fields for group.facet to function properly. Are the fields you're faceting on multiValued=false? Cody -Original Message- From: Young, Cody [mailto:cody.yo...@move.com] Sent: Monday, April 09, 2012

Re: Solr is indexing but not showing results

2012-04-09 Thread srini
Hi Thanks for your reply. As per your suggestion I changed XML field type to text. but when I start solr it is throwing following exception. SEVERE: org.apache.solr.common.SolrException: Unknown fieldtype 'text' specified on field XML Any suggestions!!(Thanks for your reply) -- View this m

Re: Solr is indexing but not showing results

2012-04-09 Thread Walter Underwood
You will need to define or customize a field type for text. The example schema.xml file that is installed with Solr 3.5 has a several kinds of text fields, "text_general" and "text_en" are good places to start. You can use one of those, then customize it. wunder On Apr 9, 2012, at 11:27 AM, s

Re: Solr is indexing but not showing results

2012-04-09 Thread Jeevanandam Madanagopal
Srini - This "text" datatype comes as sample configuration in SOLR distribution. Check this, it may suit your need! -Jeevanandam On Apr 10, 2

Re: Solr is indexing but not showing results

2012-04-09 Thread Walter Underwood
That is not a good configuration. Synonyms should be expanded at index time, not query time. --wunder On Apr 9, 2012, at 11:43 AM, Jeevanandam Madanagopal wrote: > Srini - > > This "text" datatype comes as sample configuration in SOLR distribution. > Check this, it may suit your need! > > au

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-04-09 Thread danjfoley
I am using group.facet and it works fine for regular facet.field but not for facet.query Sent from my phone - Reply message - From: "Young, Cody [via Lucene]" Date: Mon, Apr 9, 2012 1:38 pm Subject: To truncate or not to truncate (group.truncate vs. facet) To: "danjfoley" One other

Re: Solr is indexing but not showing results

2012-04-09 Thread Jeevanandam Madanagopal
I agree partially, it actually depends. For instance during index time few of the synonyms mapping may or may not expand (for e.g.. frequent data index population from different source). So good apply at index time as well as query time to achieve complete ratio. Mostly of the time I did similar

Re: Solr is indexing but not showing results

2012-04-09 Thread Walter Underwood
There are some well-understood problems with query-time synonyms. Read about them here: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory Expanding synonyms at both index and query time causes a different problem, over-counting the score for any term in the s

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-04-09 Thread Martijn v Groningen
The group.facet option only works for field facets (facet.field). Others facets types (query, range and pivot) aren't supported yet. The group.facet works for both single and multivalued fields specified in the facet.field parameter. Martijn On 9 April 2012 20:58, danjfoley wrote: > I am using

How to facet data from a multivalued field?

2012-04-09 Thread Thiago
Hello everybody, I've already searched about this topic in the forum, but I didn't find any case like this. I ask for apologizes if this topic have been already discussed. I'm having a problem in faceting a multivalued field. My field is called series, and it has names of TV series like the big b

Re: How to facet data from a multivalued field?

2012-04-09 Thread Darren Govoni
Your handler for that field should be looked at. Try not using a handler that tokenizes or stems the field. You want to leave the text as is. I forget the handler setting for that, but its documented in there somewhere. On Mon, 2012-04-09 at 13:02 -0700, Thiago wrote: > Hello everybody, > > I've

Re: Strange behavior with search on empty string and NOT

2012-04-09 Thread Chris Hostetter
: Would it be a good idea to have Solr throw syntax error if an empty string : query occurs? erick's explanation wasn't very precise ... solr doesn't have any special handling of "empty strings", but what you are searching for *might* be a totally valid query based on how the field type is c

Re: Dynamically changing facet hierarchies and facet values

2012-04-09 Thread Chris Hostetter
: I have a use case where the facet hierarchies as well as facet names change : very frequently. : : For example: : (Smartphones >> Android ) may become : Smartphones >> GSM >> And roid. : : OR :"Smartphone" could be renamed to "Smart Phone" : : If I use traditional hierarchical fa

Boosting when matching specific field values

2012-04-09 Thread gseoeltru solr
I am using edismax when executing search against set of news articles. I would like to also boost the scores of matched documents based on another field in the documents which I will call "source" which can be set to 3 possible strings. So if the "source" field has a value "a", then I want to mul

Re: solr analysis-extras configuration

2012-04-09 Thread Chris Hostetter
: Further info: I can make this work if I stay out of tomcat -- I : download a fresh solr binary distro, copy those five JARs from 'dist' : and 'contrib' into example/solr/lib/, copy my solrconfig.xml and : schema.xml, and run 'java -jar start.jar', and it works fine. But : trying to add those sa

Re: how to correctly facet clothing multiple sizes and colors?

2012-04-09 Thread Andrew Harvey
What we do in our application is exactly what Robert described. We index Products, not variants. The variant data (colour, size etc.) is denormalised into the product document at index time. We then facet on the variant attributes and get product count instead of variant count. What you're see

Re: Solr with UIMA

2012-04-09 Thread chris3001
Tommaso, I apologize for my delayed response. Thank you very much for your time looking into this!! I will try to replicate your efforts on my end this week. Respectfully, Chris -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-with-UIMA-tp3863324p3898094.html Sent from t

Re: Suggester not working for digit starting terms

2012-04-09 Thread Erick Erickson
Is it possible that your fieldType definition for a_suggest is stripping out the digits? Consider using TermsComponent http://wiki.apache.org/solr/TermsComponent or the admin page or Luke to examine the terms actually _in_ your index. Or look at the admin/analysis page and give it some sample input

Re: Question on using dynamic fields

2012-04-09 Thread Erick Erickson
Hmmm, not sure about the dataconfig.xml file. What are you trying to index? Is this DIH? Because if you're simply posting Solr-formatted XML docs, dataconfig.xml is irrelevant You say you're not seeing the output. One of two things is going on: 1> The data is not in the index. See the admin/sc

Re: Problem about range search

2012-04-09 Thread Erick Erickson
Hmmm, works fine for me using the "popularity" field in the default schema. What version of Solr are you using? What is your complete handler definition? Best Erick On Mon, Apr 9, 2012 at 12:10 AM, ZHANG Liang F wrote: > Hi, > I ran into a problem when trying range facet search. I had a schema

Re: Re: Cloud-aware request processing?

2012-04-09 Thread Erick Erickson
I _think_ you need to look at the Zookeeper information, perhaps something like ZkController.getCloudState or some such? Warning: I haven't been in that code, so this is just a guess. But since the SolrCloud stuff has to know this kind of info in order to do distributed indexing, it's got to be av

Re: Boosting when matching specific field values

2012-04-09 Thread Chris Hostetter
: possible strings. So if the "source" field has a value "a", then I want : to multiply the score by 1. If the "source" field has a value "b", then I : want to multiple the score by 2 ... and so on. What is the way to go about : doing this ? how long is your "and so on" list? You could use the

Re: how to correctly facet clothing multiple sizes and colors?

2012-04-09 Thread danjfoley
The problem with that approach is that if you selected say large and red you'd get back all the products with large and red as variants. Not the products with red in the large size add would be expected. Sent from my phone - Reply message - From: "Andrew Harvey [via Lucene]" Date: Mon,

SolrCloud versus a SearchComponent that rescores

2012-04-09 Thread Benson Margulies
Those of you insomniacs who have read my messages here over the last few weeks might recall that I've been working on a request handler that wraps the SearchHandler to rewrite queries and then reorder results. (I haven't quite worked out how to apply Grant's alternative suggestions without losing

Re: Question on using dynamic fields

2012-04-09 Thread Rakesh Varna
Hi Erick, Thanks for the response. I am trying to index xml files in a directory. I provide the xpath details, file location etc in data-config.xml. I will try the 2 approaches that you have mentioned. Regards, Rakesh Varna On Mon, Apr 9, 2012 at 3:38 PM, Erick Erickson wrote: > Hmmm, not sur

Re: SolrCloud versus a SearchComponent that rescores

2012-04-09 Thread Mark Miller
On Apr 9, 2012, at 7:34 PM, Benson Margulies wrote: > Those of you insomniacs who have read my messages here over the last > few weeks might recall that I've been working on a request handler > that wraps the SearchHandler to rewrite queries and then reorder > results. > > (I haven't quite worke

Re: Why this document does not match?

2012-04-09 Thread Alexander Ramos Jardim
Sorry for the answer. 2012/3/29 Erick Erickson > Alexander: > > Your images were stripped by one of our mail servers, so there's not > much we can see ... > > But guessing, you aren't searching the fields you think you are: > itemNameSearch:fifa12 > becomes > itemNameSearch:fifa defaultSearchFie

Re: SolrCloud versus a SearchComponent that rescores

2012-04-09 Thread Benson Margulies
That page seems to be saying that the 'distributed' APIs take place on the leader, and the ordinary prepare/process APIs out at the leaves. I'll set out to prove or disprove that tomorrow. On Mon, Apr 9, 2012 at 8:17 PM, Mark Miller wrote: > > On Apr 9, 2012, at 7:34 PM, Benson Margulies wrote:

Re: Is http://wiki.apache.org/solr/SolrCloud#Example_A:_Simple_two_shard_cluster up to date?

2012-04-09 Thread Mark Miller
On Apr 9, 2012, at 9:52 AM, Benson Margulies wrote: > I specify -Dcollection.configName=rnicloud, but the admin gui tells me > that I have a collection named 'collection1'. > > And, as reported in a prior email, the admin UI URL in there seems wrong. Sorry - that param name is not entirely cle

Re: Why this document does not match?

2012-04-09 Thread Chris Hostetter
: > itemNameSearch:fifa defaultSearchField:12 : That's exactly what's happening! Why does this happen? whyspace is meaningful to the query parser: it tells the query parser there are multiple clauses for a boolean query. if you want to search for any works the user typed in the field "itemNam

[CFP] Open Source Search Conference Oct 2, 2012

2012-04-09 Thread Erik Hatcher
Sending this on behalf of my friends at BasisTech - Subject: Call for Presentations: Open Source Search Conference Oct. 2, 2012 (Chantilly, VA) == Call for Presentations & Save the Date Open Source Search Conference Oct 2, 2012 (tutorials Oct. 1) in

[Lucene Revolution] Agenda Updated!

2012-04-09 Thread Erik Hatcher
We've updated the agenda and keynotes for the upcoming Lucene Revolution conference, May 7-10 in Boston, MA. We've got a lot of the committers coming, and Hoss' infamous "Stump the Chump" session, and many great talks. All we're missing is you it's not too late to sign up ;) http://w

Re: SolrCloud versus a SearchComponent that rescores

2012-04-09 Thread Benson Margulies
Um, maybe I've hit a quirk? In my solrconfig.xml, my special SearchComponents are installed only for a specific QT. So, it looks to me as if that QT is not propagated into the request out to the shards, and so they run the ordinary request handler without my components in it. Is this intended beh

Re: Question on using dynamic fields

2012-04-09 Thread Rakesh Varna
Hi Erick, The schema browser says that no dynamic fields were indexed. Any idea how do I specify dynamic fields through XPath when I only know the prefix and nothing else? Regards, Rakesh Varna On Mon, Apr 9, 2012 at 4:49 PM, Rakesh Varna wrote: > Hi Erick, >Thanks for the response. I am

Re: SolrCloud versus a SearchComponent that rescores

2012-04-09 Thread Mark Miller
Yeah, that's how it works - it ends up hitting the select request handler (this might be overridable with shards.qt) All the params are passed along, so in general, it will act the same as the top level req handler - but it can the remove the shards param so you don't have an infinite recursion

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-04-09 Thread danjfoley
Is this planned as a future feature? Is it in the bug tracker as a feature yet..just wondering how long until it is a feature. I could live without price counts for a bit. Sent from my phone - Reply message - From: "Martijn v Groningen-2 [via Lucene]" Date: Mon, Apr 9, 2012 3:31 pm S

RE: Problem about range search

2012-04-09 Thread ZHANG Liang F
Hi, I just found the root cause. The definition for the 'long' type is not right. the previous definition was: which doesn't support range query! now I changed to : , and it's done! Thanks a lot! Liang -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: 20

which approach is correct?

2012-04-09 Thread neosky
Here are my fields 101NGHGJGKGKLHJFKGJGKGK the sequence field is from 300 bytes to 56K bytes, no spaces I want to ngram from 3 to 8 NGH GHG HGJ ... NGHG GHGJ HGJG ... ...