Sure!
http://wiki.apache.org/solr/FieldCollapsing
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Solr Training - www.solrtraining.com
On 9. apr. 2012, at 07:27, hadi wrote:
> I have crawled many site with nutch and using solr 3.4 to browse the results
> but i want to
I'm working on a prototype of a scheme that uses SolrCloud to, in
effect, distribute a computation by running it inside of a request
processor.
If there are N shards and M operations, I want each node to perform
M/N operations. That, of course, implies that I know N.
Is that fact available anypla
Hi,
Instead of using Solr, you may want to have a look at Hadoop or another
framework for distributed computation, see e.g.
http://java.dzone.com/articles/comparison-gridcloud-computing
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Solr Training - www.solrtraining.c
Starting the leader with:
java -Dbootstrap_confdir=./solr/conf -Dcollection.configName=rnicloud
-DzkRun -DnumShards=3 -Djetty.port=9167 -jar start.jar
and browsing to
http://localhost:9167/solr/rnicloud/admin/zookeeper.jsp
I get:
HTTP ERROR 500
Problem accessing /solr/rnicloud/admin/zookeep
Jan Høydahl,
My problem is intimately connected to Solr. it is not a batch job for
hadoop, it is a distributed real-time query scheme. I hate to add yet
another complex framework if a Solr RP can do the job simply.
For this problem, I can transform a Solr query into a subset query on
each shard,
"...it is a distributed real-time query scheme..."
SolrCloud does this already. It treats all the shards like one-big-index, and you can
query it normally to get "subset" results from each shard. Why do you have to
re-write the query for each shard? Seems unnecessary.
--- Original Message
I specify -Dcollection.configName=rnicloud, but the admin gui tells me
that I have a collection named 'collection1'.
And, as reported in a prior email, the admin UI URL in there seems wrong.
Have to use exact JNDI name in db-data-config.xml, as unmanaged threads in
Websphere do not have access to java:comp/env namespace.
Resource name can not be mapped to websphere jdbc datasource name via
reference definition in web.xml.
Now using jndiName="jdbc/testdb" instead of
jndiName="java:co
On Mon, Apr 9, 2012 at 9:50 AM, Darren Govoni wrote:
> "...it is a distributed real-time query scheme..."
>
> SolrCloud does this already. It treats all the shards like one-big-index,
> and you can query it normally to get "subset" results from each shard. Why
> do you have to re-write the query f
> stored="true"
> required="true"/>
> stored="true"
> required="true"/>
String type is not tokenized. Indexed verbatim. Use a different type for full
text search. e.g. type="text"
If you would be so kind as to look at
https://issues.apache.org/jira/browse/SOLR-3342, you will see that I
tried to use a working configuration for a URP of mine with SolrCloud,
and received in return an NPE.
Somehow or another, by default, the XmlUpdateRequestHandler ends up
using (I think) the P
I believe you're looking for what's called, "Matrix Counts"
Please see this JIRA issue. To my knowledge it has been committed in trunk but
not 3.x.
https://issues.apache.org/jira/browse/SOLR-2898
This feature is accessed by using group.facet=true
Cody
-Original Message-
From: danjfole
You *could* do it by making one and only one solr document for each
clothing item, then just have the front end render all the sizes and
colors available for that item as size/color pickers on the product
page. You can add all the colors and sized to the one document in the
index so they are searc
zookeeper.jsp was removed (along with all JSP stuff) in trunk
Take a look at the cloud tab in the UI, or check the /zookeeper
servlet for the JSON raw output
ryan
On Mon, Apr 9, 2012 at 6:42 AM, Benson Margulies wrote:
> Starting the leader with:
>
> java -Dbootstrap_confdir=./solr/conf -Dcol
I did get this working with version 4. However my facet queries still don't
group.
Sent from my phone
- Reply message -
From: "Young, Cody [via Lucene]"
Date: Mon, Apr 9, 2012 12:45 pm
Subject: To truncate or not to truncate (group.truncate vs. facet)
To: "danjfoley"
I believe you'r
You tried adding the parameter
&group.facet=true ?
Cody
-Original Message-
From: danjfoley [mailto:d...@micamedia.com]
Sent: Monday, April 09, 2012 10:09 AM
To: solr-user@lucene.apache.org
Subject: Re: To truncate or not to truncate (group.truncate vs. facet)
I did get this working wit
One other thing, I believe that you need to be using facet.field on single
valued string fields for group.facet to function properly. Are the fields
you're faceting on multiValued=false?
Cody
-Original Message-
From: Young, Cody [mailto:cody.yo...@move.com]
Sent: Monday, April 09, 2012
Hi Thanks for your reply. As per your suggestion I changed XML field type to
text.
but when I start solr it is throwing following exception.
SEVERE: org.apache.solr.common.SolrException: Unknown fieldtype 'text'
specified on field XML
Any suggestions!!(Thanks for your reply)
--
View this m
You will need to define or customize a field type for text.
The example schema.xml file that is installed with Solr 3.5 has a several kinds
of text fields, "text_general" and "text_en" are good places to start. You can
use one of those, then customize it.
wunder
On Apr 9, 2012, at 11:27 AM, s
Srini -
This "text" datatype comes as sample configuration in SOLR distribution. Check
this, it may suit your need!
-Jeevanandam
On Apr 10, 2
That is not a good configuration. Synonyms should be expanded at index time,
not query time. --wunder
On Apr 9, 2012, at 11:43 AM, Jeevanandam Madanagopal wrote:
> Srini -
>
> This "text" datatype comes as sample configuration in SOLR distribution.
> Check this, it may suit your need!
>
> au
I am using group.facet and it works fine for regular facet.field but not for
facet.query
Sent from my phone
- Reply message -
From: "Young, Cody [via Lucene]"
Date: Mon, Apr 9, 2012 1:38 pm
Subject: To truncate or not to truncate (group.truncate vs. facet)
To: "danjfoley"
One other
I agree partially, it actually depends. For instance during index time few of
the synonyms mapping may or may not expand (for e.g.. frequent data index
population from different source). So good apply at index time as well as query
time to achieve complete ratio. Mostly of the time I did similar
There are some well-understood problems with query-time synonyms. Read about
them here:
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory
Expanding synonyms at both index and query time causes a different problem,
over-counting the score for any term in the s
The group.facet option only works for field facets (facet.field). Others
facets types (query, range and pivot) aren't supported yet.
The group.facet works for both single and multivalued fields specified in
the facet.field parameter.
Martijn
On 9 April 2012 20:58, danjfoley wrote:
> I am using
Hello everybody,
I've already searched about this topic in the forum, but I didn't find any
case like this. I ask for apologizes if this topic have been already
discussed.
I'm having a problem in faceting a multivalued field. My field is called
series, and it has names of TV series like the big b
Your handler for that field should be looked at.
Try not using a handler that tokenizes or stems the field.
You want to leave the text as is. I forget the handler setting for that,
but its documented in there somewhere.
On Mon, 2012-04-09 at 13:02 -0700, Thiago wrote:
> Hello everybody,
>
> I've
: Would it be a good idea to have Solr throw syntax error if an empty string
: query occurs?
erick's explanation wasn't very precise ...
solr doesn't have any special handling of "empty strings", but what you
are searching for *might* be a totally valid query based on how the field
type is c
: I have a use case where the facet hierarchies as well as facet names change
: very frequently.
:
: For example:
: (Smartphones >> Android ) may become
: Smartphones >> GSM >> And roid.
:
: OR
:"Smartphone" could be renamed to "Smart Phone"
:
: If I use traditional hierarchical fa
I am using edismax when executing search against set of news articles. I
would like to also boost the scores of matched documents based on another
field in the documents which I will call "source" which can be set to 3
possible strings. So if the "source" field has a value "a", then I want
to mul
: Further info: I can make this work if I stay out of tomcat -- I
: download a fresh solr binary distro, copy those five JARs from 'dist'
: and 'contrib' into example/solr/lib/, copy my solrconfig.xml and
: schema.xml, and run 'java -jar start.jar', and it works fine. But
: trying to add those sa
What we do in our application is exactly what Robert described. We index
Products, not variants. The variant data (colour, size etc.) is denormalised
into the product document at index time. We then facet on the variant
attributes and get product count instead of variant count.
What you're see
Tommaso,
I apologize for my delayed response. Thank you very much for your time
looking into this!!
I will try to replicate your efforts on my end this week.
Respectfully,
Chris
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-with-UIMA-tp3863324p3898094.html
Sent from t
Is it possible that your fieldType definition for a_suggest is
stripping out the digits? Consider using TermsComponent
http://wiki.apache.org/solr/TermsComponent or the admin
page or Luke to examine the terms actually _in_ your
index. Or look at the admin/analysis page and give it some
sample input
Hmmm, not sure about the dataconfig.xml file. What
are you trying to index? Is this DIH? Because
if you're simply posting Solr-formatted XML docs,
dataconfig.xml is irrelevant
You say you're not seeing the output. One of two
things is going on:
1> The data is not in the index. See the admin/sc
Hmmm, works fine for me using the "popularity" field in
the default schema.
What version of Solr are you using? What is your complete
handler definition?
Best
Erick
On Mon, Apr 9, 2012 at 12:10 AM, ZHANG Liang F
wrote:
> Hi,
> I ran into a problem when trying range facet search. I had a schema
I _think_ you need to look at the Zookeeper information, perhaps
something like ZkController.getCloudState or some such?
Warning: I haven't been in that code, so this is just a guess. But
since the SolrCloud stuff has to know this kind of info in order
to do distributed indexing, it's got to be av
: possible strings. So if the "source" field has a value "a", then I want
: to multiply the score by 1. If the "source" field has a value "b", then I
: want to multiple the score by 2 ... and so on. What is the way to go about
: doing this ?
how long is your "and so on" list?
You could use the
The problem with that approach is that if you selected say large and red you'd
get back all the products with large and red as variants. Not the products with
red in the large size add would be expected.
Sent from my phone
- Reply message -
From: "Andrew Harvey [via Lucene]"
Date: Mon,
Those of you insomniacs who have read my messages here over the last
few weeks might recall that I've been working on a request handler
that wraps the SearchHandler to rewrite queries and then reorder
results.
(I haven't quite worked out how to apply Grant's alternative
suggestions without losing
Hi Erick,
Thanks for the response. I am trying to index xml files in a directory.
I provide the xpath details, file location etc in data-config.xml. I will
try the 2 approaches that you have mentioned.
Regards,
Rakesh Varna
On Mon, Apr 9, 2012 at 3:38 PM, Erick Erickson wrote:
> Hmmm, not sur
On Apr 9, 2012, at 7:34 PM, Benson Margulies wrote:
> Those of you insomniacs who have read my messages here over the last
> few weeks might recall that I've been working on a request handler
> that wraps the SearchHandler to rewrite queries and then reorder
> results.
>
> (I haven't quite worke
Sorry for the answer.
2012/3/29 Erick Erickson
> Alexander:
>
> Your images were stripped by one of our mail servers, so there's not
> much we can see ...
>
> But guessing, you aren't searching the fields you think you are:
> itemNameSearch:fifa12
> becomes
> itemNameSearch:fifa defaultSearchFie
That page seems to be saying that the 'distributed' APIs take place on
the leader, and the ordinary prepare/process APIs out at the leaves.
I'll set out to prove or disprove that tomorrow.
On Mon, Apr 9, 2012 at 8:17 PM, Mark Miller wrote:
>
> On Apr 9, 2012, at 7:34 PM, Benson Margulies wrote:
On Apr 9, 2012, at 9:52 AM, Benson Margulies wrote:
> I specify -Dcollection.configName=rnicloud, but the admin gui tells me
> that I have a collection named 'collection1'.
>
> And, as reported in a prior email, the admin UI URL in there seems wrong.
Sorry - that param name is not entirely cle
: > itemNameSearch:fifa defaultSearchField:12
: That's exactly what's happening! Why does this happen?
whyspace is meaningful to the query parser: it tells the query parser
there are multiple clauses for a boolean query.
if you want to search for any works the user typed in the field
"itemNam
Sending this on behalf of my friends at BasisTech -
Subject: Call for Presentations: Open Source Search Conference Oct. 2, 2012
(Chantilly, VA)
==
Call for Presentations & Save the Date
Open Source Search Conference Oct 2, 2012
(tutorials Oct. 1) in
We've updated the agenda and keynotes for the upcoming Lucene Revolution
conference, May 7-10 in Boston, MA. We've got a lot of the committers coming,
and Hoss' infamous "Stump the Chump" session, and many great talks. All we're
missing is you it's not too late to sign up ;)
http://w
Um, maybe I've hit a quirk?
In my solrconfig.xml, my special SearchComponents are installed only
for a specific QT. So, it looks to me as if that QT is not propagated
into the request out to the shards, and so they run the ordinary
request handler without my components in it.
Is this intended beh
Hi Erick,
The schema browser says that no dynamic fields were indexed. Any idea
how do I specify dynamic fields through XPath when I only know the prefix
and nothing else?
Regards,
Rakesh Varna
On Mon, Apr 9, 2012 at 4:49 PM, Rakesh Varna wrote:
> Hi Erick,
>Thanks for the response. I am
Yeah, that's how it works - it ends up hitting the select request handler (this
might be overridable with shards.qt) All the params are passed along, so in
general, it will act the same as the top level req handler - but it can the
remove the shards param so you don't have an infinite recursion
Is this planned as a future feature? Is it in the bug tracker as a feature
yet..just wondering how long until it is a feature. I could live without price
counts for a bit.
Sent from my phone
- Reply message -
From: "Martijn v Groningen-2 [via Lucene]"
Date: Mon, Apr 9, 2012 3:31 pm
S
Hi,
I just found the root cause. The definition for the 'long' type is not right.
the previous definition was:
which doesn't support range query! now I changed to : , and it's done!
Thanks a lot!
Liang
-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com]
Sent: 20
Here are my fields
101NGHGJGKGKLHJFKGJGKGK
the sequence field is from 300 bytes to 56K bytes, no spaces
I want to ngram from 3 to 8
NGH GHG HGJ ...
NGHG GHGJ HGJG ...
...
54 matches
Mail list logo