Nope!
if u r doing grouping then "matches" is always the total no. of results and
"ngroups" is the number of groups. Every groups can have some docs
belonging to it which can be anything according to provided parameter
"group.limit". If u get the sum of all the docs of each group, then it's
equival
Ummm.. I think there is some confusion here.
As per my understanding, matches is the total no of docs which the original
query/filter query returned. On these docs grouping is done. So matches may
not be actually equal to total no. of returned in your result, post
grouping. Its just a subset of th
Thanks Dipti!
One thing that seems off is that you are passing the bootstrap_confdir param on
each instance?
Other than that though, the problem you are seeing is indeed a bug - though
hidden if using localhost. I'll fix it here:
https://issues.apache.org/jira/browse/SOLR-3091
Again, thanks f
So it sounds like what you want is partial results. We don't support that yet,
but there is a JIRA issue for it.
Currently we require that the full index is available - which means if you want
to survive any given instance instance going down, you need to have a replica
for each shard A and sh
Thanks for the reply Mark.
I did example A. One of the instances had zookeeper. If I shut down the
other instance, all searches on the other (running) instance produced an
error in the browser.
I don't have the error handy but it was one line. Something like missing
shard in collection IIRC.
Hello.
I'm having issues getting MLT to work with content streams. If I make a
query for a term like 'tomcat' using the 'stream.body' parameter, I get back
a valid response from Solr. I use a curl command like this:
$ curl "http://host/solr/mlt?stream.body=tomcat&mlt.mintf=1";
(I've set a defa
> I am a beginner with Solr and would like to index
> dynamic php files ( page.php?ID=233) and static php files
> and .shtml files. This is for a small website, which
> hits a small MySql database on the backend, however some php
> files are static and are not part of the database.
>
> Can you t
Thanks Emmanuel,
I should have been more clear about my use of the word 'dynamic', I actually
meant PHP files that do not pull content from a database. I will take a look at
the links you provided, ...thanks again for putting me on the right path.
Steve
-Original Message-
From: Emmanue
What do you mean by static php files? As far as I know PHP is to make
pages look dynamic. If you want to index dynamic pages as they where
just HTML you will have to download them, and add them to Solr.
Programming a small program in SolrJ and using some HTTP library
(http://hc.apache.org/httpclien
Hi Mark,
I am trying to set up on 4 ami's, where 3 of the instances will have the
embedded ZK running. Here are the startup commands for all 4.
- Note that on the 4th instance I do not have the ZK host and bootstrap
conf dir specified. The 4th instance throws exception (earlier in this
email chain
Hi ,
I am a beginner with Solr and would like to index dynamic php files (
page.php?ID=233) and static php files and .shtml files. This is for a small
website, which hits a small MySql database on the backend, however some php
files are static and are not part of the database.
Can you tell m
On Feb 2, 2012, at 9:51 AM, dar...@ontrenet.com wrote:
> Hi,
> I want to use SolrCloud in a more federated mode rather than
> replication. The failover is nice, but I am more interested in
> increasing capacity of an index through horizontal scaling (shards).
>
> How can I configure shards such
Thanks Tomás,
We are trying many combinations on master/slave and commit configurations.
If we dont find the solution, we'll try NRT.
Once we've found an acceptable solution, we'll report back ...
2012/2/2 Tomás Fernández Löbbe
> The problem is that in order to make the changes visible to th
On Feb 2, 2012, at 14:54 , Dipti Srivastava wrote:
> Hrik,
> Thanks for you response. I have a question though...
> Will I be able to access the solr/admin console after disabling the
> velocity response writer? If so I can disbale it.
The admin console in Solr does not use the velocity response
> Ok let's assume we have the basic solr engine that's able to
> search and
> give result of urls...now from those pages, I would like to
> know which
> terms are the most mentioned, e.g. iPad, Samsung,
> Candy...the list can be
> long but we could decide to only output the top#20 or so.
I use mlt
Sincere apologies for the unclarity! I'm probably misusing technical terms
such 'category' ...
Ok let's assume we have the basic solr engine that's able to search and
give result of urls...now from those pages, I would like to know which
terms are the most mentioned, e.g. iPad, Samsung, Candy...th
Erick (sorry for missing the "c" previous :D),
After playing around with the edismax query parser, I'm starting to like it.
Originally I just wanted the simplest search feature to get started, but I can
see that I might take advantage of edismax's field booster feature later.
Turns out the tric
Hrik,
Thanks for you response. I have a question though...
Will I be able to access the solr/admin console after disabling the
velocity response writer? If so I can disbale it.
Otherwise, could you please advise on where and which libraries need to be
copied and if any changes in solrconfig.xml a
You're getting confused between default search fields and the dismax query
parser. Look in your solrconfig.xml file and you'll see a request
handler I think.
Take a look at:
http://wiki.apache.org/solr/DisMaxQParserPlugin
I think this will do what you want.
The catch-all field you mentioned is al
See below:
On Thu, Feb 2, 2012 at 12:28 PM, tiuser123 wrote:
> Hello Erick,
>
>
> Erick Erickson wrote
>>
>> Lucene (where the sorting happens) assembles a list of all the unique
>> *values* for a sort field and sorts the result set by comparing to that
>> list. It doesn't sort all the documents
Erik,
Thanks for your suggestions. After I made all field names [a-zA-Z0-9_] and
turned on debugQuery=true, I saw that the query was using something like
"text^0.5", which is beyond my current comprehension. I commented out those
"^0.5" type settings in solrconfig.xml. Now the search works a
: I just want to ask, for example for all products from the category "X" name
"Samsung"
before you do anything else, *PLEASE* start with the Solr tutorial, it
should really help you understand a lot of basic concepts in Solr...
http://lucene.apache.org/solr/tutorial.html
Another resource you
No, I just used a SolrJ program I had lying around that
dummied up some fields for the default schema. I was
just looking at how you were using the update request
Best
Erick
On Thu, Feb 2, 2012 at 10:13 AM, rzoao wrote:
> Hey guys, I appreciate everybody's help...
>
> I solved what was going
: 700ms. The problem is, we are using geofiltering and that is happening *
: before* the queries, so we have to geofilter on the 1B docs to restrict our
: set of docs first, and then do the query on a name field. But it seems that
...
: I've looked at the "cache=false" param, and the "co
Dipti - the fix is purely configuration. There is no later version of Solr
officially released at this point, but it has been corrected on the branch that
will eventually become 3.6 (as well as the 4.0/trunk branch).
Follow the instructions mentioned below to fix this in your environment.
Hi Erik,
The latest release from the apache site for Solr 3.5 still has this issue.
Where can I get a drop that has the fix for this issue?
Thanks,
Dipti
On 12/9/11 1:22 PM, "Erik Hatcher" wrote:
>My bad.
>
>To clarify the issue here... the problem manifests itself only on Solr
>3.5 specifically
: > Another alternative solution would be to add a category field to the
: > already crawled content.
: > >> Let's say Solr is setup and can return relevant urls. What if I wanted
: > to get the most cited terms from a predefined list, instead? It could be
: > from a list of products, names, citi
Hello Erick,
Erick Erickson wrote
>
> Lucene (where the sorting happens) assembles a list of all the unique
> *values* for a sort field and sorts the result set by comparing to that
> list. It doesn't sort all the documents per-se.
>
Hmm, still confused. What do you mean by "sorts by comparing
On Feb 2, 2012, at 11:17 , Yuhao wrote:
> Erik,
>
> You were right! The space in "Entrez ID" was the problem. It works fine
> after I got rid of all spaces and capital letters.
> Now I just have to come up with a way to display the original field names in
> the UI, which the users would pref
Erik,
You were right! The space in "Entrez ID" was the problem. It works fine after
I got rid of all spaces and capital letters.
Now I just have to come up with a way to display the original field names in
the UI, which the users would prefer.
Is there a way I can stick the display value (with
> I have the following problem. I am doing a search by street
> name, but I
> would like to use the fuzzy capabilities of Solr.
>
> My example:
> I search for street name:ALCALA (query: NVIAC:ALCALA) and I
> get this result:
>
> 079
> 2807900132
> 28
> MADRID
> ALCALA
> CALLE
>
>
> but... if I
There should only be one document matching that query (provided "Entrez ID" is
your unique key field name). Using a space in a field name is perhaps the
problem. It's way best practice that fields have only [a-zA-z0-9_] in them.
Maybe that space isn't the issue though, but try &debugQuery=tru
Erik,
Thanks for the slides. I followed the example on pages 24-25 (maybe too
rigidly). The first line is giving me trouble:
#set($doc= $response.results.get(0))
This will always get the first document in the search results, which happens to
be the first document I indexed. So, no matter wh
Hello,
I have the following problem. I am doing a search by street name, but I
would like to use the fuzzy capabilities of Solr.
My example:
I search for street name:ALCALA (query: NVIAC:ALCALA) and I get this result:
079
2807900132
28
MADRID
ALCALA
CALLE
but... if I do NVIAC:ALCALO~ I would
Hey guys, I appreciate everybody's help...
I solved what was going on:
my schema.xml had a UniqueKey tag that was specifying the id field as
required. Since I got the schema from the example folders, it came with
those extra configs.
Thanks for helping me!!!
Erick, did you create a new scheme.x
Hi,
I want to use SolrCloud in a more federated mode rather than
replication. The failover is nice, but I am more interested in
increasing capacity of an index through horizontal scaling (shards).
How can I configure shards such that they retain their own documents and
don't replicate (or replic
The problem is that in order to make the changes visible to the user you
have to issue a commit. If you commit with every user change (I assume you
may have concurrent users) you may have many commits per second. That's too
much for Solr, as each commit will flush a new segment, reopen an index
sea
First, sorting completely overrides scoring. So if you specify a sort,
scoring is essentially ignored. If you specify more than one sort, they
are applied in order. That is, any ties in the first sort parameter are
broken by the second sort parameter. If all sort parameters specified
tie, the inter
Hi,
I am talking about post grouping only.
5017
4062
gives you the total no. of groups. If you iterate through all groups and do
the sum of counts in each group, you will get the same number as in
"matches".
in above example total no. of groups is 4062 and total no. of docs is 5017.
if u get the su
I just tried your code in 3.x (with no DB connection) and it works fine,
so it looks like something in your environment
Longer term, it's a bad idea to commit (or even update) single
documents, you can use the
add(Collection wrote:
> Hi rzao!
>
> I think this is the problem:
>
>
> On 02.02.201
Good morning everyone,
I'm working on a project using solr 3.5, one master and two slaves.
We run a grails app, and it has an update function.
When the user click the button, we search for the message to be updated,
clone it using SolrJ, delete the old message and save the new one.
We do this up
Hi rzao!
I think this is the problem:
On 02.02.2012 13:59, rzoao wrote:
UpdateRequest req = new UpdateRequest();
req.setAction(AbstractUpdateRequest.ACTION.COMMIT, false,
false);
Hey Preetesh Dubey,
thanks for you help, I tried it that way and I'm getting the same error over
here: Bad Request
I'm wondering if it's not my schema.xml ... or something else that I don't
know.
Joao
--
View this message in context:
http://lucene.472066.n3.nabble.com/Bad-Request-Solr-Weblogic
No my question is post grouping.
If group.limit=5 then a group may have 5 docs or less. So if i have to find
out total number of docs returned post grouping i will have to see the count
of docs in each group which may be any one of these - 5 or 4 or 3 or 2 or 1.
So does that mean i have to iterate
Instead of CommonHttpSolrServer, I am using StreamingUpdateSolrServer and
just doing "server.add(solrInputDocument)".
/**
StreamingUpdateSolrServer server = new StreamingUpdateSolrServer(solrurl,
2, indexingThreads);
server.add(solrInputDocument);
**/
On Thu, Feb 2, 2012 at 6:29 PM, rzoao wrote:
Hello all,
I'm new to solr and I'm having a hard time trying to configure the
schema.xml for a simple table.
I'm using WebLogic with standard configs and Oracle Database. I have a
simple table with the following fields
Fase varchar
Campo varchar
Antes varchar
Depois varchar
Regras varchar
My sc
"ngroups" XML response shows the the Total no. of groups and "matches"
shows the actual number of docs.
On Thu, Feb 2, 2012 at 6:23 PM, tamanjit.bin...@yahoo.co.in <
tamanjit.bin...@yahoo.co.in> wrote:
> You are right it does. Kind of missed this all together. Thanks
>
> And anyway to know the
First of all, what evidence do you have that you even need to shard?
12 M documents is quite a small index by Solr standards, just test it
and see.
As far as replication, 10 minutes is probably a good place to start, but
you can experiment with reducing it. I've often found that "real time" is
usu
You are right it does. Kind of missed this all together. Thanks
And anyway to know the actual number of docs returned post grouping?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Which-patch-236-to-choose-for-collapse-Solr-3-5-tp3697685p3709743.html
Sent from the Solr - Use
Why do you care? That is, what is it you're doing that
is sensitive to the order of the return relative to the
order defined?
Best
Erick
On Wed, Feb 1, 2012 at 3:27 PM, Yuhao wrote:
> $doc.getFieldNames() will give you a list of field names as defined in your
> schema.xml file. However, the or
group.ngroups=true doesnt give you the desired result?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Which-patch-236-to-choose-for-collapse-Solr-3-5-tp3697685p3709711.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi all
I have a NullPointerException in SolrJetty when using
CommonsHttpSolrServer.add(SolrInputDocument) that I can not solveplease
help me.
>From the core stats, the document does not result to be indexed.
The stackTrace is:
2-feb-2012 13.05.04 org.apache.solr.common.SolrException log
GRAVE
Try sorting on category. That should do.
http://wiki.apache.org/solr/CommonQueryParameters#sort
http://wiki.apache.org/solr/CommonQueryParameters#sort
--
View this message in context:
http://lucene.472066.n3.nabble.com/Controll-of-results-output-tp3709288p3709691.html
Sent from the Solr - User
I tried using grouping. Actually what works for me is group.main=true.
However there is an issue. Collapsing, as in the earlier version when it
returned docs, it gave a count of number of docs returned post collapsing.
But I could not find a similar feature in grouping..
For eg. if before groupin
Hello, Chris.
Thank you and Mikhail for the explanation. I'll think about changing the model
of indexing to be able to handle this case.
> : DIR:true
> : PATH:/root/folder1/folder2/
> : NAME:folder3
> : SIZE:0
> ...
> : DIR:false
> : PATH:/root/folder1/folder2/folder3/
> : NAME:image.jpg
Hello new user here,
Would just like to clarify the behavior of the solr/lucene sort param.
In this post:
http://lucene.472066.n3.nabble.com/Lucene-sort-performance-roots-tp3102493p3104294.html
I somehow got the impression that solr would do the sort only on the top
ranking documents taken from
56 matches
Mail list logo