>
> On Wed, Jun 15, 2016 at 8:36 AM, Gary Yao wrote:
>> Hi all,
>>
>> My team at work maintains a SolrCloud 5.3.2 cluster with multiple
>> collections configured with sharding and replication.
>>
>> We recently backed up our Solr indexes using the b
Hi all,
My team at work maintains a SolrCloud 5.3.2 cluster with multiple
collections configured with sharding and replication.
We recently backed up our Solr indexes using the built-in backup
functionality. After the cluster was restored from the backup, we
noticed that atomic updates of documen
say, an optimize.
>
> So called "bottom line" is that yes, the scoring may change, but
> IMO not any more radically than was possible with single cores,
> and I wouldn't worry about unless I had evidence that it was
> biting me.
>
> Best
> Erick
>
> On
Hi Guys,
As the main scoring mechanism is based tf/idf, so will same query running
against SolrCloud return different result against running it against single
core with same data sets as idf will only count df inside one core?
eg: Assume I have 100GB data:
A) Index those data using single core
B)
and have no updates in between. But even then it may be a
> waste of time.
>
> You need lots of free disk space for merging, whether a forced merge or
> automatic. Free space equal to the size of the index is usually enough, but
> worst case can need double the size of the index.
out.
>
> Otis
> Solr & ElasticSearch Support
> http://sematext.com/
> On Jan 9, 2013 6:07 PM, "Yandong Yao" wrote:
>
> > Any comments on this? Thanks very much in advance!
> >
> > 2013/1/9 Yandong Yao
> >
> > > Hi Solr Guru,
> > >
Any comments on this? Thanks very much in advance!
2013/1/9 Yandong Yao
> Hi Solr Guru,
>
> I have two set of documents in one SolrCore, each set has about 1M
> documents with different document type, say 'type1' and 'type2'.
>
> Many documents in
Hi Solr Guru,
I have two set of documents in one SolrCore, each set has about 1M
documents with different document type, say 'type1' and 'type2'.
Many documents in first set are very similar with 1 or 2 documents in the
second set, What I want to get is: for each document in set 2, return the
mo
Hi Shalin,
Thanks very much for your detailed explanation!
Regards,
Yandong
2012/8/21 Shalin Shekhar Mangar
> On Tue, Aug 21, 2012 at 8:47 AM, Yandong Yao wrote:
>
> > Hi guys,
> >
> > From http://wiki.apache.org/solr/MergingSolrIndexes, it said 'Using
> >
Hi guys,
>From http://wiki.apache.org/solr/MergingSolrIndexes, it said 'Using
"srcCore", care is taken to ensure that the merged index is not corrupted
even if writes are happening in parallel on the source index'.
What does it means? If there are deletion request during merging, will this
delet
Hi Guys,
Steps to reproduce:
1) Download apache-solr-4.0.0-ALPHA
2) cd example; java -jar start.jar
3) cd exampledocs; ./post.sh *.xml
4) Use statsComponent to get the stats info for field 'popularity' based on
facet 'cat'. And the 'count' for 'electronics' is 3
http://localhost:8983/solr/coll
rks against the automation in
> solrcore, but maybe there's a good reason you want to do it this way.
> >
> > --- Original Message ---
> > On 5/22/2012 07:35 AM Yandong Yao wrote:Hi Darren,
> >
> > Thanks very much for your reply.
> >
> > The reason I wa
or you, therefore when you try to search a node/core
> with no documents, all the results from the "cloud" are retrieved
> regardless. This is considered "A Good Thing".
>
> It requires a change in thinking about indexing and searching
>
> On Tue, 2012-05
Hi Guys,
I use following command to start solr cloud according to solr cloud wiki.
yydzero:example bjcoe$ java -Dbootstrap_confdir=./solr/conf
-Dcollection.configName=myconf -DzkRun -DnumShards=2 -jar start.jar
yydzero:example2 bjcoe$ java -Djetty.port=7574 -DzkHost=localhost:9983 -jar
start.jar
I have similar issues by using DIH,
and org.apache.solr.update.DirectUpdateHandler2.addDoc(AddUpdateCommand)
consumes most of the time when indexing 10K rows (each row is about 70K)
- DIH nextRow takes about 10 seconds totally
- If index uses whitespace tokenizer and lower case filter, th
.
Modified the web.xml to map different modules to different roles.
-Yao
-Original Message-
From: Em [mailto:mailformailingli...@yahoo.de]
Sent: Monday, February 13, 2012 11:05 AM
To: solr-user@lucene.apache.org
Subject: Re: Permissions and user to acess administrative interface
Hi Anderson
Hi Guys,
I am using Solr 3.5, and would like to use a fq like
'getField(getDoc(uuid:workspace_${workspaceId})), "isPublic"):true?
- workspace_${workspaceId}: workspaceId is indexed field.
- getDoc(uuid:concat("workspace_", workspaceId): return the document whose
uuid is "workspace_${workspaceI
Sounds like WordDelimiterFilter config issue, please refer to
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory
.
Also it will help if you could provide:
1) Tokenizers/Filters config in schema file
2) analysis.jsp output in admin page.
2010/10/26 wu liu
After upgrading to 1.4.1, it is fixed.
Thanks very much for your help!
Regards,
Yandong Yao
2010/9/14 yandong yao
> Hi Robert,
>
> I am using solr 1.4, will try with 1.4.1 tomorrow.
>
> Thanks very much!
>
> Regards,
> Yandong Yao
>
> 2010/9/14 Robert Muir
>
Hi Robert,
I am using solr 1.4, will try with 1.4.1 tomorrow.
Thanks very much!
Regards,
Yandong Yao
2010/9/14 Robert Muir
> did you index with solr 1.4 (or are you using solr 1.4) ?
>
> at a quick glance, it looks like it might be this:
> https://issues.apache.org/jira/brow
Hi Guys,
I encountered a problem when enabling WordDelimiterFilterFactory for both
index and query (pasted relative part of schema.xml at the bottom of email).
*1. Steps to reproduce:*
1.1 The indexed sample document contains only one sentence: "This is a
TechNote."
1.2 Query is: q=TechNo
> you could satisfy this by making 2 fields:
> > 1. exactmatch
> > 2. wildcardmatch
> >
> > use copyfield in your schema to copy 1 --> 2 .
> >
> > q=exactmatch:mount+wildcardmatch:mount*&q.op=OR
> > this would score exact matches above (solely) wildcar
rationale is that if search 'mounted', I also want documents with 'mount'
match.
So seems built-in wildcard search could not satisfy my requirements if i
understand correctly.
Thanks very much!
2010/8/9 Bastian Spitzer
> Wildcard-Search is already built in, just use:
Hi everyone,
How to support 'implicit trailing wildcard *' using Solr, eg: using Google
to search 'umoun', 'umount' will be matched , search 'mounta', 'mountain'
will be matched.
>From my point of view, there are several ways, both with disadvantages:
1) Using EdgeNGramFilterFactory, thus 'umou
Modify all settings in solrconfig.xml and try again, by
default solr will only index the first 1 fields.
Best Regards,
Yandong
-Original Message-
From: Mint o_O! [mailto:mint@gmail.com]
Sent: 2010年6月3日 13:58
To: solr-user@lucene.apache.org
Subject: Re: Solr Search problem; cann
My motivation is more from the performance prospective than functional
prospective. I was hoping by opening the Solr index/core read-only,
underlying Lucene IndexReader can be opened in read-only mode for optimum
query performance (removing the overhead of multi-thread management).
--
View this m
Is there a way to open a Solr index/core in read-only mode?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-read-only-core-tp843049p843049.html
Sent from the Solr - User mailing list archive at Nabble.com.
and got
1,500/minute throughput by only using a little more than 0.5 GB of Heap
Memory.
The hit ratio reported in Solr admin statistics page shows filterCache
has a hitratio of 0.99. with 103800 lookups and 103773 hits, I assume it
is 99%.
Have a nice day.
-Yao
on you can help with your
insight with just the little fraction of information provided. Thanks again!
-Yao
--
View this message in context:
http://lucene.472066.n3.nabble.com/Date-faceting-and-memory-leaks-tp824372p825059.html
Sent from the Solr - User mailing list archive at Nabble.com.
Chris,
Thanks for the detailed response. No I am not using Date Facet but Facet
Query as for facet display. Here is the full configuration of my "dismax"
query handler:
dismax
explicit
0.01
title text^0.5 domain^0.1 nature^0.1 author
title
No I still have the OOM issue with repeated facet query request on the date
field. I forgot to mention that I am running 64-bit IBM 1.5 JVM. I also
tried the Sun 1.6 JVM with and without your GC arguments. The GC pattern is
different but the heap size does not drop as the test going on. I tested
w
I do not have any GC specific setting in command line. I had tried to
force GC collection via Jconsole at the end of the run but it didn't
seems to do anything the heap size.
-Yao
-Original Message-
From: Antonio Lobato [mailto:alob...@symplicity.com]
Sent: Monday, May 17, 2010 2:
I have been running load testing using JMeter on a Solr 1.4 index with ~4
million docs. I notice a steady JVM heap size increase as I iterator 100
query terms a number of times against the index. The GC does not seems to
claim the heap after the test run is completed. It will run into OutOfMemory
It appears the hl.maxAlternateFieldLength parameter default setting in
solrconfig.xml does not take effect. I can only get it to work by explicitly
sending the parameter via the client request. It is not big deal but it
appears to be a bug.
--
View this message in context:
http://old.nabble.com/
I am struggling with the concept of delta import in DIH. According the to
documentation, the delta import will automatically record the last index
time stamp and make it available to use for the delta query. However in many
case when the last_modified date time stamp in the database lag behind the
For Data Import Handler, there is a way to dump data to a SOLR feed format
XML file?
--
View this message in context:
http://old.nabble.com/DIH---Export-to-XML-tp26138213p26138213.html
Sent from the Solr - User mailing list archive at Nabble.com.
Yes. I think would be very helpful tool for tunning search relevancy - you
can do a controlled experiment with your target audiences to understand
their responses to the parameter changes. We plan to use this feature to
benchmark Lucene/SOLR against our in-house commercial search engine - it
will
Are your product_name* fields numeric fields (integer or float)?
Dals wrote:
>
> Hi...
>
> Is there any way to group values like shopping.yahoo.com or
> shopper.cnet.com do?
>
> For instance, I have documents like:
>
> doc1 - product_name1 - value1
> doc2 - product_name1 - value2
> doc3 - p
If you can reindex, simply rebuild the index with fields replaced by
combining existing fields.
-Yao
-Original Message-
From: David Lojudice Sobrinho [mailto:dalss...@gmail.com]
Sent: Thursday, August 06, 2009 4:17 PM
To: solr-user@lucene.apache.org
Subject: Item Facet
Hi...
Is there
Having a large number of fields is not the same as having a large number of
facets. To facets are something you would display to users as aid for query
refinement or navigation. There is no way for a user to use 3700 facets at
the same time. So it more of question on how to determine what facets t
A couple of things, your mlt.fl value, must be part of fl. In this case,
content_mlt is not included in fl.
I think the fl parameter value need to be comma separated. try
fl=title,author,content_mlt,score
-Yao
SergeyG wrote:
>
> Hi,
>
> Recently, while implementing the MoreLik
Try with fl=* or fl=*,score added to your request string.
-Yao
Yang Lin-2 wrote:
>
> Hi,
> I have some problems.
> For my solr progame, I want to type only the Query String and get all
> field
> result that includ the Query String. But now I can't get any result
> with
use Solr's Filter Query parameter "fq":
fq=x:[10 TO 100]&fq=y:[20 TO 300]&fl=title
-Yao
huenzhao wrote:
>
> Hi all:
>
> Suppose that my index have 3 fields: title, x and y.
>
> I know one range(10 < x < 100) can query liks this:
>
>
faceting remains the same as standard
request handler.
-Yao
Yao Ge wrote:
>
> Does Solr support faceting on MoreLikeThis search results?
>
--
View this message in context:
http://www.nabble.com/Faceting-with-MoreLikeThis-tp24356166p24380459.html
Sent from the Solr - User mailing list a
The answer to my owner question:
...
...
would work.
-Yao
Yao Ge wrote:
>
> I am not sure about the parameters for MLT the requestHandler plugin. Can
> one of you share the solrconfig.xml entry for MLT? Thanks in advance.
> -Yao
>
>
> Bill Au wrote:
>
I am not sure about the parameters for MLT the requestHandler plugin. Can one
of you share the solrconfig.xml entry for MLT? Thanks in advance.
-Yao
Bill Au wrote:
>
> I have been using the StandardRequestHandler (ie /solr/select). fq does
> work with the MoreLikeThisHandler. I wi
I could not find any support from http://wiki.apache.org/solr/MoreLikeThis on
how to restrict MLT results to certain subsets. I passed along a fq
parameter and it is ignored. Since we can not incorporate the filters in the
query itself which is used to retrieve the target for similarity comparison
Does Solr support faceting on MoreLikeThis search results?
--
View this message in context:
http://www.nabble.com/Faceting-with-MoreLikeThis-tp24356166p24356166.html
Sent from the Solr - User mailing list archive at Nabble.com.
I will like to submit a JIRA issue for this. Can anyone help me on where to
go?
-Yao
Otis Gospodnetic wrote:
>
>
> Brian,
>
> Opening a JIRA issue if it doesn't already exist is the best way. If you
> can provide a patch, even better!
>
> Otis
> --
&
/download.html).
Yao Ge wrote:
>
> FYI. I did a direct integration with Carrot2 with Solrj with a separate
> Ajax call from UI for top 100 hits to clusters terms in the two text
> fields. It gots comparable performance to other facets in terms of
> response time.
>
> In terms of alg
" which I don't
reconize. But I think at least one of them might have used SVD
(http://en.wikipedia.org/wiki/Singular_value_decomposition).
-Yao
Otis Gospodnetic wrote:
>
>
> I'd call it related (their application in search encourages exploration),
> but also dist
Thanks for insight Otis. I have no awareness of ClusteringComponent until
now. It is time to move to Solr 1.4
-Yao
Otis Gospodnetic wrote:
>
>
> Yao,
>
> Solr can already cluster top N hits using Carrot2:
> http://wiki.apache.org/solr/ClusteringComponent
>
> I
over top N docs might be more attractive. Another features I can
really appreciate is to provide search time n-gram term clustering. Maybe
this might be better suited for "spell checker" as it just a different way
to display the alternative search terms.
-Yao
Michael Ludwig-4 wrote:
&
If I want use OR operator with mutile query filters, I can do:
fq=popularity:[10 TO *] OR section:0
Is there a more effecient alternative to this?
--
View this message in context:
http://www.nabble.com/Query-Filter-fq-with-OR-operator-tp23895837p23895837.html
Sent from the Solr - User mailing li
Yes. I am using 1.3. When is 1.4 due for release?
Yonik Seeley-2 wrote:
>
> Are you using Solr 1.3?
> You might want to try the latest 1.4 test build - faceting has changed a
> lot.
>
> -Yonik
> http://www.lucidimagination.com
>
> On Thu, Jun 4, 2009 at 12:01 PM
I am index a database with over 1 millions rows. Two of fields contain
unstructured text but size of each fields is limited (256 characters).
I come up with an idea to use visualize the text fields using text cloud by
turning the two text fields in facets. The weight of font and size is of
each
t; into so called n-grams. You can see that if you open up the SC index with
> something like Luke. Please see
> http://wiki.apache.org/jakarta-lucene/SpellChecker .
>
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> - Original
pos that I need to run spell check on. But how can these original
data be used as a base for spell checking? How does Solr know what are
correctly spelled words?
...
...
Yao Ge wrote:
>
> Can someone help providing a tutorial like introduction on how to get
>
dictionary or
word list?
Please help.
Grant Ingersoll-6 wrote:
>
> Have you gone through: http://wiki.apache.org/solr/SpellCheckComponent
>
>
> On Jun 2, 2009, at 8:50 AM, Yao Ge wrote:
>
>>
>> Can someone help providing a tutorial like introduction on how to get
>
Can someone help providing a tutorial like introduction on how to get
spell-checking work in Solr. It appears many steps are requires before the
spell-checkering functions can be used. It also appears that a dictionary (a
list of correctly spelled words) is required to setup the spell checker. Can
It took me a while to understand that to use the Template Transfomer
(http://lucene.apache.org/solr/api/org/apache/solr/handler/dataimport/TemplateTransformer.html),
all building variable names (e.g. ${e.firstName} ${e.lastName} etc). can not
contain null values. I hope the parser can do a better
Maybe you want to try with docNumber field type as "string" and see it would
make a difference.
CB-PO wrote:
>
> I'm not quite sure what logs you are talking about, but in the
> tomcat/logs/catalina.out logs, i found the following [note, i can't
> copy/paste, so i am typing up a summary]:
>
>
ied to add
single and double quote around the field name but didn't help.
Can field name contain hyphen in boot functions? How to do it? If not, where
do I find the field name special character restrictions?
-Yao
--
View this message in context:
http://www.nabble.com/Query-
Hi,
I configured solr to listen on postOptimize event and call the
snapshooter program after an optimize command. It works well when the
Java heap size is set to less than 4G. But if I increased the java heap
size to 5G, the snapshooter program can't be successfully called after
the optimize
uggestion. Let me
know if someone has already done this and there are better ways or
suggestions to accomplish this. I figured solr's caching will make this
type of application more efficient than a straight Lucene integration.
Thanks.
-Yao
There is an bug related to "facet.mincount" in incubating version.
http://www.mail-archive.com/solr-user@lucene.apache.org/msg03269.html
-Yao
-Original Message-
From: escher2k [mailto:[EMAIL PROTECTED]
Sent: Tuesday, May 01, 2007 2:00 AM
To: solr-user@lucene.apache.org
Subjec
XML schema?
-Yao
Never mind. I should have read the example
(http://wiki.apache.org/solr/SimpleFacetParameters#head-1da3ab3995bc4abc
dce8e0f04be7355ba19e9b2c) first.
From: Ge, Yao (Y.)
Sent: Thursday, April 19, 2007 10:41 PM
To: 'solr-user@lucene.apache.org'
Subject: F
When mutiple facet queries are specified, are they booleaned as OR or
AND?
-Yao
69 matches
Mail list logo