OK, so I did yet another test. I stopped solr, removed whole "data/" dir
and started Solr again. Directories were recreated fine, but missing files
were not downloaded from leader. Log is attached (I took the lines related
to my test with 2 lines of context. I hope it helps.). I could find the
foll
On 23 January 2013 01:26, hassancrowdc wrote:
> sorry if it is a stupid question but where can i find result.xml and where
> do i write this program? any hints?
[...]
The result XML referred to is the XML returned by
Solr as a response to a query (if you prefer, you
can instead get JSON by adding
Hello Jack,
Thanks a lot for the answer. I want something like what you have
specifies, I want the tokenized words to be added to the response. Could
you give me some details about custom search component?
Thanks and regards,
Romita
From: "Jack Krupansky"
To: ,
Date: 01/21/2013 11:
For the Zk quorum issue, we'll put nodes in 3 different AZ's so we can lose
1 AZ and still establish quorum with the other 2.
On Tue, Jan 22, 2013 at 10:44 PM, Timothy Potter wrote:
> Hi Markus,
>
> Thanks for the insight. There's a pretty high cost to using the approach
> you suggest in that I'd
Hi Markus,
Thanks for the insight. There's a pretty high cost to using the approach
you suggest in that I'd have to double my node count which won't make my
acct'ing dept. very happy.
As for cross AZ latency, I'm already running my cluster with nodes in 3
different AZ's and our distributed query
You shoud look at log of solr-shard-4, It's seem that some error occured in
this shard.
--
from Jun Wang
Thank you Gopal,
I've updated the wiki accordingly.
Best
Lewis
On Tue, Jan 22, 2013 at 5:58 PM, Gopal Patwa wrote:
> These classes were deprecated from 4.0 and it is replaced with
> ConcurrentUpdateSolrServer and HttpSolrServer
>
> On Tue, Jan 22, 2013 at 5:28 PM, Lewis John Mcgibbney <
> lewis.
These classes were deprecated from 4.0 and it is replaced with
ConcurrentUpdateSolrServer and HttpSolrServer
On Tue, Jan 22, 2013 at 5:28 PM, Lewis John Mcgibbney <
lewis.mcgibb...@gmail.com> wrote:
> Hi All,
>
> As above, I am upgrading our application and I wish to give it a facelift
> to use t
Hi Otis,
Yes, My data is in HBASE and I just need a fast Spatial Index where I can do
lookups and then take the ID's back to HBASE to retrieve the results. HBASE
doesn't support polygon searches that I'm aware of. You can do bounding box
queries but that doesn't meet my requirements. I thought a
Hi All,
As above, I am upgrading our application and I wish to give it a facelift
to use the new shiny 4.1.0 deps.
I notice that not both the packages above are no longer included in the
4.1.0 solr-solrj artifact.
Can someone please explain what I should replace this with?
I also notice that the w
Hi Robi,
Boosts don't do anything for warmup queries if that is what you're after...
Otis
Solr & ElasticSearch Support
http://sematext.com/
On Jan 22, 2013 8:08 PM, "Petersen, Robert" wrote:
> Hi guys,
>
> I was wondering if there was a way to pass commonly used boost values in
> with commonly
Solr/Lucene does not implement strict boolean logic. Here's an
excellent blog discussing this:
http://searchhub.org/dev/2011/12/28/why-not-and-or-and-not/
Best
Erick
On Tue, Jan 22, 2013 at 7:25 PM, Otis Gospodnetic
wrote:
> Well, depends on what you indexed.
>
> Otis
> Solr & ElasticSearch Sup
Aside from the latency, how would you deal with the Zookeeper quorum?
Say DC1 had ZK1 and ZK2, and DC2 had ZK3.
Now anytime any server in DC2 can't talk to DC1, there is no Zookeeper
quorum. So if DC1 goes down, having nodes in DC2 doesn't do you any
good since theres no ZK quorum. I guess things
Hi guys,
I was wondering if there was a way to pass commonly used boost values in with
commonly used filter queries in these solrConfig event handler sections. Could
I just append the ^1.5 at the end of the fq value? IE can I do this:
taxonomyCategoryTypeId:1^1.5
Or perhaps thi
Don't quite know. First of all, this'll be expensive, it's probably a
good idea to ask whether you really need this kind of paging.
But extrapolating from the single node case where ties are broken by
internal document ID, I'd expect that ties are broken by some
combination of internal Lucene doc
Well, I can't find the class anywhere in the source code. Nearest I
can find is ElevatedMarkerFactory, so perhaps it's a typo.
Best
Erick
On Tue, Jan 22, 2013 at 9:55 AM, eShard wrote:
> Good morning,
> I can't seem to figure out how to load this class
> Can someone please point me in the right
Well, depends on what you indexed.
Otis
Solr & ElasticSearch Support
http://sematext.com/
On Jan 22, 2013 5:48 PM, "Anders Melchiorsen"
wrote:
> Thanks, though I am still confused.
>
> How about this one:
>
> manu:apple => 1 hit
> +name:video => 2 hits
>
> manu:apple +name:video => 2 hits
>
> So
Hi David,
You made my day by fixing https://issues.apache.org/jira/browse/LUCENE-4550,
THANKS!. I already applied a custom "binary patch" to my set up.
As it happens, the project I am working on is getting ready for another
production release, Implementing BBoxStrategy it is not an option at thi
Hi,
This returns 1 result:
select?q=gmid:1001&debugQuery=true
This returns 0 results:
select?qt=tvrh&tv.all=true&tv.fl=body&q=gmid:1001&debugQuery=true
Output of second query:
0
0
body
body
Any ideas on how to start troubleshooting this? my TermVector config looks
like:
Thanks, though I am still confused.
How about this one:
manu:apple => 1 hit
+name:video => 2 hits
manu:apple +name:video => 2 hits
Solr ignores the manu:apple part completely?
Cheers,
Anders.
Den 22/01/13 23.16, Jack Krupansky skrev:
The first query:
name:ipod OR -name:ipod => 0 hits
The first query:
name:ipod OR -name:ipod => 0 hits
The "OR" and "-" are actually at the same level of the BooleanQuery, so the
"-" overrides the OR so it's equivalent to:
name:ipod -name:ipod => 0 hits
For the second query:
(name:ipod) OR (-name:ipod) => 3 hits
Pure negative queri
On Tue, Jan 22, 2013 at 4:37 PM, Marcin Rzewucki wrote:
> Sorry, my mistake. I did 2 tests: in the 1st I removed just index directory
> and in 2nd test I removed both index and tlog directory. Log lines I've
> sent are related to the first case. So Solr could read tlog directory in
> that moment.
Hi,
Regarding availability; since SolrCloud is not DC-aware at this moment we
'solve' the problem by simply operating multiple identical clusters in
different DCs and send updates to them all. This works quite well but it
requires some manual intervention if a DC is down due to a prolonged DOS
Hello!
With the example server of Solr 4.0.0 (with *.xml indexed), I get these
results:
*:* => 32 hits
name:ipod => 3 hits
-name:ipod => 29 hits
That is all fine, but for these next queries, I would expect to get 32
hits (i.e. everything), or at least the same number of hits for both
querie
No idea - logs might help.
- Mark
On Jan 22, 2013, at 4:37 PM, Marcin Rzewucki wrote:
> Sorry, my mistake. I did 2 tests: in the 1st I removed just index directory
> and in 2nd test I removed both index and tlog directory. Log lines I've
> sent are related to the first case. So Solr could read
Sorry, my mistake. I did 2 tests: in the 1st I removed just index directory
and in 2nd test I removed both index and tlog directory. Log lines I've
sent are related to the first case. So Solr could read tlog directory in
that moment.
Anyway, do you have an idea why it did not download files from le
The logging shows that its finding transaction log entries.
Are you doing anything else while bringing the nodes up and down? Indexing? Are
you positive you remove the tlog files? It can't really have any versions if it
doesn't read them from a tlog on startup...
- Mark
On Jan 22, 2013, at 3:
Thank you Erick for that great tip on getting a listing of the Cores.
O. O.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Delete-all-Documents-in-the-Example-Solr-4-0-tp4035156p4035454.html
Sent from the Solr - User mailing list archive at Nabble.com.
Moving shards is done via replication. Replication accepts an index from
another SolrCore, and it cares nothing about what data directory it lived in.
It only cares about it's local data directory - it will merge/replace into it's
local data directory. It will retrieve the index data through htt
This is fixed now. Thanks again, Alex. - Steve
On Jan 22, 2013, at 2:53 PM, Alexandre Rafalovitch wrote:
> Not even sure whom to tell: http://lucene.apache.org/solr/tutorial.html (v4
> of the tutorial is there, v3 of the tutorial is a dead link).
>
> Regards,
> Alex.
> Personal blog: http://
Thanks for reporting Alex, I'll take care of it. - Steve
On Jan 22, 2013, at 2:53 PM, Alexandre Rafalovitch wrote:
> Not even sure whom to tell: http://lucene.apache.org/solr/tutorial.html (v4
> of the tutorial is there, v3 of the tutorial is a dead link).
>
> Regards,
> Alex.
> Personal blog
Dmitry,
Solr faceting is really fast due to using in-memory approach (keeping few
noticeable exceptions in mind), hence spans should be slower. Reading term
positions/payloads always has sensible gain. You can estimate it, if you
compare time for a phrase query "foo bar" with a plain conjunction +
Just remove the run-on "but" part from the URL:
http://blog.griddynamics.com/2011/10/solr-experience-search-parent-child.html
Common issue when URLs are auto-highlighted by browsers.
Regards,
Alex.
Personal blog: http://blog.outerthoughts.com/
LinkedIn: http://www.linkedin.com/in/alexandreraf
Dmitry,
I have no idea why the link is broken I can suggest you to find "solr
experience: search parent-child relations. part iii" post at the
blog.griddynamic.com.
On Mon, Jan 21, 2013 at 5:25 PM, Dmitry Kan wrote:
> Mikhail,
>
> the griddynamics blog's link returns "Sorry, the page you were
Hi, it's possible older XML dependencies did not correctly check the contents
of the data properly, maybe something else. But your XML is certainly broken.
Stripping bad chars will most likely fix your problem.
-Original message-
> From:Sujatha Arun
> Sent: Tue 22-Jan-2013 19:25
> To
Yes. Write a program to consume the result xml and then spit it back out
the way you'd like to see it.
cheers,
Travis
On Tue, Jan 22, 2013 at 1:23 PM, hassancrowdc wrote:
> ?
>
>
> On Tue, Jan 22, 2013 at 12:24 PM, hassancrowdc [via Lucene] <
> ml-node+s472066n4035390...@n3.nabble.com> wro
I'm glad to be of help.
This is all possible using Solr 4 without programatic customization. As
always remember the docs:
http://wiki.apache.org/solr/SolrAdaptersForLuceneSpatial4#IndexingThat
has an example of index a circle. Not much to it.
The only aspect of the SpatialRecursivePrefixTree
Thanks for the pointer , but given the same index code ,why does this not
work in solr 3.6.1 but wors fine in solr 1.3
Any idea?
Regards
Sujatha
On Tue, Jan 22, 2013 at 9:33 PM, Markus Jelsma
wrote:
> Hi,
>
> You've likely got some non-character code points in your data and they
> need to be st
Hello,
I have a single instance of Solr 3.6 with enabled autoCommit
(maxDocs=1000, maxTime=30). There are ~50k documents indexed. I'm
having an issue with document addition. The issue is as follows:
everytime adding first document (with commit=False) after the commit
takes a lot more time
Can anyone help me out of the confusion?many thanks.
在 2013-01-22 23:20:12,SuoNayi 写道:
>Hi,Erick,thanks for your detailed explanation.
>
>The collecting shard combines the 24060 ID/score pairs into a master list and
>then
>how it to choose the right 20 docs? It depends on what conditions?
>I a
Thanks for pointing this out to me Robert! I have been using the default
highlighting component but I thought I'd try out PostingsHighlighter having
read about it. I guess I will have to change my schema to work around this
then.
On 22 January 2013 17:42, Robert Muir wrote:
> On Tue, Jan 22, 201
On Tue, Jan 22, 2013 at 12:23 PM, Meng Muk
wrote:
> If I set the field type to "text_en" however it works, I'm guessing
> something in the way the text is being analyzed is causing this exception
> to appear? Is there a limitation in how storeOffsetsWithPositions should be
> used?
>
IndexWriter
Sorry I don't have a definitive answer right now, especially with regard to
the spaces - other than that they are not permitted. But a couple of
things...
1. The "!" is only after the "{".
2. The qf and pf parameters need to be in quotes, I think:
qf='title,description'
3. defType is done via
Same here - I've seen some document caches that were huge and highly
utilized. Check out the screenshot of the SPM for Solr dashboard that
shows pretty high hit rates on all caches. I've circled the parts to look
at. ML manager may strip the attachment, of course. :)
In addition to multiple in-
Hi,
Btw. we tried this, confirmed it works, and may write it up on
http://blog.sematext.com .
Thanks Mark.
Otis
--
Solr & ElasticSearch Support
http://sematext.com/
On Thu, Jan 17, 2013 at 12:17 AM, Mark Miller wrote:
>
> On Jan 15, 2013, at 10:59 AM, Otis Gospodnetic
> wrote:
>
> > Hi,
thnx. One quick question, can I control the way resultset of the query is
shown: I mean if i want displayName to be shown first and then the id and
then manufacturer and model? is there any way i can do that?
--
View this message in context:
http://lucene.472066.n3.nabble.com/solr-query-tp4035
Hi Phil,
Have a look at http://wiki.apache.org/solr/HowToContribute and thank you in
advance! :)
Otis
--
Solr & ElasticSearch Support
http://sematext.com/
On Fri, Jan 18, 2013 at 5:41 AM, Phil Hoy wrote:
> Hi,
>
> I would like to experiment with some custom load balancers to help with
> qu
Hi,
I'm using Solr 4.1RC at the moment and have been trying out the
PostingsHighlighter feature. I have a schema that is based on the provided
sample schema.xml so the fieldTypes that I am using is exactly the same.
This is the field that I have declared.
If I set the value to: My name is Thoma
Thank you very much David,
your answer is really full of interesting inspiration. To be honest, many of
the spatial concepts you were writing about are unknown or obscure to me, I
was only aware of the !geofilt filter with a "distance" parameter, but I
wasn't aware, for example, that I could index
On 1/22/2013 9:54 AM, hassancrowdc wrote:
so this will be fixed by changing type f the fields and adding analyzers?
On Tue, Jan 22, 2013 at 11:54 AM, hassan altaf wrote:
ok, i think it is phrase query problem.
Probably. I can't say for sure, but everything I've learned so far says
that wo
Thanks!
On Tue, Jan 22, 2013 at 8:59 AM, Otis Gospodnetic
wrote:
> Hi,
>
> When you run your indexing app make sure you treat what you send to Solr as
> UTF-8.
> Use -Dfile.encoding=UTF8 -Dclient.encoding.override=UTF-8 to the Java
> command line.
>
> Otis
> --
> Solr & ElasticSearch Support
> ht
Hi,
When you run your indexing app make sure you treat what you send to Solr as
UTF-8.
Use -Dfile.encoding=UTF8 -Dclient.encoding.override=UTF-8 to the Java
command line.
Otis
--
Solr & ElasticSearch Support
http://sematext.com/
On Mon, Jan 21, 2013 at 3:06 PM, Jack Park wrote:
> Here is a
ok, i think it is phrase query problem.
On Tue, Jan 22, 2013 at 11:40 AM, Shawn Heisey-4 [via Lucene] <
ml-node+s472066n4035371...@n3.nabble.com> wrote:
> On 1/22/2013 8:53 AM, hassancrowdc wrote:
> > ok. Coming to my original question: problem is only manufacturer field
> is
> > working. with t
Hi david,
I think (s)he is simply using HBase for storing the actual data. So after
getting IDs from Solr (s)he will use them as keys to look up data in HBase.
I think that is all.
Otis
--
Solr & ElasticSearch Support
http://sematext.com/
On Mon, Jan 21, 2013 at 1:46 PM, David Smiley (@MIT
Javier Molina wrote
> This very wide rectangle will cause an OutOfMemoryError
>
> -180 3 180 3.016668
>
> While this one, slightly taller will work fine.
>
> -180 3 180 3.5
Due to the bug, the accuracy computing algorithm believes the width for both
of these is 0. That algorithm also l
On 1/22/2013 8:53 AM, hassancrowdc wrote:
ok. Coming to my original question: problem is only manufacturer field is
working. with the same code and everything, if i use manufacturer field in
query it gives me results but not with model or displayName etc. what
should i do?
When you search by ma
Hi Otis,
Yes, it's reliable. You can safely move an entire data directory to another
machine or another cluster (sharing conf) using scp, you'll then also copy over
the index.properties file. You could also rename the directory from
index./ to just index/ and remove the index.properties file an
Thanks Markus. Yes, I'm after the actual, physical directory on the local
FS (local to wherever the shard lives).
Is this information in index.properties reliable? Can't shards be moved
around in the cluster, say when a node that had some shard fails? When
this happens, is the new index directo
January 2013, Apache Solr™ 4.1 available
The Lucene PMC is pleased to announce the release of Apache Solr 4.1.
Solr is the popular, blazing fast, open source NoSQL search platform
from the Apache Lucene project. Its major features include powerful
full-text search, hit highlighting, faceted searc
hi,
it is probably correct to revisit your design/requirements, but it you
still find you need it, then there may be a different way
DIH is using a writer to commit documents, you can detect errors inside
these and try to recover - ie. in some situations, you want to commit,
instead of calling rol
Thanks, I'll check that out.
Turns out our problem was we had omitTermFreqAndPositions true but were
running queries like "puppy dog" which, I would imagine, require position.
On Mon, Jan 21, 2013 at 9:22 PM, Gopal Patwa wrote:
> one thing I noticed in solrconfig xml that it set to use Lucene
Hi,
You've likely got some non-character code points in your data and they need to
be stripped.
http://unicode.org/cldr/utility/list-unicodeset.jsp?a=[:Noncharacter_Code_Point=True:]
See the patch for NUTCH-1016 for an example on how to strip them. It's easily
ported to other languages.
https:/
On 1/22/2013 8:33 AM, hassancrowdc wrote:
I thought that is the problem in the beginning but I have tried Kobo, but
it does not give me result(numFound=0) If you want i can send you xml of
resultset. and can you explain ho i can make it flexible so that it takes
care of upper/lowercase or any oth
Hi Otis,
I assume you're talking about the actual index directory? This is recorded in
the index.properties file in the dataDir.
Cheers,
-Original message-
> From:Otis Gospodnetic
> Sent: Tue 22-Jan-2013 16:39
> To: solr-user@lucene.apache.org
> Subject: Shard index dir paths in Clu
I thought that is the problem in the beginning but I have tried Kobo, but
it does not give me result(numFound=0) If you want i can send you xml of
resultset. and can you explain ho i can make it flexible so that it takes
care of upper/lowercase or any other character? (I can't change the type of
fi
Hello,
We are facing some problems when indexing with Solr 4.0.0 with more than one
server node and we can't find a way to solve them.
We have 2 nodes of Solr Cloud instances.
They are running in a Zookeeper ensemble (3.4.4 version) with 3 servers
(another application is deployed on the third se
On 1/22/2013 8:16 AM, hassancrowdc wrote:
0
1
all
true
true
kobo
model
dismax
Based on what I can see, I am thinking it didn't match because what's in
your index is probably Kobo, and you are searching for kobo. The type
on all your fields except id is string, which is defined as StrField
Hi,Erick,thanks for your detailed explanation.
The collecting shard combines the 24060 ID/score pairs into a master list and
then
how it to choose the right 20 docs? It depends on what conditions?
I assume the collecting shard sort these docs by score and the top 20 docs with
higher
scores are
Schema:
Ah, your fieldNorm is different. Are you sure firstname_search has exactly the
same value and more important, length? I cannot seem to remember an issue that
encodes norms differently between 3.x and 4.x but i'm likely wrong ;)
-Original message-
> From:roySolr
> Sent: Tue 22-Jan-2013
I'm not sure why you have this problem. I use DIH 1.4.1 in production with
Jboss 5 (based on Tomcat) and seldom restart the JVMs and haven't experienced
anything like this. As for the warnings with ThreadLocals, I doubt these are
causing a severe memory leak: in 1.4.1, the DataImporter class
Hello Shawn,
Thanks for the help:
Indented format:
*SOLR4*
3.3243241 = (MATCH) sum of:
0.20717455 = (MATCH) max plus 1.0 times others of:
0.19920631 = (MATCH) weight(firstname_search:g^50.0 in 783453)
[DefaultSimilarity], result of:
0.19920631 = score(doc=783453,freq=1.0 = termFreq
Good morning,
I can't seem to figure out how to load this class
Can someone please point me in the right direction?
Thank you,
--
View this message in context:
http://lucene.472066.n3.nabble.com/SolrException-Error-loading-class-org-apache-solr-response-transform-EditorialMarkerFactory-tp40352
On 1/22/2013 7:32 AM, hassancrowdc wrote:
Hi,
I have indexed data from mysql database to solr. Indexing is successful
everything is woring ok but when i query through the fields of my table i
can only query from one field none of the rest are giving me any
resultset(numFound comes out to be 0) an
Hi,
I have indexed data from mysql database to solr. Indexing is successful
everything is woring ok but when i query through the fields of my table i
can only query from one field none of the rest are giving me any
resultset(numFound comes out to be 0) any idea how i can fix t or where
should i loo
On 1/22/2013 2:07 AM, roySolr wrote:
You want that format indended? How/where can i get that?
If you are using the query UI in Solr 4, just put a check in the box
that says "indent" ... otherwise just add &indent=true to the query URL
you send.
If you use the Solr 4 UI, the results will be
On 21 January 2013 17:06, ashimbose wrote:
[...]
> Here I used two data config
> 1. data_conf1.xml
> 2. data_conf2.xml
[...]
Your configuration looks fine.
> Any one of them running fine at a single instant. Means,
> If I run first dataimport, it will successfully index, if after that I run
> da
Sorry my misunderstanding Upayavira,
Im actually talking about stopping some solr cores from loading up.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Block-creation-of-new-instances-in-Solr-s-startup-tp4035311p4035321.html
Sent from the Solr - User mailing list archive a
I'm not clear what you are referring to. Are you wanting to block the
startup of multiple webapps in your container, or to stop some solr
cores (indexes) loading up?
Upayavira
On Tue, Jan 22, 2013, at 12:40 PM, Gustav wrote:
> Hey guys, i need help with some really tricky request...
> Is it possi
I have tried and failed. So to the best of my knowledge, no it isn't
possible to do such faceting.
The only way to do it would be to prepare 'range' fields in your index
at index time, such as:
range-hundreds: 100-200
range-hundreds: 201-300
range-hundreds 301-400
Then pivot facet on your new ra
Hi folks,
Is it possible to pivot by ranges in a single query?
Something like this:
select?q=*:*&facet=true&facet.pivot=product_category,price[0-1000, gap 100]
I would expect output like this:
category A: range 0-100: 3, range 100-200: 2, range 200-300: 5
category B: range 0-100: 11, range 100
Hi All,
Pardon my poor English skill.
I'm new to Solr and testing DIH delta-import with solr-4.0.
When testing delta-import, I've found a scenario that some updates are
missing.
This scenario follows the steps below.
No. db-update delta-import
-
bq: does Solr need load all the docs into RAM to calculate score and order
You're very close. The query (and this is just like 3.x) is sent to
each shard. Let's say your page size is 20 (the &rows=20)
Each node will need to keep a list of 8020 documents (400 * 20) + 20,
really the ID and score, c
Hi,
we are experiencing problems with the DataImportHandler in Solr 1.4.0. We are
reading datasets from MySQL and this worked fine before, but it seems the
DataImportHandler ran into an infinite loop and is requesting millions of data
rows from MySQL. This resulted in a High CPU Load on our DB
Hi,
We are on solr 3.6.1 on Tomcat 5.5.25 . The Indexing of polish content
throws the following error .
*Caused by: com.ctc.wstx.exc.WstxIOException: Invalid UTF-8 middle byte
0x77 (at char #166, byte #127)*
at com.ctc.wstx.sr.StreamScanner.throwFromIOE(StreamScanner.java:708)
at com.ctc.wstx.s
q=table&bf=product(scale(product(query({!v='color'}),1),0,1),100)
This worked! Now looking at debug query I see that the nested query is using
the default field and default op. from schema.xml How could I pass params to
the nested query as the defType or qf. I would like to do something like
this
Yes it worked like you explained.
This is my new query class:
public class QObject extends org.apache.lucene.search.Query {
Query q;
public QObject(Query q) {
this.q = q;
}
@Override
public void extractTerms(Set terms) {
q.ex
87 matches
Mail list logo