Hi
Due to what we have seen in recent tests I got in doubt how Solr search
is actually supposed to behave
* Searching with
"distrib=true&q=*:*&rows=10&collection=x,y,z&sort=timestamp asc"
** Is Solr supposed to return the 10 documents with the lowest timestamp
across all documents in all sli
It actually is Beta that we're working with.
/Martin
On Mon, Aug 27, 2012 at 10:38 PM, Martin Koch wrote:
> (I'm working with Raghav on this): We've got several parallel workers that
> add documents in batches of 16 through pysolr, and using commitWithin at 60
> seconds when the commit causes s
One other thing i forgot to mention is - multicore setup we have requires us
to be able to add cores dynamically and i am not sure if thats supported by
http solr out-of-the-box.
-
Thanks
-K'Rider
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-Index-Concurrency-Is-
Thanks for the Reply Lance.
>From your post my understanding is that Solr commiters are more focussed on
http solr than EmbeddedSolrServer and EmbeddedSolrServer may not be tested
for all features supported by http solr.
Said that, can you please tell if there is any justification/usecase for
usin
Hi, I'm working on a similar problem and was wondering if you were able to
figure out a way to get the desired results?
Thanks,
Chetna
: We have a pretty standard out-of-the-box solr/jetty setup. Using the web
: UI at /solr/admin/logging, for WARNING or SEVERE we get less logging, but
: none of CONFIG, FINE or FINEST result in any *more* logging than just at
: INFO.
The UI (and the underlying Solr API) lets you change the log l
We have Solr 3.6.1 running on Jetty (7.x) and using DIH to get data from
the MySQL database. On one of the environment the import always fails
with an exception: http://pastebin.com/tG28cHPe
It is a null pointer exception on connection being null. I've tested
that I can connect from the Solr s
: However, when I convert InputStream stream (inside parse function) to File,
: it seems that Solr is adding header and footer that contains Metadata so the
: file won't be converted properly.
...
It's not totally clear from your problem description, but i *think* you
are saying that y
: And I get problems when doing this (it is working fine when compiling the
: alpha): "ant generate-maven-artifacts". The output is:
...
: resolve:
: [ivy:retrieve] You probably access the destination server through a proxy
: server that is not well configured.
: [ivy:retrieve] You probab
Smart Folks:
I use JDBC to produce simple XML entities such as this one:
AWARDTYPE
0
31
1
awardtypes::31:1
The XML entities are stored in file and loaded by the
FileListEntityProcessor.
In this case, the "movie_id" element has a value of zero because the JDBC
getString("movie_id") m
: Ok, I'll take your suggestion, but I would still be really happy if the
: wildcard searches behaved a little more intelligent (body:* not looking for
: everything in the body). More like when you do "q=*:*" it doesn't really
: search for everything in every field.
If you can suggest an algorith
Hi,
I wrote a custom fieldtype that need to read a configuration file in the
conf directory of the core and also get the absolute path of the conf
directory
In solr 4 alpha, my code was something like :
import org.apache.solr.core.SolrResourceLoader;
...
public class MultilingualField exte
Hello all,
I'm working on implementing a solution in a very high index rate, lower query
rate project. We have a very small record size, 2 small strings, 6 longs, 7
ints, and 2 dates, indexing on 8 of those fields. We need to be able to
maintain an injest rate of 50k records/sec average. Thr
(I'm working with Raghav on this): We've got several parallel workers that
add documents in batches of 16 through pysolr, and using commitWithin at 60
seconds when the commit causes solr to freeze; if the commit is only 5
seconds, then everything seems to work fine. In both cases, throughput is
aro
If the stemming is too bad, just remove that token filter from the field
type. But, you will have to re-index whenever you make such a drastic change
(the terms in the index will be different.)
-- Jack Krupansky
-Original Message-
From: mattia.martine...@gmail.com
Sent: Monday, Augus
For the record, this was caused by a rookie mistake: FD exhaustion.
--Casey
On 8/24/12 11:24 AM, Casey Callendrello wrote:
> Hi there,
> I have been doing some load testing with Solr 4 beta (now, trunk). My
> configuration is fairly simple - two servers, replicating via
> SolrCloud. SolrCloud is
You can use CustomScoreQuery to combine a scalar field value (e.g. like the
amount of the paid placement) together with the textual relevancy. You can
combine things anyway you want, e.g.
finalScore = textualScore + 1000.0 * scalarValue.
Or whatever makes sense. It sounds like you want some ki
Thanks for your help. Using the regex trick actually worked and this is the
direction we are taking now, but I think I'll open an enhancement request as
well. I'll try to see if we can improve that code locally for our product first.
Thanks,
Yoni
-Original Message-
From: Jack Krupansky [
You need to tell solr where your .jar file is located. Something like this
will help:
On Mon, Aug 27, 2012 at 6:19 AM, awb3667 wrote:
> Yes it does.
>
> I downloaded the file 'sqljdbc4.jar' from Microsoft. I have this same jar
> working with 3.6.1.
>
> Thanks.
> -Adam
>
>
>
> --
> View this
I am using Solr 3.5 and Jetty 8.12
I need to pull out huge query results at a time(for example, 1 million
documents, probably a couple gigabytes size) and my machine is about 64 G
memory.
I use the java bin and SolrJ as my client. And I use a Servelt to provide a
query down service for the end user
2012/8/27 Mike Schultz :
> Can you include the entire text for only the titolo field?
The entire text for the titolo field is "Moschea in viale Trento,
partono i lavori".
I tried to change the type of the titolo field from text to textgen,
and now it does not match.
I think it is a stemming prob
Hello,
I downloaded the beta src. I did:
cd apache-solr-4.0.0-BETA/
ant get-maven-poms
ant ivy-bootstrap
And I get problems when doing this (it is working fine when compiling the
alpha): "ant generate-maven-artifacts". The output is:
jar-test-framework:
ivy-availability-check:
ivy-fail:
ivy-co
Can you include the entire text for only the titolo field?
1.0 = tf(termFreq(titolo:trent)=1) means the index contains one hit for
'trent' for that field, that doc.
Mike
--
View this message in context:
http://lucene.472066.n3.nabble.com/Understanding-SOLR-search-results-tp4003480p4003540.h
Try removing fields one at a time from "qf" and see how far you have to go
to get the numbers to add up. The offending field will offer a clue.
Meanwhile, what are the details of the field types for these fields?
Also, look at the id field for all 36 docs from the first query and add that
id c
I surmise that you are using the "text_it" field type, or something similar.
It has:
When I enter "trento" into the Solr admin analysis page that last filter
transforms "trento" into "trent", just as we see in the query explain.
So, indeed, this looks like a stemming anomaly.
I see this co
How are you adding the docs? In batch, streaming, a doc at a time?
Any chance you can try with the Beta?
On Mon, Aug 27, 2012 at 9:35 AM, Raghav Karol wrote:
> Hello *,
>
> We are using SolrClould 4.0 - Alpha and have a 4 machine setup.
>
> Machine 1 - 16 Solr cores - Shard 1 - 16
> Machine 2 -
> Maybe you have a synonym in the title field? Or maybe some stemming anomaly?
The complete title is "Moschea in viale Trento, partono i lavori", so
"Trent" should be a substring of the word "Trento".
But if I search for "Mos" or "lavo", I don't have this result, so I
don't understand why "Trent"
It can't hurt. I mean somebody needs to straighten things out. The doc/wiki
references a page/class that doesn't exist.
-- Jack Krupansky
-Original Message-
From: Jones, Dan
Sent: Monday, August 27, 2012 12:19 PM
To: solr-user@lucene.apache.org
Subject: RE: RecursivePrefixTreeStrategy
I put a logging.properties into solr/WEB-INF/classes, but I still not see
any logs.
On Mon, Aug 27, 2012 at 11:56 AM, Chantal Ackermann <
c.ackerm...@it-agenten.com> wrote:
>
> Drop the logging.properties file into the solr.war at WEB-INF/classes .
>
> See here:
> http://lucidworks.lucidimaginati
Should I post this message for the Solr Dev list?
This communication (including all attachments) is intended solely for
the use of the person(s) to whom it is addressed and should be treated
as a confidential AAA communication. If you are not the intended
recipient, any use, distribution, prin
Drop the logging.properties file into the solr.war at WEB-INF/classes .
See here:
http://lucidworks.lucidimagination.com/display/solr/Configuring+Logging
Section "Tomcat Logging Settings"
Cheers,
Chantal
Am 27.08.2012 um 16:43 schrieb Nicholas Ding:
> Hello,
>
> I've deployed Solr 4 on Tomca
You can also have a secondary text field which uses a specialized "stop"
filter to remove the common terms.
-- Jack Krupansky
-Original Message-
From: Ramzi Alqrainy
Sent: Monday, August 27, 2012 11:45 AM
To: solr-user@lucene.apache.org
Subject: Re: Solr Partial and Exact Matching
Bu
But I don't want to add these words as static because I can't restrict or
inclose all words, I need to play on weight only.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-Partial-and-Exact-Matching-tp4003478p4003508.html
Sent from the Solr - User mailing list archive a
You could have a second keyword field with a regex filter to drop out common
terms from restaurant names, such as "restaurant", "pizza", (even "hut"),
etc.
-- Jack Krupansky
-Original Message-
From: Ramzi Alqrainy
Sent: Monday, August 27, 2012 11:29 AM
To: solr-user@lucene.apache.org
Thanks for your response Erick.
Your explanation seems to make sense for the commit count. But I guess the
UI needs to be fixed.
Regarding the performance, I went through your blog (nicely written btw
(and good links to other interesting blogs too)). I didn't realize that
everything that is index
Many Thanks for your kind email. I expect that I have reached the information
in some other way. I will explain my point again with different scenario.
I have a big data of places. let's take sample of restaurants places under
category restaurants. Some of places contains "Restaurant" in their nam
I solved the problem.
I added the parameter sharedLib="lib" in $SOLR_HOME/solr.xml () and moved all jars into
$TOMCAT_HOME/webapps/solr/WEB-INF/lib to $SOLR_HOME/lib
This information could be included in the wiki Solr / Tomcat.
Claudio Ranieri | Especialista Sistemas de Busca | S.A O Estado de S.
Ok, I have spent way too long looking for this.
I guess I'll never figure out how to index a MULTIPOLYGON field from a
schema.xml :-(
-Original Message-
From: Jones, Dan [mailto:dajo...@national.aaa.com]
Sent: Saturday, August 25, 2012 6:34 PM
To: solr-user@lucene.apache.org
Subject: RE
Maybe you have a synonym in the title field? Or maybe some stemming anomaly?
Try using the Solr admin analyzer and enter the query text for the title
field and see how it analyzes.
In any case, the explain is clearing saying that titolo:trent was a match.
Regardless of what source text you ga
You could index place name as both a text field and a separate keyword
string field. The former would match on any word. The latter would match
only on exact match (or wildcard) and could be boosted as you have indicated
since it would be an exact match.
You can just do a copyField from one to
Right, examine the parsed query carefully and
you'll see that the semantics are much different.
At a guess, "auto" appears in more than one
field. So your first query is saying
"return me any documents for which "auto" appears
in any of the fields (text, media_transcript.).
Your NOT clause is
Please allow me to add another document with another example.
D3 :
Place Title : Pizza Jons
Review : I love it.
When the user search for pizza [General Query] we need to decrease the rank
for place title and gets
Little Caesar
Pizza hut
Pizza Jons
When we search for Jons [Search For Name]
Pizz
Hello *,
We are using SolrClould 4.0 - Alpha and have a 4 machine setup.
Machine 1 - 16 Solr cores - Shard 1 - 16
Machine 2 - 16 Solr cores - Shard 17 - 32
Machine 3 - 16 Solr cores - Replica 1 - 16
Machine 4 - 16 Solr cores - Replice 17 - 32
Index at 500 docs/sec and committing every 60 seco
Can anyone help me?
-Mensagem original-
De: Claudio Ranieri [mailto:claudio.rani...@estadao.com]
Enviada em: sexta-feira, 24 de agosto de 2012 11:40
Para: solr-user@lucene.apache.org
Assunto: RES: Problem to start solr-4.0.0-BETA with tomcat-6.0.20
Hi Vadim,
No, I used the entire apache
The autocommits are about what I'd expect. 17 hours
== 102 ten minute blocks, which is roughly your
115 autocommits. I'm _guessing_ that the total
commits are a combination of soft and hard. You'll
have 20,400 soft commits in that time frame, so this
works as a rough estimate
And SolrJ doesn't
Yes it does.
I downloaded the file 'sqljdbc4.jar' from Microsoft. I have this same jar
working with 3.6.1.
Thanks.
-Adam
--
View this message in context:
http://lucene.472066.n3.nabble.com/Data-Import-Handler-Could-not-load-driver-com-microsoft-sqlserver-jdbc-SQLServerDriver-SOLR-4-Beta-tp40
Hi.
I get some strange results for one query from SOLR.
This is an example query:
(titolo:trenti OR sommario:trenti OR occhiello:trenti OR testo:trenti)
In the results I have this document:
6.5818048
503af94e0c342
IL PROGETTO.. (no word "tren" in
"occhiello" field)
C'è la conces
I want to ask you about partial and exact matching. I have documents for
Places and we need to get exact match if the user search for place name and
get the results based on other factors (not place name) if the query is
general or partial.
This is an example to explain more details.
If we supp
OK, let's back up. _What_ is your memory error? Where
is it coming from? Solr sorting or your custom code?
How many documents are you returning to your custom code
to sort, all of them? Some subset?
Consider re-thinking how you're doing this. In Solr 4 Beta,
there's the possibility of storing all
Why do you need the django_id as a string in the
first place? Strings are much less efficient than
numeric types in s many ways that it seems
you'd be better off making it a numeric type..
But the canonical way to do what you want is to
just left-pad the string version with zeros, then
the num
Unfortunately, no it is not.
Michael Della Bitta
Appinions | 18 East 41st St., Suite 1806 | New York, NY 10017
www.appinions.com
Where Influence Isn’t a Game
On Mon, Aug 27, 2012 at 8:53 AM, bsargurunathan
wrote:
> Hi Rafal,
>
> Thanks for your
No, it's 4.0
-Original message-
> From:bsargurunathan
> Sent: Mon 27-Aug-2012 14:57
> To: solr-user@lucene.apache.org
> Subject: Re: Solr cloud Implementation
>
> Hi Rafal,
>
> Thanks for your advice, I checked in that url,
> But they asked to download Solr 4.0 Beta or Greater,
> Bu
Hi Rafal,
Thanks for your advice, I checked in that url,
But they asked to download Solr 4.0 Beta or Greater,
But I am using Solr-3.5. Is it possible to implement the Solr Cloud using
Solr-3.5?
Thanks,
Guru
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-cloud-Implem
On Mon, 2012-08-27 at 14:29 +0200, dhaivat dave wrote:
> I am getting an error while indexing data to solr. i am using solrj apis to
> index the document and using the xml request handler to index document. i
> am getting an error *org.apache.solr.common.SolrException: Unexpected
> character 'F' (c
Hello!
Please look at http://wiki.apache.org/solr/SolrCloud/
You'll find answers to your questions there.
--
Regards,
Rafał Kuć
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch
> Hi Guys,
> I am very new for Solr Cloud,
> Can you guys advice me the following questio
Hi Guys,
I am very new for Solr Cloud,
Can you guys advice me the following questions?
Basic requirement for Implementing the Solr Cloud?(For ex: Solr Version,
Supporting OS..)
Right now I am using Solr 3.5, is it possible to implement the Solr Cloud in
that?
If I can? Please share any docs or ref
Hello Everyone ,
I am getting an error while indexing data to solr. i am using solrj apis to
index the document and using the xml request handler to index document. i
am getting an error *org.apache.solr.common.SolrException: Unexpected
character 'F' (code 70) in prolog; expected '<' at [row,col
Highlighting does work on dynamic fields. I verified that there is no bug
there.
It is glob that does not work the way you used it that is the issue. Whether
that is a sloppy doc description that needs to get cleaned up or a bug in
the code is an open question. I would suggest that the doc nee
View https://issues.apache.org/jira/browse/SOLR-3761 SOLR-3761
Regards
--
View this message in context:
http://lucene.472066.n3.nabble.com/flush-delete-all-document-solr-4-Beta-tp4003434p4003452.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hey Erick, Thanks to Jack I have a working solution to issue no 2. For 1 I'm
looking for a solution.
Please help me for that.
debug Query are as below:
Search for *auto*: 36 results: parsedquery:
"+DisjunctionMaxQuery((text:auto^0.2 | media_transcript:auto^0.4 |
title:auto^0.9 | keywords:auto^0.7
Hi,
Thanks for your reply but It does not work.
It's weird, because with the solr 3.6 and 3.6.1 It's work .
Best regards
--
View this message in context:
http://lucene.472066.n3.nabble.com/flush-delete-all-document-solr-4-Beta-tp4003434p4003444.html
Sent from the Solr - User mailing list ar
Hello,
I'm trying to write custom parser and add it to Tika, but I'm not very
successful right now.
As I have a binary file that converts custom file type into XML file,
I'm converting custom file to XML file inside my custom parser, then call
XMLParser inside the parser.
However, when I conver
your docs are marked as deleted.
you should optimize after commit, then they will be really deleted.
it's easier and faster to stop your jetty/tomcat, drop your index
directory and start your servlet container...
when it's not possible, then optimize.
regards
Vadim
2012/8/27 Jamel ESSOUSSI :
> Hi
As explained below, the servers are started with a minimum heap size of 3G
and max heap size of 4G, though I've never seen the heap grow more than 3G.
On Mon, Aug 27, 2012 at 3:02 AM, Lance Norskog wrote:
> How much memory is allocated? There is a feature in modern Unix
> systems called 'Large P
Hi Kiran,
Thank you so much. Its working now
--
View this message in context:
http://lucene.472066.n3.nabble.com/Is-SpellCheck-Case-Sensitive-in-Solr3-6-1-tp4003074p4003435.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi,
I should flush solr (delete all existing documents)
--> for doing this, I have the following code:
HttpSolrServer server = HttpSolrServer(url);
server.setSoTimeout(1000);
server.setConnectionTimeout(100);
server.setDefaultMaxConnectionsPerHost(100);
server.setMaxTotalConnections(100);
serve
Yair,
you can create it easily, it will be used.
Paul
Le 27 août 2012 à 09:16, yair even-zohar a écrit :
> I'm newbie with Tomcat configurations and am looking to reduce the logging
> level for Solr
> Where should I put the logging.properties file and how to point Tomcat to use
> it?
>
>
@Erik: the cross product is about 300k records. Since we use Solr
1.4.0 I try other options first.
@Lance: sounds interesting, I have to check this out
In the meanwhile I tried to use sub-entities to solve my problem. This
means I started with the "employer" as entity and defined "services"
and "
Hi, did you find where is problem?
Dne 23.8.2012 16:40, lenz...@gfi.ihk.de napsal(a):
Hello,
I have a problem using grouped queries and the 'group.ngroups' parameter.
When I run the following request
/select?q=&group=true&group.field=personId&group.ngroups=true&wt=xml
the response looks like
Thanks for this information. Can you please send me to the right place in the
code, I'll check it out.
Regardless, it sounds like a bug to me, highlighting should work on dynamic
fields too. Should I open a bug for this?
Thanks,
Yoni
-Original Message-
From: Jack Krupansky [mailto:j...
I'm newbie with Tomcat configurations and am looking to reduce the logging
level for Solr
Where should I put the logging.properties file and how to point Tomcat to use
it?
Tomcat suggest to put it under the WEB-INF/classes but there is no
WEB-INF/classes for solr-siren
Thanks
-Yair
71 matches
Mail list logo