Am 04.03.2014 07:21, schrieb Thomas Scheffler:
Am 27.02.2014 09:15, schrieb Shawn Heisey:
On 2/27/2014 12:49 AM, Thomas Scheffler wrote:
What problems have you seen with mixing 4.6.0 and 4.6.1? It's possible
that I'm completely ignorant here, but I have not heard of any.
Actually bug reports
Am 27.02.2014 09:15, schrieb Shawn Heisey:
On 2/27/2014 12:49 AM, Thomas Scheffler wrote:
What problems have you seen with mixing 4.6.0 and 4.6.1? It's possible
that I'm completely ignorant here, but I have not heard of any.
Actually bug reports arrive me that sound like
"Unknown type 19"
Am 03.03.2014 19:12, schrieb Smiley, David W.:
The main reference for this approach is here:
http://wiki.apache.org/solr/SpatialForTimeDurations
Hoss’s illustrations he developed for the meetup presentation are great.
However, there are bugs in the instruction — specifically it’s important
to s
My user name is SusheelKumar for solr wiki.
-Original Message-
From: Susheel Kumar [mailto:susheel.ku...@thedigitalgroup.net]
Sent: Monday, March 03, 2014 9:36 PM
To: solr-user@lucene.apache.org
Subject: Please add me to wiki contributors
Hi,
Can you please add me to wiki contributors.
Thanks Chirs, I found in our application code it was related to optimistic
concurrency failure.
On Mon, Mar 3, 2014 at 6:13 PM, Chris Hostetter wrote:
>
> : Subject: java.lang.Exception: Conflict with StreamingUpdateSolrServer
>
> the fact that you are using StreamingUpdateSolrServer isn't real
Hi,
We are looking to automate searches (name searches) & filter out the results
based on some scoring confidence. Any suggestions on what different approaches
we can use to pick only top closer matches and filter out rest of the results.
Thanks,
Susheel
Hi,
Can you please add me to wiki contributors. I wanted to add some stats on Linux
vs Windows we came across recently, CSV update handler examples, and also
wanted to add company name to public server page.
Thanks,
Susheel
So as there was no quick work around to this issue, we simply change the http
method from get to post, to avoid further problems which could be triggered
by user input too. though this violates the restful standards... at least we
have something running properly
-
Zeki ama calismiyor... Cali
: Subject: java.lang.Exception: Conflict with StreamingUpdateSolrServer
the fact that you are using StreamingUpdateSolrServer isn't really a
factor here -- what matters is the data you are sending to solr in the
updates...
: location=StreamingUpdateSolrServer line=162 Status for: null is 409
: I have a core "geo" (with an empty file core.properties inside) and
solrconfig.xml at
: /srv/solr/solr4.6.1/cores/geo/conf/solrconfig.xml
...
: But I get the error message:
: "Could not load config file /srv/solr/solr4.6.1/cores/geo/solrconfig.xml"
1) what does your solr.xml file look
would not breaking the FQs out by state be faster for warming up the fq
caches?
--
View this message in context:
http://lucene.472066.n3.nabble.com/is-it-possible-to-consolidate-filterquery-cache-strings-tp4121005p4121030.html
Sent from the Solr - User mailing list archive at Nabble.com.
File permissions? Malformed XML? Are there any other exceptions
earlier in the log? If you substitute that file with one from example
distribution, does it work?
Regards,
Alex.
Personal website: http://www.outerthoughts.com/
LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
- Time is th
All of them work like a charm! Thanks, Chris.
On Mon, Mar 3, 2014 at 1:28 PM, Chris Hostetter wrote:
>
> : But this query does not work:
> :
> : q={!boost
> : b=if(exists(query({!v='user_type:ADMIN'})),10,1)}id:1&rows=1&fl=*,score
> : It gives an error like this:
>
> The problem is the way you a
: Would that work, and if so how would I know that the cache is being hit?
It should work -- filters are evaluated independently, so the fact that
you are using all of them in query query (vs all of them in individual
queries) won't change anything as far as the filterCache goes.
You can prov
Hello,
I'm sorry to repeat myself but I didn't manage to get out of the thread I
inadvertently slipped into.
My problem now is this:
I have a core "geo" (with an empty file core.properties inside) and
solrconfig.xml at
/srv/solr/solr4.6.1/cores/geo/conf/solrconfig.xml
following the hint from th
Thanks, Shawn. Right now my solr.solr.home is not being passed from the
java runtime
Lets say /mnt/solr/ is my solr root. I can add all jars to /mnt/solr/lib/
and use -Dsolr.solr.home=/mnt/solr/ , that should do it right?
Thanks
Nitin
On Mon, Mar 3, 2014 at 2:44 PM, Shawn Heisey wrote:
> On
note: by partitioning I mean that I have sharded the 120M docs into 9 Solr
partitions (each on a separate server)
--
View this message in context:
http://lucene.472066.n3.nabble.com/is-it-possible-to-consolidate-filterquery-cache-strings-tp4121005p4121012.html
Sent from the Solr - User mailing
Am 03.03.2014 um 22:43 schrieb Shawn Heisey:
> On 3/3/2014 9:02 AM, Thomas Fischer wrote:
>> The setting is
>> solr directories (I use different solr versions at the same time):
>> /srv/solr/solr4.6.1 is the solr home, in solr home is a file solr.xml of the
>> new "discovery type" (no cores), and
On 3/3/2014 3:30 PM, KNitin wrote:
A quick ping on this. To give more stats, I have 100's of collections on
every node. The time it takes for one collection to boot up /loadonStartup
is around 10-20 seconds ("and sometimes even 1 minute). I do not have any
query auto warming etc. On a per collect
A quick ping on this. To give more stats, I have 100's of collections on
every node. The time it takes for one collection to boot up /loadonStartup
is around 10-20 seconds ("and sometimes even 1 minute). I do not have any
query auto warming etc. On a per collection basis I load a bunch of
libraries
lets say I have a largish set of data (120M docs) and that I am partitioning
my data by groups of states (using the state codes)
Someone suggested that I could use the following format in my solrconfig.xml
when defining the filterqueries work:
*:*
State:AL
On 3/3/2014 9:02 AM, Thomas Fischer wrote:
The setting is
solr directories (I use different solr versions at the same time):
/srv/solr/solr4.6.1 is the solr home, in solr home is a file solr.xml of the new
"discovery type" (no cores), and inside the core directories are empty files
core.propert
ok i like the logic, you can do much more. i think this should do it for me:
(-organisations:["" TO *] -roles:["" TO *]) (+organisations:(150 42)
+roles:(174 72))
i want to use this in fq and i need to set the operator to OR. My q.op is AND
but I need OR in fq. I have read about ofq b
: But this query does not work:
:
: q={!boost
: b=if(exists(query({!v='user_type:ADMIN'})),10,1)}id:1&rows=1&fl=*,score
: It gives an error like this:
The problem is the way you are trying to nest queries inside of each other
w/o any sort of quoting -- the parser has no indication that the "b"
How frequently are you committing? Frequent commits can slow everything down.
--
View this message in context:
http://lucene.472066.n3.nabble.com/network-slows-when-solr-is-running-help-tp4120523p4120992.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi Michael,
I forgot to include what I did for one customer :
1) Using StatsComponent I get min and max values of the field (year)
2) Calculate "smart gap/range values" according to minimum and maximum.
3) Re-issue the same query (for thee second time) that includes a set of
facet.query.
Ahmet
If you just want to see which classes are occupying the most memory in a live JVM,you can do:jmap -permstat I don't think you can dump the contents of PERM space.Hope this helps,TriOn Mar 03, 2014, at 11:41 AM, KNitin wrote:Is there a way to dump the contents of permgen and look at which classes a
Hi,
Regarding "just a simple subtraction" you do it in indexer code or in a update
prcessor too. You can either modify original field or you can create an
additional one. Java-script could be used :
http://wiki.apache.org/solr/ScriptUpdateProcessor
Ahmet
On Monday, March 3, 2014 9:11 PM, Mic
What are supported types of the POJO objects that are sent to
SolrServer.addBean(obj)?
A quick glance of DocumentObjectBinder seems to suggest that
an arbitrary combination of an Collection, List, ArrayList, array ([]), Map,
Hashmap,
of primitive types, String and Date is supported, but I'm not
I'm working on a user name autocomplete feature, and am having some issues
with the way we are tokenizing user names.
We're using the StandardTokenizerFactory to tokenize user names, so
"foo-bar" gets split into two tokens. We take input from the user and use
it as a prefix to search on the user n
Hello,
I am trying to connect SOLR (tried 4.4 and 4.7) to kerberos enabled HDFS -
I am using Cloudera CDH 4.2.1
http://maven-repository.com/artifact/com.cloudera.cdh/cdh-root/4.2.1/pom_effective
the keytab and principal is valid (I tested it with flume as well as simple
hdfs cli)
did anobody su
Is there a way to dump the contents of permgen and look at which classes
are occupying the most memory in that?
- Nitin
On Mon, Mar 3, 2014 at 11:19 AM, KNitin wrote:
> Regarding PermGen: Yes we have a bunch of custom jars loaded in solrcloud
> (containing custom parsing, analyzers). But I hav
On 03.03.2014 19:58 Shawn Heisey wrote:
> There's already an issue in Jira.
>
> https://issues.apache.org/jira/browse/SOLR-1672
Thanks, this is of course the best solution. Only problem is that I use
a custom verson from a vendor (based on version 4.3) I want to enhance.
But perhaps they apply t
Regarding PermGen: Yes we have a bunch of custom jars loaded in solrcloud
(containing custom parsing, analyzers). But I haven't specifically enabled
any string interning. Does solr intern all strings in a collection by
default?
I agree with doc and Filter Query Cache. Query Result cache hits are
p
Hi Ahmet,
> There is no built in solution for this.
Yes, I know, that's why I would like the TokenFilterFactory
> Two workaround :
>
> 1) use facet.limit=-1 and invert the list (faceting response) at client side
>
> 2) use multiples facet.query
>a)facet.query=year:[2012 TO 2014]&facet.quer
On 3/3/2014 7:35 AM, Michael Lackhoff wrote:
If I understand the docs right, it is only possible to sort facets by
count or value in ascending order. Both variants are not very helpful
for year facets if I want the most recent years at the top (or appear at
all if I restrict the number of facet e
Thanks Tri,
I really appreciate the response. When I get some free time shortly I'll
start giving some of these a try and report back.
On Mon, Mar 3, 2014 at 12:42 PM, Tri Cao wrote:
> If it's really the interned strings, you could try upgrade JDK, as the
> newer HotSpot
> JVM puts interned st
Hi Michael,
Yes you are correct, oldest comes fist.
There is no built in solution for this.
Two workaround :
1) use facet.limit=-1 and invert the list (faceting response) at client side
2) use multiples facet.query
a)facet.query=year:[2012 TO 2014]&facet.query=year:[2010 TO 2012]
b)fac
The main reference for this approach is here:
http://wiki.apache.org/solr/SpatialForTimeDurations
Hoss’s illustrations he developed for the meetup presentation are great.
However, there are bugs in the instruction — specifically it’s important
to slightly buffer the query and choose an appropriat
Hi guys,
I'm on my way to solve it properly.
This is how my field looks like now:
I still have one case where I'm facing issues because in fact I want to
pres
HI Erick,
Thanks for the response.
On the wiki it states
config-file
Path to the file that defines query elevation. This file must exist in
$/conf/ or$/.
If the file exists in the /conf/ directory it will be loaded once at startup.
If it exists in the data directory, it will be reloaded for
Good afternoon,
I have this DIH:
https://redacted/";
processor="XPathEntityProcessor"
forEach="/rss/channel/item"
transformer="DateFormatTransformer,TemplateTransformer,RegexTransformer">
Thanks, Arun for sharing the idea on EdgeNGramFilter. In our case we are doing
search using automated process so may EdgeNGramFilter may not work. Wwe have
used NGramFilterFactory in the past but will look into it again.
For cases like Dave & David and other English names does anyone has idea
If it's really the interned strings, you could try upgrade JDK, as the newer HotSpotJVM puts interned strings in regular heap:http://www.oracle.com/technetwork/java/javase/jdk7-relnotes-418459.html(search for String.intern() in that release)I haven't got a chance to look into the new core auto disc
Thanks Greg. We optimize the master once a week (early in the day Sunday) and
we do not do a commit Sunday evening (the only evening of the week when we do
not commit). So now after optimization/replication the master/slave pair that
were out on sync on Friday now have the same numDocs (and ev
In the user core there are two fields, the database core in question was
40, but in production environments the database core is dynamic. My time
has been pretty crazy trying to get this out the door and we haven't tried
a standard solr install yet but it's on my plate for the test app and I
don't
AFAICS tf(name, 'co') returns 0 on the {id:1, name:'Coca-Cola Company'}
because it does not support partial match.
tf(name, 'company') will return 1
--
View this message in context:
http://lucene.472066.n3.nabble.com/Multiple-partial-match-tp4120886p4120919.html
Sent from the Solr - User maili
On Mon, Mar 3, 2014 at 11:14 AM, Gregg Donovan wrote:
> Yonik,
>
> That's a very clever idea. Unfortunately, I think that will skip the
> distributed query optimization we were hoping to take advantage of in
> SOLR-1880 [1], but it should work with the proposed distrib.singlePass
> optimization in
Add a function query boost that uses the term frequency, "tf":
bf=tf(name,'co') -- additive boost
boost=tf(name,'co') -- multiplicative boost
That does of course require that term frequency is not disabled for that
field in the schema.
You can multiply the term frequency as well in the fun
Hey Josh,I am not an expert in Java performance, but I would start with dumping a the heapand investigate with visualvm (the free tool that comes with JDK).In my experience, the most common cause for PermGen exception is the app createstoo many interned strings. Solr (actually Lucene) interns the
How can we calculate how much heap memory the filter cache will consume? We
understand that in order to determine a good size we also need to evaluate
how many filterqueries would be used over a certain time period.
Here's our setting:
According to the post below, 53 GB of RAM would b
Yonik,
That's a very clever idea. Unfortunately, I think that will skip the
distributed query optimization we were hoping to take advantage of in
SOLR-1880 [1], but it should work with the proposed distrib.singlePass
optimization in SOLR-5768 [2]. Does that sound right?
--Gregg
[1] https://issue
For the record, I am +1 for somebody to add Solr to the NoSQL wikipedia
page, in much the same way that Elasticsearch is already there.
From a LucidWorks webinar blurb: "The long awaited Solr 4 release brings a
large amount of new functionality that blurs the line between search engines
and No
Hello,
for some reason I have problems to get my local solr system to run (MacBook,
tomcat 6.0.35).
The setting is
solr directories (I use different solr versions at the same time):
/srv/solr/solr4.6.1 is the solr home, in solr home is a file solr.xml of the
new "discovery type" (no cores), and
Thanks, Mark!
The supervised process sounds very promising but complicated to get right.
E.g. where does the supervisor run, where do nodes report their status to,
are the checks active or passive, etc.
Having each node perform a regular background self-check and remove itself
from the cluster if
Thanks Erick. Indexing is not happening to the slave since it has never been
set up there - there aren't even any commits happening on the slave (which we
normally do via cron job). But Indexing is definitely happening to the master
at the time replication happens.
" Sounds like the master
Hi;
I said that:
"What are the main differences between ElasticSearch
and Solr that makes ElasticSearc a NoSQL store but not Solr."
because it is just a marketing term as Jack indicated after me. Also I said:
"The first link you provided includes ElasticSearch:
http://en.wikipedia.org/wiki/NoSQ
On 03.03.2014 16:33 Ahmet Arslan wrote:
> Currently there are two storing criteria available. However sort by index -
> to return the constraints sorted in their index order (lexicographic by
> indexed term) - should return most recent year at top, no?
No, it returns them -- as you say -- in le
Hi Guys,
Faced with a problem: make query to SOLR *name:co*^5*
It returns me two docs with equal score: {id: 1, name: 'Coca-Cola Company'},
{id: 2, name: Microsoft Corporation}.
How can I boost Coca-Cola Company because it contains more partial matches ?
P.S. All normalization used by TF-IDF
Hi,
Currently there are two storing criteria available. However sort by index - to
return the constraints sorted in their index order (lexicographic by indexed
term) - should return most recent year at top, no?
Ahmet
On Monday, March 3, 2014 4:36 PM, Michael Lackhoff wrote:
If I understand
It's a windows installation using a bitnami solr installer. I incorrectly
put 64M into the configuration for this, as I had copied the test
configuration I was using to recreate the permgen issue we were seeing on
our production system (that is configured to 512M) as it takes awhile with
to recreat
I just ran into an issue similar to this that effected document scores on
distributed searches. You might try doing an optimize and purging your deleted
documents while no indexing is being done then checking your counts. Once I
optimized all my indexes the document counts on all of my cores mat
Josh,
You've mentioned a couple of times that you've got PermGen set to 512M but then
you say you're running with -XX:MaxPermSize=64M. These two statements are
contradictory so are you *sure* that you're running with 512M of PermGen?
Assuming your on a *nix box can you provide `ps` output provi
If I understand the docs right, it is only possible to sort facets by
count or value in ascending order. Both variants are not very helpful
for year facets if I want the most recent years at the top (or appear at
all if I restrict the number of facet entries).
It looks like a requirement that was
On 3/3/2014 1:54 AM, KNitin wrote:
3. 2.8 Gb - Perm Gen (I am guessing this is because of interned strings)
As others have pointed out, this is really unusual for Solr. We often
see high permgen in our app servers due to dynamic class loading that
the framework performs; maybe you are somehow
What was the query you are making? What is the sort order for the
query? Are you sure you are not indexing data in between making these
requests? Are you able to reproduce this outside of your search
component?
It is hard to answer questions about custom code without actually
looking at the code.
Sorry for the delay,
I did not have access to the server and could not query anything.
This is my Query:
http://server:port
/solr/core/select?q=keyword1+keyword2&wt=xml&indent=true&hl.fragsize=120&f.file_URI_tokenized.hl.fragsize=1000&spellcheck=true&f.file_content.hl.alternateField=spell&hl.simp
On Fri, Feb 28, 2014 at 7:50 PM, Per Steffensen wrote:
> I might be able to find something for you. Which version are you using - I
> have some scripts that work on 4.0 and some other scripts that work for 4.4
> (and maybe later).
This sounds useful. I am using 4.6.1.
Kind regards
Jan
Hi,
I am using Solr 4.6 and doing Solr query on shard from inside
Solr search component and try to use the obtained results for my custom
logic. I have used a Solrj for doing distributed search, but the result
coming from this distributed search vary some time. So the my questions
a
On 01/03/2014 23:53, Jack Krupansky wrote:
NoSQL? To me it's just a marketing term, like Big Data.
+1
Depends very much who you talk to. Marketing folks like to ride the
current wave, so if NoSQL is current, they'll jump on that one, likewise
Big Data. Technical types like to be correct in t
70 matches
Mail list logo