Is it my imagination or has this exact email been on the list already?
Dennis Gearon
Signature Warning
It is always a good idea to learn from your own mistakes. It is usually a
better
idea to learn from others’ mistakes, so you do not have to make them yourself.
from 'http:/
I thing facet search is good for your requirement. Also what about Result
Grouping feature of Solr ?
-
Thanx:
Grijesh
http://lucidimagination.com
--
View this message in context:
http://lucene.472066.n3.nabble.com/Is-facet-could-be-used-for-Analytics-tp2515938p2515959.html
Sent from the Sol
Hello all,
We need to build a Analytics kind of application. Intially we plan to aggregate
the result and add it to database or use any ETL tool. I have an idea to use
Facet search. I just want to know others opinion on this.
We require results in the below fashion. Top 3 results in each column
Use a fielddatasource for reading field from database and then use
xpathentityprocessor .Field datasource will give you the stream that is
needed by xpathentity processor.Bellow is the example dih configuration
code.
Hi,
I wonder if it is possible to let the user build up a Solr Query and have it
validated by some java API before sending it to Solr.
Is there a parser that could help with that? I would like to help the user
building a valid query as she types by showing messages like "The query is
not valid"
Thanks for the response Hoss. Sorry for replying late was on a business
trip. The server was indexing as well as searching at the same time and it
was configured for a Native file lock, could that be the issue ? I got
another server so moved it to a Master & slave configuration with file lock
being
Thanks for updating your solution
On Tue, Feb 8, 2011 at 8:20 AM, shan2812 wrote:
>
> Hi,
>
> At last the migration to Solr-1.4.1 does solve this issue :-)..
>
> Cheers
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Http-Connection-is-hanging-while-deleteByQuery-tp2367
A common problem in metasearch engines. Its not intractable. You just have to
surface the right statistics into a 'fusion' scorer.
-
NOT always nice. When are we getting better releases?
--
View this message in context:
http://lucene.472066.n3.nabble.com/score-from-two-cores-tp2012444p25156
Does anyone have an example of using this with SQL Server varchar or XML
field?
??
On 2/16/11 2:17 AM, "Stefan Matheis" wrote:
>What about using
>http://wiki.apache.org/solr/DataImportHandler#XPathEntityProcessor ?
: This was my first thought but -1 is relatively common but we have other
: numbers just as common.
i assume that when you say that you mean "...we have other numbers
(that are not negative) just as common, (but searching for them is much
faster)" ?
I don't have any insight into why your neg
: > if you don't have any custom components, you can probably just use
: > your entire solr home dir as is -- just change the solr.war. (you can't
: > just copy the data dir though, you need to use the same configs)
: >
: > test it out, and note the "Upgrading" notes in the CHANGES.txt for the
:
I frequently use multiple cores for these reasons:
* Completely different applications, such as web search and directory search
or if their update latency / query /caching requirements are very different
I can then also nuke one without affecting the other
Also, you get nice separation for m
I updated my data importer.
I used to have:
which wasn't working. But I changed that to
and it is working fine.
On Tue, Feb 15, 2011 at 5:50 PM, Koji Sekiguchi wrote:
> (11/02/16 8:03), Tanner Postert wrote:
>
>> I am using the data import handler and using the HTMLStripTransformer
>> d
You can also easily abuse shards to query multiple cores that share parts of
the schema. This way you have isolation with the ability to query them all.
The same can, of course, also be achieved using a sinlge index with a simple
field identying the application and using fq on that one.
> Yes,
Hi,
That depends (as usual) on your scenario. Let me ask some questions:
1. what is the sum of documents for your applications?
2. what is the expected load in queries/minute
3. what is the update frequency in documents/minute and how many documents per
commit?
4. how many different applications
Yes, you're right, from now on when I say that, I'll say "except
shards". It is true.
My understanding is that shards functionality's intended use case is for
when your index is so large that you want to split it up for
performance. I think it works pretty well for that, with some
limitations
On Wed, Feb 16, 2011 at 5:08 PM, Paul wrote:
> Is this a known solr bug or is there something subtle going on?
Yes, I think it's the following bug, fixed in 1.4.1:
* SOLR-1777: fieldTypes with sortMissingLast=true or sortMissingFirst=true can
result in incorrectly sorted results.
-Yonik
http:
Had a look at this and opened an issue:
https://issues.apache.org/jira/browse/SOLR-2369
Looks like the quick fix is to switch to log4j instead of jdk-logging (which is
not a bad idea in itself:-)
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
On 16. feb. 2011, at 17
(I'm using solr 1.4)
I'm doing a test of my index, so I'm reading out every document in
batches of 500. The query is (I added newlines here to make it
readable):
http://localhost:8983/solr/archive_ECCO/select/
?q=archive%3AECCO
&fl=uri
&version=2.2
&start=0
&rows=500
&indent=on
&sort=uri%20asc
I
Hmmm. Maybe I'm not understanding what you're getting at, Jonathan, when you
say 'There is no good way in Solr to run a query across multiple Solr indexes'.
What about the 'shards' parameter? That allows searching across multiple cores
in the same instance, or shards across multiple instances.
> Thanks for the answers, more questions below.
>
> On 2/16/2011 3:37 PM, Markus Jelsma wrote:
> > 200.000 stored fields? I asume that number includes your number of
> > documents? Sounds crazy =)
>
> Nope, I wasn't clear. I have less than a dozen stored field, but the
> value of a stored field
Solr 1.4.1. So, from the documentation at
http://wiki.apache.org/solr/SolrReplication
I was wondering if I could get away without having any actual
configuration in my slave at all. The replication handler is turned on,
but if I'm going to manually trigger replication pulls while supplying
th
Solr multi-core essentially just lets you run multiple seperate distinct
Solr indexes in the same running Solr instance.
It does NOT let you run queries accross multiple cores at once. The
cores are just like completely seperate Solr indexes, they are just
conveniently running in the same Solr
Thanks for the answers, more questions below.
On 2/16/2011 3:37 PM, Markus Jelsma wrote:
200.000 stored fields? I asume that number includes your number of documents?
Sounds crazy =)
Nope, I wasn't clear. I have less than a dozen stored field, but the
value of a stored field can sometimes b
Hi,
I'm trying to use a CustomSimilarityFactory and pass in per-field
options from the schema.xml, like so:
500
1
0.5
500
2
0.5
My problem is I am utterly failing to figure out how to parse this
nested option structu
Closing a core will shutdown almost everything related to the workings of a
core. Update and search handlers, possible warming searchers etc.
Check the implementation of the close method:
http://svn.apache.org/viewvc/lucene/dev/branches/branch_3x/solr/src/java/org/apache/solr/core/SolrCore.java?v
Hi,
I have a need to index multiple applications using Solr, I also have the
need to share indexes or run a search query across these application
indexes. Is solr multi-core - the way to go? My server config is
2virtual CPUs @ 1.8 GHz and has about 32GB of memory. What is the
recommendation?
Th
> In my own Solr 1.4, I am pretty sure that running an index optimize does
> give me significant better performance. Perhaps because I use some
> largeish (not huge, maybe as large as 200k) stored fields.
200.000 stored fields? I asume that number includes your number of documents?
Sounds crazy =
In my own Solr 1.4, I am pretty sure that running an index optimize does
give me significant better performance. Perhaps because I use some
largeish (not huge, maybe as large as 200k) stored fields.
So I'm interested in always keeping my index optimized.
Am I right that if I set mergeFactor to
2011-02-16 11:32:45.489::INFO: Shutdown hook executing
2011-02-16 11:35:36.002::INFO: Shutdown hook complete
The shutdown time seems to be proportional to the amount of time that Solr has
been running. If I immediately restart and shut down again, it takes a fraction
of a second. What causes i
Hi,
I think the tool was called jmxterm or termjmx. As for REST over JMX - I think
I've seen that on Google code. If you are interested in this sort of stuff,
check my bookmarks on pinboard.in, it's all there and nicely tagged.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nut
Thanks Koji for the quick response. After making the changes you recommended,
it works great now.
--
View this message in context:
http://lucene.472066.n3.nabble.com/CJKAnalyzer-and-Synonyms-tp2510104p2512097.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi Tri,
You could look at the stats page for each slave and compare the number of docs
in them. The one(s) that are off from the rest/majority are out of sync.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Origi
Hi,
Jetty on Ubuntu has been working well for us and a bunch of our customers.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
> From: Rosa (Anuncios)
> To: solr-user@lucene.apache.org
> Sent:
Managed to get this working. Changed my solrconfig for the one provided in
velocity dir, repackaged the war file and redeployed on tomcat.
Although this seems like a ridiculously obvious thing to do, I somehow
overlooked the repackaging aspect, this was where the problem was.
Thanks for the hel
Hello,
i saw taxonomy faceting on this slides:
http://www.lucidimagination.com/solutions/webcasts/faceting
and i have a question:
I have many taxonomies and each document can apply to some of them. I dont
know how many taxonomies they are, so i cant define a field in the schema
for each taxonomy (
It looks like a log4j issue:
java.lang.NoClassDefFoundError: org/apache/log4j/jmx/HierarchyDynamicMBean
at
org.apache.zookeeper.jmx.ManagedUtil.registerLog4jMBeans(ManagedUtil.java:51)
at
org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:114)
2011/2/16 Yonik Seeley
> On Wed, Feb 16, 2011 at 3:57 AM, Thorsten Scherler
> wrote:
> > On Tue, 2011-02-15 at 09:59 -0500, Yonik Seeley wrote:
> >> On Mon, Feb 14, 2011 at 8:08 AM, Thorsten Scherler
> wrote:
> >> > Hi all,
> >> >
> >> > I followed http://wiki.apache.org/solr/SolrCloud and ever
It only works on FileDataSource right ?
Bill Bell
Sent from mobile
On Feb 16, 2011, at 2:17 AM, Stefan Matheis
wrote:
> What about using
> http://wiki.apache.org/solr/DataImportHandler#XPathEntityProcessor ?
>
> On Wed, Feb 16, 2011 at 10:08 AM, Bill Bell wrote:
>> I am using DIH.
>>
>> I
On Wednesday 16 February 2011 16:49:51 Tod wrote:
> I have a couple of semi-related questions regarding the use of the Term
> Vector Component:
>
>
> - Using curl is there a way to query a specific document (maybe using
> Tika when required?) to get a distribution of the terms it contains?
No Ti
Hello Ravish, Erick,
I'm facing the same issue with solr-trunk (as of r1071282)
- Field configuration :
positionIncrementGap="100">
- Schema configuration :
In my test index, I have documents with sparse values : Some documents
may or may not have a value for f1, f2 and/or f3
The
I have a couple of semi-related questions regarding the use of the Term
Vector Component:
- Using curl is there a way to query a specific document (maybe using
Tika when required?) to get a distribution of the terms it contains?
- When I set the termVector on a field do I need to reindex? I
(11/02/17 0:17), alexw wrote:
Hi everyone,
I am trying to get Synonyms working with CJKAnalyzer. Search works fine but
synonyms do not work as expected. Here is my field definition in the schema
file:
When testing on the analysis page, the synonym filter does not
I think you can get far by just optimizing how often you do commits (as seldom
as possible), as well as MergeFactor, to get a good balance between indexing
and query efficiency. It may be that you're looking for fewer segments on
average - not always one fully optimized segment.
If you still fe
Hi everyone,
I am trying to get Synonyms working with CJKAnalyzer. Search works fine but
synonyms do not work as expected. Here is my field definition in the schema
file:
When testing on the analysis page, the synonym filter does not kick in at
all.
My que
the documents havent the same uniquekey, only reason is the same.
i cannot show the exactly search request, because of privacy policy...
the query is like that:
reason_1: firstname lastname,
reason_2: 1234, 02.02.2011
--> in field reason: firstname lastname, 1234, 02.02.2011
the search reques
hm okay, reasonable :)
never used it, but maybe a pointer into the right direction?
http://wiki.apache.org/solr/DataImportHandler#Scheduling
On Wed, Feb 16, 2011 at 2:27 PM, Renaud Delbru wrote:
> Mainly technical administration effort.
>
> We are trying to have a solr packaging that
> - minimis
It looks like you are trying to use a function query on a multi-valued field?
-Yonik
http://lucidimagination.com
On Tue, Feb 15, 2011 at 8:34 AM, Ezequiel Calderara wrote:
> Hi, im having a problem while trying to do a dismax search.
> For example i have the standard query url like this:
> It
Mainly technical administration effort.
We are trying to have a solr packaging that
- minimises the effort to deploy the system on a machine.
- reduces errors when deploying
- centralised the logic of the Solr system
Ideally, we would like to have a central place (e.g., solrconfig) where
the lo
Hi,
We would like to trigger an optimise every x hours. From what I can see,
there is nothing in Solr (3.1-SNAPSHOT) that enables to do such a thing.
We have a master-slave configuration. The masters are tuned for fast
indexing (large merge factor). However, for the moment, the master index
is
the fieldType is textgen.
-
--- System
One Server, 12 GB RAM, 2 Solr Instances, 7 Cores,
1 Core with 31 Million Documents other Cores < 100.000
- Solr1 for Search-Requests - commit every Minute - 4GB Xmx
- Solr2 for Upd
Renaud,
just because i'm interested in .. what are your concerns about using
cron for that?
Stefan
On Wed, Feb 16, 2011 at 2:12 PM, Renaud Delbru wrote:
> Hi,
>
> We would like to trigger an optimise every x hours. From what I can see,
> there is nothing in Solr (3.1-SNAPSHOT) that enables to d
Nishant,
correct me if i'm wrong .. but spatial search normally requires
geo-information, like latitude and longitude to work? so you would
need to fetch this information before putting them into solr. the
google maps api offers
http://code.google.com/intl/all/apis/maps/documentation/geocoding/#Re
Regarding the Wiki-Page .. since 1.2 .. so, yes, should :)
On Wed, Feb 16, 2011 at 1:55 PM, Leonardo Souza wrote:
> Hi Stefan,
>
> LukeRequestHandler could be a good solution, there's a lot of useful info.
> This handler works with version 1.4x?
>
> thanks
>
> [ ]'s
> Leonardo da S. Souza
> °v°
On Wed, Feb 16, 2011 at 3:57 AM, Thorsten Scherler wrote:
> On Tue, 2011-02-15 at 09:59 -0500, Yonik Seeley wrote:
>> On Mon, Feb 14, 2011 at 8:08 AM, Thorsten Scherler
>> wrote:
>> > Hi all,
>> >
>> > I followed http://wiki.apache.org/solr/SolrCloud and everything worked
>> > fine till I tried
Hi,
I have very typical problem. From one of my applications I get data in the
format
Some Address
1
How can I implement a spatial search for this data?
Any ideas are welcome
Regards,
Nishant Anand
Hi Stefan,
LukeRequestHandler could be a good solution, there's a lot of useful info.
This handler works with version 1.4x?
thanks
[ ]'s
Leonardo da S. Souza
°v° Linux user #375225
/(_)\ http://counter.li.org/
^ ^
On Wed, Feb 16, 2011 at 10:41 AM, Stefan Matheis <
matheis.ste...@google
What does the admin page show you are the contents of
your index for reason_1?
I suspect you don't really have two documents with the same
value. Perhaps you give them both the same uniqueKey and
one overwrites the other. Perhaps you didn't commit the second.
Perhaps
But you haven't provided
Maybe the http://wiki.apache.org/solr/LukeRequestHandler ?
On Wed, Feb 16, 2011 at 1:34 PM, Savvas-Andreas Moysidis
wrote:
> There is probably a better and more robust way of doing this, but you could
> make a request to /solr/admin/file/?file=schema.xml and parse the returned
> xml?
>
> Does any
There is probably a better and more robust way of doing this, but you could
make a request to /solr/admin/file/?file=schema.xml and parse the returned
xml?
Does anyone else know of a better way to query Solr for its schema?
Thanks,
- Savvas
On 16 February 2011 11:34, Leonardo Souza wrote:
> Hi
Hi,
We do have a validation layer for other purposes, but this layer do not know
about the fields and
i would not like to replicate this configuration. Is there any way to query
the solr core about declared fields?
thanks,
[ ]'s
Leonardo da S. Souza
°v° Linux user #375225
/(_)\ http://coun
Hello.
i have the field reason_1 and reason_2. this two fields is in my schema one
dynamicField:
i copy this field in my text-default search field:
And in a new field reason:
---> if i have two documents with the exactly same value in the reason_1
field, solr can only find ONE document, not
my error is, that solr is not reachable with a ping.
ping over php-HttpRequest ...
-
--- System
One Server, 12 GB RAM, 2 Solr Instances, 7 Cores,
1 Core with 31 Million Documents other Cores < 100.000
- Solr1 for Search-
Hi,
If you have an Application layer and are not directly hitting Solr then
maybe this functionality could be implemented in Validation layer prior to
making the Solr call ?
Cheers,
- Savvas
On 16 February 2011 10:23, Leonardo Souza wrote:
> Hi,
>
> We are using solr 1.4 in a big project. Now
I have no idea, seems you haven't compiled Carrot2 or haven't included all
jars.
On Wednesday 16 February 2011 11:29:30 Isha Garg wrote:
> On Wednesday 16 February 2011 03:32 PM, Markus Jelsma wrote:
> > What distro are you using? On at least Debian systems you can put the -
> > Dsolr.clustering.
Hi,
There are a couple of Solr 1.4.1 slaves, all doing the same. Pulling some
snaps, handling some queries, nothing exciting. But can anyone explain a
sudden nightly occurence of this error?
2011-02-16 01:23:04,527 ERROR [solr.handler.ReplicationHandler] - [pool-238-
thread-1] - : SnapPull fail
On Wednesday 16 February 2011 03:32 PM, Markus Jelsma wrote:
What distro are you using? On at least Debian systems you can put the -
Dsolr.clustering.enabled=true environment variable in /etc/default/tomcat6.
You can also, of course, remove all occurences of ${solr.clustering.enabled}
from you s
Hi,
We are using solr 1.4 in a big project. Now it's time to make some
improvements.
We use the standard query parser and we would like to handle the misspelled
field names.
The problem is that SolrException can not help to flag the problem
appropriately because
this exception is used for other pr
What distro are you using? On at least Debian systems you can put the -
Dsolr.clustering.enabled=true environment variable in /etc/default/tomcat6.
You can also, of course, remove all occurences of ${solr.clustering.enabled}
from you solrconfig.xml
On Wednesday 16 February 2011 10:52:35 Isha Gar
On Wednesday 16 February 2011 02:41 PM, Markus Jelsma wrote:
On Debian you can edit /etc/default/tomcat6
hi,
i am using solr1.4 with apache tomcat. to enable the
clustering feature
i follow the link
http://wiki.apache.org/solr/ClusteringComponent
Plz help me how to add-Dsolr.
What about using
http://wiki.apache.org/solr/DataImportHandler#XPathEntityProcessor ?
On Wed, Feb 16, 2011 at 10:08 AM, Bill Bell wrote:
> I am using DIH.
>
> I am trying to take a column in a SQL Server database that returns an XML
> string and use Xpath to get data out of it.
>
> I noticed that
On Debian you can edit /etc/default/tomcat6
> hi,
> i am using solr1.4 with apache tomcat. to enable the
> clustering feature
> i follow the link
> http://wiki.apache.org/solr/ClusteringComponent
> Plz help me how to add-Dsolr.clustering.enabled=true to $CATALINA_OPTS.
> after that w
I am using DIH.
I am trying to take a column in a SQL Server database that returns an XML
string and use Xpath to get data out of it.
I noticed that Xpath works with external files, how do I get it to work with
a database?
I need something like "//insur[5][@name='Blue Cross']"
Thanks.
Greg,
a few things, i noticed while reading your post:
1) you don't need an -assignment for fields where the name does
not change, you can just skip that. - just to name one example
2) TemplateTransformer
(http://wiki.apache.org/solr/DataImportHandler#TemplateTransformer)
has no name-attribute,
Well, you need to specify a path, relative or absolute, that points to the
directory where the Velocity JAR file resides.
I'm not sure, at this point, exactly what you're missing. But it should be
fairly straightforward. Solr startup logs the libraries it loads, so maybe
that is helpful info.
75 matches
Mail list logo