Hi Tq very much its solved my problem ,
having multiple Request Handlers will not degrade the performance ... unless
we are sending parallel requests? am i right ?
Thansk,
Prasad
--
View this message in context:
http://lucene.472066.n3.nabble.com/2-solr-dataImport-requests-on-a-single-core-
Solr does not, yet, at least not simply, as far as I know, but there are ideas
and some JIRA's with maybe some patches:
http://wiki.apache.org/solr/HierarchicalFaceting
From: rajini maski [rajinima...@gmail.com]
Sent: Friday, July 23, 2010 12:34 AM
To: s
I am also looking out for same feature in Solr and very keen to know whether
it supports this feature of tree faceting... Or we are forced to index in
tree faceting formatlike
1/2/3/4
1/2/3
1/2
1
In-case of multilevel faceting it will give only 2 level tree facet is what
i found..
If i give
Hoss, thank you for your helpful response!
: i think what's confusing you is that you are using the
: MappingCharFilterFactory with that file in your "text" field type to
: convert any ISOLatin1Accent characters to their "base" characters
The problem is that a large range of characters are not g
: You are correct, first of all i haven't move yet to the TrieDateField, but i
: am still waiting to find out a bit more information about it, and there's
: not a lot of info, other then in the xml file.
In general TrieFields are a way of trading disk space for range query
speed. they are expla
Ok, it was a runaway SQL query which isn't using an index.
@tommychheng
Programmer and UC Irvine Graduate Student
Find a great grad school based on research interests: http://gradschoolnow.com
On 7/22/10 4:26 PM, Tommy Chheng wrote:
Hi,
When I run my DIH script, it says it's "busy" but the "
Hi,
When I run my DIH script, it says it's "busy" but the "Total Requests
made to DataSource" and "Total Rows Fetched" remain unchanged at 4 and
6. It hasn't reported a failure.
How can I debug what is blocking the DIH?
--
@tommychheng
Programmer and UC Irvine Graduate Student
Find a great
If the field is a single token, just define the uniqueKey on it in your
schema.
Otherwise, this may be of interest:
http://wiki.apache.org/solr/Deduplication
Haven't used it myself though...
best
Erick
On Thu, Jul 22, 2010 at 6:14 PM, Pavel Minchenkov wrote:
> Hi,
>
> Is it possible to remove
This seems to work from SolrJ now:
ModifiableSolrParams params = new ModifiableSolrParams();
params.set("qt", "/clustering");
params.set("q", query);
params.set("carrot.title", "filename_s");
params.set("rows","100");
params.se
Yeah, my results count is 151 and only 21 documents appear in 6
clusters.
This is true whether I use URL or SolrJ.
When I use carrot workbench and point to my Solr using local clustering,
the workbench
has numerous clusters and all documents are placed
On Thu, 2010-07-22 at 18:06 +0200, Stan
I blieve this cam up on IRC, and the end result wsa that the bq was
working fine, Justin just wasn't noticing because he added it to his
solrconfig.xml (and not to the query URL) and his browser was still
caching the page -- so he didn't see his boost affect anything)
(but i may be confusing j
: > being returned (consider the case where we are sorting in term order - once
: > we have collected counts for ${facet.limit} constraints, we can stop
: > iterating over terms -- but to compute the total umber of constraints (ie:
: > terms) we would have to keep going and test every one of them a
Hi,
Is it possible to remove duplicates in search results by a given field?
Thanks.
--
Pavel Minchenkov
: It looks I can
: call coreA.getCoreDescriptor().getCoreContainer().getCore("coreB"); and then
get
: the Searcher and release it when I am done.
:
: Is there a better way?
not really ... not unless you want to do it via HTTP to "localhost"
: And it also appears that during the inform or init
Hi All,
A question about the WordDelimiterFilter and position increments /
phrase queries:
I have a string like: 3-diphenyl-propanoic
When indexed gets it is broken up into the following tokens:
pos token offset
1 3 0-1
2 diphenyl 2-10
3 propanoic 11-20
3 diphenylpropanoic 2-20
The WordDelimit
Chris Hostetter wrote:
computing the number: in some algorithms it's relatively cheap (on a
single server) but in others it's more expensive then computing the facet
counts being returned (consider the case where we are sorting in term
order - once we have collected counts for ${facet.limit} c
Paul Dlug wrote:
On Thu, Jul 22, 2010 at 4:01 PM, Jonathan Rochkind wrote:
The synonym approach won't work as I need to provide them in a file.
The variants may be more dynamic and not known in advance, the process
creating the documents to index does have that logic and could easily
put th
: I am new to Solr and seeking your help to change filter from
: ISOLatin1AccentFilterFactory to ASCIIFoldingFilterFactory files. I am not
According to the files you posted, you aren't using the
ISOLatin1AccentFilterFactory -- so problem solved w/o making any changes.
: sure what change is to
Hi,
I am not sure why some commits take very long time. I have a batch indexing
which commits just once after it completes the indexing.
I tried to index just 36 rows but the total time taken to index was like 12
minutes. The indexing time was very less just some 30 seconds but it took
the remai
: I want to load full text into an external cache, So I added so codes
: in newSearcher where I found the warm up takes place. I add my codes
...
: public void newSearcher(SolrIndexSearcher newSearcher,
: SolrIndexSearcher currentSearcher) {
: warmTextCache(newSearcher,warmTextCache,
: I cannot find any info on how to get the list of current fields in an index
: (possibly sharded). With dynamic fields, I cannot simply parse the schema to
there isn't one -- the LukeRequestHandler can tell you what fields
*actually* exist in your index, but you'd have to query it on each shard
:
: When I use the stats component on a field that has no values in the result set
: (ie, stats.missing == rowCount), I'd expect that 'min'and 'max' would be
: blank.
:
: Instead, they seem to be the smallest and largest float values or something,
: min = 1.7976931348623157E308, max = 4.9E-324 .
: > I would like get the total count of the facet.field response values
:
: I'm pretty sure there's no way to get Solr to do that -- other than not
: setting a facet.limit, getting every value back in the response, and
: counting them yourself (not feasible for very large counts). I've
: look
On Thu, Jul 22, 2010 at 4:01 PM, Jonathan Rochkind wrote:
> I think the Synonym filter should actually do exactly what you want, no?
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory
>
> Hmm, maybe not exactly what you want as you describe it. It comes close,
: As the comments suggest, it's not a bug, but just the best we can do
: for now since our priority queues don't support removal of arbitrary
FYI: I updated the DistributedSearch wiki to be more clear about this --
it previously didn't make it explicitly clear that docIds were suppose to
be uni
I think the Synonym filter should actually do exactly what you want, no?
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory
Hmm, maybe not exactly what you want as you describe it. It comes close,
maybe good enough. Do you REALLY need to support "I Business M
I have a multi-core environment and a custom request handler. However, I have
one place where I would like to have my request handler on coreA query to
coreB.
This is not distributed search. This is just an independent query to get some
additional data.
I am also guaranteed that each server wi
Is there a tokenizer that supports providing variants of the tokens at
index time? I'm looking for something that could take a syntax like:
International|I Business|B Machines|M
Which would take each pipe delimited token and preserve its position
so that phrase queries work properly. The above wo
Hi Stephan,
On a lark, I hacked up solr running under a small-footprint servlet engine
on my jailbroken iPad. You can see the console here: http://imgur.com/tHRh3
It's not a particularly practical solution, though, since Apple would never
approve a Java-based app for the App Store. Or a backgrou
As the comments suggest, it's not a bug, but just the best we can do
for now since our priority queues don't support removal of arbitrary
elements. I guess we could rebuild the current priority queue if we
detect a duplicate, but that will have an obvious performance impact.
Any other suggestions?
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Stephan Schwab wrote:
> Dear Solr community,
>
> does anyone know whether it may be possible or has already been done to
> bring Solr to the Apple iPad so that applications may use a local search
> engine?
huh?
Solr requires Java. iPad does not supp
Dear Solr community,
does anyone know whether it may be possible or has already been done to
bring Solr to the Apple iPad so that applications may use a local search
engine?
Greetings,
Stephan
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-on-iPad-tp987655p987655.htm
Thank you for the link.
I was not aware of the multifaceting syntax - this will enable me to run 1
less query on the main page!
However this is not a tree faceting feature.
Thanks
Eric
On Thu, Jul 22, 2010 at 4:51 PM, SR wrote:
> Perhaps the following article can help:
> http://www.craftyf
I'm using Solr to index data from our data warehouse. The data is imported
through text files. I've written a custom FileImportDataImportHandler that
extends DataSource and it works fine - I've tested it with 280,000 records
and it manages to build the index in about 3 minutes. My problem is that
Hi,
In my SolrJ, I used ModifiableSolrParams and I set ("rows",50) but it
> still returns less than 10 for each cluster.
>
Oh, the number of documents per cluster very much depends on the
characteristics of your documents, it often happens that the algorithms
create larger numbers of smaller clus
Perhaps the following article can help:
http://www.craftyfella.com/2010/01/faceting-and-multifaceting-syntax-in.html
-S
On Jul 22, 2010, at 5:39 PM, Eric Grobler wrote:
> Hi Solr Community
>
> If I have:
> COUNTRY CITY
> Germany Berlin
> Germany Hamburg
> Spain Madrid
>
> Can I do faceting
Hi Solr Community
If I have:
COUNTRY CITY
Germany Berlin
Germany Hamburg
Spain Madrid
Can I do faceting like:
Germany
Berlin
Hamburg
Spain
Madrid
I tried to apply SOLR-792 to the current trunk but it does not seem to be
compatible.
Maybe there is a similar feature existing in the latest
Just an update to say that the only way I figured out to include my 2
tags was via the element() scheme :
* http://www.w3.org/2001/XInclude";>
*
obviously this is not desirable and clean at all even if it can make the
trick if the number of fields is very small.
Any ot
Hi Savannah,
A few comments below, scattered in-line...
-- Ken
On Jul 21, 2010, at 3:08pm, Savannah Beckett wrote:
And I will have to recompile the dom or sax code each time I add a
job board for
crawling. Regex patten is only a string which can be stored in a
text file or
db, and retrie
Informational
Hi,
This information is for anyone who might be running into problems when
performing explicit periodic backups of Solr indexes. I encountered this
problem, and hopefully this might be useful to others.
A related Jira issue is: SOLR-1475.
The issue is: When you execute a 'command=b
I'm trying to use dspace to search across a range of index created and stored
using Dsindexer.java class. I have seen where Solr can be use to perform
numerical range queries using either TrieIntField,
TrieDoubleField,TrieLongField, etc.. classes defined in Solr's api or
SortableIntField.java, So
I'm using Solr 1.4.1
-Original Message-
From: Justin Lolofie
To: solr-user@lucene.apache.org
Sent: Thu, Jul 22, 2010 2:57 pm
Subject: Re: Dismax query response field number
scrapy what version of solr are you using?
I'd like to do "fq=city:Paris" but it doesnt seem to work
scrapy what version of solr are you using?
I'd like to do "fq=city:Paris" but it doesnt seem to work for me (solr
1.4) and the docs seem to suggest its a feature that is coming but not
there yet? Or maybe I misunderstood?
On Thu, Jul 22, 2010 at 6:00 AM, wrote:
>
> Thanks,
>
> That was the pr
I set the rows=50 on my clustering URL in a browser and it returns more.
In my SolrJ, I used ModifiableSolrParams and I set ("rows",50) but it
still returns less than 10 for each cluster.
Is there a way to set rows wanted with ModifiableSolrParams?
thanks and sorry for the double post.
Darren
Staszek,
Thank you. The cluster response has a maximum of 10 documents in each
cluster. I didn't set this limit and the query by itself returns 500+
documents. There should be many more than 10 in each cluster. Does it
default to 10 maybe? Or is there a way to say, cluster every result in
the que
Thanks,
That was the problem!
select?q=moto&qt=dismax& fq =city:Paris
-Original Message-
From: Chantal Ackermann
To: solr-user@lucene.apache.org
Sent: Thu, Jul 22, 2010 12:47 pm
Subject: Re: Dismax query response field number
is this a typo in your query or in your
is this a typo in your query or in your e-mail?
you have the "q" parameter twice.
use "fq" for query inputs that mention a field explicitly when using
dismax.
So it should be:
select?q=moto&qt=dismax& fq =city:Paris
(the whitespace is only for visualization)
chantal
On Thu, 2010-07-22 at 11:
Hi,
I am trying to do a similar thing within the schema.xml (using Solr 1.4.1),
having a (super)schema that is common to 2 instances and specific fields I
would like to include (with XInclude).
Something like this:
*
...
...
*
and it works with the sp
maybe its too simple, but did you try the rows=20 or sth. greater as
Lance suggested?
=>
select?rows=20&qt=dismax
Regards,
Peter.
> Yes i've data... maybe my query is wrong?
>
> select?q=moto&qt=dismax&q=city:Paris
>
> Field city is not showing?
>
>
>
>
>
>
>
>
> -Original Message
Are you using the same instance of CommonsHttpSolrServer for all the
requests?
On Wed, Jul 21, 2010 at 4:50 PM, Sharp, Jonathan wrote:
>
> Some further information --
>
> I tried indexing a batch of PDFs with the client and Solr CELL, setting
> the credentials in the httpclient. For some reason
Yes i've data... maybe my query is wrong?
select?q=moto&qt=dismax&q=city:Paris
Field city is not showing?
-Original Message-
From: Grijesh.singh
To: solr-user@lucene.apache.org
Sent: Thu, Jul 22, 2010 10:07 am
Subject: Re: Dismax query response field number
Do u have data
DataImportHandler does not support parallel execution of several
requests. You should either send your requests sequentially or
register several DIH handlers in solrconfig and use them in parallel.
On Thu, Jul 22, 2010 at 11:20 AM, kishan wrote:
>
> please help me
> --
> View this message in con
Hi,
I am attempting to cluster a query. It kinda works, but where my
> (regular) query returns 500 results the cluster only shows 1-10 hits for
> each cluster (5 clusters). Never more than 10 docs and I know its not
> right. What could be happening here? It should be showing dozens of
> documents
Do u have data in that field also,Solr returns field which have data only.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Dismax-query-response-field-number-tp985567p986417.html
Sent from the Solr - User mailing list archive at Nabble.com.
No, i'm talking about fields.
In my schema i've got about 15 fields with: stored="true"
Like this:
But when i run a query it return me only 10 fields, the last 4 or 5 are not the
the response??
-Original Message-
From: Lance Norskog
To: solr-user@lucene.apache.org
Sent:
please help me
--
View this message in context:
http://lucene.472066.n3.nabble.com/2-solr-dataImport-requests-on-a-single-core-at-the-same-time-tp978649p986351.html
Sent from the Solr - User mailing list archive at Nabble.com.
56 matches
Mail list logo