You can create your own "update processor" that gets control between the
output of Tika and the indexing of the document.
See:
http://wiki.apache.org/solr/UpdateRequestProcessor
-- Jack Krupansky
-Original Message-
From: Raphaël
Sent: Sunday, May 27, 2012 6:24 PM
To: solr-user@lucene
yes, I am also interested in good performance with 2 billion docs. how
many search nodes do you use? what's the average response time and qps
?
another question: where can I find related paper or resources of your
algorithm which explains the algorithm in detail? why it's better than
google site(b
Hi,
I use Tika through the Solr ExtractingRequestHandler and I face a very
common use case namely: postprocessing fields from Tika in order to normalize
their values or override them with explicitly passed "literal" values.
With exception of some vagues statements about "ContentHandler", I
failed
Please start separate threads for separate questions- you have more
chances of getting an answer.
On Sun, May 27, 2012 at 1:55 PM, Lance Norskog wrote:
> Please start separate threads for separate q
>
> On Fri, May 25, 2012 at 6:30 AM, Nicholas Ball
> wrote:
>>
>> Hey all,
>>
>> I have another q
Please start separate threads for separate q
On Fri, May 25, 2012 at 6:30 AM, Nicholas Ball
wrote:
>
> Hey all,
>
> I have another question with regards to this thread.
>
> Does anyone know what the state is of the rollback command in 4.0 and how
> it works with both; replicas (i.e. distributed r
The Solr 3.x branch ended development with 3.6. So, there won't be a
3.7 and there's no point doing infrastructure work like this.
On Fri, May 25, 2012 at 9:23 AM, William Bell wrote:
> Let's just wait until SOLR 4.0 is out in a couple months.
>
> On Fri, May 25, 2012 at 9:06 AM, Maciej Lisiewski
Let me see if I can reference the performance with RankingAlgorithm.
Regards,
Nagendra Nagarajayya
http://solr-ra.tgels.org
http://rankingalgorithm.tgels.org
On 5/27/2012 10:06 AM, Darren Govoni wrote:
I think people on this list would be more interested in your approach to
scaling 2 billion
Is there a simple way to get solr to maintain shards as rolling partitions by
date, e.g., the last day's documents in one shard, the week before yesterday
in the next shard, the month before that in the next shard, and so on? I
really don't need querying to be fast on the entire index, but it is
cr
I think people on this list would be more interested in your approach to
scaling 2 billion documents than modifying solr/lucene scoring (which is
already top notch). So given that, can you share any references or
otherwise substantiate good performance with 2 billion documents?
Thanks.
On Sun, 20
Solr with RankingAlgorithm is free (for test/development/production).
Here is the license page
http://tgels.com/docs/SoftwareLicenseAgreement.txt.
RankingAlgorithm is also free ( for test/development/production).
Regards,
- NN
On 5/27/2012 8:57 AM, Radim Kolar wrote:
My company is thinking t
On Sun, May 27, 2012 at 12:42 PM, Radim Kolar wrote:
>> What reference page are you referring to?
>
> http://tgels.com/wiki/en/Sites_using/downloaded_RankingAlgorithm_or_Solr-RA
Ah, ok "sites using/downloaded"
So someone with a .oracle email / domain checked it out - that
certainly doesn't me
What reference page are you referring to?
http://tgels.com/wiki/en/Sites_using/downloaded_RankingAlgorithm_or_Solr-RA
On Sun, May 27, 2012 at 11:57 AM, Radim Kolar wrote:
> but i see RankingAlgorithm has fantastic results too and looking at its
> reference page it even powers sites like oracle.com and ebay.com.
What reference page are you referring to?
-Yonik
http://lucidimagination.com
My company is thinking to buy search algorithm from famous expert in
searching Petr Hejl - http://www.milionovastranka.net/
but i see RankingAlgorithm has fantastic results too and looking at its
reference page it even powers sites like oracle.com and ebay.com.
Actually, RankingAlgorithm 1.4.2 has been scaled to more than 2 billion
docs. With RankingAlgorithm 1.4.3, using the parameters
age=latest&docs=number feature, you can retrieve the NRT inserted
documents in milliseconds from such a huge index improving query and
faceting performance and using v
Hi,
Have you tested this with a billion documents?
Darren
On Sun, 2012-05-27 at 07:24 -0700, Nagendra Nagarajayya wrote:
> Hi!
>
> I am very excited to announce the availability of Solr 3.6 with
> RankingAlgorithm 1.4.2.
>
> This NRT supports now works with both RankingAlgorithm and Lucene.
Hi!
I am very excited to announce the availability of Solr 3.6 with
RankingAlgorithm 1.4.2.
This NRT supports now works with both RankingAlgorithm and Lucene. The
insert/update performance should be about 5000 docs in about 490 ms with
the MbArtists Index.
RankingAlgorithm 1.4.2 has multip
Wait, I thought the fuzzy match is invoked with a ~. I am not invoking any ~
but expanding my query terms with the synonyms at query time. So from what I
understarnd, when I query for James, internally, Solr would expand using
synonym search to James, Jim, Games, Jameson. So I guess, the original
i
18 matches
Mail list logo