Re: Document Update performances Improvement

2019-10-23 Thread Nicolas Paris
what is your current performance? > > Once this is clear further architecture aspects can be derived, such as > number of spark executors, number of Solr instances, sharding, replication, > commit timing etc. > > > Am 19.10.2019 um 21:52 schrieb Nicolas Paris : > >

Re: POS Tagger

2019-10-25 Thread Nicolas Paris
ey.lorberf...@ibm.com wrote: > > > > > Hi All, > > > > > > Does anyone use a POS tagger with their Solr instance other than > > > OpenNLP’s? We are considering OpenNLP, SpaCy, and Watson. > > > > > > Thanks! > > > > > > -- > > > Audrey Lorberfeld > > > Data Scientist, w3 Search > > > IBM > > > audrey.lorberf...@ibm.com > > > > > > > > > > > > -- nicolas

Re: POS Tagger

2019-10-25 Thread Nicolas Paris
ey Lorberfeld - audrey.lorberf...@ibm.com wrote: > Nicolas, > > Do you use the POS tagger at query time, or just at index time? > > We are thinking of using it to filter the tokens we will eventually perform > ML on. Basically, we have a bunch of acronyms in our corpus. Howev

Re: POS Tagger

2019-10-25 Thread Nicolas Paris
solr/guide/7_3/language-analysis.html#opennlp-part-of-speech-filter On Fri, Oct 25, 2019 at 06:25:36PM +0200, Nicolas Paris wrote: > > Do you use the POS tagger at query time, or just at index time? > > I have the POS tagger pipeline ready but nothing done yet on the solr > part. Rig

Re: Solr Ref Guide Changes - now HTML only

2019-10-28 Thread Nicolas Paris
within the html documentation I admit. Hope this is not solr under the hood :S -- nicolas

CloudSolrClient - basic auth - multi shard collection

2019-11-18 Thread Nicolas Paris
have created a Premptive Auth Interceptor which should add the credential information for every http call. Thanks for any pointer, solr:8.1 spring-data-solr:4.1.0 -- nicolas

Re: CloudSolrClient - basic auth - multi shard collection

2019-11-20 Thread Nicolas Paris
2 - both of those > bugs are fixed in that version. > > Hope that helps, > > Jason > > > On Mon, Nov 18, 2019 at 8:26 AM Nicolas Paris > wrote: > > > > Hello, > > > > I am having trouble with basic auth on a solrcloud instance. When the >

Re: A Last Message to the Solr Users

2019-12-01 Thread Nicolas Paris
>> > need me. I was never actually part of the package. To the many, many > >> people > >> > that offered me private notes of encouragement and future help - thank > >> you > >> > so much. Your help will be needed. > >> > > >> > You will reset. You will fix this. Or I will be back. > >> > > >> > Mark > >> > > >> > > >> > -- > >> > - Mark > >> > > >> > http://about.me/markrmiller > >> > > -- > > - Mark > > > > http://about.me/markrmiller > > > -- > - Mark > > http://about.me/markrmiller -- nicolas

does copyFields increase indexe size ?

2019-12-24 Thread Nicolas Paris
ndexed but not stored. This is a great surprise but is this behavior expected ? -- nicolas

Re: does copyFields increase indexe size ?

2019-12-24 Thread Nicolas Paris
On Tue, Dec 24, 2019 at 10:59:03AM -0700, Shawn Heisey wrote: > On 12/24/2019 10:45 AM, Nicolas Paris wrote: > > From my understanding, copy fields creates an new indexes from the > > copied fields. > > From my tests, I copied 1k textual fields into _text_ with copyFields. &g

Re: does copyFields increase indexe size ?

2019-12-25 Thread Nicolas Paris
e same ! (while the _text_ field is working correctly) On Tue, Dec 24, 2019 at 05:32:09PM -0700, Shawn Heisey wrote: > On 12/24/2019 5:11 PM, Nicolas Paris wrote: > > Do you mean "copy fields" is only an action of changing the schema ? > > I was thinking it was adding a

Re: does copyFields increase indexe size ?

2019-12-25 Thread Nicolas Paris
ith/without the _text_ field > > > On Dec 25, 2019, at 3:07 AM, Nicolas Paris wrote: > > > >  > >> > >> If you are redoing the indexing after changing the schema and > >> reloading/restarting, then you can ignore me. > > > > I am s

Re: does copyFields increase indexe size ?

2019-12-26 Thread Nicolas Paris
Anyway, that´s good news copy field does not increase indexe size in some circumstance: - the copied fields and the target field share the same datatype - the target field is not stored this is tested on text fields On Wed, Dec 25, 2019 at 11:42:23AM +0100, Nicolas Paris wrote: > > On We

Re: does copyFields increase indexe size ?

2019-12-26 Thread Nicolas Paris
parate part of the relevant files (.tim, .pos, > etc). Term frequencies are kept on a _per field_ basis for instance. > > So this pretty much has to be small sample size or other measurement error. > > Best, > Erick > > > On Dec 26, 2019, at 9:27 AM, Nicolas Paris wrote:

Re: does copyFields increase indexe size ?

2019-12-28 Thread Nicolas Paris
behavior is perfect for my needs. On Fri, Dec 27, 2019 at 05:28:25PM -0700, Shawn Heisey wrote: > On 12/26/2019 1:21 PM, Nicolas Paris wrote: > > Below a part of the managed-schema. There is 1k section* fields. The > > second experience, I removed the copyField, droped the collect

replica's of same shard have different file contents

2020-01-14 Thread Nicolas Franck
I noticed a - in my opinion - strange behavior in Solr Cloud. I have a collection that has 1 shard and two replica's. When I look at the directory structure, both have the same file names in "data/index" .. BUT the contents of those files are different. So when I query this collection, and sort

Re: Coming back to search after some time... SOLR or Elastic for text search?

2020-01-16 Thread Nicolas Paris
e low (max 5/second at peak). > We have implemented the content ingestion and processing pipelines already > in python and SPARK, so most of the data will be pushed in using APIs. > > I would really appreciate any guidance from the community !! -- nicolas

Re: Getting error "Bad Message 414 reason: URI Too Long"

2021-01-14 Thread Nicolas Franck
I believe you can also access this path in a HTTP POST request. That way you do no hit the URI size limit cf. https://stackoverflow.com/questions/2997014/can-you-use-post-to-run-a-query-in-solr-select I think some solr libraries already use this approach (e.g. WebService::Solr in perl) On 14

Re: Getting error "Bad Message 414 reason: URI Too Long"

2021-01-14 Thread Nicolas Franck
Euh, sorry: I did not read your message well enough. You did actually use a post request, with the parameters in the body (your example suggests otherwise) > On 14 Jan 2021, at 10:37, Nicolas Franck wrote: > > I believe you can also access this path in a HTTP POST request. > That w

Solr Benchmarks

2006-11-06 Thread Nicolas St-Laurent
Hello, Is there any Solr benchmarks available somewhere ? I would like to know how well it performs. I understand that it depends on the hardware config and on the application server used. Just to got an idea... Thank you, Nicolas

Re: Solr Benchmarks

2006-11-06 Thread Nicolas St-Laurent
The performance data on the wiki (http://wiki.apache.org/solr/ SolrPerformanceData) are a little short to get a good idea. Le 06-11-06 à 09:28, Nicolas St-Laurent a écrit : Hello, Is there any Solr benchmarks available somewhere ? I would like to know how well it performs. I understand

Re: Solr Benchmarks

2006-11-06 Thread Nicolas St-Laurent
of execution. It's a good idea to user JMeter to get a performance report. I will try this. Nicolas

Re: Solr Benchmarks

2006-11-06 Thread Nicolas St-Laurent
Le 06-11-06 à 12:21, Kevin Lewandowski a écrit : As of today Solr is running under Tomcat on a single dedicated box. It's a 2.66Ghz P4, with 1 gig ram. The index has about 1.2 million documents and is 1.2 gigs in size. This machine handles 250,000 queries per day with no problem. CPU load stays

Solr and Oracle

2006-11-23 Thread Nicolas St-Laurent
e can give me a hint on how to do this, I will appreciate. Thanks, Nicolas St-Laurent

Re: Solr and Oracle

2006-11-24 Thread Nicolas St-Laurent
Thank you Bertrand. The documentation on Solr is still sparse. I've already looked in SolrResources, find some idea, but not exactly what I need. When my solution will work, I will document it into the wiki. Nicolas Le 06-11-24 à 02:48, Bertrand Delacretaz a écrit : On 11/

Solr index on Amazon S3

2012-09-05 Thread Nicolas de Saint-Aubert
Hi, We currently share a single solr read index on an nfs accessed by various solr instances from various devices which gives us a high performant cluster framework. We would like to migrate to Amazon or other cloud. Is there any way (compatibility) to have solr index on Amazon S3 file cloud syste

<    1   2