Re: Phrase Query Issue

2009-04-03 Thread Nicolas
ed as plus (/+/) signs." Nicolas. dabboo a écrit : Hi, I am implementing phrase search query using my application. I am trying to search "Glorious Revolution", I am expecting all the records which contains this as whole string in any of its field should display. Below is my

empty index, with simple texts

2015-03-13 Thread Nicolas Turenne
//txt/Ovid//*.txt 5 files seems to be indexed but searching by keywords gives nothing when i type someting in http://localhost:8983/solr/tika/browse?q= anyone can help me ? thank you Nicolas --

Performance questions

2017-11-28 Thread Nicolas Bélisle
ork : http://events.linuxfoundation.org/sites/events/files/slides/HighPerformanceSolr.pdf Nicolas

SOLR Atomic update of custom stored metadata clears full-text index! How to add metadata without losing full-text search

2017-03-08 Thread Nicolas Bouillon
Dear SOLR friends, I developed a small ERP. I produce PDF documents linked to objects in my ERP: invoices, timesheets, contracts, etc... I have also the possibility to attach documents to a particular object and when I view an invoice for instance, I can see the attached documents. Until now, I

Re: SOLR Atomic update of custom stored metadata clears full-text index! How to add metadata without losing full-text search

2017-03-08 Thread Nicolas Bouillon
MMV of course). Without any particular > tuning. Without sharding. Very often the most expensive part of > indexing is acquiring the data in the first place, i.e. getting it > from a DB or extracting it from Tika. Solr will handle quite a load. > > And, if you're using the Extract

Re: SOLR Atomic update of custom stored metadata clears full-text index! How to add metadata without losing full-text search

2017-03-08 Thread Nicolas Bouillon
ou've mentioned so far looks like it should take > anything except getting your configurations to be what you need, so > don't make more work for yourself than you need to ;). > > After that, see the link Shawn provided... > > Best, > Erick > > On Wed, Mar 8, 201

Re: SOLR Atomic update of custom stored metadata clears full-text index! How to add metadata without losing full-text search

2017-03-08 Thread Nicolas Bouillon
Guys A BIG thank you, it works perfectly!!! After so much research I finally got my solution working. That was the trick, _text_ is stored and it’s working as expected. Have a very nice day and thanks a lot for your contribution. Really appreciated Nico > On 8 Mar 2017, at 18:26, Nico

How to get the fields that match the request?

2011-09-22 Thread Nicolas Martin
Hi everyBody, I need your help to get more information in my solR query's response. i've got a simple input text which allows me to query several fields in the same query. So my query looks like this "q=email:martyn+OR+name:martynn+OR+commercial:martyn ..." Is it possible in the response to

Re: How to get the fields that match the request?

2011-09-22 Thread Nicolas Martin
lts to the default search field, but it would helpful to know what field(s) match the query term. On Thu, Sep 22, 2011 at 3:29 AM, Nicolas Martinwrote: Hi everyBody, I need your help to get more information in my solR query's response. i've got a simple input text which allows

A fieldType for a address street

2011-09-23 Thread Nicolas Martin
Hi solR users! I'd like to make research on my client database, in particular, i need to find client by their address (ex : "100 avenue des champs élysée") Does anyone know a good fieldType to store my addresses to enable me to search client by address easily ? thank you all On 23/09/2

A fieldType for a address street

2011-09-23 Thread Nicolas Martin
Hi solR users! I'd like to make research on my client database, in particular, i need to find client by their address (ex : "100 avenue des champs élysée") Does anyone know a good fieldType to store my addresses to enable me to search client by address easily ? thank you all

spatial search performance - latlontype vs pointtype?

2012-02-01 Thread Nicolas Flacco
I've switched my index to use pointtype instead of latlontype of spatial search queries. Unfortunately I'm seeing much worse performance, and I was wondering if anybody else knew of any issues between the two types. I would expect a flat space calculation of pointtype to be better than the spher

Re: Improving performance for SOLR geo queries?

2012-02-08 Thread Nicolas Flacco
I compared locallucene to spatial search and saw a performance degradation, even using geohash queries, though perhaps I indexed things wrong? Locallucene across 6 machines handles 150 queries per second fine, but using geofilt and geohash I got lots of timeouts even when I was doing only 50 querie

solr warmup and reading the index into memory on startup?

2012-02-24 Thread Nicolas Flacco
I'm seeing some problems warming up solr on startup. Currently warmup consists of two parts- running queries on startup programmatically, and then running a script to perform queries. The programmatic warmup seems to warm up Solr fine in terms of making queries via the Solr admin tool, but when I d

Extract terms of a query to do highlighting

2012-03-16 Thread Nicolas Labrot
t with the Analyzer. I do not find any methods to do it. When I was using pure Lucene I was using Query.extractTerms. Is there any methods to answer my need ? Thanks for your help, Nicolas

SOLR-792 (hierarchical faceting) issue when only 1 document should be present in the pivot

2010-11-24 Thread Nicolas Peeters
can I make sure that Solr would find in the tree the single document when I facet on this "location" hierarchy? Thank you very much for your help. Nicolas import java.io.IOException; import java.net.MalformedURLException; import java.util.ArrayList; import java.util.List; import java.u

Re: SOLR-792 (hierarchical faceting) issue when only 1 document should be present in the pivot

2010-11-24 Thread Nicolas Peeters
ch Centre (2) level3_loc_s=Value_that_cant_be_matched (1) level1_loc_s=Groot Amsterdam (5) level2_loc_s=Amsterdam (3) level3_loc_s=Jordaan (2) level3_loc_s=Centrum (1) level2_loc_s=Amstelveen (2) level3_loc_s=Centrum (2) Any expert advice on why this is the case is more than we

Download of old solr releases

2012-07-25 Thread Nicolas Dietrich
Hi there, it looks like the old releases have been thrown out of the download servers, for example http://apache.mirrors.tds.net/lucene/solr/1.4.1/apache-solr-1.4.1.tgz Is this on purpose or a mistake, or have I overseen something? Thanks for clarification. Cheers, Nicolas

Re: Download of old solr releases

2012-07-25 Thread Nicolas Dietrich
files that most people > don't care about. > > All historic apache releases are available from the apache archive... > > http://lucene.apache.org/solr/downloads.html > http://archive.apache.org/dist/lucene/solr/ Oh right, I overlooked that. Thanks for telling! Nicolas

Iso accents and wildcards

2009-10-30 Thread Nicolas Leconte
Hi all, I have a field that contains accentuated char in it, what I whant is to be able to search with ignore accents. I have set up that field with : generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0" splitOnCaseChange="1" /> words="stopwords.txt" /> In

Re: Iso accents and wildcards

2009-11-01 Thread Nicolas Leconte
nge, mangé mang -> mange, manger mangu-> mangue mang*-> manger, mangé, mangue mang?-> mangue (and not mangé) mangé* -> nothing Jean-François - "Nicolas Leconte" a écrit : | Hi all, | | I have a field that contains accentuated char in

Re: Iso accents and wildcards

2009-11-01 Thread Nicolas Leconte
t; as an untokenized field. While querying, you can use both the fields at the same time. e.g. q=(title:écon* title_orginal:écon*). In any case, you would get desired matches. Cheers Avlesh On Fri, Oct 30, 2009 at 9:19 PM, Nicolas Leconte wrote: Hi all, I have a field that contains accentuate

Re: Highlighting is very slow

2009-11-09 Thread Nicolas Dessaigne
Hi Andrew, Alternatively, you could use a copyfield with a maxChars limit as your highlighting field. Works well in my case. See https://issues.apache.org/jira/browse/SOLR-538 Nicolas 2009/11/5 Andrew Clegg > > > Indeed -- it actually went slightly slower but only by a few s

Re: Highlighting is very slow

2009-11-10 Thread Nicolas Dessaigne
7;t find all documents. I think that a pretty large copyfield (maybe 50k?) is usually enough for most documents to be highlighted, but that depends on your corpus. Good luck ;) Nicolas 2009/11/9 Andrew Clegg > > > Nicolas Dessaigne wrote: > > > > Alternatively, you could

QueryElevationComponent : hot update of elevate.xml

2009-04-10 Thread Nicolas Pastorino
t whether this would be possible, how this would be achievable ( which type of plugin for instance ) or even be valid ? Thanks a lot in advance for your thoughts, -- Nicolas

Re: QueryElevationComponent : hot update of elevate.xml

2009-05-06 Thread Nicolas Pastorino
Hi, On Apr 10, 2009, at 16:51 , Ryan McKinley wrote: On Apr 10, 2009, at 7:48 AM, Nicolas Pastorino wrote: Hello ! Browsing the mailing-list's archives did not help me find the answer, hence the question asked directly here. Some context first : Integrating Solr with a CMS

Re: QueryElevationComponent : hot update of elevate.xml

2009-05-06 Thread Nicolas Pastorino
for the current core, as you suggested above, to reload the Elevate configuration. -- Nicolas On Fri, Apr 10, 2009 at 5:18 PM, Nicolas Pastorino wrote: Hello ! Browsing the mailing-list's archives did not help me find the answer, hence the question asked directly here. Some co

Re: QueryElevationComponent : hot update of elevate.xml

2009-05-06 Thread Nicolas Pastorino
levation system, and then did not remove this flexibility. Thanks for your expert eye on this ! On Wed, May 6, 2009 at 6:43 PM, Nicolas Pastorino wrote: Hello, On May 6, 2009, at 15:02 , Noble Paul നോബിള്‍ नोब्ळ् wrote: The elevate.xml is loaded from conf dir when the core is reloaded

Re: large index vs multicore

2009-05-07 Thread Nicolas Pastorino
fficiently distinct (this also means 2 queries and manual merging/joining). Where would this manual merging/joining occur? At the client-side or inside Solr, before returning the results ? I was wondering what relevancy, sorting, etc. would become. -- Nicolas Otis -- Sematext -- http://se

Re: QueryElevationComponent : hot update of elevate.xml

2009-05-12 Thread Nicolas Pastorino
tips and recommendations. I attached a new version of this requestHandler here : https:// issues.apache.org/jira/browse/SOLR-1147. Would this requestHandler be of any general use and could be part of Solr's trunk ? Thanks in advance, -- Nicolas Pastorino - eZ Labs On Wed, May 6, 2009

Re: QueryElevationComponent : hot update of elevate.xml

2009-05-20 Thread Nicolas Pastorino
Hi, On May 12, 2009, at 12:33 , Nicolas Pastorino wrote: Hi, On May 7, 2009, at 6:03 , Noble Paul നോബിള്‍ नोब्ळ् wrote: going forward the java based replication is going to be the preferred means replicating index. It does not support replicating files in the dataDir , it only supports

Yankee's Solr integration

2010-01-06 Thread Nicolas Kern
appy new year ! Nicolas

Save a search

2010-01-21 Thread Nicolas Kern
Hi, Is there a way to save a search performed on Solr, like it's possible to do on here : http://www.yankeegroup.com/search.do?searchType=advancedSearch ? I'd like to add the same functions to my company's Solr. Best Regards and thanks, Nicolas

[JOB] Full-time opportunity in Paris, France

2007-09-26 Thread nicolas . dessaigne
ning and NLP. Familiarity with C++, SOLR and Eclipse is also desired. If you are available and interested, please contact me directly at nicolas.dessaigne_at_arisem.com Nicolas Dessaigne Chief Technical Officer ARISEM

Proposition of a new feature: Dynamic Field Types

2008-02-29 Thread nicolas . dessaigne
uot;language". It is configured with a config similar to the following: ... ... The last condition is used as a catch-all if preceding conditions are not met. What do you think of this feature? Best regards, Nicolas Dessaigne

RE: Proposition of a new feature: Dynamic Field Types

2008-02-29 Thread nicolas . dessaigne
de me think a bit more on multilingual problems. There may be a way to tune the idf of some fields depending on others? Another idea would be to boost documents in the language of the request. This may be actually much simpler. If you have any idea on the subject I'm very interested! Nicolas

RE: Proposition of a new feature: Dynamic Field Types

2008-03-03 Thread nicolas . dessaigne
to the O(log n) complexity of queries: a query on a index of 10,000,000 docs is only slightly slower than the same query on a index of 5,000,000 docs. Thanks for your advice, I'll try to report back on our tests. Nicolas -Message d'origine- De : Grant Ingersoll [mailto:[EMAIL

RE: Proposition of a new feature: Dynamic Field Types

2008-03-03 Thread nicolas . dessaigne
fied problem of cross-lingual retrieval in Lucene. Sorry if it is not as clear as I would like (English is not my mother tongue). Nicolas -Message d'origine- De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] De la part de Yonik Seeley Envoyé : dimanche 2 mars 2008 03:45 À : solr-u

RE: Illegal xml/html character; unicode problems near solr

2008-03-07 Thread nicolas . dessaigne
I think Tomcat defaults to the operating system default, e.g. cp1252 on a classic windows. You need to add an attribute URIEncoding="UTF-8" to the Connector you use in the server.xml conf. Nicolas -Message d'origine- De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] De l

Slow Highlighting -> CopyField maxSize property

2008-03-19 Thread nicolas . dessaigne
course, if the only terms that are matched by a query are after the limit, no highlight is possible. What do you think of this feature? Best regards, Nicolas

RE: Language support

2008-03-20 Thread nicolas . dessaigne
You may be interested in a recent discussion that took place on a similar subject: http://www.mail-archive.com/solr-user@lucene.apache.org/msg09332.html Nicolas -Message d'origine- De : David King [mailto:[EMAIL PROTECTED] Envoyé : mercredi 19 mars 2008 20:07 À : solr

RE: Slow Highlighting -> CopyField maxSize property

2008-03-25 Thread nicolas . dessaigne
Hi Koji, It needs a bit of polishing first, but we'll provide a patch if you're interested. I'll keep you informed as soon as it is available. Nicolas -Message d'origine- De : Koji Sekiguchi [mailto:[EMAIL PROTECTED] Envoyé : vendredi 21 mars 2008 16:50 À : solr-us

RE: Slow Highlighting -> CopyField maxSize property

2008-04-15 Thread Nicolas DESSAIGNE
Koji, The patch is now available at https://issues.apache.org/jira/browse/SOLR-538 Tell me if it fits your needs. Nicolas -Message d'origine- De : Koji Sekiguchi [mailto:[EMAIL PROTECTED] Envoyé : vendredi 21 mars 2008 16:50 À : solr-user@lucene.apache.org Objet : Re: Slow Highlig

DataImportHandler : How to mix XPathEntityProcessor and TemplateTransformer

2008-06-10 Thread Nicolas Pastorino
lus the full examples ( slashdot RSS feed ). I would gladly have feedback from anyone who tried to index HTTP/XML sources, and got it to work smoothly. Thanks a million in advance, Regards, Nicolas -- Nicolas Pastorino eZ Systems ( Western Europe ) | http://ez.no

Re: DataImportHandler : How to mix XPathEntityProcessor and TemplateTransformer

2008-06-10 Thread Nicolas Pastorino
a lot for your time already ! Regards, Nicolas Pastorino On Jun 10, 2008, at 14:55 , Noble Paul നോബിള്‍ नोब्ळ् wrote: The configuration is fine but for one detail The documents are to be created for the entity 'oldsearchcontent' not for the root entity . so add an attribute roo

Re: DataImportHandler : How to mix XPathEntityProcessor and TemplateTransformer

2008-06-10 Thread Nicolas Pastorino
more Regards, Nicolas Pastorino On Jun 10, 2008, at 17:38 , Nicolas Pastorino wrote: Thanks a lot, it works fine now, fetching subelements properly. The only issue left is that the XPath syntax passed in the data- config.xml does not seem to work properly. As an example, processing the

Re: DataImportHandler : How to mix XPathEntityProcessor and TemplateTransformer

2008-06-11 Thread Nicolas Pastorino
o have the TemplateTransformer to not throw Null Pointer exceptions :) Cheers ! -- Nicolas Pastorino On Jun 10, 2008, at 18:05 , Noble Paul നോബിള്‍ नोब्ळ् wrote: It is a bug, nice catch there needs to be a null check there in the method can us just try replacing the method with the following? pr

RE: never desallocate RAM...during search

2008-06-20 Thread Nicolas DESSAIGNE
problem. Nicolas -Message d'origine- De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Envoyé : mercredi 18 juin 2008 15:49 À : solr-user@lucene.apache.org Objet : RE: never desallocate RAM...during search Hi Otis, Thank you for your attention. I've read for days the mail list

Synonyms relationships

2018-10-31 Thread Nicolas Paris
ng the terms of the research or not. Thanks , -- nicolas

Pagination Graph/SQL

2018-12-02 Thread Nicolas Paris
bove SQL : SELECT id FROM table LIMIT 10 would both return 10 id's together with the number of document within the table. Thanks, -- nicolas

Highlighting Parent Documents

2018-12-09 Thread Nicolas Paris
ound":1,"start":0,"docs":[ { "id":"1"}] }, "highlighting":{ "2":{ "content_txt":["apache"]}}} [1] http://lucene.472066.n3.nabble.com/Fwd-Standard-highlighting-doesn-t-work-for-Block-Join-td4260784.html [2] http://lucene.472066.n3.nabble.com/highlighting-on-child-document-td4238236.html Thanks by advance, -- nicolas

Search only for single value of Solr multivalue field (part 2)

2018-12-16 Thread Nicolas Paris
]: http://lucene.472066.n3.nabble.com/Search-only-for-single-value-of-Solr-multivalue-field-td4309850.html#a4309893 -- nicolas

Re: Search only for single value of Solr multivalue field (part 2)

2018-12-16 Thread Nicolas Paris
e field. Problem with copyField from multiple fields acts as a MV field. So the problem remains: dealing with MV fields. Isn't ? Thanks -- nicolas

Re: Search only for single value of Solr multivalue field (part 2)

2018-12-17 Thread Nicolas Paris
he MV limitation a *solr parser* limitation, or a *lucene* limitation. If it is the latter, writing my own parser would be an option isn't ? -- nicolas

MoreLikeThis & Synonyms

2018-12-26 Thread Nicolas Paris
together with the MoreLikeThis handler. Is there any reason for the MoreLikeThis feature not compatible with Multiword Synonyms ? Thanks -- nicolas

Re: MoreLikeThis & Synonyms

2018-12-27 Thread Nicolas Paris
onyms. So it is not possible to use query time synonyms nor mutli-word. Only index time is possible with the limitations it has [1] > What version of Solr are you using? I am running solr 7.6. [1] https://lucidworks.com/2017/04/18/multi-word-synonyms-solr-adds-query-time-support/ -- nicolas

edismax: sorting on numeric fields

2019-02-14 Thread Nicolas Paris
be kind:animal, weight:xxx How to deal with numeric fields ? Thanks -- nicolas

Re: edismax: sorting on numeric fields

2019-02-16 Thread Nicolas Paris
tanding your question here. if your query is > > q=kind:animal weight:50 you will get no results, as nothing matches > > (assuming a q.op of AND) > > > > > > On Thu, Feb 14, 2019 at 4:06 PM Nicolas Paris > > wrote: > > > > > Hi > > > > &g

Re: edismax: sorting on numeric fields

2019-02-17 Thread Nicolas Paris
which is a whole different animal and something I don’t > > think many have experience with, including myself > > > >> On Feb 16, 2019, at 10:10 AM, Nicolas Paris > >> wrote: > >> > >> Hi > >> > >> Thanks. > >> To clarify, I

"dismax" parameter "bq" filters instead of boosting

2019-03-05 Thread Nicolas Franck
I noticed a change in the behaviour of the regular "dismax" parser. At least in version 7.4: when you add "bq", it filters the results (like "fq" does), instead of boosting the matches. e.g. defType=dismax bq=format:periodical^30 gives only records with format "periodical". removing the param

Collection API Snapshots and Restore

2018-02-09 Thread Nicolas Bélisle
s XML description and manually delete files not part of that snapshot on each core. Is there another solution ? Nicolas

Re: Collection API Snapshots and Restore

2018-02-11 Thread Nicolas Bélisle
to this strategy ? - Could it be done without shutting down Solr ? * If so, we would like to contribute this feature. Regards, Nicolas On Sat, Feb 10, 2018 at 1:33 PM, Erick Erickson wrote: > I really have no idea how that would work. If you don't copy the index > somewhere, you s

query bag of word with negation

2018-04-22 Thread Nicolas Paris
Hello I wonder if there is a plain text query syntax to say: give me all document that match: wonderful pizza NOT peperoni all those in a 5 distance word bag then pizza are wonderful -> would match I made a wonderful pasta and pizza -> would match Peperoni pizza are so wonderful -> would not ma

Re: query bag of word with negation

2018-04-22 Thread Nicolas Paris
1. Query terms containing other than just letters or digits may be placed >> within double quotes so that those other characters do not separate a term >> into many terms. A dot (period) and white space are neither letter nor >> digit. Examples: "Now is the time for all good men" (spaces, quote

Re: query bag of word with negation

2018-04-22 Thread Nicolas Paris
w=false" in order to combine with multi-word synonyms ? 2018-04-22 21:11 GMT+02:00 Markus Jelsma : > Hello Nicolas, > > Yes you can! Check out ComplexPhaseQParser > https://lucene.apache.org/solr/guide/6_6/other-parsers.html#OtherParsers- > ComplexPhraseQueryParser > > R

Re: Is anybody using UIMA with Solr?

2018-06-19 Thread Nicolas Paris
Hi Not realy a direct answer - Never used it, however this feature have been attractive to me while first looking at uima. Right now, I would say UIMA connectors in general are by design a pain to maintain. Source and target often do have optimised way to bulk export/import data. For example, usi

Re: Is anybody using UIMA with Solr?

2018-06-19 Thread Nicolas Paris
sorry thought I was on UIMA mailing list. That being said, my position is the same : let UIMA folks load data into SolR by using the most optimized way. (what would be the best way ? Loading jsons ?) 2018-06-19 22:48 GMT+02:00 Nicolas Paris : > Hi > > Not realy a direct answer - Neve

Re: Highlighting the search keywords

2018-07-31 Thread Nicolas Franck
Nope, that is how it works. It is not in place. > On 31 Jul 2018, at 21:57, Renuka Srishti wrote: > > Hi All, > > I was using highlighting in solr, solr gives highlighting results within > the response but not included within the documents. > Am i missing something? Can i configure so that it c

Re: Storage/Volume type for Kubernetes Solr POD?

2020-02-07 Thread Nicolas PARIS
and any files transmitted with it are confidential and > may be legally privileged, and intended solely for the use of the individual > or entity to whom they are addressed. If you have received this email in > error please notify the sender. This email message has been swept for the > presence of computer viruses. -- nicolas paris

use highlighting on multivalued fields with positionIncrementGap 0

2020-02-14 Thread Nicolas Franck
I'm trying to use highlighting on a multivalued text field (analysis not so important) .. { text: [ "hello", "world" ], id: 1 } but I want to match across the string boundaries: q=text:"hello world" This works by setting the attribute positionIncrementGap to 0, but then the hightlighting

Re: Lemmatizer for Solr

2020-02-14 Thread Nicolas Franck
Try also looking at the HunspellFilter: https://lucene.apache.org/solr/guide/6_6/filter-descriptions.html dictionaries ( .dic and .aff ) can be found here: https://cgit.freedesktop.org/libreoffice/dictionaries or via the git repo: https://anongit.freedesktop.org/git/libreoffice/dictionaries.gi

Re: A question about solr filter cache

2020-02-17 Thread Nicolas Franck
If 1GB would make solr go out of memory by using a filter query cache, then it would have already happened during the initial upload of the solr documents. Imagine the amount of memory you need for one billion documents.. A filter cache would be the least of your problems. 1GB is small in comparis

multivalue faceting term optimization

2020-03-09 Thread Nicolas Paris
faster performances for the brute task, I guess I could artificially limit the FQ under 2M for all queries by getting a sample (I don't really care having more than 2M documents to build the word cloud). I am wondering how I could filter the documents to get approximate facets ? Thanks ! -- nicolas paris

Re: multivalue faceting term optimization

2020-03-09 Thread Nicolas Paris
maybe better than subsetting with extra random fields -- nicolas paris

Re: multivalue faceting term optimization

2020-03-09 Thread Nicolas Paris
https://lucene.apache.org/solr/guide/8_4/the-stats-component.html#local-parameters-with-the-stats-component is about hll and facets, but I am not sure that really meet the use case. I also have to admit that part is quite cryptic to me. -- nicolas paris

Re: Should I index the field that use in fq field?

2020-03-13 Thread Nicolas Franck
Yes, every field you query has to be "indexed" every field you need to be returned in the response has to be "stored" the parameter "fl" can only return fields that are "stored". Other fields given are simply ignored. > On 13 Mar 2020, at 13:15, GTHell wrote: > > I'm doing a lot of filter q

Re: How do *you* restrict access to Solr?

2020-03-16 Thread Nicolas Franck
IPtables seems like the way to go, at least for me. Even if this basic-auth-plugin works, then you'll have to deal with denial-of-service attacks (although these can also happen indirectly, by hitting the website that uses Solr). > On 16 Mar 2020, at 15:44, Ryan W wrote: > > How do you, personal

Re: Limit Solr Disk IO

2020-06-04 Thread Nicolas Franck
The real questions are: * how much often do you commit (either explicitly or automatically)? * how much segments do you allow? If you only allow 1 segment, then that whole segment is recreated using the old documents and the updates. And yes, that requires reading the old segment. It is com

Re: Questions about Solr Search

2020-07-04 Thread Nicolas Franck
Short answer: no Neither Solr nor ElasticSearch have such capabilities out of the box. Solr does have a plugin infrastructure that enables you to provide better tokenization based on language rules, and some are better than others. I saw for example integration of openNLP here: https://lucene.a

Re: "dismax" parameter "bq" filters instead of boosting

2019-04-16 Thread Nicolas Franck
any update on this? > On 5 Mar 2019, at 09:06, Nicolas Franck wrote: > > I noticed a change in the behaviour of the regular "dismax" parser. > At least in version 7.4: > > when you add "bq", it filters the results (like "fq" does), instead of &

Re: "dismax" parameter "bq" filters instead of boosting

2019-04-16 Thread Nicolas Franck
I agree, but I thought my thread was lost in the long list of issues. I prepared a simple case for solr 8.0: basic_dismax_set/config: schema.xml and solrconfig.xml basic_dismax_set/data: records_pp.json Total 6 records: http://localhost:8983/solr/test/select?echoParams=all 5

Re: "dismax" parameter "bq" filters instead of boosting

2019-04-16 Thread Nicolas Franck
than dismax. > > Regards, > Alex > P.s. My suggestion was actually to give the queries against STOCK > examples. That would have made all these parameters explicit and more > obvious. And perhaps would have allowed you to discover the minimum > parameter set causing the issue with

Re: local paramas only with defType=lucene?

2019-04-17 Thread Nicolas Franck
Yup Changes in Solr 7.2: local parameters only parsed when defType is either "lucene" or "func" cf. https://lucene.apache.org/solr/guide/7_3/solr-upgrade-notes.html#solr-7-2 cf. https://issues.apache.org/jira/browse/SOLR-11501 On 17 Apr 2019, at 10:35, Michael Aleythe, Sternwald mailto:michae

Re: Problem while indexing DATE field in SOLR.

2019-04-26 Thread Nicolas Franck
Dates need to be send in UTC format: -mm-ddTHH:MM:SSZ or if you want fractional seconds too: -mm-ddTHH:MM:SS.NNNZ See https://lucene.apache.org/solr/guide/6_6/working-with-dates.html#WorkingwithDates-DateFormatting There is no automatic conversion for dates On 26 Apr 2019, at 09:50,

Status of solR / HDFS-v3 compatibility

2019-05-02 Thread Nicolas Paris
Hi solr doc [1] says it's only compatible with hdfs 2.x is that true ? [1]: http://lucene.apache.org/solr/guide/7_7/running-solr-on-hdfs.html -- nicolas

Re: Status of solR / HDFS-v3 compatibility

2019-05-02 Thread Nicolas Paris
. > > Kevin Risden > > > On Thu, May 2, 2019 at 9:32 AM Nicolas Paris > wrote: > > > Hi > > > > solr doc [1] says it's only compatible with hdfs 2.x > > is that true ? > > > > > > [1]: http://lucene.apache.org/solr/guide/7_7/running-solr-on-hdfs.html > > > > -- > > nicolas > > -- nicolas

Re: Does Solr support retrieve a string text and get its filename accordingly?

2019-05-23 Thread Nicolas Franck
In that case you'll have to duplicate that field: id: $name_of_file id_t: $name_of_file The first field should be marked as "string", and set to be the key field. Id-fields cannot be tokenized. The second field is a derivative (you can just copy the contents, or use copyField), and should be se

Re: Urgent help on solr optimisation issue !!

2019-06-07 Thread Nicolas Franck
In that case, hard optimisation like that is out the question. Resort to automatic merge policies, specifying a maximum amount of segments. Solr is created with multiple segments in mind. Hard optimisation seems like not worth the problem. The problem is this: the less segments you specify during

Re: Solr 7.6.0: PingRequestHandler - Changing the default query (*:*)

2019-08-05 Thread Nicolas Franck
If the ping request handler is taking too long, and the server is not recovering automatically, there is not much you can do automatically on that server. You have to intervene manually, and restart Solr on that node. First of all: the ping is just an internal check. If it takes too long to respon

Re: Searches across Cores

2019-08-09 Thread Nicolas Franck
He's right. You can use the parameter "shards" for a very long time, even before the whole solr cloud existed. e.g. http://localhost:8983/solr/core0/select with parameters: shards = localhost:8983/solr/core0,example.com:8983/solr/core0 q = *:* defType =

Document Update performances Improvement

2019-10-15 Thread Nicolas Paris
lad to implement it in that library. Also, I have split the collection over multiple shard, and I admit this faster the update process, but who knows ? Thoughts ? -- nicolas

Solr-Cloud, join and collection collocation

2019-10-15 Thread Nicolas Paris
collection. I wonder if there is a way to overcome this limitation. For example, by replicating the joined collection to every shard - or other method I am ignoring. Any thought ? -- nicolas

Re: Solr-Cloud, join and collection collocation

2019-10-15 Thread Nicolas Paris
a of that shard must be co-located with every > replica of the “to” collection. > > Have you looked at streaming and “streaming expressions"? It does not have > the same problem, although it does have its own limitations. > > Best, > Erick > > > On Oct 15, 2019,

Re: Solr-Cloud, join and collection collocation

2019-10-16 Thread Nicolas Paris
is 12M or 1 document in size. So the performance of join looks correlated to size of joined collection and not the kind of filter applied to it. I will explore the streaming expressions On Wed, Oct 16, 2019 at 08:00:43AM +0200, Nicolas Paris wrote: > > You can certainly replicate the

Re: Solr-Cloud, join and collection collocation

2019-10-16 Thread Nicolas Paris
ing score=none as a local param. Turns another algorithm dragging > by from side join. > > On Wed, Oct 16, 2019 at 11:37 AM Nicolas Paris > wrote: > > > Sadly, the join performances are poor. > > The joined collection is 12M documents, and the performances are 6k ms &

Re: Document Update performances Improvement

2019-10-19 Thread Nicolas Paris
Hi community, Any advice to speed-up updates ? Is there any advice on commit, memory, docvalues, stored or any tips to faster things ? Thanks On Wed, Oct 16, 2019 at 12:47:47AM +0200, Nicolas Paris wrote: > Hi > > I am looking for a way to faster the update of documents. > >

Re: Document Update performances Improvement

2019-10-19 Thread Nicolas Paris
instances, sharding, replication, > commit timing etc. > > > Am 19.10.2019 um 21:52 schrieb Nicolas Paris : > > > > Hi community, > > > > Any advice to speed-up updates ? > > Is there any advice on commit, memory, docvalues, stored or any tips to >

Re: Document Update performances Improvement

2019-10-22 Thread Nicolas Paris
. Thanks Paras On Tue, Oct 22, 2019 at 01:00:10PM +0530, Paras Lehana wrote: > Hi Nicolas, > > Have you tried playing with values of *IndexConfig* > <https://lucene.apache.org/solr/guide/6_6/indexconfig-in-solrconfig.html> > (merge factor, segment size, maxBufferedDoc

Re: Document Update performances Improvement

2019-10-23 Thread Nicolas Paris
ze > the index and the index size comes below 13 GB! I guess I get the idea: "put the dollars as fast as possible in the bag, we will clean-up when back home" Thanks On Wed, Oct 23, 2019 at 11:34:44AM +0530, Paras Lehana wrote: > Hi Nicolas, > > What kind of chan

Re: Document Update performances Improvement

2019-10-23 Thread Nicolas Paris
dates might be faster. The documents are stored within parquet files without any processing needed. In this case, the atomic update is not likely to faster things. Thanks On Wed, Oct 23, 2019 at 07:49:44AM -0600, Shawn Heisey wrote: > On 10/22/2019 1:12 PM, Nicolas Paris wrote: > > &

  1   2   >