Re: Boosting fields at query time in Standard Request Handler from Solrconfig.xml

2011-05-31 Thread Jan Høydahl
Hi, You need to add edismax -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 31. mai 2011, at 18.08, Vignesh Raj wrote: > Hi, > > I am developing a search engine app using Asp.Net, C# and Solrnet. I use the &

ExtractingRequestHandler - renaming tika generated fields

2011-06-09 Thread Jan Høydahl
ield generated by Tika/SolrCell. If I do fmap.title=tika_title then my literal.title also also changes name. Any ideas? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Boost or sort a query with range values

2011-06-09 Thread Jan Høydahl
Check the new if() function in Trunk, SOLR-2136. You could then use it in &bf= or &boost= -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. juni 2011, at 13.05, jlefebvre wrote: > thanks it's ok > > ano

Re: Boost or sort a query with range values

2011-06-09 Thread Jan Høydahl
Btw. your example is a simple boolean query, and this will also work: &bq=(myfield1:0 AND myfield2:1)^100.0 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. juni 2011, at 13.31, Jan Høydahl wrote: > Check the new if() f

Re: ExtractingRequestHandler - renaming tika generated fields

2011-06-09 Thread Jan Høydahl
not rename those. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. juni 2011, at 11.26, Jan Høydahl wrote: > Hi, > > I post a PDF from a CMS client, which has metadata about the document. One of > those metadata is th

Re: Using Edismax

2011-06-14 Thread Jan Høydahl
similar thins for other special cases you wish to match. I assume that the normal case is that you want to match whole words like sharepoint or pptx, and that the _ matching is a special case. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtrai

Re: Copying few field using copyField to non multiValued field

2011-06-15 Thread Jan Høydahl
How would the resulting single-valued field look like? Concatenate all input fields into one long string? If that's what you need, I've written a FieldCopy UpdateProcessor which can do that. I'll contribute it in https://issues.apache.org/jira/browse/SOLR-2599 -- Jan Høydahl,

Re: Controlling Tika's metadata

2011-06-17 Thread Jan Høydahl
=tika_title literal.title=My client provided title In this way we can cleanly rename or ignore TIKA-generated metadata. Perhaps also an option to add a prefix to all Tika generated fields? tika.prefix=tika_ -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: Solr Highlighting and fqs

2011-06-17 Thread Jan Høydahl
Hi, Vote for this feature https://issues.apache.org/jira/browse/SOLR-1926 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. juni 2011, at 21.48, Jamie Johnson wrote: > I've read in several places that the Solr High

Re: solr highliting feature

2011-06-19 Thread Jan Høydahl
Hi, First, you should consider SolrJ API if you're working from Java/JSP. Then, say you want to highlight title. In you loop across the N hits, instead of pulling the title from the hits themselves, check if you find a highlighted result with the same ID in the section. -- Jan Hø

Re: solr highliting feature

2011-06-19 Thread Jan Høydahl
Perhaps I don't understand your question right, but if you're working with the json response format, yes, you need to pull the highlighted version of the field from the highlighting section. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Sol

Re: Request handle solrconfig.xml Spellchecker

2011-06-20 Thread Jan Høydahl
Try http://localhost:8983/solr/spellcheck?q=komputer&spellcheck=true If you want the spellchecker to be enabled in your default search handler, you need to include this in your default search-hanlder config: spellcheck -- Jan Høydahl, search solution architect Cominven

Re: How to highlight a field in searchresult using solr

2011-06-20 Thread Jan Høydahl
Hi, You have asked the question before in another thread, and I have tried to answer http://search-lucene.com/m/586GL1pMaEc1 You DO need to get the highlighted result from the highlighting tag separately. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: Advice wanted on approach/architecture

2011-06-26 Thread Jan Høydahl
ch re-indexes matching lines with a field exclude=true. This is more efficient than running all the regexes query-time. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 24. juni 2011, at 14.54, Js wrote: > > > Hi L

Re: Indexing binary files from database issue (no errors)

2012-07-30 Thread Jan Høydahl
Hi, Did you get a reply on this? I'd guess that it is your JDBC driver which does not handle the response to your CONCAT_WS() SQL. Try without it and see. Then try to upgrade your mysql JDBC driver to a newer version and see if it helps. -- Jan Høydahl, search solution architect Cominve

Re: SolrCloud replication question

2012-07-30 Thread Jan Høydahl
Hi, Interesting article in your link. What servlet container do you use and how is it configured wrt. threads etc? You should be able to utilize all CPUs with a single Solr index, given that you are not I/O bound.. Also, what is your mergeFactor? -- Jan Høydahl, search solution architect

Re: auto completion search with solr using NGrams in SOLR

2012-08-04 Thread Jan Høydahl
Have a look at my blog post http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/ for a walkthrough of how it could be done, as a separate Solr core. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 1. aug

Re: highlighting tint fields

2012-08-05 Thread Jan Høydahl
Hi, That is by design, you can only highlight text fields. The solution is to copy your "age" field to a "age_t" field and highlight on that instead. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 5. aug.

Re: Problems with EDisMax field aliases for multiple fields on Solr 3.6.1

2012-08-08 Thread Jan Høydahl
bug looks like: foo:drive +DisjunctionMaxQuery((features:drive^2.0 | name:drive)) You see that the query string is being parsed correctly, and we get three hits (vs 2 in name and 1 in features alone). -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrt

Re: Your opinion please concerning prod' installation

2012-08-19 Thread Jan Frühwacht
production systems) -Is there any data which you will index which the end-user should not be able to see? (distinction between indexed and stored fields) Hope this helps to start. If you have more detailed questions on securing the server please be more a bit more concise. Kind regards, Jan

Re: Need Help - Solr - Sitecore integration

2012-08-19 Thread Jan Frühwacht
sitecore. Kind regards, Jan 2012/8/16 Samuthira Pandi S > Hi, > > Currently I am working as a Sitecore Developer. > My client would like to implement SOLR search integration on my sitecore > application. > I don't have idea to implement, if you have any document related

Re: ExtractingRequestHandler causes Out of Memory Error

2012-09-27 Thread Jan Høydahl
Arrays.copyOf() call.. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 27. sep. 2012 kl. 11:22 skrev Shigeki Kobayashi : > Hi guys, > > > I use Manifold CF to crawl files in Windows file server and index them to > Solr u

Re: ExtractingRequestHandler causes Out of Memory Error

2012-10-03 Thread Jan Høydahl
Hi, If you like, you can open a JIRA issue on this and provide as much info as possible. Someone can then look into (potential) memory optimization of this part of the code. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 28. sep

Re: Solr/Lucene courses and training

2012-10-04 Thread Jan Høydahl
Please have a look at the "Support" wiki page: http://wiki.apache.org/solr/Support Search for "Training" and you'll find a number of companies providing Solr/Lucene training in Europe. -- Jan Høydahl, search solution architect Cominvent AS - www.co

Re: How to make SOLR manipulate the results?

2012-10-04 Thread Jan Høydahl
into Grouping http://wiki.apache.org/solr/FieldCollapsing to try to get this as one single query, perhaps through two distict &group.query requests... -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 4. okt. 2012 kl. 08:14 skre

Re: SolrCloud - replication factor

2012-10-05 Thread Jan Høydahl
guess you can set replicationFactor in solr.xml also... -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 5. okt. 2012 kl. 18:37 skrev Tomás Fernández Löbbe : > I think that's correct, but only when creating a new collection.

Re: I don't understand

2012-10-08 Thread Jan Høydahl
Hi, Please describe your environemnt better * How do you "crawl", using which crawler? * To which RequestHandler do you send the docs? * Which version of Solr * Can you share your schema and other relevant config with us? -- Jan Høydahl, search solution architect Co

Re: Wildcards and fuzzy/phonetic query

2012-10-09 Thread Jan Høydahl
. You may also be interested in a MeetUp talk held in Oslo last month: http://www.meetup.com/Oslo-Solr-Community/events/67253692/ You'll find the link to Mats' talk about Norwegian phonetics if you scroll down that page. -- Jan Høydahl, search solution architect Cominvent AS - ww

Re: BlockPostingsFormat

2012-10-13 Thread Jan Høydahl
Have not tested, but have you read this page? http://wiki.apache.org/solr/SchemaXml -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 13. okt. 2012 kl. 02:42 skrev varun srivastava : > Hi, > How can I enable BlockPostingsForm

Re: Solr4 without slf4j bindings -- apparent catch-22

2012-10-16 Thread Jan Høydahl
uff. With Jetty you can then configure your preferred logging by putting slf4j classes in jetty/lib/ext/. With Tomcat you'd probably put them in system/lib or common/lib. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 15. okt.

Re: solr 4 tika config

2012-10-17 Thread Jan Høydahl
Hi, Try the new post.jar in version 4.0.0 It will allow you to say java -Dauto -Drecursive -Dfiletypes=ppt -jar post.jar "d:\myfiles" You can inspect your Solr log file to see what ExtractingRequestHandler URLs are actually called for each -- Jan Høydahl, search solution architect

Re: how solr4.0 and zookeeper run on weblogic

2012-10-17 Thread Jan Høydahl
Did it work for you? You probably also have to set -Djetty.port=8080 in order for local ZK not to be started on port 9983. It's confusing, but you can also edit solr.xml to achieve the same. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Tra

dismax + wildcard

2009-11-04 Thread Jan Kammer
Hi there, what is the best way to search all fields AND use wildcards? Somewhere I read that there are problems with this combination... (dismax + wildcard) Thanks, Jan smime.p7s Description: S/MIME Cryptographic Signature

Wrong sort by score

2008-08-27 Thread Yuri Jan
Hi, I have encountered a weird problem in solr. In one of my queries (dismax, default sorting) I noticed that the results are not sorted by score (according to debugQuery). The first 150 results are tied (with score 12.806474), and after those, there is a bunch of results with higher score (12.962

Re: Wrong sort by score

2008-08-27 Thread Yuri Jan
Actually, no... The score in the fl are 12.806475 and 10.386531 respectively, so the results according to that are sorted correctly. Is it just a problem with the debugQuery? On Wed, Aug 27, 2008 at 9:21 AM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On Wed, Aug 27, 2008 at 9:10 AM,

Re: Wrong sort by score

2008-08-27 Thread Yuri Jan
tself is wrong (the one that I get from fl)? Thanks, Yuri On Wed, Aug 27, 2008 at 11:44 AM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On Wed, Aug 27, 2008 at 9:38 AM, Yuri Jan <[EMAIL PROTECTED]> wrote: > > Actually, no... > > The score in the fl are 12.806475 an

Re: Wrong sort by score

2008-08-29 Thread Yuri Jan
There was a bug in my implementation of tf(float freq). It was always 1, even when the frequency was 0. This caused a discrepancy between the score and the debug info score - it seems like in the debug all the dismax fields got score, while in the "real" score fields with document frequency 0 were

Different tokenizing algorithms for the same stream

2008-11-06 Thread Yuri Jan
Hello all, I'm trying to implement a tokenizer that will behave differently on different parts of the incoming stream. For example, for the first X words in the stream I would like to use one tokenizing algorithm, while for the rest of the stream a different tokenizing algorithm will be used. Wha

Re: Different tokenizing algorithms for the same stream

2008-11-07 Thread Yuri Jan
t your personalized tokenizer in a way it > >> changes its behaviour after it has delivered X tokens. > >> > >> This implies a new tokenizer instance is build from the factory for > >> every string analyzed, which I believe is true. > >> > &g

Re: Preparing the ground for a real multilang index

2009-07-07 Thread Jan Høydahl
query language. You may then want to do a copyfield from all your text_ -> text for convenient one- field-to-rule-them-all search. -- Jan Høydahl Gründer & senior architect Cominvent AS, Stabekk, Norway www.cominvent.com +20 100930908 On 3. juli. 2009, at 08.43, Michael Lackhoff wrot

Re: Preparing the ground for a real multilang index

2009-07-11 Thread Jan Høydahl
Michael, you're of course right, copyfield would copy from source. The lack of built-in language awareness in Solr is unfortunate :( I have not tried Lucid's BasisTech lemmatizer implementation, but check with them whether they can support multi languages in the same field. -- Jan Hø

DutchStemFilterFactory reducing double vowels bug ?

2009-07-12 Thread Jan Murre
. are indexed as 'bas', 'pas', 'man' and 'bom'. Those wordt have a meaning of their own. Am I missing something, or has this to be considered as a bug? Regards, Jan

mac osx server + tomcat + solr

2010-04-21 Thread Jan Kammer
i click on "Solr Admin", the described error appears. I asked google, but didn't find the right answer to solve my problem. thanks in advance. Greetz, Jan

Re: mac osx server + tomcat + solr

2010-04-21 Thread Jan Kammer
hi, thanks for your answer. I defined the path in an xml-file in Tomcat/conf/Catalina/localhost/ I read that this must be the right place. Could there be anywhere an other mistake? greetz, Jan Abdelhamid ABID schrieb: You are missing the "solr/home" context paramater , which poi

Re: mac osx server + tomcat + solr

2010-04-21 Thread Jan Kammer
e solr web-app. solr/home may be set in your xml-file like this PATH_TO_SOLR_HOME must contains "conf" and "data" folders On 4/21/10, Jan Kammer wrote: hi, thanks for your answer. I defined the path in an xml-file in Tomcat/conf/Catalina/localhost/ I read that this mus

Re: mac osx server + tomcat + solr

2010-04-21 Thread Jan Kammer
Is also correct. Abdelhamid ABID schrieb: Well, check your solr config file if the folder "Data" is correctly defined, you may set it like this: ${solr.data.dir:} On 4/21/10, Jan Kammer wrote: Exactly that i got in the file but it doesn't work :-( Abdelhamid ABID

SpellChecking

2010-05-03 Thread Jan Kammer
s, but not for 10 or more... My question is, what the best practice is to enable spellchecking on many fields. thanks. greetz, Jan

Re: SpellChecking

2010-05-03 Thread Jan Kammer
f i define more, the count for failed documents in dataimporthandler gets higher and higher the more i copy into "spell". 16444 So my question is, if this is the right way to use the spellchecker with many fields, or is there an other "better" way... thanks. greetz,

Re: SpellChecking

2010-05-03 Thread Jan Kammer
Hi, i build the index with ...&spellcheck.build=true It worked fine with a normal field. There must something wrong with copyfield, or why does dataimporthandler add/update no more documents? Can somebody paste the code for copyfield with many fields? Greetz, Jan Am 03.05.2010 1

Re: SpellChecking

2010-05-04 Thread Jan Kammer
Hi, thanks, exactly that i forgot. Now it works fine. :-) Am 03.05.2010 16:50, schrieb Michael Kuhlmann: Am 03.05.2010 16:43, schrieb Jan Kammer: Hi, It worked fine with a normal field. There must something wrong with copyfield, or why does dataimporthandler add/update no more documents

Re: defaultSearchField

2010-05-19 Thread Jan Kammer
There is something called dismax-requesthandler. I think this is what you are looking for. greetz, Jan Am 19.05.2010 15:47, schrieb Antonello Mangone: Hi to everyone, I'd like to know if it's possible to use the * defaultSearchField* on more fields ??? i.e. field1, fiel

[slightly ot] Looking for Lucene/Solr consultant in Germany

2007-08-08 Thread Jan Miczaika
search patterns used to query the data - test and evaluate the results The requirements: deep knowledge of Lucene/Solr, examples of implemented working search engines, theoretical knowledge Is anyone interested? Please feel free to circulate this offer. Thanks in advance Jan -- Geschäftsführer

Re: WELCOME to solr-user@lucene.apache.org

2008-01-08 Thread Jan Buelens
Hi, We are currently using Solr as search engine. To add an existing website to our search engine, we are investigating Nutch. Does anyone have more information / experience about an integration between Solr and Nutch? Thanks in advance ! Best regards, Jan

Re: SOLR 7.x stable version

2018-08-13 Thread Jan Høydahl
Getting the latest release is almost always the best ting to do, unless it is a 7.x.0 release fresh out the gate. As soon as a few weeks has passed without a bugfix release being announced, that normally means it is solid :) Which means that 7.4.0 should be your go-to version right now. -- Jan

Re: Schema Change for Solr 7.4

2018-08-13 Thread Jan Høydahl
Aliases are like pointers to collections that can be used in-place anywhere you'd use the collection name. See https://lucene.apache.org/solr/guide/7_4/collections-api.html#createalias <https://lucene.apache.org/solr/guide/7_4/collections-api.html#createalias> -- Jan Høydahl, sear

Re: Metrics for a healthy Solr cluster

2018-08-16 Thread Jan Høydahl
ory.html <https://lucene.apache.org/solr/guide/7_4/metrics-history.html> -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 16. aug. 2018 kl. 17:24 skrev Greenhorn Techie : > > Hi, > > Solr provides numerous JMX metrics for monitoring the health

Re: Solr unable to start up after setting up SSL in Solr 7.4.0

2018-08-20 Thread Jan Høydahl
efault configurations, just to sort out any custom changes you may have introduced? If that works, then you can try to change things step by step until you find the difference in config causing your issue. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 20. aug. 2018

Re: Solr unable to start up after setting up SSL in Solr 7.4.0

2018-08-21 Thread Jan Høydahl
Hi, Now, the zkcli.bat error may in fact be a real bug, perhaps caused by SOLR-7887. I think you should file that one in a JIRA. As a workaround you may attempt using the equivalent (from memory): bin/solr zk upconfig -c collection1 -d /path/to/conf -z localhost:2181 -- Jan Høydahl, search

Re: 7.3.1: Query of death - all nodes ran out of memory and had to be shut down

2018-08-21 Thread Jan Høydahl
The solution is to move to cursors, but you may as a safety net try to apply the RequestSanitizerComponent to disallow large offsets, see https://github.com/cominvent/request-sanitizer-component <https://github.com/cominvent/request-sanitizer-component> -- Jan Høydahl, search so

Re: SOLRJ 7.x library fails ejb deployment with weblogic

2018-08-21 Thread Jan Høydahl
-lang-arrayindexoutofboundsexception-while-deploying-app-in-wls-12 <https://stackoverflow.com/questions/19152655/java-lang-arrayindexoutofboundsexception-while-deploying-app-in-wls-12> -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 21. aug. 2018 kl. 13:28 skrev

Re: Solr unable to start up after setting up SSL in Solr 7.4.0

2018-08-23 Thread Jan Høydahl
s/zkcli.bat <https://github.com/apache/lucene-solr/blob/branch_7x/solr/server/scripts/cloud-scripts/zkcli.bat> -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 23. aug. 2018 kl. 05:51 skrev Shawn Heisey : > > On 8/22/2018 8:31 PM, Zheng Lin Edwin Yeo

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread Jan Høydahl
then you get more verbose logging in solr.log which may help Can you share with us how your query looks like, including all parameters from the section with echoParams=all enabled? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 23. aug. 2018 kl. 11:09 skrev zheny

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread Jan Høydahl
Count on filterCache, find alternatives to wildcard query and more. But all in all, I'd be very very satisfied with those low response times given the size of your data. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 23. aug. 2018 kl. 15:05 skrev Shawn Heisey : >

Re: Authorization Non- Admin user - SOLR

2018-08-24 Thread Jan Høydahl
Please share your security.json for us to be able to tell whether you configured something wrong -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 24. aug. 2018 kl. 21:28 skrev Rathor, Piyush (US - Philadelphia) > : > > Hi Team, > > W

Re: passing opts to all java calls in solr/solr.cmd

2018-08-24 Thread Jan Høydahl
Hi Sounds like it would be worth a low-pri issue. And it would be great if you guys could experiment to find the necessary changes and then contribute a patch back to the project, which we will then commit :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 25.

Re: Authorization Non- Admin user - SOLR

2018-08-27 Thread Jan Høydahl
Hi, The mailing list does not accept attachments, please copy/paste or use a file sharing service. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 27. aug. 2018 kl. 05:05 skrev Rathor, Piyush (US - Philadelphia) > : > > Hi Jan, > > Pl

Re: Multiple solr instances per host vs Multiple cores in same solr instance

2018-08-27 Thread Jan Høydahl
OS. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 27. aug. 2018 kl. 05:09 skrev Wei : > > Thanks Shawn. When using multiple Solr instances per host, is there any way > to prevent solrcloud from putting multiple replicas of the same shard on > same h

Re: Multiple solr instances per host vs Multiple cores in same solr instance

2018-08-27 Thread Jan Høydahl
What was your bottleneck when maxing on 30QPS on 3 node cluster? I expect such tests to vary quite much between use cases, so a good approach is to do just as you did: benchmark on your specific data and usage. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 27.

Re: Solr LTS and EOL

2018-08-28 Thread Jan Høydahl
Term Patching", avoiding the word "Support" :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 28. aug. 2018 kl. 15:20 skrev Dan Untenzu : > > Hey Shawn, > > thanks a lot for your clarification, all questions answered. > > Your mes

Boost matches occurring early in the field (offset)

2018-08-29 Thread Jan Høydahl
text in a new field and boost on that, but that is kind of "binary". I could also add the term offset as payload for every term and boost on that, but this should not be necessary since offset info is already part of the index? -- Jan Høydahl, search solution architect Co

Re: Solr indexing Duplicate URL's ending with /

2018-08-29 Thread Jan Høydahl
SOlr. See URLClassifyProcessor https://lucene.apache.org/solr/guide/7_2/update-request-processors.html#general-use-updateprocessorfactories <https://lucene.apache.org/solr/guide/7_2/update-request-processors.html#general-use-updateprocessorfactories> which may perhaps help. -- Jan H

Re: Boost matches occurring early in the field (offset)

2018-08-29 Thread Jan Høydahl
I have seen that one. But as I understand spanFirst, it only allows you to define a boost if your span matches, i.e. not a gradually lower score the further down in the document the match is? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 29. aug. 2018 kl. 12

Re: Boost matches occurring early in the field (offset)

2018-08-29 Thread Jan Høydahl
gorithm? Would you need to implement your own Scorer/Weight impl? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 29. aug. 2018 kl. 15:37 skrev Doug Turnbull > : > > You can also insert a token at the beginning of the query during analysis > using a

Re: Solr Stale pages

2018-08-30 Thread Jan Høydahl
then the crawler needs to explicitly send a delete request for that doc. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 30. aug. 2018 kl. 08:48 skrev kunhu0...@gmail.com: > > Hello All, > > I would like to know how Solr will handle the stale page

Re: [EXTERNAL] - Re: join works with a core, doesn't work with a collection

2018-08-31 Thread Jan Høydahl
Hi, You can have multiple nodes as long as you make sure that your collection has only one shard, then the joins will work. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 30. aug. 2018 kl. 19:51 skrev Steve Pruitt : > > Shawn, > > You are corr

Re: Split on whitespace parameter doubt

2018-08-31 Thread Jan Høydahl
> I am not sure why field centric field is not used all the time or at least > why there is no parameter to force it. Yea, we should have a parameter to force a field/term centric mode if possible. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com

Re: Solrcloud collection file location on zookeeper

2018-08-31 Thread Jan Høydahl
Once Solr 7.5 is released you will have a new "Cloud -> ZK Status" tab that will among other things show the data path on each ZK server. Until then, log in to the ZK server, locate zoo.cfg and check. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com >

Streaming timeseries() and buckets with no docs

2018-09-03 Thread Jan Høydahl
from search() and replace values? Or is there perhaps a function that can loop through the column produced by col() and replace non-numeric values with 0? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com

Re: solr how to show different documents for different users in a better way

2018-09-04 Thread Jan Høydahl
ManifoldCF (https://manifoldcf.apache.org) which is a framework that can both crawl multiple repos with ACLs and it also has a Solr plugin available for applying the filter.: https://www.slideshare.net/francelabs/integrate-manifoldcf-with-solr -- Jan Høydahl, search solution architect Co

Re: Streaming timeseries() and buckets with no docs

2018-09-05 Thread Jan Høydahl
I have tested this with latest released ver 7.4.0 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 4. sep. 2018 kl. 16:32 skrev Joel Bernstein : > > Which version are you using? > > I remember addressing this issue, but it may have been in Alfresco

Re: Streaming timeseries() and buckets with no docs

2018-09-05 Thread Jan Høydahl
26.04.2017, 17:17 SOLR-10566: Add timeseries Streaming Expression Joel Bernstein 26.04.2017, 16:57 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 5. sep. 2018 kl. 16:12 skrev Jan Høydahl : > > I have tested this with latest released ver 7.4.0 > > -

Re: Streaming timeseries() and buckets with no docs

2018-09-06 Thread Jan Høydahl
Created https://issues.apache.org/jira/browse/SOLR-12749 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 5. sep. 2018 kl. 23:48 skrev Jan Høydahl : > > Checked git history for TimeSeriesStream on master, and I cannot see any > commits related to thi

Re: Streaming timeseries() and buckets with no docs

2018-09-06 Thread Jan Høydahl
Thanks! -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 6. sep. 2018 kl. 15:09 skrev Joel Bernstein : > > I found the ticket you created and commented on it. I'll work on this today. > > > Joel Bernstein > http://joelsolr.blogspot.com/ &

Re: Solr range faceting

2018-09-06 Thread Jan Høydahl
Try facet.minCount=0 Jan > 7. sep. 2018 kl. 01:07 skrev Dwane Hall : > > Good morning Solr community. I'm having a few facet range issues for which > I'd appreciate some advice when somebody gets a spare couple of minutes. > > Environment > Solr Cloud (

Re: Docker and Solr Indexing

2018-09-11 Thread Jan Høydahl
give to Solr out of those 4G and many other factors. Jan > 11. sep. 2018 kl. 08:05 skrev solrnoobie : > > So we have a dockerized aws environment with the solr docker container having > only 4 gigs for max ram. > > Our problem is whenever we index, the container containing the l

How secure is Zookeeper digest auth?

2018-09-16 Thread Jan Høydahl
over the network, so that anyone who can sniff the network can also pick up the password, and connect and read/write nodes in ZK? We'll of course add all the firewall and IP filtering we can. Do you have any other tricks you use to increase ZK security? -- Jan Høydahl, search solution arch

Re: How secure is Zookeeper digest auth?

2018-09-16 Thread Jan Høydahl
read the traffic but I do care if the auth credentials can easily be sniffed and replayed, thus enable write acces to zk, which lets you disable all Solr security. Jan > 16. sep. 2018 kl. 22:45 skrev Christopher Schultz > : > > -BEGIN PGP SIGNED MESSAGE- > Hash:

Re: Apache zookeeper jar upgrade for SOLR

2018-09-17 Thread Jan Høydahl
ews.html> -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 17. sep. 2018 kl. 08:46 skrev padmanabhan1616 : > > Hi Team, > > We are using Apache SOLR-5.2.1 and Zookeeper-3.4.5 > > is it good idea to upgrade zookeeper-3.4.5.jar to zookeeper-3.4

Re: 20180917-Need Apache SOLR support

2018-09-17 Thread Jan Høydahl
requirements you have etc, instead of a lot of very specific max/min questions. There are very seldom hard limits, and if there are, it is usually not a good idea to approach them :) Jan

Re: [OT] 20180917-Need Apache SOLR support

2018-09-18 Thread Jan Høydahl
to investigate an alternate backup command, which instructs each shard leader to stream all documents to JSON inside the backup folder, in parallell. But you may still get issues with the Zookeeper part if restoring to a very different version. -- Jan Høydahl, search solution architect

Re: Modify Schema for Solr Cloud

2018-09-18 Thread Jan Høydahl
reload collection in UI -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 18. sep. 2018 kl. 07:30 skrev Rathor, Piyush (US - Philadelphia) > : > > Hi All, > > I am new to solr cloud. > > Can you please let me know on how to update

Re: Unnecessary Components

2018-09-20 Thread Jan Høydahl
27;t have a sponsor either. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 19. sep. 2018 kl. 22:25 skrev Shawn Heisey : > > On 9/19/2018 1:48 PM, oddtyme wrote: >> I am helping implement solr for a "downloadable library" of sorts. The >

Re: Query with exact number of tokens

2018-09-21 Thread Jan Høydahl
I have made a FieldType specially for this https://github.com/cominvent/exactmatch/ <https://github.com/cominvent/exactmatch/> -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 21. sep. 2018 kl. 18:14 skrev Steve Rowe : > > Link correction - wrong frag

Re: Solrcloud admin not asking for login

2018-09-22 Thread Jan Høydahl
requests will require authentication. Jan > 22. sep. 2018 kl. 20:11 skrev Sushant Vengurlekar > : > > Any one have any ideas as to what could be going wrong. > Thank you > > On Fri, Sep 21, 2018 at 10:43 PM Sushant Vengurlekar < > svengurle...@curvolabs.com> wrote: &

Re: Solr index clearing

2018-09-25 Thread Jan Høydahl
Hi, Solr does not do anything automatically, so I think this is a question for the Nutch community - http://nutch.apache.org/mailing_lists.html -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 24. sep. 2018 kl. 20:06 skrev Bineesh : > > Team, > > We

Re: SOLR Index Time Running Optimization

2018-09-26 Thread Jan Høydahl
With DIH you are doing indexing single-threaded. You should be able to configure multiple DIH's on the same collection and then partition the data between them, issuing slightly different SQL to each. But I don't exactly know what that would look like. -- Jan Høydahl, search solution

Re: Solr Stream vs Export Request Handlers

2018-09-27 Thread Jan Høydahl
cs as you stream them from Solr, then streaming expressions lets you enrich the docs, modify, join etc on the fly. PS: When the /export docs says it uses a streaming tecnique, it does NOT mean that it has uses the solr feature streaming expressions :) -- Jan Høydahl, search solution arch

Re: Modify the log directory for dih

2018-10-02 Thread Jan Høydahl
figuring-logging.html#permanent-logging-settings> -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 2. okt. 2018 kl. 10:25 skrev lala : > > Hi, > Is there a way to set the log directory for a dih request and the log file > name? > thanks in advance

Re: data-import-handler for solr-7.5.0

2018-10-02 Thread Jan Høydahl
> url="C:/Users/z6mhq/Desktop/data_import/nh_test.xml" Have you tried url="C:\\Users\\z6mhq/Desktop\\data_import\\nh_test.xml" ? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 2. okt. 2018 kl. 17:15 skrev Martin Frank Hansen (MHQ) : >

Re: Migrate cores from 4.10.2 to 7.5.0

2018-10-03 Thread Jan Høydahl
y than full re-index. So better plan for a full re-index from the beginning. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 4. okt. 2018 kl. 00:23 skrev Shawn Heisey : > > On 10/3/2018 3:17 PM, Pure Host - Wolfgang Freudenberger wrote: >> Is there any

<    1   2   3   4   5   6   7   8   9   10   >