Re: Recommendation for integration test framework

2013-05-04 Thread Jan Morlock
erver in order to set up a Solr server inside our test cases. The tests are performed during the integration-test phase of the Maven lifecycle. Best regards Jan Am 24.03.2013 23:14, schrieb kamaci [via Lucene]: > Unrelated about your question you said that: "We are utilizing Apache Mav

List of Solr Query Parsers

2013-05-06 Thread Jan Høydahl
Hi, I just added a Wiki page to try to gather a list of all known Solr query parsers in one place, both those which are part of Solr and those in JIRA or 3rd party. http://wiki.apache.org/solr/QueryParser If you known about other cool parsers out there, please add to the list. -- Jan

Re: List of Solr Query Parsers

2013-05-06 Thread Jan Høydahl
ok at your code. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 6. mai 2013 kl. 15:41 skrev Roman Chyla : > Hi Jan, > Please add this one http://29min.wordpress.com/category/antlrqueryparser/ > - I can't edit the wiki > > This parser is written w

Re: update to 4.3

2013-05-06 Thread Jan Høydahl
CHANGES.TXT for more info http://lucene.apache.org/solr/4_3_0/changes/Changes.html#4.3.0.upgrading_from_solr_4.2.0 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 6. mai 2013 kl. 16:50 skrev Arkadi Colson : > Hi > > After update to 4.3 I got this error: &g

Re: Solr Cloud with large synonyms.txt

2013-05-06 Thread Jan Høydahl
See discussion here http://lucene.472066.n3.nabble.com/gt-1MB-file-to-Zookeeper-td3958614.html One idea was compression. Perhaps if we add gzip support to SynonymFilter it can read synonyms.txt.gz which would then fit larger raw dicts? -- Jan Høydahl, search solution architect Cominvent AS

Re: List of Solr Query Parsers

2013-05-06 Thread Jan Høydahl
Added. Please try editing the page now. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 6. mai 2013 kl. 19:58 skrev Roman Chyla : > Hi Jan, > My login is RomanChyla > Thanks, > > Roman > On 6 May 2013 10:00, "Jan Høydahl" wrote: >

Re: Solr Cloud with large synonyms.txt

2013-05-07 Thread Jan Høydahl
ZkSolresourceLoader for large files could be one way forward, i.e. store synonyms.txt as e.g. __001_synonyms.txt __002_synonyms.txt Feel free to open a JIRA issue for this so we can get a proper resolution. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 7. mai 2013

Re: Search performance: shards or replications?

2013-05-07 Thread Jan Høydahl
well. But if your QPS is moderate and you have tons of documents, you may gain better performance both for indexing latency and search latency by sharding. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 7. mai 2013 kl. 13:09 skrev Stanislav Sandalnikov : >

Re: Solr 4.3 fails in startup when dataimporthandler declaration is included in solrconfig.xml

2013-05-08 Thread Jan Høydahl
Why did you place solr.war in tomcat/lib? Can you detail the specific errors you get when you place your DIH jars in solr-home/lib or instanceDir/lib? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 8. mai 2013 kl. 21:15 skrev William Pierce : > Thanks, Alex.

Re: Solr 4.3 fails in startup when dataimporthandler declaration is included in solrconfig.xml

2013-05-09 Thread Jan Høydahl
om scratch -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 9. mai 2013 kl. 01:54 skrev William Pierce : > The reason I placed the solr.war in tomcat/lib was -- I guess -- because > that's way I had always done it since 1.3 days. Our tomcat instance(s) run

Re: 4.3 logging setup

2013-05-09 Thread Jan Høydahl
perties (either place it on classpath or point to it with startup opt) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 9. mai 2013 kl. 20:41 skrev richardg : > Thanks for responding. My issue is I've never changed anything w/ logging, I > have always use

Re: 4.3 logging setup

2013-05-09 Thread Jan Høydahl
I've updated the WIKI: http://wiki.apache.org/solr/SolrLogging#Switching_from_Log4J_logging_back_to_Java-util_logging -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 9. mai 2013 kl. 21:57 skrev Shawn Heisey : > On 5/9/2013 1:41 PM, richardg wrote: >>

Re: Index compatibility from 4.0 to 4.3

2013-05-10 Thread Jan Høydahl
converted to 4.3 format, so keep a backup in case you wish to downgrade again, since 4.0 will not read this index. I'm not 100% sure if this auto conversion will compressed stored fields for you or if you need a re-index for that. -- Jan Høydahl, search solution architect Cominve

Re: Install Solr on EC2

2013-05-10 Thread Jan Høydahl
If you find a good recipe, perhaps update the outdated WIKI page http://wiki.apache.org/solr/SolrOnAmazonEC2 ? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 10. mai 2013 kl. 13:55 skrev Rajesh Nikam : > Hi All, > > I am looking for steps to run Solr

Re: Index compatibility from 4.0 to 4.3

2013-05-10 Thread Jan Høydahl
You're probably right, that it's only new and merged segments that will be of the new format. Solr 4.3 will be able to search the old segments. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 10. mai 2013 kl. 15:27 skrev Shawn Heisey : > On 5/10/2013

Re: Install Solr on EC2

2013-05-10 Thread Jan Høydahl
Done -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 10. mai 2013 kl. 22:57 skrev Michael Della Bitta : > I can update this page if someone adds me as a contributor: > MichaelDellaBitta. > > > Mich

Re: Request to be added to Contributor Group

2013-05-14 Thread Jan Høydahl
Added -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 14. mai 2013 kl. 06:19 skrev "Dao Xuan, Hoang" : > Hi Admins, > > My name is Eric. I got an account at http://wiki.apache.org/solr/ with user > name is Eric D. Please add me to the Contribut

Sorting facets by relevance

2013-05-15 Thread Jan Morlock
. Best regards Jan [1]: http://wiki.apache.org/solr/SimpleFacetParameters#facet.sort [2]: http://wiki.apache.org/solr/FieldCollapsing -- View this message in context: http://lucene.472066.n3.nabble.com/Sorting-facets-by-relevance-tp4063649.html Sent from the Solr - User mailing list archive at

Re: Solr 4.3: node is seen as active in Zk while in recovery mode + endless recovery

2013-05-23 Thread Jan Høydahl
Huge tlogs seems to be a common problem. Should we make it flush automatically on huge file size? Could be configurable on the tag? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 23. mai 2013 kl. 14:03 skrev Erick Erickson : > Tangential to the issue you raise

Wiki pages for Solr releases

2013-05-28 Thread Jan Høydahl
Hi, I have added the missing WIKI pages for https://wiki.apache.org/solr/Solr4.1 https://wiki.apache.org/solr/Solr4.2 https://wiki.apache.org/solr/Solr4.3 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com

Re: Solr Composite Unique key from existing fields in schema

2013-05-28 Thread Jan Høydahl
The cleanest is to do this from the outside. Alternatively, it will perhaps work to populate your uniqueKey in a custom UpdateProcessor. You can try. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 28. mai 2013 kl. 17:12 skrev Rishi Easwaran : > Hi

Re: Entire query is stopwords

2013-06-08 Thread Jan Høydahl
Remove the stopFilter from the "index" section of your fieldType, only keep it in the "query" section. This way your stopwords will always be indexed and edismax will be able to selectively remove stopwords from the query depending on whether all words are stopwords or

Re: Adding pdf/word file using JSON/XML

2013-06-11 Thread Jan Høydahl
. You do not need to use extracting request handler then, since you do the extraction on the client side. PS: Evem if you use /extract, note that you can pass the literal.* params as POST if you choose, using 100% standards-based HTTP multipart post. -- Jan Høydahl, search solution architect

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Jan Høydahl
ing* Solr. This is a skill not even seasoned SQL developers have. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 15. juni 2013 kl. 21:39 skrev Jack Krupansky : > [My apologies to Roland for "hijacking" his original thread for this rant! > Look what you st

Re: update solr.xml dynamically to add new cores

2013-06-24 Thread Jan Høydahl
/Solr.xml%204.4%20and%20beyond -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 21. juni 2013 kl. 19:03 skrev smanad : > Gr8! thanks a lot! > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/update-solr-xml-dynamic

Re: String field does not yield partial match result using qf parameter

2013-06-25 Thread Jan Høydahl
fieldType "string" is not tokenized, so your observation is correct. You need to use a fieldType with analysis and tokenization to get the behavior you want. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25. juni 2013 kl. 02:35 skrev "Mugoma Joseph

Re: Updating solrconfig and schema.xml for solrcloud in multicore setup

2013-06-25 Thread Jan Høydahl
probably zkCli shipping with Solr. See http://wiki.apache.org/solr/SolrCloud#Command_Line_Util This means you will NOT need to start Solr with -Dboostrap_confdir at all. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25. juni 2013 kl. 10:29 skrev Utkarsh Sengar : > He

Re: Solr Document inside the document

2013-06-25 Thread Jan Høydahl
uery time join as demonstrated in the Tutorial data (manufacturer sub documents as separate docs) * block-joins (faster way to index/search parent-child relationship sub docs) Please elaborate on your exact need -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25. j

Re: commitWithin + SolrJ

2011-09-03 Thread Jan Høydahl
Many customers have asked me how to do this, so it's not easy enough. Therefore I opened SOLR-2742 to add even more convenience :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 25. aug. 2011, at 01:38, Chris Hostetter

Re: Best way to anchor solr searches?

2011-09-03 Thread Jan Høydahl
der. Lucene has something called commit-point which - if kept constant for all queries will give you the same index version. But Solr does not expose this as a feature, so you'd have to roll your own caching someway. Perhaps increasing queryResultWindowSize will help. -- Jan Høydahl, search solution

Re: Field type change / copy field

2011-09-03 Thread Jan Høydahl
put into a real "date" field, then send it in from your client twice or write a UpdateProcessor to construct the date field based on the year field. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 24. aug. 2011, at 11

Re: Indexing Lotus Notes database using API

2011-09-05 Thread Jan Høydahl
Hi, You should be able to index Notes databases through JDBC, either with DIH or ManifoldCF. Have not tried myself though. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 5. sep. 2011, at 12:28, Tobias Berg wrote: >

Re: Date Faceting on Solr 3.1

2011-09-06 Thread Jan Høydahl
Hi, Note that if you want more control over the buckets, you may use facet.query instead. Also, under development is SOLR-2366 which will eventually give a more powerful gap specification to range facets. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: Stemming and other tokenizers

2011-09-11 Thread Jan Høydahl
u set langid.override=false, languid.map=true and populate your "language" field with the known language, you will probably get the desired effect. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 10. sep. 2011, at 03:24, Patr

Re: Running solr on small amounts of RAM

2011-09-11 Thread Jan Høydahl
isallowing certain type of queries from the clients (such as wildcard, sorting, fuzzy etc) to avoid getting int high-mem situations. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 11. sep. 2011, at 17:59, Erick Erickson wrote: &g

Re: Solr and DateTimes - bug?

2011-09-11 Thread Jan Høydahl
, which version of Solr do you use? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 12. sep. 2011, at 00:28, Nicklas Overgaard wrote: > Hi everybody, > > I just started playing around with solr, however i'm facin

Re: Stemming and other tokenizers

2011-09-12 Thread Jan Høydahl
Hi Everybody else use dedicated field per language, so why can't you? Please explain your use case, and perhaps we can better help understand what you're trying to do. Do you always know the query language in advance? -- Jan Høydahl, search solution architect Cominvent AS - www.com

Re: select query does not find indexed pdf document

2011-09-12 Thread Jan Høydahl
Hi, What do you get from a query http://www/SearchApp/select/?q=*:* or http://www/SearchApp/select/?q=id:vpn ? You may not have mapped the fields correctly to your schema? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 12. sep

Re: Stemming and other tokenizers

2011-09-12 Thread Jan Høydahl
s multiple languages within the same field, e.g. "body", say one paragraph of English and the next is Norwegian, then we currently do not have any capability in Solr to apply different analysis (tokenization, stemming etc) to each paragraph. -- Jan Høydahl, search solution ar

Re: Slowdown on adds with Solr 3.4 compared to 1.4

2011-10-07 Thread Jan Høydahl
Hi, Have you tried to do a commit after the deleteByQuery only? Also, what seems to cause the slowdown? Any hints from the logs? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 7. okt. 2011, at 10:04, Willem Basson wrote: >

Re: Slowdown on adds with Solr 3.4 compared to 1.4

2011-10-07 Thread Jan Høydahl
mem/cpu usage. Report back what you find. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 7. okt. 2011, at 15:38, Jan Høydahl wrote: > Hi, > > Have you tried to do a commit after the deleteByQuery only? > Also, what seems

Re: Please help - Solr Cell using 'stream.url'

2011-10-07 Thread Jan Høydahl
Hi, What Solr version? How often do you do commits, or do you use autocommit? What kind and size of docs? Do you feed from a Java program? Where is the read timeout occurring? Can you paste in some logs? How much RAM on your server, and how much did you give to the JVM? -- Jan Høydahl, search

In-document highlighting DocValues?

2011-10-10 Thread Jan Høydahl
what code needs to be written? I don't currently see how to get a complete list of matches for a particular document. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Filter Query highlight

2011-10-10 Thread Jan Høydahl
Hi, The highlighter will only highlight words from your main query. So to get highlighting for your example, add a query in "q" with the words you need highlighted: .../solr/select?fq=type:cat&q=type:cat&hl=on&hl.fl=type -- Jan Høydahl, search solution

Re: Please help - Solr Cell using 'stream.url'

2011-10-10 Thread Jan Høydahl
onsider upgrading to latest Solr... -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 10. okt. 2011, at 21:02, Tod wrote: > On 10/07/2011 6:21 PM, � wrote: >> Hi, >> >> What Solr version? > > Solr Impl

Re: Pls help :-) ! calling external ws/db to fetch field instead of own index?

2011-10-11 Thread Jan Høydahl
y. The idea is that these will be updateable without reindexing, but I don't think they are yet. See http://www.slideshare.net/lucenerevolution/willnauer-simon-doc-values-column-stride-fields-in-lucene for more. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr T

Re: In-document highlighting DocValues?

2011-10-11 Thread Jan Høydahl
to iterate through every single matching term in a field and pull out the payloads? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 10. okt. 2011, at 16:19, Jan Høydahl wrote: > Hi, > > We index structured documents, wit

Re: Please help - Solr Cell using 'stream.url'

2011-10-12 Thread Jan Høydahl
Latest version is 3.4, and it is fairly compatible with 1.4.1, but you have to reindex. First step migration can be to continue using your 1.4 schema on new solr.war (and SolrJ), but I suggest you take a few hours upgrading your schema and config as well. -- Jan Høydahl, search solution

Re: updating schema.xml in production solr, multiple cores

2011-10-13 Thread Jan Høydahl
=coreB - See http://wiki.apache.org/solr/CoreAdmin#SWAP or alternatively do the name swap in your frontend and feeder and keep track of whether A or B is live. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 12. okt. 2011, at 19

Re: In-document highlighting DocValues?

2011-10-14 Thread Jan Høydahl
only need offset info, not the snippets/fragments from the stored field. But I have not looked at the Highlighter code. Perhaps we could fork it into a new search component which pulls out only the necessary meta info and payloads for us and returns it to client? -- Jan Høydahl, search solution

Re: Filter Question

2011-10-14 Thread Jan Høydahl
currently need separate fields for. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 14. okt. 2011, at 09:17, Steven A Rowe wrote: > Hi Monica, > > AFAIK there is nothing like the filter you've described, an

Re: Callback on starting solr?

2011-10-16 Thread Jan Høydahl
Hi, This depends on your application server and config. A very simple option is to let your client poll with a ping request http://localhost:8983/solr/admin/ping/ until it succeeds. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Callback on starting solr?

2011-10-16 Thread Jan Høydahl
give your VM more memory and see if indexing succeeds then. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 16. okt. 2011, at 20:38, Jithin wrote: > I am doing something similar to that. checking netstat for any connection o

Re: millions of records problem

2011-10-17 Thread Jan Høydahl
t to fail or best case be veery slow. If you have a beefy server I'd first try putting in 64Gb RAM, slim down your schema and perhaps even switch to Solr4.0(trunk) which is more RAM efficient. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.sol

Re: OS Cache - Solr

2011-10-17 Thread Jan Høydahl
/SolrPerformanceFactors ? You should also slim down your index maximally by setting stored=false and indexed=false wherever possible. I would also upgrade to a more current Solr version. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. okt

Re: Callback on starting solr?

2011-10-17 Thread Jan Høydahl
Check your SolrConfig whether your ping handler is not configured http://wiki.apache.org/solr/SolrConfigXml#The_Admin.2BAC8-GUI_Section -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. okt. 2011, at 20:07, Jithin wrote

Re: How to change default operator in velocity?

2011-10-18 Thread Jan Høydahl
tp://localhost:8983/solr/browse?q=samsung+maxtor&mm=0 For more about mm, see http://wiki.apache.org/solr/DisMaxQParserPlugin#mm_.28Minimum_.27Should.27_Match.29 NB: In trunk (4.0), even dismax will respect the defaultOperator from schema. -- Jan Høydahl, search solution arch

Re: How to change default operator in velocity?

2011-10-18 Thread Jan Høydahl
fferent fields. See http://wiki.apache.org/solr/DisMaxQParserPlugin to learn more about DisMax. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 18. okt. 2011, at 15:56, hadi wrote: > thanks for your reply,i delete the dismax conf from so

Re: case insensitive searches

2011-10-31 Thread Jan Høydahl
stead of the lowercasing tokenizer. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 31. okt. 2011, at 02:23, Mark Juszczec wrote: > Hello all > > According to the docs, I need to use solr.LowerCaseTokenizerFactory > &

Re: [Profiling] How to profile/tune Solr server

2011-10-31 Thread Jan Høydahl
ng service from Sematext which will give you graphs and all. So far it's free evaluation: http://sematext.com/spm/index.html Do you have a clue for why the indexing is slow? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 31

Re: Solr real-time update taking time

2011-11-02 Thread Jan Høydahl
d RAM to JVM? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 2. nov. 2011, at 03:58, vijay.sampath wrote: > Hi All, > > I recently started working on SOLR 3.3 and would need your expertise to > provide a solution.

Re: Deleting documents not shown in response?

2011-11-02 Thread Jan Høydahl
. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 2. nov. 2011, at 07:50, kiran.bodigam wrote: > I am trying to delete the document from index by using id. > http://myserver/solr/update?stream.body=id:2009-11-04\13\:51\:07.

Re: shard indexing

2011-11-02 Thread Jan Høydahl
name ONE of your cores as ".", meaning it would be the "default" core living at /solr/update, perhaps that is what you're looking for? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 2. nov. 2011, at

Re: shard indexing

2011-11-02 Thread Jan Høydahl
and once you get to december, you create a "november" core, and do a SWAP between "current"<->"november". Then your new core would now be "current" and you don't need to change URLs on the index client side. -- Jan Høydahl, search solution architec

Re: SOLRJ commitWithin inconsistent

2011-11-10 Thread Jan Høydahl
size, feed speed & frequency etc. What is happening during the 25 seconds when you experience the largest delays? Can you somehow profile your application or look at the logs? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 3.

Re: Solr Score Normalization

2011-11-16 Thread Jan Høydahl
Perhaps you can solve your usecase by playing with the new eDismax "boost" parameter, which multiplies the functions with the other score instead of adding. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 5. nov. 2011

Re: [ANNOUNCEMENT] Second Edition of the First Book on Solr

2011-11-22 Thread Jan Høydahl
Congratulations! Feel free to write a shorter version of the announcement text, suitable as a news teaser on the Solr site, and we'll try to update the site with new thumb and all. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtrainin

Re: date range in solr 3.1

2011-11-22 Thread Jan Høydahl
Hi, Long shot: Try f.date.facet.range.gap=%2B1DAY instead, in case your "+" was interpreted as space by your browser... -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 22. nov. 2011, at 12:57, do3do3 wrote: >

solr-user@lucene.apache.org

2011-11-22 Thread Jan Høydahl
Why do you need spaces in the replacement? Try pattern="\+" replacement="plus" - it will cause the transformed charstream to contain as many tokens as the original and avoid the highlighting crash. -- Jan Høydahl, search solution architect Cominvent AS - www.cominve

Re: Solr 3.5 very slow (performance)

2011-11-29 Thread Jan Høydahl
Hi, Perhaps you could try to remove the luceneMatchVersion from the 3.5 solrconfig again and use same schema version as for 1.4; to more closely emulate behavior of 1.4. LuceneMatchVersion will modify several defaults. Quick way to see if any of these new defaults make a change. -- Jan

Re: VelocityResponseWriter's future

2011-12-27 Thread Jan Høydahl
ject. I've also contributed patches to fix bugs and make it more usable. So unless a new and better alternative is in already in place (I love the idea of AJAX-ifying things), I vote for keeping VRW, but lazy loading it not to annoy people copying example/ around. -- Jan Høydahl, search solution ar

Re: Looking for a good commit/merge strategy

2011-12-27 Thread Jan Høydahl
Have a look at http://wiki.apache.org/solr/NearRealtimeSearch which will help you (in TRUNK/4.0) with an efficient in-memory handling of NRT changes. Combine this with CommitWithin for persisting to disk: http://wiki.apache.org/solr/CommitWithin. -- Jan Høydahl, search solution architect

Re: Custom content extractor for Solr Cell

2011-12-27 Thread Jan Høydahl
contents or similar. If you want to override the behaviour of Tika's parsing of certain MIME types, you can do this by specifying -Dtika.config= when starting Solr (3.5 or later), and it will obey your config. See Tika's web page for how to write your own parsers. -- Jan Høydahl, searc

Re: lower score for synonyms

2011-12-27 Thread Jan Høydahl
Hi, Also see discussion in https://issues.apache.org/jira/browse/LUCENE-3130 for possible future way to do this with one field. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 6. des. 2011, at 13:47, Marc SCHNEIDER wrote

Re: spellcheck-index is rebuilt on commit

2012-01-02 Thread Jan Høydahl
Olivier, your log snippets did not make it into the mail. I think the mailing list strips attachments. Did you reload core or restart Jetty/Tomcat after your changes? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 2. jan. 2012

Re: spellcheck-index is rebuilt on commit

2012-01-02 Thread Jan Høydahl
. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 2. jan. 2012, at 22:22, Simon Willnauer wrote: > hey, is it possible that during those commits nothing has changed in > the index? I mean are you committing nevertheless the

Re: Using SOLR Autocomplete for addresses (i.e. multiple terms)

2012-01-03 Thread Jan Høydahl
idual tokens and weigh these as you choose, as well as implement phonetic normalization and other useful tricks. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 3. jan. 2012, at 00:52, Dave wrote: > Hi, > > I&

Re: Sorting results within the fields

2012-01-17 Thread Jan Høydahl
scoring at all or only by "external application"? Hope you see that being concrete is necessary for such questions. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. jan. 2012, at 19:38, aronitin wrote: > It&#x

Re: Facet auto-suggest

2012-01-17 Thread Jan Høydahl
. In the Audi example, you would disable buttons with 0 hits in the facet count. For more in depth, see http://java.dzone.com/news/complex-solr-faceting -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. jan. 2012, at 23:38, J

Re: "index-time" over boosted

2012-01-18 Thread Jan Høydahl
effect of Norms on your score will be that shorter fields score higher than long fields. Perhaps you instead can try to tell us your use-case. What kind of raning are you trying to achieve? Then we can help suggest how to get there. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Takes a while to see changes in data even after comit

2012-01-18 Thread Jan Høydahl
Hi, What Solr version? How many docs? What do you use as qutowarm count? If it's too high, it may take time. Do you use spellcheck and buildOnCommit? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 18. jan. 2012, at

Re: How to boost the relevancy of a field

2012-01-18 Thread Jan Høydahl
And using dismax query parser makes this easier: http://wiki.apache.org/solr/DisMaxQParserPlugin Example: q=solr&defType=edismax&qf=title^10 body^0.5 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012,

Re: conditional field weighting

2012-01-19 Thread Jan Høydahl
e because it has slightly higher weight :) Because of the TIE parameter, other matching fields will score 0.5X -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 05:00, Jack Kanaska wrote: > Hi Tim, > >

Re: "index-time" over boosted

2012-01-19 Thread Jan Høydahl
Hi, Can you paste exactly both and definitions from your schema? omitNorms="true" should kill norms. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 08:18, remi tassing wrote: > Hi, > >

Re: Tika0.10 language identifier in Solr3.5.0

2012-01-19 Thread Jan Høydahl
the search box. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 09:22, nibing wrote: > > Hi, all, > > > > I am using Solr3.5.0 which applies Tika0.10 to do language detection, > a

Re: "index-time" over boosted

2012-01-19 Thread Jan Høydahl
ome time in designing a schema, working with your queries, perhaps exploring DisMax query parser etc etc. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 13:01, remi tassing wrote: > Hello Jan, > > My schem

Re: Takes a while to see changes in data even after comit

2012-01-19 Thread Jan Høydahl
Hi, Try lowering your autowarm to, say, 25, and see if it helps. How often do you call commit? If you have too much warming so it takes longer time than time between commits, you're lost... You can check the stats admin page to see the autowarm time. -- Jan Høydahl, search solution arch

Re: 3.5.0 troubles

2012-01-19 Thread Jan Høydahl
Shouldn't it be literal.uid=foo, not ext.literal.uid ?? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 23:08, Wayne W wrote: > HI, > > I'm trying to setup the latest version of Solr. Curren

Re: How to import data from xml files to solr

2012-01-20 Thread Jan Høydahl
search result XML through XSLT update handler. See http://dl.dropbox.com/u/20080302/updateXml.xsl The curl command is: curl "http://localhost:8983/solr/update/xslt?commit=true&tr=updateXml.xsl"; -H "Content-Type: text/xml; charset=utf-8" --data-binary @searchres.xml -- Jan

Re: Tika0.10 language identifier in Solr3.5.0

2012-01-20 Thread Jan Høydahl
stead of reduction or stemming - then you can easily support full linguistics for 100 languages, indexed in the same field. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 20. jan. 2012, at 18:15, Ted Dunning wrote: > I t

Re: Filtering search results by an external set of values

2012-01-23 Thread Jan Høydahl
the query. You may do it yourself or have some framework/plugin to it for you, see http://wiki.apache.org/solr/SolrSecurity#Document_Level_Security -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 23. jan. 2012, at 11:49, John

Re: ExractionHandler/Cell ignore just 2 fields defined in schema 3.5.0

2012-01-23 Thread Jan Høydahl
ct_id and company_id :) http://wiki.apache.org/solr/ExtractingRequestHandler#Input_Parameters -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 23. jan. 2012, at 22:26, Wayne W wrote: > Hi, > > Im been trying to figure th

Re: hot deploy of newer version of solr schema in production

2012-01-24 Thread Jan Høydahl
to do a more thorough upgrade of schema including new functionality and breaking changes, and then do a full reindex. The exception is if my index is huge and the reason for Solr upgrade or schema change is to fix a bug, not to use new functionality. -- Jan Høydahl, search solution architect

Re: "index-time" over boosted

2012-01-24 Thread Jan Høydahl
That looks right. Can you restart your Solr, do a new search with &debugQuery=true and copy/paste the full EXPLAIN output for your query? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 24. jan. 2012, at 13:22, remi tas

Re: "index-time" over boosted

2012-01-24 Thread Jan Høydahl
atures". Then change solr/example/solr/conf/schema.xml and add omitNorms="true" to these two fields. Then stop Solr, delete your index, start Solr, re-index the docs and try again. fieldNorm is now 1.0. Once you get that working you can start debugging where you got it wrong in

Re: Difference between #indexed documents and #results in *:* query

2012-01-25 Thread Jan Høydahl
Hi, No, it's not normal :) Have you tried to hit SHIFT-F5 to make sure you're not getting tricked by browser caching? Or try a slightly different query like id:* You can also visit the "Schema browser" page of Solr admin and check the stats on how many docs are in the in

Re: Using SOLR Autocomplete for addresses (i.e. multiple terms)

2012-01-25 Thread Jan Høydahl
ploaded an example to GitHub. See http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/ -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 3. jan. 2012, at 20:41, Dave wrote: > I've got another question for

Re: Commit and sessions

2012-01-27 Thread Jan Høydahl
developed, but not there yet: https://issues.apache.org/jira/browse/SOLR-2700 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 27. jan. 2012, at 13:05, Per Steffensen wrote: > Hi > > If I have added some document to

Highlighting: When alternateField does not exist

2013-01-09 Thread Jan Høydahl
selected Or do you have other workarounds for this problem on the solr side? In this case we cannot control the source DB to make sure the "teaser" exists. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: indexing Text file in solr

2013-01-29 Thread Jan Høydahl
index it -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 27. jan. 2013 kl. 11:23 skrev hadyelsahar : > i have a large Arabic Text File that contains Tweets each line contains one > tweet , that i want to index in solr such tha

<    1   2   3   4   5   6   7   8   9   10   >