Re: solr 5 leaving tomcat, will I be the only one fearing about this?

2016-10-07 Thread Jan Høydahl
solr start (or bin/solr start [options]) Your client would talk to Solr on typically http://host.name:8983/solr/ as a standalone server, not as one out of many webapps on 8080. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 7. okt. 2016 kl. 02.32 skrev Alexan

Re: Upgrading from Solr cloud 4.1 to 6.2

2016-10-07 Thread Jan Høydahl
As Erick suggests, you should setup an empty 6.x environment, create an empty collection with shards=5 replicationFactor=3 and then re-index all your content from your data source. Once that is in, you can decommission your old cluster. -- Jan Høydahl, search solution architect Cominvent AS

Re: solr 5 leaving tomcat, will I be the only one fearing about this?

2016-10-10 Thread Jan Høydahl
ething you’d want to consider for upgrade when you upgrade Solr, so they fit better in INSTALL_DIR than in SOLR_HOME. But there may be other libs that provide some business logic and fit well in SOLR_HOME/lib. Jan

Re: [Solr 5.1.0] - Ignoring Whitespaces as delimiters

2016-10-13 Thread Jan Høydahl
Have you tried PatternTokenizer? Sendt fra min iPhone > Den 13. okt. 2016 kl. 04.03 skrev deniz : > > Hello, > > Are there any built-in tokenizers which will do sth like StandardTokenizer, > but will not tokenize on whitespace? > > e.g field:abc cde-rfg will be tokenized as "abc cde" and "rfg

Re: Public/Private data in Solr :: Metadata or ?

2016-10-18 Thread Jan Høydahl
https://wiki.apache.org/solr/SolrSecurity#Document_Level_Security <https://wiki.apache.org/solr/SolrSecurity#Document_Level_Security> -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 18. okt. 2016 kl. 23.00 skrev John Bickerstaff : > > I have a

Re: Public/Private data in Solr :: Metadata or ?

2016-10-19 Thread Jan Høydahl
docs, the change is immediate. I suppose ManifoldCF could start using DocValues for the ACL info and update those atomically much faster than re-indexing the content of every document. Anyone know if that would be feasible? -- Jan Høydahl, search solution architect Cominvent AS

Re: Spread SolrCloud across two locations

2017-06-07 Thread Jan Høydahl
was not proposing an *automatic* triggering of a reconfiguration script, but rather to have a script that someone runs manually in order to make sure one does not mess up the reconfiguration -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 2. jun. 2017 kl. 14

Re: Solr 6.5.1 crashing when too many queries with error or high memory usage are queried

2017-07-07 Thread Jan Høydahl
limiting for you, such as https://getkong.org/ <https://getkong.org/> You really need to debug what is the bottleneck in your case and try to fix that. Can you share your key numbers here so we can do a qualified guess? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.co

Re: help on implicit routing

2017-07-07 Thread Jan Høydahl
http://lucene.apache.org/solr/guide/6_6/shards-and-indexing-data-in-solrcloud.html <http://lucene.apache.org/solr/guide/6_6/shards-and-indexing-data-in-solrcloud.html> -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 6. jul. 2017 kl. 03.15 skrev im...@

Re: index new discovered fileds of different types

2017-07-07 Thread Jan Høydahl
field as correct type without messing with field name prefixes. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 5. jul. 2017 kl. 15.23 skrev Thaer Sammar : > > Hi, > We are trying to index documents of different types. Document have different > fie

Re: uploading solr.xml to zk

2017-07-07 Thread Jan Høydahl
> ERROR: cp is not a valid command! Can you write the exact command you typed again? Once solr.xml is in zookeeper, solr will find it automatically. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 7. jul. 2017 kl. 21.31 skrev im...@elogic.pk: > > The d

Re: index new discovered fileds of different types

2017-07-09 Thread Jan Høydahl
points out, that’s probably where in your code that you should use Managed Schema REST API in order to 1. Query Solr for what fields are defined 2. If you need to index a field that is not yet in Solr, add it, using the correct field type (your app should know) 3. Push the data 4. Repeat -- Jan

Re: index new discovered fileds of different types

2017-07-10 Thread Jan Høydahl
adding a field to the managed schema on the fly once an unknown field is detected should work just fine! -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 10. jul. 2017 kl. 02.08 skrev Rick Leir : > > Jan > > I hope this is not off-topic, but I am cur

Re: Solr Analyzer for Vietnamese

2017-07-13 Thread Jan Høydahl
Cao, did you see this email from Eirik? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 1. jun. 2017 kl. 13.33 skrev Eirik Hungnes : > > Thanks Erick, > > Dat: > > Do you have more info about the subject? > > 2017-05-22 17:

Re: Creating a custom auth plugin for solr

2017-07-13 Thread Jan Høydahl
ck and ask questions when you’re stuck. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 13. jul. 2017 kl. 06.34 skrev srshaik : > > Hi, > > I am new to solr security and wanted some help / reference on how to > implement custom authentication and aut

[POLL] Solr Plugin Improvements - request for feedback

2017-07-21 Thread Jan Høydahl
65 [3] https://lucenesolrrevolution2017.sched.com/event/BAwj DISCLAIMER: This poll is conducted by Jan Høydahl, a Lucene/Solr committer and PMC member, for my LuceneRev talk, and not officially on behalf of the Lucene PMC. - Jan

Re: Several critical vulnerabilities discovered in Apache Solr (XXE & RCE)

2017-10-15 Thread Jan Høydahl
I think Config API came in 5.0 through https://issues.apache.org/jira/browse/SOLR-6533 <https://issues.apache.org/jira/browse/SOLR-6533> -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 15. okt. 2017 kl. 15:29 skrev Florian Gleixner : > > On 13.10.2

[HelpWanted] Improve the PublicServers wiki page

2017-01-12 Thread Jan Høydahl
ebcity.fr/restaurants/recherche-lieu http://www.talenttube.co/JobSearch.do http://www.webcity.fr/ http://www.talenttube.co/CandidateSearch.do http://www.talenttube.co/StudentSearch.do http://www.talenttube.co/ -- Jan Høydahl, search solution architect Com

Re: retrieve ids of all indexed docs efficiently

2017-01-18 Thread Jan Høydahl
Don't know why you have mem problems. Can you paste in examples of full query strings during cursor mark querying? Sounds like you may be using it wrong. Or try exporting https://cwiki.apache.org/confluence/display/solr/Exporting+Result+Sets -- Jan Høydahl > Den 18. jan. 2017 kl. 21.

Re: Will Solr flush docs to disk when ram buffer is full (time of auto commit is not reached yet)?

2017-01-19 Thread Jan Høydahl
It will flush buffer to disk as a new segment without opening a new searcher. I guess trans-log will be rotated too, but not sure. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 19. jan. 2017 kl. 21.16 skrev Ziyuan Qin : > > Hi All, > > I'm

Re: Solr Cloud - How to maintain the addresses of the zookeeper servers

2017-01-26 Thread Jan Høydahl
all nodes 2. restart all ndoes -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 26. jan. 2017 kl. 14.30 skrev David Michael Gang : > > Hi all, > > I want to set up a solr cloud with x nodes and have 3 zookeepers servers. > As i understand the followi

bin/post and self-signed SSL

2017-02-05 Thread Jan Høydahl
against a CA signed Solr... -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com

Re: bin/post and self-signed SSL

2017-02-13 Thread Jan Høydahl
Thanks for your answers. I was also able to work around it using cURL, but we should obviously fix bin/post to be as smart as bin/solr in parsing env.variables related to SSL and auth. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 7. feb. 2017 kl. 01.23 sk

[SECURITY] CVE-2017-3163 Apache Solr ReplicationHandler path traversal attack

2017-02-15 Thread Jan Høydahl
CVE-2017-3163: Apache Solr ReplicationHandler path traversal attack Severity: Moderate Vendor: The Apache Software Foundation Versions Affected: Solr 1.4 to 6.4.0 Description: When using the Index Replication feature, Solr nodes can pull index files from a master/leader node using an HTTP API w

Fwd: block join - search together at parent and childern

2017-03-16 Thread Jan Nekuda
p;defType=edismax&qf=id_number date_from date_to has fields first_name surname birthdate street_name house_number city&stopwords=true& lowercaseOperators=true But I believe it can be solved much better. X | A |\ | \ B \ \ C \ \ \ D Thanks for your help Jan

Re: block join - search together at parent and childern

2017-03-16 Thread Jan Nekuda
h /word/ in query make fq={!parent which=type:car}/word// / //and edismax with qf=color, first_country, power, name, country Thank you for your time:) Jan Dne 16.03.2017 v 20:00 Mikhail Khludnev napsal(a): Hello, It's hard to get into the problem. but you probably want to have dismax on ch

Re: block join - search together at parent and childern

2017-03-19 Thread Jan Nekuda
nd also parent documents in one condition with children. How I wrote I'm able solve it with another parent and then also doc A will be child and everything will work fine - but I would like to solve it better. Do you have or someone else another idea?:) Thanks Jan 2017-03-16 21:51 GMT+01:

Re: block join - search together at parent and childern

2017-03-22 Thread Jan Nekuda
Hi Mikhail, thank you very much - it's exactly what I need. When I have tried it first a had problem with spaces and it seems that it doesn't work, but now it works great. Thanks and have a nice day Jan 2017-03-21 10:11 GMT+01:00 Mikhail Khludnev : > Hello Jan, > If I get you

Spread SolrCloud across two locations

2017-05-23 Thread Jan Høydahl
intervention, re-configuring of ZK followed by a restart of the surviving Solr nodes in order to point to the “new” ZK. How have you guys solved such setups? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com

Re: Spread SolrCloud across two locations

2017-05-23 Thread Jan Høydahl
standalone (or setup two more) and then start Solr again with only one ZK. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 23. mai 2017 kl. 11.14 skrev Markus Jelsma : > > I would probably start by renting a VM at a third location to run Zookeeper. &g

Re: Spread SolrCloud across two locations

2017-05-24 Thread Jan Høydahl
Of course, they would also then need to make sure that DC1 does not boot up again before compatible change has been done there too. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 23. mai 2017 kl. 18.56 skrev Shawn Heisey : > > On 5/23/2017 10:12 AM, Sush

Re: Spread SolrCloud across two locations

2017-05-25 Thread Jan Høydahl
ould also be a showstopper for certain users who have a requirement to self-host things due to data sensitivity, and only have 2 DCs But if none of these are a concern then go for it! -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com

Re: Spread SolrCloud across two locations

2017-05-25 Thread Jan Høydahl
up, zk1dc1 and zk2dc1 are now not a majority and need to sync up from the others - After all is stable again, you flip two DC2 ZKs back to observers I may have misunderstood the observer thing here, but if this is at all doable, this should be scriptable with ssh or ansible quite easily? -- Jan Høy

Re: Spread SolrCloud across two locations

2017-05-29 Thread Jan Høydahl
already have such redundancy within one of the DCs, placing a 3rd ZK there is an ideal solution with probably good enough HA. If not, I’m looking for the 2nd best low-friction approach with software-only. Thanks for the input all! -- Jan Høydahl, search solution architect Cominvent AS - www.cominve

Re: Spread SolrCloud across two locations

2017-05-29 Thread Jan Høydahl
C1 starts operating alone once it comes up again, or will it try repeatedly to contact the Zk nodes in DC2 and then discover that they disagree about the voting authority of some nodes or what? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com

Re: document migrate

2014-03-25 Thread Jan Høydahl
Migrate is new in Solr 4.7. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25. mars 2014 kl. 10:51 skrev Cihat güzel : > hi all, > > I have a test for document migrate. I followed this url: > https://cwiki.apache.org/confluence/display/solr/Col

Re: solr 4.x reindexing issues

2014-03-25 Thread Jan Høydahl
generally better to ask for ALL documents in one go and stream to the end rather than increasing offsets with new queries all the time - because high offsets/start can be time consuming, especially with multiple shards. If you increase the timeout enough you should be able to retrieve all documents in

Re: document migrate

2014-03-25 Thread Jan Høydahl
r. To find the reference guide for a released version of Solr, please follow this link ." The old wiki was littered with version numbers all over the place. If people wants to know which versions support a given feature they can refer to CHANGES. Jan

Re: Multilingual indexing, search results, edismax and stopwords

2014-03-25 Thread Jan Høydahl
apache.org/jira/browse/SOLR-3085 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 23. mars 2014 kl. 19:37 skrev Jack Krupansky : > Setting the default query operator to AND is the preferred approach: q.op=AND. > > That said, I'm not sure that counting

Re: How to secure Solr admin page?

2014-03-25 Thread Jan Høydahl
container to only require auth for this path and not for e.g. http://your.host:8983/solr/collection1/select -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 19. mars 2014 kl. 09:47 skrev Tony Xue : > Hi all, > > I was following the instructions in the official wiki

Classpath for Solr Plugins

2014-05-27 Thread Jan Nehring
ache.org/solr/SolrPlugins ? Thank you very much Jan

Re: ExtractRequestHandler, skipping errors

2013-10-21 Thread Jan Høydahl
Guido, can you point us to the Commons-Compress JIRA issue which reports your particular problem? Perhaps uncompress works just fine? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 18. okt. 2013 kl. 14:48 skrev Guido Medina : > Dont, commons compress 1.5 is bro

Re: Language detection for multivalued field

2013-10-22 Thread Jan Høydahl
t all, gone, missing? If you have a requirement for detecting language per field-value and then map those into multiple language specific fields, please add a JIRA feature request which will then be considered for future inclusion. -- Jan Høydahl, search solution architect Cominvent AS - www.com

Re: Any way to dynamically rename fields in the schema?

2013-10-22 Thread Jan Høydahl
API, in which you can programmatically add fields to the schema right before you need them: https://cwiki.apache.org/confluence/display/solr/Schema+API -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 22. okt. 2013 kl. 12:53 skrev jimi.hulleg...@svensktnaringsliv.se

Re: Language detection for multivalued field

2013-10-30 Thread Jan Høydahl
nguage for values in a multiValued field. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 22. okt. 2013 kl. 14:16 skrev vatuska : > *Can you elaborate on your comment "There isn't tag indexed". Are you saying > that your multiValued "tag" f

Function Query does not work properly

2014-02-26 Thread Jan Wedding
thing wrong or is there a bug in the recip function? I am currently using Solr 4.5.1. Thanks for your help, Jan

Re: SolrCloud with client ssl

2014-10-08 Thread Jan Høydahl
Jetty/Tomcat? ...but perhaps keep the conversation going here until it is a confirmed bug :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 7. okt. 2014 kl. 06:57 skrev Sindre Fiskaa : > Followed the description > https://cwiki.apache.org/confluence/displa

Re: SolrCloud with client ssl

2014-10-09 Thread Jan Høydahl
the Overseer node, we need to find what kind of IOException is happening and where. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 8. okt. 2014 kl. 16:08 skrev Sindre Fiskaa : > Yes, running SolrCloud without SSL it works fine with the createNodeSet > param. I ru

Re: Mismatch in numFound in q=*:* query

2014-10-12 Thread Jan Høydahl
If you don't want "downtime", you could add a field to your schema, reload, do a full re-index on top of your existing index, and then delete all documents that were not updated, via a delelteByQuery, e.g.: indextime:[* TO NOW-1DAY] -- Jan Høydahl, search solution archite

Re: Which Solr releases contain SOLR-4470 (Security for inter-solr-node requests)

2014-11-03 Thread Jan Høydahl
You find the answer to such questions by looking at the state of the JIRA issue https://issues.apache.org/jira/browse/SOLR-4470 Staus: Open Fix version: Trunk Which means that this feature is not included in any released Solr version (yet). -- Jan Høydahl, search solution architect Cominvent

Re: /suggest through SolrJ?

2015-02-02 Thread Jan Høydahl
dList) res.getVal( i ); extractSpellCheckInfo( _spellInfo ); } Earlier the suggester was the same as the Spell component, but now with its own component, suggestions are put in "suggest". I think we're lacking a SuggestResponse.java for parsing suggest responses..?? -- Jan Høydahl, sea

Re: Can i use a "max of" instead of "sum of" here?

2015-02-02 Thread Jan Høydahl
I developed a max-score query parser for a customer some time ago, and we gave it back. This should be what you're looking for: https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-MaxScoreQueryParser -- Jan Høydahl, search solution architect Cominve

Re: New UI for SOLR-based projects

2015-02-02 Thread Jan Høydahl
Cool. For your information, there are multiple existing Solr proxies out there, one of them being mr. Smiley's one in Java. Also in PHP, Node etc. Here is one link, there are others as well https://github.com/evolvingweb/ajax-solr/wiki/Solr-proxies -- Jan Høydahl, search solution arch

Re: Sort on multivalued attributes

2015-02-09 Thread Jan Høydahl
See https://issues.apache.org/jira/browse/SOLR-2522 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 9. feb. 2015 kl. 10.30 skrev Flavio Pompermaier : > > In my use case it could be very helpful because I use the SIREn plugin to > index arbitrary JSON

Re: XML nested structure in solr

2015-02-09 Thread Jan Høydahl
/Uploading+Structured+Data+Store+Data+with+the+Data+Import+Handler)? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 9. feb. 2015 kl. 11.05 skrev Jens Mayer : > > Hey @all, > I have a problem with nested documentsin solr. > I like to import avery big XML file

Re: Sort on multivalued attributes

2015-02-09 Thread Jan Høydahl
Sure, vote for it. Number of votes do not directly make prioritized sooner. So you better also add a comment to the JIRA, it will raise committer's attention. Even better of course is if you are able to help bring the issue forward by submitting patches. -- Jan Høydahl, search sol

Re: Updating solrconfig and schema.xml for solrcloud in multicore setup

2013-06-25 Thread Jan Høydahl
, it's a change. Did you try updating schema.xml for a specific collection using zkCli? Any issues? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25. juni 2013 kl. 11:24 skrev Utkarsh Sengar : > But as when I launch a solr instance without "-Dbootstrap_con

Re: URL search and indexing

2013-06-25 Thread Jan Høydahl
Probably a good match for the RegExp feature of Solr (given that your url is not tokenized) e.g. q=url:/.*\.it$/ -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25. juni 2013 kl. 12:17 skrev Flavio Pompermaier : > Hi to everybody, > I'm quite new to Solr

Re: Several Machines Communication Failure

2013-06-25 Thread Jan Høydahl
Hi, We cannot help you based on this brief email. Please provide a much more detailed description. Version of Solr, SolrCloud or not. How exactly have you done this move? Relevant configuration snippets, relevant log snippets of what goes wrong... -- Jan Høydahl, search solution architect

Re: URL search and indexing

2013-06-25 Thread Jan Høydahl
Sure you can query the url directly. Or if you choose you can split it up in multiple components, e.g. using http://lucene.apache.org/solr/4_3_0/solr-core/org/apache/solr/update/processor/URLClassifyProcessor.html -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25

Re: Set spellcheck field on query time?

2013-07-01 Thread Jan Høydahl
Check out http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.dictionary - you can define multiple dictionaries in the same handler, each with its own source field. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 1. juli 2013 kl. 11:34 skrev Timo Schmidt

Re: Solr cloud date based paritioning

2013-07-02 Thread Jan Høydahl
node, and the same with searching. You achieve implicit routing by creating the collection without a numShards parameter. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 2. juli 2013 kl. 22:11 skrev kowish.adamosh : > Sure, I'ill measure results and come back if

Simple Moving Average of Query Durations

2013-07-03 Thread Jan Morlock
getStatistics() method inside solr/core/src/java/org/apache/solr/handler/RequestHandlerBase.java. Would you agree? Thank you very much. Best regards Jan -- View this message in context: http://lucene.472066.n3.nabble.com/Simple-Moving-Average-of-Query-Durations-tp4075312.html Sent from the Solr

Re: lang.fallback doesn't work when using lang.fallbackFields

2013-07-28 Thread Jan Høydahl
existing functionality, so it would be safe to introduce. Feel free to write a JIRA issue for it. A workaround could be to write a simple UpdateProcessor which removes any illegal value from langid.fallbackFields before the LangId processor. -- Jan Høydahl, search solution architect Cominvent

Re: ALIAS feature, can be used for what?

2013-10-04 Thread Jan Høydahl
Hi, I have been asked the same question. There are only DELETEALIAS and CREATEALIAS actions available, so is there a way to achieve uninterrupted switch of an alias from one index to another? Are we lacking a MOVEALIAS command? -- Jan Høydahl, search solution architect Cominvent AS

Re: Dynamically loading synonym dictionary for solr SynonymFilter

2013-10-09 Thread Jan Høydahl
Hi, Not as I know of. You'd probably want to subclass SynonymFilter* with your own DB aware implementation, and of course contribute this back :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 9. okt. 2013 kl. 23:31 skrev ALEX PKB : > Hi, > All of our s

Re: Using Solr from Github or SVN

2013-03-21 Thread Jan Høydahl
sync, choose the official SVN. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 21. mars 2013 kl. 10:31 skrev Furkan KAMACI : > I want to branch Solr (latest version) at my local and implement some > custom codes. After some time(

Re: Slow queries for common terms

2013-03-21 Thread Jan Høydahl
"book", how much RAM, whether you utilize disk caching well enough and many other things which could affect this situation. But the pure fact that only a few common search words trigger such a delay would suggest commongrams as a possible way forward. -- Jan Høydahl, search solution arch

Re: Using Solr from Github or SVN

2013-03-21 Thread Jan Høydahl
Committers mostly use ant+ivy, so that will most likely be more stable. Both Eclipse and IDEA are frequently in use by committers so both are safe choices. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 21. mars 2013 kl. 11:10

Re: Using Solr from Github or SVN

2013-03-21 Thread Jan Høydahl
If you read the HowToContribute page you will see it described there. I's supported, you just need to apply the patch with -p1 instead of -p0 http://wiki.apache.org/solr/HowToContribute#Working_With_Patches -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Tra

Re: Slow queries for common terms

2013-03-21 Thread Jan Høydahl
ptable indexing/query performance. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 21. mars 2013 kl. 12:43 skrev David Parks : > We have 300M documents, each about a paragraph of text on average. The index > is 140GB in size. I&#

Re: Slow queries for common terms

2013-03-22 Thread Jan Høydahl
as possible. The real solution is to shard across N number of servers, until you reach the desired performance for the desired indexing/querying load. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 22. mars 2013 kl. 02:52 skrev Davi

Re: Boost query parameter with Lucid parser and using query FunctionQuery

2013-03-22 Thread Jan Høydahl
Why would you use dismax for the query() when you want to match a simple term to one field? If you share &echoParams=all the answer may lie somewhere therein? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 23. mars 2013 kl. 0

Re: NoSuchMethodError updateDocument

2013-03-22 Thread Jan Høydahl
Are you 100% sure you use the exact jars for 4.1.0 *everywhere*, and that you're not blending older versions from the Nutch distro in your classpath here? > Any ideas? BTW: What was your question here regarding Jetty vs Tomcat? -- Jan Høydahl, search solution architect Comi

Re: Boost query parameter with Lucid parser and using query FunctionQuery

2013-03-23 Thread Jan Høydahl
mple qq=title:treatment would do? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 23. mars 2013 kl. 01:26 skrev "Miller, Will Jr" : > This is the echo params... It looks like it ignores the qf in the > Functio

Re: NoSuchMethodError updateDocument

2013-03-23 Thread Jan Høydahl
just forget about it. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 23. mars 2013 kl. 14:34 skrev Furkan KAMACI : > Hi Jan; > > I will check the jar versions. By the way I think that I should create a > solr home

Recommendation for integration test framework

2013-03-24 Thread Jan Morlock
e are currently thinking about using the maven failsafe plugin. Does anybody have experiences with using it in combination with solr? Or does somebody have a better recommendation for us? Thank you very much in advance Jan -- View this message in context: http://lucene.472066.n3.nabbl

Re: Debugging Map Reduce Jobs at Solr

2013-03-26 Thread Jan Høydahl
Hi, Please elaborate your question. Solr does not have any M/R jobs, so you have to let us know your setup. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 26. mars 2013 kl. 11:34 skrev Furkan KAMACI : > Is there any easy

Re: Add fuzzy to edismax specs?

2013-04-02 Thread Jan Høydahl
Note that the "pf" field already parses this syntax as of 4.0, but then it is used as a phrase-slop value. You could probably use same parsing code for qf. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 29. mars 2013

Re: Solr Phonetic Search Highlight issue in search results

2013-04-02 Thread Jan Høydahl
If you want to highlight, you need to turn on highlighting for the actual field you search, and that field needs to be stored, i.e. &hl.fl=ContentSearchPhonetic -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 1. apr. 2013 kl. 1

Re: Question on Exact Matches - edismax

2013-04-03 Thread Jan Høydahl
Can you show us your *_ci field type? Solr does not really have a way to tell whether a match is "exact" or only partial, but you could hack around it with the fieldType. See https://github.com/cominvent/exactmatch for a possible solution. -- Jan Høydahl, search solution architect Co

Re: Solr Tika Override

2013-04-03 Thread Jan Høydahl
nt. This is possible with a customized ExtractingHandler too, but it will be uglier and harder to test. With a standalone indexer application you can write unit tests for all the special parsing requirements. see http://tika.apache.org for more. -- Jan Høydahl, search solution architect Co

Re: Time estimate Solr Installation

2013-04-07 Thread Jan Høydahl
would definitely be more than 2 days. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 7. apr. 2013 kl. 21:27 skrev Gruxter : > Hi all, > > I hope this is teh right place to post this. I want to use Apache Solr > s

Re: Where to use replicationFactor and maxShardsPerNode at SolrCloud?

2013-04-22 Thread Jan Høydahl
new node manually, don't you? Is there a JIRA to auto-balance shards? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 22. apr. 2013 kl. 13:04 skrev Erick Erickson : > 1) Imagine you have lots and lots and lots of different

Re: Export Index and Re-Index XML

2013-04-23 Thread Jan Høydahl
ich is a stylesheet for indexing response XML directly. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 23. apr. 2013 kl. 02:11 skrev Kalyan Kuram : > Thank you all very much for your help.I do have field configured as stored &g

Re: Listing Priority

2013-04-24 Thread Jan Høydahl
Hi, Check out the new RegexpBoostProcessor https://lucene.apache.org/solr/4_2_0/solr-core/org/apache/solr/update/processor/RegexpBoostProcessor.html which does exactly this based on a config file -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: How to let Solr load libs from within my JAR?

2013-04-24 Thread Jan Høydahl
Hi, Java class loader does not support JAR within JAR. You'll have to unpack both JARs and then JAR them together as one. Or simply give several JARs to Solr, that's the easiest. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraini

Re: SOLR 4.3

2013-04-24 Thread Jan Høydahl
As you can see on the issue, it is already fixed for 4.3 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 24. apr. 2013 kl. 07:02 skrev William Bell : > Can we get this in please to 4.3? > > https://issues.apache.org/ji

Re: Fields issue 4.2.1

2013-04-24 Thread Jan Høydahl
Hi, Have you tried fl=*_user ? I think fl may try to interpret the number as a function. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 24. apr. 2013 kl. 07:16 skrev William Bell : > I am getting no results when using dyna

Re: Solr 3.6.1: changing a field from stored to not stored

2013-04-24 Thread Jan Høydahl
down the cluster to do that, just re-index on top of what's there already, and your index will become smaller and smaller as merging kicks out the old data :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 24. apr. 2013 kl. 15:5

Re: Indexing PDF Files

2013-04-24 Thread Jan Høydahl
In your schema you have written > class="solr.StrField" /> Note that XML tag and param names are case sensitive, so instead of fieldtype you should use fieldType I see that you have the same error for several fieldTypes in your schema, probably resulting in other similar err

Re: Cloudspace and Solr Support Page

2013-04-25 Thread Jan Høydahl
Hi, Just give your WIKI user name and we'll give you access to edit that page to add yourself. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 25. apr. 2013 kl. 21:39 skrev Nina Talley : > Hi there, > > We offe

Re: Document is missing mandatory uniqueKey field: id for Solr PDF indexing

2013-04-26 Thread Jan Høydahl
http://wiki.apache.org/solr/post.jar -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 26. apr. 2013 kl. 13:28 skrev Furkan KAMACI : > Hi Raymond; > > Now I get that error: SimplePostTool: WARNING: IOException whil

Re: How to define a generic field to hold all undefined fields

2013-04-26 Thread Jan Høydahl
+request+handler&fc_project=Solr Reading that wiki page you'll notice the parameters uprefix and defaultField which would both be ways to solve your problem. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 26. apr. 2013 kl. 15

Re: Not In query

2013-04-26 Thread Jan Høydahl
cation, but if it is highly likely that a single user will add excludes for several thousand ids then you should perhaps consider other options and benchmark up front. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 26. apr. 2013 k

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Jan Høydahl
Hi, How many shards do you have? This is a known issue with deep paging with multi shard, see https://issues.apache.org/jira/browse/SOLR-1726 You may be more successful in going to each shard, one at a time (with &distrib=false) to avoid this issue. -- Jan Høydahl, search solution archi

Re: Not In query

2013-04-30 Thread Jan Høydahl
adding all the individual docids. Anyway, I'd start with the simple approach and then optimize once you (perhaps, perhaps not) bump into problems. Most likely it will work like a charm :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.co

Re: string field does not yield exact match result using qf parameter

2013-04-30 Thread Jan Høydahl
Hi, The "pf" feature will only kick in for phrases, i.e. multiple tokens. Per definition a "string" is one single token, so it will never kick in for strings. A workaround can be found here: https://github.com/cominvent/exactmatch -- Jan Høydahl, search solution ar

Re: Security for inter-solr-node requests

2013-05-02 Thread Jan Høydahl
This feature is not yet part of Solr, but a feature under development in SOLR-4470. We encourage you to try it out and report back what worked best for you. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 2. mai 2013 kl. 13:58

Re: string field does not yield exact match result using qf parameter

2013-05-02 Thread Jan Høydahl
Hi, You can try to increase the "pf" boost for your string field, I don't think you'll have success in having it boosted with pf since it's a string? Check explain output with &debugQuery=true and see whether you get a phrase boost. -- Jan Høydahl, search so

<    1   2   3   4   5   6   7   8   9   10   >