Re: Questions developing custom functionquery

2013-10-10 Thread Mikhail Khludnev
Hello JT, what's is the field and fieldType definition for "resname" ? can't you check how '/some example/data/here/2013/09/12/ testing.text ' is handled on analysis page in SolrAdmin? On Fri, Oct 11, 2013 at 4:53 AM, Richard Lee wrote: > seems what u got is the terms other than the raw data.

Re: SolrCore 'collection1' is not available due to init failure

2013-10-10 Thread Liu Bo
org.apache.solr.core.SolrCore.(SolrCore.java:821) ... 13 more Caused by: org.apache.lucene.store.LockObtainFailedException: Lock obtain timed out: NativeFSLock@/usr/share/solr-4.5.0/example/solr/ collection1/data/index/write.lock: java.io.FileNotFoundException: /usr/share/solr-4.5.0/example/solr/co

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Bill Bell
Does this work ? I can suggest -XX:-UseLoopPredicate to switch off predicates. ??? Which version of 7 is recommended ? Bill Bell Sent from mobile > On Oct 10, 2013, at 11:29 AM, "Smiley, David W." wrote: > > *Don't* use JDK 7u40, it's been known to cause index corruption and > SIGSEGV faults

SolrCore 'collection1' is not available due to init failure

2013-10-10 Thread Jim_Armstrong
noob trying to get solr rolling. localhost_8080_/solr returns a 500. I've been googling this msg. Thinking I don't have the solr home defined. Need a hint. Using this link as a guide. http://www.thegeekstuff.com/2013/04/install-apache-solr-tomcat/ Debian7, Oracle Java 1.7_25, Tomcat 7.42, Solr-4

Re: Find documents that are composed of % words

2013-10-10 Thread shahzad73
is there a way that i build a plugin that gets all words on a single page and build a percentage to see how many words are foreign on the page (words not on the search list) -- View this message in context: http://lucene.472066.n3.nabble.com/Find-documents-that-are-composed-of-words-tp4094264

Question about plug-in update handler failure

2013-10-10 Thread Jack Park
I have an "interceptor" which grabs SolrDocument instances in the update handler chain. It feeds those documents as a JSON string out to an agent system. That system has been running fine all the way up to Solr 4.3.1 I have discovered that, as of 4.4 and now 4.5, the very same config files, agent

Re: Multiple schemas in the same SolrCloud ?

2013-10-10 Thread xinwu
Thank you,Shawn. Thank you,everyone. All your answers help me a lot.Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-schemas-in-the-same-SolrCloud-tp4094279p4094850.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Using split in updateCSV for SolrCloud 4.4

2013-10-10 Thread Yonik Seeley
Perhaps try adding echoParams=all to check that all of the input params are being parsed as expected. -Yonik On Thu, Oct 10, 2013 at 8:10 PM, Utkarsh Sengar wrote: > Didn't help. > > This is the complete data: https://gist.github.com/utkarsh2012/6927649 (see > merchantList column). > I tried thi

Re: Using split in updateCSV for SolrCloud 4.4

2013-10-10 Thread Jack Krupansky
Yeah, I notiiced that too and tried that as well, and tried it in 4.4 as well, and tried a raw URL in the Chrome browser as well, and it all works fine. Did you try my example with the standard Solr example schema? I'm trying to see what's different from your example and mine, other than the

Re: Questions developing custom functionquery

2013-10-10 Thread Richard Lee
seems what u got is the terms other than the raw data. maybe u should check the api docs for more details 2013-10-11 上午3:56于 "JT" 写道: > I'm running into some issues developing a custom functionquery. > > My goal is to be able to implement a custom sorting technique. > > I have a field defined call

Re: Using split in updateCSV for SolrCloud 4.4

2013-10-10 Thread Utkarsh Sengar
@Jack I just noticed in your example that: feat1:feat2 is not in an encapsulator ". Was that a typo or intentional? You are passing f.features.encapsulator=%22 but don't have " around feat1:feat2. The request should look: curl " http://localhost:8983/solr/update/csv?commit=true&f.features.split=t

Re: Using split in updateCSV for SolrCloud 4.4

2013-10-10 Thread Utkarsh Sengar
Didn't help. This is the complete data: https://gist.github.com/utkarsh2012/6927649 (see merchantList column). I tried this URL: curl ' http://localhost/solr/coll1/update/csv?commit=true&f.merchantList.split=true&f.merchantList.separator=%3A&f.merchantList.encapsulator=%22&escape=\&stream.contentT

Re: Using split in updateCSV for SolrCloud 4.4

2013-10-10 Thread Jack Krupansky
Using the standard Solr example for Solr 4.5, the following works, splitting the features CSV field into multiple values: curl "http://localhost:8983/solr/update/csv?commit=true&f.features.split=true&f.features.separator=%3A&f.features.encapsulator=%22"; -H "Content-Type: text/csv" -d ' id,na

Re: Find documents that are composed of % words

2013-10-10 Thread Jack Krupansky
Or if they simply copied the requirement from some other search engine or text analytics package, get the name of that other software and supply us with a URL pointing to the description of that feature. Ditto if the feature was described in an academic paper. Solr does have a lot of text anal

Using split in updateCSV for SolrCloud 4.4

2013-10-10 Thread Utkarsh Sengar
Hello, I am trying to use split: http://wiki.apache.org/solr/UpdateCSV#split while loading some csv data via updateCSV. This is the field: This is the column in CSV (merchantList): values,"16179:10950",.values.. This is the URL I call: http://localhost/solr/coll1/update/csv?commit

Questions developing custom functionquery

2013-10-10 Thread JT
I'm running into some issues developing a custom functionquery. My goal is to be able to implement a custom sorting technique. I have a field defined called resname, it is a single value str. Example: /some example/data/here/2013/09/12/testing.text I would like to do a custom sort based on this

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
Hmm, thanks for sharing this, according to the JIRA comments it seems to affect only 32 bits systems? https://issues.apache.org/jira/browse/LUCENE-5212 Guido. On 10/10/13 18:29, Smiley, David W. wrote: *Don't* use JDK 7u40, it's been known to cause index corruption and SIGSEGV faults with Luc

Re: Find documents that are composed of % words

2013-10-10 Thread shahzad73
Yes the correct is answer may be "Why" but you cannot ask this to client. He think there is something interesting with this formula and if it works we can index websites with Nutch + Solrand let users input queries that can locate documents which has % of foreign words other than list pr

Re: which jar contains org.apache.solr.request.XSLTResponseWriter ?

2013-10-10 Thread Shawn Heisey
On 10/10/2013 12:18 PM, MC wrote: Shawn thank you for your help. I had another look at my settings, and although classpath and code were OK, I found this entry in solrconfig.xml: class="org.apache.solr.request.XSLTResponseWriter"> That was the problem. So you were right, it was a relic of the

Re: which jar contains org.apache.solr.request.XSLTResponseWriter ?

2013-10-10 Thread MC
Shawn thank you for your help. I had another look at my settings, and although classpath and code were OK, I found this entry in solrconfig.xml: class="org.apache.solr.request.XSLTResponseWriter"> That was the problem. So you were right, it was a relic of the old version. My code started workin

Re: which jar contains org.apache.solr.request.XSLTResponseWriter ?

2013-10-10 Thread Shawn Heisey
On 10/10/2013 10:15 AM, MC wrote: My embedded solr server (4.4.0) is crashing when I submit a query. The reason is this: Caused by: java.lang.ClassNotFoundException: org.apache.solr.request.XSLTResponseWriter I have the following jars in my classpath: solr-core-4.4.0.jar solr-solrj-4.4.0.jar s

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Smiley, David W.
*Don't* use JDK 7u40, it's been known to cause index corruption and SIGSEGV faults with Lucene: LUCENE-5212 This has not been unnoticed by Oracle. ~ David On 10/10/13 12:34 PM, "Guido Medina" wrote: >2. Java version: There are huges performance winning between Java 5, 6 >and 7; we use Ora

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
Even if you use a newer jetty, you won't see any difference, what you should care: 1. Jetty using NIO which is enabled by default on Jetty coming from Ubuntu repo (I think) 2. Java version: There are huges performance winning between Java 5, 6 and 7; we use Oracle JDK 7u40. 3. JVM options

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
Why not just go straight for 4.5.0? Also, we have it running with JDK 7, we use Oracle's JVM sym-linked to /opt/java, that would be an extra step (modifying the JAVA_HOME at /etc/defaul/jetty), it it worthy doing. Glad I helped, cheers, Guido. On 10/10/13 16:57, Peter Schmidt wrote: SOLVED:

Re: Multiple schemas in the same SolrCloud ?

2013-10-10 Thread Liu Bo
you can try this way: start zookeeper server first. upload your configurations to zookeeper and link them to your collection using zkcli just like shawn said let's say you have conf1 and conf2, you can link them to collection1 and collection2 remove the bootstrap stuff and start solr server. a

which jar contains org.apache.solr.request.XSLTResponseWriter ?

2013-10-10 Thread MC
Hello, My embedded solr server (4.4.0) is crashing when I submit a query. The reason is this: Caused by: java.lang.ClassNotFoundException: org.apache.solr.request.XSLTResponseWriter I have the following jars in my classpath: solr-core-4.4.0.jar solr-solrj-4.4.0.jar solr-dataimporthandler-4.4.0.

Re: Multiple schemas in the same SolrCloud ?

2013-10-10 Thread Upayavira
Use $solrzip/example/cloud-scripts/zkcli.sh to upload a new set of configuration files. Upayavira On Thu, Oct 10, 2013, at 04:57 PM, maephisto wrote: > On this topic, once you've uploaded you collection's configuration in ZK, > how > can you update it? > Upload the new one with the same config na

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
SOLVED: I reinstalled the jetty package, because i wanted to get a clean installation. And now it's working with your hint Guido :-) Thank you very much. What i did: # Download Solr wget http://archive.apache.org/dist/lucene/solr/4.4.0/solr-4.4.0.tgz tar xvfz solr-4.4.0.tgz # Extract content fro

Re: Multiple schemas in the same SolrCloud ?

2013-10-10 Thread maephisto
On this topic, once you've uploaded you collection's configuration in ZK, how can you update it? Upload the new one with the same config name ? -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-schemas-in-the-same-SolrCloud-tp4094279p4094729.html Sent from the Solr -

Re: Multiple schemas in the same SolrCloud ?

2013-10-10 Thread Shawn Heisey
On 10/10/2013 2:45 AM, xinwu wrote: > I solved that problem by using the "-Dbootstrap_conf=true" configuration > instead of the "-Dbootstrap_confdir=/opt/Solr_home/collection1/conf > -Dcollection.configName=solrConfig" on startup. You do not need this bootstrap option either. Once your configurat

Re: Please help!, Highlighting exact phrases with solr

2013-10-10 Thread Koji Sekiguchi
(13/10/10 18:17), Silvia Suárez wrote: I am using solrj as client for indexing documents on the solr server I am new to solr, And I am having problem with the highlighting in solr. Highlighting exact phrases with solr does not work. For example if the search keyword is: "dulce hogar" it returns:

Re: Re: feedback on Solr 4.x LotsOfCores feature

2013-10-10 Thread Soyez Olivier
The corresponding patch for Solr 4.2.1 LotsOfCores can be found in SOLR-5316, including the new Cores options : - "numBuckets" to create a subdirectory based on a hash on the corename % numBuckets in the core Datadir - "Auto" with 3 differents values : 1) false : default behaviour 2) createLo

Re: Solr auto suggestion not working

2013-10-10 Thread Jason Hellman
Very specifically, what is the field definition that is being used for the suggestions? On Oct 10, 2013, at 5:49 AM, Furkan KAMACI wrote: > What is your configuration for auto suggestion? > > > 2013/10/10 ar...@skillnetinc.com > >> >> >> Hi, >> >> We are encountering an issue in solr sea

Re: Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-10 Thread Jason Hellman
The best use case I see for atomic updates typically involves avoid transmission of large documents for small field updates. If you are updating a "readCount" field of a PDF document that is 1MB in size you will avoid resending the 1MB PDF document's data in order to increment the "readCount"

Re: Update existing documents when using ExtractingRequestHandler?

2013-10-10 Thread Jason Hellman
As an endorsement of Erick's like, the primary benefit I see to processing through your own code is better error-, exception-, and logging-handling which is trivial for you to write. Consider that your code could reside on any server, either receiving through a PUSH or PULLing the data from you

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
i added the jars you have listet and recreated a solr.war 2013/10/10 Peter Schmidt > OK. I extracted the solr.war in webapps folder. And than out.log reports > > 2013-10-10 16:47:28.184:WARN::Config error at class="java.io.PrintStream">| class="org.mortbay.util.RolloverFileOutputStream

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
OK. I extracted the solr.war in webapps folder. And than out.log reports 2013-10-10 16:47:28.184:WARN::Config error at |/logs/_mm_dd.stderrout.logfalse90GMT| java.lang.reflect.InvocationTargetException 2013-10-10 16:47:28.191:WARN::EXCEPTION java.lang.reflect.InvocationTargetExce

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
Add the following jars inside solr.war WEB-INF/lib folder: jcl-over-slf4j-1.7.5.jar jul-to-slf4j-1.7.5.jar log4j-1.2.17.jar slf4j-api-1.7.5.jar slf4j-log4j12-1.7.5.jar Optional if you are going to use spatial search: jts-1.13.jar Guido. On 10/10/13 14:33, Peter Schmidt wrote: Sorry terminal s

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Furkan KAMACI
Did you check here as your logs says: http://wiki.apache.org/solr/SolrLogging You have NoClassDefFoundError: error for org/slf4j/Logger and this is because of there is a dependency mismatch at your libraries. 2013/10/10 Peter Schmidt > >>For repo Jetty (6.1.x) this is a good guide, which is v

Re: Solr Wiki Account

2013-10-10 Thread Joel Bernstein
Thanks! On Thu, Oct 10, 2013 at 9:48 AM, Stefan Matheis wrote: > Sure :) I've added it to https://wiki.apache.org/solr/AdminGroup > > -Stefan > > > On Thursday, October 10, 2013 at 3:41 PM, Joel Bernstein wrote: > > > Hi, > > > > Can the account JoelBernstein be granted permission to edit the So

Re: Solr Wiki Account

2013-10-10 Thread Stefan Matheis
Sure :) I've added it to https://wiki.apache.org/solr/AdminGroup -Stefan On Thursday, October 10, 2013 at 3:41 PM, Joel Bernstein wrote: > Hi, > > Can the account JoelBernstein be granted permission to edit the Solr Wiki? > > Thanks, > Joel > >

Solr Wiki Account

2013-10-10 Thread Joel Bernstein
Hi, Can the account JoelBernstein be granted permission to edit the Solr Wiki? Thanks, Joel

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
>>For repo Jetty (6.1.x) this is a good guide, which is very similar to what we have on our environments: >>http://www.kingstonlabs.com/blog/how-to-install-solr-36-on-ubuntu-1204/ Guido, i used this guide and ended up with t

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
Sorry terminal shows: $ sudo service jetty start Found JAVA=/usr/lib/jvm/java-6-openjdk/jre/bin/java in JAVA_HOME=/ Starting Jetty: FAILED 2013/10/10 Peter Schmidt > The log: > > 2013-10-10 15:01:47.071:INFO::jetty-6.1.22 > 2013-10-10 15:01:47.110:INFO::Deploy /etc/jetty/contexts/javadoc.xml -

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
The log: 2013-10-10 15:01:47.071:INFO::jetty-6.1.22 2013-10-10 15:01:47.110:INFO::Deploy /etc/jetty/contexts/javadoc.xml -> org.mortbay.jetty.handler.ContextHandler@2353f67e {/javadoc,file:/usr/share/doc/libjetty-java/api/} 2013-10-10 15:01:47.161:INFO::NO JSP Support for , did not find org.apache

Re: Find documents that are composed of % words

2013-10-10 Thread Upayavira
Right - aside from the interesting intellectual exercise, the correct question to ask is, "why?" Why would you want to do this? What's the benefit, and is there a way of doing it that is more in keeping with how Solr has been designed? Upayavira On Thu, Oct 10, 2013, at 01:17 PM, Erick Erickson

Re: Find documents that are composed of % words

2013-10-10 Thread Aloke Ghoshal
Something you could do via function queries. Performance (for 500+ words) is a doubtful. 1) With a separate float field (myfieldwordcount) that holds the count of words from your query field (myfield): http://localhost:8983/solr/collection1/select?wt=xml&indent=true&defType=func &fl=id,myfield &q

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
For repo Jetty (6.1.x) this is a good guide, which is very similar to what we have on our environments: http://www.kingstonlabs.com/blog/how-to-install-solr-36-on-ubuntu-1204/ Guido. On 10/10/13 13:20, Peter Schmidt wrote: No, it's not working that easy with the jetty 6.1. When i just add -D

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
On the config I pasted here, Solr mailing list is inserting "**" in between dots so make sure you remove those. Example: JAVA_OPTIONS="-Dsolr.solr.**home=$YOUR_SOLR_COLLECTION_**HOME $JAVA_OPTIONS" is really: JAVA_OPTIONS="-Dsolr . solr . home=$YOUR _ SOLR _ COLLECTION _ HOME $JAVA

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
Check the jetty logs, there must be something wrong in your collection config, usually if it doesn't start it will tell you there. And of course, for every change you make be sure you restart jetty. Guido. On 10/10/13 13:20, Peter Schmidt wrote: No, it's not working that easy with the jetty 6

Re: Solr auto suggestion not working

2013-10-10 Thread Furkan KAMACI
What is your configuration for auto suggestion? 2013/10/10 ar...@skillnetinc.com > > > Hi, > > We are encountering an issue in solr search auto suggestion feature. Here > is > the problem statement with an example: > We have a product named 'Apple iphone 5s - 16 GB'. Now when in the search > bo

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Furkan KAMACI
Hi Peter; You can learn its version as like that: java -jar start.jar --version 2013/10/10 Peter Schmidt > Is the integrated jetty in Solr 4.4.0 a jetty version 9 or 7 or 8 ? > > > 2013/10/10 Peter Schmidt > > > Your right. it worked although for me with Jetty 9. You mean 9.0.6 > > and i wa

Solr auto suggestion not working

2013-10-10 Thread ar...@skillnetinc.com
Hi, We are encountering an issue in solr search auto suggestion feature. Here is the problem statement with an example: We have a product named 'Apple iphone 5s - 16 GB'. Now when in the search box we type 'Apple' or 'iphone' this product name comes in the suggestion list. But when we type 'ipho

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
Is the integrated jetty in Solr 4.4.0 a jetty version 9 or 7 or 8 ? 2013/10/10 Peter Schmidt > Your right. it worked although for me with Jetty 9. You mean 9.0.6 > and i was talking about jetty 6.1 which come with ubuntus packing. > > So i think i would use the solr with the integrated jetty.

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
Your right. it worked although for me with Jetty 9. You mean 9.0.6 and i was talking about jetty 6.1 which come with ubuntus packing. So i think i would use the solr with the integrated jetty. 2013/10/10 maephisto > Tried it and worked as expected with latest version of Jetty (.0.6 if I > rem

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
No, it's not working that easy with the jetty 6.1. When i just add -Dsolr.solr.home to my /etc/default/jetty and put the solr.war in the jetty webapps directory i got a 404 when in try to reach http://localhost:8080 or http://localhost:8080/solr. 2013/10/10 Guido Medina > My I forgot to mentio

Re: Find documents that are composed of % words

2013-10-10 Thread Erick Erickson
Just to add my $0.02. Often this kind of thing is a mistaken assumption on the part of the client that they know how to score documents better than the really bright people who put a lot of time and energy into scoring (note, I'm _certainly_ not one of those people!). I'll often, instead of making

Re: matching "starts with" only

2013-10-10 Thread Erick Erickson
Be aware that the string type is not analyzed in any way, so your searches are case sensitive. There's a "lowercase" type in the example schema.xml that combines KeywordTokenizer with LowercaseFilter for case-insensitive searches that you might find useful. Besides regex, this might be a good plac

Re: Help to figure out why query does not match

2013-10-10 Thread heaven
Hi Erick, I am finally got back to this issue. Here is the wish I've created: https://issues.apache.org/jira/browse/SOLR-5332 Best, Alex -- View this message in context: http://lucene.472066.n3.nabble.com/Help-to-figure-out-why-query-does-not-match-tp4086967p4094652.html Sent from the Solr -

Re: Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-10 Thread Erick Erickson
bq: so what is the point of having atomic updates if i need to update everything? _nobody_ claims this is ideal, it does solve a certain use-case. We'd all like like true partial-updates that didn't require stored fields. The use-case here is that you don't have access to the system-of-record so

Re: run filter queries after post filter

2013-10-10 Thread Erick Erickson
Rohit: Ah, ok. I think Jim nailed it. The whole idea of post filters though, is to use them when the filter is expensive. Do you have evidence that the fq you're trying to use as a post-filter is "expensive"? I.e. &fq={!cache=false cost=200}field:value takes time? Or are you doing something in you

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread maephisto
Tried it and worked as expected with latest version of Jetty (.0.6 if I remember correctly) and Solr 4.4.0 This tutorial should help you (it's verified by me and working): http://dcvan24.wordpress.com/2013/05/16/how-to-deploy-solr-4-3-on-jetty-9/ -- View this message in context: http://lucene.

Re: Solr's Filtering approaches

2013-10-10 Thread Erick Erickson
Well, my first question is why 50K groups is necessary, and whether you can simplify that. How a user can manually choose from among that many groups is "interesting". But assuming they're all necessary, I can think of two things. If the user can only select ranges, just put in filter queries usin

Re: Find documents that are composed of % words

2013-10-10 Thread Upayavira
On Wed, Oct 9, 2013, at 02:45 PM, shahzad73 wrote: > my client has a strange requirement, he will give a list of 500 words > and > then set a percentage like 80% now he want to find those pages or > documents which consist of the only those 80% of 500 and only 20% > unknown. > like we

Re: Find documents that are composed of % words

2013-10-10 Thread Upayavira
On Wed, Oct 9, 2013, at 02:45 PM, shahzad73 wrote: > my client has a strange requirement, he will give a list of 500 words > and > then set a percentage like 80% now he want to find those pages or > documents which consist of the only those 80% of 500 and only 20% > unknown. > like we

Re: Find documents that are composed of % words

2013-10-10 Thread Furkan KAMACI
Hi; Your question seems like an example of minimum should match feature and Aloke has answered it. However I've wanted to mention about dedup mechanism at Solr (http://wiki.apache.org/solr/Deduplication) if *mm* parameter is not you are looking for and if you want to do something more special. Ded

Re: matching "starts with" only

2013-10-10 Thread adm1n
I've changed the field name to string type, the default one presented in schema.xml, and I got what I needed. thanks for your time. -- View this message in context: http://lucene.472066.n3.nabble.com/matching-starts-with-only-tp4094430p4094637.html Sent from the Solr - User mailing list archi

Re: Update existing documents when using ExtractingRequestHandler?

2013-10-10 Thread Erick Erickson
1 - puts the work on the Solr server though. 2 - This is just a SolrJ program, could be run anywhere. See: http://searchhub.org/dev/2012/02/14/indexing-with-solrj/ It would give you the most flexibility to offload the Tika processing to N other machines. 3 - This could work, but you'd then be index

Re: Using Solr Cell to index the internal structure of a PDF

2013-10-10 Thread Furkan KAMACI
You can have a look here: http://solr.pl/en/2011/04/04/indexing-files-like-doc-pdf-solr-and-tika-integration/ 2013/10/10 Peter Bleackley > I'm trying to index a set of PDF documents with Solr 4.5.0. So far I can > get Solr to ingest the entire document as one long string, stored in the > index

Re: Find documents that are composed of % words

2013-10-10 Thread shahzad73
No did not get it unfortunately how this will help meexplain a bit in details -- View this message in context: http://lucene.472066.n3.nabble.com/Find-documents-that-are-composed-of-words-tp4094264p4094630.html Sent from the Solr - User mailing list archive at Nabble.com.

Using Solr Cell to index the internal structure of a PDF

2013-10-10 Thread Peter Bleackley
I'm trying to index a set of PDF documents with Solr 4.5.0. So far I can get Solr to ingest the entire document as one long string, stored in the index as "content". However, I want to index structure within the documents. I know that the ExtractingRequestHandler uses Apache Tika to convert the

Please help!, Highlighting exact phrases with solr

2013-10-10 Thread Silvia Suárez
I am using solrj as client for indexing documents on the solr server I am new to solr, And I am having problem with the highlighting in solr. Highlighting exact phrases with solr does not work. For example if the search keyword is: "dulce hogar" it returns: dulce hogar And it should be: d

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
My I forgot to mention where you need to copy the solr.war file: /var/lib/jetty/webapps Guido. On 10/10/13 10:48, Guido Medina wrote: Here is the pertinent changes you need to pass to /etc/default/jetty: * JAVA_OPTIONS="-Djava.awt.headless=true -Dfile.encoding=UTF-8 -server -Xms8g -Xmx8

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
Here is the pertinent changes you need to pass to /etc/default/jetty: * JAVA_OPTIONS="-Djava.awt.headless=true -Dfile.encoding=UTF-8 -server -Xms8g -Xmx8g -XX:+UseG1GC -XX:MaxGCPauseMillis=50 -XX:+OptimizeStringConcat -XX:+UseStringCache" * JAVA_OPTIONS="-Dsolr.solr.home=$YOUR_SOLR_COLLEC

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Guido Medina
I can confirm solr 3.x and 4.x runs fine with jetty 6.x from Ubuntu repo, we have in production for few years now with no issues, you just need to be sure your solr.war has the required jars in it, also your collection must have the extra jars needed by your app. We have been running it in Ubu

Re: {soft}Commit and cache flusing

2013-10-10 Thread Dmitry Kan
Tim, my suggestion was very concise, sorry for that. But not at all "rude" or anything. Instead, tried to help you. Dmitry On Wed, Oct 9, 2013 at 9:28 PM, Tim Vaillancourt wrote: > Apologies all. I think the suggestion that I was replying "to get noticed" > is what erked me, otherwise I would

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Tobias Kabbeck
Am 10.10.13 11:05, schrieb Stefan Matheis: Is there a specific reason you are trying to use that jetty instead of the provided one? No not really. I'm already using the provided one. But since i spend some tries on it i want to know if somebody is successfully using it that way :) -Stefa

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Tobias Kabbeck
Am 10.10.13 11:02, schrieb Furkan KAMACI: Do you get an error? As Webapp like in http://www.kingstonlabs.com/blog/how-to-install-solr-36-on-ubuntu-1204/ i got a 404 Recipe from the cookbook produces some errors in the jetty log. i thinks the jetty.xml an the Java Class is different. 201

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Stefan Matheis
Is there a specific reason you are trying to use that jetty instead of the provided one? -Stefan On Thursday, October 10, 2013 at 11:01 AM, Peter Schmidt wrote: > Hey folks, > for some days i tried to get Solr 4.4.0 working as a webapp with Jetty 6.1 > from the Ubuntu repository installed wit

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Furkan KAMACI
Do you get an error? 2013/10/10 Peter Schmidt > Hey folks, > for some days i tried to get Solr 4.4.0 working as a webapp with Jetty 6.1 > from the Ubuntu repository installed with apt-get. First i tried the > installation according the wiki http://wiki.apache.org/solr/SolrJetty. > Then > i foun

Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-10 Thread Peter Schmidt
Hey folks, for some days i tried to get Solr 4.4.0 working as a webapp with Jetty 6.1 from the Ubuntu repository installed with apt-get. First i tried the installation according the wiki http://wiki.apache.org/solr/SolrJetty. Then i found this example http://www.kingstonlabs.com/blog/how-to-install

Re: Multiple schemas in the same SolrCloud ?

2013-10-10 Thread xinwu
Thank you ,Shawn. I solved that problem by using the "-Dbootstrap_conf=true" configuration instead of the "-Dbootstrap_confdir=/opt/Solr_home/collection1/conf -Dcollection.configName=solrConfig" on startup. But,another problem showed up. I had two collections (collection1,collection2) when I sta

Re: Multiple schemas in the same SolrCloud ?

2013-10-10 Thread xinwu
Thank you,kamaci. -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-schemas-in-the-same-SolrCloud-tp4094279p4094594.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: SolrCloud+Tomcat ; 3 windows VMs, 3 shards * 2 replica

2013-10-10 Thread magnum87
for further informations, when I delete and recreate Collections with collection API, my SolR HOME changes (instead of having directories core1, core2, core3... I have core1_shard1_replica1, core1_shard3_replica2)... This happens with or without the 'persistent' on solr.xml. If persistent is TRUE,