Re: Partition Question

2012-05-08 Thread Yuval Dotan
Hi Can someone please guide me to the right way to partition the solr index? On Mon, May 7, 2012 at 11:41 AM, Yuval Dotan wrote: > Hi All > Jan, thanks for the reply - answers for your questions are located below > Please update me if you have ideas that can solve my problems. > > First, some co

Re: Nested CachedSqlEntityProcessor running for each entity row with Solr 3.6?

2012-05-08 Thread not interesting
> In the mean time, I am hoping each of you can test these fixes with your > installation.  The best way to do this is to get a fresh SVN checkout of the > 3.6.1 branch > (http://svn.apache.org/repos/asf/lucene/dev/branches/lucene_solr_3_6/), > switch to the "solr" directory, then run "ant dist

RE: Advanced search with results matrix

2012-05-08 Thread Gnanakumar
Hi Mikhail, > have you considered to junk your subqueries into disjunction > (BooleanQuery.Occurs.SHOULD) and request > http://wiki.apache.org/solr/SimpleFacetParameters#facet.query_:_Arbitrary_Qu ery_Faceting? Thanks for pointing/guiding me at the right time and/or direction about Solr "Facet Que

Re: Bridge between Solr and NoSQL

2012-05-08 Thread Michael Kuhlmann
Am 08.05.2012 04:13, schrieb Jeff Schmidt: Francois: Check out DataStax Enterprise 2.0, Solr integrated with Cassandra: http://www.datastax.com/docs/datastax_enterprise2.0/search/index And, Solbase, Solr integrated with HBase: https://github.com/Photobucket/Solbase I'm sure there are others,

How to change the default format for tstamp?

2012-05-08 Thread forwardswing
When i use Nutch1.2 & Solr1.4,from Nutch i use the following command to commit the index to Solr: bin/nutch crawl urls -dir ... But there is an error in Solr: ERROR : org.apache.solr.core.SolrCore - org.apache.solr.common.SolrException: Error while creating field 'tstamp{type=date,properties

RE: How to change the default format for tstamp?

2012-05-08 Thread Agnesrani Nadar
SELECT DATE_FORMAT( ts.insertDate,'%Y-%m-%d-%k:%i:%S') as insertDate, Try to use the above DATE_FORMAT you will get the date as 2012-03-24-14:07:59 -Original Message- From: forwardswing [mailto:wangweiz...@sohu.com] Sent: Tuesday, May 08, 2012 2:22 PM To: solr-user@lucene.apache.org S

Solr Webapps and JVM code cache

2012-05-08 Thread Sujatha Arun
Hello , I see that the code cache in the JVM is nearing its memory limits 47mb /assigned in 50 MB .On deploying more solr webapps to the server,Will i get any out of memory exceptions ? and will JVM freeze? How should this be handled? Regards Sujatha

SOLR UIMA - posTag

2012-05-08 Thread hemantverm...@gmail.com
Hi All I have intergrated UIMA and Solr and successfully indexed the documents. My requirement is to boost nouns over verbs. (Lets take for example field : description) Below is the sample index contents. - - Water Curses Prospect Hummer people are different in nature some bahaves be

Solr Spellcheck index is not replicated to the slave

2012-05-08 Thread ilanh
I am using apache-solr-1.4.1. I have a master and a slave. When I am doing “Full import”, the index is replicated to the slave however the spellcheck index is not. In the master I have: solrconfig_slave.xml:solrconfig.xml,schema.xml,stopwords.txt,elevate.xml,protwords.txt,synonyms.txt And in the

RE: Nested CachedSqlEntityProcessor running for each entity row with Solr 3.6?

2012-05-08 Thread Dyer, James
Kellen, I appreciate your trying this out. Is there any way you can provide your data-config.xml file? I'd really like to get to the bottom of this. Thanks. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: not interesting [mailto:dye.kel...@

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Michael Della Bitta
Hi Sujatha, You will likely have to increase the JVM permgen space for your container when you launch it. This is normal. How you do this depends on the container you're using and how you launch it. Michael On Tue, 2012-05-08 at 16:54 +0530, Sujatha Arun wrote: > Hello , > > I see that the code

Re: Lucene FieldCache - Out of memory exception

2012-05-08 Thread Rahul R
A update on the things I tried today. Since multiValued fields do not use the fieldCache, I changed my schema to define all my fields as multiValued fields. Although these fields need to be only single valued, I made this change and recreated the index and tested with it. Observations : - force GC

[MAVEN] Heads up: build changes

2012-05-08 Thread Steven A Rowe
If you use the Lucene/Solr Maven POMs to drive the build, I committed a major change last night (see https://issues.apache.org/jira/browse/LUCENE-3948 for more details): * 'ant get-maven-poms' no longer places pom.xml files under the lucene/ and solr/ directories. Instead, they are placed in a

RE: How to change the default format for tstamp?

2012-05-08 Thread forwardswing
I am a learner to nutch,can you please describe in more detail? For example,from where and how to execute the SQL? -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-change-the-default-format-for-tstamp-tp3970751p3971251.html Sent from the Solr - User mailing list archive

RE: How to change the default format for tstamp?

2012-05-08 Thread Markus Jelsma
You should upgrade your Nutch version. Nutch 1.4 writes dates in the format Solr understands. On Tue, 8 May 2012 06:03:25 -0700 (PDT), forwardswing wrote: I am a learner to nutch,can you please describe in more detail? For example,from where and how to execute the SQL? -- View this message i

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Sujatha Arun
The Permgen space ,we have given 2Gb ,currently used permgen space is 650MB ,however code cache has by default 49MB and 47+ Mb has been used ,I would like to know what happens when we deploy more webapps to the container . - Does the unused memory removed from cache to give more memory?

Re: Advanced search with results matrix

2012-05-08 Thread Mikhail Khludnev
On Tue, May 8, 2012 at 11:38 AM, Gnanakumar wrote: > My question is, is it possible to run > multiple combination of search queries to just get only result count "in a > single trip" without using "facet.query"? > No. AFAIK. -- Sincerely yours Mikhail Khludnev Tech Lead Grid Dynamics

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Michael Della Bitta
I'm sorry, apparently I'm unfamiliar with issues that result in needing 2Gb of permgen space. I've never run into code cache issues before. I found this page via Google: http://jasig.275507.n4.nabble.com/Java-code-cache-memory-td271140.html ..but perhaps someone who has experience with this issue

CLASSPATH

2012-05-08 Thread Tolga
Hi, Probably off-topic, but what directory should I export to CLASSPATH environment variable so that I can begin using nutch? Regards,

Re: CLASSPATH

2012-05-08 Thread Otis Gospodnetic
Tolga - you should ask on the Nutch mailing list, not Solr one. :) Otis  Performance Monitoring for Solr / ElasticSearch / HBase - http://sematext.com/spm  > > From: Tolga >To: solr-user@lucene.apache.org >Sent: Tuesday, May 8, 2012 4:30 PM >Subject: CLAS

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Otis Gospodnetic
Hi, Did you really mean 47 *MB*? Yes, if any limits are reached and GC cannot reclaim enough space, you will get OOM. You can use a couple of JVM params to catch this and dump heap to a file if you want to analyze it and see what was using memory and how much. Otis  Performance Monitoring

Re: SOLR UIMA - posTag

2012-05-08 Thread Lance Norskog
2 quick ideas: 1) The nouns and verbs have to be in separate fields. So, you would make two analysis stacks. One only stores terms marked as 'noun' and the other only stores terms marked as 'verb'. 2) Store all terms in one field. Create a Similarity class that scores from the payloads. (There is a

Re: Bridge between Solr and NoSQL

2012-05-08 Thread Lance Norskog
The SolrCloud project is making Solr be a NoSQL DB by itself, without connections to other systems. On Tue, May 8, 2012 at 2:01 AM, Michael Kuhlmann wrote: > Am 08.05.2012 04:13, schrieb Jeff Schmidt: > >> Francois: >> >> Check out DataStax Enterprise 2.0, Solr integrated with Cassandra: >> http:

Re: Partition Question

2012-05-08 Thread Lance Norskog
Lucene does not support more 2^32 unique documents, so you need to partition. In Solr this is done with Distributed Search: http://www.lucidimagination.com/search/link?url=http://wiki.apache.org/solr/DistributedSearch First, you have to decide a policy for which documents go to which 'shard'. It i

Re: PDF indexing

2012-05-08 Thread Lance Norskog
post.jar and curl do the same thing. Look at post.sh, which uses curl. On Mon, May 7, 2012 at 12:57 PM, Tolga wrote: > On 05/07/2012 10:35 PM, Jack Krupansky wrote: >> >> Try SolrCell (ExtractingRequestHandler). >> >> See: >> http://wiki.apache.org/solr/ExtractingRequestHandler >> >> -- Jack Krup

Re: Solritas in production

2012-05-08 Thread Bill Bell
I would not use Solaritas unless for very rudimentary solutions and prototypes. Sent from my Mobile device 720-256-8076 On May 6, 2012, at 6:02 AM, András Bártházi wrote: > Hi, > > We're currently evaluating Solr as a Sphinx replacement. Our site has > 1.000.000+ pageviews a day, it's a real e

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Sujatha Arun
yes 47 MB ,Does CMS permgen Sweeping take care of code Cache clean up? Thanks Michael and Otis Regards Sujatha On Wed, May 9, 2012 at 2:27 AM, Otis Gospodnetic wrote: > Hi, > > Did you really mean 47 *MB*? > Yes, if any limits are reached and GC cannot reclaim enough space, you > will get O

Re: Dynamic creation of cores for this use case.

2012-05-08 Thread pprabhcisco123
Hi, I tried to create core by simply hitting the below url http://localhost:8983/solr/admin/cores?action=CREATE&name=core3&instanceDir=C://solr&config=solrconfig.xml&schema=schema.xml&dataDir=C://solr/data It made a entry in the solr.xml file . but the core directory is not created. Plea

Re: Solritas in production

2012-05-08 Thread Paul Libbrecht
Le 7 mai 2012 à 13:30, Marcelo Carvalho Fernandes a écrit : > Anything else? If fearing DoS attacks by too large queries (e.g. if having millions of documents), consider writing a query-component that can limit the queries. I believe that there's nothing else. paul