Re: Class name of parsing the fq clause

2013-10-23 Thread Sandeep Gupta
Yes.. it is not related to this particular mail thread. I will post separate mail. Thanks Sandeep On Wed, Oct 23, 2013 at 4:36 PM, Jack Krupansky wrote: > Not in just a few words. Do you have specific questions? I mean none of > that relates to parsing of fq, the topic of this particular email

Re: Major GC does not reduce the old gen size

2013-10-23 Thread Shawn Heisey
On 10/21/2013 2:11 PM, neoman wrote: > We are using solr 4.4 version production with 4 shards. This is our memory > settings. > -d64 -server -Xms8192m -Xmx12288m -XX:MaxPermSize=256m \ > -XX:NewRatio=1 -XX:SurvivorRatio=6 \ > -XX:+UseConcMarkSweepGC -XX:+CMSIncrementalMode > -XX:CMSIncrementalDutyC

Re: Major GC does not reduce the old gen size

2013-10-23 Thread neoman
help please -- View this message in context: http://lucene.472066.n3.nabble.com/Major-GC-does-not-reduce-the-old-gen-size-tp4096880p4097429.html Sent from the Solr - User mailing list archive at Nabble.com.

why Analyzer in solr always hang ?

2013-10-23 Thread Mingzhu Gao
Hi All , My custom analyser always hang when I click "Analysis values" button from analysis page . The thread dump is the following : "http-bio-8080-exec-7" daemon prio=5 tid=7ffc7e0a9800 nid=0x1152d6000 runnable [1152d3000] java.lang.Thread.State: RUNNABLE at gnu.trove.impl.hash.TObjectHas

Re: Global IDF vs. Routing

2013-10-23 Thread Otis Gospodnetic
Duh, right, right, sorry for the noise. Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * http://sematext.com/ On Wed, Oct 23, 2013 at 9:13 PM, Yonik Seeley wrote: > On Wed, Oct 23, 2013 at 9:03 PM, Otis Gospodnetic > wrote: >> Seeing so much work

Re: deleteByQuery does not work with SolrCloud

2013-10-23 Thread YouPeng Yang
Hi Erick It can get hits on this documents. And I try this : myhost/solr/mycore/update?stream.body=name:shardTv_20131010&commit=true the document could be deleted. Regards 2013/10/23 Erick Erickson > The first thing I'd do is go in to the browser UI and make sure you can get > hits o

Re: Solr operation problem

2013-10-23 Thread Alexandre Rafalovitch
Have you already used Solr with default setup (Jetty)? If not, I recommend you do the Jetty setup first and online tutorial. Just so you understand what the files are, where they are and so on. Then, add Tomcat into the mix. If you still have a problem, let us know which operating system you are o

Re: Global IDF vs. Routing

2013-10-23 Thread Yonik Seeley
On Wed, Oct 23, 2013 at 9:03 PM, Otis Gospodnetic wrote: > Seeing so much work being put in routing and seeing the recent > questions about the status of global IDF support made me realize, for > the first time really, that with people using routing more and more we > should be seeing more and mor

Carrot2 Clustering with Field Collapsing

2013-10-23 Thread Thanigai Vellore
When I try to use carrot2 clustering in solr with grouping based on a field, I get a null pointer exception. However, the clustering query works fine without field grouping. For eg: the below query works fine: /clustering?q=text:apple&rows=500&carrot.title=title but, this query throws an error:

Global IDF vs. Routing

2013-10-23 Thread Otis Gospodnetic
Hi, Seeing so much work being put in routing and seeing the recent questions about the status of global IDF support made me realize, for the first time really, that with people using routing more and more we should be seeing more and more issues caused by the lack of global IDF because routing by

Solr operation problem

2013-10-23 Thread masum.uia
Dear user, Cloud you please help me to solve my following problem: I have installed Java, Tomcat and arrange all the files for Solr 4.5 according the instruction from Solr Wiki.htm and different web. My tomcat is running well but I am getting problem once I try to open solr using http://localhost:

Re: single core for extracted text from pdf/other doc types and metadata fields about that doc from the database

2013-10-23 Thread Otis Gospodnetic
You can accomplish your end goal easily if you just write your own indexer, which is easy and gives you power and flexibility. Otis Solr & ElasticSearch Support http://sematext.com/ On Oct 23, 2013 6:39 PM, "Sharma, Vikas" wrote: > > Can I create a core where one subset of fields comes from the

single core for extracted text from pdf/other doc types and metadata fields about that doc from the database

2013-10-23 Thread Sharma, Vikas
Can I create a core where one subset of fields comes from the Database source using the DataImport handler for database and another subset of fields using the Apache Tika dataimport handler For example if in the indexed doc I want following fields to come from the database source 1

Re: Solr facet field counts not correct

2013-10-23 Thread dboychuck
Hoss created: https://issues.apache.org/jira/browse/SOLR-5383 -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-facet-field-counts-not-correct-tp4097305p4097346.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr facet field counts not correct

2013-10-23 Thread Chris Hostetter
: if I do group=false&group.facet=false the counts are what they should be for : the ungrouped counts... seems like group.facet isn't working correctly yeah ... thanks for digging int -- definitely seems like a problem with group.facet and Trie fields that use precisionStep. I've opened a Jira:

Re: Solr not indexing everything from MongoDB

2013-10-23 Thread gohome190
numFound is 10. numDocs is 10, maxDoc is 23. Yeah, Solr 4.x! Thanks! -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-not-indexing-everything-from-MongoDB-tp4097302p4097340.html Sent from the Solr - User mailing list archive at Nabble.com.

Terms function join with a Select function ?

2013-10-23 Thread Bruno Mannina
Dear Solr users, I use the Terms function to see the frequency data in a field but it's for the whole database. I have 2 questions: - Is it possible to increase the number of statistic ? actually I have the 10 first frequency term. - Is it possible to limit this statistic to the result of a

Re: What is the right fieldType for this kind of field?

2013-10-23 Thread Jack Krupansky
Yes, that blog post appears to use the proper technique for case insensitive string fields. The so-called "keyword" tokenizer merely treats the whole string value as a single token (AKA keyword) and does NOT do any further tokenization. -- Jack Krupansky -Original Message- From: Bru

Re: What is the right fieldType for this kind of field?

2013-10-23 Thread Bruno Mannina
Le 23/10/2013 22:49, Bruno Mannina a écrit : Le 23/10/2013 22:44, Bruno Mannina a écrit : Le 23/10/2013 20:09, Jack Krupansky a écrit : You could use the keyword tokenizer plus the lower case filter. Jack, Could you help me to write the right fieldType please? (index and query) Another thing

Re: What is the right fieldType for this kind of field?

2013-10-23 Thread Bruno Mannina
Le 23/10/2013 22:44, Bruno Mannina a écrit : Le 23/10/2013 20:09, Jack Krupansky a écrit : You could use the keyword tokenizer plus the lower case filter. Jack, Could you help me to write the right fieldType please? (index and query) Another thing, I don't know if I must use the Keyword token

Re: What is the right fieldType for this kind of field?

2013-10-23 Thread Bruno Mannina
Le 23/10/2013 20:09, Jack Krupansky a écrit : You could use the keyword tokenizer plus the lower case filter. Jack, Could you help me to write the right fieldType please? (index and query) Another thing, I don't know if I must use the Keyword tokenizer because codes contain "/" char, and Tok

Re: Solr facet field counts not correct

2013-10-23 Thread dboychuck
if I do group=false&group.facet=false the counts are what they should be for the ungrouped counts... seems like group.facet isn't working correctly -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-facet-field-counts-not-correct-tp4097305p4097314.html Sent from the Solr -

Re: Solr facet field counts not correct

2013-10-23 Thread dboychuck
Here is my query String: /solr/singleproductindex/productQuery?fq=siteid:82&q=categories_82_is:109124&facet=true&facet.query=HeatingArea_numeric:[0%20TO%20*]&facet.field=HeatingArea_numeric&debugQuery=true Here is my schema for that field: Here is my request handler definition:

Solr facet field counts not correct

2013-10-23 Thread dboychuck
I am running a simple query in a non-distributed search using grouping. I am getting incorrect facet field counts and I cannot figure out why. Here is the query you will notice that the facet field and facet query counts are not the same. The facet query counts are correct. Any help is appreciated

Re: Solr not indexing everything from MongoDB

2013-10-23 Thread Shawn Heisey
On 10/23/2013 1:14 PM, gohome190 wrote: I have a Mongo database with about 50 entries inside. I use a mongo-solr connector. When I do a Solr *:* query, I only get about 10 or 13 responses. Even if I increase the max rows. I have updated my schema.xml accordingly. I have deleted my solr index,

Solr not indexing everything from MongoDB

2013-10-23 Thread gohome190
Hi, I have a Mongo database with about 50 entries inside. I use a mongo-solr connector. When I do a Solr *:* query, I only get about 10 or 13 responses. Even if I increase the max rows. I have updated my schema.xml accordingly. I have deleted my solr index, restarted solr, restarted the connec

Re: What is the right fieldType for this kind of field?

2013-10-23 Thread Bruno Mannina
Hi Jack, Yes String works fine, I forgot to restart my solr server after changing my schema.xml...arrf.I'm so stupid sorry ! Le 23/10/2013 20:09, Jack Krupansky a écrit : Trailing wildcard should work fine for strings, but "a23*" will not match "A23*" due to case. You could use the keywor

Re: DIH - delta query and delta import query executes transformer twice

2013-10-23 Thread Arcadius Ahouansou
Hello Lee. In case you haven't solved this, would you mind posting your DIH config? Arcadius. On 27 September 2013 15:06, Lee Carroll wrote: > Hi It looks like when a DIH entity has a delta and delta import query plus > a transformer defined the execution of both query's call the transformer

Re: What is the right fieldType for this kind of field?

2013-10-23 Thread Jack Krupansky
Trailing wildcard should work fine for strings, but "a23*" will not match "A23*" due to case. You could use the keyword tokenizer plus the lower case filter. -- Jack Krupansky -Original Message- From: Bruno Mannina Sent: Wednesday, October 23, 2013 1:54 PM To: solr-user@lucene.apache

What is the right fieldType for this kind of field?

2013-10-23 Thread Bruno Mannina
Dear, Data look likes: A23L1/22066 A23L1/227 A23L1/231 A23L1/2375 I tried: - String but I can't search with troncation (i.e. A23*) - Text_General but as my code contains / then data are splitted... What kind of field must choose to use truncation and consider code with / as one term? th

Re: Query cache and group by queries

2013-10-23 Thread Erick Erickson
query cache? queryResultCache? filterCache? Some more details please, what are you seeing and what do you expect to see? Best, Erick On Wed, Oct 23, 2013 at 1:22 PM, Kalle Aaltonen wrote: > Hi, > > It seems that query cache is not used to all for group queries? Can someone > explain why this i

Re: Indexing logs files of thousands of GBs

2013-10-23 Thread Erick Erickson
As a supplement to what Chris said, if you can partition the walking amongst a number of clients you can also parallelize the indexing. If you're using SolrCloud 4.5+, there are also some nice optimizations in SolrCloud to keep intra-shard routing to a minimum. FWIW, Erick On Wed, Oct 23, 2013 a

Re: New shard leaders or existing shard replicas depends on zookeeper?

2013-10-23 Thread Erick Erickson
My first impulse would be to ask how you created the collection. It sure _sounds_ like you didn't specify 24 shards and thus have only a single shard, one leader and 23 replicas bq: ...to point to the zookeeper ensemble also used for the ukdomain collection... so my guess is that this ZK ense

RE: New query-time multi-word synonym expander

2013-10-23 Thread Markus Jelsma
Nice, but now we got three multi-word synonym parsers? Didn't the LUCENE-4499 or SOLR-4381 patches work? I know the latter has had a reasonable amount of users and committers on github, but it was never brought back to ASF it seems. -Original message- > From:Otis Gospodnetic > Sent: W

Re: New query-time multi-word synonym expander

2013-10-23 Thread Jack Krupansky
Otis, could you provide a little (well, maybe a lot!) of discussion and detailed examples that illustrate what the patch can and can't handle? I mean, I read the Jira and and is simultaneously promising and a bit vague. Does it fully solve the issue, or is it yet another partial solution? Either

Re: Issue with large html indexing

2013-10-23 Thread Erick Erickson
Attachments and images are often eaten by the mail server, your image is not visible at least to me. Can you describe what you're seeing? Or post the image somewhere and provide a link? Best, Erick On Wed, Oct 23, 2013 at 11:07 AM, Raheel Hasan wrote: > Hi, > > I have an issue here while indexi

Re: Spellcheck with Distributed Search (sharding).

2013-10-23 Thread Luis Cappa Banda
More info: When executing the Query to a single Solr server it works: http://solr1:8080/events/data/suggest?q=m&wt=json { - responseHeader: { - status: 0, - QTime: 1 }, - response: { - numFo

New query-time multi-word synonym expander

2013-10-23 Thread Otis Gospodnetic
Hi, Heads up that there is new query-time multi-word synonym expander patch in https://issues.apache.org/jira/browse/SOLR-5379 This worked for our customer and we hope it works for others. Any feedback would be greatly appreciated. Thanks, Otis -- Performance Monitoring * Log Analytics * Search

Re: Is Solr can create temporary sub-index ?

2013-10-23 Thread Bruno Mannina
I need your help to define the right fieldType, please, this field must be indexed, stored and each value must be considered as one term. The char / don't be consider like a separator. Is String could be a good fieldType ? thanks Le 23/10/2013 18:51, Bruno Mannina a écrit : A23L1/22066 A

RE: Facet performance

2013-10-23 Thread Lemke, Michael SZ/HZA-ZSW
On Tue, October 22, 2013 5:23 PM Michael Lemke wrote: >On Tue, October 22, 2013 9:23 AM Toke Eskildsen wrote: >>On Mon, 2013-10-21 at 16:57 +0200, Lemke, Michael SZ/HZA-ZSW wrote: >>> QTime fc: >>>never returns, webserver restarts itself after 30 min with 100% CPU >>> load >> >>It might be

Re: Solr Cloud Distributed IDF

2013-10-23 Thread dboychuck
I am indexing documents using the domin:id format ex id = k-690kohler!670614 This ensures that all k-690kohler documents are indexed to the same shard. This does cause numDocs that are not perfectly distributed across shards probably even worse than the default sharding algorithm. Here is the sear

Spellcheck with Distributed Search (sharding).

2013-10-23 Thread Luis Cappa Banda
Hello! I'be been trying to enable Spellchecking using sharding following the steps from the Wiki, but I failed, :-( What I do is: *Solrconfig.xml* <*searchComponent name="suggest"* class="solr.SpellCheckComponent"> suggest org.apache.solr.spelling.suggest.Suggester org.apache.solr.spelling.sug

Re: Minor bug with CloudSolrServer and collection-alias.

2013-10-23 Thread Mark Miller
I filed https://issues.apache.org/jira/browse/SOLR-5380 and just committed a fix. - Mark On Oct 23, 2013, at 11:15 AM, Shawn Heisey wrote: > On 10/23/2013 3:59 AM, Thomas Egense wrote: >> Using cloudSolrServer.setDefaultCollection(collectionId) does not work as >> intended for an alias spannin

Re: Is Solr can create temporary sub-index ?

2013-10-23 Thread Bruno Mannina
Hum I think my fieldType = "text_classification" is not appropriated for this kind of data... I don't need to use stopwords, synonym etc... IC field is a field that contains codes, and codes contains often the char "/" and if I use the Terms option, I get: ... 4563254 3763554 2263254 ... ..

Re: Is Solr can create temporary sub-index ?

2013-10-23 Thread Bruno Mannina
I have a little question concerning statistics on a request: I have a field defined like that: multiValued="true"/> positionIncrementGap="100" autoGeneratePhraseQueries="true"> words="stopwords.txt" enablePositionIncrements="true"/> words="stopwords.txt" enablePositionIncre

Re: Is Solr can create temporary sub-index ?

2013-10-23 Thread Timothy Potter
Yes, absolutely you resend the q= each time, optionally with any facets selected by the user using fq= On Wed, Oct 23, 2013 at 10:00 AM, Bruno Mannina wrote: > Hello Tim, > > Yes solr's facet could be a solution, but I need to re-send the q= each > time. > I'm asking me just if an another solut

SV: fq with { or } in Solr 4.3.1

2013-10-23 Thread Peter Kirk
Thanks. The data for the "catid" comes from another system, and is actually a string with a start { and an end }. I was confused that it works in a q parameter but not fq. I think the easiest for me, is simply to strip the start and end characters when I feed to the index. Thanks __

Re: Is Solr can create temporary sub-index ?

2013-10-23 Thread Bruno Mannina
Hello Tim, Yes solr's facet could be a solution, but I need to re-send the q= each time. I'm asking me just if an another solution exists. Facet seems to be the good solution. Bruno Le 23/10/2013 17:03, Timothy Potter a écrit : Hi Bruno, Have you looked into Solr's facet support? If I'm

Re: SOLR Cloud node link is wrong in the admin panel

2013-10-23 Thread Shawn Heisey
On 10/23/2013 7:50 AM, Branham, Jeremy [HR] wrote: > It seems the parameters in solr.xml are being ignored. > > > host="localhost" hostPort="8080" hostContext="/svc/solr"> > > collection="test" dataDir="/data/v8p/solr/test/data"/> > > Did you restart Solr (actually your container -

Re: Minor bug with CloudSolrServer and collection-alias.

2013-10-23 Thread Shawn Heisey
On 10/23/2013 3:59 AM, Thomas Egense wrote: > Using cloudSolrServer.setDefaultCollection(collectionId) does not work as > intended for an alias spanning more than 1 collection. > The virtual collection-alias collectionID is recoqnized as a existing > collection, but it does only query one of the co

Re: shards.tolerant throwing null pointer exception when spellcheck is on

2013-10-23 Thread shamik
Thanks for the information. I think its good to have this issue fixed, specially for cases where the spellcheck feature is on. I'll check out at the source code and take a look, even a quick suppressing of the null pointer exception might make a difference. -- View this message in context: http

Re: Having two document sets in one index, "separated" by filter query.

2013-10-23 Thread Timothy Potter
Sounds correct - you probably want to use an invariant parameter in solrconfig.xml, something along the lines of: docset:0 Where docset is the new field you add to the schema to determine which set a document belongs to. You might also consider adding a newSearcher warming query that includes t

Re: Changing indexed property on a field from false to true

2013-10-23 Thread michael.boom
I've made a test, based on your suggestion. Using the example in 4.5.0 i set the title field as indexed=false, indexed a couple of docs: 1 BigApple 2 SmallApple and made fq=title:BigApple. No docs were returned, of course. Then I modified the schema, setting indexed=true for the

Re: Is Solr can create temporary sub-index ?

2013-10-23 Thread Timothy Potter
Hi Bruno, Have you looked into Solr's facet support? If I'm reading your post correctly, this sounds like the classic case for facets. Each time the user selects a facet, you add a filter query (fq clause) to the original query. http://wiki.apache.org/solr/SolrFacetingOverview Tim On Wed, Oct 2

Re: Stop/Restart Solr

2013-10-23 Thread Raheel Hasan
ok got it thanks :) On Wed, Oct 23, 2013 at 7:33 PM, Walter Underwood wrote: > PPID is the "parent process ID". You want to kill the PID, not the PPID. > > wunder > > On Oct 23, 2013, at 3:09 AM, Jeevanandam M. wrote: > > > It seems process started recently. Is there any external cron/proces

Stemming and Synonyms in Apache Solr

2013-10-23 Thread venkatesham.gu...@igate.com
We have written a blog with our understanding and experiments on stemming and synonyms in Apache Solr. http://theunstructuredworld.blogspot.in/ We appreciate the users can read and post their valuable suggestions/comments. Thanks. -- View this message in context: http://lucene.472066.n3.

Re: Stop/Restart Solr

2013-10-23 Thread Walter Underwood
PPID is the "parent process ID". You want to kill the PID, not the PPID. wunder On Oct 23, 2013, at 3:09 AM, Jeevanandam M. wrote: > It seems process started recently. Is there any external cron/process > triggering a startup of Solr? > Kill again and monitor it. > > - Jeeva > > -- Or

Reclaiming disk space from (large, optimized) segments

2013-10-23 Thread Scott Lundgren
*Background:* - Our use case is to use SOLR as a massive FIFO queue. - Document additions and updates happen continuously. - Documents are being added at sustained a rate of 50 - 100 documents per second. - About 50% of these document are updates to existing docs, indexed using atomic u

SolR document with high number of fields

2013-10-23 Thread Jean-Marc Desprez
Hi, I have done some research about SolR document with a very high number of fields. In the mailing list archive there's a thread about this subject which answers my question : http://lucene.472066.n3.nabble.com/Dynamic-fields-performance-question-td476337.html . By the way, this post is a littl

Is Solr can create temporary sub-index ?

2013-10-23 Thread Bruno Mannina
Dear Solr User, We have to do a new web project which is : Connect our SOLR database to a web plateform. This Web Plateform will be used by several users at the same time. They do requests on our SOLR and they can apply filter on the result. i.e.: Our SOLR contains 87M docs An user do request

Re: Changing indexed property on a field from false to true

2013-10-23 Thread Upayavira
The content needs to be re-indexed, the question is whether you can use the info in the index to do it rather than pushing fresh copies of the documents to the index. I've often wondered whether atomic updates could be used to handle this sort of thing. If all fields are stored, push a nominal upd

RE: SOLR Cloud node link is wrong in the admin panel

2013-10-23 Thread Branham, Jeremy [HR]
It seems the parameters in solr.xml are being ignored. Jeremy D. Branham Performance Technologist II Sprint University Performance Support Fort Worth, TX | Tel: **DOTNET http://JeremyBranham.Wordpress.com http://www.linkedin.com/in/jeremybranham -Original Message- From:

Multiple facet fields in "defaults" section of a Request Handler

2013-10-23 Thread Varun Thacker
I define 2 facets - "brand" and "category". Both have been configured in a request handler inside "defaults" Now a client wants to use multi select faceting. He calls the following API: http://localhost:8983/solr/collection1/search?q=*:*&facet.field={!ex=foo}category&fq={!tag=foo}category :"cat"

Re: deleteByQuery does not work with SolrCloud

2013-10-23 Thread Erick Erickson
The first thing I'd do is go in to the browser UI and make sure you can get hits on documents, something like blah/collection/q=indexname:shardTv_20131010 Best, Erick On Wed, Oct 23, 2013 at 8:20 AM, YouPeng Yang wrote: > Hi > I am using SolrCloud withing solr 4.4 ,and I try the SolrJ API > d

Changing indexed property on a field from false to true

2013-10-23 Thread michael.boom
Being given indexed="false"* stored="true" multiValued="false" /> Changed to indexed="true"* stored="true" multiValued="false" /> Once the above is done and the collection reloaded, is there a way I can build that index on that field, without reindexing the everything? Thank you! - Thank

Re: External Zookeeper and JBOSS

2013-10-23 Thread Erick Erickson
When you create the collection, you specify the number of shards you want. >From there on, the data is stored in ZK, I don't think shows up in your solr.xml file. Best, Erick On Tue, Oct 22, 2013 at 7:08 PM, Branham, Jeremy [HR] < jeremy.d.bran...@sprint.com> wrote: > [collections] was empty un

Re: SolrCloud performance in VM environment

2013-10-23 Thread Erick Erickson
Be a bit careful here. 128G is lots of memory, you may encounter very long garbage collection pauses. Just be aware that this may be happening later. Best, Erick On Tue, Oct 22, 2013 at 5:04 PM, Tom Mortimer wrote: > Just tried it with no other changes than upping the RAM to 128GB total, and >

Re: Solr cloud weird behaviour

2013-10-23 Thread Erick Erickson
When you say "missing files", do you mean the index segments are missing or what? Are your document counts the same the night before and after? Is there any indexing going on? We need some more specifics if we're to help you. If you do have indexing going on, then you might be getting segment mer

Having two document sets in one index, "separated" by filter query.

2013-10-23 Thread Achim Domma
Hi, I have two document sets, both having the same schema. On set is the larger reference set (lets say a few hundred thousand documents) and the smaller set is some user generated content (a few hundreds or thousands). In most cases, I just want to search on the larger reference sets but some

Query cache and group by queries

2013-10-23 Thread Kalle Aaltonen
Hi, It seems that query cache is not used to all for group queries? Can someone explain why this is?

Re: Chinese language search in SOLR 3.6.1

2013-10-23 Thread Poornima Jay
Hi Rajani, The string field type is not analyzed. But that is not the case for text_chinese field type for which is  ChineseTokenizerFactory and  ChineseFilterFactory is added for index and query analysis. Below check the schema and the fields how it is defined in my above mail. Thanks, Poornim

Re: Class name of parsing the fq clause

2013-10-23 Thread Jack Krupansky
Not in just a few words. Do you have specific questions? I mean none of that relates to parsing of fq, the topic of this particular email thread, right? -- Jack Krupansky -Original Message- From: Sandeep Gupta Sent: Wednesday, October 23, 2013 3:58 AM To: solr-user@lucene.apache.org S

Re: Stop/Restart Solr

2013-10-23 Thread Furkan KAMACI
Did you check that is it running as a service or not? If it runs as a service when even you kill the process it may start again. 2013/10/23 Jeevanandam M. > It seems process started recently. Is there any external cron/process > triggering a startup of Solr? > Kill again and monitor it. > > - J

Re: Indexing logs files of thousands of GBs

2013-10-23 Thread Chris Geeringh
Prerna, The FileListEntityProcessor has a terribly inefficient recursive method, which will be using up all your heap building a list of files. I would suggest writing a client application and traverse your filesystem with NIO available in Java 7. Files.walkFileTree() and a FileVisitor. As you "

Re: fq with { or } in Solr 4.3.1

2013-10-23 Thread Jack Krupansky
Are you using the edismax query parser? It traps the syntax error and then escapes or ignores special characters. Curly braces are used for exclusive range queries (square brackets are inclusive ranges). The proper syntax is "{term1 TO term2}". So, what were your intentions with "catid:{123}"

New shard leaders or existing shard replicas depends on zookeeper?

2013-10-23 Thread Hoggarth, Gil
Hi solr-users, I'm seeing some confusing behaviour in Solr/zookeeper and hope you can shed some light on what's happening/how I can correct it. We have two physical servers running automated builds of RedHat 6.4 and Solr 4.4.0 that host two separate Solr services. The first server (called l

Re: DIH - URLDataSource import size

2013-10-23 Thread Shalin Shekhar Mangar
Following up within 15 hours is not going to do any good -- it just increases email traffic for everyone. Please understand that a lot of people here are in different time zones and almost all of them are volunteers answering questions in addition to their day jobs. Are there any exceptions in the

Re: Question about sharding and overlapping

2013-10-23 Thread Shalin Shekhar Mangar
No, shard splitting does not support collections with implicit router. On Wed, Oct 23, 2013 at 1:21 PM, Yago Riveiro wrote: > Can I split shards as with compositeId using this method? > > On Wednesday, October 23, 2013, Shalin Shekhar Mangar wrote: > > > You can't control that if using the compo

Why analyzer only output part of my string ?

2013-10-23 Thread Mingzhu Gao
Hi All , I have configured a custom analyzer (Chinese) in solr 4.5.0 , when I access http://localhost:8983/solr/#/collection1/analysis , Choose my fieldType , and input some character string , why only part of string is analyzed ? the last part of string is dismissed. Is there any length limit

RE: Stop/Restart Solr

2013-10-23 Thread Jeevanandam M.
It seems process started recently. Is there any external cron/process triggering a startup of Solr? Kill again and monitor it. - Jeeva -- Original Message -- From: Raheel Hasan [mailto:raheelhasan@gmail.com] Sent: October 23, 2013 3:29:47 PM GMT+05:30 To: solr-user@lucene.apa

Re: Stop/Restart Solr

2013-10-23 Thread Raheel Hasan
31173 1 0 16:45 ?00:00:08 java -jar start.jar On Wed, Oct 23, 2013 at 2:53 PM, Jeevanandam M. wrote: > Can you please share output of following command? > ps -ef | grep 'start.jar' > > - Jeeva > > -- Original Message -- > From: Raheel Hasan [mailto:raheelhasan@g

Minor bug with CloudSolrServer and collection-alias.

2013-10-23 Thread Thomas Egense
I found this bug in both 4.4 and 4.5 Using cloudSolrServer.setDefaultCollection(collectionId) does not work as intended for an alias spanning more than 1 collection. The virtual collection-alias collectionID is recoqnized as a existing collection, but it does only query one of the collections it i

RE: Stop/Restart Solr

2013-10-23 Thread Jeevanandam M.
Can you please share output of following command? ps -ef | grep 'start.jar' - Jeeva -- Original Message -- From: Raheel Hasan [mailto:raheelhasan@gmail.com] Sent: October 23, 2013 3:19:46 PM GMT+05:30 To: solr-user@lucene.apache.org Subject: Re: Stop/Restart Solr Kill -9 ###

Re: Stop/Restart Solr

2013-10-23 Thread Raheel Hasan
also, is this DSTOP.PORT same as on which solr is visible on a browser (i.e. like 8983 from http://localhost:8983)? On Wed, Oct 23, 2013 at 2:49 PM, Raheel Hasan wrote: > Kill -9 didnt kill it... ... the process is now again listed, but > with PPID=1 which I dont want to kill as many proces

Re: Stop/Restart Solr

2013-10-23 Thread Raheel Hasan
Kill -9 didnt kill it... ... the process is now again listed, but with PPID=1 which I dont want to kill as many processes have this same id... On Tue, Oct 22, 2013 at 11:59 PM, Utkarsh Sengar wrote: > We use this to start/stop solr: > > Start: > java -Dsolr.clustering.enabled=true -Dsolr.so

RE: fq with { or } in Solr 4.3.1

2013-10-23 Thread michael.boom
For filtering categories i'm using something like this : fq=category:(cat1 OR cat2 OR cat3) - Thanks, Michael -- View this message in context: http://lucene.472066.n3.nabble.com/fq-with-or-in-Solr-4-3-1-tp4097170p4097183.html Sent from the Solr - User mailing list archive at Nabble.com.

Issue with large html indexing

2013-10-23 Thread Raheel Hasan
Hi, I have an issue here while indexing large html. Here is the confguration for that: 1) Data is imported via URLDataSource / PlainTextEntityProcessor (DIH) 2) Schema has this for the field: type="text_en_splitting" indexed="true" stored="false" required="false" 3) text_en_splitting has the fo

RE: Facet performance

2013-10-23 Thread Toke Eskildsen
On Tue, 2013-10-22 at 17:25 +0200, Lemke, Michael SZ/HZA-ZSW wrote: > On Tue, October 22, 2013 11:54 AM Andre Bois-Crettez wrote: > >> This is with Solr 1.4. > >Really ? > >This sound really outdated to me. > >Have you tried a tried more recent version, 4.5 just went out ? > > Sorry, can't. Too m

RE: fq with { or } in Solr 4.3.1

2013-10-23 Thread Peter Kirk
Sorry, that was just a typo. / search?q=*:*&fq=catid:{123} Gives me the error. I think that { and } must be used in ranges for fq, and that's why I can't use them directly like this. /Peter -Original Message- From: Upayavira [mailto:u...@odoko.co.uk] Sent: 23. oktober 2013 10:52 T

Re: fq with { or } in Solr 4.3.1

2013-10-23 Thread Upayavira
Missing a colon before the curly bracket in the fq? On Wed, Oct 23, 2013, at 09:42 AM, Peter Kirk wrote: > Hi > > If I do a search like > /search?q=catid:{123} > > I get the results I expect. > > But if I do > /search?q=*:*&fq=catid{123} > > I get an error from Solr like: > org.apache.solr.se

fq with { or } in Solr 4.3.1

2013-10-23 Thread Peter Kirk
Hi If I do a search like /search?q=catid:{123} I get the results I expect. But if I do /search?q=*:*&fq=catid{123} I get an error from Solr like: org.apache.solr.search.SyntaxError: Cannot parse 'catid:{123}': Encountered " "}" "} "" at line 1, column 58. Was expecting one of: "TO" ... ...

Re: Adding documents in Solr plugin

2013-10-23 Thread Avner Levy
I've tried to write the plugin code. Currently I do: AddUpdateCommand addUpdateCommand = new AddUpdateCommand(solrQueryRequest); DocIterator iterator = docList.iterator(); SolrIndexSearcher indexReader = solrQueryRequest.getSearcher();

Re: Class name of parsing the fq clause

2013-10-23 Thread Sandeep Gupta
Thanks Jack for detailing out the parser logic. Would it be possible for you to say something more about filter cache code flow... sometimes we do not use fq parameter in query string and pass the raw query Regards Sandeep On Mon, Oct 21, 2013 at 7:11 PM, Jack Krupansky wrote: > Start wit

Re: Question about sharding and overlapping

2013-10-23 Thread Yago Riveiro
Can I split shards as with compositeId using this method? On Wednesday, October 23, 2013, Shalin Shekhar Mangar wrote: > You can't control that if using the compositeIdRouter because the routing > is dependent on the hash function. What you want is custom sharding i.e. > the ability to control th

Re: DIH - URLDataSource import size

2013-10-23 Thread Raheel Hasan
anyone? On Tue, Oct 22, 2013 at 9:50 PM, Raheel Hasan wrote: > Hi, > > I have an issue that is only coming on live environment. The DIH > with URLDataSource is not working when the file size imported is large > (i.e. 100kb above - which is not so large). If its large, it returns > nothing (as se

Re: Solr Cloud Distributed IDF

2013-10-23 Thread Upayavira
Can you say more about the problem? What did you see that led to that problem? How did you distribute docs between shards, and how is that different from your 3.6 setup? It might be a distributed IDF thing, or it could be something simpler. Upayavira On Wed, Oct 23, 2013, at 03:26 AM, dboychuck

Re: Solr Cloud Distributed IDF

2013-10-23 Thread Toke Eskildsen
On Wed, 2013-10-23 at 04:26 +0200, dboychuck wrote: > I recently moved an index from 3.6 non-distributed to Solr Cloud 4.4 with > three shards. My company uses a boosting function with a value assigned to > each document. This boosting function no longer works dependably and I > believe the cause i