Re: Contrib module for Document Clustering

2016-04-06 Thread davidphilip cherian
gt; Joel Bernstein > http://joelsolr.blogspot.com/ > > On Wed, Apr 6, 2016 at 1:36 PM, davidphilip cherian < > davidphilipcher...@gmail.com> wrote: > > > Any thoughts? > > > > > > On Tue, Apr 5, 2016 at 9:05 PM, davidphilip cherian < > > davidphilipcher...@gm

Re: Contrib module for Document Clustering

2016-04-06 Thread davidphilip cherian
Any thoughts? On Tue, Apr 5, 2016 at 9:05 PM, davidphilip cherian < davidphilipcher...@gmail.com> wrote: > Hi, > > Is there any contribution(open source contrib module) that routes > documents to shards based on document similarity technique? Or any > suggestions that inte

Contrib module for Document Clustering

2016-04-05 Thread davidphilip cherian
Hi, Is there any contribution(open source contrib module) that routes documents to shards based on document similarity technique? Or any suggestions that integrates mahout to solr for this use case? >From what I know, currently there are two document route strategies as explained here https://luc

Re: understanding phonetic matching

2016-03-23 Thread davidphilip cherian
The "phonetic_en" analyzer definition available in solr-schema does return documents having "Jon", "JN", "John" when search term is "John". Checkout screen shot here : http://imgur.com/0R6SvX2 This wiki page explains how phonetic matching works : https://cwiki.apache.org/confluence/display/solr/Ph

hitratio vs cumulative_hitratio

2016-02-04 Thread davidphilip cherian
Solr caching : What does it mean have lookup=0, hits=0 and hitratio=0 but cumulative_hitratio=0.75 and cumulative_lookups >100,000 with cumulative_inserts >20k and cumulative_evictions =0, maxSize of cache objects is 512

Re: Close and open vendor sorting

2016-01-22 Thread davidphilip cherian
Hi Irshad, So, assuming that each vendor information is one solr document, you will have information regarding the vendors open-close hours correct? You should be indexing this content in one of the fields, isn't it? If yes, then you should try something as explained: When the user searches, *capt

Solr node 'Gone' status

2016-01-18 Thread davidphilip cherian
Hi, Solr-admin cloud view page has got another new radio button indicating status of node : 'Gone' status. What does that mean? One of my collection is in that state and it is not serving any request. How to bring that up?

Re: Error while reloading collection

2016-01-13 Thread davidphilip cherian
You should probably ask this question here http://www.cloudera.com/content/cloudera-content/cloudera-docs/CM4Ent/4.7.0/Cloudera-Manager-Introduction/cmi_getting_help_and_support.html On Thu, Jan 14, 2016 at 12:11 PM, vidya wrote: > Hi > I am using solrcloud on cloudera cluster. I have created

Re: Configuring cores to persist in the event of Solr restart

2016-01-10 Thread davidphilip cherian
What do you mean by cores getting deleted? Files created on filesystem for these cores disappear? How are you starting and stopping solr? Is this solr cloud or standalone mode? On Sun, Jan 10, 2016 at 2:59 PM, Allan Kamau wrote: > We are able to load several cores into Solr 5.3.1. > The problem

Re: solr 5.2.0 need to build high query response

2016-01-05 Thread davidphilip cherian
You should use solr softcommit for this use case. So, by setting softcommit to 5 seconds and autoCommit to minute with openSearcher=false should do the work. 6 false 2000 Reference link- https://cwiki.apache.org/confluence/display/solr/Near+Real+Time+Searching To know more about solr

Re: Querying with action parameter included in URL

2016-01-04 Thread davidphilip cherian
Hi Vidya, I think you are confused with solr search queries/requests with solr other restful apis to perform CRUD operations on collections. Sample of search queries are list here with standard query parser : https://cwiki.apache.org/confluence/display/solr/The+Standard+Query+Parser Solr collect

Re: Using post.jr for indexing in Solr 5.4.0

2015-12-30 Thread davidphilip cherian
Did you try using the latest post jar (shipped with 5.4 version) to index same docs? What happened? Did you get the same error or no? On Thu, Dec 31, 2015 at 11:12 AM, Zheng Lin Edwin Yeo wrote: > Hi, > > Would like to find out, is there any changes to the post.jar for Solr 5.4? > I tried to u

Re: Having replica will slow down Solr?

2015-12-30 Thread davidphilip cherian
Few questions: What are your memory configs: xms and xmx values, available ram size? Were you also indexing at the time when the search response was so slow? In the sense, were you committing often with openSearcher=true, in any case? What is the QTime for consequent searches? Are the caches ena

Re: problem with solr plugin

2015-12-19 Thread davidphilip cherian
Hi Sara, The error is clear: class not found exception, which means solr couldn't locate that jar file. If you are not using solr-cloud then place that custom jar under solr_home/lib folder. You can also hard code the path of this jar file in solrconfig.xml under /lib element. If you are using s

Re: Issue in Geospatial Search

2015-12-15 Thread davidphilip cherian
What is the fieldType of the field "latlon" in older schema as well as new schema? Have you confirmed that both are same? On Tue, Dec 15, 2015 at 3:18 PM, Shenbagarajan wrote: > Hello, > > I am trying to implement geo spatial search in solr by referring the below > site. > https://cwiki.apache.

Re: Create Collection Admin Request - unable to specify collection configName

2015-12-01 Thread davidphilip cherian
If I'm carefully looking at your commands, the IP are different. For upconfig "54.XX.XX.124:2181" For downconfig, you used 54.XX.XX.84:2181 Are these zookeeper on an ensemble setup? Anyways, to avoid confusion, what I'd recommend is, just spin one only zookeeper first, start solr by passing respe

Re: unsubscribe me.

2015-11-18 Thread davidphilip cherian
You should probably send an email to solr-user-unsubscr...@lucene.apache.org Reference links http://lucene.apache.org/solr/resources.html#community https://wiki.apache.org/solr/Unsubscribing%20from%20mailing%20lists On Wed, Nov 18, 2015 at 1:04 PM, Pramod wrote: > please unsubscribe me. > >

Re: Simple web interface for queries

2015-11-10 Thread davidphilip cherian
How about velocity search UI? You can access that using /browse https://cwiki.apache.org/confluence/display/solr/Velocity+Search+UI On Tue, Nov 10, 2015 at 10:14 AM, Nigel Robbins < nigel.robb...@wallstreetsystems.com> wrote: > Hi, > > > > Just wondering if there is a simplified web user inte

Convert output response xml into input xml format using xslt

2015-11-09 Thread davidphilip cherian
Has anyone written a sample xslt (and would like to share) that converts output response xml of solr into its input format, to repost/reindex it back? Thanks

Re: SnapPuller Exception in Slave server

2015-11-08 Thread davidphilip cherian
Hi, It is in the exception that 'master node' is not available, and thus reporting IO exception. Did you check if the master instance was running when these exceptions were reported? Where are these two nodes located? Is there a network latency? Do you see any mismatch in the numFound, at master

Re: solr-8983-console.log is huge

2015-11-08 Thread davidphilip cherian
I think, the collection reload, using collection apis[1] should do the work. If you are not on solr-cloud setup, just navigate to coreadmin on solr UI, and click on reload, and see if that works. [1] https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-api2 http: //loca

Re: Solr Keyword query on a specific field.

2015-11-08 Thread davidphilip cherian
ied > within the Title sections's keywords. My goal is to have the Title keywords > work the same as the general keywords but only be applied to the title > field vs the default text field. > > On Fri, Oct 30, 2015 at 6:35 PM, davidphilip cherian < > davidphilipcher...@gmail.

Re: solr-8983-console.log is huge

2015-11-06 Thread davidphilip cherian
>From mail archives https://support.lucidworks.com/hc/en-us/articles/207072137-Solr-5-X-Console-Logging-solr-8983-console-log On Fri, Nov 6, 2015 at 1:10

SSL on Solr with CA signed certificate

2015-11-02 Thread davidphilip cherian
The doc[1] on reference guide provides steps related to setting up ssl with self signed certificate. My employer wants me to set up and test with CA signed certificate. When I go to buy[2] a ssl certificate(just for testing), it asks for specific web server name and jetty is not listed on it. Is t

Re: Solr Keyword query on a specific field.

2015-10-30 Thread davidphilip cherian
>> "Is there any way to have a single field search use the same keyword search logic as the default query?" Do a phrase search, with double quotes surrounding the multiple keywords, it should work. Try q=title:("Test Keywords") You could possibly try adding this q.op as local param to query as sh

Re: Sort not working as expected

2015-10-30 Thread davidphilip cherian
You can create a copy field with string type and make it copy from this existing field, and sort on this new one. That way, you can still continue doing text search on existing one and sort on this new field. On Fri, Oct 30, 2015 at 3:04 PM, Brian Narsi wrote: > Is there no way that the exis

String index out of range exception from Spell check

2015-09-27 Thread davidphilip cherian
There are irregular exceptions from spell check component. Below is the stack trace. This is not common for all the q terms but have often seen them occurring for specific queries after enabling spellcheck.collate method. String index out of range: -3 java.lang.StringIndexOutOfBoundsException

Re: Problems upgrading to Solr 5.3.0

2015-09-07 Thread davidphilip cherian
If you have a look at solr 5.3 'changes.txt', there are some jiras (resolved) that are related to ICU Tokenizer. You could possibly try taking the latest jar of this tokenizer from solr 5.3 binaries and replace in 5.2 and try. On Mon, Sep 7, 2015 at 9:49 AM, Zheng Lin Edwin Yeo wrote: > Hi, >

Re: Issue Using Solr 5.3 Authentication and Authorization Plugins

2015-09-04 Thread davidphilip cherian
Hi Kevin/Noble, What is the download link to take the latest? What are the steps to compile it, test and use? We also have a use case to have this feature in solr too. Therefore, wanted to test and above info would help a lot to get started. Thanks. On Fri, Sep 4, 2015 at 1:45 PM, Kevin Lee wr

Logging in solr admin page

2015-08-17 Thread davidphilip cherian
Hi, Where are the logs fetched from on solr admin ui page? http://localhost:8983/solr/#/~logging. I am unable to see any logs there. Its just showing the 'loading' symbol but no logs fetched. What could be the reason? Any logging setting that has to be made? Thanks.

Re: No. of records mismatch

2015-08-16 Thread davidphilip cherian
Hi, You should check whether there were deletions by navigating to solr admin core admin page. Example url http://localhost:8983/solr/#/~cores/test_shard1_replica1, check for numDocs, maxDocs and deletedDocs. If numDocs remains equal to maxDocs, then you confirm that there were no updations (as re

Re: Make search faster in Solr

2015-08-10 Thread davidphilip cherian
Hi Nitin, 32 shards for 16 million documents is too much. 2 shards should suffice considering your document sizes are moderate. Caches are to be monitored and tuned accordingly. You should study about caches a bit here https://cwiki.apache.org/confluence/display/solr/Query+Settings+in+SolrConfig

Re: Changing solr.Date to solr.TrieDate

2015-08-10 Thread davidphilip cherian
Hi Saurabh, You could probably try command=fetchindex functionality. http://node:port /solr//replication?command=fetchindex&masterUrl=http://node:port /solr/ Master url should be the existing index solr instance url. On Mon, Aug 10, 2015 at 6:37 PM, saurabh tewari wrote: > Hi, > > I recently s

Re: Is cache enabled by default?

2015-08-10 Thread davidphilip cherian
Hi Nitin, You can just set the attributes of caches to zero. size="0" initialSize="0"autowarmCount="0" and so on. Why do you want to turn off caches btw? Any specific reasons?IMO, documents are cached in OS disc cache space which you may not able to control. It is OS specific. I don't quite

Collection APIs to create collection and custom cores naming

2015-08-02 Thread davidphilip cherian
How to use the 'property.name=value' in the api example[1] to modify core.properties value of 'name' While creating the collection with below query[2], the core names become 'aggregator_shard1_replica1' and 'aggregator_shard2_replica1'. I wanted have specific/custom name for each of these cores. I

Re: java.lang.IllegalStateException: Too many values for UnInvertedField faceting on field content

2015-07-20 Thread davidphilip cherian
I think you should just make docValues=true and reindex. But be warned that faceting is generally not performed on field that are of type text and tokenized. They should be string if they are not numeric. What is analysis chain of 'text_fa'? On Mon, Jul 20, 2015 at 8:16 PM, Ali Nazemian wrote:

Re: Protwords in solr spellchecker

2015-07-10 Thread davidphilip cherian
hat you can suggest me. > > > > > On Thu, Jul 9, 2015 at 12:09 PM, davidphilip cherian < > davidphilipcher...@gmail.com> wrote: > > > The best bet is to use solr.StopFilterFactory. > > Have all such words added to stopwords.txt and add this filter to yo

Re: Protwords in solr spellchecker

2015-07-08 Thread davidphilip cherian
The best bet is to use solr.StopFilterFactory. Have all such words added to stopwords.txt and add this filter to your analyzer. Reference links https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.StopFilterFactory https://cwiki.apache.org/confluence/display/solr/Filter+Descriptions#F

Re: Tokenizer and Filter Factory to index Chinese characters

2015-07-06 Thread davidphilip cherian
Hi Edwin, Have you tried the Paoding analyzer? It is not out of the box shipped with Solr jars. You may have to download it and add it to solr libs. https://stanbol.apache.org/docs/trunk/components/enhancer/nlp/paoding 2015-07-06 12:29 GMT+05:30 Zheng Lin Edwin Yeo : > I'm now using the solr

Re: Solr 5.2.1 setup zookeeper ensemble problem

2015-07-06 Thread davidphilip cherian
Hi Adrian, What is the error that you are getting? In order to upload configs files, you could use zkcli.sh script that will be shipped with solr and use the upconfig command. ./server/scripts/cloud-scripts/zkcli.sh -zkhost 127.0.0.1:9983 \ -cmd upconfig -confname my_new_config -confdir serve

Re: Solr Suggester not working.

2015-06-25 Thread davidphilip cherian
Did you have a quick look at this blog? http://lucidworks.com/blog/solr-suggester/ On Fri, Jun 26, 2015 at 12:05 PM, ssharma7...@gmail.com < ssharma7...@gmail.com> wrote: > Hi, > I am new to Solr & have installed Solr 5.1. I am trying to get the > Suggester > component working without any succe

Re: Contribute the Customized Phonetic Filter to Apache Solr

2015-06-18 Thread davidphilip cherian
Hi Aman, https://wiki.apache.org/solr/HowToContribute HTH On Thu, Jun 18, 2015 at 12:11 PM, Aman Tandon wrote: > Hi, > > We created the new phonetic filter, It is working great on our products, > mostly of our suppliers are Indian, it is quite helpful for us to provide > the exact result e.g.

Re: Securing Solr 5.0.0

2015-03-23 Thread davidphilip cherian
Hi Frederik Arnold, Could you please blog it? the steps to setup the same using apache as reverse proxy and share it with community? On Sun, Mar 22, 2015 at 10:16 PM, Frederik Arnold wrote: > I have and I tried all sorts of things and they didn't work. > But I figured it out now. I setup Apache

Re: IP Address assgined to solr instance during the Cloud mode start

2015-03-19 Thread davidphilip cherian
Hi Shawn, Thanks you for the detailed explanation. On Thu, Mar 19, 2015 at 7:31 PM, Shawn Heisey wrote: > On 3/19/2015 2:02 AM, davidphilip cherian wrote: > > When I started solr in cloud mode(interactive) and chose 2 nodes, it > > started and in the cloud-view screen it showed

Re: IP Address assgined to solr instance during the Cloud mode start

2015-03-19 Thread davidphilip cherian
I think this is because of change in network ip address. I got it. Thanks. On Thu, Mar 19, 2015 at 1:32 PM, davidphilip cherian < davidphilipcher...@gmail.com> wrote: > Hi, > > When I started solr in cloud mode(interactive) and chose 2 nodes, it > started and in the cloud-vie

Start stop solr started in solr cloud mode

2015-03-19 Thread davidphilip cherian
Hi, I started solr in cloud mode (interactive set up). 3 nodes, 3 shards and 1 replica and a collection. I stopped it using ./solr stop -all. How do I get the same above cloud mode setup to start? "./solr -c start" started the new solr cloud instance all together where as I was looking for the pr

IP Address assgined to solr instance during the Cloud mode start

2015-03-19 Thread davidphilip cherian
Hi, When I started solr in cloud mode(interactive) and chose 2 nodes, it started and in the cloud-view screen it showed some different ip with url 169.254.5.207:7574, when clicked on that, it says page not found. When I modified url to localhost(http://localhost:7574/solr/#/~cloud) it worked(loade

Solr 5 admin screen description

2015-03-13 Thread davidphilip cherian
Hi Group, I would like to understand the solr's dashboard screen, particularly the screen *system* properties displayed such as Physical Memory, Swap Space, File Desc. count and etc. 1. What is the importance of file descriptor count? 2. Dashboard shows that it has taken 16 gb of space out of wh