Re: better stemming engine than Porter?

2008-04-22 Thread Jay
Hi Wagner, Thanks for the intro of KStem! I quickly scanned the original paper on KStem by Robert Krovetz but could not find any timing comparison data on KStem and Porter stem. I wonder how slow/fast Kstem is compared to Porter stem based on your use in your application? Jay Wagner,Harry

facet by update date

2016-01-24 Thread Jay Potharaju
updated today will show will be in the facet "24 hrs" and when the same query runs 2 weeks from today, the document will be marked as "< 1 month". How can I set my facets to get the above values? -- Thanks Jay

Re: facet by update date

2016-01-24 Thread Jay Potharaju
cet.interval.set=[NOW-1YEAR,NOW-1MONTH) > > I do not know if this is a correct way of doing this, but I did not find > anything better. > Here is link for interval faceting in wiki: > > https://cwiki.apache.org/confluence/display/solr/Faceting#Faceting-IntervalFaceting > >

Re: facet by update date

2016-01-24 Thread Jay Potharaju
. > > Erik > > > On Jan 24, 2016, at 18:38, Jay Potharaju wrote: > > > > Thanks Pavel, > > I was trying it using the range faceting instead of facet.interval. Can > > someone comment on performance of using facet.interval with sharded > index > >

Handling fields used for both display & index

2016-01-31 Thread Jay Potharaju
Hi, I am trying to decide if I should use text_en or string as my field type. The fields have to be both indexed and stored for display. One solution is to duplicate fields, one for indexing other for display.One of the field happens to be a description field which I would like to avoid duplicating

Solr-8496

2016-01-31 Thread Jay Potharaju
ys have the option > of swapping the "real" collection that you are using without needing to > change anything else. I'll need to ask some people if they think this > is a good documentation addition, and think of a good place to mention > it in the reference guide. > > Thanks, > Shawn > > -- Thanks Jay Potharaju

Re: Handling fields used for both display & index

2016-02-03 Thread Jay Potharaju
Thanks for the response Sameer & Binoy. Jay On Sun, Jan 31, 2016 at 6:13 PM, Binoy Dalal wrote: > Adding to sameer's answer, use string types when you want exact matches, > both in terms of query and case. > In case you want to perform additional operations on the input, like

Custom field using PatternCaptureGroupFilterFactory

2016-03-06 Thread Jay Potharaju
? -- Thanks Jay

Re: Custom field using PatternCaptureGroupFilterFactory

2016-03-06 Thread Jay Potharaju
raw_bytes start end positionLength type position TEST [54 45 53 54] 0 4 1 word 1 On Sun, Mar 6, 2016 at 9:56 AM, Binoy Dalal wrote: > What do you see under the analysis screen in the solr admin UI? > > On Sun, Mar 6, 2016 at 10:55 PM Jay Potharaju > wrote: > > > Hi, > >

Re: Custom field using PatternCaptureGroupFilterFactory

2016-03-07 Thread Jay Potharaju
Thanks Jack, the problem was my regex. Following regex worked. Jay On Sun, Mar 6, 2016 at 7:43 PM, Jack Krupansky wrote: > The filter name, "Capture Group", says it all - only pattern groups are > captured and you have not specified even a single group. See the example: >

JSON FACET API - multiselect

2016-03-09 Thread Jay Potharaju
} *Multi-select facet* fq: [ "{!tag=FIRSTLETTER}facet_firstLetter_lastname:(Q J)" ], response: { numFound: 5246, start: 0, docs: [ ] } I was expecting the response count to be 540 + 4718 = 5258 but the response is 5246. Can someone comment on regarding this? -- Thanks Jay

Re: JSON FACET API - multiselect

2016-03-09 Thread Jay Potharaju
Actually there is a problem with my data..found my error. Thanks On Wed, Mar 9, 2016 at 9:24 AM, Jay Potharaju wrote: > Hi, > I am using solr 5.4 and testing the multi select JSON facet feature. > When I select 1 value the results are the same as number of counts for the > facet

solr & docker in production

2016-03-14 Thread Jay Potharaju
Hi, I was wondering is running solr inside a docker container. Are there any recommendations for this? -- Thanks Jay

Re: solr & docker in production

2016-03-14 Thread Jay Potharaju
Upayavira, Thanks for the feedback. I plan to deploy solr on its own instance rather than on instance running multiple applications. Jay On Mon, Mar 14, 2016 at 3:19 PM, Upayavira wrote: > There is a default Docker image for Solr on the Docker Registry. I've > used it to grea

Re: solr & docker in production

2016-03-15 Thread Jay Potharaju
I have not yet tried in production yet, will post my findings. Thanks Jay > On Mar 14, 2016, at 11:42 PM, Georg Sorst wrote: > > Hi, > > sounds great! > > Did you run any benchmarks? What's the IO penalty? > > Best, > Georg > > Jay Potha

Re: Making managed schema unmutable correctly?

2016-03-18 Thread Jay Potharaju
Does using schema API mean that no upconfig to zookeeper and no reloading of all the nodes in my solrcloud? In which scenario should I not use schema API, if any? Thanks Jay On Wed, Mar 16, 2016 at 6:22 PM, Shawn Heisey wrote: > On 3/16/2016 1:14 AM, Alexandre Rafalovitch wrote: > >

Re: Making managed schema unmutable correctly?

2016-03-20 Thread Jay Potharaju
Thanks appreciate the feedback. On Wed, Mar 16, 2016 at 8:23 PM, Shawn Heisey wrote: > On 3/16/2016 7:51 PM, Jay Potharaju wrote: > > Does using schema API mean that no upconfig to zookeeper and no reloading > > of all the nodes in my solrcloud? In which scenario should I n

Indexing using CSV

2016-03-20 Thread Jay Potharaju
the best way to debug this issue and secondly how do other people handle indexing data using csv data. -- Thanks Jay

understanding phonetic matching

2016-03-22 Thread Jay Potharaju
data. When I look at my solr data I dont see any similar sounding names in my solr data, even though I have set inject="true". Is that not how it is supposed to work? Can someone explain how phonetic matching works? -- Thanks Jay

Indexing multiple pdf's and partial update of pdf

2016-03-23 Thread Jay Parashar
my Solr document has existing fields populated and then I index a pdf, it seems it overwrites the document with the end result being just the contents of the pdf. I know we can do partial updates using SolrJ but is it possible to do partial updates of pdf using curl? Thanks Jay

RE: Indexing multiple pdf's and partial update of pdf

2016-03-24 Thread Jay Parashar
tp://localhost:8983/solr/techproducts/update/extract?&extractOnly=true"; --data-binary @example/exampledocs/sample.html -H 'Content-type:text/html' Thanks Jay -Original Message- From: Reth RM [mailto:reth.ik...@gmail.com] Sent: Thursday, March 24, 2016 12:24 AM To: s

number of zookeeper & aws instances

2016-04-13 Thread Jay Potharaju
Jay

Re: number of zookeeper & aws instances

2016-04-13 Thread Jay Potharaju
-in-the-abstract-why-we-dont-have-a-definitive-answer/ > > Best, > Erick > >> On Wed, Apr 13, 2016 at 6:47 AM, Jay Potharaju wrote: >> Hi, >> >> In my current setup I have about 30 million docs which will grow to 100 >> million by the end of the year. In

RE: Multiple data-config.xml in one collection?

2016-04-14 Thread Jay Parashar
You have to specify which one to run. Each DIH will run only one XML (e.g. health-topics-conf.xml) One thing, and please correct if wrong, I have noticed running DataImport for a particular config overwrites the existing data for a document...that is, there is no way to preserve the existing d

RE: Multiple data-config.xml in one collection?

2016-04-14 Thread Jay Parashar
Thanks a lot Daniel. -Original Message- From: Davis, Daniel (NIH/NLM) [C] [mailto:daniel.da...@nih.gov] Sent: Thursday, April 14, 2016 11:41 AM To: solr-user@lucene.apache.org Subject: RE: Multiple data-config.xml in one collection? Jay Parashar wrote: > One thing, and please corr

RE: Solr Support for BM25F

2016-04-14 Thread Jay Parashar
To use per-field similarity you have to add to your schema.xml file: And then in individual fields you can use the BM25 with different k1 and b. -Original Message- From: David Cawley [mailto:david.cawl...@mail.dcu.ie] Sent: Thursday, April 14, 2016 11:42 AM To: solr-user@lucene.apache.or

Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
e note my clusterstate.json is not updating correctly. Not sure if that is causing an issue. Any suggestions why the Addreplica command is not working. And is it related to the clusterstate.json? If yes, how can i fix it? -- Thanks Jay

Re: Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
it done fairly easily on the Linux command line. If > that's an option and you're interested, let me know - I have a rough but > accurate document. But perhaps others on the list will have the specific > answer you're looking for. > > On Thu, Apr 14, 2016 at 4:19 P

Re: Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
ome preparation > necessary... I'll send you the doc. > > On Thu, Apr 14, 2016 at 4:55 PM, Jay Potharaju > wrote: > > > Curious what command did you use? > > > > On Thu, Apr 14, 2016 at 3:48 PM, John Bickerstaff < > > j...@johnbickerstaff.com> > &g

Re: Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
Bickerstaff" wrote: >> >> 5.4 >> >> This problem drove me insane for about a month... >> >> I'll send you the doc. >> >> On Thu, Apr 14, 2016 at 5:02 PM, Jay Potharaju >> wrote: >> >>> Thanks John, which version of

Re: Adding replica on solr - 5.50

2016-04-14 Thread Jay Potharaju
at the > directories be created anyway. > > There is probably a way to do it through the UI, once Solr is installed on > a new machine - and IIRC, I did manage that, but my IT guy wanted > scriptable command lines. > > Also, IIRC, the stuff I did on the command line actually showed

Re: Adding replica on solr - 5.50

2016-04-15 Thread Jay Potharaju
>>> > `node_name=...`. > >>> > > >>> > > >>> > Thanks, > >>> > Jarek > >>> > > >>> > On Fri, 15 Apr 2016, at 05:48, John Bickerstaff wrote: > >>> >> Another thought - again probably not i

SOLR-3666

2016-04-15 Thread Jay Potharaju
only alternative I know is to get list of all nodes in the cloud and poll each one of them to check the DIH status. Not the most effective way but will work. -- Thanks Jay

Adding a new shard

2016-04-15 Thread Jay Potharaju
but not sure it will work. Any suggestions how are others handling this scenario in production. -- Thanks Jay

Re: Adding a new shard

2016-04-15 Thread Jay Potharaju
, Jay Potharaju wrote: > Hi, > I have an existing collection which has 2 shards, one on each node in the > cloud. Now I want to split the existing collection into 3 shards because of > increase in volume of data. And create this new shard on a new node in the > solrCloud. > > I

Re: Adding a new shard

2016-04-17 Thread Jay Potharaju
es. How many docs are on each? Why do you think > you need more shards? Query speed? OOMs? Java heaps > getting too big? > > Best, > Erick > > On Fri, Apr 15, 2016 at 10:50 PM, Jay Potharaju > wrote: > > I found ticket https://issues.apache.org/jira/browse/SOLR-5025 whic

Re: Adding a new shard

2016-04-18 Thread Jay Potharaju
nning on M2. No downtime during any of > this. > > 4> You should be able to delete S1 now from M1 just to tidy up. > > 5> Repeat for the other shards. > > Best, > Erick > > > On Sun, Apr 17, 2016 at 3:09 PM, Jay Potharaju > wrote: > > Erik tha

Re: NoSuchFileException errors common on version 5.5.0

2016-04-21 Thread Jay Potharaju
ibutes(Files.java:1737) > > at java.nio.file.Files.size(Files.java:2332) > > at > > org.apache.lucene.store.FSDirectory.fileLength(FSDirectory.java:210) > > > > Something else to note: It wasn't 5.5.0 that I had installed, it was > > 5.5.0-SNAPSHOT -- I installed it some time before 5.5.0 was released. > > Looks like I did the install of that version on January 29th. > > > > Thanks, > > Shawn > > > > > -- Thanks Jay Potharaju

measuring query performance & qps per node

2016-04-25 Thread Jay Potharaju
depends on the load and type of query running at a given time. -- Thanks Jay

Re: measuring query performance & qps per node

2016-04-25 Thread Jay Potharaju
rn is all queries like 'q=*:*&fq=clause1&fq=clause2" then it's > fine. It totally > falls down if, for instance, you have a bunch of facets. Or grouping. > Or. > > Best, > Erick > > On Mon, Apr 25, 2016 at 3:48 PM, Jay Potharaju > wrote: > &g

Re: Decide on facets from results

2016-04-28 Thread Jay Potharaju
On the same lines as Erik suggested but using facet stats instead. you can get stats on your facet fields in the first pass and then include the facets that you need in the second pass. > On Apr 27, 2016, at 1:21 PM, Mark Robinson wrote: > > Thanks Eric! > So that will mean another call will

Using updateRequest Processor with DIH

2016-05-01 Thread Jay Potharaju
using Request processor. I tried the following but got an error. Any recommendations on how to use this correctly? index_time data-config.xml update_indextime Error: Error from server at unknown UpdateRequestProcessorChain: update_indextime -- Thanks Jay

Re: query action with wrong result size zero

2016-05-05 Thread Jay Potharaju
t; i found a strange thing with solr query,when i set the value of query > field like "brand:amd",the size of query result is zero,but the real data > is not zero,can some body tell me why,thank you very much!! > my english is not very good,wish some body understand my words! > -- Thanks Jay Potharaju

Filter queries & caching

2016-05-05 Thread Jay Potharaju
:[* TO NOW/DAY+1DAY]&& tofield:[NOW/DAY-7DAY TO *] && type:"abc" The way these queries are currently written I think wont leverage the filter cache because of "*". Is there a better way to write this query so that I can leverage the cache. -- Thanks Jay

Re: Filter queries & caching

2016-05-05 Thread Jay Potharaju
tly. > > Also consider disabling caching (using cost) in expensive queries: > http://yonik.com/advanced-filter-caching-in-solr/ > > Ahmet > > > > On Thursday, May 5, 2016 8:25 PM, Jay Potharaju > wrote: > Hi, > I have a filter query that gets documents based on da

Re: Filter queries & caching

2016-05-05 Thread Jay Potharaju
AY-7DAY TO *] ) fq=filter(type:abc) Is this something that I would need to determine by running some test Thanks On Thu, May 5, 2016 at 1:44 PM, Jay Potharaju wrote: > Are you suggesting rewriting it like this ? > fq=filter(fromfield:[* TO NOW/DAY+1DAY]&& tofield:[NOW/DAY-7DAY TO

Re: Filter queries & caching

2016-05-06 Thread Jay Potharaju
Thanks Shawn,Erick & Ahmet , this was very helpful. > On May 6, 2016, at 6:19 AM, Shawn Heisey wrote: > >> On 5/5/2016 2:44 PM, Jay Potharaju wrote: >> Are you suggesting rewriting it like this ? >> fq=filter(fromfield:[* TO NOW/DAY+1DAY]&& tofield:[NOW/DA

Re: Filter queries & caching

2016-05-06 Thread Jay Potharaju
We have high query load and considering that I think the suggestions made above will help with performance. Thanks Jay On Fri, May 6, 2016 at 7:26 AM, Shawn Heisey wrote: > On 5/6/2016 7:19 AM, Shawn Heisey wrote: > > With three separate > > fq parameters, you'll get th

Re: understanding phonetic matching

2016-05-07 Thread Jay Potharaju
e text and the query text in the > > two boxes in the UI? I am not sure what "look at my solr data" means > > in this particular context. > > > > Regards, > >Alex. > > > > Newsletter and resources for Solr beginners and intermediates: &g

Re: understanding phonetic matching

2016-05-07 Thread Jay Potharaju
Thanks will check it out. On Sat, May 7, 2016 at 7:05 PM, Susheel Kumar wrote: > Jay, > > There are mainly three phonetics algorithms available in Solr i.e. > RefinedSoundex, DoubleMetaphone & BeiderMorse. We did extensive comparison > considering various tests cases and f

Re: Filter queries & caching

2016-05-08 Thread Jay Potharaju
rom the filter cache. Since scores are not cached, all documents that match the filter produce the same score (0 by default). Cached filters will be extremely fast when they are used again in another query. Thanks On Fri, May 6, 2016 at 9:46 AM, Jay Potharaju wrote: > We have high query loa

Re: Filter queries & caching

2016-05-09 Thread Jay Potharaju
gt; As I understand it useful incase you use an OR operator between two > restricting clauses. > Recall that multiple fq means implicit AND. > > ahmet > > > > On Monday, May 9, 2016 4:02 AM, Jay Potharaju > wrote: > As mentioned above adding filter() will add the fi

Error on creating new collection with existing configs

2016-05-09 Thread Jay Potharaju
issing something?? -- Thanks Jay

Re: Filter queries & caching

2016-05-09 Thread Jay Potharaju
) > q=filter(A) > q=filter(B) > > would only require two. Yet all of them would be satisfied only by > looking at the filterCache. > > Aside from the example immediately above, which one you use is largely > a matter of taste. > > Best, > Erick > > On Mon, Ma

solr multicore vs sharding vs 1 big collection

2015-08-01 Thread Jay Potharaju
oblem. What are my other alternatives to moving to a multicore collections.? Solr: 4.9 Index size:25 GB Max doc: 40 million Doc count:29 million Replication:4 4 servers in solrcloud. Thanks Jay

Re: solr multicore vs sharding vs 1 big collection

2015-08-02 Thread Jay Potharaju
ore memory to the server is not a good scaling strategy. That is why i was thinking maybe there is a issue with the way things are set up and need to be revisited. Thanks On Sat, Aug 1, 2015 at 7:06 PM, Shawn Heisey wrote: > On 8/1/2015 6:49 PM, Jay Potharaju wrote: > > I currently h

Re: solr multicore vs sharding vs 1 big collection

2015-08-02 Thread Jay Potharaju
d use it. Are there any good references for doing that? Thanks Jay On Sun, Aug 2, 2015 at 10:19 AM, Shawn Heisey wrote: > On 8/2/2015 8:29 AM, Jay Potharaju wrote: > > The document contains around 30 fields and have stored set to true for > > almost 15 of them. And these stored fi

Re: solr multicore vs sharding vs 1 big collection

2015-08-04 Thread Jay Potharaju
57K(5478912K), 0.0956342 secs] 1974116.082: [GC 5206361K->3974011K(5215744K), 0.1967284 secs] Thanks Jay On Mon, Aug 3, 2015 at 1:53 PM, Bill Bell wrote: > Yeah a separate by month or year is good and can really help in this case. > > Bill Bell > Sent from mobile > > &g

Solr packages in Apache BigTop.

2015-03-07 Thread jay vyas
aging project, we build hadoop, spark, solr, hbase and so on in rpm/deb format, and supply puppet provisioners along with vagrant recipse for testing. -- jay vyas

debugging solr query

2016-05-24 Thread Jay Potharaju
all is not being used ..i would expect the complete index to be in memory but it doesnt look like it is. Any recommendations ?? -- Thanks Jay

Re: debugging solr query

2016-05-25 Thread Jay Potharaju
uery provide performance benefits. - Would there be a performance penalty because majority of the documents(!fieldA:abc) dont have values in the new columns? Thanks Jay On Tue, May 24, 2016 at 8:06 PM, Erick Erickson wrote: > Try adding debug=timing, that'll give you an idea of wha

Re: How to perform a contains query

2016-05-25 Thread Jay Potharaju
quot;: >> "java/org/apache/catalina/core/AsyncContextImpl.java in >> : > Apache Tomcat 7.x before 7.0.40 does not properly handle the throwing >> of a >> : > RuntimeException in an AsyncListener in an application, which allows >> : > context-dependent attackers to obtain sensitive request information >> intended >> : > for other applications in opportunistic circumstances via an >> application >> : > that records the requests that it processes.", >> : > >> : > "summary": "Session fixation vulnerability in Apache Tomcat >> 7.x >> : > before 7.0.66, 8.x before 8.0.30, and 9.x before 9.0.0.M2, when >> different >> : > session settings are used for deployments of multiple versions of the >> same >> : > web application, might allow remote attackers to hijack web sessions >> by >> : > leveraging use of a requestedSessionSSL field for an unintended >> request, >> : > related to CoyoteAdapter.java and Request.java.", >> : > >> : > "summary": "The (1) Manager and (2) Host Manager applications >> in >> : > Apache Tomcat 7.x before 7.0.68, 8.x before 8.0.31, and 9.x before >> 9.0.0.M2 >> : > establish sessions and send CSRF tokens for arbitrary new requests, >> which >> : > allows remote attackers to bypass a CSRF protection mechanism by >> using a >> : > token.", >> : > >> : > "summary": "The setGlobalContext method in >> : > org/apache/naming/factory/ResourceLinkFactory.java in Apache Tomcat >> 7.x >> : > before 7.0.68, 8.x before 8.0.31, and 9.x before 9.0.0.M3 does not >> consider >> : > whether ResourceLinkFactory.setGlobalContext callers are authorized, >> which >> : > allows remote authenticated users to bypass intended SecurityManager >> : > restrictions and read or write to arbitrary application data, or >> cause a >> : > denial of service (application disruption), via a web application >> that sets >> : > a crafted global context.", >> : >> : >> >> -Hoss >> http://www.lucidworks.com/ >> > > -- Thanks Jay Potharaju

Re: How to save index data to other place? [scottchu]

2016-05-25 Thread Jay Potharaju
l/solr-5.4.1/server/solr/cugna. > Index data is also saved there. But wish to save index data on other > folder, say /var/sc_data/cugna. How can I dothis? > > scott.chu,scott@udngroup.com > 2016/5/26 (週四) > -- Thanks Jay Potharaju

Re: debugging solr query

2016-05-25 Thread Jay Potharaju
Any links that illustrate and talk about solr internals and how indexing/querying works would be a great help. Thanks Jay On Wed, May 25, 2016 at 6:30 PM, Jay Potharaju wrote: > Hi, > Thanks for the feedback. The queries I run are very basic filter queries > with some sorting. > >

Re: debugging solr query

2016-05-26 Thread Jay Potharaju
uery provide performance benefits. - Would there be a performance penalty because majority of the documents(!fieldA:abc) dont have values in the new columns? Thanks On Wed, May 25, 2016 at 8:40 PM, Jay Potharaju wrote: > Any links that illustrate and talk about solr internals and

Re: debugging solr query

2016-05-27 Thread Jay Potharaju
suggestions on how handle filtering/querying/sorting on high cardinality date fields? Index size: 30Million Solr: 4.3.1 Thanks On Thu, May 26, 2016 at 6:04 AM, Jay Potharaju wrote: > Hi, > Thanks for the feedback. The queries I run are very basic filter queries > with some sorting. > >

Re: debugging solr query

2016-05-27 Thread Jay Potharaju
wrote: > Hi Jay, > > Please separate the clauses. Feed one of them to the main q parameter with > content score operator =^ since you are sorting on a structured field(e.g. > date) > > q:fieldB:(123 OR 456)^=1.0 > &fq=dt1:[date1 TO *] > &fq=dt2:[* TO NOW/DAY+1] >

Slow date filter query

2016-05-27 Thread Jay Potharaju
alleviate the problem because of high cardinality. Can I store the date as MMDD and run range queries on them instead of date fields? Is that a good option? -- Thanks Jay

Re: Slow date filter query

2016-05-30 Thread Jay Potharaju
> That does seem long, but you haven't provided many details > > about the fields. Are there 100 docs in your index? 100M docs? 500M docs? > > > > Are you using NOW in appropriately? See: > > > > On Fri, May 27, 2016 at 1:32 PM, Jay Potharaju > wrote: > &g

result grouping in sharded index

2016-06-13 Thread Jay Potharaju
+Grouping#ResultGrouping-DistributedResultGroupingCaveats -- Thanks Jay

Re: result grouping in sharded index

2016-06-14 Thread Jay Potharaju
Any suggestions on how to handle result grouping in sharded index? On Mon, Jun 13, 2016 at 1:15 PM, Jay Potharaju wrote: > Hi, > I am working on a functionality that would require me to group documents > by a id field. I read that the ngroups feature would not work in a sharded >

Re: result grouping in sharded index

2016-06-15 Thread Jay Potharaju
ction. " On Wed, Jun 15, 2016 at 3:03 AM, Tom Evans wrote: > Do you have to group, or can you collapse instead? > > > https://cwiki.apache.org/confluence/display/solr/Collapse+and+Expand+Results > > Cheers > > Tom > > On Tue, Jun 14, 2016 at 4:57 PM, Jay Pothar

Sorting & searching on the same field

2016-06-23 Thread Jay Potharaju
Hi, I would like to have 1 field that can used for both searching and case insensitive sorting. As far as i know the only way to do is to have two fields one for searching (text_en) and one for sorting(lowercase & string). Any ideas how the two can be combined into 1 field. -- Thanks

Re: Sorting & searching on the same field

2016-06-23 Thread Jay Potharaju
yes, that is what i thought. but was checking to see if there was something I was missing. Thanks On Thu, Jun 23, 2016 at 12:55 PM, Ahmet Arslan wrote: > Hi Jay, > > I don't think it can be combined. > Mainly because: searching requires a tokenized field. > Sorting req

Slow facet range performance

2016-06-23 Thread Jay Potharaju
al.set=[NOW-7DAY,NOW]&f.date_field.facet.interval.set=[NOW-30DAY,NOW-7DAY]&f.date_field.facet.interval.set=[NOW-1MONTH,NOW-7DAY]&f.date_field.facet.interval.set=[NOW-1YEAR,NOW-1MONTH] Any suggestions on how to speed this up? -- Thanks Jay

clarification on using docvalues for sorting

2016-06-23 Thread Jay Potharaju
ting & faceting. How can i accomplish using docvalues for case-insensitive field types.? Or what I am trying to do is not possible. -- Thanks Jay

Re: Sorting & searching on the same field

2016-06-23 Thread Jay Potharaju
Alex > On 24 Jun 2016 9:01 AM, "Jay Potharaju" wrote: > > > yes, that is what i thought. but was checking to see if there was > something > > I was missing. > > Thanks > > > > On Thu, Jun 23, 2016 at 12:55 PM, Ahmet Arslan > > > wrote

Re: Sorting & searching on the same field

2016-06-24 Thread Jay Potharaju
Thanks Alex, I will check this out. Is it possible to do something at query time , using a function query to lowercase the field and then sort on it.? Jay > On Jun 24, 2016, at 12:03 AM, Alexandre Rafalovitch > wrote: > > Keep voting for SOLR-8362? > > You could do you

json facet - date range & interval

2016-06-27 Thread Jay Potharaju
", gap : "+1DAY" } } msg": "Can't add gap 1DAY to value Fri Jun 17 15:49:36 UTC 2016 for field: datefield", "code": 400 -- Thanks Jay

Re: json facet - date range & interval

2016-06-28 Thread Jay Potharaju
json.facet={daterange : {type : range, field : datefield, start : "NOW/DAY-10DAYS", end : "NOW/DAY",gap:"\+1DAY"} } Escaping the plus sign also gives the same error. Any other suggestions how can i make this work? Thanks Jay On Mon, Jun 27, 2016 at 10:23 PM, Erick E

Re: json facet - date range & interval

2016-06-28 Thread Jay Potharaju
start : "NOW/DAY%2D10DAYS", > end : "NOW/DAY", > gap : "%2B1DAY" > > } > } > > > On 06/28/2016 01:19 PM, Jay Potharaju wrote: > >> json.facet={daterange : {type : range, field : datefield, start : >> &q

Re: Integrating Stanford NLP or any other NLP for Natural Language Query

2016-07-07 Thread Jay Urbain
I use Stanford NLP and cTakes (based on OpenNLP) while indexing with a SOLRJ application. Best, Jay On Thu, Jul 7, 2016 at 12:09 PM, Puneet Pawaia wrote: > Hi > > I am currently using Solr 5.5.x to test but can upgrade to Solr 6.x if > required. > I am working on a POC for n

Re: Integrating Stanford NLP or any other NLP for Natural Language Query

2016-07-08 Thread Jay Urbain
NLP: http://stanfordnlp.github.io/CoreNLP/ Best, Jay On Thu, Jul 7, 2016 at 9:52 PM, Puneet Pawaia wrote: > Hi Jay > Any place I can learn more on this method of integration? > Thanks > Puneet > > On 8 Jul 2016 02:58, "Jay Urbain" wrote: > > > I use Stanford NLP and

RE: [Ext] Influence ranking based on document committed date

2016-08-17 Thread Jay Parashar
This is correct: " I index it and feed it the timestamp at index time". You can sort desc on that field (can be a TrieDateField) -Original Message- From: Steven White [mailto:swhite4...@gmail.com] Sent: Wednesday, August 17, 2016 9:01 AM To: solr-user@lucene.apache.org Subject: [Ext] Inf

Solr on GCE

2016-09-22 Thread Jay Parashar
ngine-agentruntime.jar:na] Thanks Jay

SolrJ App Engine Client

2016-09-22 Thread Jay Parashar
I sent a similar message earlier but do not see it. Apologize if its duplicated. I am unable to connect to Solr Cloud zkhost (using CloudSolrClient) from a SolrJ client running on Google App Engine. The error message is "java.nio.channels.SocketChannel is a restricted class. Please see the Googl

RE: [Ext] Re: SolrJ App Engine Client

2016-09-22 Thread Jay Parashar
2D&d=CwIBaQ&c=uGuXJ > 43KPkPWEl2imVFDmZQlhQUET7pVRA2PDIOxgqw&r=bRfqJEeedEKG5nkp5748YxbNMFrUY > T3YiNl0Ni2vUBQ&m=HDJS4ElFF2X939U2LWfIfRIdBJNLvm9q4mvpNmZp7kU&s=kGg4rdS > 7eJoNjVzljzxek-nIUeMnjxRhjETSDJzdaXY&e= > when-trying-to-run-my-project > > > On Thu, Sep 22

RE: [Ext] Re: SolrJ App Engine Client

2016-09-22 Thread Jay Parashar
nkp5748YxbNMFrUYT3YiNl0Ni2vUBQ&m=FjaUoU-i-tiL8deMoKceLKxX-kgXBObYvgMAjZnac8A&s=EkfJOFmbVi4fwdp1mBAnpIXC1XHnT8_eN6Jsz1PvDhw&e= On Thu, Sep 22, 2016 at 1:38 PM, Jay Parashar wrote: > I sent a similar message earlier but do not see it. Apologize if its > duplicated. > > I am unab

Re: SolrJ App Engine Client

2016-09-22 Thread Jay Parashar
2D&d=CwIBaQ&c=uGuXJ > 43KPkPWEl2imVFDmZQlhQUET7pVRA2PDIOxgqw&r=bRfqJEeedEKG5nkp5748YxbNMFrUY > T3YiNl0Ni2vUBQ&m=HDJS4ElFF2X939U2LWfIfRIdBJNLvm9q4mvpNmZp7kU&s=kGg4rdS > 7eJoNjVzljzxek-nIUeMnjxRhjETSDJzdaXY&e= > when-trying-to-run-my-project > > > On Thu, Sep 22

RE: SolrJ App Engine Client

2016-09-22 Thread Jay Parashar
nkp5748YxbNMFrUYT3YiNl0Ni2vUBQ&m=FjaUoU-i-tiL8deMoKceLKxX-kgXBObYvgMAjZnac8A&s=EkfJOFmbVi4fwdp1mBAnpIXC1XHnT8_eN6Jsz1PvDhw&e= On Thu, Sep 22, 2016 at 1:38 PM, Jay Parashar wrote: > I sent a similar message earlier but do not see it. Apologize if its > duplicated. > > I am unabl

solrcloud load balancing

2016-10-22 Thread Jay Potharaju
is minimal and can be ignored) 3) In my solr logs i display the "slow" queries, is the qtime displayed takes all of the above and shows the correct time taken. Solr version: 5.5.0 -- Thanks Jay

Re: solrcloud load balancing

2016-10-22 Thread Jay Potharaju
shard. Can you please point me to some code/documenation that can help me understand this better. Thanks Jay On Sat, Oct 22, 2016 at 6:00 PM, Erick Erickson wrote: > 1) Single shards have some short circuiting in them. And anyway it's > best to have some kind of load balancer in front o

Re: solrcloud load balancing

2016-10-22 Thread Jay Potharaju
I am not able to see any high response time in the solr logs. Is it possible that the under high load it takes a long time to retrieve and send the documents? If i run the same query in browser individually it comes back in quick time. Thanks Jay On Sat, Oct 22, 2016 at 6:14 PM, Shawn Heisey w

book on solr

2017-10-12 Thread Jay Potharaju
scaling would also be great. Thanks Jay Potharaju

Managed schema used with Cloudera MapreduceIndexerTool and morphlines?

2017-03-17 Thread Jay Hill
I've got a very difficult project to tackle. I've been tasked with using schemaless mode to index json files that we receive. The structure of the json files will always be very different as we're receiving files from different customers totally unrelated to one another. We are attempting to build

Best practices for backup & restore

2017-05-16 Thread Jay Potharaju
.org/confluence/display/solr/Making+and+Restoring+Backups Any recommendations ? -- Thanks Jay

search multiple cores

2014-05-13 Thread Jay Potharaju
Hi, I am trying to join across multiple cores using query time join. Following is my setup 3 cores - Solr 4.7 core1: 0.5 million documents core2: 4 million documents and growing. This contains the child documents for documents in core1. core3: 2 million documents and growing. Contains records from

highlighting on hl.alternateField (copyField target) doesnt highlight

2014-06-03 Thread jay list
Hello,   im trying to implement a user friendly search for phone numbers. These numbers consist out of two digit-tokens like "12345 67890".   Finally I want the highlighting for the phone number in the search result, without any concerns about was this search result hit by field  tel  or copyFi

highlighting on hl.alternateField (copyField target) doesnt highlight

2014-06-03 Thread jay list
Hello,   im trying to implement a user friendly search for phone numbers. These numbers consist out of two digit-tokens like "12345 67890".   Finally I want the highlighting for the phone number in the search result, without any concerns about was this search result hit by field  tel  or copyFi

Fw: highlighting on hl.alternateField (copyField target) doesnt highlight

2014-06-05 Thread jay list
Anybody knowing this issue? > Gesendet: Dienstag, 03. Juni 2014 um 09:11 Uhr > Von: "jay list" > An: solr-user@lucene.apache.org > Betreff: highlighting on hl.alternateField (copyField target) doesnt highlight > > > Hello, >   > im trying to implement a u

  1   2   3   4   5   >