Re: ngrams with position

2016-03-08 Thread Emir Arnautovic
Hi Elisabeth, I don't think there is such token filter, so you would have to create your own token filter that takes token and emits ngram token of specific length. It should not be too hard to create such filter - you can take a look how nagram filter is coded - yours should be simpler than th

Re: ngrams with position

2016-03-08 Thread elisabeth benoit
Thanks for your answer Emir, I'll check that out. Best regards, Elisabeth 2016-03-08 10:24 GMT+01:00 Emir Arnautovic : > Hi Elisabeth, > I don't think there is such token filter, so you would have to create your > own token filter that takes token and emits ngram token of specific length. > It

solr simple query searh

2016-03-08 Thread Mugeesh Husain
Hello, I am implementing simple search demo. I have a field abc, insert "iphone" in abc. if i will search "iphone" then it will give result,if i will search i phone, then result will not populated. in analyzer i should implement for this case. input==abc:iphone search query= i phone -- View

Re: Stopping Solr JVM on OOM

2016-03-08 Thread Binoy Dalal
Hi Shawn, I've just finished writing a batch oom killer script and it seems to work fine. I couldn't try it on the actual solr process since I'm a bit stumped on how I can make solr throw an oom at will. Although I did write another code that does throw an oom upon which this script is called and

Re: solr simple query searh

2016-03-08 Thread John Blythe
what does your current analyzer look like? -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Tue, Mar 8, 2016 at 6:42 AM, Mugeesh Husain wrote: > Hello, > > I am implementing simple search demo. > I have

Re: Stopping Solr JVM on OOM

2016-03-08 Thread Shawn Heisey
On 3/8/2016 5:13 AM, Binoy Dalal wrote: > I've just finished writing a batch oom killer script and it seems to work > fine. > > I couldn't try it on the actual solr process since I'm a bit stumped on how > I can make solr throw an oom at will. > Although I did write another code that does throw an

Re: High Cpu sys usage

2016-03-08 Thread YouPeng Yang
Hi all Thanks for your reply.I do some investigation for much time.and I will post some logs of the 'top' and IO in a few days when the crash come again. 2016-03-08 10:45 GMT+08:00 Shawn Heisey : > On 3/7/2016 2:23 AM, Toke Eskildsen wrote: > > How does this relate to YouPeng reporting that the

Different scores depending on cloud node

2016-03-08 Thread Robert Brown
Hi, I have 2 shards, each with 1 replica. When sending the same request to the cluster, I'm seeing the same results, but ordered differently, and with different scores. Does this highlight an issue with my index, or is this an accepted anomaly? Example of 8 results: 1st call: 160.2047 160.

Re: Different scores depending on cloud node

2016-03-08 Thread Shawn Heisey
On 3/8/2016 6:56 AM, Robert Brown wrote: > I have 2 shards, each with 1 replica. > > When sending the same request to the cluster, I'm seeing the same > results, but ordered differently, and with different scores. > > Does this highlight an issue with my index, or is this an accepted > anomaly? So

IllegalArgumentException: Seeking to negative position

2016-03-08 Thread Yago Riveiro
I saw this exception in my log. What can caused this? java.lang.IllegalArgumentException: Seeking to negative position: MMapIndexInput(path="/opt/solr/node/collections/2016_shard9_replica2/data/index/_0.fdx") at org.apache.lucene.store.ByteBufferIndexInput$SingleBufferImpl.seek(ByteBufferI

RE: Different scores depending on cloud node

2016-03-08 Thread Markus Jelsma
Hi - see inline. Markus -Original message- > From:Shawn Heisey > Sent: Tuesday 8th March 2016 15:11 > To: solr-user@lucene.apache.org > Subject: Re: Different scores depending on cloud node > > On 3/8/2016 6:56 AM, Robert Brown wrote: > > I have 2 shards, each with 1 replica. > > > > Whe

Re: Solrcloud Batch Indexing

2016-03-08 Thread Cassandra Targett
There is an open source Hive -> Solr SerDe available that might be worth checking out: https://github.com/lucidworks/hive-solr. I'm not sure how it would work with the source table being rebuilt every day since it uses Hive's external tables, but it might be something you could extend. On Mon, Mar

RE: Multiple custom Similarity implementations

2016-03-08 Thread Markus Jelsma
Hello, you can not change similarities per request, and this is likely never going to be supported for good reasons. You need multiple cores, or multiple fields with different similarity defined in the same core. Markus -Original message- > From:Parvesh Garg > Sent: Tuesday 8th March 2

Re: Warning and Error messages in Solr's log

2016-03-08 Thread Steven White
Re-posting. Anyone has any idea about this question? Thanks. Steve On Mon, Mar 7, 2016 at 5:15 PM, Steven White wrote: > Hi folks, > > In Solr's solr-8983-console.log I see the following (about 50 in a span of > 24 hours when index is on going): > > WARNING: Couldn't flush user prefs: > j

Re: ngrams with position

2016-03-08 Thread Alessandro Benedetti
Elizabeth, out of curiousity, could we know what you are trying to solve with that complex way of tokenisation ? Solr is really good in storing positions along with token, so I am curious to know why your are mixing the things up. Cheers On 8 March 2016 at 10:08, elisabeth benoit wrote: > Thank

Re: Indexing Twitter - Hypothetical

2016-03-08 Thread Joseph Obernberger
Thank you for the links and explanation. We are using GATE (General Architecture for Text Engineering) and parts of the Stanford NER/Parser for the data that we ingest, but we do not apply it to the queries - only the data. We've been concentrating on the back-end, and analytics, not so much what

Failed to set SSL solr 5.2.1 Windows OS

2016-03-08 Thread Ilan Schwarts
Hi all, I am trying to integrate solr with SSL on Windows 7 OS I followed the enable ssl guide at https://cwiki.apache.org/confluence/display/solr/Enabling+SSL I created the keystore and placed in on etc folder. I un-commented the lines and set: SOLR_SSL_KEY_STORE=C:\solr-5.2.1\server\etc\solr-ssl

Re: Indexing Twitter - Hypothetical

2016-03-08 Thread Jack Krupansky
You have my permission... and blessing... and... condolences! BTW, our usual recommendation is to do a subset proof of concept to see how all the pieces come together and then calculate the scaling from there. IOW, go ahead and index a day, a week, a month from the firehose and see how many nodes,

Can termfreq count stemmed forms of terms?

2016-03-08 Thread Aki Balogh
Hi All, We're using solr termfreq to count raw term frequencies (i.e. the tf in tf-idf). This works fine on a regular text field. However, we have a field where we've added snowball stemmer. Should termfreq also work on a stemmed field? Right now, we're only getting data back on terms where th

Re: Can termfreq count stemmed forms of terms?

2016-03-08 Thread Aki Balogh
Doh! I think I had answered my own question back last year: http://qnalist.com/questions/6147365/term-frequency-with-stemming *The only trick is, each term in a phrase has to be stemmed separately (i.e."end-user experience" has to be broken down into "end-user" -> "end-us" and"experience" -> "

Re: XJoin, a way to use external data sources with Solr

2016-03-08 Thread Zisis Tachtsidis
Hi Charlie, This looks like an interesting feature, but I have a couple of questions before giving it a try. I had similar needs - filtering results based on information outside of the queried Solr collection - and I went down the post-filtering path. More specifically I've implemented a *PostF

Duplicate Document IDs when updateing parent document with child document

2016-03-08 Thread Sebastian Riemer
Hi, I have created a simple Java application which illustrates this issue. I am using Solr-Version 5.5.0 and SolrJ. Here is a link to the github repository: https://github.com/sebastianriemer/SolrDuplicateTest The issue I am facing is also described by another person on stackoverflow: http://

Re: Stopping Solr JVM on OOM

2016-03-08 Thread Binoy Dalal
Hi Shawn, The JIRA issue is SOLR-8803 (https://issues.apache.org/jira/browse/SOLR-8803 ). I've used "git diff" and created a patch but it only has the changes that I made to the solr.cmd file under bin to add the -XX:OnOutOfMemoryError option. There's the entire file of the actual OOM kill script t

Re: Stopping Solr JVM on OOM

2016-03-08 Thread Binoy Dalal
I've uploaded both files. Please review and advise. On Wed, Mar 9, 2016 at 12:46 AM Binoy Dalal wrote: > Hi Shawn, > The JIRA issue is SOLR-8803 ( > https://issues.apache.org/jira/browse/SOLR-8803). > I've used "git diff" and created a patch but it only has the changes that > I made to the solr.

Re: Warning and Error messages in Solr's log

2016-03-08 Thread Shawn Heisey
On 3/7/2016 3:15 PM, Steven White wrote: > In Solr's solr-8983-console.log I see the following (about 50 in a span of > 24 hours when index is on going): > > WARNING: Couldn't flush user prefs: > java.util.prefs.BackingStoreException: Couldn't get file lock. This is not directly related to Sol

Re: Failed to set SSL solr 5.2.1 Windows OS

2016-03-08 Thread Steve Rowe
Hi Ilan, Looks like you’re modifying solr.in.sh instead of solr.in.cmd? FYI running under Cygwin is not supported. -- Steve www.lucidworks.com > On Mar 8, 2016, at 11:51 AM, Ilan Schwarts wrote: > > Hi all, I am trying to integrate solr with SSL on Windows 7 OS > I followed the enable ssl gui

Re: Failed to set SSL solr 5.2.1 Windows OS

2016-03-08 Thread Ilan Schwarts
Hi, thanks for reply. I am using solr.in.cmd I even put some pause in the cmd with echo to see the parameters are ok.. This is the original file as found in https://www.apache.org/dist/lucene/solr/5.2.1/solr-5.2.1.zip [image: Inline image 1] On Tue, Mar 8, 2016 at 10:25 PM, Steve Rowe wrote: >

Re: Failed to set SSL solr 5.2.1 Windows OS

2016-03-08 Thread Steve Rowe
Hmm, not sure what’s happening. Have you tried converting the backslashes in your paths to forward slashes? -- Steve www.lucidworks.com > On Mar 8, 2016, at 3:39 PM, Ilan Schwarts wrote: > > Hi, thanks for reply. > I am using solr.in.cmd > I even put some pause in the cmd with echo to see the

Re: DataImportHandler - Automatic scheduling of delta imports in Solr in windows 7

2016-03-08 Thread B Weber
harshrossi gmail.com> writes: > > I am using *DeltaImportHandler* for indexing data in Solr. Currently I am > manually indexing the data into Solr by selecting commands full-import or > delta-import from the Solr Admin screen. > > I am using Windows 7 and would like to automate the process by

Retrieving of Field Type

2016-03-08 Thread Zheng Lin Edwin Yeo
Hi, Is there any way that we can retrieve the field type of a field, either by using SolrJ or by using URL? The field type that we assigned in schema.xml, like int, float, tdate.. Would like to see if it is possible to retrieve it without going back to the schema.xml. I'm using Solr 5.4.0 Regards

Re: Retrieving of Field Type

2016-03-08 Thread Alexandre Rafalovitch
The Admin UI does and it uses Javascript. So you know it is possible. Admin UI uses Luke for technical-level info: http://localhost:8983/solr/techproducts/admin/luke You can use Schema API for slightly better one: http://localhost:8983/solr/techproducts/schema You can also use Schema API to get ju

Re: Retrieving of Field Type

2016-03-08 Thread Zheng Lin Edwin Yeo
Hi Alex, Thanks for the information. That was helpful. Regards, Edwin On 9 March 2016 at 13:31, Alexandre Rafalovitch wrote: > The Admin UI does and it uses Javascript. So you know it is possible. > > Admin UI uses Luke for technical-level info: > http://localhost:8983/solr/techproducts/admin/

Disable hyper-threading for better Solr performance?

2016-03-08 Thread Avner Levy
I have a machine with 16 real cores (32 with HT enabled). I'm running on it a Solr server and trying to reach maximum performance for indexing and queries (indexing 20k documents/sec by a number of threads). I've read on multiple places that in some scenarios / products disabling the hyper-thread

Query behavior.

2016-03-08 Thread Modassar Ather
Hi, Kindly help me understand the parsing of following query. I am using edismax parser and Solr-5.5.0. q.op is set to AND and there is no explicit mm value set. fl:(java OR book) => "boost(+((fl:java fl:book)~2),int(val))" When the query has explicit OR then why the ~2 is present in the parsed

Re: Disable hyper-threading for better Solr performance?

2016-03-08 Thread Ilan Schwarts
What is the solr version and shard config? Standalone? Multiple cores? Spread over RAID ? On Mar 9, 2016 9:00 AM, "Avner Levy" wrote: > I have a machine with 16 real cores (32 with HT enabled). > I'm running on it a Solr server and trying to reach maximum performance > for indexing and queries (i

Re: Failed to set SSL solr 5.2.1 Windows OS

2016-03-08 Thread Ilan Schwarts
How would one try to solve this issue? What would you suggest me to do? Debug that module? I will try only to install clean jetty with ssl first. Another question. The files jetty.xml\jetty-ssl.xml and the rest of files in /etc are being used in solr 5.2.1? On Mar 9, 2016 12:08 AM, "Steve Rowe" w