more like this generated query

2015-04-27 Thread alxsss
Hello, I am using solr-4.10.4 with mlt. I noticed that mlt constructs query which is missing some words. For example, for doc with title: Jennnifer Lopez keywords: Jennifer, concert, Hollywood the parsedquery generated by mlt for this doc is title:lopez keywords:jennifer keywords:concert

Re: snapinstaller does not start newSearcher

2015-03-04 Thread alxsss
I have used snapshotter api and modified snapinstaller script, so that it successfully grabs the snapshot folder and updates index folder in slave. However, it fails to open newSearcher. It simple, sends a commit command to slave, but hasUncommittedChanges function returns false. That is the reas

Re: snapinstaller does not start newSearcher

2015-02-24 Thread alxsss
Hello, We cannot use replication with the current architecture, so decided to use snapshotter with snapinstaller. Here is the full stack trace 8937 [coreLoadExecutor-5-thread-3] INFO org.apache.solr.core.CachingDirectoryFactory – Closing directory: /home/solr/solr-4.10.1/solr/example/solr/p

snapinstaller does not start newSearcher

2015-02-23 Thread alxsss
Hello, I am using latest solr (solr trunk) . I run snapinstaller, and see that it copies snapshot to index folder but changes are not picked up and logs in slave after running snapinstaller are 44302 [qtp1312571113-14] INFO org.apache.solr.update.UpdateHandler – start commit{,optimize=false

custom sorting of search result

2014-11-03 Thread alxsss
Hello, We need to order solr search results according to specific rules. I will explain with an example. Let say solr returns 1000 results for query "sport". These results must be divided into three buckets according to rules that come from database. Then one doc must be chosen from each b

Re: Incorrect group.ngroups value

2014-08-25 Thread alxsss
Hi, From the discussion it is not clear if this is a fixable bug in the case of documents being in different shards. If this is fixable could someone please direct me to the part of the code so that I could investigate. Thanks. Alex. -Original Message- From: Andrew Shumway To

regexTransformer returns no results if there is no match

2014-08-11 Thread alxsss
Hello, I try to construct wikipedia page url from page title using regexTransformer with This does not work for titles that have no space, so title_underscore for them is empty. Any ideas what is wrong here? This is with solr-4.8.1 Thanks. Alex.

Re: group.ngroups is set to an incorrect value - specific field types

2014-06-17 Thread alxsss
Hi, I see similar problem in our solr application. Sometime it gives number in a group as number of all documents. This starting to happen after upgrade from 4.6.1 to 4.8.1 Thanks. Alex. -Original Message- From: 海老澤 志信 To: solr-user Sent: Tue, Jun 17, 2014 5:24 am Subject: RE: gr

Re: how do I get search for "fort st john" to match "ft saint john"

2014-04-01 Thread alxsss
It seems to me that, you are missing this line under Alex. -Original Message- From: solr-user To: solr-user Sent: Tue, Apr 1, 2014 5:01 pm Subject: Re: how do I get search for "fort st john" to match "ft saint john" Hi Eric. Sorry, been away. The city_index_synonyms

spellcheck in solr-4.6-1 distrib=true

2014-03-31 Thread alxsss
Hello, For queries in solrcloud and in distributed mode solr-4.6.1 spellcheck does not return any suggestions, but in non-distrubited mode. Is this a know bug? Thanks. Alex.

Re: change character correspondence in icu lib

2014-02-13 Thread alxsss
I found out that generated files are the same. I think this is because that these lines inside build file Note that the gennorm2 and icupkg tools must be on your PATH. These tools are part of the ICU4C package. See http://site.icu-

Re: change character correspondence in icu lib

2014-02-13 Thread alxsss
I found out that generated files are the same. I think this is because that these lines inside build file Note that the gennorm2 and icupkg tools must be on your PATH. These tools are part of the ICU4C package. See http://site.icu-

change character correspondence in icu lib

2014-02-12 Thread alxsss
Hello, I use icu4j-49.1.jar, lucene-analyzers-icu-4.6-SNAPSHOT.jar for one of the fields in the form I need to change one of the accent char's corresponding letter. I made changes to this file lucene/analysis/icu/src/data/utr30/DiacriticFolding.txt recompiled solr and lucene and replaced t

Re: additional requests sent to solr

2013-08-11 Thread alxsss
Hi, Could someone please confirm that this must me so or this is a bug in SOLR. In short, I see three logs in SOLR for one request http://server1:8983/solr/mycollection/select?q=alex&wt=xml&defType=edismax&facet.field=school&facet.field=company&facet=true&facet.limit=10&facet.mincount=1&qf=schoo

Re: additional requests sent to solr

2013-08-05 Thread alxsss
ny__terms have all values from company facet field. > > I have added terms=false to the original query sent to solr, but it did > not help. > > Does anyone has any idea how to suppress these queries. > > Thanks. > Alex. > > > > > > > > > -Original

Re: additional requests sent to solr

2013-08-04 Thread alxsss
the original query sent to solr, but it did not help. Does anyone has any idea how to suppress these queries. Thanks. Alex. -Original Message- From: alxsss To: solr-user Sent: Fri, Jul 19, 2013 5:00 am Subject: additional requests sent to solr Hello, I send to solr( to ser

additional requests sent to solr

2013-07-18 Thread alxsss
Hello, I send to solr( to server1 in the cluster of two servers) the folowing request http://server1:8983/solr/mycollection/select?q=alex&wt=xml&defType=edismax&facet.field=school&facet.field=company&facet=true&facet.limit=10&facet.mincount=1&qf=school_txt+company_txt+name&shards=server1:8983/sol

Re: document id in nutch/solr

2013-06-24 Thread alxsss
Another way of overriding nutch fields is to modify solrindex-mapping.xml file. hth Alex. -Original Message- From: Jack Krupansky To: solr-user Sent: Sun, Jun 23, 2013 12:04 pm Subject: Re: document id in nutch/solr Add the "passthrough" dynamic field to your Solr schema, and

whole index in memory

2013-05-31 Thread alxsss
Hello, I have a solr index of size 5GB. I am thinking of increasing cache size to 5 GB, expecting Solr will put whole index into memory. 1. Will Solr indeed put whole index into memory? 2. What are drawbacks of this approach? Thanks in advance. Alex.

Re: EdgeGram filter

2013-04-23 Thread alxsss
Hi, I did not find any descriptions, except constructor and method names. Thanks. Alex. -Original Message- From: Markus Jelsma To: solr-user Sent: Tue, Apr 23, 2013 12:08 pm Subject: RE: EdgeGram filter Always check the javadocs. There's a lot of info to be found there: http

Re: EdgeGram filter

2013-04-23 Thread alxsss
Hi, I was unable to find more info about LimitTokenCountFilterFactory in solr wiki. Is there any other place to get thorough description of what it does? Thanks. Alex. -Original Message- From: Jack Krupansky To: solr-user Sent: Tue, Apr 23, 2013 11:36 am Subject: Re: EdgeGr

Re: solr-cloud performance decrease day by day

2013-04-19 Thread alxsss
How many segments each shard has and what is the reason of running multiple shards in one machine? Alex. -Original Message- From: qibaoyuan To: solr-user Sent: Fri, Apr 19, 2013 12:26 am Subject: Re: solr-cloud performance decrease day by day there are 6 shards and they are

Re: Spellchecker not working for Solr 4.1

2013-04-11 Thread alxsss
inside your request handler try to put spellcheck true and name of the spellcheck dictionary hth Alex. -Original Message- From: davers To: solr-user Sent: Thu, Apr 11, 2013 6:24 pm Subject: Spellchecker not working for Solr 4.1 This is almost the same exact setup I was usin

Re: Query slow with termVectors termPositions termOffsets

2013-03-25 Thread alxsss
Did index size increase after turning on termPositions and termOffsets? Thanks. Alex. -Original Message- From: Ravi Solr To: solr-user Sent: Mon, Mar 25, 2013 8:27 am Subject: Query slow with termVectors termPositions termOffsets Hello, We re-indexed our entire core o

Re: strange behaviour of wordbreak spellchecker in solr cloud

2013-03-22 Thread alxsss
1. group=true, no spellcheck results 2. group=false , there are spellcheck results case when matches>0 1. group=true, there are spellcheck results 2. group =false, there are spellcheck results Do these constitute a failing test case? Thanks. Alex. -Original Message- From:

Re: strange behaviour of wordbreak spellchecker in solr cloud

2013-03-22 Thread alxsss
. group=false , there are spellcheck results case when matches>0 1. group=true, there are spellcheck results 2. group =false, there are spellcheck results Do these constitute a failing test case? Thanks. Alex. -Original Message- From: alxsss To: solr-user Sent: Thu, Mar

Re: strange behaviour of wordbreak spellchecker in solr cloud

2013-03-21 Thread alxsss
Thanks in advance. Alex. -Original Message- From: alxsss To: solr-user Sent: Tue, Mar 19, 2013 11:34 am Subject: Re: strange behaviour of wordbreak spellchecker in solr cloud -- distributed environment. But to nail it down, we probably need to see both -- the app

Re: strange behaviour of wordbreak spellchecker in solr cloud

2013-03-21 Thread alxsss
Hello, We need this feature be fixed ASAP. So, please let me know which class is responsible for combining spellcheck results from all shards. I will try to debug the code. Thanks in advance. Alex. -Original Message- From: alxsss To: solr-user Sent: Tue, Mar 19, 2013 11:34

Re: strange behaviour of wordbreak spellchecker in solr cloud

2013-03-19 Thread alxsss
-- distributed environment. But to nail it down, we probably need to see both -- the applicable Not sure what this is? I have spell direct spell solr.DirectSolrSpellChecker internal 0.5 2 1

Re: strange behaviour of wordbreak spellchecker in solr cloud

2013-03-19 Thread alxsss
Hello, I was testing my custom testhandler. Direct spellchecker also was not working in cloud. After I added spellcheck to /select requestHandler it worked but the wordbreak spellchecker. I have added shards.qt=testhanlder to curl request but it did not solve the issue. Thanks. A

strange behaviour of wordbreak spellchecker in solr cloud

2013-03-18 Thread alxsss
Hello, I try to use wordbreak spellchecker in solr-4.2 with cloud feature. We have two server with one shard in each of them. curl 'server1:8983/solr/test/testhandler?q=paulusoles&indent=true&rows=10' curl 'server2:8983/solr/test/testhandler?q=paulusoles&indent=true&rows=10' does not return any

Re: structure of solr index

2013-03-18 Thread alxsss
---So,"search" time is in no way impacting by the existence or non-existence of stored values, What about memory? Would it require to increase memeory in order to have the same Qtime as in the case of indexed only fields? For example in the case of indexed fields only index size is 5GB, a

Re: structure of solr index

2013-03-16 Thread alxsss
Hi, So, will search time be the same for the case when fields are indexed only vs the case when they are indexed and stored? Thanks. Alex. -Original Message- From: Otis Gospodnetic To: solr-user Sent: Fri, Mar 15, 2013 8:09 pm Subject: Re: structure of solr index Hi, I thin

structure of solr index

2013-03-15 Thread alxsss
Hi, I wondered if solr searches on indexed fields only or on entire index? In more detail, let say I have fields id, title and content, all indexed, stored. Will a search send all these fields to memory or only indexed part of these fields? Thanks. Alex.

spellchecker does not have suggestion for keywords typed through a non-whitespace delimiter

2013-03-12 Thread alxsss
Hello, Recently we noticed that solr and its spellchecker do not return results for keywords typed with non-whitespace delimiter. A user accidentally typed u instead of white space. For example, paulusoles instead of paul soles. Solr does not return any results or spellcheck suggestion for key

Re: solr cloud index size is too big

2013-03-04 Thread alxsss
Hi, It is the index folder. tlog is only a few MB. I have analysed all changed and found out that only one field in schema was changed. This field in non cloud was changed to in cloud to use fastVectorHighlighting. Is it possible that this change could double index size? Thanks. Alex.

how to overrride pre and post tags when usefastVectorHighlighter is set to true

2013-02-22 Thread alxsss
Hello, I was unable to change pre and post tags for highlighting when usefastVectorHighlighter is set to true. Changing default tags in solrconfig.xml works for standard highlighter though. I searched mailing list and the net with no success. I use solr-4.1.0. Thanks. Alex.

Re: How do I create two collections on the same cluster?

2013-02-22 Thread alxsss
Hi, What if you add new collection to solr.xml file? Alex. -Original Message- From: Shankar Sundararaju To: solr-user Sent: Thu, Feb 21, 2013 8:51 pm Subject: How do I create two collections on the same cluster? I am using Solr 4.1. I created collection1 consisting of 2 lea

Re: long QTime for big index

2013-02-14 Thread alxsss
Hi, It is curious to know how many linux boxes do you have and how many cores in each of them. It was my understanding that solr puts in the memory all documents found for a keyword, not the whole index. So, why it must be faster with more cores, when number of selected documents from many sepa

Re: Pause and resume indexing on SolR 4 for backups

2012-12-20 Thread alxsss
Depending on your architecture, why not index the same data into two machines? One will be your prod another your backup? Thanks. Alex. -Original Message- From: Upayavira To: solr-user Sent: Thu, Dec 20, 2012 11:51 am Subject: Re: Pause and resume indexing on SolR 4 for backup

Re: Grouping performance problem

2012-07-16 Thread alxsss
This is strange. We have data folder size 24Gb, RAM for java 2GB. We query with grouping, ngroups and highlighting, do not query all fields and query time mostly is less than 1 sec it rarely goes up to 2 sec. We use solr 3.6 and tuned off all kind of caching. Maybe your problem is with caching

Re: Grouping performance problem

2012-07-16 Thread alxsss
What are the RAM of your server and size of the data folder? -Original Message- From: Agnieszka Kukałowicz To: solr-user Sent: Mon, Jul 16, 2012 6:16 am Subject: Re: Grouping performance problem Hi Pavel, I tried with group.ngroups=false but didn't notice a big improvement. The time

Re: Grouping performance problem

2012-07-16 Thread alxsss

Re: Broken pipe error

2012-07-03 Thread alxsss
I had the same problem with jetty. It turned out that broken pipe happens when application disconnects from jetty. In my case I was using php client and it had 10 sec restriction in curl request. When solr takes more than 10 sec to respond, curl automatically disconnected from jetty. Hope this

Re: Removing old documents

2012-05-02 Thread alxsss
I use jetty that comes with solr. I use solr's dedupe true id true url solr.processor.Lookup3Signature and because of this id is not url itself but its encoded signature. I see solrclean uses url to delete

Re: Removing old documents

2012-05-01 Thread alxsss
all caching is disabled and I restarted jetty. The same results. Thanks. Alex. -Original Message- From: Lance Norskog To: solr-user Sent: Tue, May 1, 2012 2:57 pm Subject: Re: Removing old documents Maybe this is the HTTP caching feature? Solr comes with HTTP caching turned on

Re: Removing old documents

2012-05-01 Thread alxsss
Hello, I did bin/nutch solrclean crawl/crawldb http://127.0.0.1:8983/solr/ without and with -noCommit and restarted solr server Log shows that 5 documents were removed but they are still in the search results. Is this a bug or something is missing? I use nutch-1.4 and solr 3.5 Thanks. Alex.

Re: term frequency outweighs exact phrase match

2012-04-13 Thread alxsss
Hello Hoss, Here are the explain tags for two doc 0.021646015 = (MATCH) sum of: 0.021646015 = (MATCH) sum of: 0.02141003 = (MATCH) max plus 0.01 times others of: 2.84194E-4 = (MATCH) weight(content:apache^0.5 in 3578), product of: 0.0029881175 = queryWeight(content:apache^0.5

Re: term frequency outweighs exact phrase match

2012-04-12 Thread alxsss
In that case documents 1 and 2 will not be in the results. We need them also be shown in the results but be ranked after those docs with exact match. I think omitting term frequency in calculating ranking in phrase queries will solve this issue, but I do not see that such a parameter in configs.

term frequency outweighs exact phrase match

2012-04-10 Thread alxsss
Hello, I use solr 3.5 with edismax. I have the following issue with phrase search. For example if I have three documents with content like 1.apache apache 2. solr solr 3.apache solr then search for apache solr displays documents in the order 1,.2,3 instead of 3, 2, 1 because term frequency in

data/index/segments_u (No such file or directory)

2012-03-19 Thread alxsss
Hello, I have copied solr's data folder from dev linux box to prod one. When starting solr I get this error in prod server. In dev solr starts sucessfully. Caused by: java.io.FileNotFoundException: /home/apache-solr-3.5.0/example/solr/data/index/segments_u (No such file or directory)

Re: Help with duplicate unique IDs

2012-03-02 Thread alxsss
take a look to I think you must use dedup to solve this issue -Original Message- From: Thomas Dowling To: solr-user Cc: Mikhail Khludnev Sent: Fri, Mar 2, 2012 1:10 pm Subject: Re: Help with duplicate unique IDs Thanks. In fact, the behavior I want is overwrite=true. I wan

Re: spellcheck configuration not providing suggestions or corrections

2012-02-13 Thread alxsss
you have put this true Maybe you need to put true Alex. -Original Message- From: Dyer, James To: solr-user Sent: Mon, Feb 13, 2012 12:43 pm Subject: RE: spellcheck configuration not providing suggestions or corrections That would be it, I tbinkl. Your request is to "/sele

Re: can solr automatically search for different punctuation of a word

2012-01-30 Thread alxsss
Hi Chantal, In the readme file at solr/contrib/analysis-extras/README.txt it says to add the ICU library (in lib/) Do I need also add ... and where? Thanks. Alex. -Original Message- From: Chantal Ackermann To: solr-user Sent: Fri, Jan 13, 2012 1:52 am Subject: Re: can solr a

can solr automatically search for different punctuation of a word

2012-01-12 Thread alxsss
Hello, I would like to know if solr has a functionality to automatically search for a different punctuation of a word. For example if I if a user searches for a word Uber, and stemmer is german lang, then solr looks for both Uber and Über, like in synonyms. Is it possible to give a file with

Re: How to apply relevant Stemmer to each document

2011-12-22 Thread alxsss
Hi Erick, Why querying would be wrong? It is my understanding that if I have let say 3 docs and each of them has been indexed with its own language stemmer, then sending a query will search all docs and return matching results? Let say if a query is "driving" and one of the docs has drive a

Re: two word phrase search using dismax

2011-12-05 Thread alxsss
Hi Eric, After reading more about pf param I increased them a few times and this solved options 2, 3, 4 but 1. As an example, for phrase "newspaper latimes" latimes.com is not even in the results to boost it to the first place and changing mm param to 1<-1 5<-2 6<90% solves only 1,4 but 2

Re: two word phrase search using dismax

2011-12-03 Thread alxsss
Hello, Here is my request handler edismax explicit 0.01 site^1.5 content^0.5 title^1.2 site^1.5 content^0.5 title^1.2 id,title, site 2<-1 5<-2 6<90% 300 true *:* content 0 165 title 0 url regex I have made a few tests with debugQuery and realised that for two word phrases, solr takes the fi

less search results in prod

2011-12-03 Thread alxsss
Hello, I have build solr-3.4.0 data folder in dev server and copied it to prod server. Made a search for a keyword, then modified qf and pf params in solrconfig.xml. Made search for the same keywords, then restored qf and pf params to their original value. Now, solr returns very less number of

Re: spellcheck in dismax

2011-11-22 Thread alxsss
It seem you forget this true -Original Message- From: Ruixiang Zhang To: solr-user Sent: Tue, Nov 22, 2011 11:54 am Subject: spellcheck in dismax I put the following into dismax requestHandler, but no suggestion field is returned. true false 1

Re: jetty error, broken pipe

2011-11-19 Thread alxsss
I found out that curl timeout was set to 10 and for queries taking longer than 10 sec it was closing connection to jetty. I noticed that when number of docs found is large solr returns results for about 20 sec. This is too long. I set caching to off but it did not help. I think solr spends too mu

jetty error, broken pipe

2011-11-19 Thread alxsss
Hello, I use solr 3.4 with jetty that is included in it. Periodically, I see this error in the jetty output SEVERE: org.mortbay.jetty.EofException at org.mortbay.jetty.HttpGenerator.flush(HttpGenerator.java:791) at org.mortbay.jetty.AbstractGenerator$Output.flush(AbstractGenerat

Re: two word phrase search using dismax

2011-11-15 Thread alxsss
Hello, Thanks for your letter. I investigated further and found out that we have title scored more than content in qf field and those docs in the first places have one of the words in title but not both of them. The doc in the first place has only one of the words in the content. Docs with both

Re: how to achieve google.com like results for phrase queries

2011-11-07 Thread alxsss
Solr also can query link(url) text and rank them higher if we specify url in qf field. Only problem is that why it does not rank pages with both words higher when mm is set as 1<-1. It seems to me that this is a bug. Thanks. Alex. -Original Message- From: Ted Dunning To: solr-

Re: how to achieve google.com like results for phrase queries

2011-11-05 Thread alxsss
Hi Erick, The term "newspaper latimes" is not found in latimes.com. However, google places it in the first place. My guess is that mm parameter must not be set as 2<-1 in order to achieve google.com like ranking for two word phrase queries. My goal is to set mm parameter in such a way that la

how to achieve google.com like results for phrase queries

2011-11-03 Thread alxsss
Hello, I use nutch-1.3 crawled results in solr-3.4. I noticed that for two word phrases like newspaper latimes, latimes.com is not in results at all. This may be due to the dismax def type that I use in request handler dismax url^1.5 id^1.5 content^ title^1.2 url^1.5 id^1.5 content^0.5 title^1

apply filter to spell filed

2011-09-27 Thread alxsss
Hello, I have implemented spellchecker in two ways. 1. Adding a textspell type to schema.xml and making a copy field from original content field, which is type text. 2. without adding new type and copy field. Simple adding name of spell field, content to solrconfig.xml I have an issue in

Re: pagination with grouping

2011-09-12 Thread alxsss
Is case #2 planned to be coded in the future releases? Thanks. Alex. -Original Message- From: Bill Bell To: solr-user Sent: Thu, Sep 8, 2011 10:17 pm Subject: Re: pagination with grouping There are 2 use cases: 1. rows=10 means 10 groups. 2. rows=10 means to results (irr

pagination with grouping

2011-09-08 Thread alxsss
Hello, When trying to implement pagination as in the case without grouping I see two issues. 1. with rows=10 solr feed displays 10 groups not 10 results 2. there is no total number of results with grouping to show the last page. In detail: 1. I need to display only 10 results in one page.

grouping by alpha-numeric field

2011-09-07 Thread alxsss
Hello, I try to group by a field with type string. In the results I see groupValues as parts of the group field. Any ideas how to fix this. Thanks. Alex.

spellchecking in nutch solr

2011-09-01 Thread alxsss
Hello, I have tried to implement spellchecker based on index in nutch-solr by adding spell field to schema.xml and making it a copy from content field. However, this increased data folder size twice and spell filed as a copy of content field appears in xml feed which is not necessary. Is it po

Re: how to manually add data to indexes generated by nutch-1.0 using solr

2009-05-13 Thread alxsss
I forget to say that when I do curl http://localhost:8983/solr/update -H "Content-Type: text/xml" --data-binary '' 0453 and search for added keywords gives 0 results. Does status 0 mean that addition was successful? Thanks. Alex. -Original Message- From: Erik Hatcher T

Re: how to manually add data to indexes generated by nutch-1.0 using solr

2009-05-12 Thread alxsss
Tried to add a new record using curl http://localhost:8983/solr/update -H "Content-Type: text/xml" --data-binary ' 20090512170318 86937aaee8e748ac3007ed8b66477624 0.21189615 test.com test test 20090513003210909 ' I get 071 and added records are not found in the search. Any ideas w

how to manually add data to indexes generated by nutch-1.0 using solr

2009-05-11 Thread alxsss
Hello, I had? Nutch -1.0 to crawl fetch and index a lot of files. Then I needed to? index a few files also. But I know keywords for those files and their? locations. I need to add them manually. I took a look to two tutorials on the wiki, but did not find any info about this issue. Is there a tu