Suggester not suggesting anything using DictionaryCompoundWordTokenFilterFactory

2014-11-10 Thread Thomas Michael Engelke
I'm toying around with the suggester component, like described here: http://www.andornot.com/blog/post/Advanced-autocomplete-with-Solr-Ngrams-and-Twitters-typeaheadjs.aspx So I made 4 fields: stored="true" multiValued="true" /> stored="true" multiValued="true" /> indexed="true" stored=

Re: Best practice: Autosuggest/autocomplete vs. "real search"

2014-11-10 Thread Thomas Michael Engelke
The dedicated "autosuggest" field is not used by a suggester component, instead we just directly query it ("/select"). I'm trying to read my way into how the suggesters work, and toying around with some configurations (For instance from here: http://www.andornot.com/blog/post/Advanced-autocomplete

Fwd: Best Practices for open source pipeline/connectors

2014-11-10 Thread Dan Davis
The volume and influx rate in my scenario are very modest. Our largest collections with existing indexing software is about 20 million objects, second up is about 5 million, and more typical collections are in the tens of thousands. Aside from the 20 million object corpus, we re-index and replic

Re: Lucene to Solrcloud migration

2014-11-10 Thread Erick Erickson
bq: - the documents are organized in "shards" according to date (integer) and language (a possibly extensible discrete set) bq: - the indexes are disjunct OK, I'm having a hard time getting my head around these two statements. If the indexes are disjunct in the sense that you only search one at

Re: Ask Question

2014-11-10 Thread Erick Erickson
Whoa! One of the main points of SolrCloud is that you shouldn't have (in most cases) to worry about the shard. If you must control the shard, you can include a prefix with the ! character. So I really wonder if this is an XY problem. Please tell us the use-case you are trying to solve, perhaps the

Re: Help with SolrCloud exceptions while recovering

2014-11-10 Thread Erick Erickson
Glad to hear that! Thanks for closing this out. Best, Erick On Sun, Nov 9, 2014 at 4:55 PM, Bruno Osiek wrote: > Erick, > > Once again thank you very much for your attention. > > Now my pseudo-distributed SolrCloud is configured with no inconsistency. An > additional problem was starting Jboss w

Re: Best practice: Autosuggest/autocomplete vs. "real search"

2014-11-10 Thread Jorge Luis Betancourt Gonzalez
It wouldn’t be easy if in the site you’ll ensure that only terms are submitted to the actual search? In app I worked some time ago the default behavior of the Javascript component used for autocompletion was to first autocomplete the term in the input and then submit the query against the backen

Re: How to Facet external fields

2014-11-10 Thread Ahmet Arslan
Hi, Can't you use Function Range Query Parser with facet.query? Ahmet On Tuesday, November 11, 2014 1:03 AM, bbarani wrote: I am using external field for price field since it changes every 10 minutes. I am able to display the price / use range queries to display the documents based on a price

Re: Highlighting simple.pre and simple.post values getting ignored

2014-11-10 Thread shamik
Found the issue, to use FastVectorHighlighter, the pre and post tag syntax are different This worked out as expected. -- View this message in context: http://lucene.472066.n3.nabble.com/Highlighting-simple-pre-and-simple-post-values-getting-ignored-tp4168657p4168663.html Sent from the Solr

Re: Highlighting simple.pre and simple.post values getting ignored

2014-11-10 Thread shamik
Looks like this has to do with the selection of fast vector and breakIterator as boundary scanner. I'm using them to make sure that the highlighted snippet starts from the beginning of a sentence and not from the middle. false true breakIterator Now, if I don't use them, I'm getting the right p

Highlighting simple.pre and simple.post values getting ignored

2014-11-10 Thread Shamik Bandopadhyay
Hi, I'm facing a weird issue where the specified "hl.simple.pre" and "hl.simple.post" values for highlighting is getting ignored. In my test handler, I've the following entry: true name subject html 200 false true breakIterator 100

How to Facet external fields

2014-11-10 Thread bbarani
I am using external field for price field since it changes every 10 minutes. I am able to display the price / use range queries to display the documents based on a price range. I am trying to see if its possible to generate facets using external field. I understand that faceting requires indexing

Does ReRankQuery support reranking the result of a FuzzyQuery?

2014-11-10 Thread Brian Sawyer
Hello, We are trying to make use of the new ReRankQuery to rescore results according to a custom function but run into problems when our main query includes a FuzzyQuery. Using the example setup in Solr 4.10.2 querying: q=name:Dell~1 &rq={!rerank reRankQuery=id:whatever} results in: java.lang.U

Re: how do I stop queries from being logged in two different log files in Tomcat

2014-11-10 Thread solr-user
awesome Mike. that does exactly what I want. many thanks -- View this message in context: http://lucene.472066.n3.nabble.com/how-do-I-stop-queries-from-being-logged-in-two-different-log-files-in-Tomcat-tp4168587p4168597.html Sent from the Solr - User mailing list archive at Nabble.com.

clarification regarding shard splitting and composite IDs

2014-11-10 Thread Ian Rose
Howdy - We are using composite IDs of the form !. This ensures that all events for a user are stored in the same shard. I'm assuming from the description of how composite ID routing works, that if you split a shard the "split point" of the hash range for that shard is chosen to maintain the inva

Re: how do I stop queries from being logged in two different log files in Tomcat

2014-11-10 Thread Michael Della Bitta
I generally turn off the console logging when I install Tomcat. It flushes after every line, unlike the other handlers, and that's sort of a performance problem (although if you need that, you need that). Basically, find logging.properties in Tomcat's conf directory, and change these two lines

how do I stop queries from being logged in two different log files in Tomcat

2014-11-10 Thread solr-user
hi all. We have a number of solr 1.4x and solr 4.x installations running on tomcat We are trying to standardize the content of our log files so that we can automate log analysis; we dont want to use log4j at this time. In our solr 1.4x installations, the following conf\logging.properties file

Re: Lucene to Solrcloud migration

2014-11-10 Thread Michael Della Bitta
Hi Michal, Is there a particular reason to shard your collections like that? If it was mainly for ease of operations, I'd consider just using CompositeId to prevent specific types of queries hotspotting particular nodes. If your ingest rate is fast, you might also consider making each "colle

RE: Synonymn for Numbers

2014-11-10 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
Hi Jack I am using @ query time & Index time as I have other multiword synonym. seems the unidirection has some issues or I may miss configured.. I did something like below 10 => 2010 14=> 2014 I am thinking of using a separate file for this synonymn and use it @ Query time. Let me know if yo

Re: Proper way to backup solr.

2014-11-10 Thread Shawn Heisey
On 11/10/2014 12:00 AM, elmerfudd wrote: > What is the proper way to backup Solr while running, in the index level and > in the node level. > How do you restore it afterwards ? One way to back up is to use the backup feature of the replication handler. Note that this actually copies the files, so

Re: Unsupported major.minor version 51.0 when deploying solr 4.7.2

2014-11-10 Thread Shawn Heisey
On 11/10/2014 5:43 AM, Danesh Kuruppu wrote: > I am getting following error when I try to deploy solr.war. > > java.lang.UnsupportedClassVersionError: > org/apache/solr/servlet/SolrDispatchFilter : Unsupported major.minor > version 51.0 (unable to load class > org.apache.solr.servlet.SolrDispatchF

Solr webapp deployment is very slow with in solrconfig.xml

2014-11-10 Thread 汤林
Have opened JIRA for that: https://issues.apache.org/jira/browse/SOLR-6675 We have a SolrCloud with Solr version 4.7 with Tomcat 7. And our solr index(cores) are big(50~100G) each core. When we start up tomcat, the solr webapp deployment is very slow. From tomcat's catalina log, every time it t

Is there a configurable parameter to control the doc number between peersync and the snapshot pull recovery?

2014-11-10 Thread 汤林
If there are >100 docs gap between the recovering node and the good node, the solr will do snap pull recovery instead of peersync. Can the 100 docs be configurable? For example, there can be 1, 1000, or 10 docs gap between the good node and the node to recover. For 100 doc, a regular restart

Re: Best practice: Autosuggest/autocomplete vs. "real search"

2014-11-10 Thread Michael Sokolov
The goal is to ensure that suggestions from autocomplete are actually terms in the main index, so that the suggestions will actually result in matches. You've considered expanding the main index by adding the suggestion n-grams to it, but it would probably be better to alter your suggester so

Re: Solr query with multiple double quotes

2014-11-10 Thread xan
Hi, Still it shows me 0 results. I'm doing the following: Do a customized search and find out some "exact matches". To those exact matches, I then apply a boost. It's working amazingly. It's just these types of names (that I recently) encountered weren't returning any search results. -- View

Re: Solr query with multiple double quotes

2014-11-10 Thread Ahmet Arslan
Hi, How about using carat operator after the parenthesis? q=name:("how "r" you")^100 By the way what is your default search field? Ahmet On Monday, November 10, 2014 2:19 PM, xan wrote: When doing a query: "how "r" you" as: name:("how "r" you"^100) : I get no results. When doing it as->

Lucene to Solrcloud migration

2014-11-10 Thread Michal Krajňanský
Hi All, I have been working on a project that has long employed Lucene indexer. Currently, the system implements a proprietary document routing and index plugging/unplugging on top of the Lucene and of course contains a great body of indexes. Recently an idea came up to migrate from Lucene to Sol

Unsupported major.minor version 51.0 when deploying solr 4.7.2

2014-11-10 Thread Danesh Kuruppu
Hi all, I am getting following error when I try to deploy solr.war. java.lang.UnsupportedClassVersionError: org/apache/solr/servlet/SolrDispatchFilter : Unsupported major.minor version 51.0 (unable to load class org.apache.solr.servlet.SolrDispatchFilter). Server Java Version: 1.6.0_45 Solr Vers

AW: is group.query supported in solrcloud (4.8) ?

2014-11-10 Thread Timo Schmidt
Hi Giovanni, i think there is a bug in solr (cloud only) https://issues.apache.org/jira/browse/SOLR-5046 i've prepared a patch, that needs some review and improvements (my solr core knowledge is limited here :)) Maybe you can try the patch and vote for the issue in jira. What i know by now: w

Solr query with multiple double quotes

2014-11-10 Thread xan
When doing a query: "how "r" you" as: name:("how "r" you"^100) : I get no results. When doing it as-> name:"how "r" you"^100 : I get the correct results. Solr parses the 1st query as -> "q": "name:\"how \"r\" you\"100" and the 2nd one as -> q": "name:\"how \"r\" you\"100" I want to get corr

gnoreTikaException flag not working

2014-11-10 Thread 5ton3
Hi! I'm importing BLOBs from an Oracle DB, and want to retrieve the textual body/plaintext content for analyzing/indexing purposes. I'm using TikaEntityProcessor to do the parsing of the documents, which works fine for most of the documents. But in some cases , e.g. when a document is password pro

Aw: is group.query supported in solrcloud (4.8) ?

2014-11-10 Thread Timo Schmidt
Hi Giovanni, i think there is a bug in solr (cloud only) https://issues.apache.org/jira/browse/SOLR-5046 i've prepared a patch, that needs some review and improvements (my solr core knowledge is limited here :)) Maybe you can try the patch and vote for the issue in jira. What i know by now: w

Re: How to dynamically create Solr cores with schema

2014-11-10 Thread Andreas Hubold
Hi Alex, Solr 4.10 has support to create fields (of already defined types) with its Schema API: https://cwiki.apache.org/confluence/display/solr/Schema+API#SchemaAPI-Modifytheschema But one has to specify an initial schema.xml which is shared forever between all cores created from the same co

is group.query supported in solrcloud (4.8) ?

2014-11-10 Thread Giovanni Bricconi
hello I have a collection 0_2014_10_11 made of three shards When I try a group.query, even specifying a single shard, i get this error "shard 0 did not set sort field values (FieldDoc.fields is null); you must pass fillFields=true to IndexSearcher.search on each shard" This is the request, ask t

Re: Filter Query or Query

2014-11-10 Thread Erik Hatcher
Filter queries are the right thing to use here. It allows the filters to cache separately from the query and will provide, possibly dramatic, performance improvements. Don't worry about the caching concern - you'll have far fewer caches when the filter constraints are separate from the user qu

Re: help about guide of solr 3.6.2

2014-11-10 Thread Alexandre Rafalovitch
Solr reference guide was contributed by Lucid fairly recently. Until then, the Wiki was the source of all the information: https://wiki.apache.org/solr/ I would also download an original Solr 3.6.2 distribution and check the examples and documentation that came with that. And, of course, the arch

Re: Parsing Faceted Search output in Solr

2014-11-10 Thread Erik Hatcher
Make that fq not facet.query. Does that do what you want? Erik > On Nov 10, 2014, at 00:40, Mayuri Laxman Gadsing wrote: > > Hi all, > > I'm using the faceted search in Solr. I wanted to know if there's a way to > display selective records in the sublist. > > I've tried this query - >

help about guide of solr 3.6.2

2014-11-10 Thread Nie Fei
Hi all, I'm the new guy for Solr, as my company uses Solr 3.6.2. So I need some documents about that release. I checked the Solr site, but there is only guides for releases above 4.4. Do you know where can i get the guides or docs for 3.6.2? Or if you have one, can you send it to me? many than

Filter Query or Query

2014-11-10 Thread Bram Van Dam
Hi folks, I have an index with hundreds of millions of documents, which users can query in various ways. Two index fields are used by the system to hide certain documents from certain users (for instance: Department A can only view documents belonging to Department A, but not Department B).

Re: on regards to Solr and NoSQL storages integration

2014-11-10 Thread andrey prokopenko
Thanks for the useful information on DataStax internals, very interesting. My solution stores both primary key reference links and data itself in NoSQL storage, i.e. index parts for the stored fields are not created at all. I guess by the time DataStax has created their solution, there was no Solr