Re: How can I convert xml message for updating a Solr index to a javabin file

2014-04-27 Thread Jack Krupansky
Look at the SolrJ source code and doc. JavaBin is more of a protocol than a file format. -- Jack Krupansky -Original Message- From: Elran Dvir Sent: Sunday, April 27, 2014 2:16 AM To: solr-user@lucene.apache.org Subject: RE: How can I convert xml message for updating a Solr index to

Re: Wildcard search not working with search term having special characters and digits

2014-04-28 Thread Jack Krupansky
analyzed at index time for wildcard to work. Ditto is your filed type uses the word delimiter filter with the split digits option enabled - the alpha and numeric portions will generate separate terms - and cause a wildcard to fail. -- Jack Krupansky -Original Message- From: Geepalem

Re: Stemming not working with wildcard search

2014-04-28 Thread Jack Krupansky
Wildcards and stemming are incompatible at query time - you need to manually stem the term before applying your wildcard. Wildcards are not supported in quoted phrases. They will be treated as punctuation, and ignored by the standard tokenizer or the word delimiter filter. -- Jack Krupansky

Re: Explain Solr Query Execution

2014-05-04 Thread Jack Krupansky
ntity in your request. Maybe you should start by telling us what you are trying to achieve, in plain English. -- Jack Krupansky -Original Message- From: nativecoder Sent: Sunday, May 4, 2014 2:20 PM To: solr-user@lucene.apache.org Subject: Explain Solr Query Execution How will a

Re: Wildcard malfunctioning

2014-05-05 Thread Jack Krupansky
MappingCharFilterFactory. -- Jack Krupansky -Original Message- From: Román González Sent: Monday, May 5, 2014 7:00 AM To: solr-user@lucene.apache.org Subject: Wildcard malfunctioning Hi all! Sorry in advance if this question was posted but I were unable to find it with search engines

Re: Help to Understand a Solr Query

2014-05-05 Thread Jack Krupansky
Read up on the edismax query parser first: http://wiki.apache.org/solr/ExtendedDisMax The "^" operator is known as boosting or field boosting and is used to influence document scores for relevancy. It has no analog in SQL. -- Jack Krupansky -Original Message- From: n

Re: Help to Understand a Solr Query

2014-05-05 Thread Jack Krupansky
"dismax" means Disjunction Maximum, which means Lucene takes the highest scoring clause (field), for each search term. This is effectively an OR of the clauses. -- Jack Krupansky -Original Message- From: nativecoder Sent: Monday, May 5, 2014 11:21 AM To: solr-user@lucene.

Re: Turning on KeywordRepeat and RemoveDups on an existing fieldType.

2014-05-05 Thread Jack Krupansky
an do the reindex incrementally (as you replace existing documents) as well if you don't mind if the difference in relevancy takes an extended time to become apparent. -- Jack Krupansky -Original Message- From: Michael Tracey Sent: Monday, May 5, 2014 4:52 PM To: solr-user@lu

Re: Relevancy help

2014-05-05 Thread Jack Krupansky
what the document scores look like. -- Jack Krupansky -Original Message- From: Ravi Solr Sent: Monday, May 5, 2014 5:41 PM To: solr-user@lucene.apache.org Subject: Relevancy help Hello, I have a weird relevancy requirement. We search news content hence chronology is very importan

Re: Strict Search in Apache Solr

2014-05-05 Thread Jack Krupansky
The term "strict search" is not in the Lucene/Solr nomenclature - it could mean any number of things. It sounds as if maybe you want to do a phrase search, looking for an exact phrase - yes, you can do that by enclosing the phrase in quotes. -- Jack Krupansky -Origin

Re: Indexing scanned PDFs

2014-05-06 Thread Jack Krupansky
tly from a word processor source file will retain smooth curves on characters while a PDF generated from scanned page images will show heavy pixelation. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Tuesday, May 6, 2014 1:30 AM To: solr-user@lucene.apache.org S

Re: Help to Understand a Solr Query

2014-05-06 Thread Jack Krupansky
alternatives - at least one of the alternatives must match. -- Jack Krupansky -Original Message- From: nativecoder Sent: Tuesday, May 6, 2014 4:44 AM To: solr-user@lucene.apache.org Subject: Re: Help to Understand a Solr Query When I go through the debug results I f

Re: Help to Understand a Solr Query

2014-05-06 Thread Jack Krupansky
Think of debugQuery as your "Solr BFF"! -- Jack Krupansky -Original Message- From: nativecoder Sent: Tuesday, May 6, 2014 7:36 AM To: solr-user@lucene.apache.org Subject: Re: Help to Understand a Solr Query Thank you this is what I was looking for all this time I

Re: Too many documents Exception

2014-05-07 Thread Jack Krupansky
ad such an overstuffed index. -- Jack Krupansky -Original Message- From: [Tech Fun]山崎 Sent: Tuesday, May 6, 2014 8:54 PM To: solr-user@lucene.apache.org Subject: Too many documents Exception Hello everybody, Solr 4.3.1(and 4.7.1), Num Docs + Deleted Docs > 2147483647(Integer.MAX_VALUE

Re: Use XSD or DTD to make Solr schema?

2014-05-07 Thread Jack Krupansky
Sorry, but you have to create the schema manually, but... you could possibly get by with Solr schemaless mode to dynamically create the schema based on the actual data values. See: https://cwiki.apache.org/confluence/display/solr/Schemaless+Mode -- Jack Krupansky -Original Message

Re: LetterTokenizerFactory doesn't work as expected

2014-05-11 Thread Jack Krupansky
Please post your full field type analyzer. The letter tokenizer should in fact return "I", "can", and "t" - if it is used properly. -- Jack Krupansky -Original Message- From: ienjreny Sent: Saturday, May 10, 2014 8:28 AM To: solr-user@lucene.apache.org S

Re: is it possible for solr to calculate and give back the price of a product based on its sub-products

2014-05-11 Thread Jack Krupansky
n any case, please clarify your use case. -- Jack Krupansky -Original Message- From: Gharbi Mohamed Sent: Friday, May 9, 2014 3:29 PM To: solr-user@lucene.apache.org Subject: is it possible for solr to calculate and give back the price of a product based on its sub-products Hi, I am usin

Re: Easises way to insatll solr cloud with tomcat

2014-05-13 Thread Jack Krupansky
Check out HDS from Heliosearch - it comes packaged with Tomcat, ready to go: http://heliosearch.com/download.html -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Monday, May 12, 2014 8:23 AM To: solr-user@lucene.apache.org Subject: Re: Easises way to insatll solr cloud

Re: Too many documents Exception

2014-05-13 Thread Jack Krupansky
One of the hard-core Lucene guys is going to have to help you out. Or you may have to write some custom code to fix the index for any such shard. If you have deleted any documents, it may be sufficient to simply optimize the index. -- Jack Krupansky -Original Message- From: yamazaki

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-13 Thread Jack Krupansky
I do have basic coverage for that filter (and all other filters) and the parameter values in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html That said, are you sure you want to be using the payload feature of Lucene

Re: distrib=false is not honoring

2014-05-14 Thread Jack Krupansky
at is your rationale? What is this odd "mcat.intent" query response writer type that you are specifying with the qt parameter? -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Wednesday, May 14, 2014 1:35 AM To: solr-user@lucene.apache.org Subject: distrib=false i

Re: KeywordTokenizerFactory splits the string for the exclamation mark

2014-05-14 Thread Jack Krupansky
ry parser: http://lucene.apache.org/core/4_8_0/queryparser/org/apache/lucene/queryparser/classic/package-summary.html#Escaping_Special_Characters -- Jack Krupansky -Original Message- From: Romani Rupasinghe Sent: Tuesday, May 13, 2014 11:14 AM To: solr-user@lucene.apache.org Subject: KeywordT

Re: Indexing DateField timezone problem

2014-05-14 Thread Jack Krupansky
nually add that to dates. -- Jack Krupansky -Original Message- From: hakanbillur Sent: Friday, May 9, 2014 4:38 AM To: solr-user@lucene.apache.org Subject: Indexing DateField timezone problem <http://lucene.472066.n3.nabble.com/file/n4135079/Capture2.png> <http://lucene.472066

Re: Difference between search strings

2014-05-15 Thread Jack Krupansky
Inside of quotes you only have to escape quote and backslash. Add the debugQuery=true parameter to see exactly how Solr processes characters and generates queries. But... in a URL you have to URL-encode URL query parameters: http://en.wikipedia.org/wiki/Query_string -- Jack Krupansky

Re: Help to Understand a Solr Query

2014-05-15 Thread Jack Krupansky
Please don't re-use an existing message thread for a new, completely independent question! Also, try to make the subject line indicate something about the actual issue. -- Jack Krupansky -Original Message- From: nativecoder Sent: Tuesday, May 13, 2014 10:56 AM To: solr

Status of mail?

2014-05-16 Thread Jack Krupansky
Is the mail list working again yet?? -- Jack Krupansky

Re: Difference between search strings

2014-05-16 Thread Jack Krupansky
For these specific examples, the results should be the same, but mostly that's because the term is a simple sequence of letters. I have an extended discussion of characters in terms in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/

Re: date range queries efficiency

2014-05-16 Thread Jack Krupansky
exact query matches such as year without expanding the date to a range for the full interval. -- Jack Krupansky -Original Message- From: Dmitry Kan Sent: Friday, May 9, 2014 6:41 AM To: solr-user@lucene.apache.org Subject: date range queries efficiency Hi, There was a mention eith

Re: slow performance on simple filter

2014-05-16 Thread Jack Krupansky
Add the debugQuery=true parameter and look at the "timing" section to see which search component is consuming the time. Are you using faceting or highlighting? 7 million documents is actually a fairly small index. -- Jack Krupansky -Original Message- From: mizayah Sent:

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-17 Thread Jack Krupansky
sure would be nice to see more substantial and easier to use support for Payload in Solr. -- Jack Krupansky -Original Message- From: ienjreny Sent: Saturday, May 17, 2014 1:24 AM To: solr-user@lucene.apache.org Subject: Re: What is the usage of solr.NumericPayloadTokenFilterFactory

Re: Solr 4.8: Does eDisMax parser calls analyzer chain to tokenize?

2014-05-17 Thread Jack Krupansky
01 k CD-ROM. => CD-ROM. vs. CD ROM Finally, the default for the splitOnNumerics attribute is "1" (true), which is why "abc123xyz" is split into three terms. If you don't want that split, set splitOnNumerics="0". There are more details on WDF in my e-book:

Re: Index / Query IP Address as number.

2014-05-18 Thread Jack Krupansky
What are you using for your default query operator, and do you have autoGeneratePhraseQueries set to "true" for your field type? I mean, a query for 192.168.1.4 shouldn't match 192.168.1.3 - unless you have autoGeneratePhraseQueries set to "false" (the defa

Re: How to delete docs only in a spec shard within a collection

2014-05-19 Thread Jack Krupansky
Normally a delete-by-query needs to be distributed to all shards since it is not known which document keys it will affect. It sounds like SolrCloud does not support the feature you would really like: support for distrb=false. You can file a Jira request for a feature "improvement."

Re: Subscribe for Solr Mails

2014-05-19 Thread Jack Krupansky
Read this web page for information about subscribing: https://lucene.apache.org/solr/discussion.html -- Jack Krupansky -Original Message- From: Kamal Kishore Aggarwal Sent: Monday, May 19, 2014 6:12 AM To: solr-user@lucene.apache.org Subject: Subscribe for Solr Mails Dear Team

Re: Index / Query IP Address as number.

2014-05-19 Thread Jack Krupansky
addresses. And then you could output to multiple fields - one for the raw string for wildcard matches, say, and one as an integer for proximity or range checks. -- Jack Krupansky -Original Message- From: SolrUser1543 Sent: Monday, May 19, 2014 3:04 PM To: solr-user@lucene.apache.org Subject

Re: Howto Search word which contains the character "

2014-05-20 Thread Jack Krupansky
there are some examples in my e-book that shows how to map @ and _ to ALPHA. -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Tuesday, May 20, 2014 4:55 AM To: solr-user@lucene.apache.org Subject: Re: Howto Search word which contains the character " Hi, It is speci

Re: trigger delete on nested documents

2014-05-20 Thread Jack Krupansky
document would make it discontiguous with the rest of the block of documents. Just update the block by resending the entire block of documents. For e previous discussion of this limitation: http://lucene.472066.n3.nabble.com/block-join-and-atomic-updates-td4117178.html -- Jack Krupansky

Re: solr-user Digest of: get.100322

2014-05-21 Thread Jack Krupansky
il to follow it. Or, maybe we should enhance Solr to check available memory and log a stern warning if the index size exceeds system memory when Solr is started. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Tuesday, May 20, 2014 1:49 PM To: solr-user@lucene.apache.

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
g of documents that is different? For the latter, the fq is purely a "filter" that removes documents, but the terms in fq do not participate in relevance or scoring calculation of documents, so documents will not get boosted based on the terms present in fq. -- Jack Krupansk

Re: Distributed Search in Solr with different queries per shard

2014-05-21 Thread Jack Krupansky
Unfortunately the same query will be sent to all cores if you use the shards parameter to query multiple cores. Is there some characteristic of the first core that is distinct from the second core so that you could OR the differences between the two? -- Jack Krupansky -Original Message

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
filtering terms to participate in the document scoring. In other words, what exactly were you trying to achieve by using fq? -- Jack Krupansky -Original Message- From: johnmu...@aol.com Sent: Wednesday, May 21, 2014 12:19 PM To: solr-user@lucene.apache.org Subject: Re: Using fq as OR

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
, or add a boosting query using the bq parameter. The latter approach works for the dismax and edismax query parsers only. -- Jack Krupansky -Original Message- From: johnmu...@aol.com Sent: Wednesday, May 21, 2014 5:51 PM To: solr-user@lucene.apache.org Subject: Re: Using fq as OR Hi

Re: multiple queries in single request

2014-05-21 Thread Jack Krupansky
Nothing special for this use case. This seems to be a use case that I would call "bulk data retrieval - based on ID". I would suggest "batching" your requests - limit each request query to, say, 50 or 100 IDs. -- Jack Krupansky -Original Message- From: Pav

Re: boosting multivalued fields

2014-05-21 Thread Jack Krupansky
Yes. -- Jack Krupansky -Original Message- From: vit Sent: Wednesday, May 21, 2014 11:20 AM To: solr-user@lucene.apache.org Subject: boosting multivalued fields is it posiible to boost values of the same field. For example in a query like that: category_id:(2271578^0.5 22718986^0.4

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
The results will be scored, but only based on terms in q, not terms in fq. -- Jack Krupansky -Original Message- From: johnmu...@aol.com Sent: Wednesday, May 21, 2014 6:41 PM To: solr-user@lucene.apache.org Subject: Re: Using fq as OR Interesting!! I did not know that using &qu

Re: Applying boosting for keyword search

2014-05-21 Thread Jack Krupansky
Just add the boost to the keyword: q=toyota^100. Or, use the dismax or edismax query parsers and then the boost can be specified for the field: qf=keyword^100. -- Jack Krupansky -Original Message- From: manju16832003 Sent: Thursday, May 22, 2014 12:04 AM To: solr-user

Re: pdfs

2014-05-21 Thread Jack Krupansky
cific symptom? -- Jack Krupansky -Original Message- From: Brian McDowell Sent: Thursday, May 22, 2014 12:24 AM To: solr-user@lucene.apache.org Subject: pdfs Has anyone had issues with indexing pdf files? Some pdfs are bringing down Solr completely so that it actually needs to be man

Re: Applying boosting for keyword search

2014-05-21 Thread Jack Krupansky
Your original message had "q=toyota featured:true^100" and also using bq - both are valid. If either is not working for you, please be specific about what exactly is not behaving as you expected - what the symptom is. Sometimes you have to experiment with the boost factor. -- Jack

Re: pdfs

2014-05-22 Thread Jack Krupansky
Yeah, I recall running into infinite loop issues with PDFBox in Solr years ago. They keep fixing these issues, but they keep popping up again. Sigh. -- Jack Krupansky -Original Message- From: Siegfried Goeschl Sent: Thursday, May 22, 2014 4:35 AM To: solr-user@lucene.apache.org

Re: multiple queries in single request

2014-05-22 Thread Jack Krupansky
No, I was rejecting BOTH methods 1 and 2. I was suggesting a different method. I'll leave it to somebody else to describe the method so that it is easier to understand. -- Jack Krupansky -Original Message- From: Pavel Belenkovich Sent: Thursday, May 22, 2014 4:00 AM To: solr

Re: Query translation of User Fields

2014-05-22 Thread Jack Krupansky
Hmmm... that doesn't sound like what I would have expected - I would have thought that Solr would throw an exception on the "user" field, rather than simply treat it as a text keyword. File a Jira. Either it's a bug or the doc is not complete. -- Jack Krupansky -

Re: Atomic update by query instead of ID

2014-05-22 Thread Jack Krupansky
No, but it sure would be nice to have the Elasticsearch feature of supplying a script for update. -- Jack Krupansky -Original Message- From: Saumitra Srivastav Sent: Thursday, May 22, 2014 11:13 AM To: solr-user@lucene.apache.org Subject: Atomic update by query instead of ID Is is

Re: How to Disable Commit Option and Just Manage it via SolrConfig?

2014-05-23 Thread Jack Krupansky
class, or maybe you would have to copy and edit it. Also, note that the delete command also has a commitWithin setting. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Thursday, May 22, 2014 6:36 AM To: solr-user@lucene.apache.org Subject: How to Disable Commit Option and J

Re: index a repository of documents(.doc) without using post.jar

2014-05-23 Thread Jack Krupansky
Is there a particular reason you are adverse to using post.jar? I mean, if there is some bug or inconvenience, let us know so we can fix it! The Solr server itself does not provide any ability to "crawl" file systems (LucidWorks Search does.) post.jar does provide that convenience.

Re: index a repository of documents(.doc) without using post.jar

2014-05-23 Thread Jack Krupansky
Feel free to look at the source code for post.jar. I mean, all it is really doing is scanning the directory (optionally recursively) and then streaming each file to Solr. -- Jack Krupansky -Original Message- From: benjelloun Sent: Friday, May 23, 2014 8:15 AM To: solr-user

Re: Solr 4.7.2 ValueSourceParser classCast exception

2014-05-23 Thread Jack Krupansky
Are you sure that you compiled your code with the proper Solr jars so that the class signature (extends, implements, and constructors) matches the Solr 4.7.2 jars? I mean, Java is simply complaining that your class is not a valid value source class of the specified type. -- Jack Krupansky

Re: Query translation of User Fields

2014-05-25 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Yonik Seeley Sent: Sunday, May 25, 2014 10:58 AM To: solr-user@lucene.apache.org Subject: Re: Query translation of User Fields On Thu, May 22, 2014 at 10:56 AM, Jack Krupansky wrote: Hmmm... that doesn't sound like what I would

Re: about analyzer and tokenizer

2014-05-26 Thread Jack Krupansky
book". Your SYNONYMS.TXT file would contain: macbook,mac book Only use the synonyms filter at index time. The standard query parsers don't support phrases for synonyms. -- Jack Krupansky -Original Message- From: rachun Sent: Monday, May 26, 2014 5:26 AM To: solr-user@lucene.apa

Re: Solr - Cores not initialised

2014-05-26 Thread Jack Krupansky
f the XML elements. It appears that you mangled that message as well! Feel free to post the complete message here as well. -- Jack Krupansky -Original Message- From: Manikandan Saravanan Sent: Monday, May 26, 2014 1:52 AM To: solr-user@lucene.apache.org Cc: Varuna Venkatesh Subject

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Jack Krupansky
You could also consider DataStax Enterprise, which integrates Apache Cassandra as the primary database and Solr for indexing and query. See: http://www.datastax.com/what-we-offer/products-services/datastax-enterprise -- Jack Krupansky -Original Message- From: Ali Nazemian Sent

Re: Any Solrj API to obtain field list?

2014-05-27 Thread Jack Krupansky
You might consider an update request processor as an alternative. It runs on the server and might be simpler. You can even use the stateless script update processor to avoid having to write any custom Java code. -- Jack Krupansky -Original Message- From: T. Kuro Kurosaka Sent

Re: Regex with local params is not working

2014-05-28 Thread Jack Krupansky
Post the parsed query itself. Yes, edismax should always generate a disjunctionmaxquery - in addition to the regexquery. -- Jack Krupansky -Original Message- From: Lokn Sent: Wednesday, May 28, 2014 1:53 AM To: solr-user@lucene.apache.org Subject: Re: Regex with local params is not

Re: Solr Cell Tika - date.formats

2014-05-28 Thread Jack Krupansky
HH:mm:ss'Z' -MM-dd'T'HH:mm:ss -MM-dd -MM-dd hh:mm:ss -MM-dd HH:mm:ss EEE MMM d hh:mm:ss z EEE, dd MMM HH:mm:ss zzz , dd-MMM-yy HH:mm:ss zzz EEE MMM d HH:mm:ss See: https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+

Re: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches?

2014-05-29 Thread Jack Krupansky
https://cwiki.apache.org/confluence/display/solr/Working+with+Enum+Fields -- Jack Krupansky -Original Message- From: Elran Dvir Sent: Thursday, May 29, 2014 2:50 AM To: solr-user@lucene.apache.org Subject: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches? H

Re: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches?

2014-05-29 Thread Jack Krupansky
cases, but my personal imagination is not at issue for this particular thread. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Thursday, May 29, 2014 9:46 AM To: solr-user@lucene.apache.org Subject: Re: wildcard matches in EnumField - what do I need to change in code to

Re: Block document for certain search words

2014-05-30 Thread Jack Krupansky
Explain your use case a little more, but you can define terms as stop words with a stop filter, which means they won't appear in the index. -- Jack Krupansky -Original Message- From: Christian Loock Sent: Friday, May 30, 2014 5:38 AM To: solr-user@lucene.apache.org Subject:

Re: How to check and handle empty Solr query

2014-05-30 Thread Jack Krupansky
Consider using the edismax query parser, which allows you to specify q.alt which is an alternative query such as "*:". Or use an application layer to handle the situation before it gets to Solr. -- Jack Krupansky -Original Message- From: vit Sent: Friday, May 30, 2014

Re: search component needs access to results of previous component

2014-05-30 Thread Jack Krupansky
rks Search Solr-based product. It re-executes enhanced queries in a query component. I don't think the source is available though. And I don't recall if it was cloud-enabled. See: http://docs.lucidworks.com/display/lweug/Queries+and+Relevance -- Jack Krupansky -Original Message

Re: How to build Solr4.0 Final?

2014-05-30 Thread Jack Krupansky
Make sure you checkout from the "tagged" branch, not trunk or the actual release branch which has updates since the release. See: http://svn.apache.org/viewvc/lucene/dev/tags/ -- Jack Krupansky -Original Message- From: eShard Sent: Friday, May 30, 2014 11:55 AM To:

Re: Uneven shard heap usage

2014-05-31 Thread Jack Krupansky
When you restart, how long does it take it hit the problem? And how much query or update activity is happening in that time? Is there any other activity showing up in the log? If you bring up only a single node in that problematic shard, do you still see the problem? -- Jack Krupansky

Re: Solr 4.8 synonyms expansion for each primary term

2014-06-02 Thread Jack Krupansky
x27;t dug into the new REST API to know the answer. -- Jack Krupansky -Original Message- From: Archana R Sent: Monday, June 2, 2014 10:44 AM To: solr-user@lucene.apache.org Subject: Solr 4.8 synonyms expansion for each primary term we recently upgraded to Solr 4.8 and we are using REST A

Re: change in EnumField configuration - what do you think?

2014-06-02 Thread Jack Krupansky
docvalues, right? -- Jack Krupansky -Original Message- From: Erick Erickson Sent: Monday, June 2, 2014 10:46 AM To: solr-user@lucene.apache.org Subject: Re: change in EnumField configuration - what do you think? Would both then be supported? I see where it would be easily detectable. And I

Re: solr multi-tenant: anyone use per-tenant synonyms file?

2014-06-02 Thread Jack Krupansky
Try to stay with a separate collection/core for each tenant - otherwise relevancy for document scores gets "polluted" by other tenants, even if you do use filter queries to isolate what documents get returned for a tenant in a multi-tenant core. -- Jack Krupansky -Origin

Re: suspect SOLR query from D029 (SOLR master)

2014-06-03 Thread Jack Krupansky
11 * 11 or 121 query terms, which shouldn't be so bad. But... maybe the Lucene FST for your synonym list is huge. Someone with deeper Lucene knowledge would have to address that. -- Jack Krupansky -Original Message- From: Branham, Jeremy [HR] Sent: Tuesday, June 3, 2014 3:57

Re: Nested boosting in map function in solr?

2014-06-03 Thread Jack Krupansky
I think you need to use parameter substitution for those nested queries since the "boost" parameter takes a white-space delimited sequence of function queries. -- Jack Krupansky -Original Message- From: Kamal Kishore Aggarwal Sent: Tuesday, June 3, 2014 2:22 AM To:

Re: Solr maximum Optimal Index Size per Shard

2014-06-03 Thread Jack Krupansky
er se. -- Jack Krupansky -Original Message- From: Vineet Mishra Sent: Tuesday, June 3, 2014 8:43 AM To: solr-user@lucene.apache.org Subject: Solr maximum Optimal Index Size per Shard Hi All, Has anyone came across the maximum threshold document or size wise for each core of solr to hold.

Re: Solr maximum Optimal Index Size per Shard

2014-06-03 Thread Jack Krupansky
Anybody care to forecast when hardware will catch up with Solr and we can routinely look forward to newbies complaining that they indexed "some" data and after only 10 minutes they hit this weird 2G document count limit? -- Jack Krupansky -Original Message- From: Shawn He

Re: Use a field with space in qf

2014-06-04 Thread Jack Krupansky
Unfortunately, field aliasing works above the level of the qf parameter values. Maybe the Lucene guys could suggest a way to forcibly rename a field on disk. -- Jack Krupansky -Original Message- From: devraj.jaiman Sent: Wednesday, June 4, 2014 6:27 AM To: solr-user

Re: Solr maximum Optimal Index Size per Shard

2014-06-04 Thread Jack Krupansky
ation will tell you everything you need to know. -- Jack Krupansky -Original Message- From: Vineet Mishra Sent: Wednesday, June 4, 2014 2:45 AM To: solr-user@lucene.apache.org Subject: Re: Solr maximum Optimal Index Size per Shard Thanks all for your response. I presume this conver

Re: Multivalue wild card search

2014-06-04 Thread Jack Krupansky
Wildcard, fuzzy, and regex query operate on a single term of a single tokenized field value or a single string field value. -- Jack Krupansky -Original Message- From: Ethan Sent: Wednesday, June 4, 2014 6:59 PM To: solr-user Subject: Multivalue wild card search I can't seem to

Re: change in EnumField configuration - what do you think?

2014-06-05 Thread Jack Krupansky
Yeah, leaving holes as reservations for future fill-in seems like a reasonable use case. Why not just have explicit="n" as an XML attribute for . If no explicit attribute is given, then simply use the previous value plus one. No need for a separate then. -- Jack Krupansky ---

Re: Easises way to insatll solr cloud with tomcat

2014-06-05 Thread Jack Krupansky
As has been suggested on other threads here, the HDS distribution of Solr from Heliosearch comes preconfigured with Tomcat, so it really is the "easiest way to install Solr with Tomcat." -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Thursday, June 5, 2014

Re: Does lucene uses tries?

2014-06-05 Thread Jack Krupansky
Lucene uses trie for numeric and date fields: http://lucene.apache.org/core/4_8_0/core/org/apache/lucene/search/NumericRangeQuery.html -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Thursday, June 5, 2014 5:32 AM To: solr-user@lucene.apache.org Subject: Does lucene uses

Re: Date number to text

2014-06-06 Thread Jack Krupansky
ther field to keep it as text. You can also use your own update processor to convert to some other text format. See the stateless script update processor to write a simple JavaScript script. -- Jack Krupansky -Original Message- From: O. Klein Sent: Friday, June 6, 2014 4:32 AM To:

Re: Multivalue wild card search

2014-06-06 Thread Jack Krupansky
Wildcard, fuzzy, and regex query do work on multivalued fields, but on only a single value at a time. Your match cannot occur across values. -- Jack Krupansky -Original Message- From: Ethan Sent: Thursday, June 5, 2014 12:52 PM To: solr-user Subject: Re: Multivalue wild card search

Re: Mapping a field name before queryParser

2014-06-06 Thread Jack Krupansky
Edismax has field aliasing: http://wiki.apache.org/solr/ExtendedDisMax#Field_aliasing_.2F_renaming f.my_alias.qf=actual_field f.brand.qf=brand_name -- Jack Krupansky -Original Message- From: Antoine LE FLOC'H Sent: Friday, June 6, 2014 5:56 PM To: solr-user@lucene.apach

Re: Mapping a field name before queryParser

2014-06-06 Thread Jack Krupansky
Oops... for your example it would be: f.brand_name.qf=brand -- Jack Krupansky -Original Message- From: Antoine LE FLOC'H Sent: Friday, June 6, 2014 6:54 PM To: solr-user@lucene.apache.org Subject: Re: Mapping a field name before queryParser OK thank you. I wanted to see w

Re: Deepy nested structure

2014-06-07 Thread Jack Krupansky
you believe are the most complex queries your users are likely to need. Start by expressing them clearly in simple, plain English, unless the structured query is quite obvious. -- Jack Krupansky -Original Message- From: harikrishna Sent: Friday, June 6, 2014 9:35 AM To: solr-user

Re: Large disjunction query practices

2014-06-09 Thread Jack Krupansky
down. Small is better. -- Jack Krupansky -Original Message- From: Joe Gresock Sent: Monday, June 9, 2014 8:50 AM To: solr-user@lucene.apache.org Subject: Large disjunction query practices I'm wondering what the best practice for large disjunct queries in Solr is. A user wants t

Re: accessing individual elements of a multivalued field

2014-06-09 Thread Jack Krupansky
Not currently. You could have separate explicit fields for the categories such as "cat_1", "cat_2", etc. The data would need to be replicated (possibly using a ), but redundancy to facilitate access is a reasonable approach. -- Jack Krupansky -Original Message-

Re: split field on json update

2014-06-12 Thread Jack Krupansky
You can easily write a JavaScript snippet using the stateless script update processor and do whatever string manipulation you want on an input value, and then write extracted strings to whatever field(s) you want. My e-book has plenty of script examples. -- Jack Krupansky -Original

Re: How to query for content with ACLs?

2014-06-12 Thread Jack Krupansky
Take a look at this: http://www.slideshare.net/lucenerevolution/wright-nokia-manifoldcfeurocon-2011 Karl has an old Jira patch somewhere for doing the ACLs processing in Solr. -- Jack Krupansky -Original Message- From: lalitjangra Sent: Thursday, June 12, 2014 9:28 AM To: solr-user

Re: How to update one field without losing the others?

2014-06-14 Thread Jack Krupansky
Is this a schema in when the text field is actually populated via from other fields? Or maybe not, but the text field may not have been a stored field. It needs to be one of the two. -- Jack Krupansky -Original Message- From: librarymark Sent: Saturday, June 14, 2014 10:40 AM To

Re: docFreq coming to be more than 1 for unique id field

2014-06-17 Thread Jack Krupansky
by some to be a "NoSQL database" and with (near) real-time updates, that model is clearly obsolete. -- Jack Krupansky -Original Message- From: Apoorva Gaurav Sent: Tuesday, June 17, 2014 11:15 AM To: solr-user ; Ahmet Arslan Subject: Re: docFreq coming to be more than 1 for uni

Re: Multivalue wild card search

2014-06-19 Thread Jack Krupansky
1. Wildcards do not work within quoted terms. 2. Spaces in terms need to be escaped. 3. The quotes embedded in a term do not need to be escaped. So, try: q=*["Steve",\ "Wonder",\ ""]] or q=*["Steve",\ "Wonder",\ ""]* -- Jack Krupan

Re: Question about sending solrconfig and schema files with java

2014-06-20 Thread Jack Krupansky
probably not Solr-related. Sorry for the inconvenience! -- Jack Krupansky -Original Message- From: Frederic Esnault Sent: Friday, June 20, 2014 11:50 AM To: solr-user@lucene.apache.org Subject: Re: Question about sending solrconfig and schema files with java Hi Shawn, Actually i should

Re: Question about sending solrconfig and schema files with java

2014-06-20 Thread Jack Krupansky
Oops! Sorry I missed it. Please post of the rest of the info on SO as well. We'll get to it! -- Jack Krupansky -Original Message- From: Frederic Esnault Sent: Friday, June 20, 2014 7:03 PM To: solr-user@lucene.apache.org Subject: Re: Question about sending solrconfig and schema

Re: No results for a wildcard query for text_general field in solr 4.1

2014-06-24 Thread Jack Krupansky
o for this case??) -- Jack Krupansky -Original Message- From: Erick Erickson Sent: Tuesday, June 24, 2014 11:38 AM To: solr-user@lucene.apache.org ; Ahmet Arslan Subject: Re: No results for a wildcard query for text_general field in solr 4.1 Wildcards are a tough thing to get your head a

Re: Does one need to perform an optimize soon after doing a batch indexing using SolrJ ?

2014-06-24 Thread Jack Krupansky
guide? -- Jack Krupansky -Original Message- From: Erick Erickson Sent: Tuesday, June 24, 2014 11:46 AM To: solr-user@lucene.apache.org Subject: Re: Does one need to perform an optimize soon after doing a batch indexing using SolrJ ? Your indexing process looks fine, there's no r

Re: Does updating a child document destroy the parent - child relationship

2014-06-24 Thread Jack Krupansky
limitation does not appear to be documented in the Solr ref guide. -- Jack Krupansky -Original Message- From: Vinay B, Sent: Tuesday, June 24, 2014 10:40 PM To: solr-user Subject: Does updating a child document destroy the parent - child relationship When I edit a child document, a

<    1   2   3   4   5   6   7   8   9   10   >