Re: Indexing PDF files with Solr 6.6 while allowing highlighting matched text with context

2017-06-19 Thread ZiYuan
Hi Erick, thanks very much for the explanations! Clarification for question 2: more specifically I cannot see the field content in the returned JSON, with the the same definitions as in the post

Re: Indexing PDF files with Solr 6.6 while allowing highlighting matched text with context

2017-06-19 Thread Erik Hatcher
Ziyuan - You may be interested in the example/files that ships with Solr too. It’s got schema and config and even UI for file indexing and searching. Check it out README.txt under example/files in your Solr install. Erik > On Jun 19, 2017, at 6:52 AM, ZiYuan wrote: > > Hi Erick, >

RE: Indexing PDF files with Solr 6.6 while allowing highlighting matched text with context

2017-06-19 Thread Allison, Timothy B.
> There is no standard across different types of docs as to what meta-data > field is >> included. PDF might have a "last_edited" field. Word might have a >> "last_modified" field where the two mean the same thing. On Tika, we _try_ to normalize fields according to various standards, the most

Give boost only if entire value is present in Query

2017-06-19 Thread Aman Deep Singh
Hi, I have a problem ,I need to give the boost to a particular field if and only if the query contains entire field value (String contains like feature). e.g. if Field value is ABC DEF It should match if user query is like ABC DEF XYZ or XYZ ABC DEF, But it should not match when user query is like

Re: Facet is not working while querying with group

2017-06-19 Thread Aman Deep Singh
I tried to recreate the collection and its working fine, But if i try to change the any field level value this error again comes Is their any roadmap to avoid the remanent data issue, since every time you change the field definition you need to delete the data directory or recreate the collection.

Re: Indexing PDF files with Solr 6.6 while allowing highlighting matched text with context

2017-06-19 Thread ZiYuan
Hi Erick, Now it is clear. I have to update the request handler of /update/extract/ from "defaults":{"fmap.content":"_text_"} to "defaults":{"fmap.content":"content"} to fill the field. Hope that there is no side effect of not mapping the PDF content to _text_. Thank you for the hint. Best regar

Re: Give boost only if entire value is present in Query

2017-06-19 Thread Susheel Kumar
In general, the documents which has more or all terms matched against query terms will be boosted higher per lucene tf/idf scoring. So for document having ABC DEF queries like ABC DEF XYZ or XYZ ABC DEF will find a match(assuming q.op=AND) and will be boosted higher compare to documents with ABC

how to leave the mailing list? eof

2017-06-19 Thread david fernandes

Re: how to leave the mailing list? eof

2017-06-19 Thread ZiYuan
You can check this page: http://lucene.apache.org/solr/community.html On Mon, Jun 19, 2017 at 5:22 PM, david fernandes wrote: > >

Re: Multiple hashJoin or innerJoin

2017-06-19 Thread Joel Bernstein
These are MapReduce joins so you have to stream all the records. You definitely will not be able to stream 100 fields. So you'll need to come up with a strategy that streams the minimum number fields needed to perform the join. You can use the fetch expression to fetch additional fields following t

Re: Give boost only if entire value is present in Query

2017-06-19 Thread alessandro.benedetti
Isn't this a case where you don't want the query parser to split by space before the analyser ? Take a look to the "sow" param for the edismax query parser. In your case you should be ok but Be aware that is not a silver bullet for everything and that other problems could arise in similar scenarios

Re: Give boost only if entire value is present in Query

2017-06-19 Thread Aman Deep Singh
Yes Susheel , I know that more token matched gives more weight but in my case if entire match I want around x times boost but in case of partial match I want to give nominal boost or normal boost, Now in case of keyword token ziet or phrase query they work if and only if the user query is exactly m

Re: Give boost only if entire value is present in Query

2017-06-19 Thread Aman Deep Singh
Yes alessandro, I know that their us some downsight of using sow =false but if don't use it then neither shingle nor bhram will work ,and these are required in my case/setup On 19-Jun-2017 8:18 PM, "alessandro.benedetti" wrote: Isn't this a case where you don't want the query parser to split by

Re: Indexing PDF files with Solr 6.6 while allowing highlighting matched text with context

2017-06-19 Thread Erick Erickson
bq: Hope that there is no side effect of not mapping the PDF Well, yes it will have that side effect. You can cure that with a copyField directive from content to _text_. But do really consider running this as a SolrJ program on the client. Tim knows in far more painful detail than I do what kind

Re: Facet is not working while querying with group

2017-06-19 Thread Erick Erickson
bq: Is their any roadmap to avoid the remanent data issue, Not that I've ever heard of. Well, Uwe did show a process for adding docValues to an existing index here: http://lucene.472066.n3.nabble.com/Adding-Docvalues-to-a-Field-td4333503.html but you can see what kinds of deep-level Lucene knowled

Re: Indexing PDF files with Solr 6.6 while allowing highlighting matched text with context

2017-06-19 Thread ZiYuan
Dear Erick and Timothy, yes I will parse from the client for all the benefits. I am just trying to figure out what is going on by indexing one or two PDF files first. Thank you both. Best regards, Ziyuan On Mon, Jun 19, 2017 at 6:17 PM, Erick Erickson wrote: > bq: Hope that there is no side ef

Re: Give boost only if entire value is present in Query

2017-06-19 Thread Rick Leir
Aman, Use a copyfield so you can have a second field that uses a different analysis chain. In the new field you just created for the copyfield, use the lowercase type, or create a type using KeywordTokenizer in the analysis chain. Then match on the original field, and boost based on the new fiel

Re: shards are going down frequently

2017-06-19 Thread ramesh.pallapo...@fedex.com
Can any one please suggest on this - Thanks, Ramesh. -- View this message in context: http://lucene.472066.n3.nabble.com/shards-are-going-down-frequently-tp4340652p4341802.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Give boost only if entire value is present in Query

2017-06-19 Thread Aman Deep Singh
Sorry Rick, I didn't get it. How will you give boost on other field when querying on some other field. Or if you query on copy keyword tokenized field then it only matched when entire query is matched. On 20-Jun-2017 1:52 AM, "Rick Leir" wrote: Aman, Use a copyfield so you can have a second fie

Re: Multiple hashJoin or innerJoin

2017-06-19 Thread Zheng Lin Edwin Yeo
Hi Joel, Take this example query with more details: innerJoin(innerJoin( search(people, q=field1a:A AND field1b:B, fl= "personId,personNme,field1a,field1b", sort="personId asc", qt="/export"), search(pets, q=field2a:A AND field2b:B, fl= "petsId,petName,field2a,field2b", sort="personId asc", q

SolrException: Error trying to proxy request for url: solr/sync-status/admin/system

2017-06-19 Thread S G
Hi, We are stuck in a strange problem. Whole cluster is red. All nodes are being shown as down. Restart of the nodes is not helping either. All our nodes seem to have gone into a distributed lock. Here is the grep command I ran on all the solr.log files: grep "Error trying to proxy request"

Re: Will Solr support google like organic search ?

2017-06-19 Thread Geepalem
Thanks Toke! We are collecting inputs from clients on this. It seems client wants to show advertisement kind of results along with relevance based results. So, is there any way to do this in Solr? Can any one respond on this please? Thanks, G. Naresh Kumar -- View this message in context: ht

Re: Facet is not working while querying with group

2017-06-19 Thread Aman Deep Singh
Again the same problem started to occur and I haven't change any schema It's only coming to the Numeric data types only (tint,tdouble) and that too in group query only If I search with string field type it works fine. Steps which i have followed 1. drop the old collection 2. create the new

Re: Will Solr support google like organic search ?

2017-06-19 Thread Alexandre Rafalovitch
I think you are still several steps away from having an actual Solr question. Yes, you could use Solr to search a different data set (Ads). The devil is in the details. Where do those ads come from, what do they match (same keywords as search?), how are they ranked (for Google by Auction, I think)