Re: Import from Solr 4.10.4 into Solr 5.2.1 using DIH and SolrEntityProcessor

2015-07-10 Thread Shawn Heisey
On 7/10/2015 5:47 PM, Jens Ivar Jørdre wrote: > I wonder if someone may shed some light on an issue I have using DIH and > SolrEntityProcessor. In short I am unable to import data from Solr 4.10.4 to > 5.2.1 where collection configurations in each have the very same schema.xml > and solrconfig.x

Re: Auto-suggest in Solr

2015-07-10 Thread Alessandro Benedetti
Hi guys, just wrote a blog to integrate Erick's post and to explain in details with practical examples all the main Lookup implementations : http://alexbenedetti.blogspot.co.uk/2015/07/solr-you-complete-me.html I think this can be useful for Edwin to finally fix the config for the FreeTextSuggest

Trouble getting a solr join query done

2015-07-10 Thread Yusnel Rojas García
I have 2 indexes products { id, name, type_id .. } and categories { id, name .. } and I want to get all categories that match a name and have products in it. my best guess would be: http://localhost:8983/solr/categories/select?q=clothes&fl=*,score&fq={!join from=type_id to=

Import from Solr 4.10.4 into Solr 5.2.1 using DIH and SolrEntityProcessor

2015-07-10 Thread Jens Ivar Jørdre
Hi, I wonder if someone may shed some light on an issue I have using DIH and SolrEntityProcessor. In short I am unable to import data from Solr 4.10.4 to 5.2.1 where collection configurations in each have the very same schema.xml and solrconfig.xml. I have two Solr Cloud instances running. On

Re: Planning Solr migration to production: clean and autoSoftCommit

2015-07-10 Thread Wenbin Wang
Hi Erick, Scheduling the indexing job is not an issue. The question is how to push the index to other two slave instances while the polling from other two slave instance needs to be manipulated. In the first option you proposed, I need to detect if the indexing job has completed, and force replic

how to index filesystem using solr

2015-07-10 Thread Mugeesh Husain
Hi I am planning to index 18 lac doc file in a file-systems using Solr. Could you give me sample example or documentation, Please suggest me Is DataImport Handler is suitable for this case or i have to use SolrJ for this. Thank you for help. Thanks and regards,Mugeesh -- View this message in

Re: Planning Solr migration to production: clean and autoSoftCommit

2015-07-10 Thread Erick Erickson
bq: The re-indexing is going to be every 4 hours or even every 2 hours a day, so it is not rare. Manually managing replication is not an option Why not? Couldn't this all be done from a shell script run via a cron job? On Fri, Jul 10, 2015 at 11:03 AM, wwang525 wrote: > Hi Erick, > > It is Solr

Re: Lingo3g-Solr integration - ClassNotFoundException: com.google.common.base.MoreObjects

2015-07-10 Thread Shawn Heisey
On 7/10/2015 10:09 AM, Mandris, Collin wrote: > Hello, > > I am trying to integrate Lingo3g with Solr. I have arrived at the error > ClassNotFoundException error using Lingo3g (verison 1.12.0) with Solr 4.8.0. > I located the guava-18.0.jar, which contains the > com.google.common.base.MoreObject

Re: Custom Solr caches in a FunctionQuery that emulates the ExternalFileField

2015-07-10 Thread Upayavira
Hi Erick, You are right that I could actually be asking for a stored field. That's an exceptionally good point, and yes, would suck. Better would be to retrieve a docValue from document. I'll look into that. Upayavira On Fri, Jul 10, 2015, at 06:28 PM, Erick Erickson wrote: > Upayavira: > > bq:

RE: Lingo3g-Solr integration - ClassNotFoundException: com.google.common.base.MoreObjects

2015-07-10 Thread Mandris, Collin
Adding the Guava jar to {instanceDir}/lib seems to have fixed Solr not loading it, but now I am getting another error that I cannot pin down. The trace for this one is: java.lang.RuntimeException: java.lang.IllegalAccessError: tried to access class com.carrotsearch.hppc.BitUtil from class com.ca

Re: Planning Solr migration to production: clean and autoSoftCommit

2015-07-10 Thread wwang525
Hi Erick, It is Solr 4.7. For the time being, we are considering the old style master/slave configuration. The re-indexing is going to be every 4 hours or even every 2 hours a day, so it is not rare. Manually managing replication is not an option. Is there any other easy-to-manage option ? Thank

Re: Planning Solr migration to production: clean and autoSoftCommit

2015-07-10 Thread Erick Erickson
You're confusing a couple of things here. First, I'm assuming that you are NOT using SolrCloud, but older-style master/slave. If that's not true, disregard the rest of this. autoSoftCommit is _local_ and has nothing to do with changing the Lucene segments. And since you're not searching on the ma

Re: Lingo3g-Solr integration - ClassNotFoundException: com.google.common.base.MoreObjects

2015-07-10 Thread Erick Erickson
Have you tried adding another directive to solrconfig.xml? Best, Erick On Fri, Jul 10, 2015 at 9:09 AM, Mandris, Collin wrote: > Hello, > > I am trying to integrate Lingo3g with Solr. I have arrived at the error > ClassNotFoundException error using Lingo3g (verison 1.12.0) with Solr 4.8.0. >

Re: Custom Solr caches in a FunctionQuery that emulates the ExternalFileField

2015-07-10 Thread Erick Erickson
Upayavira: bq: retrieve the value of field Y for that doc If this is fetching the stored field it's going to be horrible as it'll probably read/decompress a 16K block each time. Yccck. If you can read the value from a DocValues field (or, indeed, any indexed field which would only really work

Planning Solr migration to production: clean and autoSoftCommit

2015-07-10 Thread wwang525
Hi, The following questions are about the basic configuration options in production. We will have three machines: one indexing instance (master) and two Solr instances (in different machines) for searching purpose. This way, we will always have two Solr instances dedicated for executing search r

Re: SolrJ/Tika custom indexer not indexing CERTAIN .doc text? | SIDENOTE

2015-07-10 Thread Erick Erickson
Tim: Thanks! I've prompted the folks at LW to see what 's up with blog comments and I'll add your suggestion to the blog (with attribution of course) Best, Erick On Fri, Jul 10, 2015 at 5:41 AM, Allison, Timothy B. wrote: >>>Wow, that code looks familiar ;)... > > Erick and Paden, > The follo

Re: Get content in response from ExtractingRequestHandler

2015-07-10 Thread Erick Erickson
In a word, no. If you don't store the data it is completely gone with no chance of retrieval. There are a couple of things to think about though 1> The original doc must exist somewhere. Store some kind of URI in Solr that you can use to retrieve the original doc on demand. 2> Go ahead and store

Lingo3g-Solr integration - ClassNotFoundException: com.google.common.base.MoreObjects

2015-07-10 Thread Mandris, Collin
Hello, I am trying to integrate Lingo3g with Solr. I have arrived at the error ClassNotFoundException error using Lingo3g (verison 1.12.0) with Solr 4.8.0. I located the guava-18.0.jar, which contains the com.google.common.base.MoreObjects class, and have tried putting it in multiple locations

Re: Custom Solr caches in a FunctionQuery that emulates the ExternalFileField

2015-07-10 Thread Upayavira
Mikhail, Thanks for pointing this out. I'd say that ticket is in distinct need of some examples or use-cases. It is extremely hard to work out what "scoring" actually means. What is used to score what? It'd be great to see some examples and some explanations as to what effect those examples have

Re: Best way to facets with value preprocessing (w/ docValues)

2015-07-10 Thread Konstantin Gribov
Hello again. My phrase about TokenStream reuse was incorrect since it's reused even in TextField case, so Analyzer#createComponents() is called infrequently. But analysis itself takes some time (in case of StrField it's trivial, just calling `toInternal`). One suspicious moment is that solr.StrFi

Re: Custom Solr caches in a FunctionQuery that emulates the ExternalFileField

2015-07-10 Thread Mikhail Khludnev
I've heard that people use https://issues.apache.org/jira/browse/SOLR-6234 for such purpose - adding scores from fast moving core to the bigger slow moving one On Fri, Jul 10, 2015 at 4:54 PM, Upayavira wrote: > All, > > I have knocked up what I think could be a really cool function query - > it

RE: Protwords in solr spellchecker

2015-07-10 Thread Dyer, James
Kamal, Given the constraint that you cannot re-index the data, your best bet might be to simply filter out the suggestions at the application level, or maybe even have a proxy do it. Possibly another option, you might be able to extend DirectSolrSpellchecker and override #getSuggestions(), cal

Custom Solr caches in a FunctionQuery that emulates the ExternalFileField

2015-07-10 Thread Upayavira
All, I have knocked up what I think could be a really cool function query - it allows you to retrieve a value from another core (much like a pseudo join) and use that value during scoring (much like an ExternalFileField). Examples: * Selective boosting of documents based upon a category based va

Re: Spell checking the synonym list?

2015-07-10 Thread Ryan Yacyshyn
Thanks both! James, I like that approach. I'll give it a try. I forgot to mention I was only using query-time synonyms but shouldn't be a problem in my case to add synonyms during index-time. Ryan On Thu, 9 Jul 2015 at 22:07 Dyer, James wrote: > Ryan, > > If you use index-time synonyms on th

Re: solr 5 question

2015-07-10 Thread Shawn Heisey
On 7/10/2015 6:37 AM, seunghun@leaseplan.com wrote: > How to put security on solr 5 on windows7? > > > i am quit new to solr 5 and running it through windows7. > I would able to index some files and search through /browse on other > computers as well. > what I am trying to do is putting secur

Re: Performance of q.alt vs. fq

2015-07-10 Thread Shawn Heisey
On 7/10/2015 5:55 AM, Thomas Seidl wrote: > I am working a lot with Drupal and Apache Solr. There, we implemented a > performance improvement that would, for filter-only queries (i.e., no > "q" parameter, just "fq"s) instead move the filters to the "q.alt" > parameter (idea based on this blog post

Re: Sold 5 Logging with log4j2 possible?

2015-07-10 Thread Shawn Heisey
On 7/10/2015 3:34 AM, Oliver Schrenk wrote: > I see (1) that Solr 5 uses log4j1.2 for logging. As we have some bad > experiences especially with the syslog implementation supplied by log4j we > wanted to use log4j2 . Does anybody have experiences with using the log4j > bridge (2)? Is it possible

RE: SolrJ/Tika custom indexer not indexing CERTAIN .doc text? | SIDENOTE

2015-07-10 Thread Allison, Timothy B.
>>Wow, that code looks familiar ;)... Erick and Paden, The following is not the source of your problem, but I thought I'd mention it while you reference Erick's fantastic blog post on solrj (http://lucidworks.com/blog/indexing-with-solrj/). I tried to comment on Erick's blog post, but someth

solr 5 question

2015-07-10 Thread Seunghun . Han
How to put security on solr 5 on windows7? i am quit new to solr 5 and running it through windows7. I would able to index some files and search through /browse on other computers as well. what I am trying to do is putting security on solr because other computer could access to do admin page. 1

Re: Using edismax in a filter query

2015-07-10 Thread Thomas Seidl
Hi Ahmet, Brilliant, thanks a lot! I thought it might be possible with local parameters, but couldn't find any information anywhere on how (especially setting the multi-valued "qf" parameter). Thanks again, Thomas On 2015-07-10 14:09, Ahmet Arslan wrote: > Hi Tomasi > > Yes it is possible, plea

Re: Using edismax in a filter query

2015-07-10 Thread Ahmet Arslan
Hi Tomasi Yes it is possible, please see local params : https://cwiki.apache.org/confluence/display/solr/Local+Parameters+in+Queries fq={!edismax qf='field1 field2 field'}search key Ahmet On Friday, July 10, 2015 2:20 PM, Thomas Seidl wrote: Hi all, I was wondering if there's any way to u

Re: Protwords in solr spellchecker

2015-07-10 Thread Alessandro Benedetti
So let's try to analyse the situation from the spellchecking point of view . First of all we follow David suggestions and we add in the QueryTime analysis, the StopWordsFilter, with our configured "bad" words. *Starting scenario* - we have the protected words in our index, we still want them to be

Performance of q.alt vs. fq

2015-07-10 Thread Thomas Seidl
Hi all, I am working a lot with Drupal and Apache Solr. There, we implemented a performance improvement that would, for filter-only queries (i.e., no "q" parameter, just "fq"s) instead move the filters to the "q.alt" parameter (idea based on this blog post [1]). [1] https://web.archive.org/web/20

Using edismax in a filter query

2015-07-10 Thread Thomas Seidl
Hi all, I was wondering if there's any way to use the Extended DisMax query parser in an "fq" filter query? The problem is that I have a "facet.query" with which I want to check whether a certain set of keywords would have any results. But since the normal query goes across multiple fields, I end

Re: Solr Grouping - sorting groups based on the sum of the scores of the documents within each group

2015-07-10 Thread Alessandro Benedetti
Hi Emilio, looking into the Stackoverflow there were a couple of solutions using faceting. So my first question : Is it faceting good for you ? ( faceting is different from grouping as you will only see the countings) . Do you need only to sort by that value, or to return it in each group ? Cheers

Sold 5 Logging with log4j2 possible?

2015-07-10 Thread Oliver Schrenk
Hi, I see (1) that Solr 5 uses log4j1.2 for logging. As we have some bad experiences especially with the syslog implementation supplied by log4j we wanted to use log4j2 . Does anybody have experiences with using the log4j bridge (2)? Is it possible to switch? Cheers. Oliver (1) https://cwiki

Re: Protwords in solr spellchecker

2015-07-10 Thread davidphilip cherian
Hi Kamal, Not necessarily. You can have different filters applied at index time and query time. (note that the order in which filters are defined matters). You could just add the stop filter at query time. Have your own custom data type defined (similar to 'text_en' that will be in schem.xml) and

Re: Protwords in solr spellchecker

2015-07-10 Thread Kamal Kishore Aggarwal
Hi David, This one is a good suggestion. But, if add these *adult* keywords in the stopwords.txt file, it will be requiring the re-indexing of these keywords related data. How can I see the change instantly. Is there any other great suggestion that you can suggest me. On Thu, Jul 9, 2015 at 1