Re: Using Customized sorting in Solr

2012-04-26 Thread solr user
Jan, Thanks for the response, I though of using it, but it will be suboptimal to do this in the scenario I have. I guess I have to explain the scenario better, let me try it again:- 1. I have importance based buckets in the system, this is implemented using a variable named bucket_count having i

Re: HTTP Auth and Distributed Search?

2012-04-26 Thread Lance Norskog
I believe you can set up certificates. You then store the certificates in a Java keyring file, and tell Java about the keyring at startup. Now, when you make an HTTP connection, the HTTP library automatically uses the certificates. You don't need any custom code in the http client. (I think this i

Benchmark Solr vs Elastic Search vs Sensei

2012-04-26 Thread Volodymyr Zhabiuk
Hi Solr users I've implemented the project to compare the performance between Solr, Elastic Search and SenseiDB https://github.com/vzhabiuk/search-perf the Solr version 3.5.0 was used. I've used the default configuration, just enabled json updates and used the following schema https://github.com/

Re: Question on Facet counts by grouped results

2012-04-26 Thread Erick Erickson
Yes, stopwords_en.txt. Or go into your schema file and find the usages of stopwords_en.txt and change it to a stopwords file in your setup. Best Erick On Thu, Apr 26, 2012 at 5:15 PM, Sohail Aboobaker wrote: > Hi, > > I am trying nightly build for solr 4.0. I downloaded the build and am > able

Re: impact of EdgeNGramFilterFactory on indexing process?

2012-04-26 Thread Erick Erickson
1> yes. EdgeNGram will inevitably increase the number of tokens in your index, lengthening your index time. How much? "some", but that means you'll have to try it to see if it's unacceptable. Some people can't take an increase of 10%. Some can take a 100% increase. 2> No. It will increase the

Re: Per-User Sorting on an ExternalFileField

2012-04-26 Thread Phill Tornroth
So, I did just issue: &sort=sub(my_user_score_field,0)+desc It got me past the error, but still doesn't appear to be actually using the values to sort. Any ideas as to why? Phill On Thu, Apr 26, 2012 at 4:35 PM, Stephane Bailliez wrote: > On Fri, Apr 27, 2012 at 12:07 AM, Phill Tornroth >w

Re: Per-User Sorting on an ExternalFileField

2012-04-26 Thread Stephane Bailliez
On Fri, Apr 27, 2012 at 12:07 AM, Phill Tornroth wrote: > I'm using Solr 3.5. Does anyone have a suggestion as to how to end up > adding this extra dimension so that I can do per-user relevance? It seems > like an oft-asked, rarely-answered question. > Use a function that make use of your externa

Per-User Sorting on an ExternalFileField

2012-04-26 Thread Phill Tornroth
I'm trying pretty hard to come up with a solution that lets me sort by per-user scores that I calculate based on my data. Today, I'm trying to use a combination of ExternalFileField and dynamic fields, where the presumption is that each user might have their own file full of scores. I think the fie

Re: Using Customized sorting in Solr

2012-04-26 Thread Jan Høydahl
Hi, How about trying grouping with paging? First you do group=true&group.field=advertiserId&group.limit=1&group.offset=0&group.main=true&sort=something&group.sort=how-much-paid desc That gives you one listing per advertiser, sorted the way you like. Then to grab the next batch of ads, you go gr

Re: HTTP Auth and Distributed Search?

2012-04-26 Thread Michael Della Bitta
Really? Is that in a .properties file somewhere, or would I have to do it in code? I was sort of hoping I'd be able to add the credentials to the URL in the shards field, but looking at the source, that won't fly. While we're on the topic, it might be nice to be able to specify the connection sche

Re: HTTP Auth and Distributed Search?

2012-04-26 Thread Mark Miller
On Apr 26, 2012, at 5:25 PM, Michael Della Bitta wrote: > Hi, > > I'm wondering if there's any way to use container-based HTTP auth and > Distributed Search configured in the SearchHandler that I haven't > discovered aside from writing my own shard handler implementation. > > Thanks, > > Micha

HTTP Auth and Distributed Search?

2012-04-26 Thread Michael Della Bitta
Hi, I'm wondering if there's any way to use container-based HTTP auth and Distributed Search configured in the SearchHandler that I haven't discovered aside from writing my own shard handler implementation. Thanks, Michael

Re: Solr4 CoreContainer failed to load with older version of Slf4j 1.5.2

2012-04-26 Thread Mark Miller
I also ran into a problem using 1.6.1 - thats the breaks of progress I guess ;) On Thu, Apr 26, 2012 at 4:07 PM, Gopal Patwa wrote: > I am using Solr4 nightly build apache-solr-4.0-2012-04-26_08-10-58 and I > saw Slf4j version was upgraded to 1.6.4 and it is failing now to start > Solr, if I wan

Re: Question on Facet counts by grouped results

2012-04-26 Thread Sohail Aboobaker
Hi, I am trying nightly build for solr 4.0. I downloaded the build and am able to start it. In 3.x, I copied the example directory and updated the schema.xml. It worked fine but in 4.0, I did the same thing (make a copy of example) but when I change the schema, I get following: Apr 26, 2012 5:04:

Re: QueryElevationComponent and distributed search

2012-04-26 Thread Mark Miller
On Apr 26, 2012, at 2:56 PM, srinir wrote: > Can anyone help me out in understand the fix to QueryElevationComponent (in > Solr 4.0) to make it work for distributed search. > > https://github.com/apache/lucene-solr/commit/229ed68c31b346611c505ca9766871cec713a850 - Mark Miller lucidimaginatio

Re: solr error after relacing schema.xml

2012-04-26 Thread BillB1951
I'm using haystack 2.0.0Beta, and Apache-Solr-3.6.0. I'm not sure how to determine the schema.xml version, but I do notice that the solr example's schema.xml is - and the schema.xml generated by haystack is . Can I specify another schema.xml generator for haystack? If so , where?

Re: Solr for routing a webapp

2012-04-26 Thread Paul Libbrecht
Or write your own query component mapping /solr/* in the web.xml, exposing the request by a thread-local through a filter, and reading this setting the appropriate query parameters... Performance-wise, this seems quite reasonable I think. paul Le 26 avr. 2012 à 16:58, Paul Libbrecht a écrit :

Re: QueryElevationComponent and distributed search

2012-04-26 Thread srinir
Can anyone help me out in understand the fix to QueryElevationComponent (in Solr 4.0) to make it work for distributed search. -- View this message in context: http://lucene.472066.n3.nabble.com/QueryElevationComponent-and-distributed-search-tp3936998p3942221.html Sent from the Solr - User mailing

Re: searchable solr user mail archive

2012-04-26 Thread Chris Hostetter
: Is there a searchable archive for solr user emails available somewhere : to avoid questions already asked on list? https://wiki.apache.org/solr/SolrResources#Mailing_List_Archives Or just use the search box in the top right corner of the main solr website... http://lucene.apache.org/solr/

Re: solr error after relacing schema.xml

2012-04-26 Thread Lance Norskog
Which version of Solr does Haystack expect? The schema builder might be targeting an older version of Solr. On Thu, Apr 26, 2012 at 10:47 AM, Mark Miller wrote: > By default logging goes to std out. You probably want to configure real > logging though: http://wiki.apache.org/solr/SolrJetty#Loggi

searchable solr user mail archive

2012-04-26 Thread Sohail Aboobaker
Hi, Is there a searchable archive for solr user emails available somewhere to avoid questions already asked on list? Sohail

Setting FuzzyConfig's prefixLength ?

2012-04-26 Thread Phill Tornroth
I'd like to change Lucene's FuzzyConfig prefixLength from it's default value of 0. Is there a way to configure that via Solr somehow? I've noticed references on the list to people recompiling lucene from source in order to change this value, and I'm hoping not to need to resort to the same. Thanks

Re: Question on Facet counts by grouped results

2012-04-26 Thread Mark Miller
On Apr 26, 2012, at 1:24 PM, Sohail Aboobaker wrote: > Any > ideas on when Solr 4.0 will be out? We are hoping this year. There will be a series of alphas and betas that should start within a month or few. - Mark Miller lucidimagination.com

Re: solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-26 Thread Mark Miller
On Apr 23, 2012, at 12:10 PM, geeky2 wrote: > http://someip:someport/somepath/somecore/admin/replication/ is not > available. Index fetch failed. Exception: Invalid version (expected 2, but > 10) or the data in not in 'javabin' format This is kind of a bug. When Solr tries to talk in javabin and

Re: solr error after relacing schema.xml

2012-04-26 Thread Mark Miller
By default logging goes to std out. You probably want to configure real logging though: http://wiki.apache.org/solr/SolrJetty#Logging On Apr 26, 2012, at 1:33 PM, BillB1951 wrote: > It does not appear that any logfiles were created. > > > - > BillB1951 > -- > View this message in context:

Re: solr error after relacing schema.xml

2012-04-26 Thread BillB1951
It does not appear that any logfiles were created. - BillB1951 -- View this message in context: http://lucene.472066.n3.nabble.com/solr-error-after-relacing-schema-xml-tp3940133p3941997.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Question on Facet counts by grouped results

2012-04-26 Thread Sohail Aboobaker
Never mind, I did not notice that this is coming in Solr 4.0. Any ideas on when Solr 4.0 will be out? Sohail

Re: Recovery - too many updates received since start

2012-04-26 Thread Mark Miller
On Apr 24, 2012, at 9:31 AM, Trym R. Møller wrote: > Hi > > I experience that a Solr looses its connection with Zookeeper and > re-establish it. After Solr is reconnection to Zookeeper it begins to recover. > It has been missing the connection approximately 10 seconds and meanwhile the > leade

Re: solr error after relacing schema.xml

2012-04-26 Thread Mark Miller
Try looking at the logs. On Apr 25, 2012, at 10:53 PM, BillB1951 wrote: > Trouble getting solr and Haystack working together. I have solar > working (can get admin screen and query test data). I then create my > search_indexes.py (per getting started example, I also have run syndb > and added d

impact of EdgeNGramFilterFactory on indexing process?

2012-04-26 Thread geeky2
Hello all, i am experimenting with EdgeNGramFilterFactory - on two of the fieldTypes in my schema. i believe i understand this - but want to verify: 1) will this increase my index time? 2) will increase the number of documents in my index? thank you -- View this message in context:

Re: Solr for routing a webapp

2012-04-26 Thread Paul Libbrecht
Have you tried using mod_rewrite for this? paul Le 26 avr. 2012 à 15:16, Björn Zapadlo a écrit : > Hello, > > I'm thinking about using a Solr index for routing a webapp. > > I have pregenerated base urls in my index. E.g. > /foo/bar1 > /foo/bar2 > /foo/bar3 > /foo/bar4 > /bar/foo1 > /bar/foo2

Re: Dynamic creation of cores for this use case.

2012-04-26 Thread Michael Kuhlmann
Am 26.04.2012 16:17, schrieb pprabhcisco123: The use case is to create a core for each customer as well as partner . Since its very difficult to create cores statically in solr.xml file for all 4500 customers , is there any way to create the cores dynamically or on the fly. Yes there is. Have

Re: solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-26 Thread geeky2
hello, sorry - i overlooked this message - thanks for checking back and thanks for the info. yes - replication seems to be working now: tailed from logs just now: 2012-04-26 09:21:33,284 INFO [org.apache.solr.handler.SnapPuller] (pool-12-thread-1) Slave in sync with master. 2012-04-26 09:21:53

Re: Dynamic creation of cores for this use case.

2012-04-26 Thread pprabhcisco123
Hi, Thanks Erick for your response . Actually , the total no of customers is 4500 and every group of customers say 30 is a considered to be a partner or agent. The use case is to create a core for each customer as well as partner . Since its very difficult to create cores statically in solr.x

Format "content" field

2012-04-26 Thread webdev1977
Greetings all! I have created a enterprise search architecture that includes both nutch for crawling as well as solr for indexing. I was so focused on the nutch part that I didn't realized that my user interface (Jquery based) was lacking in appeal. One of my issues is the format of the text in

Solr for routing a webapp

2012-04-26 Thread Björn Zapadlo
Hello, I'm thinking about using a Solr index for routing a webapp. I have pregenerated base urls in my index. E.g. /foo/bar1 /foo/bar2 /foo/bar3 /foo/bar4 /bar/foo1 /bar/foo2 /bar/foo3 I try to find a way to match /foo/bar3/parameter1/value1/parameter2/value2 without knowing that parameter and

Re: auto warm up cache and new data

2012-04-26 Thread Tomás Fernández Löbbe
The warmup process reloads the data from the new index. Cache in Solr expires with a new searcher, correct. You could have evictions too if it gets filled. On Thu, Apr 26, 2012 at 8:33 AM, mizayah wrote: > Please help me understand that. > What wil happen if if have cached data and thay change

auto warm up cache and new data

2012-04-26 Thread mizayah
Please help me understand that. What wil happen if if have cached data and thay change after comit and i have autowarm set up. Old cached data will be still accesible in cache so i will get old data? That means if autowarm copy all needed data to new cache probably i will never see new data? Cache

Re: Stats.facet on date returns error

2012-04-26 Thread Erick Erickson
Works on my macine (tm). I tried both trunk and 3.6, so I guess that means we need more details. What version are you running on? What is your exact URL? Did you do anything like change the definition without blowing away your index and re-indexing? Have you tried using Luke or the schema browser

Re: Dynamic creation of cores for this use case.

2012-04-26 Thread Erick Erickson
Take a look here: http://wiki.apache.org/solr/CoreAdmin?highlight=%28create%29%7C%28core%29#CREATE I'm not sure about your "partners" view. As an alternative to creating individual cores, you could simply use a single index and use filter queries (fq) to restrict the selection to the relevant cust

Using Customized sorting in Solr

2012-04-26 Thread solr user
Hi, We are planning to move the search of one of our listing based portal to solr/lucene search server from sphinx search server. But we are facing a challenge is porting customized sorting being used in our portal. We only have last 60 days of data live.The algorithm is as follows:- 1. Put a

solr error after relacing schema.xml

2012-04-26 Thread BillB1951
Trouble getting solr and Haystack working together. I have solar working (can get admin screen and query test data). I then create my search_indexes.py (per getting started example, I also have run syndb and added data to the Notes table). I run manage.py build_solar_schema, it generates XML for

Re: EmbeddedSolrServer and StreamingUpdateSolrServer

2012-04-26 Thread pcrao
Hi Ryan, I see. Yes, for incremental indexing(Hourly) we use StreamingUpdateSolrServer and it is faster than EmbeddedSolrServer. We are also using, Embedded server for full indexing on a daily basis and it is efficient for full indexing as it can handle large number of documents in a better way.

Re: Boosting fields in SOLR using Solrj

2012-04-26 Thread Michael Kuhlmann
Am 26.04.2012 00:57, schrieb Joe: Hi, I'm using the solrj API to query my SOLR 3.6 index. I have multiple text fields, which I would like to weight differently. From what I've read, I should be able to do this using the dismax or edismax query types. I've tried the following: SolrQuery query =

Re: Boosting fields in SOLR using Solrj

2012-04-26 Thread Ryan McKinley
I would suggest debugging with browser requests -- then switching to Solrj after you are at 1st base. In particular, try adding the &debugQuery=true parameter to the request and see what solr thinks is happening. The value that will "work" for the 'qt' parameter depends on what is configured in s

Re: EmbeddedSolrServer and StreamingUpdateSolrServer

2012-04-26 Thread Ryan McKinley
In general -- i would not suggest mixing EmbeddedSolrServer with a different style (unless the other instances are read only). If you have multiple instances writing to the same files on disk you are asking for problems. Have you tried just using StreamingUpdateSolrServer for daily update? I woul