Re: How to Use PageRank like Document Boosting at Solr?

2013-06-12 Thread Michael Della Bitta
Seems like your boost field needs to be indexed. On Jun 12, 2013 3:49 AM, "Furkan KAMACI" wrote: > I use Nutch to index my documents. I have a Nutch aware schema at my Solr > and there is a field like that: > > > > boost holds the epic score of my documents (similar

How to Use PageRank like Document Boosting at Solr?

2013-06-12 Thread Furkan KAMACI
I use Nutch to index my documents. I have a Nutch aware schema at my Solr and there is a field like that: boost holds the epic score of my documents (similar to Google's pagerank). How can I boost my queries at Solr side?I followed wiki and tried that: q={!boost b=boost}text:supervillians

Re: PageRank

2012-04-04 Thread Markus Jelsma
Hi, Please subscribe to the Nutch mailing list. Scoring is straightforward and calculated scores can be written to the CrawlDB or as external file field for Solr. Cheers On Wed, 04 Apr 2012 10:22:46 -0500 (COT), Manuel Antonio Novoa Proenza wrote: hi Rav Thank you for your answer. In my

Re: PageRank

2012-04-04 Thread Manuel Antonio Novoa Proenza
hi Rav Thank you for your answer. In my case I use nutch for crawling the web. Using nutch am a true rookie. How do I configure nutch to return that information? And how do I make solr to index that information, or that information is being built with the score of the indexed documents. thank

Re: pagerank??

2012-04-04 Thread Ravish Bhagdev
his. > > In my case, I get data by keyword-matching from Solr and then rank the data > by PageRank after that. > > Thanks, > Bing > > On Wed, Apr 4, 2012 at 6:37 AM, Manuel Antonio Novoa Proenza < > mano...@estudiantes.uci.cu> wrote: > > > Hello, > > &

Re: pagerank??

2012-04-04 Thread Bing Li
According to my knowledge, Solr cannot support this. In my case, I get data by keyword-matching from Solr and then rank the data by PageRank after that. Thanks, Bing On Wed, Apr 4, 2012 at 6:37 AM, Manuel Antonio Novoa Proenza < mano...@estudiantes.uci.cu> wrote: > Hello, > > I

pagerank??

2012-04-03 Thread Manuel Antonio Novoa Proenza
Hello, I have in my Solr index , many indexed documents. Let me know any way or efficient function to calculate the page rank of websites indexed. s 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS INFORMATICAS... CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION http:/

Re: How to Sort By a PageRank-Like Complicated Strategy?

2012-01-29 Thread Ahmet Arslan
> As I learned, big data, such as Lucene index, was not > suitable to be > updated frequently. Some people use ExternalFileField for PageRank-like fields. http://lucidworks.lucidimagination.com/display/solr/Solr+Field+Types#SolrFieldTypes-WorkingwithExternalFiles Lucene supports par

Re: How to Sort By a PageRank-Like Complicated Strategy?

2012-01-28 Thread Bing Li
e: > You can update the document in the index quite frequently. IDNK what > your requirement is, another option would be to boost query time. > > On Sun, Jan 22, 2012 at 5:51 AM, Bing Li wrote: > > Dear Shashi, > > > > Thanks so much for your reply! > > > &

Re: How to Sort By a PageRank-Like Complicated Strategy?

2012-01-23 Thread Shashi Kant
You can update the document in the index quite frequently. IDNK what your requirement is, another option would be to boost query time. On Sun, Jan 22, 2012 at 5:51 AM, Bing Li wrote: > Dear Shashi, > > Thanks so much for your reply! > > However, I think the value of PageRank is n

Re: How to Sort By a PageRank-Like Complicated Strategy?

2012-01-22 Thread Bing Li
Dear Shashi, Thanks so much for your reply! However, I think the value of PageRank is not a static one. It must update on the fly. As I know, Lucene index is not suitable to be updated too frequently. If so, how to deal with that? Best regards, Bing On Sun, Jan 22, 2012 at 12:43 PM, Shashi

Re: How to Sort By a PageRank-Like Complicated Strategy?

2012-01-21 Thread Shashi Kant
Lucene has a mechanism to "boost" up/down documents using your custom ranking algorithm. So if you come up with something like Pagerank you might do something like doc.SetBoost(myboost), before writing to index. On Sat, Jan 21, 2012 at 5:07 PM, Bing Li wrote: > Hi, Kai, > >

Re: How to Sort By a PageRank-Like Complicated Strategy?

2012-01-21 Thread Bing Li
a? > > > > If I want to add new sorting ways, Solr's function query seems to support > > this feature. > > > > However, for a complicated ranking strategy, such PageRank, can Solr > > provide an interface for me to do that? > > > > My ranking ways are mo

Re: How to Sort By a PageRank-Like Complicated Strategy?

2012-01-21 Thread Kai Lu
> > However, for a complicated ranking strategy, such PageRank, can Solr > provide an interface for me to do that? > > My ranking ways are more complicated than PageRank. Now I have to load all > of matched data from Solr first by keyword and rank them again in my ways > before showing to users. It is correct? > > Thanks so much! > Bing

How to Sort By a PageRank-Like Complicated Strategy?

2012-01-21 Thread Bing Li
string field (complete matching), how does Lucene sort the retrieved data? If I want to add new sorting ways, Solr's function query seems to support this feature. However, for a complicated ranking strategy, such PageRank, can Solr provide an interface for me to do that? My ranking ways are

Re: PageRank sort

2009-04-24 Thread Marcus Herou
elect?indent=on&start=0&rows=100&q={!boost<http://127.0.0.1:8110/solr/test/select?indent=on&start=0&rows=100&q=%7B%21boost>b=blogRank > v=$qq}&qq=title:solr&debugQuery=on' > > On Sat, Apr 25, 2009 at 12:43 AM, Marcus Herou > wrote: > >&g

Re: PageRank sort

2009-04-24 Thread Marcus Herou
Cool! GET ' http://127.0.0.1:8110/solr/test/select?indent=on&start=0&rows=100&q={!boostb=blogRank v=$qq}&qq=title:solr&debugQuery=on' On Sat, Apr 25, 2009 at 12:43 AM, Marcus Herou wrote: > That seems wise... PageRank * Text-based Scoring. > > So you mean

Re: PageRank sort

2009-04-24 Thread Marcus Herou
That seems wise... PageRank * Text-based Scoring. So you mean in my stupid case that: GET ' http://127.0.0.1:8110/solr/test/select?indent=on&start=0&rows=100&q={!boostb=blogRank v=$qq}&qq=*:*' would yield the same results as: GET "http://127.0.0.1:8110/solr/test/se

Re: PageRank sort

2009-04-24 Thread Yonik Seeley
You probably want to mix the custom score with the normal relevancy score... to add, use a normal boolean query. To multiply, check out boosted query: http://lucene.apache.org/solr/api/org/apache/solr/search/BoostQParserPlugin.html For other options, use a more complex function query with the new

Re: PageRank sort

2009-04-24 Thread Marcus Herou
That is fantastic, I am creating a really small index right now trying to figure out howto implement the FunctionQuery for this. //Marcus On Fri, Apr 24, 2009 at 10:55 PM, Yonik Seeley wrote: > On Fri, Apr 24, 2009 at 1:39 PM, Marcus Herou > wrote: > > Great! That seems like something that coul

Re: PageRank sort

2009-04-24 Thread Marcus Herou
Works like a charm! Thank you sir. //Marcus On Fri, Apr 24, 2009 at 11:01 PM, Marcus Herou wrote: > That is fantastic, I am creating a really small index right now trying to > figure out howto implement the FunctionQuery for this. > > //Marcus > > > On Fri, Apr 24, 2009 at 10:55 PM, Yonik Seele

Re: PageRank sort

2009-04-24 Thread Marcus Herou
And I published the setup here: http://dev.tailsweep.com/solr-external-scoring/en/ /M On Sat, Apr 25, 2009 at 12:01 AM, Marcus Herou wrote: > Works like a charm! > > Thank you sir. > > //Marcus > > > On Fri, Apr 24, 2009 at 11:01 PM, Marcus Herou > wrote: > >> That is fantastic, I am creating a

Re: PageRank sort

2009-04-24 Thread Yonik Seeley
On Fri, Apr 24, 2009 at 1:39 PM, Marcus Herou wrote: > Great! That seems like something that could work. Depends on how that field > get's re-read/indexed I guess. http://lucene.apache.org/solr/api/org/apache/solr/schema/ExternalFileField.html It's a separate *text* file that just contains id/va

Re: PageRank sort

2009-04-24 Thread Marcus Herou
Hi. Comments inline. On Fri, Apr 24, 2009 at 1:00 PM, Grant Ingersoll wrote: > How often are you updating the rank? The goal is to optimize the pagerank calculating algorithm so can have continuous updates (1 blogs at a time 24/7) but more likely we'll end up refreshing the inde

Re: PageRank sort

2009-04-24 Thread Grant Ingersoll
On Apr 24, 2009, at 1:52 AM, Marcus Herou wrote: Hi. I've posted before but here it goes again: I have BlogData data which is more or less 100% static but one field is not - the PageRank. I would like to sort on that field and on the Lucene list I got these answers. 1. Use two ind

PageRank sort

2009-04-23 Thread Marcus Herou
Hi. I've posted before but here it goes again: I have BlogData data which is more or less 100% static but one field is not - the PageRank. I would like to sort on that field and on the Lucene list I got these answers. 1. Use two indexes and a ParallellReader 2. Use a FieldScoreQuery conta