Re: Migrating to Solr

2010-02-25 Thread Bernd Fehling
Hi list, is this true, no downloaded copy of the documentprocessor anywhere available? Regards, Bernd Bernd Fehling schrieb: > Was anyone able to get a copy of: > http://sesat.no/svn/sesat-documentprocessor/ > > Unfortunately it is offline. > > Would be pleased to get a copy. > > Regards, >

Changing term frequency according to value of one of the fields

2010-02-25 Thread Pooja Verlani
Hi, I want to modify Similarity class for my app like the following- Right now tf is Math.sqrt(termFrequency) I would like to modify it to Math.sqrt(termFrequncy/solrDoc.getFieldValue("count")) where count is one of the fields in the particular solr document. Is it possible to do so? Can I import s

Re: If you could have one feature in Solr...

2010-02-25 Thread Lance Norskog
Error messages that make sense. I have to read the source far too often when a simple change to errror-handling would make some feature easy to use. If I want to read Java I'll use Lucene! Passive-aggressive error handling is a related problem: when I do something nonsensical I too often get "0 re

SOLR Multivalued field and length norm

2010-02-25 Thread Pooja Verlani
Hi, I understand if I query on a multivalued field, length norm takes the total length of the multivalued field. Is it possible to use the length of only the particular value in the array of multivalued field? It would be easier and more efficient in searching then. Regards, Pooja

Re: Solr Cell RTF Woes

2010-02-25 Thread Lance Norskog
Ha! http://issues.apache.org/jira/browse/TIKA-282 You're running this on a headless machine and the RTF parser demands an X window. On Thu, Feb 25, 2010 at 11:08 AM, Bill Engle wrote: > Any RTF file I tried to index in Solr 1.4 throws these errors out.  I have > no issues with doc, pdf.  Any th

Re: Delta Query - DIH

2010-02-25 Thread Lance Norskog
It may be easier to understand the problem if you create views for the full- and delta-import queries. On Thu, Feb 25, 2010 at 9:09 AM, JavaGuy84 wrote: > > Hi,My data config looks like below, >         query="select * from z where id=x.id">I am able to successfully run the > Full-Import query

Re: Using XSLT with DIH for a URLDataSource

2010-02-25 Thread Lance Norskog
There could be a common 'open an url' utility method. This would help make the DIH components consistent. 2010/2/24 Noble Paul നോബിള്‍ नोब्ळ् : > you are right. The StreamSource class is not throwing the proper exception > > Do we really have to handle this.? > > On Thu, Feb 25, 2010 at 9:06 AM,

HTTP ERROR: 404 missing core name in path after integrating nutch

2010-02-25 Thread Ian M. Evans
Hi everyone, Last night I was able to get solr up and running. Ran and was able to access: http://localhost:8983/solr/admin This morning, I started on the nutch crawling instructions over at: http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ After adding the following to /solr/con

Re: Strange search behavior

2010-02-25 Thread Otis Gospodnetic
Jan, If you go to Solr Admin Analysis page and enter your problematic query, what do you see? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem search :: http://search-hadoop.com/ - Original Message > From: Jan Simon Winkelmann > To: "solr-user@lu

Re: Index size

2010-02-25 Thread Otis Gospodnetic
It depends on many factors - how big those docs are (compare a tweet to a news article to a book chapter) whether you store the data or just index it, whether you compress it, how and how much you analyze the data, etc. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop e

Re: new/first searcher

2010-02-25 Thread Otis Gospodnetic
Hi, There is nothing wrong with using the same query for both events. As a matter of fact, it makes sense to use the same (type of) query for both events. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem search :: http://search-hadoop.com/ - Original M

RE: Free Webinar: Mastering Solr 1.4 with Yonik Seeley

2010-02-25 Thread Bernadette Houghton
Yonk, can you please advise whether this event will be recorded and available for later download? (It starts 5am our time ;-) ) Regards Bern -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Thursday, 25 February 2010 10:23 AM To: solr

RE: How to use dismax and boosting properly?

2010-02-25 Thread Jason Chaffee
I thought I tried that, but I guess I didn't restart Solr to pick up the configuration. That did the trick. Thanks! -Original Message- From: Nagelberg, Kallin [mailto:knagelb...@globeandmail.com] Sent: Thursday, February 25, 2010 1:10 PM To: 'solr-user@lucene.apache.org' Subject: RE:

Re: Seattle Hadoop/Scalability/NoSQL Meetup Tonight!

2010-02-25 Thread Nick Dimiduk
Not that I'm aware of. 2010/2/25 Tim Terlegård > 2010/2/25 Bradford Stephens : > > Thanks for coming, everyone! We had around 25 people. A *huge* > > success, for Seattle. And a big thanks to 10gen for sending Richard. > > > > Can't wait to see you all next month. > > Did anyone record the event

RE: How to use dismax and boosting properly?

2010-02-25 Thread Nagelberg, Kallin
Try setting the boost to 0 for the fields you don't want to contribute to the score. Kallin Nagelberg -Original Message- From: Jason Chaffee [mailto:jchaf...@ebates.com] Sent: Thursday, February 25, 2010 4:03 PM To: solr-user@lucene.apache.org Subject: How to use dismax and boosting pro

Advice on deployment

2010-02-25 Thread Shawn Heisey
We are currently using a commerical indexing product based on Lucene for our indexing needs, and would like to replace it with SOLR. The source database for this system has 40 million records, growing by about 30,000 items per day. It is a repository for all the metadata relating to an archive

Re: Solr 1.4 distributed search configuration

2010-02-25 Thread Mark Miller
Can you elaborate on "doesn't work" when you put it in the /search handler? You get an error in the logs? Nothing happens? On 02/25/2010 03:47 PM, Jeffrey Zhao wrote: Hi Mark, Thanks for your reply. I did make a new handler as following, but it does not work, anything wrong with my configurati

How to use dismax and boosting properly?

2010-02-25 Thread Jason Chaffee
I am using dismax and I have configured to search 3 different fields with one field getting an extra boost so that I the results of that field are at the top of result set. Then, I sort the results by another field to get the ordering. My problem is that the scores are being skewed by the addi

Re: If you could have one feature in Solr...

2010-02-25 Thread Smiley, David W.
1. Spatial search 2. Ease of managing a sharded index, multi-server Solr instance. I am aware these are in-progress, slated for Solr 1.5. I may find myself getting involved on these shortly because I'm working on a very large scale search project requiring both. ~ David On Feb 24, 2010, at 8:4

Re: Solr 1.4 distributed search configuration

2010-02-25 Thread Jeffrey Zhao
Hi Mark, Thanks for your reply. I did make a new handler as following, but it does not work, anything wrong with my configuration? Thanks, name="shards">202.161.196.189:8080/solr,localhost:8080/solr query facet spellcheck deb

Re: Solr 1.4 distributed search configuration

2010-02-25 Thread Mark Miller
On 02/25/2010 03:32 PM, Jeffrey Zhao wrote: How do define a new search handler with a shards parameter? I defined as following way but it doesn't work. If I put the shards parameter in default handler, it seems I got an infinite loop. explicit

Solr 1.4 distributed search configuration

2010-02-25 Thread Jeffrey Zhao
How do define a new search handler with a shards parameter? I defined as following way but it doesn't work. If I put the shards parameter in default handler, it seems I got an infinite loop. explicit 202.161.196.189:8080/solr,localhost:8080/solr

Re: If you could have one feature in Solr...

2010-02-25 Thread Ron Mayer
Erik Hatcher wrote: > Ron - I think SOLR-792 meets the need you describe. What do you think? > It's "tree faceting", allowing you to facet down 2 levels deep > arbitrarily on any two fields. Ideally we'd enhance it to be of > arbitrary depth too. Nice! It certainly handles my main use case. Th

Re: Extended stats via JMX

2010-02-25 Thread Shalin Shekhar Mangar
On Thu, Feb 25, 2010 at 10:56 AM, Dan Trainor wrote: > > Right now, being inexperienced with JMX and all, I was wondering if there > was a way to pull all Solr-specific items out of there. I see some general > counters pertaining to each of my Solr instances, but nothing along the > lines of loo

Re: Extended stats via JMX

2010-02-25 Thread Matthew Runo
https://issues.apache.org/jira/browse/SOLR-1750 might help you, since I don't think that all of stats.jsp is exposed via MBeans. I could be wrong about that though.. (apologies, our solr servers are firewalled and I can't connect via JMX at the moment) Thanks for your time! Matthew Runo Softwa

Solr Cell RTF Woes

2010-02-25 Thread Bill Engle
Any RTF file I tried to index in Solr 1.4 throws these errors out. I have no issues with doc, pdf. Any thoughts? Thanks. Apache Tomcat/6.0.18 - Error report