How can i indexing MS-Outlook files?

2008-12-13 Thread RaghavPrabhu
Hi Folks, I want to indexing MS-Outlook mails in my data directory.How can i perform this function? Please help me and give the solution as soon as possible. Thanks in advance Prabhu.K -- View this message in context: http://www.nabble.com/How-can-i-indexing-MS-Outlook-files--tp20988507p20

Boosting by date when only some records have one

2008-12-13 Thread Robert Krüger
Hi, I'm looking for a way to boost queries by date but not all documents have a date associated with them. My goal is to have something like a default boost for documents (e.g. 1.0) with a function for documents with dates that distribute the boosts between 1.0 - x to 1.0 + x based on a valid dat

Re: Stopping / Starting IndexReaders in Solr 1.3+

2008-12-13 Thread Kay Kay
Erik Hatcher wrote: Maybe the PingRequestHandler can help? It can check for the existence of a file (see solrconfig.xml for healthcheck) and return an error if it is not there. This wouldn't prevent Solr from responding to requests, but if a client used that information to determine whether to

Re: How can i indexing MS-Outlook files?

2008-12-13 Thread Kay Kay
You can check out the format of the MS-Outlook files. If they happen to be plain text - may be a little bit of parsing to remove the protocol headers would be needed. Otherwise - you can check with Thunderbird / OpenOffice teams to see how they parse the data when they import from MS-Outlook (

DIH Admin Page Commands

2008-12-13 Thread Ahmed Hammad
Hi, I would like to add a few utility commands to the DIH admin page. I frequently need these commands to manage the index. The commands are: full Import, delta Import, status, reload config, ... In addition to "Return to Admin Page" link. It will be a simple forms at the end of debug.jsp as foll

Re: SolrConfig.xml Replication

2008-12-13 Thread Shalin Shekhar Mangar
Jeff, SOLR-821 has a patch now. It'd be nice to get some feedback if you manage to try it out. On Thu, Dec 11, 2008 at 8:33 PM, Jeff Newburn wrote: > Thank you for the quick response. I will keep an eye on that to see how it > progresses. > > > On 12/10/08 8:03 PM, "Noble Paul നോബിള്‍ नोब्ळ्"

Re: DIH Admin Page Commands

2008-12-13 Thread Shalin Shekhar Mangar
Thanks for the suggestions. I'm sure these will be useful to a lot of users. Do you mind opening an issue in Jira? If you have a patch, that'd be awesome :-) On Sat, Dec 13, 2008 at 8:58 PM, Ahmed Hammad wrote: > Hi, > > I would like to add a few utility commands to the DIH admin page. I > frequ

Re: DIH Admin Page Commands

2008-12-13 Thread Ahmed Hammad
Thanks for your feedback :-) Sure, I will open issue and attach a patch. Best wishes, Ahmed On Sat, Dec 13, 2008 at 7:44 PM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > Thanks for the suggestions. I'm sure these will be useful to a lot of > users. > Do you mind opening an issue in

escaping/removing control characters?

2008-12-13 Thread Ryan McKinley
I'm indexing some mail archives and within the various formats/ encodings etc, some messages have invalid control characters. doc.setField( "body", content.toString() ); In the solr logs, I get: [java] SEVERE: java.io.IOException: Illegal character ((CTRL- CHAR, code 22)) [java] a

Re: unique so that if that combination exists in index it will return an error

2008-12-13 Thread Erik Hatcher
On Dec 13, 2008, at 12:10 PM, Kishore Veleti CoreObject wrote: In Lucene is there a way (like in databases) to define a combination of 3 fields as unique so that if that combination exists in index it will return an error and not add another document with that combination again? No, you'll

Re: unique so that if that combination exists in index it will return an error

2008-12-13 Thread KishoreVeleti CoreObjects
Thanks Erik for your reply. But I have another question. Supose say I already have my database primary key as uniqueId in SOLR. Besides uniqueKey "Id" field I have 3 more fields say field1, field2, field3. I want to make sure that in SOLR I will have only one document where field1, field2, field

Re: [RESULTS] Community Logo Preferences

2008-12-13 Thread Mathijs Homminga
Hi Ryan, I am of course very happy to see that people like the logo I submitted! But I agree that it looks a lot like the Solaris logo (although I wasn't aware of that when designing it). Anyway, good luck with the decisions! Regards, Mathijs Ryan McKinley wrote: Check the results from the

Re: escaping/removing control characters?

2008-12-13 Thread Yonik Seeley
On Sat, Dec 13, 2008 at 1:45 PM, Ryan McKinley wrote: > Is there any standard way to escape invalid xml control characters? Not that I know of... it's a shame that XML can't carry the full unicode range. Good reason to get binary or JSON indexing interface at some point... I think Noble was worki

DIH - duplicate xpaths on HttpDataSource

2008-12-13 Thread Erik Hatcher
I'm trying to index a blog with DIH, and have this: If I comment out the url line it all works fine, but if I put it in, no documents get indexed. Is there an issue with using the same xpath twice? Or something else I'm missing? This is using Solr trunk. Than

Re: ExtractingRequestHandler and XmlUpdateHandler

2008-12-13 Thread Grant Ingersoll
On Dec 12, 2008, at 11:53 PM, Jacob Singh wrote: Hi Grant, Thanks for the quick response. My Colleague looked into the code a bit, and I did as well, here is what I see (my Java sucks): http://svn.apache.org/repos/asf/lucene/solr/trunk/contrib/extraction/src/main/java/org/apache/solr/handler

Re: ExtractingRequestHandler and XmlUpdateHandler

2008-12-13 Thread Grant Ingersoll
Hi Jacob, I just updated the code such that it should now be possible to send in multiple values as literals, as in an HTML form that looks like: method="POST"> Choose a file to upload: Cheers, Grant On Dec 12, 2008, at 11:53 PM, Jacob Singh wrote: Hi Grant, Thanks for the q