Re: Indexing Doc, PDF, ... from filesystem (Newbie Question)

2007-08-21 Thread Vish D.
There seems to be some code out for Tika now (not packaged/announced yet, but...). Could someone please take a look at it and see if that could fit in? I am eagerly waiting for a reply back from tika-dev, but no luck yet. http://svn.apache.org/repos/asf/incubator/tika/trunk/src/main/java/org/apach

Re: Indexing Doc, PDF, ... from filesystem (Newbie Question)

2007-08-21 Thread Vish D.
va programmer so if you have questions about the internals > of the code, definitely direct those to Eric as I cannot help. I have > only implemented it in web applications. If you have any other > questions about the use of the patch I can answer those questions. > > Enjoy! > >

Re: Indexing Doc, PDF, ... from filesystem (Newbie Question)

2007-08-21 Thread Vish D.
(create extra elements, create '' for pdf files and '' for html files, etc..), etc... This strips out having to deal with if/else scripting outside of Solr. Rao On 8/21/07, Vish D. <[EMAIL PROTECTED]> wrote: > > Pete, > > > > Thanks for the gr

Re: Indexing Doc, PDF, ... from filesystem (Newbie Question)

2007-08-21 Thread Vish D.
On 8/21/07, Vish D. <[EMAIL PROTECTED]> wrote: > > On 8/21/07, Peter Manis <[EMAIL PROTECTED]> wrote: > > > > I am a little confused how you have things setup, so these meta data > > files contain certain information and there may or may not be a pdf, >

Re: Interest in Extending SOLR

2006-04-13 Thread Vish D.
Mike, I am currently evaluating different search engine technologies (esp., open source ones), and this is very interesting to me, for the following reasons: Our data is much like yours in that we have different types of data (abstracts, fulltext, music, etc...), which eventually fall under diffe

Re: Interest in Extending SOLR

2006-04-18 Thread Vish D.
Yonik/Chris, Do we have a eta on " Allow multiple independent Solr *webapps* in the same app server"? After reading up, silently, on the many emails on this topic, I agree with you that it would be worthwhile to test out the current implementation and see how it performs. But, it makes sense to r

Faceted Browsing questions

2006-06-23 Thread Vish D.
Hi all, I am trying to figure out how I can have some type of faceted browsing working. I am also in need of a way to get a list of unique field values within a query's results set (for filtering, etc...). When I say trying, I mean having it up and running without much coding, b/c of time reasons

Re: Faceted Browsing questions

2006-06-24 Thread Vish D.
Thank you Chris and Erik. That makes it a bit clearer, but I might need to sit down and look at the code (nines + DisMax...) a bit closer to see how it all works in Solr. Erik, when do you plan on having your implementation refactored with "good" use of code? Or, in general, when is Solr planning

Re: Faceted Browsing questions

2006-06-24 Thread Vish D.
Thanks! On 6/24/06, Erik Hatcher <[EMAIL PROTECTED]> wrote: On Jun 24, 2006, at 12:38 PM, Vish D. wrote: > Erik, when do you plan on having your implementation refactored > with "good" > use of code? This weekend :) I have imported more data than my hacked implementat

Re: Faceted Browsing questions

2006-06-28 Thread Vish D.
Erik, Any update on your progress? Eager to get my hands on on your latest code... :=) Thanks! On 6/28/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: : > well, the most obvious solution i can think of would be a patch adding an : > invert() method to DocSet, HashDocSet and BitDocSet. :) :

Re: Is solr scalable with respect to number of documents?

2006-09-27 Thread Vish D.
Are there any plans on implementing a MultiSearcher into solr? I have been following the list for a while, and read quite a few topic on multiple instances of solr, in order to accomodate multiple schemas as well as break down index sizes for performance reasons. I have a use case that sits right

Re: Is solr scalable with respect to number of documents?

2006-09-27 Thread Vish D.
I just noticed that link on the first reply from Yonik about FederatedSearch. I see that a lot of thought went in to it. I guess the question to ask would be, any progress on it, Yonik? :) On 9/27/06, Vish D. <[EMAIL PROTECTED]> wrote: Are there any plans on implementing a MultiSearche

LIUS/Fulltext indexing

2007-06-11 Thread Vish D.
Anyone have experience working with LIUS ( http://sourceforge.net/projects/lius/)? I can't seem to find any real documentation on it, even though it seems 'active' @ sourceforge. I need a way to index various types of fulltext, and LIUS seems very promising at first glance. What do you guys think?

Re: LIUS/Fulltext indexing

2007-06-12 Thread Vish D.
Sounds interesting. I can't seem to find any clear dates on the project website. Do you know? ...V1 shipping date? Thanks! On 6/12/07, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote: On 6/12/07, Yonik Seeley <[EMAIL PROTECTED]> wrote: >... I think Tika will be the way forward (some of the code

Re: LIUS/Fulltext indexing

2007-06-12 Thread Vish D.
Wonder if TOM could be useful to integrate? http://tom.library.upenn.edu/convert/sofar.html On 6/12/07, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote: On 6/12/07, Vish D. <[EMAIL PROTECTED]> wrote: > ...Sounds interesting. I can't seem to find any clear dates on the proje