Stefan,


You're right. I was attempting to post some quick pseudo-code, but that
<doc/> is pretty misleading, they should have been <str> elements, like <str
name="dblocation">/abc/def/ghi/123.xml</str>, or something to that affect.



Thanks,

Walter


On Mon, Jan 10, 2011 at 10:08 AM, Stefan Matheis <
matheis.ste...@googlemail.com> wrote:

> Hey Walter,
>
> what's against just putting your db-location in a 'string' field, and use
> it
> like any other value?
> There is no special field-type for something like a
> path/directory/location-information, afaik.
>
> Regards
> Stefan
>
> On Mon, Jan 10, 2011 at 4:50 PM, Walter Closenfleight <
> walter.p.closenflei...@gmail.com> wrote:
>
> > I'm very unclear on how to associate what I need to a Solr index entry.
> > Based on what I've read thus far, you can extract data from text files
> and
> > store that in a Solr document.
> >
> > I have hundreds of thousands of documents in a database/svn type system.
> > When I index a file, it is likely going to be local to the filesystem and
> I
> > know the location it will take on in the database. So, when I index, I
> want
> > to provide a path that it can find it when someone else does a search.
> >
> > 123.xml may look like:
> >
> > <mydoc>
> > <title>my title</title>
> > <para>Every foobar has its day</para>
> > <figure href="/abc/xxx.gif"><caption>My caption</caption>
> > </mydoc>
> >
> > and the proprietary location I want it to be associated with is:
> >
> > /abc/def/ghi/123.xml
> >
> > So, when a user does a search for "foobar", it returns some information
> > about 123.xml but most importantly the location should be available.
> >
> > I have yet to find (in the schema.xml or otherwise) where you can define
> > that path to store, and how you would pass along that parameter in the
> > indexing of that document.
> >
> > Instead, from the examples I can find, including the book, you store
> fields
> > from your data into the index. In the book's examples (a music database),
> > searching for "Cherub Rock" returns a list of with their duration, track
> > name, album name, and artist. In other words, the full text data you
> > retrieve is the only information the search index has to offer.
> >
> > Just for example, using the exampledocs post.jar, I'm envisioning
> something
> > like this:
> >
> > java -jar post.jar 123.xml -dblocation "/abc/def/ghi/123.xml" -othermeta1
> > "xxx" -othermeta2 "zzz"
> >
> > Then the Solr doc would look like:
> > <doc>
> > <field name="id">123</field>
> > <field name="dblocation">/abc/def/ghi/123.xml</field>
> > <field name="othermeta1">xxx</field>
> > <field name="othermeta2">zzz</field>
> > <field name="title">my title</field>
> > <field name="graphic">/abc/xxx.gif</field>
> > <field name="text">Every foobar has its day My caption</field>
> > </doc>
> >
> > This way, when a user searches for foobar, they get item 123 back, review
> > the search result and if they decide that's the data they want, they can
> > use
> > the dblocation field to retrieve the data for editing purposes (and then
> > re-index it following their edits).
> >
> > I'm guessing I just haven't found the right terms yet to look into, as
> I'm
> > very new to this. Thanks for any direction you can provide. Also, if Solr
> > appears to be the wrong tool for what I need, let me know as well!
> >
> > Thank you,
> > Walter
> >
>

Reply via email to