retrieved based on the job board. What do you think?
You can store the XPath expressions in a text file as strings, and
load/compile them as needed.
From: "Nagelberg, Kallin"
To: "solr-user@lucene.apache.org"
Sent: Wed, July 21, 2010 10:39:32 AM
Subject: RE: faceted search
only a string which can be stored in a text file
> or
> db, and retrieved based on the job board. What do you think?
>
>
>
>
>
> From: "Nagelberg, Kallin"
> To: "solr-user@lucene.apache.org"
> Sent: Wed, July 2
To: "solr-user@lucene.apache.org"
Sent: Wed, July 21, 2010 10:39:32 AM
Subject: RE: faceted search with job title
Yeah you should definitely just setup a custom parser for each site.. should be
easy to extract title using groovy's xml parsing along with tagsoup for sloppy
ht
r dom code. But
is regex pattern flexible enough for all job boards?
Thanks.
From: "Nagelberg, Kallin"
To: "solr-user@lucene.apache.org"
Sent: Wed, July 21, 2010 10:39:32 AM
Subject: RE: faceted search with job title
Yeah you should defin
ge here :P
-Kallin Nagelberg
-Original Message-
From: Savannah Beckett [mailto:savannah_becket...@yahoo.com]
Sent: Wednesday, July 21, 2010 12:20 PM
To: solr-user@lucene.apache.org
Cc: dave.sea...@magicalia.com
Subject: Re: faceted search with job title
mmm...there must be better way.
42:55 AM
Subject: RE: faceted search with job title
You'd probably need to do some post processing on the pages and set up rules
for
each website to grab that specific bit of data. You could load the html into an
xml parser, then use xpath to grab content from a particular tag with a cl
You'd probably need to do some post processing on the pages and set up rules
for each website to grab that specific bit of data. You could load the html
into an xml parser, then use xpath to grab content from a particular tag with a
class or id, based on the particular website
-Original M