Mathieu, have a look at Solr's DataImportHandler. It provides a
configuration-based approach to index different types of datasources
including relational databases and XML files. In particular have a look at
the XpathEntityProcessor (
http://wiki.apache.org/solr/DataImportHandler#head-f1502b1ed71d98ef0120671db5762e137e63f9d2)
which allows you to use xpath syntax to map xml data to index fields.

-Jay


On Tue, Jul 7, 2009 at 7:25 AM, Saeli Mathieu <saeli.math...@gmail.com>wrote:

> Hello.
>
> I'm a new user of Solr, I already used Lucene to index files and search.
> But my programme was too slow, it's why I was looking for another solution,
> and I thought I found it.
>
> I said I thought because I don't know if it's possible to use solar with
> this kind of XML files.
>
>  <lom xsi:schemaLocation="http://ltsc.ieee.org/xsd/lomv1.0
> http://ltsc.ieee.org/xsd/lomv1.0/lom.xsd";>
> <general>
> <identifier>
> <catalog>STRING HERE</catalog>
> <entry>
> STRING HERE
> </entry>
> </identifier>
> <title>
> <string language="fr">
> STRING HERE
> </string>
> </title>
> <language>fr</language>
> <description>
> <string language="fr">
> STRING HERE
> </string>
> </description>
> </general>
> <lifeCycle>
> <status>
> <source>STRING HERE</source>
> <value>STRING HERE</value>
> </status>
> <contribute>
> <role>
> <source>STRING HERE</source>
> <value>STRING HERE</value>
> </role>
> <entity>STRING HERE
> </entity>
> </contribute>
> </lifeCycle>
> <metaMetadata>
> <identifier>
> <catalog>STRING HERE</catalog>
> <entry>STRING HERE</entry>
> </identifier>
> <contribute>
> <role>
> <source>STRING HERE</source>
> <value>STRING HERE</value>
> </role>
> <entity>STRING HERE
> </entity>
> <date>
> <dateTime>STRING HERE</dateTime>
> </date>
> </contribute>
> <contribute>
> <role>
> <source>STRING HERE</source>
> <value>STRING HERE</value>
> </role>
> <entity>STRING HERE
> </entity>
> <entity>STRING HERE</entity>
> <entity>STRING HERE
> </entity>
> <date>
> <dateTime>STRING HERE</dateTime>
> </date>
> </contribute>
> <metadataSchema>STRING HERE</metadataSchema>
> <language>STRING HERE</language>
> </metaMetadata>
> <technical>
> <location>STRING HERE
> </location>
> </technical>
> <educational>
> <intendedEndUserRole>
> <source>STRING HERE</source>
> <value>STRING HERE</value>
> </intendedEndUserRole>
> <context>
> <source>STRING HERE</source>
> <value>STRING HERE</value>
> </context>
> <typicalAgeRange>
> <string language="fr">STRING HERE</string>
> </typicalAgeRange>
> <description>
> <string language="fr">STRING HERE</string>
> </description>
> <description>
> <string language="fr">STRING HERE</string>
> </description>
> <language>STRING HERE</language>
> </educational>
> <annotation>
> <entity>STRING HERE
> </entity>
> <date>
> <dateTime>STRING HERE</dateTime>
> </date>
> </annotation>
> <classification>
> <purpose>
> <source>STRING HERE</source>
> <value>STRING HERE</value>
> </purpose>
> </classification>
> <classification>
> <purpose>
> <source>STRING HERE</source>
> <value>STRING HERE</value>
> </purpose>
> <taxonPath>
> <source>
> <string language="fr">STRING HERE</string>
> </source>
> <taxon>
> <id>STRING HERE</id>
> <entry>
> <string language="fr">STRING HERE</string>
> </entry>
> </taxon>
> </taxonPath>
> </classification>
> <classification>
> <purpose>
> <source>STRING HERE</source>
> <value>STRING HERE</value>
> </purpose>
> <taxonPath>
> <source>
> <string language="fr">STRING HERE </string>
> </source>
> <taxon>
> <id>STRING HERE</id>
> <entry>
> <string language="fr">STRING HERE</string>
> </entry>
> </taxon>
> </taxonPath>
> <taxonPath>
> <source>
> <string language="fr">STRING HERE</string>
> </source>
> <taxon>
> <id>STRING HERE</id>
> <entry>
> <string language="fr">STRING HERE</string>
> </entry>
> </taxon>
> </taxonPath>
> </classification>
> </lom>
>
> I don't know how I can use this kind of file with Solr because the XML
> example are this one.
>
>  <add>
>  <doc>
>  <field name="id">SOLR1000</field>
>  <field name="name">Solr, the Enterprise Search Server</field>
>  <field name="manu">Apache Software Foundation</field>
>  <field name="cat">software</field>
>  <field name="cat">search</field>
>  <field name="features">Advanced Full-Text Search Capabilities using
> Lucene</field>
>  <field name="features">Optimized for High Volume Web Traffic</field>
>  <field name="features">Standards Based Open Interfaces - XML and
> HTTP</field>
>  <field name="features">Comprehensive HTML Administration
> Interfaces</field>
>  <field name="features">Scalability - Efficient Replication to other Solr
> Search Servers</field>
>  <field name="features">Flexible and Adaptable with XML configuration and
> Schema</field>
>  <field name="features">Good unicode support: h&#xE9;llo (hello with an
> accent over the e)</field>
>  <field name="price">0</field>
> <field name="popularity">10</field>
> <field name="inStock">true</field>
> <field name="incubationdate_dt">2006-01-17T00:00:00.000Z</field>
> </doc>
> </add>
>
> I understood Solr need this kind of architecture, by Architecture I mean
> <field + name="keyword">Value</field>
> or as you can see I can't use this kind of architecture because I'm not
> allow to change my XML files.
>
> I'm looking forward to read you.
>
> Mathieu Saeli
> --
> Saeli Mathieu.
>

Reply via email to