Just for testing purpose - I would
1. Use curl to create new docs
2. Use Solrj to go to individual dbs and collect docs.



On Wed, Jul 7, 2010 at 12:45 PM, Xavier Rodriguez <xee...@gmail.com> wrote:

> Thanks for the quick reply!
>
> In fact it was a typo, the 200 rows I got were from postgres. I tried to
> say
> that the full-import was omitting the 100 oracle rows.
>
> When I run the full import, I run it as a single job, using the url
> command=full-import. I've tried to clear the index both using the clean
> command and manually deleting it, but when I run the full-import, the
> number
> of indexed documents are the documents coming from postgres.
>
> To be sure that the id field is unique, i get the id by assigning a letter
> before the id value. When indexed, the id looks like s_123, and that's the
> id 123 for an entity identified as "s". Other entities use different
> prefixes, but never "s".
>
> I used DIH to index the data. My configuration is the folllowing:
>
> File db-data-config.xml
>
>  <dataSource
>        type="JdbcDataSource"
>        name="ds_ora"
>        driver="oracle.jdbc.OracleDriver"
>        url="jdbc:oracle:thin:@xxx.xxx.xxx.xxx:1521:SID"
>        user="user"
>        password="password"
>    />
>
>  <dataSource
>        type="JdbcDataSource"
>        name="ds_pg"
>        driver="org.postgresql.Driver"
>        url="jdbc:postgresql://xxx.xxx.xxx.yyy:5432/sid"
>        user="user"
>        password="password"
>    />
>
> <entity name="carrers" dataSource="ds_ora" query="select 's_'||id as
> id_carrer,'a' as tooltip from imi_carrers">
>            <field column="id_carrer" name="identificador" />
>            <field column="tooltip" name="Nom" />
> </entity>
>
>
> <entity name="hidrants" dataSource="ds_pg" query="select 'h_'||id as
> id_hidrant, parc as tooltip from hidrants">
>            <field column="id_hidrant" name="identificador" />
>            <field column="tooltip" name="Nom" />
>  </entity>
>
> ----------
>
> In that configuration, all the fields coming from ds_pg are indexed, and
> the
> fields coming from ds_ora are not indexed. As I've said, the strange
> behaviour for me is that no error is logged in tomcat, the number of
> documents created is the number of rows returned by "hidrants", while the
> number of rows returned is the sum of the rows from "hidrants" and
> "carrers".
>
> Thanks in advance.
>
> Xavi.
>
>
>
>
>
>
>
> On 7 July 2010 02:46, Erick Erickson <erickerick...@gmail.com> wrote:
>
> > first do you have a unique key defined in your schema.xml? If you
> > do, some of those 300 rows could be replacing earlier rows.
> >
> > You say: " if I have 200
> > rows indexed from postgres and 100 rows from Oracle, the full-import
> > process
> > only indexes 200 documents from oracle, although it shows clearly that
> the
> > query retruned 300 rows."
> >
> > Which really looks like a typo, if you have 100 rows from Oracle how
> > did you get 200 rows from Oracle?
> >
> > Are you perhaps doing this in two different jobs and deleting the
> > first import before running the second?
> >
> > And if this is irrelevant, could you provide more details like how you're
> > indexing things (I'm assuming DIH, but you don't state that anywhere).
> > If it *is* DIH, providing that configuration would help.
> >
> > Best
> > Erick
> >
> > On Tue, Jul 6, 2010 at 11:19 AM, Xavier Rodriguez <xee...@gmail.com>
> > wrote:
> >
> > > Hi,
> > >
> > > I have a SOLR installed on a Tomcat application server. This solr
> > instance
> > > has some data indexed from a postgres database. Now I need to add some
> > > entities from an Oracle database. When I run the full-import command,
> the
> > > documents indexed are only documents from postgres. In fact, if I have
> > 200
> > > rows indexed from postgres and 100 rows from Oracle, the full-import
> > > process
> > > only indexes 200 documents from oracle, although it shows clearly that
> > the
> > > query retruned 300 rows.
> > >
> > > I'm not doing a delta-import, simply a full import. I've tried to clean
> > the
> > > index, reload the configuration, and manually remove
> > dataimport.properties
> > > because it's the only metadata i found.  Is there any other file to
> check
> > > or
> > > modify just to get all 300 rows indexed?
> > >
> > > Of course, I tried to find one of that oracle fields, with no results.
> > >
> > > Thanks a lot,
> > >
> > > Xavier Rodriguez.
> > >
> >
>

Reply via email to