Re: Data Import from RDBMS+File

2013-07-08 Thread Alexandre Rafalovitch
You can mix and match the data sources in nested entities, yes. Just make sure that you declare your data sources at the top and refer to them properly. As per documentation: "Ensure that the dataSource is of type DataSource (FileDataSource, URLDataSource)". So you need to declare one at the top of

Re: Data Import from RDBMS+File

2013-07-08 Thread Raheel Hasan
ok great. can I use this EntityProcessor within JdbcDataSource? Like this: http://localhost/project_1/files/a.txt"; dataSource="data-source-name"> By the way, I currently load the field into "text_en_splitting" as defined in schema.xml... On Mon, Jul 8, 2013 at 7:

Re: Data Import from RDBMS+File

2013-07-08 Thread Alexandre Rafalovitch
http://wiki.apache.org/solr/DataImportHandler#PlainTextEntityProcessor or http://wiki.apache.org/solr/DataImportHandler#LineEntityProcessor ? The file name gets exposed as a ${entityname.fieldname} variable. You can probably copy/manipulate it with a transformer on the external entity before it hi

Re: Data Import from RDBMS+File

2013-07-08 Thread Raheel Hasan
On this page (http://wiki.apache.org/solr/DataImportHandler), I cant see how its possible. Perhaps there is another guide.. Basically, this is what I am doing: Index data from multiple tables into Solr (see here http://wiki.apache.org/solr/DIHQuickStart). I need to skip 1 very big heavy table as i

Re: Data Import from RDBMS+File

2013-07-08 Thread Alexandre Rafalovitch
Did you have a chance to look at DIH with nested entities yet? That's probably the way to go to start out. Or a custom client, of course. Or, ETL solutions that support Solr (e.g. Apache Flume - not personally tested yet). Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedI

Data Import from RDBMS+File

2013-07-08 Thread Raheel Hasan
Hi everyone, I am looking for a way to import/index data such that i load data from table_1 and instead of joining from table_2, i import the rest of the "joined" data from a file instead. The name of the file comes from a field from table_1 Is it possible? and is it easily possible? -- Reg