Re: DIH for sharded database?

2010-12-18 Thread Andy
--- On Sat, 12/18/10, Lance Norskog wrote: > You can have a file with 1,2,3 on > separate lines. There is a > line-by-line file reader that can pull these as separate > drivers. > Inside that entity the JDBC url has to be altered with the > incoming > numbers. I don't know if this will work. I'

Re: DIH for sharded database?

2010-12-18 Thread Lance Norskog
You can have a file with 1,2,3 on separate lines. There is a line-by-line file reader that can pull these as separate drivers. Inside that entity the JDBC url has to be altered with the incoming numbers. I don't know if this will work. It also may work for single-threaded DIH but not during multip

DIH for sharded database?

2010-12-18 Thread Andy
I have a table that is broken up into many virtual shards. So basically I have N identical tables: Document1 Document2 . . Document36 Currently these tables all live in the same database, but in the future they may be moved to different servers to scale out if the needs arise. Is there any way

Re: old index files not deleted on slave

2010-12-18 Thread Lance Norskog
This could be a quirk of the native locking feature. What's the file system? Can you fsck it? If this error keeps happening, please file this. It should not happen. Add the text above and also your solrconfigs if you can. One thing you could try is to change from the native locking policy to the

Re: Is there a way to create multiple using DIH and access the data pertaining to a particular ?

2010-12-18 Thread Lance Norskog
And, a use case: Tika blows up on some files. But we still want other data like file name etc. and an empty text field. So: Both documents have the same unique id. If the Tika autoparser uses PDF and the PDF works, the second document overwrites the

old index files not deleted on slave

2010-12-18 Thread feedly team
I have set up index replication (triggered on optimize). The problem I am having is the old index files are not being deleted on the slave. After each replication, I can see the old files still hanging around as well as the files that have just been pulled. This causes the data directory size to in

Re: Is there a way to create multiple using DIH and access the data pertaining to a particular ?

2010-12-18 Thread Lance Norskog
You can have multiple documents generated by the same data-config: It's the 'rootEntity="false" that makes the child entity a document. On Sat, Dec 18, 2010 at 7:43 AM, Dennis Gearon wrote: > Just curious, do these tables hav

Re: how to config DataImport Scheduling

2010-12-18 Thread Hamid Vahedi
I think it must work with any version of solr. because it works url base (see config file). Attention to this point: Successfully tested on Apache Tomcat v6(should work on any other servlet container) From: Ahmet Arslan To: solr-user@lucene.apache.org Sent

RE: Memory use during merges (OOM)

2010-12-18 Thread Burton-West, Tom
Thanks Robert, We will try the termsIndexInterval as a workaround. I have also opened a JIRA issue: https://issues.apache.org/jira/browse/SOLR-2290. Hope I found the right sections of the Lucene code. I'm just now in the process of looking at the Solr IndexReaderFactory and SolrIndexWriter a

Re: Is there a way to create multiple using DIH and access the data pertaining to a particular ?

2010-12-18 Thread Dennis Gearon
Just curious, do these tables have the same schema, like a set of shards would? If not, how do you map them to the index? Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others’ mistakes, so

Re: Is there a way to create multiple using DIH and access the data pertaining to a particular ?

2010-12-18 Thread Koji Sekiguchi
(10/11/11 1:57), bbarani wrote: Hi, I have a peculiar situation where we are trying to use SOLR for indexing multiple tables (There is no relation between these tables). We are trying to use the SOLR index instead of using the source tables and hence we are trying to create the SOLR index as th