You'll need a python lib that uses a python ZooKeeper client to be SolrCloud-aware so that you can do RDD like things, such as reading from all shards in a collection in parallel. I'm not aware of any Solr py libs that are cloud-aware yet, but it would be a good contribution to upgrade https://github.com/toastdriven/pysolr to be SolrCloud-aware
On Mon, Mar 30, 2015 at 11:31 PM, Chaushu, Shani <shani.chau...@intel.com> wrote: > Hi, > I saw there is a tool for reading solr into Spark RDD in JAVA > I want to do something like this in python, is there any package in python > for reading solr into spark RDD? > > Thanks , > Shani > > > --------------------------------------------------------------------- > Intel Electronics Ltd. > > This e-mail and any attachments may contain confidential material for > the sole use of the intended recipient(s). Any review or distribution > by others is strictly prohibited. If you are not the intended > recipient, please contact the sender and delete all copies.