The streaming API looks like it's meant to be run from the client app server - very similar to a standard Solr search. When I run a basic streaming operation the memory consumption occurs on the app server jvm, not the solr server jvm. The opposite of what I was expecting.
(pseudo code) Stream A = new CloudSolrStream(); Stream B = new CloudSolrStream(); Stream C = new HashJoinStream(A, B); Stream D = new SortStream(C); Stream E = new ReducerStream(D); E.open(); The SortStream is processed in memory when open() is called. Can the processing be pushed off to the Solr cluster? Is that what the Parallel stream will do - using worker collections? confused, Ted -- View this message in context: http://lucene.472066.n3.nabble.com/Should-streaming-place-load-on-the-app-server-tp4288466.html Sent from the Solr - User mailing list archive at Nabble.com.