Re: Easiest way to export the entire index

2020-01-31 Thread Vincenzo D'Amore
Hi Amanda, I did this https://github.com/freedev/solr-import-export-json and works with/without cursormark, in case your index does not have an unique key (primary key). On Fri, Jan 31, 2020 at 1:18 PM Amanda Shuman wrote: > Thanks all! > > I wasn't familiar with using curl at the command lin

Re: Easiest way to export the entire index

2020-01-31 Thread Amanda Shuman
Thanks all! I wasn't familiar with using curl at the command line at all, but I did try a basic curl yesterday based on this thread, admin console attribute syntax, and the tutorial in solr documentation ( https://lucene.apache.org/solr/guide/8_4/solr-tutorial.html) and was able to produce the fil

Re: Easiest way to export the entire index

2020-01-29 Thread Edward Ribeiro
HI Amanda, Below is crude prototype in Bash that fetches documents from Solr using cursorMark: https://gist.github.com/eribeiro/de1588aaa1759c02ea40cc281e8aedc8 This is a crude prototype, but should shed some light for your use case (I copied the code below too): Best, Edward --

Re: Easiest way to export the entire index

2020-01-29 Thread Steve Ge
@Amanda You can try using curl and write output to a file   curl http://localhost:8983/Solr?q={theSolrQuery) > out.json   theSolrQuery - you need to specify all attrs you want exported, not just * If you are on Windows, there is a Windows curl tool you can download to use Steve On Wed, Ja

Re: Easiest way to export the entire index

2020-01-29 Thread David Hastings
i do this often and just create a 30gb file using wget, On Wed, Jan 29, 2020 at 10:21 AM Emir Arnautović < emir.arnauto...@sematext.com> wrote: > Hi Amanda, > I assume that you have all the fields stored so you will be able to export > full document. > > Several thousands records should not be to

Re: Easiest way to export the entire index

2020-01-29 Thread Emir Arnautović
Hi Amanda, I assume that you have all the fields stored so you will be able to export full document. Several thousands records should not be too much to use regular start+rows to paginate results, but the proper way of doing that would be to use cursors. Adjust page size to avoid creating huge