Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Markus Jelsma
> On Wed, Apr 6, 2011 at 15:31 PM, Adam Estrada > > I wanted to know how large field's size affects performance. If you use replication then it's a huge impact on performance as the data gets sent over the network. It's also a memory hog so there's less memory and more garbage collection. Ind

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Ezequiel Calderara
On Wed, Apr 6, 2011 at 15:31 PM, Adam Estrada wrote: > Well...by default there is a pretty decent schema that you can use as a > template in the example project that builds with Solr. Tika is the library > that does the actual content extraction so it would be a good idea to try > the example pro

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Stefan Matheis
Ezequiel, Am 06.04.2011 20:38, schrieb Ezequiel Calderara: Anyone knows any storage for images that performs well, other than FS ? you may have a look on http://www.danga.com/mogilefs/ ? :) Regards Stefan

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Adam Estrada
Well...by default there is a pretty decent schema that you can use as a template in the example project that builds with Solr. Tika is the library that does the actual content extraction so it would be a good idea to try the example project out first. Adam 2011/4/6 Ezequiel Calderara > Another

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Jonathan Rochkind
On 4/6/2011 2:39 PM, Markus Jelsma wrote: Ha, there's a binary field type?! I've stored binary data in an ordinary "String" field type, and it's worked. But there were some headaches to get it to work, might have been smoother if I had realized there was actually a binary field type. How, you

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Markus Jelsma
> Hi, your answers were really helpfull > > I was thinking in putting the base64 encoded file into a string field. But > was a little worried about solr trying to stem it or vectorize or those > stuff. String field types are not analyzed. So it doesn't brutalize your data. Better use BinaryFiel

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Markus Jelsma
> Ha, there's a binary field type?! > > I've stored binary data in an ordinary "String" field type, and it's > worked. But there were some headaches to get it to work, might have > been smoother if I had realized there was actually a binary field type. How, you can't just embed control characte

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Ezequiel Calderara
Hi, your answers were really helpfull I was thinking in putting the base64 encoded file into a string field. But was a little worried about solr trying to stem it or vectorize or those stuff. Seen in the example of the schema.xml: Anyone knows any storage for images that performs well,

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Jonathan Rochkind
Ha, there's a binary field type?! I've stored binary data in an ordinary "String" field type, and it's worked. But there were some headaches to get it to work, might have been smoother if I had realized there was actually a binary field type. But wait I'm talking about Solr 'stored field', n

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Jonathan Rochkind
I put binary data in an ordinary Solr stored field, don't need any special schema. I have run into trouble making sure the data is not corrupted on the way in during indexing, depending on exactly what form of communication is being used to index (SolrJ, SolrJ with EmbeddedSolr, DIH, etc.), as

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Ryan McKinley
You can store binary data using a binary field type -- then you need to send the data base64 encoded. I would strongly recommend against storing large binary files in solr -- unless you really don't care about performance -- the file system is a good option that springs to mind. ryan 2011/4/6

Re: Solr: Images, Docs and Binary data

2011-04-06 Thread Ezequiel Calderara
Another question that maybe is easier to answer, how can i store binary data? Any example schema? 2011/4/6 Ezequiel Calderara > Hello everyone, i need to know if some has used solr for indexing and > storing images (upt to 16MB) or binary docs. > > How does solr behaves with this type of docs? H

Solr: Images, Docs and Binary data

2011-04-06 Thread Ezequiel Calderara
Hello everyone, i need to know if some has used solr for indexing and storing images (upt to 16MB) or binary docs. How does solr behaves with this type of docs? How affects performance? Thanks Everyone -- __ Ezequiel. Http://www.ironicnet.com