The indexed="true" side is quite efficient. The stored="true" side -- not so much, but the strings you have here are pretty small and I wouldn't worry about it. Solr 4.1 (unreleased) does a great job here and compresses all the stored field data across documents.
~ David Jie Sun wrote > Hi - > our indexed documents currently store solr fields like 'digest' or 'type', > which most of our documents will end up with same value (such as 'sha1' > for field 'digest', or 'message' for field 'type' etc). > > on each solr server, we usually have 100 of millions of documents indexed > and with the same value on these fields (fields are stored and indexed). > > any suggestion what is the best approach if we suspect this will be very > inefficient on disk space usage, or is it? > > thanks! > Jie ----- Author: http://www.packtpub.com/apache-solr-3-enterprise-search-server/book -- View this message in context: http://lucene.472066.n3.nabble.com/suggestion-howto-handle-highly-repetitive-valued-field-tp4026104p4026131.html Sent from the Solr - User mailing list archive at Nabble.com.