Using arbitrary strings affects at least on the traffic between the shard(s) and a querying client or shards and a frontend solr instance. We have actually hit such an issue, described here: https://issues.apache.org/jira/browse/SOLR-4903, which has triggered the suggestion for ids compaction: https://issues.apache.org/jira/browse/SOLR-4904
Lucene uses internal doc ids, so I would wildly guess it shouldn't matter what ids are used on the application level, but would love to hear more on this topic from someone who knows for sure. Dmitry On Fri, Oct 4, 2013 at 11:51 AM, Zahoor Mohamed <zah...@indix.com> wrote: > Hi > > Does the size of ID field matter .. in terms of memory usage...and query > performance... > > i.e. will Solr use more memory if you use a URL string as ID field instead > of a int value? > > ./zahoor >