The generic problem with all the "semi-structured" documents is that the meta-data has no consistent naming. Making up names here, but Word might have "created_on", PDF "created" etc. Its really frustrating, but each type has to be investigated to figure out which field you want to map to "created". Tika and SolrCel just map what they find.
On way to go about this is to map the dynamic glob pattern to a stored field, then look at what pops out. Not satisfactory, but... <dynamicField name="*" type="string" stored="true" multiValued="true" /> Best, Erick On Wed, Apr 22, 2015 at 5:44 AM, Eric Meisler <eric.meis...@veritablelp.com> wrote: > Sorry if my question was too vague. In my mind it wasn't but you led me in > the right direction which gave me a new issue. > > I added the following to my schema.xml to bring back the Created Date: > <field name="created" type="date" indexed="false" stored="true"/> but now I > am getting back the created date for PDF files but not for Word documents > (specifically .doc and .docx). > > Has anyone run into this issue? If I look at the properties for all three > types of files the Create Date is called created so I am not sure what I am > doing wrong. > > Thanks for the help in advanced. > > Eric > > > >>>> Erick Erickson <erickerick...@gmail.com> 4/21/2015 11:45 AM >>> > Not really sure what you're asking here, I must be missing something. > > The mapping is through the field name supplied, so as long as your input > XML has something like > <add> > <doc> > <field name="CreatedDate">your date here</field> > </doc> > </add> > > it should be fine. > > You can use "date math" here as well, as: > <field name="CreatedDate">NOW</field> > > Best, > Erick > > On Tue, Apr 21, 2015 at 7:57 AM, Eric Meisler > <eric.meis...@veritablelp.com> wrote: >> I am a newbie and just started using Solr 4.10.3. We have successfully >> indexed a network drive and are running searches. We now have a request to >> show the "Created Date" for all documents (PDF/WORD/TXT/XLS) that come back >> in our search results. I have successfully filtered on the last_modified >> date but I cannot figure out or find out how to add a document's Created >> Date to the schema.xml. We do not want to search on the created date since >> last_modified date handles this but just want to display it. To my >> understanding I need to add indexed="false" and stored="true" to the xml >> field but I don't know how or understand how the xml name will map to the >> document's created date property. >> >> This is my guess: >> <field name="CreatedDate" type="date" indexed="false" stored="true"/> >> >> Can someone please supply the correct syntax for the xml and maybe a brief >> comment on how solr maps to the actual document's property? Also, will I >> need to re-index the dive to make this change apply? >> >> Thanks, >> Eric