Thanks Emmanuel, I should have been more clear about my use of the word 'dynamic', I actually meant PHP files that do not pull content from a database. I will take a look at the links you provided, ...thanks again for putting me on the right path.
Steve -----Original Message----- From: Emmanuel Espina [mailto:espinaemman...@gmail.com] Sent: Thursday, February 02, 2012 4:49 PM To: solr-user@lucene.apache.org Subject: Re: solr to index php files What do you mean by static php files? As far as I know PHP is to make pages look dynamic. If you want to index dynamic pages as they where just HTML you will have to download them, and add them to Solr. Programming a small program in SolrJ and using some HTTP library (http://hc.apache.org/httpclient-3.x/) to download the pages is the usual thing. Generating lists of URL, downloading those files to a temporary location and adding them to Solr is the tipical approach. To add them to Solr you can use http://wiki.apache.org/solr/ExtractingRequestHandler or parse them yourself using a library such as TagSoup http://ccil.org/~cowan/XML/tagsoup/ that I havn't tested myself but apparently it is very robust 2012/2/2 Reid, Stephen <sr...@novantas.com>: > Hi , > > I am a beginner with Solr and would like to index dynamic php files ( > page.php?ID=233) and static php files and .shtml files. This is for a small > website, which hits a small MySql database on the backend, however some php > files are static and are not part of the database. > > Can you tell me the best way to achieve this? > > Also, I know that XML data is returned by default, but how do I go about > creating a custom page for the results? > > > Thanks, > Steve > > > > IMPORTANT NOTICE: This message is intended only for the addressee and > may contain confidential, privileged information. If you are not the > intended recipient, you may not use, copy or disclose any information > contained in the message. If you have received this message in error, > please notify the sender by reply e-mail and delete the message. IMPORTANT NOTICE: This message is intended only for the addressee and may contain confidential, privileged information. If you are not the intended recipient, you may not use, copy or disclose any information contained in the message. If you have received this message in error, please notify the sender by reply e-mail and delete the message.