Re: Indexing Wikipedia with Solr/Lucene

2012-05-13 Thread András Bártházi
Hi, Using the RegexTransformer? I guess you can make a regular expression for the wikipedia text field to extract category and external links. Bye, Andras 2012/5/13 vineet yadav > Hi all, > I want to create Lucene/Solr index of

Fwd: Indexing Wikipedia with Solr/Lucene

2012-05-13 Thread vineet yadav
Hi all, I want to create Lucene/Solr index of wikipedia xml dump. I used Solr example(http://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia) to index wikipedia xml dump. Since in wikipedia, Category and external links are part of wikipedia text, I am not able to index category a