Hi, I’m using R (TM package) for text mining and I’m having problems
filtering articles out of my data set by local meta data.



Here is the code:



*data <- ("C:/… /19970331")*

* *

* *

*rs <- ReutersSource(data , encoding = "UTF-8")*

*RC <- VCorpus(DirSource(data), readerControl = list(reader =
readRCV1asPlain,*

*
language = "en_US",*

*
load = TRUE),*

*
     dbControl = list(useDb = TRUE,*

*
      dbName = "texts.db",*

*
      dbType = "DB1"))*

* *

* *

* *

*tm_index(RC, FUN = sFilter, doclevel = F, useMeta = T,  "Topics == 'MCAT'")
*

* *

* *



When I use  sFilter, I can only filter fields in yellow, I want to filter
fields in red, what am I doing wrong?



Thanks, Andy



This is meta data that is attached to each article



Available meta data pairs are:

  Author       :

  DateTimeStamp: 1997-03-31

  Description  :

  Heading      : USA: WHX begins tender offer for Dynamics Corp.

  ID           : 476871

  Language     : en_US

  Origin       : Reuters Corpus Volume 1

User-defined local meta data pairs are:

$Publisher

[1] "Reuters Holdings Plc"



$Topics

[1] "C18"  "C181" "CCAT"



$Industries

[1] "I22100" "I34000"



$Countries

[1] "USA"

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to