Try this:

gsub("(.*)[ \n]\\w{3}/\\d{2}", "\\1", avprix$DESCRIPTION)

On Wed, Apr 28, 2010 at 9:30 AM, arnaud Gaboury <arnaud.gabo...@gmail.com>wrote:

> TY so much david. We are getting close. But I need to keep "USD" in my
> object name (i.e "STANDARD LEAD USD")
>
>
>
> ***************************
> Arnaud Gaboury
> Mobile: +41 79 392 79 56
> BBM: 255B488F
> ***************************
>
>
> > -----Original Message-----
> > From: David Winsemius [mailto:dwinsem...@comcast.net]
> > Sent: Wednesday, April 28, 2010 2:25 PM
> > To: arnaud Gaboury
> > Cc: r-help@r-project.org
> > Subject: Re: [R] data frame manipulation and regex
> >
> >
> > On Apr 28, 2010, at 5:14 AM, arnaud Gaboury wrote:
> >
> > > Dear group,
> > >
> > > Here is my data.frame :
> > >
> > > avprix <-
> > > structure(list(DESCRIPTION = c("CORN Jul/10", "CORN May/10",
> > > "ROBUSTA COFFEE (10) Jul/10", "SOYBEANS Jul/10", "SPCL HIGH GRADE
> > > ZINC USD
> > > Jul/10",
> > > "STANDARD LEAD USD Jul/10"), prix = c(-1.5, -1082, 11084, 1983.5,
> > > -2464, -118), quantity = c(0, -3, 8, 2, -1, 0)), .Names =
> > > c("DESCRIPTION",
> > > "prix", "quantity"), row.names = c(NA, -6L), class = "data.frame")
> > >
> > >> avprix
> > >                      DESCRIPTION    prix quantity
> > > 1                     CORN Jul/10    -1.5        0
> > > 2                     CORN May/10 -1082.0       -3
> > > 3      ROBUSTA COFFEE (10) Jul/10 11084.0        8
> > > 4                 SOYBEANS Jul/10  1983.5        2
> > > 5 SPCL HIGH GRADE ZINC USD Jul/10 -2464.0       -1
> > > 6        STANDARD LEAD USD Jul/10  -118.0        0
> > >
> > > I need to remove the date (i.e. Jul/10 in this example) for each
> > > element of
> > > the DESCRIPTION column that contains the USD symbol. I am trying to
> > > do this
> > > using regular expressions, but must admit I am going nowhere.
> > > My elements in the DESCRIPTION column and the dates can change every
> > > day.
> >
> > This searches for the pattern USD and then replaces any three
> > characters , forward-slash, any two characters:
> >  > sub("USD+.*(.../..)", "", avprix$DESCRIPTION)
> > [1] "CORN Jul/10"                "CORN May/10"                "ROBUSTA
> > COFFEE (10) Jul/10"
> > [4] "SOYBEANS Jul/10"            "SPCL HIGH GRADE ZINC "
> > "STANDARD LEAD "
> >
> > This tightens up the matching by requiring that that the characters
> > after the slash be digits:
> >
> >  > sub("USD+.*(.../\\d{2})", "", avprix$DESCRIPTION)
> > [1] "CORN Jul/10"                "CORN May/10"                "ROBUSTA
> > COFFEE (10) Jul/10"
> > [4] "SOYBEANS Jul/10"            "SPCL HIGH GRADE ZINC "
> > "STANDARD LEAD "
> >
> > -- David.
> >
> >
> >  >
> > >
> > > TY for any help.
> > >
> > > ______________________________________________
> > > R-help@r-project.org mailing list
> > > https://stat.ethz.ch/mailman/listinfo/r-help
> > > PLEASE do read the posting guide http://www.R-project.org/posting-
> > guide.html
> > > and provide commented, minimal, self-contained, reproducible code.
> >
> > David Winsemius, MD
> > West Hartford, CT
>
> ______________________________________________
> R-help@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



-- 
Henrique Dallazuanna
Curitiba-Paraná-Brasil
25° 25' 40" S 49° 16' 22" O

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to