Thanks very much. The cut function is exactly what I was looking for. For the 
graph I forgot to include an example (picture attached). I think it is 
something different from what you have shown in the examples. I want to plot 
all the data in a line plot - exactly how it is shown in the attached graph.


Mark




________________________________
From: R. Michael Weylandt <michael.weyla...@gmail.com>
To: Mark D. <d.mar...@ymail.com>
Cc: "r-help@r-project.org" <r-help@r-project.org>
Sent: Saturday, 13 August 2011, 21:39
Subject: Re: [R] Plotting and quantiles


I believe you received an informative answer to both these questions from 
Daniel Maiter one hour and twenty five minutes after sending your question: I 
repeat it here just in case you didn't get it. 

--------------------------

Q1 is very opaque because you are not even saying what kind of plot you want.
For a regular scatterplot, you have multiple options.

a.) select only the data in the given intervals and plot the data

b.) plot the entire data, but restrict the graph region to the intervals you
are interested in, or

c.) winsorize the data (i.e., set values below the lower cutoff and above
the upper cutoff to the cutoff values

Which one you want to do depends on which one makes the most sense given the
purpose of your analysis

Say:

x<-rnorm(100)
y<-x+rnorm(100)

Then

a.) plot(y~x,data=data.frame(x,y)[
x<2&x>-2 , ])
#plots y against x only for xs between -2 and 2

b.) plot(y~x,xlim=c(-2,2))

#plots all y agains x, but restricts the plotting region to -2 to 2 on the
x-axis

c.)

x<-replace(x,x>2,2)
x<-replace(x,x<(-2),-2)
plot(y~x)

#sets all x-values below -2 and above 2 to these cutoffs



Q2: look at the cut() function.

?cut

HTH,
Daniel

---------------------

If you need more information, a different solution, or further clarification, 
please ask new questions. 

Michael Weylandt


On Sat, Aug 13, 2011 at 10:10 AM, Mark D. <d.mar...@ymail.com> wrote:

Dear R users,
>
>
>This is most likely very basic question but I am new to R and would really 
>appreciate some tips on those two problems.
>
>1) I need to plot variables from a data frame. Because of some few high 
>numbers my graph is really strange looking. How could I plot a fraction of the 
>samples (like 0.1 (10%), 0.2 up to for example 0.6) on x axis and values 
>'boundaries' (like any value '< 100',  '101-200' and '> 201') on the y axis? 
>This needs to be a simple line plot like the one I attached for an example. 
>The values would come from one column.
>
>
>2) I have a data frame with values and need to subset the rows based on the 
>values. I wanted to order them (with increasing values) and divide into 3-4 
>groups. I though about using quantile but I want the group to be something 
>like '1-25', '26-50', '51-75', '75-100' (ordered and for example 25th 
>percentile, 26-50th etc). I could just look for a median divide into two and 
>then again (or use quantiles 0.25, 0.5, 0.7 and 1 and then get rid of all rows 
>in 0.25 that are in 0.5 etc) but surely there must by a faster and simpler way 
>to do that (I need to do this a lot on different columns)?
>
>Thanks for your help,
>Mark
>______________________________________________
>R-help@r-project.org mailing list
>https://stat.ethz.ch/mailman/listinfo/r-help
>PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>and provide commented, minimal, self-contained, reproducible code.
>
>

<<attachment: viewer.png>>

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to