Thanks, Erik. I will try your code soon.

I did this first:

> data <-
read.table('E-TABM-1-processed-data-1342561271_log2_with_symbols.txt',
row.names = NULL ,header=TRUE, fill=TRUE)
> class(data[[3]])
[1] "factor"
> is.numeric(data[[3]])
[1] FALSE
>

So it is not numeric but 'factor' instead.
Can I convert this column to numeric?

Allen

On Thu, Jun 12, 2008 at 7:48 PM, Erik Iverson <[EMAIL PROTECTED]>
wrote:

>
>
> ss wrote:
>
>> It is:
>>
>>  > data <-
>> read.table('E-TABM-1-processed-data-1342561271_log2_with_symbols.txt',
>> row.names = NULL ,header=TRUE, fill=TRUE)
>>  > class(data[3])
>> [1] "data.frame"
>>  >
>>
>>
> Oops, should have said  class(data[[3]]) and
> is.numeric(data[[3]])
>
> See ?Extract
>
>
>> And if I try to use as.matrix(read.table()), I got:
>>
>>  >data
>> <-as.matrix(read.table('E-TABM-1-processed-data-1342561271_log2_with_symbols.txt',
>> + row.names = NULL ,header=TRUE, fill=TRUE))
>>  > data[1:4,1:4]
>>     Probe_ID       Gene_Symbol M16012391010920 M16012391010525
>> [1,] "A_23_P105862" "13CDNA73"  "-1.6"          " 0.16"       [2,]
>> "A_23_P76435"  "15E1.2"    "0.18"          " 0.59"       [3,] "A_24_P402115"
>> "15E1.2"    "1.63"          "-0.62"       [4,] "A_32_P227764" "15E1.2"
>>  "-0.76"         "-0.42"
>> You see they are surrounded by "".
>>
>> I don't see such if I just use >read.table
>>
>>
> That is because matrices (objects of class 'matrix') are of homogeneous
> type.  It changes everything to a character (including the numbers), which
> you certainly do NOT want.
>
> You want a data.frame, I will provide an example of what I think you are
> after.
>
> Try the following commands and see how they compare to your situation:
> these work for me.
>
> test <- data.frame(x = factor(rep(c("A", "B"), each = 13)), y = rnorm(26),
> z = rnorm(26))
>
> test
>
> class(test)
>
> is.numeric(test[[2]])
>
> is.numeric(test[[3]])
>
> rowMeans(test)
>
> rowMeans(test[2:3])
>
>   > data <-
>> read.table('E-TABM-1-processed-data-1342561271_log2_with_symbols.txt',
>> row.names = NULL ,header=TRUE, fill=TRUE)
>>  > data[1:4,1:4]
>>      Probe_ID Gene_Symbol M16012391010920 M16012391010525
>> 1 A_23_P105862    13CDNA73            -1.6            0.16
>> 2  A_23_P76435      15E1.2            0.18            0.59
>> 3 A_24_P402115      15E1.2            1.63           -0.62
>> 4 A_32_P227764      15E1.2           -0.76           -0.42
>>
>>
>> Thanks,
>>      Allen
>>
>>
>>
>> On Thu, Jun 12, 2008 at 7:34 PM, Erik Iverson <[EMAIL PROTECTED]<mailto:
>> [EMAIL PROTECTED]>> wrote:
>>
>>
>>
>>    ss wrote:
>>
>>        Hi Wacek,
>>
>>        Yes, data is data frame not a matrix.
>>
>>            is.numeric(data[3])
>>
>>        [1] FALSE
>>
>>
>>    what is class(data[3])
>>
>>
>>        But I looked at the column 3 and it looks okay though. There are
>>        few NAs and
>>        I did find
>>        anything strange.
>>
>>        Any suggestions?
>>
>>        Thanks,
>>             Allen
>>
>>
>>
>>        On Thu, Jun 12, 2008 at 7:01 PM, Wacek Kusnierczyk <
>>        [EMAIL PROTECTED]
>>        <mailto:[EMAIL PROTECTED]>> wrote:
>>
>>            ss wrote:
>>
>>                Thank you very much, Wacek! It works very well.
>>                But there is a minor problem. I did the following:
>>
>>                    data <-
>>
>>
>>  read.table('E-TABM-1-processed-data-1342561271_log2_with_symbols.txt',
>>                +row.names = NULL ,header=TRUE, fill=TRUE)
>>
>>            looks like you have a data frame, not a matrix
>>
>>
>>                    dim(data)
>>
>>                [1] 23963    85
>>
>>                    data[1:4,1:4]
>>
>>                     Probe_ID Gene_Symbol M16012391010920 M16012391010525
>>                1 A_23_P105862    13CDNA73            -1.6            0.16
>>                2  A_23_P76435      15E1.2            0.18            0.59
>>                3 A_24_P402115      15E1.2            1.63           -0.62
>>                4 A_32_P227764      15E1.2           -0.76           -0.42
>>
>>                    data1<-data[sapply(data, is.numeric)]
>>                    dim(data1)
>>
>>                [1] 23963    82
>>
>>                    data1[1:4,1:4]
>>
>>                 M16012391010525 M16012391010843 M16012391010531
>>                M16012391010921
>>                1            0.16           -0.23           -1.40
>>                 0.90
>>                2            0.59            0.28           -0.30
>>                 0.08
>>                3           -0.62           -0.62           -0.22
>>                -0.18
>>                4           -0.42            0.01            0.28
>>                -0.79
>>
>>                You will notice that, after using 'data[sapply(data,
>>                is.numeric)]' and
>>                getting
>>                data1, the first sample in data, called
>>                'M16012391010920', was missed
>>                in data1.
>>
>>                Any further suggestions?
>>
>>            surely there must be an entry in column 3 that makes it
>>            non-numeric.
>>            what does is.numeric(data[3]) say?  (NAs should not make a
>>            column
>>            non-numeric, unless there are only NAs there, which is not
>>            the case
>>            here.)  check your data for non-numeric entries in column 3,
>>            there can
>>            be a typo.
>>
>>            vQ
>>
>>
>>               [[alternative HTML version deleted]]
>>
>>        ______________________________________________
>>        R-help@r-project.org <mailto:R-help@r-project.org> mailing list
>>
>>        https://stat.ethz.ch/mailman/listinfo/r-help
>>        PLEASE do read the posting guide
>>        http://www.R-project.org/posting-guide.html
>>        and provide commented, minimal, self-contained, reproducible code.
>>
>>
>>

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to