Hi,

although it takes some 10 minutes to do the calculation (the original
dataframe has more than 100 000 rows..), it seems to work ok.
Thanks for your time and help,

Zuzana

On 25 March 2013 15:37, Rui Barradas <ruipbarra...@sapo.pt> wrote:

> Hello,
>
> I believe the following solves your problem. It's a bit more complicated
> but with the sample dataset you've provided the result is as wished.
>
>
>
> tmp <- lapply(split(subz, subz$time), function(x) {
>         i1 <- which(as.character(x$fix) == "noon")[1]
>         i2 <- which(as.character(x$fix) == "midnight")
>         x$day[i2] <- x$day[i1]
>         x})
> names(tmp) <- NULL
> result <- do.call(rbind, tmp)
>
>
>
> Hope this helps,
>
> Rui Barradas
>
> Em 25-03-2013 14:15, zuzana zajkova escreveu:
>
>> Hi Rui,
>>
>> thank you for your code, but unfortunately it doesn't work correctly. What
>> I got is this:
>>
>>
>>  subz
>>>
>>           jul                time    dtime      fix    ddawn    ddusk day
>> 101608 15006 2011-02-01 19:14:49 19.24694     noon 7.916667 19.88333   1
>> 101609 15006 2011-02-01 19:24:49 19.41361 midnight 7.916667 19.56667   1
>> 101610 15006 2011-02-01 19:24:49 19.41361     noon 7.916667 19.88333   1
>> 101611 15006 2011-02-01 19:34:49 19.58028 midnight 7.916667 19.56667   1
>>
>> 101612 15006 2011-02-01 19:34:49 19.58028     noon 7.916667 19.88333   1
>> 101613 15006 2011-02-01 19:44:49 19.74694 midnight 7.916667 19.56667   1
>> 101614 15006 2011-02-01 19:44:49 19.74694     noon 7.916667 19.88333   1
>>   101615 15006 2011-02-01 19:54:49 19.91361 midnight 7.916667 19.56667   1
>>
>> 101616 15006 2011-02-01 19:54:49 19.91361     noon 7.916667 19.88333   0
>> 101617 15006 2011-02-01 20:04:49 20.08028 midnight 7.916667 19.56667   0
>> 101618 15006 2011-02-01 20:04:49 20.08028     noon 7.916667 19.88333   0
>>
>> Where "day" for "time" 19:54:49 for midnight is 1 and for noon is 0. There
>> are supposed to be 0 both (as "dtime" 19.91361 > "ddusk" for noon
>> 19.88333)
>> Probably the problem would be adding 1 to he index in
>> subz$day[idx + 1] <- subz$day[idx]
>> So far, I haven't found solution...
>>
>> Zuzana
>>
>>
>>
>> On 22 March 2013 20:01, Rui Barradas <ruipbarra...@sapo.pt> wrote:
>>
>>  Hello,
>>>
>>> Try the following.
>>>
>>>
>>> idx <- which(subz$fix == "noon")
>>> if(idx[length(idx)] == nrow(subz)) idx <- idx[-length(idx)]
>>> subz$day[idx + 1] <- subz$day[idx]
>>>
>>>
>>> Hope this helps,
>>>
>>> Rui Barradas
>>>
>>> Em 22-03-2013 18:18, zuzana zajkova escreveu:
>>>
>>>  Hi,
>>>>
>>>> I would appreciate if somebody could help me with this small issue...
>>>> I have a dataframe like this (originaly has more than 100 000 rows):
>>>>
>>>>   subz
>>>>
>>>>>
>>>>>             jul                time    dtime      fix    ddawn
>>>>  ddusk day
>>>> 101608 15006 2011-02-01 19:14:49 19.24694     noon 7.916667 19.88333   1
>>>> 101609 15006 2011-02-01 19:24:49 19.41361 midnight 7.916667 19.56667   1
>>>> 101610 15006 2011-02-01 19:24:49 19.41361     noon 7.916667 19.88333   1
>>>> 101611 15006 2011-02-01 19:34:49 19.58028 midnight 7.916667 19.56667   0
>>>> 101612 15006 2011-02-01 19:34:49 19.58028     noon 7.916667 19.88333   1
>>>> 101613 15006 2011-02-01 19:44:49 19.74694 midnight 7.916667 19.56667   0
>>>> 101614 15006 2011-02-01 19:44:49 19.74694     noon 7.916667 19.88333   1
>>>> 101615 15006 2011-02-01 19:54:49 19.91361 midnight 7.916667 19.56667   0
>>>> 101616 15006 2011-02-01 19:54:49 19.91361     noon 7.916667 19.88333   0
>>>> 101617 15006 2011-02-01 20:04:49 20.08028 midnight 7.916667 19.56667   0
>>>> 101618 15006 2011-02-01 20:04:49 20.08028     noon 7.916667 19.88333   0
>>>>
>>>>   dput(subz)
>>>>
>>>>>
>>>>>  structure(list(jul = c(15006, 15006, 15006, 15006, 15006, 15006,
>>>> 15006, 15006, 15006, 15006, 15006), time = structure(c(1296587689,
>>>> 1296588289, 1296588289, 1296588889, 1296588889, 1296589489, 1296589489,
>>>> 1296590089, 1296590089, 1296590689, 1296590689), class = c("POSIXct",
>>>> "POSIXt"), tzone = "GMT"), dtime = c(19.2469444444444, 19.4136111111111,
>>>> 19.4136111111111, 19.5802777777778, 19.5802777777778, 19.7469444444444,
>>>> 19.7469444444444, 19.9136111111111, 19.9136111111111, 20.0802777777778,
>>>> 20.0802777777778), fix = structure(c(2L, 1L, 2L, 1L, 2L, 1L,
>>>> 2L, 1L, 2L, 1L, 2L), .Label = c("midnight", "noon"), class = "factor"),
>>>>       ddawn = c(7.91666666666667, 7.91666666666667, 7.91666666666667,
>>>>       7.91666666666667, 7.91666666666667, 7.91666666666667,
>>>> 7.91666666666667,
>>>>       7.91666666666667, 7.91666666666667, 7.91666666666667,
>>>> 7.91666666666667
>>>>       ), ddusk = c(19.8833333333333, 19.5666666666667, 19.8833333333333,
>>>>       19.5666666666667, 19.8833333333333, 19.5666666666667,
>>>> 19.8833333333333,
>>>>       19.5666666666667, 19.8833333333333, 19.5666666666667,
>>>> 19.8833333333333
>>>>       ), day = c(1, 1, 1, 0, 1, 0, 1, 0, 0, 0, 0)), .Names = c("jul",
>>>> "time", "dtime", "fix", "ddawn", "ddusk", "day"), row.names =
>>>> 101608:101618, class = "data.frame")
>>>>
>>>> where "day" is calculated as
>>>>
>>>> subz$day <- ifelse( subz$dtime > subz$ddusk | subz$dtime < subz$ddawn,
>>>> 0,
>>>> 1
>>>> )
>>>>
>>>> The way I would like to calculate "day" is this
>>>> - for the same "time",  the "day" is calculated for "noon" as mentioned
>>>> above but  for "midnight" is just copying the same value as for "noon".
>>>> So for the same "time" the "day" value should be the same for "noon" and
>>>> "midnight".
>>>> Something like this:
>>>>
>>>>     jul                time    dtime      fix    ddawn    ddusk day
>>>> 101608 15006 2011-02-01 19:14:49 19.24694     noon 7.916667 19.88333   1
>>>> 101609 15006 2011-02-01 19:24:49 19.41361 midnight 7.916667 19.56667   1
>>>> 101610 15006 2011-02-01 19:24:49 19.41361     noon 7.916667 19.88333   1
>>>> 101611 15006 2011-02-01 19:34:49 19.58028 midnight 7.916667 19.56667   1
>>>> 101612 15006 2011-02-01 19:34:49 19.58028     noon 7.916667 19.88333   1
>>>> 101613 15006 2011-02-01 19:44:49 19.74694 midnight 7.916667 19.56667   1
>>>> 101614 15006 2011-02-01 19:44:49 19.74694     noon 7.916667 19.88333   1
>>>> 101615 15006 2011-02-01 19:54:49 19.91361 midnight 7.916667 19.56667   0
>>>> 101616 15006 2011-02-01 19:54:49 19.91361     noon 7.916667 19.88333   0
>>>> 101617 15006 2011-02-01 20:04:49 20.08028 midnight 7.916667 19.56667   0
>>>> 101618 15006 2011-02-01 20:04:49 20.08028     noon 7.916667 19.88333   0
>>>>
>>>> Where I get stuck, is I don't know how to get the value for "midnight".
>>>>
>>>> Any suggestion is welcome. Thanks
>>>>
>>>> Zuzana
>>>>
>>>>          [[alternative HTML version deleted]]
>>>>
>>>> ______________________________****________________
>>>> R-help@r-project.org mailing list
>>>> https://stat.ethz.ch/mailman/****listinfo/r-help<https://stat.ethz.ch/mailman/**listinfo/r-help>
>>>> <https://stat.**ethz.ch/mailman/listinfo/r-**help<https://stat.ethz.ch/mailman/listinfo/r-help>
>>>> >
>>>> PLEASE do read the posting guide http://www.R-project.org/**
>>>> posting-guide.html 
>>>> <http://www.R-project.org/**posting-guide.html<http://www.R-project.org/posting-guide.html>
>>>> >
>>>>
>>>> and provide commented, minimal, self-contained, reproducible code.
>>>>
>>>>
>>>>
>>

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to