Le 25/04/2023 à 17:39, Bill Dunlap a écrit :
x <- c(1, 1, 2, 2,  2, 3)
y <- c(1, 2, 1, 3, NA, 3)
str(xyTable(x,y))
List of 3
  $ x     : num [1:6] 1 1 2 2 NA 3
  $ y     : num [1:6] 1 2 1 3 NA 3
  $ number: int [1:6] 1 1 1 NA NA 1


How many (2,3)s do we have?  At least one, the third entry, but the fourth
entry, (2,NA), is possibly a (2,3) so we don't know and make the count NA.
I suspect this is not the intended logic, but a byproduct of finding value
changes in a sorted vector with the idiom x[-1]!=x[-length(x).  Also the
following does follow that logic:

x <- c(1, 1, 2, 2,  5, 6)
y <- c(2, 2, 2, 4, NA, 3)
str(xyTable(x,y))
List of 3
  $ x     : num [1:5] 1 2 2 5 6
  $ y     : num [1:5] 2 2 4 NA 3
  $ number: int [1:5] 2 1 1 1 1
Not really. If we take

  x <- c(1, 1, 2, 2,  5, 6, 5, 5)
  y <- c(2, 2, 2, 4, NA, 3, 3, 4)

we get

  str(xyTable(x,y))

List of 3
 $ x     : num [1:7] 1 2 2 5 5 NA 6
 $ y     : num [1:7] 2 2 4 3 4 NA 3
 $ number: int [1:7] 2 1 1 1 NA NA 1

How many (5, 3) we have? At least 1 but (5, NA) is possibly (5,3) so we should have NA but we have 1. How many (5, 4) we have? At least 1 but (5, NA) is possibly (5,4) and we do get NA. So restored logic is not consistent. Without talking about a pair (NA, NA) appeared and not producing (5, NA) pair.

Best,
Serguei.





table() does not use this logic, as one NA in a vector would make all the
counts NA.  Should xyTable have a way to handle NAs the way table() does?

-Bill

On Tue, Apr 25, 2023 at 1:26 AM Viechtbauer, Wolfgang (NP) <
wolfgang.viechtba...@maastrichtuniversity.nl> wrote:

Hi all,

Posted this many years ago (
https://stat.ethz.ch/pipermail/r-devel/2017-December/075224.html), but
either this slipped under the radar or my feeble mind is unable to
understand what xyTable() is doing here and nobody bothered to correct me.
I now stumbled again across this issue.

x <- c(1, 1, 2, 2,  2, 3)
y <- c(1, 2, 1, 3, NA, 3)
table(x, y, useNA="always")
xyTable(x, y)

Why does xyTable() report that there are NA instances of (2,3)? I could
understand the logic that the NA could be anything, including a 3, so the
$number value for (2,3) is therefore unknown, but then the same should
apply so (2,1), but here $number is 1, so the logic is then inconsistent.

I stared at the xyTable code for a while and I suspect this is coming from
order() using na.last=TRUE by default, but in any case, to me the behavior
above is surprising.

Best,
Wolfgang

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

        [[alternative HTML version deleted]]

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

Reply via email to