Hi,

I've got a list of ~20000 elements that look like this:

[1]
"A00096:A00096:A00096:A00096:A02178:A02178:A07776"

[2]
"A00046:A00076:A01101:A04146:A05671:A07169"

[3]
"A00038:A00932:A02185:A02370:A02818:A02818:A02818:A02818:A04732:A07142:A07142"

[4]
"A00096:A01352:A01352:A02023:A05001:A05001:A07776"

[5]
"A00036:A00047:A00059:A00503:A00904:A00904:A00904:A01023:A01023:A01399:A02029:A03941:A07679"
[6]
"A00041:A00533:A00855:A02178:A02178:A02178:A05671:A05671:A05671:A05671:A05671:A05671:A05671"
...

And I would like to have a table with the frequency of occurrences for
matching subvectors in all elements, i.e., not
only the number of times a vector is found but also how many times a
subvector (of at least 2 ids) is found.

How can I do that?
Thanks in advance,
Albert.

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to