Hi, I've got a list of ~20000 elements that look like this:
[1] "A00096:A00096:A00096:A00096:A02178:A02178:A07776" [2] "A00046:A00076:A01101:A04146:A05671:A07169" [3] "A00038:A00932:A02185:A02370:A02818:A02818:A02818:A02818:A04732:A07142:A07142" [4] "A00096:A01352:A01352:A02023:A05001:A05001:A07776" [5] "A00036:A00047:A00059:A00503:A00904:A00904:A00904:A01023:A01023:A01399:A02029:A03941:A07679" [6] "A00041:A00533:A00855:A02178:A02178:A02178:A05671:A05671:A05671:A05671:A05671:A05671:A05671" ... And I would like to have a table with the frequency of occurrences for matching subvectors in all elements, i.e., not only the number of times a vector is found but also how many times a subvector (of at least 2 ids) is found. How can I do that? Thanks in advance, Albert. [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.