On Sep 15, 2011, at 6:00 PM, nikki22 wrote:
Hi, newbie here,
I was wondering if anyone can out, I have checked through the help
archives
but cannot find my answer.
I have three nexus files mostly with the same taxa but some are
different. I
want to concatenate the DNA sequences from these three files into
one large
nexus file but I cannot seem to accomplish it!
This is an example:
file1
taxon1 aaggttcc
taxon2 aagtttcg
taxon3 aaggttcc
file2
taxon2 ctgctg
taxon3 ctggtc
taxon4 ctgctg
file3
taxon1 ggttggtta
taxon2 ggttggttc
taxon4 ggttggttc
dat.mrg <- merge(
merge(file1,file2, by="V1", all=TRUE),
file3,
by="V1", all=TRUE)
> dat.mrg$pseqs <- with(dat.mrg, paste(V2.x , V2.y , V2,
sep=""))
> dat.mrg$pseqs <- gsub("NA", "??????", dat.mrg$pseqs)
> dat.mrg[ , c(1,5)]
V1 pseqs
1 taxon1 aaggttcc??????ggttggtta
2 taxon2 aagtttcgctgctgggttggttc
3 taxon3 aaggttccctggtc??????
4 taxon4 ??????ctgctgggttggttc
this is what i want to end up with (where ? means missing data) in
nexus
format:
taxon1 aaggttcc??????ggttggtta
taxon2 aagtttcgctgctgggttggttc
taxon3 aaggttccctggtc????????
taxon4 ??????????????ggttggttc
Thanks in advance for any help!
David Winsemius, MD
West Hartford, CT
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.