On Sep 15, 2011, at 6:00 PM, nikki22 wrote:

Hi, newbie here,

I was wondering if anyone can out, I have checked through the help archives
but cannot find my answer.

I have three nexus files mostly with the same taxa but some are different. I want to concatenate the DNA sequences from these three files into one large
nexus file but I cannot seem to accomplish it!

This is an example:

file1
taxon1 aaggttcc
taxon2 aagtttcg
taxon3 aaggttcc

file2
taxon2 ctgctg
taxon3 ctggtc
taxon4 ctgctg

file3
taxon1 ggttggtta
taxon2 ggttggttc
taxon4 ggttggttc


dat.mrg <- merge(
             merge(file1,file2, by="V1", all=TRUE),
             file3,
              by="V1", all=TRUE)

> dat.mrg$pseqs <- with(dat.mrg, paste(V2.x , V2.y , V2, sep=""))
> dat.mrg$pseqs <- gsub("NA", "??????", dat.mrg$pseqs)
>  dat.mrg[ , c(1,5)]
      V1                   pseqs
1 taxon1 aaggttcc??????ggttggtta
2 taxon2 aagtttcgctgctgggttggttc
3 taxon3    aaggttccctggtc??????
4 taxon4   ??????ctgctgggttggttc


this is what i want to end up with (where ? means missing data) in nexus
format:

taxon1 aaggttcc??????ggttggtta
taxon2 aagtttcgctgctgggttggttc
taxon3 aaggttccctggtc????????
taxon4 ??????????????ggttggttc

Thanks in advance for any help!




David Winsemius, MD
West Hartford, CT

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to