Dear all, I have DNA sequence data which are fasta-formatted as
>gene A;..... AAAAACCCC TTTTTGGGG CCCTTTTTT >gene B;.... CCCCCAAAA GGGGGTTTT I want to load only the lines that start with ">" where the annotation information for the gene is contained. In principle, I can remove the sequences before loading or after loading all the lines. I just wonder if there's a way to load only lines with a particular pattern. The skip argument in read.table() doesn't work for my purpose. Thanks in advance, Jimmy -- View this message in context: http://www.nabble.com/how-to-load-only-lines-that-start-with-a-particular-symbol-tp25461693p25461693.html Sent from the R help mailing list archive at Nabble.com. ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.