I do not know what happend but the text didn't get formatted correctly
on the list. But this is how the out put should really have been:
a g a t a g a t c g c a t c g a - - - - - - dog
a c g c t t c g a t a c g c t a g c t t a - cat
a g a t a t a c g g g t t - - - - - - - - - mouse
That is, I want the edited sequence data and the name on the same line.
-Thanks
-Mike
In article <[EMAIL PROTECTED]>, [EMAIL PROTECTED] (John W. Krahn)
wrote:
> Mike Robeson wrote:
> >
> > Hello,
>
> Hello,
>
> > I am a relatively new PERL beginner and have been trying to work with
> > simple bioinformatics stuff. I have so far written some very useful but
> > simple bioinformatics scripts. However.... recently I have been trying
> > to work on a script to no avail. I have a text file whose contents are:
> >
> > >dog
> > agatagatcgcatcga
> > >cat
> > acgcttcgatacgctagctta
> > >mouse
> > agatatacgggt
> >
> > .... and so on...
> >
> > I would like to turn that into this:
> >
> > a g a t a g a t c g c a t c g a - - - - - - - - - - - - - - -
> > dog
> > a c g c t t c g a t a c g c t a g c t t a - - - - - - - - - -
> > cat
> > a g a t a t a c g g g t t - - - - - - - - - - - - - - - - - - -
> > mouse
> >
> > Notice that the sequence of letters varies however I need the lines in
> > the newly formed file to be equal in length by adding the appropriate
> > amount of dashes. For those in the know I am trying to convert a FASTA
> > file into a DCSE file.
> >
> > I have been beating my head for the past 2 weeks and I cannot figure
> > out how to do this. I do not expect a complete answer (I would like to
> > try figuring this out on my own as much as possible) but rather some
> > guidance. Any detailed pseudo-code would be appreciated!!
>
> According to your data this should work:
>
> #!/usr/bin/perl
> use warnings;
> use strict;
>
> my $len = 30; # pad out to this length
> while ( <DATA> ) {
> unless ( s/^\s*>// ) {
> chomp;
> my @char = ( split( // ), ( '-' ) x ( $len - length ) );
> $_ = "@char\n";
> }
> print;
> }
>
> __DATA__
> >dog
> agatagatcgcatcga
> >cat
> acgcttcgatacgctagctta
> >mouse
> agatatacgggt
>
>
>
> John
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]