ProcessesΒΆ
The workflow has the following steps in it:
- Input fasta pairs are aligned to one another with muscle.
- Gap characters in aligned fasta are replaces with X character.
- The pairs are split into individual fasta files with
faSplit bynametool. - Each fasta for the pair is then fragmented using the
pyfasta splitwhere-kflag is used to define window size and-oflag is used to define overlap between fragments. -Fragmented files are then subjected toblastp. blastpoutput is then parsed to obtain a table of fragment to fragment match and percent identity shared by a pair of fragments.- The window size vs percent ID plots are generated to visualise the breakpoints of diversification and recombination.