Supplementary material for "Evolution of sequence-diverse disordered regions in a protein family: order within the chaos"

2020-03-19T11:09:15Z (GMT) by Thomas Shafee KIM JOHNSON Tony Bacic

A set of supplementary data files

Accompanying publication: Evolution of sequence-diverse disordered regions in a protein family: order within the chaos


Supp data file 1

Excel file for the 2644 fasciclin domains, names and annotation information. In order to keep names short for phylogenies, FLAs given arbitrary identifier numbers, and fasciclin domains within them indicated by their (e.g. “>X1234_FLA.2.3” -> Fasciclin domain cluster 1, arbitrary FLA identifier number 1234, FLA fasciclin domain 2 out of 3). Numbers and colours given for fasciclin, AG, non-AG and inter-proline clusters.


Supp data file 2

Multiple sequence alignments as fasta files for all 2644 fasciclin domains, as well as separately for each cluster A-R.


Supp data file 3

Phylogenies as newick files for all 2644 fasciclin domains, as well as separately for each cluster A-R.


Supp data file 4

An [R] script to perform the analyses shown in the publication. See also github repo TS404/FLAnnotator.