Random Matrix Analysis of Protein Families
Issued Date
2022-01-01
Resource Type
ISSN
19386737
eISSN
19385862
Scopus ID
2-s2.0-85133370159
Journal Title
ECS Transactions
Volume
107
Issue
1
Start Page
18877
End Page
18891
Rights Holder(s)
SCOPUS
Bibliographic Citation
ECS Transactions Vol.107 No.1 (2022) , 18877-18891
Suggested Citation
Kumari R., Bhadola P., Deo N. Random Matrix Analysis of Protein Families. ECS Transactions Vol.107 No.1 (2022) , 18877-18891. 18891. doi:10.1149/10701.18877ecst Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/84633
Title
Random Matrix Analysis of Protein Families
Author(s)
Author's Affiliation
Other Contributor(s)
Abstract
Proteins are vital for almost all biochemical and cellular processes. Although there is an enormous growth in the protein sequence data, the statistical characterization, structure and function of many of these sequences are still unknown. The statistical and spectral analysis of the Pearson correlation matrices between positions based on physiochemical properties of amino acids of seven protein families is performed and compared with the random Wishart matrix model results. A detailed analysis shows that the protein families significantly diverge from the Marchenko-Pastur distribution with many eigenvalues (outliers) outside the Wishart lower and upper bound. It is shown that level spacing distribution of protein families is similar to the Gaussian orthogonal ensemble. Further, the number variance varies as log of the system size indicating the presence of long range correlations within the protein families.