Statistical analysis of proteins families: a network and random matrix approach

dc.contributor.authorKumari R.
dc.contributor.authorBhadola P.
dc.contributor.authorDeo N.
dc.contributor.correspondenceKumari R.
dc.contributor.otherMahidol University
dc.date.accessioned2024-10-19T18:13:23Z
dc.date.available2024-10-19T18:13:23Z
dc.date.issued2024-10-01
dc.description.abstractAbstract: We present a novel method for analyzing the structural organization of protein families by integrating random matrix theory (RMT) and network theory with the physiochemical properties of amino acids and multiple sequence alignment. RMT distinguishes significant interactions between amino acids from background noise, pinpointing coevolving positions likely crucial for protein structure and function. This property-based approach captures both short and long-range correlations, unlike previous methods that treat amino acids as mere characters. The eigenvector components of eigenvalues outside the RMT bound deviate from typical RMT observations, offering critical system information. We quantify the information content of each eigenvector using an entropic estimate, showing that the smallest eigenvectors are highly localized and informative. These eigenvectors form clusters of biologically and structurally significant positions, validated by experiments. By creating networks of amino acid interactions for each property, we uncover key motifs and interactions. This method enhances our understanding of protein evolution, interactions, and potential targets to modulate enzymatic actions. We study two protein families Cadherin-4 and Betalactamase families which display two extreme characteristics one nearly random and the other very structured or organised. Graphical abstract: (Figure presented.)
dc.identifier.citationEuropean Physical Journal B Vol.97 No.10 (2024)
dc.identifier.doi10.1140/epjb/s10051-024-00781-6
dc.identifier.eissn14346036
dc.identifier.issn14346028
dc.identifier.scopus2-s2.0-85206220280
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/101669
dc.rights.holderSCOPUS
dc.subjectMaterials Science
dc.subjectPhysics and Astronomy
dc.titleStatistical analysis of proteins families: a network and random matrix approach
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85206220280&origin=inward
oaire.citation.issue10
oaire.citation.titleEuropean Physical Journal B
oaire.citation.volume97
oairecerif.author.affiliationUniversity of Delhi
oairecerif.author.affiliationMahidol University

Files

Collections