info:eu-repo/semantics/article
Large Ankyrin repeat proteins are formed with similar and energetically favorable units
Fecha
2020-06Registro en:
Galpern, Ezequiel Alejandro; Freiberger, Maria Ines; Ferreiro, Diego; Large Ankyrin repeat proteins are formed with similar and energetically favorable units; Public Library of Science; Plos One; 15; 6; 6-2020; 1-16
1932-6203
CONICET Digital
CONICET
Autor
Galpern, Ezequiel Alejandro
Freiberger, Maria Ines
Ferreiro, Diego
Resumen
Ankyrin containing proteins are one of the most abundant repeat protein families present in all extant organisms. They are made with tandem copies of similar amino acid stretches that fold into elongated architectures. Here, we built and curated a dataset of 200 thousand proteins that contain 1.2 million Ankyrin regions and characterize the abundance, structure and energetics of the repetitive regions in natural proteins. We found that there is a continuous roughly exponential variety of array lengths with an exceptional frequency at 24 repeats. We described that individual repeats are seldom interrupted with long insertions and accept few deletions, in line with the known tertiary structures. We found that longer arrays are made up of repeats that are more similar to each other than shorter arrays, and display more favourable folding energy, hinting at their evolutionary origin. The array distributions show that there is a physical upper limit to the size of an array of repeats of about 120 copies, consistent with the limit found in nature. The identity patterns within the arrays suggest that they may have originated by sequential copies of more than one Ankyrin unit.