Efeitos da atribuição de pesos a sequências sobre as frequências de aminoácidos em alinhamentos múltiplos de sequências – aplicação em análises de conservação e correlação entre resíduos

Lucas Carrijo de Oliveira

dc.contributor	Lucas Bleicher
dc.contributor	http://lattes.cnpq.br/1342208759733891
dc.creator	Lucas Carrijo de Oliveira
dc.date.accessioned	2021-03-01T16:53:21Z
dc.date.accessioned	2022-10-03T22:44:29Z
dc.date.available	2021-03-01T16:53:21Z
dc.date.available	2022-10-03T22:44:29Z
dc.date.created	2021-03-01T16:53:21Z
dc.date.issued	2016-06-30
dc.identifier	http://hdl.handle.net/1843/35089
dc.identifier.uri	http://repositorioslatinoamericanos.uchile.cl/handle/2250/3809570
dc.description.abstract	Analysing a multiple sequece alignment at the residue level, apart from the conserved positions, there are other patterns that are also indicative of functional importance and reflect functional divergence within a homologous protein family due to gene duplication. In families that have subfamilies with distinct functional specificities, some positions can be conserved only in a particular subfamily, or the conserved amino acid can be different for each of the subfamilies. This suggests that the role of this residue relates not to the global function of the family, but to functional specificities of that group. In these cases, it is reasonable that such specificities are not determined by the presence of a single residue, but by a group of residues, and this group will emerge from residue correlation analysis since a sufficient amount of proteins show the same specificities. However, some protein families have subfamilies less represented in terms of amount of sequences in the alignments. Meantime, this alignments use to come full of redundant sequences, many times mutants or variants of the same sequence, originary mainly from model organisms. This redundancy in the alignments tend to introduce bias to analysis with a statistical mean like the correlation methods. In this way, the present work has as objective to compare the effects of distinct approaches aiming the decreasing of redundancy in multiple sequence alignments: sequence weighting and filtering by maximum identity. Besides, this work also proposes approaches to make the correlation calculations compatible with sequence weighting, in order to improve analisys of residue conservation and correlation. Sequence weighting was capable of highlighting frequencies of amino acids specific of less sampled subfamilies, while decreasing the frequencies of amino acids present in redundant sequences. The adapted calculations were capable of detecting such differences, providing a good alternative to conservation and correlation analisys in alignments that are less representative of the actual protein diversity existent in nature.
dc.publisher	Universidade Federal de Minas Gerais
dc.publisher	Brasil
dc.publisher	ICB - INSTITUTO DE CIÊNCIAS BIOLOGICAS
dc.publisher	Programa de Pós-Graduação em Bioinformatica
dc.publisher	UFMG
dc.rights	http://creativecommons.org/licenses/by-nc-nd/3.0/pt/
dc.rights	Acesso Aberto
dc.subject	Bioinformática
dc.title	Efeitos da atribuição de pesos a sequências sobre as frequências de aminoácidos em alinhamentos múltiplos de sequências – aplicação em análises de conservação e correlação entre resíduos
dc.type	Dissertação

Este ítem pertenece a la siguiente institución

Universidade Federal de Minas Gerais (Brasil)