masterThesis
Distribuição dos tamanhos de DNA humano codificante via teoria da informação
Fecha
2021-02-12Registro en:
CORREIA, Jonathan Pessoa. Distribuição dos tamanhos de DNA humano codificante via teoria da informação. 2021. 73f. Dissertação (Mestrado em Física) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021.
Autor
Correia, Jonathan Pessoa
Resumen
We analyze the coding sequence for the Homo Sapiens DNA via a model that naturally embraces correlations among the bases in DNA sequences of living organisms. The
model is based on the Shannon entropy’s optimization, which is the core of all statistical arguments. On our work , we propose the double-exponential1 distribution function
of the length of DNA measured in base pairs (bp). The results show that the ShortRange-Correlations (SRC), always present in coding DNA sequences, are appropriately
captured through the double-exponential distribution and adequately describes the cumulative length distribution of DNA bases. Based on this model, we use an Empirical
cumulative distribution function and the database of proteins compiled by the Ensembl
Project to show consistency with the data.