Tesis
Agrupamento de sequências de miRNA utilizando aprendizado não-supervisionado baseado em grafos
Fecha
2016-08-12Registro en:
Autor
Kasahara, Viviani Akemi
Institución
Resumen
Cluster analysis is the organization of a collection of patterns into clusters based on similarity
which is determined by using properties of data. Clustering techniques can be useful in a
variety of knowledge domains such as biotechnology, computer vision, document retrieval and
many others. An interesting area of biology involves the concept of microRNAs (miRNAs) that
are approximately 22 nucleotide-long non-coding RNA molecules that play important roles in
gene regulation. Clustering miRNA sequences can help to understand and explore sequences
belonging to the same cluster that has similar biological functions. This research work
investigates and explores seven unsupervised clustering algorithms based on graphs that can be
divided into three categories: algorithm based on region of influence, algorithm based on
minimum spanning tree and spectral algorithm. To assess the contribution of the proposed
algorithms, data from miRNA families stored in the online miRBase database were used in the
conducted experiments. The results of these experiments were presented, analysed and
evaluated using clustering validation indexes as well as visual analysis.