Tese
Agrupamento de interações não lineares em análise fatorial
Fecha
2020-02-19Autor
Erick da Conceição Amorim
Institución
Resumen
Factor analysis is a powerful tool for dimension reduction in a multivariate statistical study. This Thesis is dedicated to extend the factor model with non-linear interactions proposed in 2013. The main contribution of our work is to present two approaches to cluster the non-linear interactions and thus develop new models that are not restricted to the extreme scenarios where all non-null interactions are different or all are the same. The first strategy to handle the clusters involves a finite mixture of degenerated components. The second option is especified via the Dirichlet process. A comprehensive simulation study is developed to explore the proposals and it shows their good performances. A sentitivity analysis is carried out to evaluate advantages of estimating a smoothness parameter defined in a covariance function of the Gaussian process establishing the non-linearity of the interactions. In terms of application, the methodology is illustrated with the analysis of gene expression related to four breast cancer data sets. Here, the genes belonging to disjoint genome regions, with copy number alteration, are connected to the main factors and their non-linear interactions are estimated and clustered. The mutual investigation and comparison of these four breast cancer data sets is rarely found in the literature.