dc.creatorWang, K.
dc.creatorAbid, M.A.
dc.creatorRasheed, A.
dc.creatorCrossa, J.
dc.creatorHearne, S.
dc.creatorHuihui Li
dc.date2023-01-12T01:20:12Z
dc.date2023-01-12T01:20:12Z
dc.date2023
dc.date.accessioned2023-07-17T20:10:01Z
dc.date.available2023-07-17T20:10:01Z
dc.identifierhttps://hdl.handle.net/10883/22385
dc.identifier10.1016/j.molp.2022.11.004
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/7514132
dc.descriptionGenomic prediction is an effective way to accelerate the rate of agronomic trait improvement in plants. Traditional methods typically use linear regression models with clear assumptions; such methods are unable to capture the complex relationships between genotypes and phenotypes. Non-linear models (e.g., deep neural networks) have been proposed as a superior alternative to linear models because they can capture complex non-additive effects. Here we introduce a deep learning (DL) method, deep neural network genomic prediction (DNNGP), for integration of multi-omics data in plants. We trained DNNGP on four datasets and compared its performance with methods built with five classic models: genomic best linear unbiased prediction (GBLUP); two methods based on a machine learning (ML) framework, light gradient boosting machine (LightGBM) and support vector regression (SVR); and two methods based on a DL framework, deep learning genomic selection (DeepGS) and deep learning genome-wide association study (DLGWAS). DNNGP is novel in five ways. First, it can be applied to a variety of omics data to predict phenotypes. Second, the multilayered hierarchical structure of DNNGP dynamically learns features from raw data, avoiding overfitting and improving the convergence rate using a batch normalization layer and early stopping and rectified linear activation (rectified linear unit) functions. Third, when small datasets were used, DNNGP produced results that are competitive with results from the other five methods, showing greater prediction accuracy than the other methods when large-scale breeding data were used. Fourth, the computation time required by DNNGP was comparable with that of commonly used methods, up to 10 times faster than DeepGS. Fifth, hyperparameters can easily be batch tuned on a local machine. Compared with GBLUP, LightGBM, SVR, DeepGS and DLGWAS, DNNGP is superior to these existing widely used genomic selection (GS) methods. Moreover, DNNGP can generate robust assessments from diverse datasets, including omics data, and quickly incorporate complex and large datasets into usable models, making it a promising and practical approach for straightforward integration into existing GS platforms.
dc.description279-293
dc.languageEnglish
dc.publisherCell Press
dc.relationhttps://github.com/AIBreeding/DNNGP/blob/main/example-data.tgz
dc.rightsCIMMYT manages Intellectual Assets as International Public Goods. The user is free to download, print, store and share this work. In case you want to translate or create any other derivative work and share or distribute such translation/derivative work, please contact CIMMYT-Knowledge-Center@cgiar.org indicating the work you want to use and the kind of use you intend; CIMMYT will contact you with the suitable license for that purpose
dc.rightsOpen Access
dc.source1
dc.source16
dc.source1674-2052
dc.sourceMolecular Plant
dc.subjectAGRICULTURAL SCIENCES AND BIOTECHNOLOGY
dc.subjectDeep Learning
dc.subjectGenomic Selection
dc.subjectMulti-Omics Data
dc.subjectPrediction Method
dc.subjectMARKER-ASSISTED SELECTION
dc.subjectMETHODS
dc.subjectDATA
dc.subjectLEARNING
dc.subjectwheat
dc.titleDNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants
dc.typeArticle
dc.typePublished Version
dc.coverageUSA


Este ítem pertenece a la siguiente institución