dc.creatorRozadilla, Gastón
dc.creatorMoreiras Clemente, Jorgelina
dc.creatorMccarthy, Cristina Beryl
dc.date.accessioned2021-09-10T01:25:55Z
dc.date.accessioned2022-10-15T13:45:03Z
dc.date.available2021-09-10T01:25:55Z
dc.date.available2022-10-15T13:45:03Z
dc.date.created2021-09-10T01:25:55Z
dc.date.issued2020-07
dc.identifierRozadilla, Gastón; Moreiras Clemente, Jorgelina; Mccarthy, Cristina Beryl; HoSeIn: A Workflow for Integrating Various Homology Search Results from Metagenomic and Metatranscriptomic Sequence Datasets; Bio-protocol; Bio-protocol; 10; 14; 7-2020; 1-34
dc.identifier2331-8325
dc.identifierhttp://hdl.handle.net/11336/140050
dc.identifierCONICET Digital
dc.identifierCONICET
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/4392924
dc.description.abstractData generated by metagenomic and metatranscriptomic experiments is both enormous and inherently noisy. When using taxonomy-dependent alignment-based methods to classify and label reads, the first step consists in performing homology searches against sequence databases. To obtain the most information from the samples, nucleotide sequences are usually compared to various databases (nucleotide and protein) using local sequence aligners such as BLASTN and BLASTX. Nevertheless, the analysis and integration of these results can be problematic because the outputs from these searches usually show inconsistencies, which can be notorious when working with RNA-seq. Moreover, and to the best of our knowledge, existing tools do not criss-cross and integrate information from the different homology searches, but provide the results of each analysis separately. We developed the HoSeIn workflow to intersect the information from these homology searches, and then determine the taxonomic and functional profile of the sample using this integrated information. The workflow is based on the assumption that the sequences that correspond to a certain taxon are composed of:1)sequences that were assigned to the same taxon by both homology searches;2)sequences that were assigned to that taxon by one of the homology searches but returned no hits in the other one.
dc.languageeng
dc.publisherBio-protocol
dc.relationinfo:eu-repo/semantics/altIdentifier/url/https://bio-protocol.org/e3679
dc.relationinfo:eu-repo/semantics/altIdentifier/doi/http://dx.doi.org/10.21769/BioProtoc.3679
dc.rightshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.rightsinfo:eu-repo/semantics/restrictedAccess
dc.subjectMETAGENOMICS
dc.subjectMETATRANSCRIPTOMICS
dc.subjectNEXT GENERATION SEQUENCING
dc.subjectHOMOLOGY SEARCH
dc.titleHoSeIn: A Workflow for Integrating Various Homology Search Results from Metagenomic and Metatranscriptomic Sequence Datasets
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:ar-repo/semantics/artículo
dc.typeinfo:eu-repo/semantics/publishedVersion


Este ítem pertenece a la siguiente institución