Revising transcriptome assemblies with phylogenetic information

Citation metadata

From: PLoS ONE(Vol. 16, Issue 1)
Publisher: Public Library of Science
Document Type: Report
Length: 5,763 words
Lexile Measure: 1630L

Document controls

Main content

Abstract :

A common transcriptome assembly error is to mistake different transcripts of the same gene as transcripts from multiple closely related genes. This error is difficult to identify during assembly, but in a phylogenetic analysis such errors can be diagnosed from gene phylogenies where they appear as clades of tips from the same species with improbably short branch lengths. treeinform is a method that uses phylogenetic information across species to refine transcriptome assemblies within species. It identifies transcripts of the same gene that were incorrectly assigned to multiple genes and reassign them as transcripts of the same gene. The treeinform method is implemented in Agalma, available at, and the general approach is relevant in a variety of other contexts.

Source Citation

Source Citation   

Gale Document Number: GALE|A648156946