Identification of human-specific transcript variants induced by DNA insertions in the human genome.

Bibliographic Collection: 
MOCA Reference, APE
Publication Type: Journal Article
Authors: Kim, Dong Seon; Hahn, Yoonsoo
Year of Publication: 2011
Journal: Bioinformatics
Volume: 27
Issue: 1
Pagination: 14-21
Date Published: 2011 Jan 1
Publication Language: eng
ISSN: 1367-4811
Keywords: Alternative Splicing, Animals, Base Sequence, DNA, DNA Transposable Elements, Evolution, Molecular, Exons, Expressed Sequence Tags, Genetic Variation, Genome, Human, Genomics, Humans, Molecular Sequence Data, Mutagenesis, Insertional, Pan troglodytes, Polyadenylation, Polymorphism, Genetic, Promoter Regions, Genetic, RNA Splice Sites, RNA, Messenger, Species Specificity, Transcription, Genetic

MOTIVATION: Many genes in the human genome produce a wide variety of transcript variants resulting from alternative exon splicing, differential promoter usage, or altered polyadenylation site utilization that may function differently in human cells. Here, we present a bioinformatics method for the systematic identification of human-specific novel transcript variants that might have arisen after the human-chimpanzee divergence.

RESULTS: The procedure involved collecting genomic insertions that are unique to the human genome when compared with orthologous chimpanzee and rhesus macaque genomic regions, and that are expressed in the transcriptome as exons evidenced by mRNAs and/or expressed sequence tags (ESTs). Using this procedure, we identified 112 transcript variants that are specific to humans; 74 were associated with known genes and the remaining transcripts were located in unannotated genomic loci. The original source of inserts was mostly transposable elements including L1, Alu, SVA, and human endogenous retroviruses (HERVs). Interestingly, some non-repetitive genomic segments were also involved in the generation of novel transcript variants. Insert contributions to the transcripts included promoters, terminal exons and insertions in exons, splice donors and acceptors and complete exon cassettes. Comparison of personal genomes revealed that at least seven loci were polymorphic in humans. The exaptation of human-specific genomic inserts as novel transcript variants may have increased human gene versatility or affected gene regulation.

DOI: 10.1093/bioinformatics/btq612
Alternate Journal: Bioinformatics