Title | Transcriptome sequencing reveals thousands of novel long non-coding RNAs in B cell lymphoma. |
Publication Type | Journal Article |
Year of Publication | 2015 |
Authors | Verma A, Jiang Y, Du W, Fairchild L, Melnick A, Elemento O |
Journal | Genome Med |
Volume | 7 |
Pagination | 110 |
Date Published | 2015 Nov 01 |
ISSN | 1756-994X |
Keywords | Animals, B-Lymphocytes, Cell Line, Tumor, Dogs, Humans, Lymphoma, Large B-Cell, Diffuse, RNA, Long Noncoding, Sequence Analysis, RNA, Transcriptome |
Abstract | BACKGROUND: Gene profiling of diffuse large B cell lymphoma (DLBCL) has revealed broad gene expression deregulation compared to normal B cells. While many studies have interrogated well known and annotated genes in DLBCL, none have yet performed a systematic analysis to uncover novel unannotated long non-coding RNAs (lncRNA) in DLBCL. In this study we sought to uncover these lncRNAs by examining RNA-seq data from primary DLBCL tumors and performed supporting analysis to identify potential role of these lncRNAs in DLBCL. METHODS: We performed a systematic analysis of novel lncRNAs from the poly-adenylated transcriptome of 116 primary DLBCL samples. RNA-seq data were processed using de novo transcript assembly pipeline to discover novel lncRNAs in DLBCL. Systematic functional, mutational, cross-species, and co-expression analyses using numerous bioinformatics tools and statistical analysis were performed to characterize these novel lncRNAs. RESULTS: We identified 2,632 novel, multi-exonic lncRNAs expressed in more than one tumor, two-thirds of which are not expressed in normal B cells. Long read single molecule sequencing supports the splicing structure of many of these lncRNAs. More than one-third of novel lncRNAs are differentially expressed between the two major DLBCL subtypes, ABC and GCB. Novel lncRNAs are enriched at DLBCL super-enhancers, with a fraction of them conserved between human and dog lymphomas. We see transposable elements (TE) overlap in the exonic regions; particularly significant in the last exon of the novel lncRNAs suggest potential usage of cryptic TE polyadenylation signals. We identified highly co-expressed protein coding genes for at least 88 % of the novel lncRNAs. Functional enrichment analysis of co-expressed genes predicts a potential function for about half of novel lncRNAs. Finally, systematic structural analysis of candidate point mutations (SNVs) suggests that such mutations frequently stabilize lncRNA structures instead of destabilizing them. CONCLUSIONS: Discovery of these 2,632 novel lncRNAs in DLBCL significantly expands the lymphoma transcriptome and our analysis identifies potential roles of these lncRNAs in lymphomagenesis and/or tumor maintenance. For further studies, these novel lncRNAs also provide an abundant source of new targets for antisense oligonucleotide pharmacology, including shared targets between human and dog lymphomas. |
DOI | 10.1186/s13073-015-0230-7 |
Alternate Journal | Genome Med |
PubMed ID | 26521025 |
PubMed Central ID | PMC4628784 |