Pages

Saturday 8 April 2017

Research achievements during my PhD.

CPEB2 subfamily 3'UTRs
Background. Developmental biology in higher organisms critically depends on RNA mediated gene expression regulation. As a consequence, the length and complexity of 3' untranslated regions (3'UTRs) of mRNAs is dramatically expanded in mammals. The aim of my Ph.D. was to make sense of the different signatures present in the 3’UTRs, in particular miRNA targets and highly conserved elements.
Research. I showed that CPEB2, CPEB3 and CPEB4 transcripts are regulated by a shared ancestral miRNA signature1. Moreover, I showed that the miRNA binding sites preceded the generation of highly conserved elements in their 3'UTRs (Figure 1).


Figure 1. The 3’UTRs of CPEB2, CPEB3 and CPEB4 share an ancestral miRNA signature that preceded the generation of highly conserved elements. a, Evolution of the CPEB2 subfamily consisting of two duplications preceding vertebrate speciation. The 3’UTR of the ancestral and current CPEBs for different vertebrates is depicted. The binding sites for different miRNA are indicated. b, Alignment of segments of human CPEB2, CPEB3 and CPEB4 3’UTRs. c, Alignment of the same segment of CPEB2 3’UTRs now between different vertebrates. The binding sites for mir-26 and mir-92 are shown.

3'UTR annotation
The explosion of genomic and transcriptomic data created a bottleneck in the gene annotation pipelines at the level of data analysis. While working on the CPEBs, I noticed that the long 3'UTR isoforms of these and other genes were mis-annotated. Importantly, these isoforms were highly conserved across vertebrates and abundantly expressed in the brain (Figure 2). I then moved on to develop different approaches to annotate 3'UTRs in vertebrates. I found a few hundred mis-annotated genes in mice and humans and a few thousand in other vertebrates2. I also showed that long 3'UTRs tend to be misclassified as long non-coding RNAs.


Figure 2. Identification of 3’ ends for transcripts encoding different K+ channels. a, Genomic region encoding the 3’UTRs of Kcnq3 (left panel) and Kcnb1 (right panel). The annotated transcripts are shown on top (Ensembl) together with mapping ESTs, the conservation score for the genomic region (PhyloP) and RNASeq reads from brain (B), testis (T) and heart (H). The proposed 3’ ends are indicated with colored arrowheads. b, Northern blots of brain (B), testis (T) and heart (H) using probes for Kcnq3 (left) and Kcnb1 (right) are shown. The bands corresponding to the proposed new 3’ ends are indicated by the arrowheads.

Perspective. In the study of the CPEB transcripts, I showed that miRNA binding sites embedded in highly conserved stretches of 3’UTRs are functional and more interestingly, they also preceded the formation of these highly conserved elements. Also, the annotations that we proposed for conserved 3’UTRs were later validated by others3 and with time incorporated to the commonly used databases.

1.        Morgan, M., Iaconcig, A. & Muro, A. F. CPEB2, CPEB3 and CPEB4 are coordinately regulated by miRNAs recognizing conserved binding sites in paralog positions of their 3’-UTRs. Nucleic Acids Res. 38, 7698–7710 (2010).
2.        Morgan, M., Iaconcig, A. & Muro, A. F. Identification of 3’ gene ends using transcriptional and genomic conservation across vertebrates. BMC Genomics 13, 708 (2012).
3.        Miura, P. et al. Widespread and extensive lengthening of 3 ′ UTRs in the mammalian brain Widespread and extensive lengthening of 3 9 UTRs in the mammalian brain. 812–825 (2013). doi:10.1101/gr.146886.112

No comments:

Post a Comment