Professional Documents
Culture Documents
The transcriptome is the total set of transcripts, mRNA and non-coding RNA, in one or a
population of cells under specific conditions. The transcriptome analysis lay the foundation of
gene structure and function research. Based on next-generation high-throughput sequencing
technologies, RNA-seq found its applications in many research fields including fundamental
science research, medical research and drug development.
Services
We will first test the quality of total RNA provided by the customer. If the sample is qualified, we
The basic data analysis includes image recognition, base calling, filtering adapter sequences and
detecting contaminations of samples.
We will first test the quality of total RNA provided by the customer. If the sample is qualified, we
1
will then conduct the following technical route: sample preparation→sequencing.
The basic data analysis includes image recognition, base calling, filtering adapter sequences and
detecting contaminations of samples.
We will first test the quality of total RNA or size-fractionated RNA (eg. 200-700 nt) provided by
the customer. If the sample is qualified, we will then conduct the following technical route: sample
preparation→sequencing.
Experimental pipeline
2
Application of RNA-Seq
capacity RNA-Seq
Detected signals Digital signals
Detected range Nearly all the transcripts
Detected accuracy From several copies to 100,000 copies
Resolution Allele specific expression, alternative splicing
Case Study
Marc Sultan et al. reported that RNA-Seq can detect 25% more genes than
those by microarrays. A global survey of messenger RNA splicing events
identified 94,241 splice junctions (4096 of which were previously unidentified)
in a study of human embryonic kidney and B cell.
3
detected by microarrays are shown with light red (HEK) and dark red (B cells) bars.
After comparing 5’ RACE results with RNA-Seq results, researchers found both
methods identified 5’ boundaries within 50 bp of one another for 786 genes
(77.9%). RNA-Seq could identify the 3’ boundary precisely.
In rice RNA-seq project in the Beijing Genomics Insitute, we found RNA-seq can find more low
abundance genes than traditional methods. (Fig. 6-A)
A B
4
Fig6.Transcriptome study can detect more low abundance
transcripts than cDNA sequencing
A.The length distribution of newly identified transcripts.
B.A comparison of expression level between novel transcripts and cDNA genes.
15
con=0.9775
10
qPCR(log2)
5
0
-5
-10
-10 -5 0 5 10 15
RNA-seq(log2)
5
Reference
Sultan, M, Schulz, M. H.A global view of gene activity and alternative splicing by deep sequencing of the human
transcriptome. et al., Science 321 (5891), 956 (2008).
Maher, C. A,Kumar-Sinha, C,Cao, X. Transcriptome sequencing to detect gene fusions in cancer. et al., Nature
458 (7234), 97 (2009).
Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, The Transcriptional Landscape of the YeastGenome Defined
by RNA Sequencing. et al.,Science 320 (5881), 1344 (2008).
Wilhelm BT, Marguerat S, Watt S.Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide
resolution. et al., Nature 453 (7199), 1239 (2008).
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B.Mapping and quantifying mammalian transcriptomes
by RNA-Seq. et al., Nat Methods 5 (7), 621 (2008).
FAQ
6
RNA-Seq analysis
The transcriptome is the total set of transcripts, mRNA and non-coding RNA, in one or a
population of cells under specific conditions. The transcriptome analysis lay the foundation of
gene structure and function research. Based on next-generation high-throughput sequencing
technologies, RNA-seq found its applications in many research fields including fundamental
science research, medical research and drug development.
Services
We will first test the quality of total RNA provided by the customer. If the sample is qualified, we
The basic data analysis includes image recognition, base calling, filtering adapter sequences and
detecting contaminations of samples.
We will first test the quality of total RNA provided by the customer. If the sample is qualified, we
7
will then conduct the following technical route: sample preparation→sequencing.
The basic data analysis includes image recognition, base calling, filtering adapter sequences and
detecting contaminations of samples.
We will first test the quality of total RNA or size-fractionated RNA (eg. 200-700 nt) provided by
the customer. If the sample is qualified, we will then conduct the following technical route: sample
preparation→sequencing.
Experimental pipeline
capacity RNA-Seq
Detected signals Digital signals
Detected range Nearly all the transcripts
Detected accuracy From several copies to 100,000 copies
Resolution Allele specific expression, alternative splicing
Case Study
Marc Sultan et al. reported that RNA-Seq can detect 25% more genes than
those by microarrays. A global survey of messenger RNA splicing events
identified 94,241 splice junctions (4096 of which were previously unidentified)
in a study of human embryonic kidney and B cell.
9
Identify 5’and 3’UTRs in yeast
After comparing 5’ RACE results with RNA-Seq results, researchers found both
methods identified 5’ boundaries within 50 bp of one another for 786 genes
(77.9%). RNA-Seq could identify the 3’ boundary precisely.
In rice RNA-seq project in the Beijing Genomics Insitute, we found RNA-seq can find more low
abundance genes than traditional methods. (Fig. 6-A)
A B
10
Detect gene fushion
15
con=0.9775
10
qPCR(log2)
5
0
-5
-10
-10 -5 0 5 10 15
RNA-seq(log2)
11
Reference
Sultan, M, Schulz, M. H.A global view of gene activity and alternative splicing by deep sequencing of the human
transcriptome. et al., Science 321 (5891), 956 (2008).
Maher, C. A,Kumar-Sinha, C,Cao, X. Transcriptome sequencing to detect gene fusions in cancer. et al., Nature
458 (7234), 97 (2009).
Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, The Transcriptional Landscape of the YeastGenome Defined
by RNA Sequencing. et al.,Science 320 (5881), 1344 (2008).
Wilhelm BT, Marguerat S, Watt S.Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide
resolution. et al., Nature 453 (7199), 1239 (2008).
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B.Mapping and quantifying mammalian transcriptomes
by RNA-Seq. et al., Nat Methods 5 (7), 621 (2008).
FAQ
12
Small RNA analysis
RNA is one of the most important parts of the bio-material which constructs the framework of
life with DNA and protein together. Small RNA regulates life, such as the development and
growth of cell, the transcription and translation of gene, as well as the gene silence. Small RNA
sequencing is based on solexa technology, the deep sequencing yield numerous small fragments
from 18 to 30nt, we compare them with the known and relative species, find out the difference
between different samples and predict the novel miRNA, furthermore study its function.
Services
Bioinformatics analysis
Items of basic bioinformatics analysis
Length distribution of small RNA
Mapping small RNA sequences to genome sequences and exploring features of distribution along each chromosome
Differential small RNA between two samples
Comparing small RNA sequences with known miRNAs deposited at miRBase (miRBase13.0)
Identification of rRNA, tRNA, snRNA, snoRNA against Rfam (9.1) and Genebank
Identifying repeats associated with small RNAs
Identifying mRNA degradated fragments and siRNA candidates
Items of basic bioinformatics analysis
Annotating and classifying miRNA
Prediction of novel miRNA
Expression of miRNA
Differential expression analysis of miRNA gene and construction of miRNA expression profiles
Clustering analysis of differentially expressed miRNA
Target prediction of miRNA (only for plant)
Technical features
High-throughput: more than 2.5 millions reads can be obtained through the single-pass
sequencing.
13
High resolution: differences between single base pair can be detected.
High accuracy: digital signals to accurately detect the number of copies ranging from several to
hundreds of thousands.
Experimental pipeline
Case study
Extract about 20 μg of total RNA from an animal tissue, conduct the high-throughput sequencing
and do bioinformatics analysis.
The length of small RNA is centered on 22 nt (more than 90%). This illustrates small RNA
sequencing is reliable. The length distribution of small RNA from the tissue is shown in Fig5-2.
After removing contaminants of adaptor and low quality sequences, 3,333,504 reads are
14
generated. Align the sequence to database of miRBase, mRNA/EST and rRNA, and identify known
and candidate miRNAs (Fig.5-3).
Among these data, the most of unique_reads is exon, but miRNA is the major part of total reads.
These miRNA data make the the result more reliable for predicting the novel miRNA.
Different miRNAs show different expression patterns in the same tissue (Fig. 5-4).That is
relative to the difference of the tissue and the selective expression of gene.
Expression level(10K)
Figure 5-4 Expression profile for part of miRNAs in the same tissue
Figure 5-5 Expression profile for part of miRNAs in the different tissue
15
As shown in Fig. 5-5, the expression of miRNA is tissue-specific (A, B, C, D, E, F, G, H, I, J
indicate different tissues respectively, has-let-7b and has-miR-22 indicate different miRNA genes).
As a special kind of RNA, there usually is U in 5’ end, but not G. The position of 2 and 4 base is
short at U. Generally speaking, all positions are short at G but the fourth position. miRNAs have
high conservation in sequence, high time orderency and tissue specificity. The count of all variants
of a miRNA gene can be used as a digital measure its expression level. (Fig. 5-6)
Percent
Except acting as sequence specific guides to regulate mRNA stability or inhibit protein
synthesis, lots of recent studies discovered some novel small RNA types which bound with
different Agonaute proteins and involved in some important biological process, such as chromatin
maintenance and transposon control. These small RNAs always derived from highly repeated
elements and called repeat-associated small RNAs (always interchangeable with piRNA).
According to type of Agonaute proteins they bind to, these small RNAs can be future divided into
different classes. (Fig. 5-7)
16
Prediction of new miRNA candidates
miRNA precursors have characteristic fold-back structure, which can be used to predict novel
miRNAs. By folding the flanking genome sequence of small RNAs, followed by analysis of its
structural features, we can identify novel miRNA candidates. (Fig. 5-8)
One type of gene at different condition has differential expression. Expression level of known
miRNA between different samples and use Log2-ratio drawing, Scatter plot drawing to campare
known miRNA expressed in different samples. (Figure 5-9,5-10)
1000000
Expression level(Day7)
10000
100
1
1
0
10
00
00
10
00
10
Expression level(Day2)
17
Clustering differentially expressed miRNA between two samples
Analysis clusterly miRNA gene which standardized to 1TPM by sequence similarlity. Cluster
the similar sequence .Red indicated up trend, green indicated down trend ,and gray the gene which
hasn’t expressed in any sample.
References
Y Zhang,X Zhou,X Ge, et al.(2009)Insect-Specific microRNA Involved in the Development of the Silkworm
Bombyx mori.PLos One.
Xi Chen,QB Li,J Wang,et al. (2009)Identifucation and characterization of novel amphioxus microRNAs by
Solexa sequencing. Genome Biology.
JM Guo,Y Miao,BX Xiao,et al. (2009)Differential expression of microRNA species in humann gastric cancer
versus non-tumorous tissues.J Gastroenterol Hepatol.
Xi Chen,Yi Ba,LJ Ma,et al. (2008)Characterization of microRNAs in serum: a novel class of biomarkers for
diagnosis of cancer and other diseases.Cell Research.
18
XH Wang,S Tang,SY Le,et al. (2008)Aberrant Expression of Oncogenic and Tumor-Supperessive MicorRNAs in
Cervical Cancer Is Required for Cancer cell Growth.PLos One.
Mi S, Cai T, Hu Y, Chen Y, Hodges E, et al. (2008) Sorting of Small RNAs into Arabidopsis Ar-gonaute
Complexes Is Directed by the 5’ Terminal Nucleotide. Cell.
Montgomery TA, Howell MD, Cuperus JT, Li D, Hansen JE, et al. (2008) Specificity of ARGONAUTE7-miR390
Interaction and Dual Functionality in TAS3 Trans-Acting siRNA Formation. Cell.
Morin RD, O’Connor MD, Griffith M, Kuchenbauer F, Delaney A, et al. (2008) Application of massively parallel
sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Res.
Hafner M, Landgraf P, Ludwig J, Rice A, Ojo T, et al. (2008) Identification of microRNAs and other small
regulatory RNAs using cDNA library sequencing. Methods 44(1): 3-12.
Ibarra I, Erlich Y, Muthuswamy SK, Sachidanandam R, Hannon GJ (2007) A role for microRNAs in maintenance
of mouse mammary epithelial progenitor cells. Genes Dev 21(24): 3238-3243.
19