CAGE - What can it do for you?

  • Highly sensitive and accurate quantitative transcriptome analysis not only as a gene, but as a Transcription Start Site (TSS).
  • Reliable way to discover Transcription Factor binding motif based on the true TSS nearby the actual promoter.
  • Unique and powerful tool to discover alternative promoters for all endogenous genes.
  • Help you to discover new biomarker and/or bidirectional enhancer RNA.
  • Clarifying the heterogeneity and complexity of transcriptomes from the view of Cause of Transcription instead of expression profiling.
  • Linking expression values to promoter sites for a better understanding of how signaling pathways regulate transcription.
  • PCR-free operation procedures provide unbiased quantification of transcript amount.

Difference from RNA-Seq and other gene expression analysis techniques

Different from Microarray and RNA-seq, CAGE is able to accurately identify transcriptional start sites (TSSs) and the corresponding promoter regions through sequencing the 3’ end of cDNA (5’ end of RNA). This makes CAGE a powerful tool to analyze the gene regulation in the TSSs level, enabling analysis of the gene regulated by multiple alternative promoters. Therefore, CAGE can serve as a new perspective approach for genome annotation, by elucidating transcriptional signaling cascades, and performing other functions.

Comparison among major gene expression analysis techniques
CAGE RNA-seq SAGE Microarray
de novo Gene Finding good good good N/A
Gene Expression Quantification superior 1
※ free of PCR bias unaffected by gene size
good good average
Determining Promoter Site superior average N/A N/A
Motif Finding for Transcription Factor Binding Site superior average average 2
※ depending on known 5' end sequence information
average 2
※ depending on known 5' end sequence information
Identification of Bidirectional Enhancer RNA superior N/A N/A N/A
Determining Transcription Start/1st Exon Site superior average N/A N/A
Determining Gene Structure (intron/exon, alternative splicing variants) N/A average3
※ depending on sequence depth
Duration of Work Process average average average good
Library Preparation Complicatedness ※Long Time 4 (8 days) average average easy
Data Analysis Tools average good average good

("N/A" means not applicable)

  • 1 free of PCR bias and unaffected by gene size
  • 2 depending on known 5' end sequence information
  • 3 depending on sequence depth
  • 4 8 days

What is the CAGE method?

Cap Analysis of Gene Expression (CAGE) is a method for promoter identification and transcription profiling developed by RIKEN (Patent Number: US 6174669, US 6221599, US8809518, etc.). CAGE utilizes a “cap-trapping” technology based on the biotinylation of the 7-methylguanosine cap of Pol II transcripts, to pull down the 5’-complete cDNAs reversely transcribed from the captured transcripts. Through a massive parallel sequencing of the 5’ end of cDNA and analysis of the sequenced tags, transcription start sites and transcripts amount are inferred on a genome-wide scale. Thus, CAGE provides an effective genome-wide transcriptional profiling as an alternative to microarray and RNA-seq.

Item Specification Comment
Total RNA required 5 μg / sample Total RNA preferable
RNA entry QC Bioanalyzer We perform entry QC on all samples
DNA amount of CAGE library Several ng DNA fragments ready for illumine NGS sequencer
Sequencing platform Illumina HiSeq 2000/2500
Number of reads per lane guaranteed 75 M reads/lane
Approx. 100 - 150 M reads / lane in average
Standard conditions: 8 - 12 samples per lane.
Optional extra sequencing Number of lanes per analysis Additional lanes are available with additional charge.
Mapping rate About 75% of tags map to unique mapping position 4 M mappable CAGE tags / sample is guaranteed.
Sequence data Provided with Illumina file format Delimited text files holding sequence information and quality scores.
Data Analysis Mapping positions Tables/flat files: number of raw reads, number of extracted tags, number of mapped tags, etc.

Sample Data Download

Downlod the raw fastq data, processed fastq data, reference genome, and analysis results
* The file may take some time to download as the total data size is 32.6GB.

How does CAGE work?

First strand cDNA synthesis is performed using our proprietarily owned technology facilitating thermostable reverse transcription and random priming. Random Priming allows for the finding of non-polyadenylated RNA as well. RNAs without the 5’-cap such as rRNA, truncated RNA, and incompletely reversely transcribed RNAs are eliminated by cap-trapping.

After selection of full-length cDNA, adaptors are ligated to the 5'-ends of full-length enriched cDNAs as well as the 3'-ends in the strand-specific manner. CAGE Tags are ready for high-throughput sequencing after the 2nd strand synthesis.

CAGE tags are mapped to the reference genome to identify TSS and their related promoter regions. In addition, comparison of mapping positions to defined transcripts provides an annotation of the CAGE tag. The number of CAGE tags mapped at specific locations in the genome provides quantification on promoter activities on a genome-wide scale.

Hence, CAGE can contribute to projects in Genome Annotation, Gene Discovery, Gene Expression Profiling, and Promoter Identification. The Reference List showed examples on how CAGE has been successfully used in studies related to transcriptome analysis and promoter identification. Our CAGE protocols are based on a well-established procedure capable of obtaining a large number of tags.

Why we can trust the CAGE analysis?

Extensively used within the

User voices

Shipping and Packing

No charge for shipping within Japan.
Shipping outside Japan, extra fee will be charged. Please refer our Shipping, Packing Charge Information

Shipping and Packing Charge
The library will be shipped at RT as dried pellet.


  1. Young RS et al, The frequent evolutionary birth and death of functional promoters in mouse and human, Genome Res, 25, 1546-1557 (2015)
  2. Roy S et al, Redefining the transcriptional regulatory dynamics of classically and alternatively activated macrophages by deepCAGE transcriptomics, Nucleic Acids Res, 43(14), 6969-6982 (2015)
  3. Yu NY et al, Complementing tissue characterization by integrating transcriptome profiling from the Human Protein Atlas and from the FANTOM5 consortium, Nucleic Acids Res, 43(14), 6787-6798 (2015)
  4. and more
  5. Mina M et al, Promoter-level expression clustering identifies time development of transcriptional regulatory cascades initiated by ErbB receptors in breast cancer cells, Sci Rep, 5, 11999 (2015)
  6. Ramilowski JA et al, A draft network of ligand-receptor-mediated multicellular signalling in human, Nat Commun, 6, 7866 (2015)
  7. Aitken S et al, Transcriptional dynamics reveal critical roles for non-coding RNAs in the immediate-early response, PLoS Comput Biol, 11(4), e1004217 (2015)
  8. Haberle V et al, CAGEr: precise TSS data retrieval and high-resolution promoterome mining for integrative analyses, Nucleic Acids Res, 43(8), e51 (2015)
  9. Taguchi A et al, Characterization of Novel Transcripts of Human Papillomavirus Type 16 Using Cap Analysis Gene Expression Technology, Journal of Virology, 89(4), 2448-2452 (2015)
  10. Arner E et al, Gene regulation. Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells, Science, 347(6225), 1010-1014 (2015)
  11. Joshi A et al, Transcription factor, promoter, and enhancer utilization in human myeloid cells, J Leukoc Biol, 97(5), 985-995 (2015)
  12. Fort A et al, Nuclear transcriptome profiling of induced pluripotent stem cells and embryonic stem cells identify noncoding loci resistant to reprogramming, Cell Cycle, 14(8), 1148-1155 (2015)
  13. Lizio M et al, Gateways to the FANTOM5 promoter level mammalian expression atlas, Genome Biol, 16, 22 (2015)
  14. Vitezic M et al, CAGE-defined promoter regions of the genes implicated in Rett Syndrome, BMC Genomics, 15, 1177 (2014)
  15. Verardo R et al, Specific mesothelial signature marks the heterogeneity of mesenchymal stem cells from high-grade serous ovarian cancer, Stem Cells, 32(11), 2998-3011 (2014)
  16. Grison A et al, Mesencephalic dopaminergic neurons express a repertoire of olfactory receptors and respond to odorant-like molecules, BMC Genomics, 15, 729 (2014)
  17. Fort A et al, Deep transcriptome profiling of mammalian stem cells supports a regulatory role for retrotransposons in pluripotency maintenance, Nature Genetics, 46(6), 558-566 (2014)
  18. Hasegawa A et al, MOIRAI: a compact workflow system for CAGE analysis, BMC Bioinformatics, 15, 144 (2014)
  19. Yamaga R et al, Systemic identification of estrogen-regulated genes in breast cancer cells through cap analysis of gene expression mapping, Biochem Biophys Res Commun, 447(3), 531-536 (2014)
  20. Prasad P et al, High-throughput transcription profiling identifies putative epigenetic regulators of hematopoiesis, Blood, 123(17), e46-e57 (2014)
  21. Ohmiya H et al, RECLU: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (CAGE), BMC Genomics, 15, 269 (2014)
  22. Kawaji H et al, Comparison of CAGE and RNA-seq transcriptome profiling using a clonally amplified and single molecule next generation sequencing, Genome Res, 24(4), 708-717 (2014)
  23. Rönnerblad M et al, Analysis of the DNA methylome and transcriptome in granulopoiesis reveals timed changes and dynamic enhancer methylation, Blood, 123(17), e79-e89 (2014)
  24. Motakis E et al, Redefinition of the human mast cell transcriptome by deep-CAGE sequencing, Blood, 123(17), e58-e67 (2014)
  25. Schmidl C et al, The enhancer and promoter landscape of human regulatory and conventional T cell subpopulations, Blood, 123(17), e68-e78 (2014)
  26. Schmidl C et al, Transcription and enhancer profiling in human monocyte subsets, Blood, 123(17), e90-e99 (2014)
  27. Morikawa H et al, Differential roles of epigenetic changes and Foxp3 expression in regulatory T cell-specific transcriptional regulation, Proc Natl Acad Sci USA, 111(14), 5289-5294 (2014)
  28. Severin J et al, Interactive visualization and analysis of large-scale NGS data-sets using ZENBU, Nature Biotechnology, 32(3), 217-219 (2014)
  29. Rye M et al, Chromatin states reveal functional associations for globally defined transcription start sites in four human cell lines, BMC Genomics, 15, 120 (2014)
  30. Murata M et al, Detecting expressed genes using CAGE, Methods Mol Biol, 1164, 67-85 (2014)
  31. Pascarella G et al, NanoCAGE analysis of the mouse olfactory epithelium identifies the expression of vomeronasal receptors and of proximal LINE elements, Front Cell Neurosci, 8, 41 (2014)
  32. Nepal C et al, Dynamic regulation of the transcription initiation landscape at single nucleotide resolution during vertebrate embryogenesis, Genome Res, 23(11), 1938-1950 (2013)
  33. Sompallae R et al, Sompallae RA comprehensive promoter landscape identifies a novel promoter for CD133 in restricted tissues, cancers, and stem cells, Front Genet, 4, 209 (2013)
  34. Sompallae R, A comprehensive promoter landscape identifies a novel promoter for CD133 in restricted tissues, cancers, and stem cells, Front Genet, 4, 209 (2013)
  35. Harbers M et al, Comparison of RNA- or LNA-hybrid oligonucleotides in template-switching reactions for high-speed sequencing library preparation, BMC Genomics, 14, 665 (2013)
  36. Tang DT et al, Suppression of artifacts and barcode bias in high-throughput transcriptome analyses utilizing template switching, Nucleic Acids Res, 41(3), e44 (2013)
  37. ENCODE Project Consortium, An integrated encyclopedia of DNA elements in the human genome, Nature, 489(7414), 57-74 (2012)
  38. Derrien T et al, The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression, Genome Res, 22(9), 1775-1789 (2012)
  39. Djebali S et al, Landscape of transcription in human cells, Nature, 489(7414), 101-108 (2012)
  40. Itoh M et al, Automated workflow for preparation of cDNA for cap analysis of gene expression on a single molecule sequencer, PLoS One, 7(1), e30809 (2012)
  41. Takahashi H et al, CAGE (cap analysis of gene expression): a protocol for the detection of promoter and transcriptional networks, Methods Mol Biol, 786, 181-200 (2014)
  42. Schroder K et al, Conservation and divergence in Toll-like receptor 4-regulated gene expression in primary human versus mouse macrophages, Proc Natl Acad Sci USA, 109(16), E944-E953 (2012)
  43. Plessy C et al, Promoter architecture of mouse olfactory receptor genes, Genome Res, 22(3), 486-497 (2012)
  44. Takahashi H et al, 5' end-centered expression profiling using cap-analysis gene expression and next-generation sequencing, Nat Protoc7, 542-561 (2012)
  45. Chien CH et al, Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data, Nucleic Acids Res, 39(21), 9345-9356 (2011)
  46. Kanamori-Katayama M et al, Unamplified cap analysis of gene expression on a single-molecule sequencer, Genome Res, 21(7), 1150-1159 (2011)
  47. Hoskins RA et al, Genome-wide analysis of promoter architecture in Drosophila melanogaster, Genome Res, 21(2), 182-192 (2011)
  48. Salimullah M et al, NanoCAGE: a high-resolution technique to discover and interrogate cell transcriptomes, Cold Spring Harb Protoc, 2011(1), pdb.prot5559 (2011)
  49. Schaefer U et al, High sensitivity TSS prediction: estimates of locations where TSS cannot occur, PLoS One, 5(11), e13934 (2010)
  50. Vitezic M et al, Building promoter aware transcriptional regulatory networks using siRNA perturbation and deepCAGE, Nucleic Acids Res, 38(22), 8141-8148 (2010)
  51. Hestand MS et al, Tissue-specific transcript annotation and expression profiling with complementary next-generation sequencing technologies, Nucleic Acids Res, 38(16), e165 (2010)
  52. Plessy C et al, Linking promoters to functional transcripts in small samples with nanoCAGE and CAGEscan, Nat Methods, 7(7), 528-534 (2010)
  53. Atanur SS et al, The genome sequence of the spontaneously hypertensive rat: Analysis and functional significance, Genome Res, 20(6), 791-803 (2010)
  54. FANTOM Consortium et al, The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line, Nat Genet, 41(5), 553-62 (2009)
  55. Faulkner GJ et al, The regulated retrotransposon transcriptome of mammalian cells, Nat Genet, 41(5), 563-571 (2009)
  56. Balwierz PJ et al, Methods for analyzing deep sequencing expression data: constructing the human and mouse promoterome with deepCAGE data, Genome Biol, 10(7), R79 (2009)
  57. Valen E et al, Genome-wide detection and analysis of hippocampus core promoters using DeepCAGE, Genome Res, 19(2), 255-265 (2009)
  58. Hofmann O et al, Genome-wide analysis of cancer/testis gene expression, Proc Natl Acad Sci USA, 105(51), 20422-20427 (2008)
  59. ENCODE Project Consortium et al, Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project, Nature, 447(7146), 799-816 (2007)
  60. Shimokawa K et al, Large-scale clustering of CAGE tag expression data, BMC Bioinformatics, 8, 161 (2007)
  61. Kawaji H et al, Dynamic usage of transcription start sites within core promoters, Genome Biol, 7(12), R118 (2006)
  62. Carninci P et al, Genome-wide analysis of mammalian promoter architecture and evolution, Nat Genet, 38(6), 626-635 (2006)
  63. Kodzius R et al, CAGE: cap analysis of gene expression, Nat Methods, 3(3), 211-222 (2006)
  64. Shiraki T et al, Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage, Proc Natl Acad Sci USA, 100(26), 15776-15781 (2003)
  65. Carninci P et al, High-efficiency full-length cDNA cloning by biotinylated CAP trapper, Genomics, 37(3), 327-336 (1996)