Choosing the proper RNA-seq library preparation method is essential, because it directly affects data quality, depth of coverage, and accuracy of results for your study. In fact, different strategies—such as poly(A) selection or rRNA depletion—can dramatically influence which types of RNA are captured, how much usable data you generate, and how cost-effective your sequencing will be.
A study by Pfizer found that to achieve similar exonic coverage, rRNA depletion required 220% more reads from blood and 50% more from colon tissue compared to poly(A) selection.
Poly(A) selection focuses on mature, polyadenylated mRNAs, making it ideal for gene expression studies. In contrast, rRNA depletion captures both coding and various non-coding transcript types—including non-polyadenylated RNAs and pre-mRNA—providing broader discovery potential.
More usable exonic reads mean lower sequencing costs and faster downstream analysis. For example, using poly(A) enrichment with high-RIN samples can drastically reduce intronic noise, while using rRNA depletion in mRNA-only projects may introduce unnecessary non-coding reads that complicate alignment and inflate file sizes. Conversely, a higher fraction of intronic reads from rRNA depletion can increase data storage and processing demands.
As a result, selecting the right library prep method is not just a technical decision—it’s a strategic one. It determines experimental success, budget optimization, and ultimately, how impactful your research findings will be.
Figure 1. polyA+ selection and rRNA depletion protocols. (Shanrong Zhao et al,.2018)
Poly(A) enrichment is a targeted RNA-seq library prep method that uses oligo(dT) primers attached to beads to capture polyadenylated (poly(A)+) RNA—mainly mature messenger RNAs (mRNAs). Interestingly, the effectiveness of poly(A) capture can also depend on the cell type. Neuronal tissues, for instance, often produce mRNAs with longer poly(A) tails, potentially improving enrichment efficiency in neural studies.
Here's why it's widely used:
Enriches for mature, coding transcripts: Since mRNA makes up ~5% of total RNA and poly(A)-tails are added after splicing, poly(A) selection effectively focuses on functional, protein-coding RNA, reducing background noise from rRNA and pre-mRNA (Chen, L. et al,.2020).
Improved data quality with fewer reads: In one benchmark study (Wei Zhao et al,. 20214), only ~13.5 million poly(A)-selected reads were needed to detect as many genes as a typical microarray, compared to 35–65 million reads when using total RNA-seq (rRNA depletion).
Faster workflows with low input requirements: Poly(A) enrichment works efficiently with small RNA amounts and gives a higher fraction of exonic reads—about 70–71% usable reads in blood and colon samples—greatly lowering sequencing costs.
Bias toward longer poly(A) tails: Some research suggests poly(A) selection can preferentially capture long-tailed mRNAs, potentially misrepresenting shorter-tail or tail-variable transcripts.
Misses non-polyadenylated RNAs: Important RNA types—like some non-coding RNAs, histone mRNAs, and certain regulatory transcripts—lack poly(A) tails and are excluded using this method.
Quantifying protein-coding gene expression, especially when RNA integrity is high (RIN > 8).
Cost-sensitive projects, where maximizing useful exonic data per sequencing dollar is essential.
Low-input samples or smaller studies, where starting material is limited. In fact, many single-cell RNA-seq technologies, such as Smart-seq2 and 10x Genomics Chromium, rely on poly(A) priming due to its compatibility with ultra-low input RNA.
Internal Link: If you’re interested in learning more about our specialized poly(A) RNA-seq services, check out our Poly(A) RNA Sequencing Service.
rRNA depletion is a library preparation method for RNAseq that removes ribosomal RNA (rRNA)—which accounts for ~80–90% of total RNA—allowing both polyadenylated and non-polyadenylated transcripts to be captured. This broader approach is ideal for projects that require full transcriptome coverage, including non-coding RNAs, pre-mRNA, degraded samples, and even microbial RNA. It is also the preferred method for host-pathogen studies and microbial community profiling, as most bacteria and viruses lack poly(A) tails and would be completely missed by poly(A) enrichment.
Figure 2. Concordance of gene expression between the polyA+ selection and rRNA depletion RNA-seq data. (Shanrong Zhao et al,.2018)
A comprehensive comparison study in Scientific Reports (Zhao et al., 2018) evaluated poly(A)+ selection versus rRNA depletion across blood and colon samples. Results showed rRNA depletion captured more unique transcript features, including non-coding and immature RNAs; however, it required 50–220% more sequencing reads to reach the same level of exonic coverage.
Feature | Poly(A) Selection | rRNA Depletion |
---|---|---|
Exonic read yield | High (70–71%) | Lower (22–46%) |
Transcript types detected | Coding mRNAs | Coding + non-coding RNAs |
Bias uniformity | Higher 3′ bias | More even coverage |
Sample quality needed | High (RIN ≥8) | Accepts degraded/FFPE |
Sequence depth required | Lower | Higher |
In short, rRNA depletion is the method of choice when your research demands a wide overview of the transcriptome—including non-coding and degraded RNA—with balanced coverage and flexibility.
Below is a side-by-side comparison of the two main RNAseq library prep methods—poly(A) enrichment and rRNA depletion—with data-driven insights to guide method selection.
Feature | Poly(A) Enrichment | rRNA Depletion |
---|---|---|
Usable exonic reads (blood) | 71 % | 22 % |
Usable exonic reads (colon tissue) | 70 % | 46 % |
Extra reads needed for same exonic coverage | — | +220 % (blood), +50 % (colon) |
Transcript types captured | Mature, coding mRNAs | Coding + noncoding (lncRNAs, snoRNAs, premRNA) |
3′–5′ read bias | Pronounced 3′ bias | More uniform coverage |
Performance with low-quality/FFPE samples | Reduced efficiency | Robust—even with degraded RNA |
Sequencing cost per usable read | Lower, fewer total reads needed | Higher, due to extra sequencing depth |
Bioinformatics complexity | Lower – mostly exonic reads | Higher – includes intronic/noncoding reads |
Usable reads efficiency:
Poly(A) enrichment yields a high percentage of reads mapping to exons (≈70%), minimizing wasted sequencing effort. In contrast, rRNA depletion yields fewer exonic reads—leading to more sequencing and thus higher cost to reach the same depth.
Sequencing & budget considerations:
To match exonic coverage from poly(A) selection, rRNA depletion must sequence 220% more in blood and 50% more in colon tissue—equating to significant cost increase.
Transcriptome scope:
Poly(A) selection targets mature, coding RNAs. By comparison, rRNA depletion offers wider transcriptome coverage, uncovering non-coding RNAs and immature transcripts.
Bias considerations:
Poly(A) methods can introduce a 3′bias due to oligo-dT priming. rRNA depletion provides more uniform coverage across transcripts, enabling more accurate isoform and splicing analyses.
Sample quality flexibility:
Poly(A) selection needs high-quality RNA (e.g., RIN ≥8), while rRNA depletion performs well even with degraded or FFPE samples.
Computational demand:
With rRNA depletion, downstream analysis must handle intronic and non-coding reads, which increases bioinformatics complexity and storage needs. These larger file sizes not only require more disk space but also slow down steps like alignment, transcript assembly, and differential expression analysis—especially in high-throughput projects.
Selecting between poly(A) enrichment and rRNA depletion depends on your sample quality, target RNA types, and research objectives. Another critical but often overlooked factor is the organism type. Poly(A) selection works well for eukaryotes but fails completely in prokaryotes, where rRNA depletion is the only viable option.
Here's a refined decision matrix to guide your method choice:
Sample or Goal | Poly(A) Enrichment | rRNA Depletion |
---|---|---|
High-quality RNA (RIN ≥8) | ✅ Ideal—efficient capture of mature mRNA | ✅ Works, but yields more non-coding reads |
Degraded RNA / FFPE samples (RIN <7) | ⚠️ Not recommended—strong 3′ bias, low yield | ✅ Recommended—handles fragmented RNA without relying on poly(A) tails |
Protein-coding mRNA quantification | ✅ Best choice—high exonic read fraction (~70%) | ⚠️ Less efficient—requires 50–220% deeper sequencing to match coverage |
Non-coding RNAs (lncRNA, snoRNAs, histone RNA) | ⚠️ Misses non-polyadenylated RNAs | ✅ Captures both polyA and non-polyA transcripts |
Low-input or ultra-low input samples | ✅ Works well—efficient with few cells or small RNA | ⚠️ May lose small non-coding RNAs unless optimized kits are used |
For FFPE and degraded samples, rRNA depletion avoids bias and maintains coverage across transcripts, even when RNA is fragmented.
In contrast, poly(A) enrichment needs intact poly(A) tails—otherwise, you'll see strong 3′ end bias and low data yield.
If your focus is strictly protein-coding genes, and RNA quality is high, choose poly(A) for cost-effective, high-quality data.
If your project includes non-coding RNAs or novel transcripts, rRNA depletion allows a broader capture.
Poly(A) enrichment requires fewer reads (13–15M reads) to detect most genes.
rRNA depletion needs significantly more sequences—35–65M reads or more—to reach similar depth.
Check RNA quality (RIN/RQN):
Consider primary targets:
Evaluate budget and sequencing depth:
In light of the data and case studies shared earlier, here’s a clear recommendation summary to help guide your method selection:
Choose Poly(A) Enrichment When:
Choose rRNA Depletion When:
If... | Then Use... | Why |
---|---|---|
RIN ≥ 8 & focus on mRNA only | Poly(A) Enrichment | High exonic yield, lower cost and simpler analysis |
RIN < 7 or FFPE | rRNA Depletion | Independent of poly(A) tails and better suited for degraded RNA |
Include non-coding or novel RNA | rRNA Depletion | Captures both polyA and non-polyA transcripts |
Budget constraints | Poly(A) Enrichment | Requires fewer reads (~13–15M) vs. 35–65M for rRNA depletion |
References: