RAP-Seq Service

RNA Antisense Purification Sequencing — genome-wide mapping of lncRNA-chromatin interactions, RNA-RNA interactions, and RNA-protein complexes.

Long non-coding RNAs (lncRNAs) play critical roles in gene regulation, chromatin organization, and nuclear architecture — often by binding to specific genomic regions, interacting with other RNAs, or recruiting protein complexes to their sites of action. Understanding where a lncRNA binds across the genome is essential for deciphering its molecular function.

Our RAP-Seq (RNA Antisense Purification Sequencing) service provides end-to-end solutions for capturing and sequencing lncRNA interaction partners using biotinylated antisense oligonucleotide probes. We offer RAP-DNA for genome-wide chromatin interaction mapping, RAP-RNA for RNA-RNA interactome identification, and integrated bioinformatic analysis to deliver publication-ready results.

  • Targeted lncRNA-chromatin interaction mapping — identify genome-wide DNA binding sites of any lncRNA of interest
  • RAP-DNA and RAP-RNA dual-readout capability from the same capture experiment
  • Long tiled probes (120 nt, every 15 nt) for robust capture of structured and low-abundance lncRNAs
  • Multiple crosslinking strategies — formaldehyde (FA), DSG+FA, AMT — for direct and indirect interaction capture
  • End-to-end bioinformatics — from raw sequencing data through peak calling to functional annotation
Submit Your Request Now

RAP-Seq principle: biotinylated antisense probes capture target lncRNA with bound chromatin for high-throughput sequencing

Overview Comparison Advantages Workflow Bioinformatics Strategy Applications Demo Case FAQ

RAP-Seq Overview

RNA Antisense Purification followed by high-throughput sequencing (RAP-Seq) is a powerful method for mapping the molecular interactions of a target RNA — most commonly a lncRNA — at genome-wide scale. The method uses biotinylated antisense DNA probes (typically 120 nucleotides) tiled across the entire target RNA to capture it from crosslinked cellular lysate, along with its bound DNA, RNA, and protein partners. The enriched nucleic acids are then eluted and analyzed by high-throughput sequencing to identify interaction sites.

RAP-Seq encompasses two complementary readout strategies: RAP-DNA identifies genomic DNA loci bound by the target RNA, providing a genome-wide chromatin interaction map; RAP-RNA identifies RNA molecules that base-pair with or are indirectly associated with the target RNA, revealing its RNA-RNA interactome. When combined with control probes (e.g., scrambled sequence or input normalization) and appropriate crosslinking conditions, RAP-Seq delivers high-confidence interaction maps that distinguish direct from indirect contacts.

RAP-Seq is closely related to other antisense capture methods including ChIRP-MS and RAP-MS, but differs in its sequencing-based readout. Our service integrates seamlessly with complementary approaches such as RNA Interaction analysis, RIC-seq, and RADICL-seq for comprehensive RNA-centric interaction profiling.

RAP-Seq vs. ChIRP-Seq vs. CHART-Seq: Method Comparison

Feature RAP-Seq ChIRP-Seq CHART-Seq
Probe Length 120 nt long ssDNA probes ~20 nt short oligos ~25 nt short oligos
Probe Tiling Tiled every ~15 nt across full RNA Tiled every ~100 bp across full RNA Targets RNase H-accessible regions only
Probe Design Requirement No prior knowledge of binding domains or accessible regions needed No prior knowledge needed Requires RNase H mapping to identify accessible regions
Background Noise Low — long probes provide high binding affinity and specificity Moderate — shorter probes may have higher off-target binding Low — but requires pre-selection of accessible regions
Crosslinking FA, DSG+FA, AMT, or UV GA or FA FA
Readout DNA-seq (RAP-DNA) + RNA-seq (RAP-RNA) from same capture DNA-seq (ChIRP-seq); separate MS for proteins (ChIRP-MS) DNA-seq (CHART-seq)
Controls Scramble probe, input, no-crosslink, RNase-treated Scramble probe, input, RNase A+H treatment Scramble probe, input, RNase H treatment
Structured RNA Capture Robust — long probes overcome secondary structure Moderate — shorter probes may fail at structured regions Limited — only targets accessible regions
Low-Abundance RNA Good — long probes increase capture efficiency Moderate Moderate

RAP-Seq offers significant advantages for lncRNAs with complex secondary structure or low expression levels, where longer probes provide more robust capture and lower background. The dual DNA/RNA readout capability from a single capture experiment provides complementary information about both chromatin targeting and RNA interaction partners.

Technical Advantages of Our RAP-Seq Service

Long Tiled Probe Design for Robust Capture

Our probe design pipeline generates 120-nt biotinylated ssDNA probes tiled every ~15 nt across the full-length target RNA sequence. This tiling strategy ensures efficient capture even in the presence of extensive RNA secondary structure, protein-RNA interactions, or partial RNA degradation — all common challenges with lncRNAs. The long probes provide higher binding affinity and lower background noise compared with short-oligo methods. Each probe set is computationally filtered against the reference genome to remove cross-reactive sequences and optimized for GC content (40–60%). Even/odd probe pool splitting enables independent biological replicate validation.

Dual RAP-DNA and RAP-RNA Readout from a Single Capture

RAP-Seq uniquely enables parallel sequencing of both the bound DNA (RAP-DNA) and co-purified RNA (RAP-RNA) fractions from the same capture experiment. This dual-readout capability provides a comprehensive view of lncRNA function: RAP-DNA reveals the genomic loci where the lncRNA is localized (chromatin binding sites), while RAP-RNA identifies RNA molecules that directly or indirectly interact with the target. Integrated analysis of both datasets enables researchers to distinguish between lncRNAs that act as chromatin guides, RNA sponges, or scaffold molecules.

Versatile Crosslinking Strategies for Interaction Specificity

Different crosslinking chemistries capture distinct classes of molecular interactions. Our service offers multiple crosslinking options: formaldehyde (FA) crosslinks RNA-protein and protein-DNA contacts, capturing both direct and indirect RNA-chromatin interactions; DSG+FA dual crosslinking enhances capture of protein-mediated RNA-chromatin contacts; AMT (4'-aminomethyltrioxsalen) specifically crosslinks directly base-paired RNA-RNA duplexes without protein crosslinking. This modular crosslinking approach allows researchers to dissect direct from indirect interactions and build mechanistic models of lncRNA function.

RAP-Seq Workflow Overview

Our RAP-Seq service follows a streamlined 5-step workflow from probe design and validation through to bioinformatic analysis. Each step is optimized for robust lncRNA capture and high-confidence interaction mapping.

  • Probe Design and Synthesis — Target RNA sequence (full-length transcript) is obtained from reference databases. Biotinylated ssDNA probes (120 nt) are designed tiled every ~15 nt across the entire transcript. Probes are filtered for repetitive elements, GC content optimization (40–60%), and cross-genome alignment to remove off-target potential. Probes are synthesized as biotinylated ssDNA via PCR amplification and in vitro transcription with biotinylated primers.
  • Cell Crosslinking and Lysis — Live cells are crosslinked in situ using the selected crosslinking chemistry (FA, DSG+FA, or AMT). Crosslinked cells are lysed in denaturing conditions, and chromatin is fragmented by DNase I digestion and sonication to 100–300 bp fragments. The lysate is pre-cleared with streptavidin beads to reduce non-specific background.
  • Antisense Probe Hybridization and Capture — Biotinylated probes are hybridized to the target RNA in the crosslinked lysate under stringent conditions. Streptavidin-coated magnetic beads capture the probe-RNA-chromatin complexes. After extensive washing, bound complexes are eluted. Even and odd probe pools are used in separate captures for independent biological validation.
  • Library Preparation and Sequencing — Eluted DNA (RAP-DNA) and RNA (RAP-RNA) are separately processed. DNA is purified and used for library preparation directly. RNA is reverse transcribed and converted into strand-specific sequencing libraries. Libraries are sequenced on Illumina NovaSeq 6000 or NovaSeq X Plus (PE150 for DNA, PE150/SE50 for RNA).
  • Bioinformatic Analysis — Raw sequencing data are processed through our dedicated RAP-Seq bioinformatics pipeline: adapter trimming, read alignment, peak calling (MACS2 or custom), genomic feature annotation, motif analysis, and functional enrichment. RAP-RNA data undergoes transcript identification and quantification. Integrated RAP-DNA + RAP-RNA analysis reveals the full interaction landscape.

RAP-Seq workflow from probe design to sequencing and bioinformatic analysis

Bioinformatics and Data Analysis

Our bioinformatics pipeline for RAP-Seq data processing is designed to deliver high-confidence interaction sites with comprehensive functional annotation. The pipeline handles both RAP-DNA and RAP-RNA data streams, with tailored analytical modules for each readout type.

Analysis Package Content Description
Standard Analysis
1. Raw Data QC and Preprocessing FastQC quality assessment, adapter trimming (Cutadapt), read filtering for quality (Q ≥ 30), PCR duplicate removal. Alignment statistics including mapping rate, duplication rate, and library complexity metrics.
2. Read Alignment RAP-DNA: alignment to reference genome (Bowtie2 / BWA-MEM). RAP-RNA: alignment to reference transcriptome and genome (STAR). Spike-in normalization where applicable. Multi-mapping read resolution.
3. Peak Calling (RAP-DNA) Identification of significantly enriched genomic intervals (MACS2, SICER, or custom peak caller). Input normalization and scramble probe subtraction. Irreproducible Discovery Rate (IDR) analysis across even/odd probe replicates. Peak annotation to genomic features (promoters, gene bodies, enhancers, intergenic).
4. Target RNA Identification (RAP-RNA) Transcript quantification and identification of enriched RNAs compared to scramble control. Identification of specific RNA-RNA interaction partners including mRNA, lncRNA, snoRNA, snRNA targets. Base-pairing prediction for directly crosslinked RNAs (AMT datasets).
5. Genomic Feature and Functional Annotation Peak-to-gene assignment, GREAT analysis for biological process enrichment, KEGG and Reactome pathway analysis. Chromatin state enrichment analysis using Roadmap Epigenomics or ENCODE reference epigenomes.
Advanced Analysis
6. Motif and Sequence Analysis De novo motif discovery (MEME, HOMER) at RAP-DNA peak regions. Sequence composition analysis, GC content profiling, and repeat element enrichment. Transcription factor binding motif enrichment for mechanistic inference.
7. Comparative and Differential Binding Analysis Differential peak analysis between conditions (e.g., differentiation time points, disease vs. control, knockdown vs. WT). Quantitative comparison of RAP-DNA enrichment across experimental groups using DiffBind or similar.
8. Multi-Omics Integration Integration with RNA-seq, ChIP-seq, ATAC-seq, and Hi-C data to contextualize lncRNA binding sites within the chromatin landscape. Correlation analysis with gene expression changes upon lncRNA perturbation.
9. Network and Functional Inference Construction of lncRNA-centric interaction networks integrating DNA binding sites, RNA interaction partners, and protein interaction data. Functional enrichment of target genes and pathway over-representation analysis.

Our analysis team provides a comprehensive report with publication-ready figures including genome browser tracks, peak heatmaps, motif logos, functional enrichment plots, and integrated network visualizations. Data are delivered in standard formats (BAM, bigWig, BED, peak BED, interaction tables) compatible with downstream analysis tools.

Analytical Strategy for RAP-Seq Experiments

Successful RAP-Seq experiments require careful experimental design addressing probe design optimization, crosslinking strategy selection, and appropriate controls. Our analytical strategy is built on three pillars: rigorous probe design and validation, optimized crosslinking and capture conditions, and integrated multi-readout data interpretation.

Probe Design and Validation Strategy

The quality of RAP-Seq data depends critically on probe design. Our approach includes:

  • Transcript isoform-aware design — Probes are designed against the biologically relevant transcript isoform, avoiding regions subject to alternative splicing. For lncRNAs with multiple isoforms, we design probes targeting shared exons or isoform-specific regions depending on the research question.
  • Repetitive element filtering — Probes containing repetitive sequences (SINEs, LINEs, LTRs, simple repeats) are computationally masked and removed to prevent off-target capture of repetitive genomic elements.
  • Cross-genome specificity filtering — Designed probes are aligned against the reference genome to identify and remove probes with off-target binding potential. Only probes with unique genomic alignment are retained.
  • Even/odd pool splitting — Probes are divided into two independent pools (even and odd) based on tiling position. Independent captures with each pool serve as biological replicates, enabling IDR analysis for high-confidence peak calling.
  • RT-qPCR validation — Enrichment of the target RNA is validated by RT-qPCR using primers spanning non-probe regions before proceeding to sequencing.

Crosslinking Strategy Selection

Crosslinking strategy is selected based on the biological question:

  • Formaldehyde (FA) — Recommended for standard chromatin interaction mapping. Captures RNA-protein and protein-DNA contacts. Best for RAP-DNA.
  • DSG + FA dual crosslinking — Recommended for weak or transient interactions. Enhances protein-mediated RNA-chromatin contacts.
  • AMT crosslinking — Used for direct RNA-RNA interaction mapping (RAP-RNA only). Does not crosslink proteins.

Analytical strategy for RAP-Seq experiments from probe design to data integration

Applications

RAP-Seq has broad applications across lncRNA biology, chromatin regulation, and gene expression control. The following application areas are particularly well-suited to our approach.

LncRNA-Chromatin Interaction Mapping

RAP-DNA is the method of choice for identifying the genomic binding sites of chromatin-associated lncRNAs. Whether studying X-chromosome inactivation (Xist), imprinting control (Kcnq1ot1, Airn), enhancer-associated lncRNAs, or chromatin looping regulators (Firre), RAP-Seq provides genome-wide maps of lncRNA occupancy at high resolution. This application is foundational for understanding how lncRNAs contribute to gene silencing, activation, and three-dimensional genome organization.

RNA-RNA Interactome Discovery

RAP-RNA enables systematic identification of RNA molecules that interact with a target lncRNA, including mRNA, other lncRNAs, small nuclear RNAs (snRNAs), small nucleolar RNAs (snoRNAs), and repetitive RNA elements. This application has revealed that many lncRNAs interact with nascent pre-mRNAs to localize to active chromatin, and that RNA-RNA base-pairing can direct lncRNAs to specific genomic regions. AMT-crosslinked RAP-RNA specifically captures directly base-paired interactions.

LncRNA Functional Characterization

Integrating RAP-DNA binding maps with transcriptomic data (RNA-seq after lncRNA knockdown or knockout) provides mechanistic insights into lncRNA function. Genes proximal to lncRNA binding sites that change expression upon lncRNA perturbation represent likely direct targets. Combined with motif analysis and chromatin state data, RAP-Seq can distinguish between lncRNAs that act as guides (recruiting chromatin modifiers to specific loci), scaffolds (bridging proteins and chromatin), or decoys (sequestering proteins from chromatin).

Nuclear RNA Localization and Dynamics

RAP-Seq can be applied across different cellular states — differentiation time courses, disease models, or drug treatments — to study how lncRNA chromatin localization changes dynamically. Comparative RAP-DNA analysis between conditions reveals condition-specific binding sites and identifies genomic regions where lncRNA occupancy correlates with transcriptional changes. Spatial genomic context analysis (through integration with Hi-C or similar data) can reveal whether lncRNAs bind preferentially at topologically associating domain (TAD) boundaries, active/inactive compartments, or specific chromatin states.

Mechanistic Studies of LncRNA in Development and Disease

Many disease-associated lncRNAs exert their effects through chromatin binding. RAP-Seq can identify the genomic targets of lncRNAs implicated in cancer (MALAT1, HOTAIR, NEAT1, NORAD, PVT1), cardiovascular disease, neurological disorders, and developmental defects. By mapping lncRNA binding sites in disease-relevant cell types and comparing with disease-associated genetic variants, researchers can connect lncRNA dysfunction to specific gene regulatory circuits. The combination of RAP-DNA and RAP-RNA data provides a comprehensive mechanistic view of lncRNA function in disease contexts.

Deliverables

Sample Requirements

Sample Type Recommended Amount Notes
Adherent cells ≥2 × 10⁷ cells per capture 2–3 biological replicates recommended; 1× T175 flask typically yields ~1 × 10⁷ cells
Suspension cells ≥2 × 10⁷ cells per capture Crosslinking performed in suspension prior to lysis
Fresh tissue ≥100 mg per capture Minced and crosslinked immediately after collection; tissue disaggregation required
Cell pellet (snap-frozen) ≥5 × 10⁷ cells Must be crosslinked before freezing; pre-crosslinked pellets accepted
Target RNA information Full-length transcript sequence Transcript ID (RefSeq/Ensembl) or FASTA sequence required for probe design

Important Notes:

  • All cell samples must be processed as live cells for in vivo crosslinking. Formaldehyde-fixed cell pellets are the preferred format for sample shipment.
  • Each RAP-Seq experiment requires a minimum of 2 capture reactions (even and odd probe pools) plus 1–2 control captures (scramble probe and/or input), requiring ≥6 × 10⁷ total cells for a standard design.
  • The target lncRNA expression level should be confirmed by RT-qPCR prior to RAP-Seq. Recommended minimum expression: >1 copy per cell (or Cq < 32 in 10 ng cDNA from 10⁶ cell equivalents).
  • For low-expressed lncRNAs, additional input material may be required. Please consult with our team for feasibility assessment.
  • Control samples (scramble probe capture, input chromatin) are essential for background subtraction and are included as standard in our service.
  • For species without an annotated reference genome, de novo sequencing and assembly may be required for probe design — please consult with our project team.

Demo Results

Representative RAP-Seq data outputs from typical lncRNA chromatin interaction mapping experiments.

RAP-DNA peak calling results — Genome-wide identification of significantly enriched genomic intervals from RAP-DNA sequencing, showing peak distribution across chromosomes and enrichment relative to input and scramble controls.

Genome browser tracks — IGV browser view of RAP-DNA signal at representative genomic loci, comparing even and odd probe pools, scramble control, and input, with annotated gene tracks for context.

Peak annotation and genomic feature distribution — Bar chart and pie chart showing the distribution of RAP-DNA peaks across genomic features: promoters, gene bodies (intronic/exonic), intergenic regions, enhancers, and repetitive elements.

Motif discovery and transcription factor enrichment — Top de novo motifs identified at RAP-DNA peak regions with E-values and best-matched transcription factor binding motifs.

RAP-RNA interaction partners heatmap — Heatmap showing enriched RNA interaction partners identified in RAP-RNA data, with clustering by RNA type (mRNA, lncRNA, snRNA, snoRNA).

Integrated RAP-DNA and RNA-seq analysis — Scatter plot or genome browser view correlating RAP-DNA binding with gene expression changes upon lncRNA perturbation, identifying potential direct target genes.

RAP-DNA peak calling results across genome RAP-DNA peak calling results

Genome browser tracks showing RAP-DNA signal Genome browser tracks

Peak annotation and genomic feature distribution Peak annotation and genomic feature distribution

Motif discovery and transcription factor enrichment Motif discovery and transcription factor enrichment

RAP-RNA interaction partners heatmap RAP-RNA interaction partners heatmap

Integrated RAP-DNA and RNA-seq analysis Integrated RAP-DNA and RNA-seq analysis

Case Study: RAP Reveals Intronic Determinants of Charme LncRNA Chromatin Retention and Myogenic Function

A 2020 study published in Cell Reports by Desideri and colleagues used RNA Antisense Purification (RAP) combined with mass spectrometry and functional genomics to investigate how the muscle-specific lncRNA Charme (Chromatin architect of muscle expression) is retained on chromatin and how its intronic sequences coordinate nuclear activity through RNA-binding protein interactions.

Charme is a muscle-specific lncRNA essential for myogenesis — the process of muscle cell differentiation. Previous work had shown that Charme exists in two isoforms generated by alternative splicing: a nuclear, chromatin-retained isoform (pCharme) that retains a large ~11 kb intron-1, and a cytoplasmic, spliced isoform (mCharme). The mechanism by which pCharme is retained on chromatin and the functional role of the retained intron in mediating chromatin interactions were unknown.

Study design for Charme lncRNA RAP analysis showing probe design and experimental workflowFigure 1. Experimental strategy for RAP-based identification of Charme lncRNA interaction partners.
Biotinylated antisense DNA probes were designed tiled across the pCharme lncRNA sequence. C2C12 myoblast cells were crosslinked with formaldehyde, and RAP was performed to capture pCharme along with its bound protein and nucleic acid partners. Mass spectrometry identified MATR3 and PTBP1 as the principal protein interactors. Adapted from Desideri et al. 2020 (CC BY 4.0).

RAP-based approach: The authors designed biotinylated antisense DNA probes tiled across the pCharme lncRNA sequence. C2C12 mouse myoblasts were crosslinked with formaldehyde to stabilize RNA-protein and protein-DNA contacts. RAP was performed by hybridizing the probe set to the crosslinked lysate, capturing the probe-RNA-chromatin complexes on streptavidin beads, and eluting the bound material. Captured proteins were identified by quantitative mass spectrometry (RAP-MS), while the RAP capture methodology used is directly applicable to RAP-DNA and RAP-RNA sequencing readouts for comprehensive interaction mapping. RNA-seq was performed after Charme knockdown and intron-1 deletion to assess downstream transcriptional effects. CRISPR-Cas9 was used to generate mice carrying a deletion of the alternatively spliced intron-1 to validate its functional importance in vivo.

Charme lncRNA chromatin retention mechanism identified by RAPFigure 2. RAP-identified mechanism of Charme lncRNA chromatin retention.
The pCharme lncRNA isoform retains intron-1, which serves as a docking platform for MATR3 and PTBP1 binding. The MATR3/pCharme interaction is required for chromatin retention of the lncRNA. Deletion of intron-1 by CRISPR-Cas9 releases pCharme from chromatin and causes cardiac defects in mice, phenocopying the full-length Charme knockout. Adapted from Desideri et al. 2020 (CC BY 4.0).

Key findings: (1) RAP-MS identified MATR3 (a multifunctional RNA/DNA binding protein) and PTBP1 (a splicing regulator) as the principal nuclear interactors of pCharme. (2) Both proteins bind specifically to sequences within the retained intron-1 of pCharme, forming nuclear ribonucleoprotein aggregates. (3) PTBP1 acts as a splicing repressor of intron-1 — its depletion increases the mCharme/pCharme ratio by promoting intron-1 removal. (4) MATR3 is required for pCharme's chromatin localization — upon MATR3 depletion, pCharme is released from chromatin and redistributes to the nucleoplasm. (5) Conversely, pCharme influences MATR3's own chromatin binding — Charme depletion causes MATR3 to relocate from chromatin to the nucleoplasm. (6) CRISPR-Cas9 deletion of intron-1 in mice caused pCharme release from chromatin and resulted in cardiac defects, matching the phenotype of full-length Charme knockout. This study demonstrates how RAP-based approaches can dissect the molecular mechanisms of lncRNA chromatin function and identify critical sequence determinants of lncRNA nuclear activity.

FAQs — Frequently Asked Questions

References:

  1. Engreitz JM, Pandya-Jones A, McDonel P, et al. The Xist lncRNA exploits three-dimensional genome architecture to spread across the X chromosome. Science. 2013;341(6147):1237973.
  2. Engreitz JM, Sirokman K, McDonel P, et al. RNA-RNA interactions enable specific targeting of noncoding RNAs to nascent pre-mRNAs and chromatin sites. Cell. 2014;159(1):188-199.
  3. Desideri F, Cipriano A, Petrezselyova S, et al. Intronic Determinants Coordinate Charme lncRNA Nuclear Activity through the Interaction with MATR3 and PTBP1. Cell Reports. 2020;33(12):108548.
  4. McHugh CA, Chen CK, Chow A, et al. The Xist lncRNA interacts directly with SHARP to silence transcription through HDAC3. Nature. 2015;521(7551):232-236.

For Research Use Only. This service is intended for exploratory and mechanistic research applications, including lncRNA-chromatin interaction mapping, RNA-RNA interactome discovery, and lncRNA functional characterization. It is not intended for clinical diagnosis, treatment selection, patient stratification, or therapeutic decision-making.



Inquiry
  • For research purposes only, not intended for clinical diagnosis, treatment, or individual health assessments.
RNA
Research Areas
Copyright © CD Genomics. All rights reserved.
Top