DNA-seq and RNA-Seq are often used in combination to provide more comprehensive genomic and transcriptomic data to advance disease research, personalized medicine, agricultural breeding, and many other fields.
However, there are major differences between the two:
The purpose of DNA-seq is to determine the order of the nucleotides (A, T, C, G) in DNA molecules, so as to comprehensively analyze and study specific genomes. It is capable of sequencing the genome (the entire set of genetic material) of an organism or deciphering the sequence of a particular gene; it can reveal the structure of genes as well as determine which genes are present in the chromosomes and where they are located, which is convenient for us to isolate and identify genes of interest or unknown.
1. It can reveal the genetic information of an individual or a species, including gene sequences, gene mutations, gene polymorphisms and so on. This information is of great significance for us to understand the pathogenesis of hereditary diseases, predict the risk of diseases, and formulate personalized treatment plans.
2. Through DNA-seq, it can detect whether an individual carries a gene mutation related to a specific disease. For example, in prenatal diagnosis, fetal chromosomal disorders such as Down's syndrome and Edward's syndrome can be screened by detecting fetal free DNA in maternal peripheral blood. In addition, DNA-seq can be used for tumor diagnosis, which provides a basis for tumor typing, prognosis assessment and treatment selection by analyzing the genetic variation of tumor cells.
3. In the fields of ecology and biosystematics, DNA-seq technology is used to analyze the genome sequences of different species, revealing the kinship relationship between species, evolutionary history and biodiversity, which is of great significance for the conservation, utilization and management of biological resources and management of biological resources.
1. Gene expression analysis: RNA-seq can comprehensively determine the expression levels of all genes in cells or tissues, helping us to understand the changes in gene expression under different physiological and pathological conditions. By analyzing transcriptome data, we can study the activity and expression patterns of genes and their roles in specific biological processes (e.g., development, immune response, stress response, etc.).
2. Disease Mechanism Research and Diagnosis: RNA-seq can be used to analyze the changes in gene expression related to diseases and help reveal the molecular mechanisms of diseases. For example, by comparing the transcriptome differences between cancer tissues and normal tissues, RNA-Seq is able to detect abnormal expression of tumor-related genes, providing important information for early diagnosis, prognosis assessment and personalized treatment of cancer. In addition, RNA-seq can also be used to identify mutations or abnormal expression of genes related to genetic diseases, neurodegenerative diseases, etc.
3. Transcript and gene structure research: RNA-seq not only quantitatively analyzes the level of gene expression, but also reveals the whole picture of transcripts, including different splice variants, gene fusions, non-coding RNAs and so on. By deeply analyzing the structure of transcripts, RNA-Seq provides an important tool for gene annotation, functional studies and new gene discovery.
WGS means sequencing the entire DNA genome, i.e., the complete genome sequence from the first DNA to the last DNA in the cells of a species, completely detected and in order. This technology is therefore capable of identifying almost any type of mutation in the genome, including SNVs, insertion deletions, structural variants and CNV-coding sequences. With WGS we can explore species without reference genomes or use it for some organisms with poor reference quality.
WES, on the other hand, focuses on sequencing the mRNA coding regions (exons), which usually represent a very small portion of the genome (e.g., 3% in humans), omitting regulatory regions such as promoters and enhancers. WES allows the detection of SNVs and insertion deletions in protein-coding genes as well as other functional elements such as microRNA sequences.
Both WGS and WES can be used to find rare mutations/common variants associated with a disease or phenotype, but WES is much cheaper and faster than WGS.
Targeted sequencing focuses only on a subset of the target gene (or target region) and is faster and cheaper than the previous two. Targeted sequencing focuses on only one area, is easy to analyze and is sensitive enough to detect low-frequency variants, especially in tumor samples where targeted sequencing is often needed.
Unlike DNA-seq, RNA-seq requires reverse transcription of extracted RNA into cDNA before amplification.The most common applications of RNA-seq are to detect changes in gene expression, variable splicing, post-transcriptional modifications, gene fusions, and also to detect mutations and SNPs.
If you want to know more about RNA-seq, please click "RNA Sequencing Techniques".
(1) By reading the DNA fragments in the genome, accurate nucleotide sequences can be obtained, which can directly reflect the genetic background and genetic information of the individual.
(2) DNA-seq can detect various types of genetic variation, such as : SNP, Indel, CNV, structural variation including chromosome rearrangement, inversion, translocation, etc. Through these information, we can analyze the genetic background of individuals, identify disease-related gene mutations, and study the evolutionary relationship of species.
(1) By calculating the reads count or transcript expression of specific genes (such as FPKM, TPM and other standardized values), we can understand the activity of genes in specific cells or tissues.
(2) Different exon combinations of a gene can produce multiple transcripts ; a gene may have many different splicing forms to produce multiple transcripts.
(3) RNA-seq data are usually compared to the target reference genome to determine the location and structure of each transcript. Each reading corresponds to a specific gene or transcript, so the expression level of the gene can be speculated by counting these readings.
(4) By comparing RNA sequence data under different conditions (such as health and disease, different tissues or different experimental groups), DEGs can be identified and important changes in biological processes can be revealed. For example, in cancer research, RNA-seq can identify genes with significantly altered expression levels in cancer cells, providing clues for early diagnosis and targeted therapy of cancer.
(5) RNA-seq not only focuses on mRNAs encoding proteins, but also analyzes non-coding RNAs, such as lncRNAs and miRNAs.
Overview of RNA-Seq (Kukurba KR et al., 2015)
Through the use of these different tools, we can also see that DNA-seq and RNA-seq have different focuses and have very different analysis processes.
In addition to the above differences in these processes, RNA-seq also requires additional steps such as RNA separation, reverse transcription (general sequencing technology is based on DNA and cannot directly detect RNA), and sometimes RNA needs to be fragmented before sequencing.
The fields of application of both DNA-seq and RNA-seq are cross-cutting, especially in disease research, but there are still differences. For example, generally, cancer patients need RNA-seq after DNA-seq. DNA testing is done to detect gene mutations in order to find possible therapeutic targets. However, DNA testing may miss something, so RNA-seq is also needed.
In particular, the presence of fusion genes is found to be an important form of mutation that drives tumor development. Currently, the FDA or NMPA has approved a number of corresponding targeted drugs for clinical treatment against fusion targets such as ALK, RET, ROS1, and NTRK1/2/3. Therefore, it is very important to detect fusion genes more efficiently in order to obtain highly effective targeting information.
DNA-seq and RNA-seq are two important technologies for detecting fusion genes, but they are very different. DNA-seq is based on the DNA level, but the breakpoints of fusion genes usually occur in long intronic regions, and the breakpoints of fusion genes vary across patients and diseases in real life. However, DNA-seq cannot accurately cover the long intronic regions, which contain a large number of repetitive sequences, making it difficult to identify fusion genes.
Moreover, DNA-seq is limited in its ability to detect some DNA fragments with high GC content, as the designed probes cannot effectively capture these fragments. Additionally, the intronic sequences of most genes in organisms contain very similar repetitive sequences, making it difficult for DNA-seq to accurately identify these sequences in different genes. Lastly, genes undergo a complex post-transcriptional splicing process, which affects the location of fusion gene breakpoints. Based on these reasons, DNA-seq is defective in detecting fusion genes and needs to be supplemented with RNA-seq.
On the other hand, RNA-seq can be more effective for detecting fusion genes. This is because the expression of fusion genes at the RNA level connects the two exons before and after, and the fusion breakpoint is relatively fixed. This feature facilitates the design of primers or probes more efficiently and accurately. At the RNA level, we can also detect the expression of fusion genes more effectively and detect functional changes, which can lead to more accurate follow-up treatment or medication.
Furthermore, evidence shows that detecting fusion genes at the RNA level is easier and more reliable than at the DNA level.
Studies have shown that MET inhibitors achieved good anti-tumor effects in patients with MET exon 14 jumping mutations, which suggests that MET exon 14 jumping mutations have the potential to serve as a new and important target for the treatment of NSCLC patients, and therefore the accurate detection of MET exon 14 jumping mutations is of great significance in guiding the clinic.Kurtis et al. respectively used DNA-seq and RNA-seq on tumor tissues from NSCLC patients, DNA-seq detected MET exon 14 jump mutations in 11 out of 856 NSCLC samples (1.3%), while RNA-seq detected MET exon 14 jump mutations in 17 out of 404 samples (4.2%), and RNA-seq was more sensitive than DNA- seq detection was more sensitive(Davies KD et al,.2019).
The causative factor of NSCLC is also mostly the emergence of fusion genes, and the fusion genes found to cause NSCLC are ALK, ROS1, RET, NTRK, NRG1, etc. These discovered gene fusions also provide us with precise therapeutic targets for the treatment of NSCLC. Then accurate detection of fusion genes has significance for us, most of them were previously detected by using DNA-seq, but it was found that DNA NGS could not detect all gene fusion events very accurately, the researchers further detected more than 2500 lung adenocarcinomas by using RNA NGS, and found that 195 (7.7%) gene fusions were detected and 119 (4.7%) MET exon 14 jump mutations. Moreover, when testing tissues previously tested by DNA NGS, only 254 of the 275 materials were available for RNA testing, but of these 254, alterations in mitogenic drivers that were not detected by DNA NGS were detected in 36 (14%) of the 254. 33 of the 36 patients were amenable to treatment, and the final 10 patients received specialized targeted therapy. Eight patients showed favorable clinical outcomes. This suggests that RNA NGS can detect gene fusion events not detected by DNA NGS(Benayed R et al,.2019).
More applications, refer to "Overview of RNA Sequencing Applications"
Summary table comparing DNA sequencing (DNA-seq) and RNA sequencing (RNA-seq):
Aspect | DNA Sequencing (DNA-seq) | RNA Sequencing (RNA-seq) |
---|---|---|
Purpose | Analyzes the DNA sequence to study genes and mutations. | Analyzes RNA to study gene expression and splicing. |
Applications | 1. Genome analysis 2. Disease diagnosis (mutations) 3. Evolution studies |
1. Gene expression analysis 2. Disease research (e.g., cancer) 3. Splicing & transcript discovery |
Technology | Whole genome sequencing (WGS), exome sequencing (WES), targeted sequencing | Whole transcriptome sequencing (WTS), mRNA-seq, small RNA-seq |
Sample Type | DNA is stable and easy to extract. | RNA is unstable and degrades quickly, requiring careful handling. |
Data Output | DNA sequence reads showing mutations and variations. | RNA reads showing gene expression and splicing patterns. |
Data Analysis Tools | Tools like BWA, GATK, and CNVnator for mutations and variations. | Tools like STAR, DESeq2, and Kallisto for expression analysis. |
Sequencing Platforms | Illumina, Ion Torrent, PacBio, Oxford Nanopore | Illumina, Ion Torrent, PacBio, Oxford Nanopore |
Use in Disease | Detects genetic mutations related to diseases. | Detects changes in gene expression linked to diseases. |
Fusion Gene Detection | Less effective for fusion genes. | More effective for detecting fusion genes in cancers. |
Cost | WGS is expensive, WES is cheaper, targeted is the cheapest. | RNA-seq can be expensive, especially for full transcriptome. |
References: