A pseudogene can be defined as a non-functional replica of genomic DNA that bears striking similarity to functional genes but has forfeited its capacity to carry out its original function. These entities are commonly present within eukaryotic genomes and are regarded as vestiges of gene families that emerged throughout the course of evolution. Initially perceived as inert remnants of genes that had relinquished their protein-coding capabilities, pseudogenes have recently emerged as objects of research interest due to their potential functional roles, notably as non-coding RNAs (ncRNAs). They have been implicated in a diverse array of biological processes and diseases, including cancer.
CD Genomics provides sequencing-based non-coding RNAs solution, revealing the complex networks of interaction with other DNA/RNA/proteins.
Pseudogenes vs Genes
Genes | Pseudogenes | |
---|---|---|
Definition | Segments of DNA that encode functional proteins or RNA molecules. | Gene-like sequences have lost their ability to produce functional proteins or RNA molecules. |
Function | Determine traits and characteristics, involved in protein synthesis. | Provide genetic variation, regulatory functions, and evolutionary insights. |
Protein Production | Encode functional proteins or RNA molecules. | Unable to produce functional proteins or RNA molecules. |
Evolution | Subject to natural selection, can contribute to genetic disorders. | Arise from genes through duplication or retrotransposition. |
Expression | Actively transcribed and translated into proteins or RNA molecules. | May have promoter regions, exons, and introns, but mutations prevent expression. |
Biological Significance | Essential for cellular functions, determine traits, affected by mutations. | Can still have residual functions, regulate other genes, or produce noncoding RNAs. |
Pseudogenes can arise through different mechanisms. The two main classifications you mentioned are non-processing pseudogenes (replicative) and processing pseudogenes (retrotransposons).
Non-processing pseudogenes, also known as duplicative or non-functionalization pseudogenes, typically arise through gene duplication events. During these events, a functional gene is duplicated, resulting in two copies of the gene. Over time, mutations accumulate in one of the copies, rendering it non-functional. These mutations may include deletions, insertions, or point mutations that disrupt the coding sequence or regulatory regions of the gene. As a result, the non-functional pseudogene loses its ability to produce a functional protein and is often not transcribed or expressed.
Processing pseudogenes, also referred to as retroprocessed pseudogenes, originate from the reverse transcription of mRNA molecules and subsequent insertion into the genome. Retrotransposons, a type of mobile genetic element, play a key role in the formation of processing pseudogenes. Retrotransposons are transcribed into RNA molecules, which are then reverse transcribed back into DNA by the enzyme reverse transcriptase. This DNA copy is then inserted into a new location in the genome, potentially giving rise to a processing pseudogene. However, these pseudogenes also experience mutations and structural changes over time, leading to their loss of function.
Pseudogenes. (Cheetham et al., 2020)
Within the vast expanse of the human genome, there exists a fascinating collection of approximately 15,000 to 18,000 pseudogenes. However, it would be grossly inaccurate to dismiss these pseudogenes as mere genetic remnants or insignificant "junk" genes. In light of recent advancements in research, the study of pseudogenes has gained substantial momentum, revealing that many of these genetic entities possess meaningful biological functions, particularly those categorized as processing pseudogenes. These remarkable pseudogenes carry out their functions through a variety of intricate mechanisms.
Firstly, the transcripts of pseudogenes assume the role of antisense RNAs, effectively dampening the expression of functional genes. Through this mechanism, pseudogene transcripts act as molecular antagonists, engaging in an intricate dance of gene regulation.
Secondly, pseudogene transcripts have the remarkable capacity to generate endogenous small interfering RNAs. These minuscule RNA molecules serve as key regulators, intricately modulating the expression of functional genes. The pseudogenes, in this scenario, play an active role in orchestrating the genetic symphony within the intricate tapestry of human biology.
Several studies have shown that pseudogenes can regulate gene expression by various mechanisms, including acting as competing endogenous RNAs (ceRNAs) that sponge microRNAs, thereby modulating the expression of target genes. Pseudogenes can also serve as precursors for small interfering RNAs (siRNAs) or long non-coding RNAs (lncRNAs), which are involved in gene regulation and other cellular processes. They can function as oncogenes or tumor suppressors, impacting key cellular pathways involved in cancer biology. Furthermore, pseudogene expression patterns have been associated with patient prognosis and different cancer subtypes, suggesting their potential as biomarkers for diagnosis, prognosis, and therapeutic stratification.
Furthermore, the transcripts of pseudogenes engage in a fascinating phenomenon where they compete for binding to miRNA loci. As a striking example, let us consider the oncogene PTEN and its counterpart pseudogene, PTENP1. Remarkably, both PTEN and PTENP1 share a common miRNA binding site. In this intricate interplay, PTENP1 binds competitively to miRNA sites, ensuring the unhampered expression of the PTEN gene. This delicate equilibrium further underscores the intricate relationship between pseudogenes and functional genes.
Lastly, it is important to note that certain pseudogenes produce functional proteins, defying the conventional notion that they are mere genetic relics. This revelation challenges the previously held belief that pseudogenes lack biological significance, further highlighting the complexity of the human genome and the multifaceted nature of pseudogene functionality.
RNA sequencing (RNA-seq) is indeed a valuable choice for identifying pseudogenes and studying their functions. RNA-seq is a high-throughput sequencing technique that allows researchers to obtain a comprehensive snapshot of the transcriptome, including both coding and non-coding RNA molecules, in a given sample. Here's how RNA-seq can be used to study pseudogenes.
Pseudogene expression analysis. (Kalyana-Sundaram et al., 2012)
By integrating RNA-seq data with complementary techniques and datasets, researchers can comprehensively understand pseudogenes, encompassing their expression patterns, functional roles in normal development, and their implications in disease, including cancer.
References: