Eukaryotic genes commonly generate multiple transcript isoforms which are important for understanding many biological processes and human diseases. Identification of isoforms has traditionally been completed using short-read technologies that limit read length and hinder isoform discovery.
The increased utilization of long-read sequencing has allowed researchers to sequence thousands of bases in a single read and has improved the quantification and discovery of spice junctions. However, deficiencies with current computational tools motivated researchers from the Children’s Hospital of Philadelphia (CHOP) to develop their own tool, ESPRESSO (Error Statistics PRomoted Evaluator of Splice Site Options).
This new tool uses long-read sequencing data and processes the resulting alignments to improve splice junction accuracy and isoform quantification. ESPRESSO was designed to compare the long reads for each gene to its corresponding genomic DNA. Then the error patterns of the reads are analyzed to locate splice junctions and complete isoforms.
The CHOP researchers tested ESPRESSO using data generated from native DNA and RNA sequenced on Oxford Nanopore Technologies devices, along with simulated sequencing data. Over 1 billion long reads were analyzed, covering 30 human tissue types and three human cell lines.
Using long reads to cross-reference transcripts to the genomic DNA allowed them to identify undocumented isoforms and splice junctions. In addition, ESPRESSO was able to accurately discover RNA isoforms and quantify them better than several other contemporary tools designed for transcript isoform analysis
This computational tool has demonstrated that it can be a useful resource for investigating RNA from eukaryotic transcriptomes. Researchers from CHOP believe using ESPRESSO with long-read RNA sequencing will aid in our understanding of RNA variation and its role in genetic diseases.
More information about ESPRESSO can be found on its corresponding GitHub page or by reading the published manuscript.
Header Leaderboard Ad
Collapse
ESPRESSO: Quantifying transcript isoforms from long-read sequencing
Collapse
Announcement
Collapse
No announcement yet.
Latest Articles
Collapse
-
by seqadmin
Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...-
Channel: Articles
03-10-2023, 05:31 AM -
-
by seqadmin
Using automation to prepare sequencing libraries isn’t a new concept, and most researchers are aware that there are numerous benefits to automating this process. However, many labs are still hesitant to switch to automation and often believe that it’s not suitable for their lab. To combat these concerns, we’ll cover some of the key advantages, review the most important considerations, and get real-world advice from automation experts to remove any lingering anxieties....-
Channel: Articles
02-21-2023, 02:14 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 03-17-2023, 12:32 PM
|
0 responses
7 views
0 likes
|
Last Post
by seqadmin
03-17-2023, 12:32 PM
|
||
Started by seqadmin, 03-15-2023, 12:42 PM
|
0 responses
17 views
0 likes
|
Last Post
by seqadmin
03-15-2023, 12:42 PM
|
||
Started by seqadmin, 03-09-2023, 10:17 AM
|
0 responses
66 views
1 like
|
Last Post
by seqadmin
03-09-2023, 10:17 AM
|
||
Started by seqadmin, 03-03-2023, 12:03 PM
|
0 responses
64 views
0 likes
|
Last Post
by seqadmin
03-03-2023, 12:03 PM
|