Hej there,
I'm trying to detect gene fusion with rna-seq. The draft below is showing
an example of how reads spanning read and split reads can be used to detect
fusions. Counts of spanning reads and split reads are given by the tool that I
use, that works fine. Additionally, I would like to count the reads/fragments that
contradict the fusion.
That would be:
sum(R2/1 (only if R2/2 maps to GENEA but on the other side of the breakpoint) + R3/1(this reads contradict the fusion, no need that R3/2 maps to GENEA)
same could be calculated for GENEB, note that I'm dealing with rna-seq data and the breakpoint I'm showing is on rna level.
GENEA R1/1 -----``````````````````GENEB ------R1/2
```````R2/1 ----- ``R2/2-----
++++++++++++++|----------------````` -----------------|+++++++++++++++
``````````R3/1 ------ ```R3/2-----
``````````R4/1---`````````````````````````` ---R4/1 -------R4/2
legend: ++++ Exon, | breakpoint, - intron, ----R1/1 read1 pair1, R1/2 read 1 pair 2; R4/1 is a split read, half the read maps on GENEA the other half on GENEB, R1/1 and R1/2 are spanning reads whereas 1 pair maps to GENEA the other to GENEB,```this are just placeholder to show correct "alignment" of the reads
I was thinking about using bedtools or samtools view but I can't rely only on intervals, I would really need to check wether eg R2/1 maps "left" to the breakpoint of GENEA and R2/2 has to map "right" to the breakpoint which would be the next exon of GENEA.
Is there any tools out there that does that for me, or do I need to write something custom. Hope the draft is not too confusing, but I thought I give it a try. Thx for any help.
Best,
###############################
###############################
Edit:
I figured I explained this far too complicated. In fact all I need is a way to count read across splice-junctions. Let's say I have exon1 exon2 of GENE A. I want to count all reads that map across exon1 and exon2 + all reads were pair 1 maps on exon1 and pair2 maps on exon 2. That's it.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
Latest Articles
Collapse
-
by seqadmin
The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...-
Channel: Articles
11-06-2024, 07:24 PM -
-
by seqadmin
Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...-
Channel: Articles
10-18-2024, 07:11 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 11:09 AM
|
0 responses
23 views
0 likes
|
Last Post
by seqadmin
Today, 11:09 AM
|
||
Started by seqadmin, Today, 06:13 AM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
Today, 06:13 AM
|
||
Started by seqadmin, 11-01-2024, 06:09 AM
|
0 responses
30 views
0 likes
|
Last Post
by seqadmin
11-01-2024, 06:09 AM
|
||
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks
by seqadmin
Started by seqadmin, 10-30-2024, 05:31 AM
|
0 responses
21 views
0 likes
|
Last Post
by seqadmin
10-30-2024, 05:31 AM
|