Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • aligning nucleoplasm fraction RNA

    Hi, this is my first post, so I hope I don't miss any details.

    I'm trying to align RNA sequenced from the subcellular nucleoplasm fraction using TopHat2. The command I'm using is something like this:

    Code:
    tophat2 --num-threads 5 -g 1 --library-type fr-secondstrand -a 5 -r 3000 -o ./Nucleop_untreated_rep1_topHat /GenoStorage/Genomas/hg19/Genome_indexFiles/Bowtie2/hg19 NucleopRNA_1_1.fastq,NucleopRNA_2_1.fastq NucleopRNA_1_2.fastq,NucleopRNA_2_2.fastq
    I used -r 3000 because previously I had to align chromatin fraction RNA and that was the only way to get the reads properly paired.

    After aligning, my align_summary.txt file looks like this:
    Left reads:
    Input: 23788444
    Mapped: 22989305 (96.6% of input)
    of these: 148191 ( 0.6%) have multiple alignments (148191 have >1)
    Right reads:
    Input: 23788444
    Mapped: 22863972 (96.1% of input)
    of these: 151161 ( 0.7%) have multiple alignments (151161 have >1)
    96.4% overall read alignment rate.

    Aligned pairs: 22400527
    of these: 4825 ( 0.0%) have multiple alignments
    and: 8473407 (37.8%) are discordant alignments
    58.5% concordant pair alignment rate.

    Because of the high % of discordant alignments, I used samtools flagstat to check for any issues:
    46399134 + 0 in total (QC-passed reads + QC-failed reads)
    0 + 0 duplicates
    46399134 + 0 mapped (100.00%:-nan%)
    46399134 + 0 paired in sequencing
    23308173 + 0 read1
    23090961 + 0 read2
    19935278 + 0 properly paired (42.96%:-nan%)
    45487286 + 0 with itself and mate mapped
    911848 + 0 singletons (1.97%:-nan%)
    11390956 + 0 with mate mapped to a different chr
    11390956 + 0 with mate mapped to a different chr (mapQ>=5)

    So it seems I get many read pairs with reads in different chromosomes.
    I have tried varying some paremeters (library type, -r) but had no success.
    I should also mention that this happens for 2 different experiments (with 2 different cell types), in a total of 4 datasets (one fore one, three for the other).
    I also looked at other (rare) cases where subcellular fraction RNA was aligned, but they didn't report anything like this.

    Thanks in advance to anyone who can help

Latest Articles

Collapse

  • seqadmin
    Latest Developments in Precision Medicine
    by seqadmin



    Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

    Somatic Genomics
    “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
    05-24-2024, 01:16 PM
  • seqadmin
    Recent Advances in Sequencing Analysis Tools
    by seqadmin


    The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
    05-06-2024, 07:48 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 05-24-2024, 07:15 AM
0 responses
13 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-23-2024, 10:28 AM
0 responses
17 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-23-2024, 07:35 AM
0 responses
21 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-22-2024, 02:06 PM
0 responses
10 views
0 likes
Last Post seqadmin  
Working...
X