Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • More problems with Tophat (v1.3.2)

    I ran tophat version 1.3.2 to align my deep sequencing data to my reference genome via the following command

    tophat --solexa1.3-quals –p 6 –o /control/8905X1/tophat /rice_index/rice_index /8905X1/8905X1.txt

    It generated a file called accepted_hits.bam.
    I converted the file to a bed file using the bamToBed tool.
    Then split the file up based on chromosomes. Here are the results:
    331723028 May 11 09:26 accepted_hits_bed_Chr1
    119325730 May 11 09:26 accepted_hits_bed_Chr10
    123994839 May 11 09:26 accepted_hits_bed_Chr11
    121898474 May 11 09:27 accepted_hits_bed_Chr12
    416137943 May 11 09:27 accepted_hits_bed_Chr2
    334052893 May 11 09:27 accepted_hits_bed_Chr3
    161529836 May 11 09:27 accepted_hits_bed_Chr4
    347228298 May 11 09:28 accepted_hits_bed_Chr5
    189465060 May 11 09:28 accepted_hits_bed_Chr6
    200695493 May 11 09:28 accepted_hits_bed_Chr7
    171194241 May 11 09:28 accepted_hits_bed_Chr8
    759976461 May 11 09:29 accepted_hits_bed_Chr9
    2184791 May 11 09:29 accepted_hits_bed_ChrSy
    539606 May 11 09:29 accepted_hits_bed_ChrUn

    Tophat overloaded chromosome 9 which is one of the smaller chromosomes.

    Unless this can be resolved, I recommend not using Tophat.

  • #2
    To all,
    I discovered my problem with my RNA-seq data. It looks like there was some rRNA contamination in my sample which accounted for 5% of the total reads in the sample. The rRNA genes are located on chromosome 9. There are also some regions on chromosome 2 that show high homology to the rRNA genes on chromosome 9. This is the cause of the reads overloading chromosomes 9 and 2.

    The latest version of Tophat works fine.

    If other people are having problems with reads overloading a chromosome, check to see if there is rRNA contamination.
    Thanks all.

    Comment


    • #3
      With any RNA-seq data, you're almost always going to get rRNA contamination. It makes subsequent mapping quicker if you filter out the rRNA reads first (i.e. prior to any other mapping that is done).

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM
      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      32 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      35 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      29 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      53 views
      0 likes
      Last Post seqadmin  
      Working...
      X