Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • strange SAM output

    Hi,

    I use tophat to run RNAseq project, the sequence is from solexa pair end .
    the below is part of one line in the result.

    ILLUMINA-7A0261_0001:3:77:8496:8665#0 161 chr1 554335 3 64M = 1395011 0

    from the SAM1 manual, I think the insert size is 840676(1395011-554335). but ISIZE(inferred Insert SIZE) is 0. how can explain it.




    zhidong

  • #2
    Can you try using other short read mapper such as bwa to see the insert size distribution? Or you can use the Bio:B::Sam perl modules to access the entire sam/bam, and infer the insert size distribution.

    Comment


    • #3
      thank you , macrowave.

      but I run RNAseq project. bwa seems not fit for mapping RNA sequence to Genome

      Comment


      • #4
        Can you provide more specific information, such as the fragment size, read length, the expected insert sizes, reference type (genome or transcriptome)? Form my experience, paired-end mapping with bowtie to predicted transcriptome yielded expected insert size distribution. By the way, BWA is perfectly fine for mRNA-Seq mapping to the genome, it's just harder to estimate insert size because of the introns, and you'll get weird inferred size distribution as the variable intron length. The Bio:B::Sam perl module has functions to access all proper mapped paired reads from sam/bam , from no matter which mapper you use. So it's a good idea to get all pairs in a region and see the real distribution.

        Comment


        • #5
          Just realized that your problem might be a bug in TopHat. In the newest TopHat release notes, they say 'TLEN field in SAM format is correctly output', which means you may be using an older release that doesn't output the isize correctly.

          Comment


          • #6
            and the sam flag 161 (1+32+128) means the paired reads mapped one forward, one reverse, but for some reason, the aligner thinks the pair isn't right (not properly aligned?), so that might be also a reason it returns a zero isize.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Exploring the Dynamics of the Tumor Microenvironment
              by seqadmin




              The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
              07-08-2024, 03:19 PM
            • seqadmin
              Exploring Human Diversity Through Large-Scale Omics
              by seqadmin


              In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
              06-25-2024, 06:43 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 05:49 AM
            0 responses
            12 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 07-15-2024, 06:53 AM
            0 responses
            23 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 07-10-2024, 07:30 AM
            0 responses
            36 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 07-03-2024, 09:45 AM
            0 responses
            204 views
            0 likes
            Last Post seqadmin  
            Working...
            X