Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • mate alignment error when running ValidateSamFile in Picard

    Dear All
    I am trying to identify snps using GATK tool. Their manual suggested that I shall try to validate my bam files before entering the analytic pipeline.

    My samples are 100 nucleotides long (first 3 sites trimmed) paired reads.

    I generated bam files for each sample using tophat 1.33 and I sorted each bam file (one file per sample) using picard ReorderSam.jar,

    After that I added read group information using Picard AddOrReplaceReadGroups.jar.

    Then I ran a ValidateSamFile.jar to check whether these bam files are suitable in GATK pipeline. However, I got a lot of errors like this.

    ERROR: Record 21368, Read name HWI-ST978:1370AHMACXX:5:1308:13890:200017, Mate alignment does not match alignment start of mate
    ERROR: Record 21368, Read name HWI-ST978:1370AHMACXX:5:1308:13890:200017, Mate negative strand flag does not match read negative strand flag of mate
    ERROR: Record 21377, Read name HWI-ST978:1370AHMACXX:5:1308:13890:200017, Mate alignment does not match alignment start of mate
    ERROR: Record 21377, Read name HWI-ST978:1370AHMACXX:5:1308:13890:200017, Mate negative strand flag does not match read negative strand flag of mate
    ERROR: Record 21477, Read name HWI-ST978:1370AHMACXX:5:1306:2629:199972, Mate alignment does not match alignment start of mate
    ERROR: Record 21477, Read name HWI-ST978:1370AHMACXX:5:1306:2629:199972, Mate negative strand flag does not match read negative strand flag of mate
    ERROR: Record 21481, Read name HWI-ST978:1370AHMACXX:5:1306:2629:199972, Mate alignment does not match alignment start of mate
    ERROR: Record 21481, Read name HWI-ST978:1370AHMACXX:5:1306:2629:199972, Mate reference index (MRNM) does not match reference index of mate
    ERROR: Record 21490, Read name HWI-ST978:1370AHMACXX:5:2108:3190:30134, Mate alignment does not match alignment start of mate
    ERROR: Record 21490, Read name HWI-ST978:1370AHMACXX:5:2108:3190:30134, Mate negative strand flag does not match read negative strand flag of mate
    Can anyone tell me where the problem is? Why do I have so many unmatched pairs identified at this step. Should I delete them before the analysis?

    Thanks a lot.

  • #2
    I am seeing the same errors. Does anyone have any insight into this?

    Comment


    • #3
      I also encounter this problem, and now I am using the picard/FixMateInformation.jar to fix it, maybe you can try

      Comment


      • #4
        I am also seeing these errors. I tried using Picard's FixMateInformation, but it did not resolve the errors. They seem to be appearing after I use Picard's MarkDuplicates. Is anyone else having this problem/know how to fix it?

        Comment


        • #5
          Originally posted by tatumdmortimer View Post
          I am also seeing these errors. I tried using Picard's FixMateInformation, but it did not resolve the errors. They seem to be appearing after I use Picard's MarkDuplicates. Is anyone else having this problem/know how to fix it?
          I ValidateSamFile before MarkDuplicates, these errors still occur.

          Has anyone solved this problem?

          Comment


          • #6
            Same

            I have the exact same problem tat I can't solve ...
            Maybe it have a link but I also have an exception when trying the CollectGcBiasMetrics command

            Code:
            Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 142549452
            	at net.sf.picard.analysis.CollectGcBiasMetrics.doWork(CollectGcBiasMetrics.java:152)
            	at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:179)
            	at net.sf.picard.analysis.CollectGcBiasMetrics.main(CollectGcBiasMetrics.java:95)

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Recent Advances in Sequencing Analysis Tools
              by seqadmin


              The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
              05-06-2024, 07:48 AM
            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              04-22-2024, 07:01 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 06:57 AM
            0 responses
            12 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-06-2024, 07:17 AM
            0 responses
            16 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-02-2024, 08:06 AM
            0 responses
            19 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-30-2024, 12:17 PM
            0 responses
            24 views
            0 likes
            Last Post seqadmin  
            Working...
            X