Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • david.tamborero
    Member
    • Feb 2011
    • 60

    Error during PicardMarkDuplicates (Illumina paired-ends mapped by Bfast)

    I mapped one run of Illumina paired end reads by using Bfast (bwaaln + localalign + postprocess), then converted it to bam, performed the Picard AddOrReplaceReadGroups command plus Samtools sort/index and therefore, when executing Picard MarkDuplicates, I have the following error:

    Code:
    Exception in thread "main" net.sf.picard.PicardException: Value was put into PairInfoMap more than once.  3: grouped_reads:HWUSI-EAS1692_0001:1:1:1841:8839#0
        at net.sf.picard.sam.CoordinateSortedPairInfoMap.ensureSequenceLoaded(CoordinateSortedPairInfoMap.java:124)
        at net.sf.picard.sam.CoordinateSortedPairInfoMap.remove(CoordinateSortedPairInfoMap.java:78)
        at net.sf.picard.sam.DiskReadEndsMap.remove(DiskReadEndsMap.java:61)
        at net.sf.picard.sam.MarkDuplicates.buildSortedReadEndLists(MarkDuplicates.java:285)
        at net.sf.picard.sam.MarkDuplicates.doWork(MarkDuplicates.java:113)
        at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:157)
        at net.sf.picard.sam.MarkDuplicates.main(MarkDuplicates.java:97)
    I've noticed that the reads outputted by the bfast-postprocess looks like HWUSI-EAS1692_0001:2:1:1025:14826#0, whereas the original reads indicates the pair-end by either HWUSI-EAS1692_0001:2:1:1025:14826#0/1 or HWUSI-EAS1692_0001:2:1:1025:14826#0/2, I guess it is related, but I do not know how to solve this.

    Any help will be appreciated, many thanks in advance!

    David
  • nilshomer
    Nils Homer
    • Nov 2008
    • 1283

    #2
    Could you create a test case with only the offending reads (in SAM format)?

    Comment

    • david.tamborero
      Member
      • Feb 2011
      • 60

      #3
      I've created a file just with the reads pointed out in the MarkDuplicates error, and the same message error appears when running the Picard command. Also, I've created a SAM file with only these reads removed and the same error appears but with another read ID. I would say that MarkDuplicates does not allow more than one mapped position per read in the input file, therefore, I've forced just an unique alignement per read in the bfast postprocess command by changing the value of the "-a" argument:

      previously, the command was:

      Code:
      bfast postprocess -i  input.baf -f hg19.fa -a 0 -A 0 -R -O 1 -n 4 -U >  output.sam
      I changed it to:

      Code:
      bfast postprocess -i  input.baf -f hg19.fa -a 3 -z -A 0 -R -O 1 -n 4 -U >  output.sam
      Now the Picard MarkDuplicates command works fine. Hope it makes sense.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        New Genomics Tools and Methods Shared at AGBT 2025
        by seqadmin


        This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

        The Headliner
        The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
        03-03-2025, 01:39 PM
      • seqadmin
        Investigating the Gut Microbiome Through Diet and Spatial Biology
        by seqadmin




        The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
        02-24-2025, 06:31 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 03-20-2025, 05:03 AM
      0 responses
      17 views
      0 reactions
      Last Post seqadmin  
      Started by seqadmin, 03-19-2025, 07:27 AM
      0 responses
      18 views
      0 reactions
      Last Post seqadmin  
      Started by seqadmin, 03-18-2025, 12:50 PM
      0 responses
      19 views
      0 reactions
      Last Post seqadmin  
      Started by seqadmin, 03-03-2025, 01:15 PM
      0 responses
      186 views
      0 reactions
      Last Post seqadmin  
      Working...