Header Leaderboard Ad

Collapse

Error "RG ID on SAMRecord not found in header" from Picard's MarkDuplicates.jar‏

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Error "RG ID on SAMRecord not found in header" from Picard's MarkDuplicates.jar‏

    I am trying the following command:

    java -Xmx4g -jar /home/picard-tools-1.35/MarkDuplicates.jar INPUT=$dir/input.bam OUTPUT=$dir/output.bam TMP_DIR=/home/temp METRICS_FILE=PCR_duplicates REMOVE_DUPLICATES=true ASSUME_SORTED=true VALIDATION_STRINGENCY=LENIENT

    And getting lots of errors like "RG ID on SAMRecord not found in header".

    The command is still running and I haven't seen output.bam coming out... From what I saw previously, it is most likely that I will end up getting no output when the command stops.

    I used BWA for reads mapping, then SAMtools for sorting reads and merge reads from different lanes or different Illumina flowcells. For each bam file, I used the corresponding flowcell name as the prefix, e.g., FC124_1_sorted.bam.

    Since I merged different flowcells from the same Illumina machine, I used "samtools merge -r". Based on SAMtools manual, "merge -r" can attach an RG tag to each alignment. The tag value is inferred from file names.

    I am running commands in a cluster node which has 4GB RAM and 10Tb space and I assume the memory and space should be enough for picard.

    Anyone knows how to fix the error "RG ID on SAMRecord not found in header"?


    Thanks

    -C

  • #2
    Is there a line describing the read groups in your SAM header? (you can view the header on a bam file using e.g. samtools view -H)

    Comment


    • #3
      I used picard AddOrReplaceReadGroups.jar on this problem.

      I think samtools also can add (actually, replace: by samtools reheader) read groups to your bam file.

      For example:
      java -Xmx4g -jar /home/lab/bin/picard/AddOrReplaceReadGroups.jar VALIDATION_STRINGENCY=LENIENT VERBOSITY=ERROR SO=unsorted INPUT=sample.cat.pic OUTPUT=sample.rga.bam RGPL=illumina RGLB=sm_1110 RGPU=barcode_1110 RGSM=sample_1110
      Any other ideas?
      Last edited by alexbmp; 11-10-2011, 02:20 AM.

      Comment


      • #4
        Use picard's MergeSam rather than samtools merge if the files have different headers.

        Comment


        • #5
          @ Chipper

          Yes, I also agree using picard MergeSam instead of samtools merge.

          However I found a case that picard MergeSam doesn't take many inputs (500 inputs) when options are default.

          So in that case I used samtools merge (the files were split; they ought to have the same header).

          Comment

          Latest Articles

          Collapse

          • seqadmin
            How RNA-Seq is Transforming Cancer Studies
            by seqadmin



            Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
            09-07-2023, 11:15 PM
          • seqadmin
            Methods for Investigating the Transcriptome
            by seqadmin




            Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

            Whole Transcriptome RNA-seq
            Whole transcriptome sequencing...
            08-31-2023, 11:07 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 09-22-2023, 09:05 AM
          0 responses
          21 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 09-21-2023, 06:18 AM
          0 responses
          14 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 09-20-2023, 09:17 AM
          0 responses
          14 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 09-19-2023, 09:23 AM
          0 responses
          29 views
          0 likes
          Last Post seqadmin  
          Working...
          X