Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • tahamasoodi
    Success
    • May 2012
    • 130

    samtools variant calling for separate chromosomes

    Is there anyway to call variants separately for each chromosome (separate output files) using a single command in samtools. I am having whole genome data and when I annotate it using annovar, I'm getting a very big file which can not be loaded in excel, so I want a separate vcf file for each chromosome so that i can annotate them separately using annovar.

    Thanks,
    Thanks,
  • tahamasoodi
    Success
    • May 2012
    • 130

    #2
    There seems to be no response for my query.
    Thanks,

    Comment

    • tahamasoodi
      Success
      • May 2012
      • 130

      #3
      Can anybody help?
      Thanks,

      Comment

      • Khen
        Member
        • Mar 2012
        • 11

        #4
        There is an example in the manual.


        samtools mpileup -C50 -gf ref.fasta -r chr3:1,000-2,000 in1.bam in2.bam

        The '-r' argument allows you to specify the region.

        Comment

        • tahamasoodi
          Success
          • May 2012
          • 130

          #5
          This command will call variants separately for each chromosome thus we have to use 24 commands for 24 chromosomes. But, I want a single command to call variants for all the 24 chromosomes, each chromosome having a separate vcf file.
          Thanks,

          Comment

          • tahamasoodi
            Success
            • May 2012
            • 130

            #6
            Can this be done in samtools?
            Thanks,

            Comment

            • westerman
              Rick Westerman
              • Jun 2008
              • 1104

              #7
              No. Samtools will not output 24 individual files with a single command. Learn some shell scripting. It is easy enough to run 24 commands with a single shell command.

              Comment

              • tahamasoodi
                Success
                • May 2012
                • 130

                #8
                Then how can we view the WGS data as the single output vcf file of samtools after annotation in annovar is equal to around 900 MBs which cannot be viewed in excel?
                Thanks,

                Comment

                • westerman
                  Rick Westerman
                  • Jun 2008
                  • 1104

                  #9
                  Originally posted by tahamasoodi View Post
                  Then how can we view the WGS data as the single output vcf file of samtools after annotation in annovar is equal to around 900 MBs which cannot be viewed in excel?
                  That question does not make sense when looking at your previous comments in this threads. Basically, if you need a single file with all chromosome variants in it then samtools/bcftools/vcfutils will provide this. Or if you need 24 individual files one per chromosome then samtools will provide this ... however you will either need to type in the samtools command 24 times (with slightly different parameters) or you will need to create a shell script that will run samtools 24 times.

                  Your very first question was "... so I want a separate vcf file for each chromosome ..." to which Khen provided the answer. You then asked, somewhat confusingly, "... Can this be done in samtools?" to which I took the word "this" to mean "generate 24 VCF files" which samtools can not do in a single command.

                  Comment

                  • swbarnes2
                    Senior Member
                    • May 2008
                    • 910

                    #10
                    NGS simply makes too much data for a person to deal with if all the tools they have are mindlessly copied command lines and Excel. You have to learn shell scripting, or Perl, or Python, or something to handle large text files.

                    Comment

                    Latest Articles

                    Collapse

                    • GATTACAT
                      Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by GATTACAT
                      Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                      Yesterday, 11:43 AM
                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM
                    • SEQadmin2
                      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                      by SEQadmin2


                      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                      ...
                      06-02-2026, 10:05 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, 06-30-2026, 05:37 AM
                    0 responses
                    11 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-26-2026, 11:10 AM
                    0 responses
                    18 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    52 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-09-2026, 11:58 AM
                    0 responses
                    111 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...