Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • samtools variant calling for separate chromosomes

    Is there anyway to call variants separately for each chromosome (separate output files) using a single command in samtools. I am having whole genome data and when I annotate it using annovar, I'm getting a very big file which can not be loaded in excel, so I want a separate vcf file for each chromosome so that i can annotate them separately using annovar.

    Thanks,
    Thanks,

  • #2
    There seems to be no response for my query.
    Thanks,

    Comment


    • #3
      Can anybody help?
      Thanks,

      Comment


      • #4
        There is an example in the manual.

        http://samtools.sourceforge.net/samtools.shtml
        samtools mpileup -C50 -gf ref.fasta -r chr3:1,000-2,000 in1.bam in2.bam

        The '-r' argument allows you to specify the region.

        Comment


        • #5
          This command will call variants separately for each chromosome thus we have to use 24 commands for 24 chromosomes. But, I want a single command to call variants for all the 24 chromosomes, each chromosome having a separate vcf file.
          Thanks,

          Comment


          • #6
            Can this be done in samtools?
            Thanks,

            Comment


            • #7
              No. Samtools will not output 24 individual files with a single command. Learn some shell scripting. It is easy enough to run 24 commands with a single shell command.

              Comment


              • #8
                Then how can we view the WGS data as the single output vcf file of samtools after annotation in annovar is equal to around 900 MBs which cannot be viewed in excel?
                Thanks,

                Comment


                • #9
                  Originally posted by tahamasoodi View Post
                  Then how can we view the WGS data as the single output vcf file of samtools after annotation in annovar is equal to around 900 MBs which cannot be viewed in excel?
                  That question does not make sense when looking at your previous comments in this threads. Basically, if you need a single file with all chromosome variants in it then samtools/bcftools/vcfutils will provide this. Or if you need 24 individual files one per chromosome then samtools will provide this ... however you will either need to type in the samtools command 24 times (with slightly different parameters) or you will need to create a shell script that will run samtools 24 times.

                  Your very first question was "... so I want a separate vcf file for each chromosome ..." to which Khen provided the answer. You then asked, somewhat confusingly, "... Can this be done in samtools?" to which I took the word "this" to mean "generate 24 VCF files" which samtools can not do in a single command.

                  Comment


                  • #10
                    NGS simply makes too much data for a person to deal with if all the tools they have are mindlessly copied command lines and Excel. You have to learn shell scripting, or Perl, or Python, or something to handle large text files.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Genetic Variation in Immunogenetics and Antibody Diversity
                      by seqadmin



                      The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
                      11-06-2024, 07:24 PM
                    • seqadmin
                      Choosing Between NGS and qPCR
                      by seqadmin



                      Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
                      10-18-2024, 07:11 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 11-08-2024, 11:09 AM
                    0 responses
                    140 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 11-08-2024, 06:13 AM
                    0 responses
                    116 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 11-01-2024, 06:09 AM
                    0 responses
                    70 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 10-30-2024, 05:31 AM
                    0 responses
                    25 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X