vcftools can do this, here is a command
vcftools --chr chr1 --vcf myfile.vcf --recode --recode -INFO-all --out myfile.chr1.vcf
check your vcf file first to assure how chromosome number has been marked. (chr1 or 1) use that in command line.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
You can also use BBMap's filtervcf.sh like this:
Code:filtervcf.sh in=original.vcf out=varCHR.vcf contigs=Chr01,Chr02,Chr03,Chr04,Chr05,Chr06,Chr07,Chr08,Chr09,Chr10
Leave a comment:
-
Keep all chromosomes and remove scaffolds
Hello,
I am trying to do something similar, I want to keep information for 10 chromosomes (Chr01-10) and remove the scaffold SNP information in my vcf file.
This is the code I have run
vcftools --vcf original.vcf --out varCHR.vcf --chr Chr[01-10]
And I receive the following
After filtering, kept 2 out of 2 Individuals
After filtering, kept 0 out of a possible 166402 Sites
No data left for analysis!
Run Time = 0.00 seconds
Can anyone help me with my problem?
Thank you
Leave a comment:
-
yes you right,
but vcf tool was not giving output.
so i tried with tabix
tabix -h file_name 21
it vcf file of chr 21 but i am not sure about its ending statement.
is it right to extract by this command?
Leave a comment:
-
I think you have the opposite problem as the original poster. They used --chr 1 but the vcf used chr1 instead of 1 to describe the chromosome number. You are using --chr chr21 but it looks like ExAC uses 21 instead of chr21 to describe the chromosome number.
#CHROM POS ID REF ALT QUAL FILTER INFO
1 13372 . G C 608.91 PASS
Leave a comment:
-
i have applied this vcftools but it showing following program
My input command :
vcftools --gzvcf ExAC.r0.3.1.sites.vep.vcf.gz --chr chr21 --out chr21 --recode
And error is showing like this:
VCFtools - UNKNOWN
(C) Adam Auton and Anthony Marcketta 2009
Parameters as interpreted:
--gzvcf ExAC.r0.3.1.sites.vep.vcf.gz
--chr chr21
--out chr21
--recode
Using zlib version: 1.2.8
After filtering, kept 0 out of 0 Individuals
Outputting VCF file...
After filtering, kept 0 out of a possible 0 Sites
File does not contain any sites
Run Time = 0.00 seconds
I am getting all these things as log file as output?
Could anyone please help me to split the chromosome21.vcf from my input file with proper command line discription?
I have looked and tried all vcf command and and even gatk too?
Leave a comment:
-
You may have to put '--chr chr1' instead of '--chr 1'. Your vcf file has chromosoms marked as 'chr$number'.
Good luck
Leave a comment:
-
extract a chromosome with vcftools
I tried to extract Chr 1 data and recode a new vcf using the example in the basic documentation and got no data out. All I want to do is break the large input file into small files of one chromosome each.
this is the log file:
VCFtools - v0.1.10
(C) Adam Auton 2009
Parameters as interpreted:
--gzvcf /Users/williamcassano/Desktop/Variations/PG0000652-BLD.genome.block.anno.vcf.gz
--chr 1
--out chr1
--recode
Using zlib version: 1.2.5
Reading Index file.
File contains 128054613 entries and 1 individuals.
Filtering by chromosome.
Chromosome: chr18
Chromosome: chr3
Chromosome: chr20
Chromosome: chr17
Chromosome: chr8
Chromosome: chr19
Chromosome: chr15
Chromosome: chr10
Chromosome: chr12
Chromosome: chr16
Chromosome: chrX
Chromosome: chr14
Chromosome: chr5
Chromosome: chr22
Chromosome: chr2
Chromosome: chr6
Chromosome: chr7
Chromosome: chr1
Chromosome: chr9
Chromosome: chr13
Chromosome: chr21
Chromosome: chrM
Chromosome: chrY
Chromosome: chr4
Chromosome: chr11
Keeping 0 entries on specified chromosomes.
Applying Required Filters.
After filtering, kept 1 out of 1 Individuals
After filtering, kept 0 out of a possible 0 Sites
Error:No data left for analysis!Tags: None
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
Nobel Prize for MicroRNA Discovery
This week,...-
Channel: Articles
10-07-2024, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-11-2024, 06:55 AM
|
0 responses
12 views
0 likes
|
Last Post
by seqadmin
10-11-2024, 06:55 AM
|
||
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
110 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
114 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
121 views
0 likes
|
Last Post
by EmiTom
10-07-2024, 06:46 AM
|
Leave a comment: