Hi All,
I huave a big list of SRA accession numbers (e.g. SRR651693, ..etc). I used fastq-dump to download all these accessions and convert to directly to fastq (without the need to store sra in the disk).
However, I Noticed that for some of the files, the number of reads is different than the number of reads for that file in the NCBI website, which means that the download operation for this dataset was incomplete.
My question is:
is there any validation tool within sra toolkit that I can call (within a script to iterate through fastq files) so it compares the downloaded fastq file with the metadata in the website?
(I know there vdb-validate, but it works only for sra files not for fastq files).
I huave a big list of SRA accession numbers (e.g. SRR651693, ..etc). I used fastq-dump to download all these accessions and convert to directly to fastq (without the need to store sra in the disk).
However, I Noticed that for some of the files, the number of reads is different than the number of reads for that file in the NCBI website, which means that the download operation for this dataset was incomplete.
My question is:
is there any validation tool within sra toolkit that I can call (within a script to iterate through fastq files) so it compares the downloaded fastq file with the metadata in the website?
(I know there vdb-validate, but it works only for sra files not for fastq files).
Comment