Header Leaderboard Ad

Collapse

Truncated File: Trouble with SRA > Fastq > Sam

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • DPCook
    started a topic Truncated File: Trouble with SRA > Fastq > Sam

    Truncated File: Trouble with SRA > Fastq > Sam

    Hi everyone,

    I know there have been several posts regarding truncated SAM files, but I'm having a difficult time finding a solution.

    I've converted an SRA file to FASTQ with fastq-dump:
    Code:
    fastq-dump [SRA] [Output-prefix]
    QC/Trimmed the FASTQ,

    Bowtie to align (SAM output):
    Code:
    bowtie -n 2 -k 1 -l 40 --best -m 1 -S [genome] [FASTQ] [SAM output]
    But when I try to convert sam to bam
    Code:
    $ samtools view -Sb [.SAM] > [.BAM]
    [W::sam_read1] parse error at line 26
    [main_samview] truncated file.
    So I looked at line 26 and it does appear odd (bottom line):
    Code:
    @HD	VN:1.0	SO:unsorted
    @SQ	SN:chr1	LN:197195432
    @SQ	SN:chr2	LN:181748087
    @SQ	SN:chr3	LN:159599783
    @SQ	SN:chr4	LN:155630120
    @SQ	SN:chr5	LN:152537259
    @SQ	SN:chr6	LN:149517037
    @SQ	SN:chr7	LN:152524553
    @SQ	SN:chr8	LN:131738871
    @SQ	SN:chr9	LN:124076172
    @SQ	SN:chr10	LN:129993255
    @SQ	SN:chr11	LN:121843856
    @SQ	SN:chr12	LN:121257530
    @SQ	SN:chr13	LN:120284312
    @SQ	SN:chr14	LN:125194864
    @SQ	SN:chr15	LN:103494974
    @SQ	SN:chr16	LN:98319150
    @SQ	SN:chr17	LN:95272651
    @SQ	SN:chr18	LN:90772031
    @SQ	SN:chr19	LN:61342430
    @SQ	SN:chrX	LN:166650296
    @SQ	SN:chrY	LN:15902555
    @SQ	SN:chrM	LN:16299
    @PG	ID:Bowtie	VN:1.1.1	CL:"bowtie --wrapper basic-0 -n 2 -k 1 -l 40 --best -m 1 -S /Users/DPC/Documents/Genome_Indexes/mm9.ebwt/mm9 /Volumes/My Passport/Hi-C/ChIP-seq-files/Ezh2/EZH2-trimmed.fastq /Volumes/My Passport/Hi-C/ChIP-seq-files/Ezh2/EZH2.sam"
    EZH2-SRR579282.1	4	*	0	0	*	*	0	0	TAGTTTGATTCACCCAAGGGTGAATGGGGGAACAAGAAAG	F>BACFHAHIEHFGII@FHGEE;9CFAHH)8??FFH;BDH	XM:i:0
    X¢˚˚p¢˚˚D@X¢˚˚p¢˚˚D@X¢˚˚p¢˚˚D@EZH2-SRR579282.5	4	*	0	0	*	*	0	0	GCTAAAACCATCCAGTGGAAGAAAAACAGCATTTTCAACA	HHHHIIIIIIHIIIHHHIEGHEGIIFIIGAGHIIGFGHII	XM:i:1
    I'm not quite sure what's going on here. I've had it happen with a couple of SAM files generated from SRA > Fastq, so I'm thinking it may be coming from the fastq-dump step.

    Any insight or help would be greatly appreciated,

    David

Latest Articles

Collapse

  • seqadmin
    How RNA-Seq is Transforming Cancer Studies
    by seqadmin



    Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
    09-07-2023, 11:15 PM
  • seqadmin
    Methods for Investigating the Transcriptome
    by seqadmin




    Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

    Whole Transcriptome RNA-seq
    Whole transcriptome sequencing...
    08-31-2023, 11:07 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 09:05 AM
0 responses
13 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-21-2023, 06:18 AM
0 responses
10 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-20-2023, 09:17 AM
0 responses
12 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-19-2023, 09:23 AM
0 responses
27 views
0 likes
Last Post seqadmin  
Working...
X