Seqanswers Leaderboard Ad

**lh3** · 07-25-2013, 05:20 AM

Try "gi|358485511|ref|NC_006088.3|:100-1000", though I don't know if that works.

The best solution is to replace the sequence names with something easier.

**rkizen** · 07-25-2013, 05:29 AM

Thanks lh3! That works. I should change the sequence names but I needed this for something a bit urgent and there are a lot of sequence names which would require a bit of time ensuring that a name changing script was not corrupting the file in any way. Thank you again and I am kicking myself for not using the quotation marks to get past the pipes in the sequence names.

**dpryan** · 07-25-2013, 05:45 AM

The reheader script can simply be "samtools reheader"

Do a "samtools view -H file.bam > header.sam", edit the header.sam file by replacing the chromosome names (they need to be in the same order as the original, so don't reorder anything!), then use samtools reheader. Unless your alignments are to a large number of contigs, this should be quite quick.

**rkizen** · 07-25-2013, 05:59 AM

Thanks dpryan!

I have not used reheader before. Will it replace all the sequence names? or will it just replace the header info?

There are a lot of contigs and there are many mappings which may not have the same contigs represented. I will have to write a script to run "samtools view -H file.bam > header.sam" for each mapping and then replace the sequence names for each of the header sam files and apply those back onto the origin bam file using samtools reheader.

For now I just need some intervals quickly so I will use the method lh3 to get these intervals and then work on converting the headers later.

**dpryan** · 07-25-2013, 06:03 AM

If you have a BAM file, the chromosome/contig names are only stored in the header (there's an ordered list and each read just has a numeric value saying where in that list it's chromosome name can be found). Yeah, if you have a bunch of different mappings, then following what Heng Li suggested might prove faster.

**lh3** · 07-25-2013, 02:51 PM

@dpryan your reheader is better in the long run. Haven't thought about that...

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 22 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Samtools view bam parse region issues

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News