Seqanswers Leaderboard Ad

**gringer** · 10-27-2011, 07:39 AM

If you have a BAM file (which bowtie can produce), this tutorial suggests that GBrowse can handle BAM directly:

GBrowse NGS Tutorial - GMOD

http://gmod.org/wiki/GBrowse_NGS_Tutorial#Tell_GBrowse_About_the_SAM_Files

You could also have a look at the admin tutorial:

GBrowse Administration Tutorial

http://gmod.org/gbrowse2/tutorial/tutorial.html#graph

**gringer** · 10-27-2011, 09:33 AM

FWIW, I've just today ended up writing a python script to convert from SAM/BAM files to GFF3 files using pysam. The code may be useful for you if you can't find anything else suitable for your conversion.

For a bit of context, I broke up genomic contigs into 100bp fragments, naming the sequences <contig>#<start>-<end>, then used bowtie2 to map them to the genome -- that's why I've got the 'read.qname.find' bits in the code. My contigs started with 'v', which bowtie replaced with 'N' for some odd reason, so I had to do a bit of extra fiddling to add the 'v' back in.

Here's the relevant part of my code which does the SAM->GFF conversion:

Code:

samFile = pysam.Samfile(samFileName, "r")
totalCount = 0
sys.stderr.write("Getting reads from SAM file...")
sys.stdout.write("##gff-version 3\n")
gffWriter = csv.writer(sys.stdout, delimiter = '\t')
for read in samFile:
    totalCount += 1
    if(read.tid > 0):
        qContig = 'v' + read.qname[1:read.qname.find("#")]
        qContigStart = read.qname[read.qname.find("#")+1:read.qname.find("-")]
        qContigEnd = read.qname[read.qname.find("-")+1:]
        tContig = samFile.getrname(read.tid)
        strand = "-" if read.is_reverse else "+"
        score = "."
        for tag in read.tags:
            if(tag[0] == "XS"):
                score = str(tag[1]+1000)
        gffWriter.writerow((qContig,
                            "sam2gff3-"+os.path.basename(samFileName),
                            "nucleotide_match", qContigStart, qContigEnd,
                            score, strand, ".",
                            "Name=SAM_%s,ID=%s-%s;Target=%s %d %d" %
                            (tContig, qContig, tContig, tContig,
                             read.pos, read.aend)))
    if(totalCount % 100000 == 0):
        sys.stderr.write(".")

**ykdang** · 11-02-2011, 12:14 PM

Thanks a lot, that's could be very useful

Topics	Statistics	Last Post
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 12 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 16 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, 05-02-2024, 08:06 AM	0 responses 22 views 0 likes	Last Post by seqadmin 05-02-2024, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 24 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM

Seqanswers Leaderboard Ad

Announcement

How to generate GFF?

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News