Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • repinementer
    Member
    • Dec 2009
    • 80

    Weird TopHat - junction.bed output

    Does any one know about the (snapshot attched)

    blue-refseq
    green- cuff transcripts
    red- junctions

    1. exon junctions (junction.bed) overlapping none of the refseq exons or Cufflinks transcripts.

    2. exon junctions overlapping introns but not any cuff transcripts ?
    Attached Files
    Last edited by repinementer; 08-18-2010, 12:06 AM.
  • repinementer
    Member
    • Dec 2009
    • 80

    #2
    another snapshot

    another snapshot
    Attached Files

    Comment

    • Dethecor
      Member
      • May 2010
      • 24

      #3
      better resolution plz . . . can't see a damn thing on those snapshots :/

      "You are only young once, but you can stay immature indefinitely."

      Comment

      • repinementer
        Member
        • Dec 2009
        • 80

        #4
        hey

        I submitted a good resolution file but this website is showing them that. Anyways here are zipped pictures.
        Attached Files

        Comment

        • ECO
          --Site Admin--
          • Oct 2007
          • 1360

          #5
          Sorry about that resizing. I've turned it off, and am testing it now.

          Comment

          • john_mu
            Member
            • May 2010
            • 88

            #6
            I may help to also display the SAM file in IGV so you can see where the reads are aligning.
            SpliceMap: De novo detection of splice junctions from RNA-seq
            Download SpliceMap Comment here

            Comment

            • repinementer
              Member
              • Dec 2009
              • 80

              #7
              Thank you

              IGV is freezing just after uploading sam file. It would be great if you suggest any other browser so that I can put them along. So far as per discussion in seq answers IGV has a reputation going on.
              Last edited by repinementer; 08-18-2010, 06:21 PM.

              Comment

              • john_mu
                Member
                • May 2010
                • 88

                #8
                You should convert the SAM file to a BAM file for input into IGV.

                It does require a lot of memory if you have a large number of reads. That could be why it is freezing.
                SpliceMap: De novo detection of splice junctions from RNA-seq
                Download SpliceMap Comment here

                Comment

                • repinementer
                  Member
                  • Dec 2009
                  • 80

                  #9
                  still the same

                  I used coverage.wiggle files instaed of SAM/BAMfile to notify read density or coverage.

                  I came to a conlclusion TopHat producing many false positive because of the 32 bp read length?

                  I'm including other snapshots where junctions mapped to introns instead of exons and with out any read coverage.
                  Attached Files

                  Comment

                  • bgibb
                    Junior Member
                    • Jul 2010
                    • 7

                    #10
                    similar, even with --no-novel-juncs option

                    I've found a few unexpected junctions from TopHat, even when using the --no-novel-juncs option. Attached is an example. UCSC gene, mRNA and exon records were specified in a gff3 file (with the -G option). And I checked the gff3 file carefully to ensure there were no boundaries specified near the bottom two unexpected junctions.

                    blue -- RefSeq
                    red -- junctions reported by TopHat with --no-novel-junctions option

                    TopHat-1.0.14
                    Bowtie-0.12.5
                    76bp reads
                    Attached Files

                    Comment

                    • repinementer
                      Member
                      • Dec 2009
                      • 80

                      #11
                      Yes same observed in my analysis.

                      After supplying known ensemblegenes.gtf and ensemble_junctions and --no-novel-juncs I observed few cases where I have seen exactly you mentioned.
                      Seems it is still giving false postives by taking the reads into consideration.

                      BTW how did you directly uploaded sam file into IGV?

                      Comment

                      • bgibb
                        Junior Member
                        • Jul 2010
                        • 7

                        #12
                        SAM files and IGV

                        I had some problems with the Linux version of IGV, but the Windows version seems to work OK. I'm using the binary distribution, version 1.5.18.

                        After filtering, my accepted_hits.sam file size is 2.7 GB. I leave the SAM file header in place.

                        Comment

                        • Jim Robinson
                          Member
                          • May 2009
                          • 75

                          #13
                          Samfiles and IGV

                          Hi, I just found this thread while looking for something else. I can answer a few questions.

                          You can load a SAM file into IGV, but it needs to create an index first. It will do this automatically after a warning, or you can do it in advance with "igvtools". BAM is better, but SAM works.

                          I hate to hear about freezes. Recently I added a coverage depth limit, user settable, which has solved most freezing issues. Also, for large wig files please check out the "tdf" format, easily created from igvtools (command line or File > run igvtools...). Its basically a bigwig, developed before bigwig was available or I would have just used bigwig. Also, for bed files exceeding ~ 100,000 rows or so please consider indexing them, you will be happier. Again use igvtools for this (or tabix). If you use the binary and indexed formats you can load an really large dataset quickly, and without freezing.

                          Finally send bug reports and complaints to igv-help at broadinstitute.org. I'll answer them, but I don't always have time to monitor these forums unfortunately.

                          Comment

                          • repinementer
                            Member
                            • Dec 2009
                            • 80

                            #14
                            I tried index format but ntworking

                            I converted file.bam to to file.bam.bai by using samtools.
                            And i tried to open file.bam.bi index file using IGV. It throwing by saying SAM/BAM index files are not supported ?

                            Comment

                            • Jim Robinson
                              Member
                              • May 2009
                              • 75

                              #15
                              BAM files in IGV

                              Open the BAM file, not the index. Sorry if the message is confusing, I'll work on that.

                              To elaborate a little, samtools does not convert the bam file, it just creates an index in a separate file. The alignment data is still in the original file. IGV (and other tools like samtools tview) look for the index by naming convention, but you don't explicitly load it.
                              Last edited by Jim Robinson; 12-02-2010, 04:51 PM. Reason: clarification

                              Comment

                              Latest Articles

                              Collapse

                              • GATTACAT
                                Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                                by GATTACAT
                                Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                                07-01-2026, 11:43 AM
                              • SEQadmin2
                                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                                by SEQadmin2


                                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                                Here are nine questions we think about, in roughly the order they matter, before...
                                06-18-2026, 07:11 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by SEQadmin2, 07-02-2026, 11:08 AM
                              0 responses
                              7 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-30-2026, 05:37 AM
                              0 responses
                              12 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-26-2026, 11:10 AM
                              0 responses
                              20 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-17-2026, 06:09 AM
                              0 responses
                              54 views
                              0 reactions
                              Last Post SEQadmin2  
                              Working...