Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • TopHat output and USCS genome browser question

    Hi folk! This is my first post and I'm quite new to next-gen sequencing analysis, so please be kind

    Here is some background on my data:
    mRNA from rat heart tissue was isolated and sequenced using a Heliscope from Helicos. I then converted the sms file generated by Helicos to a fasta file using a script in the Helisphere software. From there I used TopHat to look for splice junctions.

    From what I can see, everything went fine. The TopHat output includes all the files that I would expect it to include. Now, when I tried to view the .BED and .WIG files using the UCSC genome browser I have problems.

    Using the input found here --> http://genome.ucsc.edu/cgi-bin/hgGateway I tried to view my files after selecting genome->rat and the most recent assembly. I try to upload my .BED file and received the following error:
    "Error File 'junctions.bed' - Unrecognized format line 2 of custom track: gi|34868215|ref|NW_047355.1|Rn11_WGA1875_4 1124643 1138858 JUNC00000001 2 - 1124643 1138858 255,0,0 2 11,10 0,14205 (note: chrom names are case sensitive)"

    I am wondering if I'm even attempting to use the right tool to view the file or if there is some upstream problem?

    Any help would be appreciated.

    Thanks,
    Sam

  • #2
    The pre-built index for rat is built from NCBI contigs. In the first column of the bed file where it should be giving you a chromosome name you're getting the contig instead. You can build your own index from the UCSC fasta files with bowtie-build and it won't have this problem.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Best Practices for Single-Cell Sequencing Analysis
      by seqadmin



      While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
      06-06-2024, 07:15 AM
    • seqadmin
      Latest Developments in Precision Medicine
      by seqadmin



      Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

      Somatic Genomics
      “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
      05-24-2024, 01:16 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 06:58 AM
    0 responses
    13 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 06-06-2024, 08:18 AM
    0 responses
    20 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 06-06-2024, 08:04 AM
    0 responses
    18 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 06-03-2024, 06:55 AM
    0 responses
    13 views
    0 likes
    Last Post seqadmin  
    Working...
    X