Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • TopHat2 command

    Hello, I am not a bioinformatician I am more of a bench scientist. I am trying to align illumina bodymap 2.0 RNAseq files from ENA to the hg19 .fa file. I cannot find the command anywhere to do this with tophat. Is it similar to the bowtie2 command? I just started using linux to get this data and I really need help .

  • #2
    First, you want to be sure you have a powerful enough computer, and enough disk space.

    Second, you want to build an index for the hg19.fa file or download one, from the iGenomes for example

    Third, you'll want to download the annotation file, in the format of a GTF file, from UCSC or the iGenomes.

    Fourth, you need to understand the Linux command line.
    To get the arguments for TopHat, just run the following command.

    tophat --help
    Here is a sample command for TopHat
    tophat \
    --library-type fr-firststrand \
    -G Mus_musculus.GRCm38.77.gtf \
    -o results/tophat/sample_1 \
    Bowtie2Index/Mus_musculus.GRCm38.dna.primary_assembly \
    sample_1_R1.fastq.gz \
    sample_2_R2.fastq.gz
    There is a classic article in Nature Protocols for beginners.
    They make it sound a bit too simple, though, and don't cover any of the subtleties of RNA-Seq.
    Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks
    Last edited by blancha; 11-21-2015, 02:22 PM.

    Comment


    • #3
      @fk566938: If you want to avoid the time consuming step of aligning the files (sounds like you are a new user and may not have adequate software/hardware available locally) you could use the aligned BAM files that were made available by Ensembl for this data.

      Original announcement was here : http://www.ensembl.info/blog/2011/05...from-illumina/
      BAM files first appeared in ensembl release 70 (were available till Ensembl release 78 at least): ftp://ftp.ensembl.org/pub/release-78...ens/genebuild/

      If you just want to query/browse specific genes for tissues you can do that here: https://www.ebi.ac.uk/gxa/experiments/E-MTAB-513

      This link has the tissue specific gene expression analysis from the bodymap data: http://www.cureffi.org/2013/07/11/ti...n-bodymap-2-0/

      Comment


      • #4
        thank you, I've been looking for a way to get already aligned bam files. I have access to the university's linux cluster but I don't have the time anymore to map. My PI wants this data asap. I've tried using these expression sites to find expression of my gene, but I am looking for a specific Isoform and there isn't a lot of information on it in the databases. Thank you again for a link to the already aligned bam files. Would you happen to know if these alignments are already "sorted"? Also they say they aligned them to the genome using the epstein barr virus as a decoy, do you know what that means?
        Last edited by fk566938; 11-21-2015, 07:39 AM.

        Comment


        • #5
          Originally posted by fk566938 View Post
          I have access to the university's linux cluster but I don't have the time anymore to map. My PI wants this data asap.
          Your PI is underestimating the amount of training and effort it takes to get correct and useful results using bioinformatics. It's not just something you can toss in at the end on a whim.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Recent Advances in Sequencing Analysis Tools
            by seqadmin


            The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
            Yesterday, 07:48 AM
          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 07:17 AM
          0 responses
          11 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-02-2024, 08:06 AM
          0 responses
          19 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-30-2024, 12:17 PM
          0 responses
          20 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-29-2024, 10:49 AM
          0 responses
          29 views
          0 likes
          Last Post seqadmin  
          Working...
          X