Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Creating Transcriptome File for Use With BWA from GTF file and genomic fasta file

    I downloaded the Mus musculus mm10 pre-built tophat indices from the Tophat website (http://ccb.jhu.edu/software/tophat/igenomes.shtml). This package comes with a .gtf file that specifies junction boundaries and start/stop codon positions. It also contains a .fa file for the mouse genome.

    I would like to produce a .fa file that contains all of the mouse RNA transcript sequences from the above two files. Is there any software that does this? I have read of de novo transcriptome assembly software like Trinity, but I do not think it would be ideal for what I would like to do here. I merely need some kind of script/tool to generate the annotated mRNA (and ideally, rRNA, mtRNA, and ncRNA) sequences that will match the annotation in the aforementioned two files (.gtf and .fa from the tophat pre-built index).

    Thank you kindly for your time and help. Please let me know if there is any further information you would like to help evaluate my query. If I do find an appropriate tool for this problem, I plan on writing a script to extract and splice necessary sequences from the genomic .fa file, and will upload this script for others to use (but would rather avoid re-inventing the wheel if possible!).

  • #2
    Hi
    Any solution of your problem ?
    I am confused with too

    Comment


    • #3
      Whenever I heard the word "gtf" or "vcf" then my mind goes to BEDtools. And indeed one of the programs is "fastaFromBed" which "Creates FASTA sequences based on intervals in a BED/GFF/VCF file." GTF is just a GFF in disguise.

      Comment


      • #4
        Thanks westerman, however, I like to ask you something different but relative.
        After running Denovo Trinity reports Trinity.fasta and then I did blast got blastx.outfmt6. I also have TransDecoder output i.e. Trinity.fasta.transdecoder.cds , Trinity.fasta.transdecoder.gff3 , Trinity.fasta.transdecoder.mRNA , Trinity.fasta.transdecoder.pep

        I am confused how to add real gene names in to Trinity.fasta So that Trinity numbered genes (TR1...n) will be replaced by real gene names which are available in Trinity.fasta.transdecoder.gff3 or blastx.outfmt6 ?
        Any idea ?

        Comment


        • #5
          Originally posted by jp. View Post
          Thanks westerman, however, I like to ask you something different but relative.
          After running Denovo Trinity reports Trinity.fasta and then I did blast got blastx.outfmt6. I also have TransDecoder output i.e. Trinity.fasta.transdecoder.cds , Trinity.fasta.transdecoder.gff3 , Trinity.fasta.transdecoder.mRNA , Trinity.fasta.transdecoder.pep

          I am confused how to add real gene names in to Trinity.fasta So that Trinity numbered genes (TR1...n) will be replaced by real gene names which are available in Trinity.fasta.transdecoder.gff3 or blastx.outfmt6 ?
          Any idea ?
          No idea. This is processing step that I do not use thus am unable to give advice.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Latest Developments in Precision Medicine
            by seqadmin



            Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

            Somatic Genomics
            “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
            Yesterday, 01:16 PM
          • seqadmin
            Recent Advances in Sequencing Analysis Tools
            by seqadmin


            The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
            05-06-2024, 07:48 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 07:15 AM
          0 responses
          13 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-23-2024, 10:28 AM
          0 responses
          17 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-23-2024, 07:35 AM
          0 responses
          17 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-22-2024, 02:06 PM
          0 responses
          10 views
          0 likes
          Last Post seqadmin  
          Working...
          X