Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Creating Transcriptome File for Use With BWA from GTF file and genomic fasta file

    I downloaded the Mus musculus mm10 pre-built tophat indices from the Tophat website (http://ccb.jhu.edu/software/tophat/igenomes.shtml). This package comes with a .gtf file that specifies junction boundaries and start/stop codon positions. It also contains a .fa file for the mouse genome.

    I would like to produce a .fa file that contains all of the mouse RNA transcript sequences from the above two files. Is there any software that does this? I have read of de novo transcriptome assembly software like Trinity, but I do not think it would be ideal for what I would like to do here. I merely need some kind of script/tool to generate the annotated mRNA (and ideally, rRNA, mtRNA, and ncRNA) sequences that will match the annotation in the aforementioned two files (.gtf and .fa from the tophat pre-built index).

    Thank you kindly for your time and help. Please let me know if there is any further information you would like to help evaluate my query. If I do find an appropriate tool for this problem, I plan on writing a script to extract and splice necessary sequences from the genomic .fa file, and will upload this script for others to use (but would rather avoid re-inventing the wheel if possible!).

  • #2
    Hi
    Any solution of your problem ?
    I am confused with too

    Comment


    • #3
      Whenever I heard the word "gtf" or "vcf" then my mind goes to BEDtools. And indeed one of the programs is "fastaFromBed" which "Creates FASTA sequences based on intervals in a BED/GFF/VCF file." GTF is just a GFF in disguise.

      Comment


      • #4
        Thanks westerman, however, I like to ask you something different but relative.
        After running Denovo Trinity reports Trinity.fasta and then I did blast got blastx.outfmt6. I also have TransDecoder output i.e. Trinity.fasta.transdecoder.cds , Trinity.fasta.transdecoder.gff3 , Trinity.fasta.transdecoder.mRNA , Trinity.fasta.transdecoder.pep

        I am confused how to add real gene names in to Trinity.fasta So that Trinity numbered genes (TR1...n) will be replaced by real gene names which are available in Trinity.fasta.transdecoder.gff3 or blastx.outfmt6 ?
        Any idea ?

        Comment


        • #5
          Originally posted by jp. View Post
          Thanks westerman, however, I like to ask you something different but relative.
          After running Denovo Trinity reports Trinity.fasta and then I did blast got blastx.outfmt6. I also have TransDecoder output i.e. Trinity.fasta.transdecoder.cds , Trinity.fasta.transdecoder.gff3 , Trinity.fasta.transdecoder.mRNA , Trinity.fasta.transdecoder.pep

          I am confused how to add real gene names in to Trinity.fasta So that Trinity numbered genes (TR1...n) will be replaced by real gene names which are available in Trinity.fasta.transdecoder.gff3 or blastx.outfmt6 ?
          Any idea ?
          No idea. This is processing step that I do not use thus am unable to give advice.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Genetic Variation in Immunogenetics and Antibody Diversity
            by seqadmin



            The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
            11-06-2024, 07:24 PM
          • seqadmin
            Choosing Between NGS and qPCR
            by seqadmin



            Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
            10-18-2024, 07:11 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 11-08-2024, 11:09 AM
          0 responses
          35 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 11-08-2024, 06:13 AM
          0 responses
          28 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 11-01-2024, 06:09 AM
          0 responses
          32 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 10-30-2024, 05:31 AM
          0 responses
          23 views
          0 likes
          Last Post seqadmin  
          Working...
          X