Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • barbarian
    Member
    • Feb 2015
    • 21

    Getting transcriptome sequence from RNA-seq reads in FASTA

    Hello,

    So, from Ensembl FTP, we can download a transcriptome file which is a FASTA file containing the header info, which is transcript name, chromosome position, etc. and the dna sequence itself. On the other hand, I have a FASTQ file from RNA-seq experiment.

    What I want to do is, generate the FASTA file like Ensembl transcriptome. I think I read this is called consensus FASTA.

    What I imagine the step to generate this is like this:

    1. Align the reads to the transcriptome reference, we get SAM/BAM
    2. Assemble the SAM/BAM according to coordinate
    3. Solve the occurence of SNP and indel
    4. Generate FASTA file with header information and sequence assembled from step 3

    For step 1, I know I can use bowtie2. For step 2, I don't know the tools but I think I can write my own program. The problem is step 3. I don't know how.
    In that case, probably you can suggest me well known pipeline to do this because I think this is a general things to do.

    What do you suggest for that? Thank you for your reply.
    Last edited by barbarian; 03-03-2016, 08:32 PM.
  • mastal
    Senior Member
    • Mar 2009
    • 666

    #2
    I think you can do most of what you want (following the bowtie alignment step) with samtools, using samtools sort, index, and then mpileup.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Pathogen Surveillance with Advanced Genomic Tools
      by seqadmin




      The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
      03-24-2025, 11:48 AM
    • seqadmin
      New Genomics Tools and Methods Shared at AGBT 2025
      by seqadmin


      This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

      The Headliner
      The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
      03-03-2025, 01:39 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 03-20-2025, 05:03 AM
    0 responses
    42 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-19-2025, 07:27 AM
    0 responses
    51 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-18-2025, 12:50 PM
    0 responses
    38 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-03-2025, 01:15 PM
    0 responses
    193 views
    0 reactions
    Last Post seqadmin  
    Working...