Getting transcriptome sequence from RNA-seq reads in FASTA

barbarian

Member

Join Date: Feb 2015

Posts: 21
- Share
- Tweet
#1

Getting transcriptome sequence from RNA-seq reads in FASTA

03-03-2016, 08:22 PM

Hello,

So, from Ensembl FTP, we can download a transcriptome file which is a FASTA file containing the header info, which is transcript name, chromosome position, etc. and the dna sequence itself. On the other hand, I have a FASTQ file from RNA-seq experiment.

What I want to do is, generate the FASTA file like Ensembl transcriptome. I think I read this is called consensus FASTA.

What I imagine the step to generate this is like this:

1. Align the reads to the transcriptome reference, we get SAM/BAM
2. Assemble the SAM/BAM according to coordinate
3. Solve the occurence of SNP and indel
4. Generate FASTA file with header information and sequence assembled from step 3

For step 1, I know I can use bowtie2. For step 2, I don't know the tools but I think I can write my own program. The problem is step 3. I don't know how.
In that case, probably you can suggest me well known pipeline to do this because I think this is a general things to do.

What do you suggest for that? Thank you for your reply.

Last edited by barbarian; 03-03-2016, 08:32 PM.
Tags: alignment, rna-seq, transcript mapping
mastal

Senior Member

Join Date: Mar 2009

Posts: 666
- Share
- Tweet
#2

03-04-2016, 02:33 AM

I think you can do most of what you want (following the bowtie alignment step) with samtools, using samtools sort, index, and then mpileup.

Multisample SNP Calling

http://samtools.sourceforge.net/mpileup.shtml
Comment

Previous template Next

Genetic Variation in Immunogenetics and Antibody Diversity

by seqadmin

The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
- Channel: Articles
11-06-2024, 07:24 PM
Choosing Between NGS and qPCR

by seqadmin

Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
- Channel: Articles
10-18-2024, 07:11 AM

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 24 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Getting transcriptome sequence from RNA-seq reads in FASTA

Comment

Latest Articles

ad_right_rmr

News