Header Leaderboard Ad

Collapse

What is the Reference to be used for RNA-Seq assembly?

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • What is the Reference to be used for RNA-Seq assembly?

    Hello all.

    I have been trying to do a Reference-based assembly for my RNA-seq data. It is Illumina paired-end data of 100bp reads.
    There are 8 different sets of data samples.

    But now I have a problem for aligned the reads to the reference. I have the fasta sequence of contigs, scaffolgs, genes and complete mRNA. The sequences are split in the form of chromosomes. This data was downloaded from an already available browser, with view in Chromosome-wise.

    I plan to use Bowtie for the alignment, and I need a new Index for my genome as its not readily available.
    Which is the reference for Transcriptome alignment - contigs, scaffolgs, genes, complete mRNA or something else?

    I tried using contigs, mRNA, scaffold fasta sequences but only 0.05% of the reads align to any of these.
    What exactly must be used as the reference? What should the Bowtie Index be built upon???

    My aim is to create the BAM files so that it can be viewed in the Genome Browser.
    Last edited by rohitngs; 02-20-2013, 01:27 AM.

  • #2
    You have to use the genome as reference in a reference-base assembly.

    Comment


    • #3
      The complete genome is not available; only the contigs, scaffolgs, genes and complete mRNA are available

      Comment


      • #4
        I am trying to use the scaffolds that are currently available, is that fine???
        Will that be ok to align my RNA-seq reads???

        Comment


        • #5
          Maybe you can try De-Novo assembly

          Comment


          • #6
            The problem with de-novo assembly was I could not generate the SAM files properly...

            Comment


            • #7
              In general a De-Novo assembly will output a list of transcript in fasta file

              Comment


              • #8
                Yes, I tried Rnnotator and got the final contigs.
                But I could not generate the BAM files from that as it always showed no header line error

                Comment


                • #9
                  Maybe try Trinity, it worked well. On which organism do you work ?

                  Comment


                  • #10
                    Silkworm, it has very high repetitive sequences... and the sex chromosomes have not yet been sequenced...

                    Comment


                    • #11
                      Hi rohitngs,

                      if your genomic assembly is fairly complete you could try to map the reads onto that (combine the contigs and scaffolds into a single FASTA file and index that for Bowtie).

                      In theory you could also map your RNASeq reads to, say, a set of full length cDNA sequences, but you would need to be confident that it is pretty complete (and you never really can be), or you will throw away a lot of data.

                      However, I am alarmed by the fact that only 0.05% of your reads mapped when you mapped them to the genomic assembly. That to me suggests that your data is not from silkworm at all, and I would first check that it hasn't been contaminated with something else or even swapped with another sample. We had a case like this here not so long ago.

                      Just take a few reads and BLAST them against the NCBI database - you will quickly get an idea of what is going on.

                      cheers

                      Micha

                      Comment


                      • #12
                        Yes, that was the reason...

                        I was working with miRNA pooled libraries thinking that they were actually mRNA sequences... I did not know of that miRNA library data existed till I asked them if there was any other RNA-seq done...
                        I got it last week only when I was looking through all the data I was given, forgot to update that...

                        It was pretty stupid of me, but I guess that's how I had to learn...
                        Thank you so much Micha....

                        Cheers,
                        Rohit
                        Last edited by rohitngs; 02-26-2013, 02:52 AM.

                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
                          by seqadmin




                          Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
                          03-10-2023, 05:31 AM
                        • seqadmin
                          Expert Advice on Automating Your Library Preparations
                          by seqadmin



                          Using automation to prepare sequencing libraries isn’t a new concept, and most researchers are aware that there are numerous benefits to automating this process. However, many labs are still hesitant to switch to automation and often believe that it’s not suitable for their lab. To combat these concerns, we’ll cover some of the key advantages, review the most important considerations, and get real-world advice from automation experts to remove any lingering anxieties....
                          02-21-2023, 02:14 PM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, 03-17-2023, 12:32 PM
                        0 responses
                        7 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 03-15-2023, 12:42 PM
                        0 responses
                        17 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 03-09-2023, 10:17 AM
                        0 responses
                        66 views
                        1 like
                        Last Post seqadmin  
                        Started by seqadmin, 03-03-2023, 12:03 PM
                        0 responses
                        64 views
                        0 likes
                        Last Post seqadmin  
                        Working...
                        X