Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Large RNA sequences ? Does it has any sense ?

    Hi!

    First, i'm a computer scientist recently exploring bioinformatics field, so please forgive me if i say something really stupid

    Basically i'm studying the possibility of implementing Nussinov-Jacobsen algorithm on GPU's, accelerating, if possible, time performance in orders of magnitude; but to accomplish that, the RNA sequence has to be very large. I was wondering if it has some sense since i've seen most RNA seqs are about 200 bases.

    Thanks!

  • #2
    How long do you mean by 'very large'? It depends on the sequencing technology used and the length ordered. Even 200 is somewhat in the 'long' range for NGS (I believe).

    Comment


    • #3
      What kind of RNA do you mean? There are entire RNA genomes of bacteria. And normal mRNAs are some 100s to 1000s nucleotides long.
      By quickly googling Nussinov-Jacobsen I learned that you can do RNA folding prediction with it. That only makes sense for small RNAs.

      Comment


      • #4
        454 has 400+ base reads & PacBio is promising reads that long or much longer.

        Folding of longer RNAs could be interesting, as secondary structure is sometimes involved in the stability, localization or utilization of an RNA.

        It's a niche, but that doesn't mean it isn't interesting.

        Comment


        • #5
          Many RNAs are long, but the current sequencing technologies fragment them prior to sequencing, since they perform better on shorter sequences. SOLiD works up to 50 bp, Illumina works up to about 100 bp, and 454 can get a few hundred bp. If you want something larger you'll have to piece together multiple reads into a longer consensus sequence.
          It seems to me, though, that if you need a longer consensus sequence you could just use the complement of the genomic sequence (which is the RNA sequence) for some interesting genes. If your goal is to demonstrate an algorithmic speedup using a GPU-based approach it seems that it wouldn't be important to have cutting-edge RNA data, but it would be better to use a well studied RNA (like ribosomal RNA or tRNA) for your comparison.

          Comment


          • #6
            Originally posted by mrawlins View Post
            Many RNAs are long, but the current sequencing technologies fragment them prior to sequencing, since they perform better on shorter sequences. SOLiD works up to 50 bp, Illumina works up to about 100 bp, and 454 can get a few hundred bp. If you want something larger you'll have to piece together multiple reads into a longer consensus sequence.
            It seems to me, though, that if you need a longer consensus sequence you could just use the complement of the genomic sequence (which is the RNA sequence) for some interesting genes. If your goal is to demonstrate an algorithmic speedup using a GPU-based approach it seems that it wouldn't be important to have cutting-edge RNA data, but it would be better to use a well studied RNA (like ribosomal RNA or tRNA) for your comparison.
            Ok.

            And how many nucleotides can have that ribosomal or tRNA ?

            Comment


            • #7
              In Shewanella the longest ribosomal sequence is about 2900 bases long. Human ribosome sequences may be a bit longer. The tRNAs (in Shewanella) are about 76 bases long. I seem to recall that tRNAs have some of the best documented secondary structure, though, so while they may not make a good test of your algorithm's speed, they might be a good test of accuracy.

              Comment


              • #8
                Originally posted by mrawlins View Post
                In Shewanella the longest ribosomal sequence is about 2900 bases long. Human ribosome sequences may be a bit longer. The tRNAs (in Shewanella) are about 76 bases long. I seem to recall that tRNAs have some of the best documented secondary structure, though, so while they may not make a good test of your algorithm's speed, they might be a good test of accuracy.
                Thanks!

                I wonder, what about the searching for structures on an entire genome ?

                Comment


                • #9
                  Originally posted by perencia View Post
                  I wonder, what about the searching for structures on an entire genome ?
                  Only some viruses have an RNA genome (most organisms use DNA), and their genomes tend not to be very big (not big enough to worry about GPU optimisations I would guess).

                  Comment


                  • #10
                    Have you had a chance to take a look at Rfam and Sean Eddy's Infernal?

                    Comment


                    • #11
                      Originally posted by Bruins View Post
                      Have you had a chance to take a look at Rfam and Sean Eddy's Infernal?
                      No, but i'll look for them now

                      I've searching a little more, and found that report



                      and that implementation



                      Former is a GPU implementation of the Unafold Algorithm
                      ( http://mfold.bioinfo.rpi.edu/ )

                      It seems that a GPU optimisation may take room on a multiple RNA structure prediction, as in the first report ( a set of 11 Picor-
                      naviral sequences (7124 to 8214 nucleotides)).
                      I'll post what i find

                      Anyway, i'd like to known which are the benefits from such procedures, where do they impact. Bioinformatics is a large field i guess .
                      Last edited by perencia; 07-29-2010, 07:12 AM.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Recent Advances in Sequencing Technologies
                        by seqadmin







                        Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                        Long-Read Sequencing
                        Long-read sequencing has...
                        12-02-2024, 01:49 PM
                      • seqadmin
                        Genetic Variation in Immunogenetics and Antibody Diversity
                        by seqadmin



                        The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
                        11-06-2024, 07:24 PM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 12-02-2024, 09:29 AM
                      0 responses
                      137 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 12-02-2024, 09:06 AM
                      0 responses
                      48 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 12-02-2024, 08:03 AM
                      0 responses
                      38 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 11-22-2024, 07:36 AM
                      0 responses
                      69 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X