Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Gene prediction with both RNA-seq mapping and genomic sequence based inputs?

    Hey everyone,
    I am working on assembling a new reptile genome. I will have some pretty high coverage mRNA-seq data as well. Are there any standard gene prediction techniques that utilize both mRNA-seq data and genome sequence level data to predict genes? I found a program called Conrad that looks like it could do this kind of thing utilizing a conditional random field, but it doesn't look like it has been widely used, or maintained since 2009.

    Would the best option be to use separate programs to call genes using genome sequence information, and then again using the mRNA-seq information (cufflinks or something like that maybe?), and then I could go back and somehow merge the output form the two techniques? Are there any standard methods of performing this kind of merging?

    Gene annotation is another thing I will want to do with the output. I am going to hand annotate a few genes, but it would be useful if there was some kind of program that does a blast similarity based annotation with the remainder of genes.

    Thank you for your suggestions!
    -John

  • #2
    AFAIK, not a lot of fully integrative methods our there. Looks like gene finding 15 years ago or so, when "ab initio" approaches (genomic sequence only) were distinct from "similarity" methods (transcripts alignments only).
    I would use Cufflinks, or try Gmorse

    Comment


    • #3
      AUGUSTUS is able to use RNA-Seq data and did a good job at gene prediction (at least in one plant genome). Problem is, one has to map reads using blat, which is not weapon of choice for fast and accurate spliced RNA-Seq mapping.
      Problem: unspliced reads running over short introns

      Comment


      • #4
        The approach you propose was used recently for assembly of a nematode genome. See http://www.ncbi.nlm.nih.gov/pubmed/20980554.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM
        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        31 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        33 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        28 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        53 views
        0 likes
        Last Post seqadmin  
        Working...
        X