Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to examine paired reads

    Hello, a beginner question, any help appreciated

    We sequenced an organism (Illumina Hiseq) and would like now to examine how the reads are getting paired, just as a quality control look at how far the paired end are from each other, how many pairs there are.

    Unfortunately, this organism does not have a reference genome, so initially our approach was to get a single gene, such as the rRNA gene fasta sequence for this organism, align as many of our sequenced reads as possible to that gene, and examine the distance between the pairs.

    Does this approach make sense? What program can we use to align our reads to only a single gene.

    Any other suggestions appreciated!

    Thanks in advance,
    Ramiro

  • #2
    I presume you are talking about an transcriptome project instead of a whole genome project thus avoiding the whole issue of intron/exons.

    I wouldn't use rRNA -- too many duplicates -- but if that is all you have then, sure, use it. And it would be useful to have more than one gene to look at just for good statistics. In general you approach is valid for a first pass. As for the program to use, there are many. I'd try one of the short read aligners such as BWA, Bowtie2, SNAP, etc.

    Also I suggest running FastQC to get a feel for the quality of the data.

    Comment


    • #3
      You are correct, it is a transcriptome project.

      Thank you very much for your feedback. The reason I was asking about what program to use, is that BWA, Bowtie2, etc seem to be meant for long reference sequences (such as genomes, chromosomes) and not for a small gene of just a few hundred nucleotides in length. However, I will try.

      I ran FastQC and the reads are of high quality.

      The situation, on the bigger picture, is that the reads are not assembling, and one possibility is that the "pairing" has shifted for some reason (that what are being called "pairs" are not really pairs). I don't think this is true (for various reasons) but would like to verify it somehow before we rule it out and have that as a QC step for the future.

      Comment


      • #4
        Certainly BWA, Bowtie2, SNAP are good for long references but they can also be used on short references. I use said programs to scan reads for matches to the rRNA database that we use. Said database has hundreds of short rRNA references but it could just as easily have one short reference. I also use the programs to scan my phiX database which has a single fairly short reference in it.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Non-Coding RNA Research and Technologies
          by seqadmin




          Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

          Nobel Prize for MicroRNA Discovery
          This week,...
          10-07-2024, 08:07 AM
        • seqadmin
          Recent Developments in Metagenomics
          by seqadmin





          Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
          09-23-2024, 06:35 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 02:44 PM
        0 responses
        7 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-11-2024, 06:55 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-02-2024, 04:51 AM
        0 responses
        110 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-01-2024, 07:10 AM
        0 responses
        116 views
        0 likes
        Last Post seqadmin  
        Working...
        X