Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • swbarnes2
    replied
    Originally posted by carmeyeii View Post
    Shurjo,

    How do you configure the grep command so that it strips the sequence IDs that mapped to rRNA database, AND the following three lines that contain the sequence, the + and the quality string?

    Thanks!
    Carmen
    That's not how you do it. You have the .bam, which has the sequence and what it mapped to all on one line; you filter that. You could do that pretty easily with grep.

    Leave a comment:


  • carmeyeii
    replied
    Hi everyone!

    I am analyzing some Illumina libraries that appear to have a lot of ribosomal RNA contamination.

    I'm using Bowtie to align the reads only to a specific set of sequences, and because of the differing amount of rRNA contamination in each sample, each of them maps a different percentage of reads to the dataset (some half of what others map), ranging from 1% to 0.3%.

    I wonder if the amount of rRNA contamination in the preparation of a library can have an impact on the apparent expression level of a gene -- even though one normalizes its counts agains the total number of reads that mapped.

    What's your opinion in this subject?

    Carmen

    Leave a comment:


  • carmeyeii
    replied
    Originally posted by shurjo View Post
    I usually align the raw data against a set of ribosomal sequences (and the mitochondrial genome) using ELAND, and use grep with the -v option to remove any reads that find a match. If you are using the standard Illumina pipeline you can configure Gerald to use ANALYSIS:eland_rna to do this automatically without having to do it step-by-step.
    Shurjo,

    How do you configure the grep command so that it strips the sequence IDs that mapped to rRNA database, AND the following three lines that contain the sequence, the + and the quality string?

    Thanks!
    Carmen

    Leave a comment:


  • cliffbeall
    replied
    Reply to Ikim

    You want to use the closest species you can find. Try the SILVA database: http://www.arb-silva.de/ - they have an extensive collection of small and large subunit sequences and a taxonomic browser to find what you need.

    Leave a comment:


  • ikim
    replied
    Hello, was wondering how well conserved ribosomal RNA, and ribosomal proteins are? How relevant is it to use the Human ribosomal units for matching waterlily datasets for example?

    Leave a comment:


  • shurjo
    replied
    Here are the gi numbers for the ribosomal sequences, you can download them from Genbank.

    gi|555853|gb|U13369.1|HSU1336 Human ribosomal DNA complete repeating unit

    gi|23898|emb|X12811.1| Human 5S DNA

    To this I would add the mitochondrial genome which you can get from the UCSC Genome Browser site (unless you are interested in mitochondrial gene expression)

    HTH,

    Shurjo

    Leave a comment:


  • DNAjunk
    replied
    Thanks for your reply, shurjo.

    I am wondering where you get the set of ribosomal sequences (and the mitochondrial genome) from?

    Did you make it yourself or download from somewhere?

    Leave a comment:


  • shurjo
    replied
    I usually align the raw data against a set of ribosomal sequences (and the mitochondrial genome) using ELAND, and use grep with the -v option to remove any reads that find a match. If you are using the standard Illumina pipeline you can configure Gerald to use ANALYSIS:eland_rna to do this automatically without having to do it step-by-step.

    Leave a comment:


  • DNAjunk
    started a topic Filter ribosomal RNA

    Filter ribosomal RNA

    Hi

    Although ribosomal depletion is usually performed some sequences with ribosomal content remain and are sequenced.

    How do you filter out reads (e.g. from 454 Titanium runs) with ribosomal content?

    Many thanks for advice!

Latest Articles

Collapse

  • seqadmin
    Non-Coding RNA Research and Technologies
    by seqadmin




    Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

    Nobel Prize for MicroRNA Discovery
    This week,...
    10-07-2024, 08:07 AM
  • seqadmin
    Recent Developments in Metagenomics
    by seqadmin





    Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
    09-23-2024, 06:35 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 06:55 AM
0 responses
10 views
0 likes
Last Post seqadmin  
Started by seqadmin, 10-02-2024, 04:51 AM
0 responses
108 views
0 likes
Last Post seqadmin  
Started by seqadmin, 10-01-2024, 07:10 AM
0 responses
114 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-30-2024, 08:33 AM
1 response
118 views
0 likes
Last Post EmiTom
by EmiTom
 
Working...
X