Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Irina Pulyakhina
    Member
    • Sep 2010
    • 24

    mapping microRNA data -- less than 1% maps

    Hi guys,

    I am not new to NGS/RNA-Seq, but I am new to miRNA sequencing... I want to map human miRNA reads (HiSeq 2500, single end, 50bp) to the miRNA databases (mature and hairpin, I took both from here: http://www.mirbase.org/ftp.shtml; I also checked that the headers of the sequences contain "human", so I don't map against the wrong database). This is the procedure I follow:

    (1) using cutadapt, remove Illumina adapters, discard reads that are too short (<17bp) or too long (>35bp) after removing adapters -- works fine, no errors/warnings, removes 5-10% of reads from the initial fastq files.
    (2) index the miRNA databases.
    (3) map trimmed reads (1) to the indexed databases (2)

    However, when I do it using stampy, bwa, bowtie or bowtie2, I get less than 0.5% of reads mapped... I believe I'm doing something wrong at the indexing step or am missing something at the alignment step (duh...). Does anyone have an idea of what I could be doing wrong?

    You could find all my commands here:
    bioinformatics pipeline to analyze micro RNA sequencing data - File not found · jknightlab/mirna_pipeline


    And I also copy them here:

    **Stampy**

    > stampy.py -g human_mature_mirna -H human_mature_mirna
    > stampy.py -g human_mature_mirna -h human_mature_mirna -M reads.fastq -o alignment.stampy.sam


    **bowtie**

    > bowtie-build mature_dna_human.fa mature_mirna
    > bowtie -l 8 mature_mirna reads.fastq > alignment.bowtie.sam

    **bowtie2**

    > bowtie2-build mature_dna_human.fa mature_mirna.bowtie2
    > bowtie2 -L 8 -x mature_mirna.bowtie2 reads.fastq > alignment.bowtie1.sam

    **BWA**

    > bwa index -a is mature_dna_human.fa
    > samtools faidx mature_dna_human.fa
    > java -jar CreateSequenceDictionary.jar REFERENCE=mature_dna_human.fa OUTPUT=mature_dna_human.dict
    > bwa aln -l 8 Database_for_mirna/mature_dna_human.fa reads.fasrq > alignment.bwa.sai

    I hope someone could help!

    Cheers,
    Irina
  • natasha321
    Junior Member
    • Nov 2016
    • 2

    #2
    Hi Irina,

    I'm facing the same problem now. Did you find a solution? I've tried aligning my miRNA-seq data on the mature sequences from miRBase but most of them were not aligned.

    Please, do you have any tip?

    best,

    Comment

    • HESmith
      Senior Member
      • Oct 2009
      • 512

      #3
      For starters, try alignment to the whole genome to see 1) if the unmapped reads are from your species or contaminants, and 2) where they align (e.g., if the library is actually RNA rather than miRNA).

      Comment

      • colindaven
        Senior Member
        • Oct 2008
        • 417

        #4
        It's unlikely the alignment or indexing is messed up across so many aligners. I would suggest the adapter trimming is not working perfectly.

        Try using fastqc on your fastq files before and after trimming ? Big difference ? Expected size range hit ? Plenty of 21nt reads left ?

        Also, additional adapters which your provider did not tell you about may be present.

        Comment

        • wingless
          Junior Member
          • Jan 2012
          • 8

          #5
          When you downloaded the human microRNA sequences, did you replace U-s with T-s? Bowtie-build input should be DNA (with Ts) not RNA, it skips Us.

          Comment

          • kerplunk412
            Senior Member
            • Jun 2012
            • 119

            #6
            Originally posted by wingless View Post
            When you downloaded the human microRNA sequences, did you replace U-s with T-s? Bowtie-build input should be DNA (with Ts) not RNA, it skips Us.
            This. Also, the small RNA adapter is different from the "standard" Illumina adapter, so make sure you trimmed the correct adapter sequence.

            Comment

            Latest Articles

            Collapse

            • SEQadmin2
              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by SEQadmin2


              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

              Here are nine questions we think about, in roughly the order they matter, before...
              06-18-2026, 07:11 AM
            • SEQadmin2
              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
              by SEQadmin2


              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
              ...
              06-02-2026, 10:05 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, 06-26-2026, 11:10 AM
            0 responses
            14 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-17-2026, 06:09 AM
            0 responses
            48 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-09-2026, 11:58 AM
            0 responses
            107 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-05-2026, 10:09 AM
            0 responses
            125 views
            0 reactions
            Last Post SEQadmin2  
            Working...