Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • STAR Aligner Output BAM Processing

    Hi everyone,

    I am a new user of STAR Aligner. I have been using GSNAP until now, but my new group likes STAR better. I am very excited about using it and have a couple of questions too. We will be using paired end fastq files as input. I saw there are couple of really cool things with STAR:

    1. Output coordinate-sorted/unsorted BAM files using --outSAMtype
    2. Output read counts using --quantMode

    Question 1: Does the --quantMode perform equally well to htseq-count? Are the read-counts identical in both cases?

    When I was using GSNAP, after I obtained the output BAM file, I used to perform the following steps before using htseq-count:
    1. samtools fixmate to fill in mate information
    2. bamtools filter to keep only "reads in proper pair" and CIGAR string should indicate atleast one match i.e. 'M'

    Question 2: So do I have to do these steps after I obtain the bam output from STAR?

    Question 3: My command line for htseq-count is as follows with the GSNAP output:
    samtools view -f 0x0002 sample.bam | htseq-count - $GTF > sample.counts

    Question 3: Do I have to use '-f 0x0002' (i.e. reads mapped in proper pair) while using htseq-count on STAR output bam file as well?

    Thanks a lot!
    Komal Rathi
    Bioinformatics Application Developer
    University of Pennsylvania

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    Yesterday, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
57 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
53 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
45 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-04-2024, 09:00 AM
0 responses
55 views
0 likes
Last Post seqadmin  
Working...
X