Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bowtie2 outputs empty file

    I'm trying to align a paired reads fastq file to the hg19 genome using bowtie2 in Galaxy. The paired ends files are the output of a fastq groomer and are about 3GB each and contains reads like these:

    @ERR010982.1460.2 SOLEXA-GA01_1:1:1:21:1187 length=76
    AGTTATGATTTTTGTTAGTCTTTTTGTCTTATTATTCTTCCTTAGGATTATAACAACTACTCTAACCTTTTGTTCT
    +ERR010982.1460.2 SOLEXA-GA01_1:1:1:21:1187 length=76
    !"""!""!""""""""!"!"""""""!"""""""""""""""""""""!!"!"""!!"!!!!!!!!!!!!!!!!!!

    The bowtie2 syntax as run by galaxy is:
    bowtie2-build "/home/leon/ref_data/fa/hg19.fa" genome; ln -s "/home/leon/ref_data/fa/hg19.fa" genome.fa; bowtie2 -p ${GALAXY_SLOTS:-4} -x genome -1 /home/leon/galaxy-dist/database/files/000/dataset_19.dat -2 /home/leon/galaxy-dist/database/files/000/dataset_20.dat -I 0 -X 250 | samtools view -Su - | samtools sort -o - - > /home/leon/galaxy-dist/database/files/000/dataset_21.dat

    For some reason, the bam file that's generated after this runs for several hours is only 62 bytes long, meaning nothing got aligned! What could I be doing wrong? This is the first time I'm aligning a genome and so could be royally screwing things up.

  • #2
    Probably better to post this to the Galaxy forum.

    I am unsure which Galaxy instance you are using but my first observation is that you are trying to build the index for a commonly used genome -- hg19. Why not use the built-in index? My suspicion is that if you are using the public Galaxy instance that you are running out disk space or time when building the index.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Understanding Genetic Influence on Infectious Disease
      by seqadmin




      During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

      Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
      09-09-2024, 10:59 AM
    • seqadmin
      Addressing Off-Target Effects in CRISPR Technologies
      by seqadmin






      The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...
      08-27-2024, 04:44 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 06:25 AM
    0 responses
    9 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, Yesterday, 01:02 PM
    0 responses
    8 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-18-2024, 06:39 AM
    0 responses
    10 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-11-2024, 02:44 PM
    0 responses
    13 views
    0 likes
    Last Post seqadmin  
    Working...
    X