Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Taxonomic Analysis of MiSeq Data

    Hello everyone,

    Recently I have completed a MiSeq run with 63 samples multiplexed.

    I am currently attempting to analyse this data for small viral genomes obtained from chicken gut homogenates, with the expected viruses being around 7kb - 10kb in length.

    Normally my go to route with smaller datasets would be to run my contig files through a local BLAST nr nt database and using the resulting .xml file in MEGAN to perform a taxonomic analysis.

    Unfortunately, with the MiSeq dataset being so large it is very time consuming to go this route sample by sample especially since the computer we are using is a tad underpowered.

    I was hopeful when I saw all the apps available on BaseSpace however there does not seem to be an appropriate viral categorisation tool or taxonomic tool. I attempted to use Kraken but I was getting very little viral output when comparing the same sample to a BLAST & MEGAN analysis.

    I was wondering if anyone had any tips for this type of analysis.

    Thanks for the help.

  • #2
    Hi GSviral,

    Have you solved the problem on BLASTing huge data?
    I am also trying to detect viruses in clinical samples like frozen plasma (by Hiseq 2000).
    I also tried Kraken, but it missed a lot of reads which should be mapped to some viruses by BLAST.
    BLAST is more accurate, accepts long gap, but very slow.

    To treat huge amount of data (some millions of reads FASTA files),
    First, I Removed huge amount of host geonome by BOWTIE2 (it is ultarfast and accepts long gap like splicing).
    Then, I divided FASTA files into 1,000,000 reads per files, and run BLAST+ in parallel.
    It still needs 1-2 days by supercomputer (4 core CPU/32GB memories per job), and needs a week for a workstation ...
    But I think BLAST is a right tool for your to correctly find multiple virus in samples.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Recent Advances in Sequencing Technologies
      by seqadmin







      Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

      Long-Read Sequencing
      Long-read sequencing has...
      12-02-2024, 01:49 PM
    • seqadmin
      Genetic Variation in Immunogenetics and Antibody Diversity
      by seqadmin



      The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
      11-06-2024, 07:24 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 12-02-2024, 09:29 AM
    0 responses
    139 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 12-02-2024, 09:06 AM
    0 responses
    49 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 12-02-2024, 08:03 AM
    0 responses
    38 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 11-22-2024, 07:36 AM
    0 responses
    69 views
    0 likes
    Last Post seqadmin  
    Working...
    X