Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Tophat "butterfly" option annomaly

    Hello All,

    So I reran some of my data with the new Tophat Butterfly option to see how it may of effected the output and I ended up having to stop the run due to the fact that it created an enormous temporary file (the one with the title of something like left_end_reads). My computer ran out of hard disk space. The file ended up being 472GB big. Just wondering if anyone knew if this was normal? Could anyone share their results with using this option (ie did it make a significant difference in the output)? If so, what method did you use for your library prep (eg NuGen Kits)?

    Just Curious,
    Johnathon

  • #2
    I've never used the butterfly option but anyway tophat writes massive tmp files while it's working. For me the tmp files top out at about 200 GB but maybe I have less data and I also guess that the butterfly finds and tests more search locations so the tmp files are even bigger again. My data is all standard polyA selected so if you use something that includes ncRNA the problem will get much worse.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      The Impact of AI in Genomic Medicine
      by seqadmin



      Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
      02-26-2024, 02:07 PM
    • seqadmin
      Multiomics Techniques Advancing Disease Research
      by seqadmin


      New and advanced multiomics tools and technologies have opened new avenues of research and markedly enhanced various disciplines such as disease research and precision medicine1. The practice of merging diverse data from various ‘omes increasingly provides a more holistic understanding of biological systems. As Maddison Masaeli, Co-Founder and CEO at Deepcell, aptly noted, “You can't explain biology in its complex form with one modality.”

      A major leap in the field has
      ...
      02-08-2024, 06:33 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 02-28-2024, 06:12 AM
    0 responses
    28 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-23-2024, 04:11 PM
    0 responses
    74 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-21-2024, 08:52 AM
    0 responses
    85 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-20-2024, 08:57 AM
    0 responses
    69 views
    0 likes
    Last Post seqadmin  
    Working...
    X