Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • fznajar
    replied
    Dear all,
    Can iSAAC work on mac os platform?

    Leave a comment:


  • sklages
    replied
    Hallo again ;-)

    we are now with Isaac3. Cool .. ;-)

    Creating indices for grch38 and grcm38 leaves some open questions:

    I have run index creation as follows (mask-width 0 is the default, I just put it there as a "reminder" for future index creation runs):

    Code:
    isaac-sort-reference \
      --output-directory iSAACindex \
      --jobs 1 \
      --mask-width 0 \
      --genome-file genome.fa
    That left me with exact 3 files and a 1.1TiB Temp folder:

    Code:
    -rw-rw-r-- 1 klages klages 618M 2016.08.26 01:05:08 2repeatness.8bpb.gz
    -rw-rw-r-- 1 klages klages 678M 2016.08.25 22:19:13 2uniqueness.8bpb.gz
    -rw-rw-r-- 1 klages klages 108K 2016.08.26 01:05:09 sorted-reference.xml
    drwxrwxr-x 2 klages klages 8.0K 2016.08.26 01:05:09 Temp
    make reported
    Code:
    [all]    INFO: All done!
    At least it is "packable" by isaac-pack-reference.

    hg19-packed-reference.tar.gz from BaseSpace (btw, would be fine to have some grch38/grcm38 though) shows:

    Code:
    -rwxr-x--- rpetrovski/aladdin 644685308 2014-11-19 21:38 2uniqueness.16bpb.gz
    -rw-r--r-- rpetrovski/aladdin 386961748 2014-11-20 13:03 neighbors-1or2-16.1bpb
    -rw-r--r-- rpetrovski/aladdin 386961748 2014-11-20 13:06 neighbors-1or2-32.1bpb
    -rwxr-xr-- rpetrovski/aladdin 3157608038 2014-11-20 12:53 genome.fa
    -rw-r--r-- rpetrovski/aladdin      48044 2014-11-20 12:54 sorted-reference.xml
    * Is that a complete and valid index??
    * Do I still need Temp for any task after index creation?
    * What are the differences compared to isaac2 indices?

    best,
    Sven

    Leave a comment:


  • craczy
    replied
    In an attempt to make it easier to use Isaac2, we will make the packed index reference for commonly used genomes on BaseSpace. At the moment, the only 2 genomes available are hg19 and mm9. Feel free to request other genomes.

    Also, the issues and recommendations around indexing genomes are summarized on the isaac2 github wiki page "Reference Indexes".

    The link to the already indexed genomes in basespace might change in the future, please refer to the wiki page on github for updates.

    Hopefully, this will help.

    Come

    Leave a comment:


  • GenoMax
    replied
    I had started an isaac2 index creation job for mm9 genome (with -w 6). It has been running for a week and still making files in Temp directory.

    Leave a comment:


  • sklages
    replied
    Got the same just 5 minutes ago :-)

    So the default for isaac-sort-reference should be changed or, alternatively, it should always be called with '--mask-width 6'.

    Leave a comment:


  • GenoMax
    replied
    Commands used for the final steps in a nutshell.

    Code:
    $ isaac-pack-reference -j 1 -r ./sorted-reference.xml -o ./packed-reference.tar.gz
    
    $ isaac-unpack-reference -j 1 -w 6 -i ./packed-reference.tar.gz
    The end result was a set of 64 files

    kmer-positions-32-00.dat through kmer-positions-32-63.dat
    And one

    Code:
    2uniqueness.16bpb.gz
    file.

    I have started a new isaac2 genome creation job for the MM9 genome with -w 6 option upfront.
    Last edited by GenoMax; 07-29-2015, 03:11 AM.

    Leave a comment:


  • craczy
    replied
    Originally posted by GenoMax View Post
    This is not working for me:

    Code:
    tar: This does not look like a tar archive
    tar: Skipping to next header
    tar: Read 4461 bytes from ./sorted-reference.xml
    tar: Error exit delayed from previous errors
    make: *** [Temp/sorted-reference.xml] Error 2
    My mistake. Apologies. It is not the sorted-reference.xml but the tarball created by 'isaac-pack-reference':

    Code:
    rm -rf Temp
    isaac-unpack-reference -j 1 -w 6 -i packed-reference.tar.gz

    Leave a comment:


  • GenoMax
    replied
    Originally posted by craczy View Post
    The input file should be the 'sorted-reverence.xml', not the current directory:

    This should work:

    Code:
    isaac-unpack-reference -j 1 -w 6 -i sorted-reference.xml
    Remember to remove the already existing Temp directory, if any

    Come
    This is not working for me:

    Code:
    tar: This does not look like a tar archive
    tar: Skipping to next header
    tar: Read 4461 bytes from ./sorted-reference.xml
    tar: Error exit delayed from previous errors
    make: *** [Temp/sorted-reference.xml] Error 2

    Leave a comment:


  • craczy
    replied
    The input file should be the 'sorted-reverence.xml', not the current directory:

    This should work:

    Code:
    isaac-unpack-reference -j 1 -w 6 -i sorted-reference.xml
    Remember to remove the already existing Temp directory, if any

    Come

    Leave a comment:


  • GenoMax
    replied
    @sven: A new thread has been created for posts related to isaac2 genome index creation.

    Leave a comment:


  • sklages
    replied
    Originally posted by GenoMax View Post
    BTW: "Temp" directory is required for the unpack-reference.
    That's funny though .. under normal circumstances I'd remove this folder as it occupies quite a lot of disk space ..

    Leave a comment:


  • sklages
    replied
    Just tried,
    Code:
    isaac-unpack-reference -j 1 -w 6 -i . --dry-run
    This (basically) results in this error:
    Code:
    warning: failed to load external entity "Temp/sorted-reference.xml"
    unable to parse Temp/sorted-reference.xml
    warning: failed to load external entity "Temp/sorted-reference.xml"
    unable to parse Temp/sorted-reference.xml
    Without dry-run:
    Code:
    isaac-unpack-reference -j 1 -w 6 -i .
    tar fails:
    Code:
    tar -C Temp --touch -xvf .
    tar: .: Cannot read: Is a directory
    tar: At beginning of tape, quitting now
    tar: Error is not recoverable: exiting now
    make: *** [Temp/sorted-reference.xml] Error 2
    Even when I copy sorted-reference.xml to Temp, I get an error:

    Code:
    make[1]: Entering directory `/path/to/iSAACindexBuildDir/iSAAC2Index.32'
    make[1]: *** No rule to make target `Temp/genome.fa', needed by `/path/to/iSAACindexBuildDir/iSAAC2Index.32/genome.fa'.  Stop.
    make[1]: Leaving directory `/path/to/iSAACindexBuildDir/iSAAC2Index.32'
    make: *** [all] Error 2

    Leave a comment:


  • GenoMax
    replied
    @come:

    I tried the "isaac-unpack-reference" (relevant part of the command line below)

    Code:
    $ isaac-unpack-reference -j 8 -w 6 -i .
    Resulted in this error

    Code:
    tar: .: Cannot read: Is a directory
    tar: At beginning of tape, quitting now
    tar: Error is not recoverable: exiting now
    make: *** [Temp/sorted-reference.xml] Error 2
    @sven: Can you see if it works for you?

    BTW: "Temp" directory is required for the unpack-reference.

    Leave a comment:


  • sklages
    replied
    Well, I can confirm that.

    It took ~64h on a 48 core "Opteron 6176 SE" (fast local storage, RAID) to build a hg19 index.

    Code:
    isaac-sort-reference --genome-file fa_hg19/genome.fa --jobs 1 --output-directory iSAAC2Index.32 --quiet
    The result is:
    Code:
    938M 2015.07.27 06:21:35 2uniqueness.16bpb.gz
     42G 2015.07.27 06:54:45 kmer-positions-32-0.dat
     15K 2015.07.27 06:54:51 sorted-reference.xml
    8.0K 2015.07.27 06:54:51 Temp
    with 'Temp' being 1.1TiB (!) in size ... (btw, why don't you clean Temp automatically after successfully finishing a job?).

    Leave a comment:


  • GenoMax
    replied
    Originally posted by craczy View Post
    This looks correct, but surprising. Did you specify something like "-w 1" on the command line by any chance?
    Thanks for confirming that. I had only done this

    Code:
    $ isaac-sort-reference -g /path_to/HG19_UCSC/Sequence/WholeGenomeFasta/genome.fa -o .
    Is there a better command-line for future reference?

    Originally posted by craczy View Post
    You can use the "isaac-pack-reference" and then "isaac-unpack-reference -w 6" to split the index into smaller files without having to re-doing the reference sorting.
    I did the isaac-pack-reference thinking that it would "compress" the index but nothing appeared to change except the date stamps.

    Update: I think I need to move the "Temp" directory out of the way (just realized that and trying it now) for "pack-reference" to work.

    Leave a comment:

Latest Articles

Collapse

  • seqadmin
    Recent Advances in Sequencing Analysis Tools
    by seqadmin


    The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
    05-06-2024, 07:48 AM
  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 05-10-2024, 06:35 AM
0 responses
20 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-09-2024, 02:46 PM
0 responses
25 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-07-2024, 06:57 AM
0 responses
21 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-06-2024, 07:17 AM
0 responses
21 views
0 likes
Last Post seqadmin  
Working...
X