Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • anoopkmr
    replied
    This tools helps me create bed files from a gene list:


    Choose bed format while downloading.

    Leave a comment:


  • Gopo
    replied
    You will have to make the BED file yourself. Here is a guide:

    Code:
    # install picard
    cd ~/bin/
    wget https://github.com/broadinstitute/picard/releases/download/2.18.10/picard.jar
    mv picard.jar picard-2.18.10.jar
    
    # index reference (Reference is AmexG_v3.0.0.fa)
    samtools faidx AmexG_v3.0.0.fa
    
    # create sequence dictionary
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CreateSequenceDictionary \
    R=AmexG_v3.0.0.fa \
    O=AmexG_v3.0.0.dict
    
    # Convert BED to interval list
    java -jar ~/bin/picard-2.18.10.jar BedToIntervalList \
    I=rfs.immunome.bed \
    O=rfs.immunome.interval.list \
    SD=AmexG_v3.0.0.dict
    
    # run CollectHsMetrics
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CollectHsMetrics \
    BAIT_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    BAIT_SET_NAME=Immunome \
    TARGET_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    METRIC_ACCUMULATION_LEVEL=SAMPLE \
    R=/ssdwork/jelber2/rfs/AmexG_v3.0.0.fa \
    I=ALL-samples.bam \
    O=ALL-samples-coverage-metrics.txt
    
    # if needed, add readgroups
    java -Xmx64g -jar ~/bin/picard.jar AddOrReplaceReadGroups \
    I=ALL-samples.bam \
    O=ALL-samples-RG.bam \
    SORT_ORDER=coordinate \
    RGPL=illumina \
    RGPU=barcode \
    RGLB=Lib1 \
    RGID=all \
    RGSM=all \
    VALIDATION_STRINGENCY=LENIENT
    
    # run CollectHsMetrics with ReadGroups added to BAM
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CollectHsMetrics \
    BAIT_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    BAIT_SET_NAME=Immunome \
    TARGET_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    METRIC_ACCUMULATION_LEVEL=SAMPLE \
    R=/ssdwork/jelber2/rfs/AmexG_v3.0.0.fa \
    I=ALL-samples-RG.bam \
    O=ALL-samples-coverage-metrics.txt
    Best,
    Gopo
    Last edited by Gopo; 08-08-2018, 10:12 PM. Reason: wrong syntax for displaying code

    Leave a comment:


  • Gopo
    replied
    [CODE]
    # install picard
    cd ~/bin/
    wget https://github.com/broadinstitute/picard/releases/download/2.18.10/picard.jar
    mv picard.jar picard-2.18.10.jar

    # reference genome is AmexG_v3.0.0.fa
    # generate fasta index
    samtools faidx AmexG_v3.0.0.fa

    # create sequence dictionary
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CreateSequenceDictionary \
    R=AmexG_v3.0.0.fa \
    O=AmexG_v3.0.0.dict

    # convert bed to interval list
    java -jar ~/bin/picard-2.18.10.jar BedToIntervalList \
    I=rfs.immunome.bed \
    O=rfs.immunome.interval.list \
    SD=AmexG_v3.0.0.dict

    # run CollectHsMetrics
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CollectHsMetrics \
    BAIT_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    BAIT_SET_NAME=Immunome \
    TARGET_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    METRIC_ACCUMULATION_LEVEL=SAMPLE \
    R=/ssdwork/jelber2/rfs/AmexG_v3.0.0.fa \
    I=ALL-samples.bam \
    O=ALL-samples-coverage-metrics.txt

    # if you need to add read groups
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar AddOrReplaceReadGroups \
    I=ALL-samples.bam \
    O=ALL-samples.RG.bam \
    SORT_ORDER=coordinate \
    RGPL=illumina \
    RGPU=barcode \
    RGLB=Lib1 \
    RGID=all \
    RGSM=all \
    VALIDATION_STRINGENCY=LENIENT

    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CollectHsMetrics \
    BAIT_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    BAIT_SET_NAME=Immunome \
    TARGET_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    METRIC_ACCUMULATION_LEVEL=SAMPLE \
    R=/ssdwork/jelber2/rfs/AmexG_v3.0.0.fa \
    I=ALL-samples.RG.bam \
    O=ALL-samples-coverage-metrics.txt
    [\CODE]

    Leave a comment:


  • Gopo
    replied
    Code:
    # install picard
    cd ~/bin/
    wget https://github.com/broadinstitute/picard/releases/download/2.18.10/picard.jar
    mv picard.jar picard-2.18.10.jar
    
    # reference genome is AmexG_v3.0.0.fa
    # generate fasta index
    samtools faidx AmexG_v3.0.0.fa
    
    # create sequence dictionary
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CreateSequenceDictionary \
    R=AmexG_v3.0.0.fa \
    O=AmexG_v3.0.0.dict
    
    # convert bed to interval list
    java -jar ~/bin/picard-2.18.10.jar BedToIntervalList \
    I=rfs.immunome.bed \
    O=rfs.immunome.interval.list \
    SD=AmexG_v3.0.0.dict
    
    # run CollectHsMetrics
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CollectHsMetrics \
    BAIT_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    BAIT_SET_NAME=Immunome \
    TARGET_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    METRIC_ACCUMULATION_LEVEL=SAMPLE \
    R=/ssdwork/jelber2/rfs/AmexG_v3.0.0.fa \
    I=ALL-samples.bam \
    O=ALL-samples-coverage-metrics.txt
    
    # if you need to add read groups
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar AddOrReplaceReadGroups \
    I=ALL-samples.bam \
    O=ALL-samples.RG.bam \
    SORT_ORDER=coordinate \
    RGPL=illumina \
    RGPU=barcode \
    RGLB=Lib1 \
    RGID=all \
    RGSM=all \
    VALIDATION_STRINGENCY=LENIENT
    
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CollectHsMetrics \
    BAIT_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    BAIT_SET_NAME=Immunome \
    TARGET_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    METRIC_ACCUMULATION_LEVEL=SAMPLE \
    R=/ssdwork/jelber2/rfs/AmexG_v3.0.0.fa \
    I=ALL-samples.RG.bam \
    O=ALL-samples-coverage-metrics.txt

    Leave a comment:


  • Gopo
    replied
    So, you will have to generate the BED file yourself based on the coordinates of the genes or exons that you are interested in.

    Code:
    # generate fasta index for genome (genome is AmexG_v3.0.0.fa)
    samtools faidx AmexG_v3.0.0.fa
    
    # create sequence dictionary
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CreateSequenceDictionary \
    R=AmexG_v3.0.0.fa \
    O=AmexG_v3.0.0.dict
    
    # convert Bed to interval list
    java -jar ~/bin/picard-2.18.10.jar BedToIntervalList \
    I=rfs.immunome.bed \
    O=rfs.immunome.interval.list \
    SD=AmexG_v3.0.0.dict
    
    # run CollectHsMetrics
    java -Xmx64g -jar ~/bin/picard-2.18.10.jar CollectHsMetrics \
    BAIT_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    BAIT_SET_NAME=Immunome \
    TARGET_INTERVALS=/ssdwork/jelber2/rfs/rfs.immunome.interval.list \
    METRIC_ACCUMULATION_LEVEL=SAMPLE \
    R=/ssdwork/jelber2/rfs/AmexG_v3.0.0.fa \
    I=ALL-samples.bam \
    O=ALL-samples-coverage-metrics.txt
    # note that you might have to add readgroups to the file ALL-samples.bam (whatever your BAM file is named) with

    Code:
    java -Xmx64g -jar ~/bin/picard.jar AddOrReplaceReadGroups \
        I=ALL-samples-recal-no-read-groups.bam \
        O=ALL-samples-recal-no-read-groups-for-callableloci.bam \
        SORT_ORDER=coordinate \
        RGPL=illumina \
        RGPU=barcode \
        RGLB=Lib1 \
        RGID=all \
        RGSM=all \
        VALIDATION_STRINGENCY=LENIENT

    Leave a comment:


  • mhmtgenc
    replied
    Originally posted by Gopo View Post
    Try CollectHSmetrics - part of Picard Tools
    For details see:
    https://broadinstitute.github.io/pic...-overview.html
    So how coul I be able to get a BED file or prepare it by myself? Could you give me a clue or a tutorial of this. Cause I have trouble on this. I know this is hard to expalin it here but can I get any file or manual for this?

    Leave a comment:


  • Gopo
    replied
    Try CollectHSmetrics - part of Picard Tools
    For details see:

    Leave a comment:


  • How to get a report like stuff of a bam file how many percent of the exons are cover

    Dear Collegues,

    Lets say I have a miseq run and have the .bam file from the squencer and I would like to know how many percent of the exons (specific genes) are coverred in these .bam files?

    Is it possible?

    Thanks in advance

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 04-25-2024, 11:49 AM
0 responses
19 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-24-2024, 08:47 AM
0 responses
18 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
62 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
60 views
0 likes
Last Post seqadmin  
Working...
X