Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • aby
    replied
    I converted the all.map file generated from MAQ tool to SAM format using maq2sam-long in samtools. The output file I named as Out. Now when I try to convert Out file (which is in SAM format) to BAM format using the command
    samtools view -S Out

    I get an error message that header @ not found. When I edit the Out file by manually adding @ at the top, more errors appear. What should I do? Is there a fault in the SAM file generated by the converter maq2sam ? Is there any other way to convert SAM file to BAM file? I tried 'samtool import' and that also does not work.

    Leave a comment:


  • aby
    replied
    I was trying to convert my single read illumina file to SAM format using the export2sam.pl script. It does not work and gives the following error:

    Use of uninitialized value $t[21] in string ne at export2sam.pl line 67, <$fh1> line 14063.

    Here is a sample data from my file.

    HWUSI-EAS174_0025:2:1:5:488#0/1:CGGAGAATACGCTCCCATTCCCCCNGNANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTNTGATCTTAGATCGGA:aabbbbb_baaa_a]ab`_aBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBB
    HWUSI-EAS174_0025:2:1:5:1542#0/1:TGGATGCCTAGGCAATCAGAGGCGNANANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNANGTGATAAGCAGCGAA:abbbabbbbbbbbbbbbbbbBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBB


    How do I resolve it ? Is there any other way?

    A.

    Leave a comment:


  • bosTau2
    replied
    It depends on regions. If a region is repetitive then you need to filter out possible duplications which can cause artificial hetero SNPs. "Repetitive" here means kmer uniqueness --- how many times a given kmer (30mer for example) can be found in a reference.

    Leave a comment:


  • sbaheti
    replied
    Hi lh3

    Originally Posted by xguo
    I got a list of candidate SNPs using BWA and samtools for RNASeq data, and am trying to weigh various filtering options. "samtools.pl varFilter" gives a list of filtering criterion with default setting. The maximum read depth is set at 100. Given that duplicate reads have been removed by "samtools rmdup", do I still need to limit the maximum read depth?

    replied by lh3:
    Yes, you need this unless you are doing target sequencing in which case the read depth is expected to vary a lot.

    My question

    If we are doing variant calling in exome capture analysis, do i need to limit the max read depth when using samtool varFilter tool?

    Leave a comment:


  • wuhoucdc
    replied
    Dear All,

    Do you know if samtools can perform multiple commands (>2) together? Here assuming that I have ten BAM files (result001.bam, result002.bam,......result010.bam) and want to merge them first and then sort and index them, the last step I hope is to extract the data for chromosome 1 (chr1), how can I edit the samtools command? I did it like this:
    samtools merge result.bam result001.bam result002.bam ............result010.bam |\
    samtools sort - result | \
    samtools index result.bam | \
    samtools view result.bam chr1 > resultchr1.bam

    Is it right?

    Thank you very much!

    Wu

    Leave a comment:


  • bosTau2
    replied
    Is there sam2maq (sam to maq map format)? I am testing simulated data and like to convert ssaha2 alignment to maq output so that I can analyze them in maq. I cerated all the simu data in maq.
    Thank you.

    Leave a comment:


  • KevinLam
    replied
    There was a discussion on how the CS tag should be generated in sam files according to the specs. Is there a consensus on how it is to be done?

    I have to write a script to append the CS tag info to the BWA alignment of SOLID reads. I am hoping to make it as painless as possible.

    Leave a comment:


  • KevinLam
    replied
    Originally posted by win804 View Post
    Thanks Li Heng. I just want to confirm that nothing is wrong with the sorted bam file.

    Thanks a lot.
    You may convert to sam and count the lines as a proxy for checking.
    but yes as lh3 mentioned sorted files compress better due to the compression algorithm

    Leave a comment:


  • gsjlucky
    replied
    maq2sam-long

    Originally posted by lh3 View Post
    maq2sam-short is for the .map files generated by maq-0.6.x, while maq2sam-long for files generated by maq-0.7.x. Sorry for the confusion, and one of the aims of SAM is to avoid such confusions in future.
    maq2sam <in.map> [<readGroup>], I want know how to use the option parameter 'readGroup',can it add library info from map to sam?

    Leave a comment:


  • glacierbird
    replied
    Originally posted by lh3 View Post
    @corthay

    You can convert with "blast2sam.pl -s" to save the sequence in SAM. Currently, samtools cannot parse SAM without sequence, although the specification allows this.

    Hi Li,
    I parse .sam with sequence, but samtools view still gave such error msg. Could you please take a look of the post:


    Thanks.

    Leave a comment:


  • win804
    replied
    Originally posted by maubp View Post
    You already asked this on a separate thread
    http://seqanswers.com/forums/showthread.php?t=5684
    Yes, I wanted to delete the previous thread before, however, I have no idea of how to do it. Any idea?

    Thanks.

    Leave a comment:


  • win804
    replied
    Thanks Li Heng. I just want to confirm that nothing is wrong with the sorted bam file.

    Thanks a lot.

    Leave a comment:


  • maubp
    replied
    Originally posted by win804 View Post
    Hi All,
    Is "sorted" BAM file smaller in size compare to unsorted BAM file?
    If that's the case, why is that so?...
    You already asked this on a separate thread

    Leave a comment:


  • lh3
    replied
    Sorted files are compressed better.

    Leave a comment:


  • win804
    replied
    Hi All,
    Is "sorted" BAM file smaller in size compare to unsorted BAM file?
    If that's the case, why is that so?

    I sort a lot of BAM files using the samtools, with this command:
    samtools sort chr1-aligned.bam chr1-aligned.sorted

    file size of chr1-aligned.bam ==> 353,618,735 bytes
    but the file size of chr1-aligned.sorted.bam ==> 295,208,534 bytes

    I have checked for all my unsorted and sorted bam files. All of the sorted bam files are smaller in size compare to the unsorted ones.

    Thanks.

    Leave a comment:

Latest Articles

Collapse

  • SEQadmin2
    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
    by SEQadmin2


    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

    Here are nine questions we think about, in roughly the order they matter, before...
    06-18-2026, 07:11 AM
  • SEQadmin2
    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
    by SEQadmin2


    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
    ...
    06-02-2026, 10:05 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by SEQadmin2, 06-17-2026, 06:09 AM
0 responses
40 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-09-2026, 11:58 AM
0 responses
102 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-05-2026, 10:09 AM
0 responses
123 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-04-2026, 08:59 AM
0 responses
114 views
0 reactions
Last Post SEQadmin2  
Working...