Is there a command to output the kmers of each sequence in a multifasta file?
Seqanswers Leaderboard Ad
Collapse
X
-
Trouble parsing header
Dear BBMap team:
I tried to use filterbytile.sh to remove the reads with low quality, but I encountered an error message saying that there was a trouble parsing the header. I've read the description of the script and Brian Bushnell said that was possible when the reads were renamed (such as in SRA) and to contact him if such error happened.
I downloaded the sequencing data (SRA) from ncbi and used fastq-dump to get the fastq files. I wonder if there is a solution to this?
Thank you very much!
Rose
Comment
-
-
BBsketch alltoall is incomplete
Can I ask a question about bbsketch?
I want to compare the ANI between many genomes (1000+) to each other.
I did
Code:bbsketch.sh perfile genome_folder/*.fasta out=sketch.gz k=31,24 threads=16 comparesketch.sh alltoall sketch.gz k=31,24 prealloc=0.75 format=3 threads=16 out=table.tsv
Code:Set threads to 16 Loading sketches. Loaded 1157 sketches in 59.541 seconds. Total Time: 59.784 seconds.
Code:Set threads to 16 Loading sketches. Executing kmer.KmerTableSet [ways=31, tabletype=10, prealloc=0.75] Initial size set to 45218398 Initial: Ways=31, initialSize=45218398, prefilter=f, prealloc=0.75 Memory: max=91268m, total=91268m, free=90848m, used=420m 3.713 seconds. Indexed 2880884 unique and 10513099 total hashcodes. Loaded 1157 sketches in 8.457 seconds. Ran 1225005 comparisons in 9.344 seconds. Total Time: 17.801 seconds.
- Genomes are highly similar.
#Query Ref ANI QSize RefSize QBases RBases QTaxID RTaxID KID WKID SSU
genome1.fasta genome2.fasta 94.223 1984118 1796930 1987598 1797650 -1 -1 24.952 27.523 .
- It is not simply due to the naming: I neither find "genome1 vs genome2" nor "genome 2 vs genome1"
Any idea?
Comment
-
-
I'm trying to use BBmap to find all perfect hits or hits with an indel length 1.
Code:bbmapskinner.sh in=kmer.fasta out=result.sam ambiguous=all strictmaxindel=1
Is there something that I am doing wrong?
Comment
-
Latest Articles
Collapse
-
by seqadmin
The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...-
Channel: Articles
Today, 11:48 AM -
-
by seqadmin
This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.
The Headliner
The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...-
Channel: Articles
03-03-2025, 01:39 PM -
-
by seqadmin
The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...-
Channel: Articles
02-24-2025, 06:31 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 03-20-2025, 05:03 AM
|
0 responses
26 views
0 reactions
|
Last Post
by seqadmin
03-20-2025, 05:03 AM
|
||
Started by seqadmin, 03-19-2025, 07:27 AM
|
0 responses
33 views
0 reactions
|
Last Post
by seqadmin
03-19-2025, 07:27 AM
|
||
Started by seqadmin, 03-18-2025, 12:50 PM
|
0 responses
25 views
0 reactions
|
Last Post
by seqadmin
03-18-2025, 12:50 PM
|
||
Started by seqadmin, 03-03-2025, 01:15 PM
|
0 responses
190 views
0 reactions
|
Last Post
by seqadmin
03-03-2025, 01:15 PM
|
Comment