Hey, anyone here experienced with HTSJDK?
I'm currently using it to read VCF files (bi-allelic microarray derived) and grab all the genotypes for all the individuals then transform them into a Byte Array, storing each genotype as the sum of it's indices (so ./. > -2, 0/0 > 0, 0/1 > 1, 1/1 > 2 etc). Using other language (cyvcf2 etc) libraries they often natively provide genotypes as indices. However, with HTSJDK when requesting genotypes from a VariantContext with getGenotypes() you get the genotypes in base form (something like [A*, C], [A*, A*], [C, C] where each allele is it's own object). Currently I'm using getAlleleIndices() from VariantContext to convert these Genotype objects back into an Array of indices which I can then sum. This is a bit slower than I'd ideally like, as such is there a better or faster way of doing this rather than using getAlleleIndices() to transform each Genotype into it's Indices?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
Nobel Prize for MicroRNA Discovery
This week,...-
Channel: Articles
10-07-2024, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 06:55 AM
|
0 responses
8 views
0 likes
|
Last Post
by seqadmin
Today, 06:55 AM
|
||
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
105 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
113 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
117 views
0 likes
|
Last Post
by EmiTom
10-07-2024, 06:46 AM
|