Hi all,
I don't have a bioinformatics background (coming from machine learning), just started to work on a project that aims to demonstrate how big data tools (eg hadoop) can be used in this space (quite general description). I found that these tools are already widely used in alignment and assembly. So I'm looking into SNP analysis and retrieval, just wondering if someone can suggest me something that would benefit from faster analysis and retrieval (e.g. more complex SNP database queries?).
many thanks for your help.
I don't have a bioinformatics background (coming from machine learning), just started to work on a project that aims to demonstrate how big data tools (eg hadoop) can be used in this space (quite general description). I found that these tools are already widely used in alignment and assembly. So I'm looking into SNP analysis and retrieval, just wondering if someone can suggest me something that would benefit from faster analysis and retrieval (e.g. more complex SNP database queries?).
many thanks for your help.