Skymap is a standalone database that aims to offer:
1. a single data matrix for each omic layer for each species that spans >200k sequencing runs from all the public studies, which is done by reprocessing petabytes worth of sequencing data.
2. a biological metadata file that describe the relationships between the sequencing runs and also the keywords extracted from over 3 million free text annotations using NLP.
3. a technical metadata file that describe the relationships between the sequencing runs.
Where they can all fit into your personal computer
Github: https://github.com/brianyiktaktsui/S...ster/README.md
Preprint: https://www.biorxiv.org/content/early/2018/08/07/386441
Data: https://www.synapse.org/#!Synapse:syn11415602/files/
Fun blog post about why I decided to do this project: https://brianyiktaktsui.wordpress.co...ect-a-preview/
Please leave feedback and comments. This project was initially a side project that I was working on but turns out a lot of people find it useful
(https://twitter.com/strnr/status/1026822778673156097). I just talked with folks from gene pattern team, if there are enough use cases and interest, we might make an effort to integrate, maintain and improve this database.
1. a single data matrix for each omic layer for each species that spans >200k sequencing runs from all the public studies, which is done by reprocessing petabytes worth of sequencing data.
2. a biological metadata file that describe the relationships between the sequencing runs and also the keywords extracted from over 3 million free text annotations using NLP.
3. a technical metadata file that describe the relationships between the sequencing runs.
Where they can all fit into your personal computer
Github: https://github.com/brianyiktaktsui/S...ster/README.md
Preprint: https://www.biorxiv.org/content/early/2018/08/07/386441
Data: https://www.synapse.org/#!Synapse:syn11415602/files/
Fun blog post about why I decided to do this project: https://brianyiktaktsui.wordpress.co...ect-a-preview/
Please leave feedback and comments. This project was initially a side project that I was working on but turns out a lot of people find it useful
(https://twitter.com/strnr/status/1026822778673156097). I just talked with folks from gene pattern team, if there are enough use cases and interest, we might make an effort to integrate, maintain and improve this database.