This looks like a good start in the right direction for personal genomes.
Our genomes, unzipped
11/10/2010
Categories: Uncategorized
Written by Daniel MacArthur
When we launched this website back in June, I welcomed readers with a promise that Genomes Unzipped would “ultimately be much more than just a group blog”. Indeed, the last four months of blogging have really just been a prelude of sorts to what comes next: the real Genomes Unzipped.
Today we’re launching an exciting new phase of the project. Although we’re not entirely sure where this journey will take us, we’re looking forward to finding out – and to bringing you along with us.
What are we doing?
Over the last year, all the members of Genomes Unzipped have had genome scans performed by personal genomics company 23andMe; several of us have also had additional tests done by other genetic testing companies (Counsyl, deCODEme). From today, we’ll be making all of our raw genetic data and the reports generated from these tests freely available online. As the project proceeds, we aim to obtain data from an ever larger array of tests – ultimately extending to whole-genome sequencing – and release it openly. Right now you can freely download the 23andMe data from everyone in the project from this website.
Over the next few weeks, each of the members will be writing about their own experiences with genetic testing, and what they’ve learnt from their own genetic data. We’ll be discussing analyses we’ve performed on our own raw data, using software written both by group members and other collaborators; and we’ll be releasing the code for that software in our new code repository. We’ll also be talking about the process of deciding to release our genetic data publicly, and how we discussed this decision with our families.
To make it easier for us (and you) to explore our genomes, we have assembled a custom genome browser using JBrowse – this provides a visual interface that allows our 23andMe (and later, complete sequence) data to be viewed in the context of genes and other features. It’s still in prototype form, but we’ll be refining it and adding more data as the project proceeds.
Link:
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Some companies or institutions are getting considerable data somewhere.
Complete Genomics Sequences Over 300 Human Genomes in Q3, Order Backlog Grows to 800
October 05, 2010
According to an amended S-1 form filed with the Securities and Exchange Commission this week, the company had an order backlog of more than 800 human genomes, an increase of 300 genomes compared to mid-July.
Leave a comment:
-
Originally posted by krobison View PostA little googling around revealed that you can access many of the published personal genomes on Penn State's modification of the UCSC Genome Browser; perhaps you could snag some more Y chromosome SNPs there.
For example, try entering the coordinates chrY:2,700,000-9,000,000
Make sure you have the "Individual Genotypes" track on under "Personal Genomes"
Certainly Hammer, Karafet, and Underhill are major players in this endeavor. Cruciani has been the biggest contributor in the study of the M35 haplogorup which our website studies.
Our dataset in the M35 project I think could be matched up against any data set in the world for this group. To use our group as a base before whole genome testing on the subset of the Y chromosome seems to make sense to me. Lets take a small bite of the apple without trying to swallow the whole apple in one bite. If we can deal with this 60 to 80 million base pairs and develop the tools to compare the data this might help in the larger picture. This would seem to be a good proving ground for sequencing and tool development. But first things first we need to sequence the data on the Y chromosome. I believe we have the numbers, resources to finance and the necessary releases of private information to make a go of this from our members.
I appreciate your insights and information. Maybe I am ahead of the power curve on this issue. But this M35 group and project has been a leader in breaking new ground in this area of study. Since the V13 snp discovery and paper by Cruciani we have discovered 11 new subclades for V13 alone in this one subclade of M35. We have broken new ground and wish to continue to do so and think big.
This is the browser link they use at this site.
Link to Consortium website:
Last edited by KerryOdair; 10-05-2010, 11:59 AM.
Leave a comment:
-
A little googling around revealed that you can access many of the published personal genomes on Penn State's modification of the UCSC Genome Browser; perhaps you could snag some more Y chromosome SNPs there.
For example, try entering the coordinates chrY:2,700,000-9,000,000
Make sure you have the "Individual Genotypes" track on under "Personal Genomes"
Leave a comment:
-
The buzz from her talk in the spring is that she gives quite an informative & entertaining talk.
Leave a comment:
-
Originally posted by KerryOdair View PostHere is a teenager with more ambitious aspirations than my small look of 80 million BP's on the Y Chromosome. Great article and a must read.
CUPERTINO, Calif.—In many ways, Anne West is a typical 17-year-old California teenager. She wears her hair long. She likes to hang out with her friends. She went to the prom.
She is also analyzing her family's genome.
Having being diagnosed with a pulmonary embolism in 2003, Anne's father John decided last year to get the family's genes sequenced. The process involves an advanced technology that spews out the six billion letters that represent the makeup of a person's genetic code. But after putting up $160,000 to get the four-member family tested, the Wests realized something: sifting through the reams of data was tougher than they ever imagined.
http://online.wsj.com/article/SB1000...l?mod=ITP_TEST
I'll have to think on it a little more, but I would love to collaborate with her and her family to open up the dataset to SEQanswers users. Perhaps I could secure or fund a donation of some compute resources for the product.
Thanks for posting it Kerry!
edit: Looks like we'd be a little late to the party...
She is now at work on a paper based in part on her family's data, with researchers from a Seattle institute. Last month, she was a speaker on a panel at a personal-genomics conference held at Cold Spring Harbor, New York, a scientific mecca.
Leave a comment:
-
Here is a teenager with more ambitious aspirations than my small look of 80 million BP's on the Y Chromosome. Great article and a must read.
CUPERTINO, Calif.—In many ways, Anne West is a typical 17-year-old California teenager. She wears her hair long. She likes to hang out with her friends. She went to the prom.
She is also analyzing her family's genome.
Having being diagnosed with a pulmonary embolism in 2003, Anne's father John decided last year to get the family's genes sequenced. The process involves an advanced technology that spews out the six billion letters that represent the makeup of a person's genetic code. But after putting up $160,000 to get the four-member family tested, the Wests realized something: sifting through the reams of data was tougher than they ever imagined.
Leave a comment:
-
Future Repositories and Standards Bodies: Guidance
In addition to the International Society of Genetic Genealogy, I would like to suggest NARA (US National Archives and Records Administration) as a potential source of guidance and standards development for de-centralized or centralized genealogical repository of DNA sequences. This is because the emergent majority of digital archive users at US NARA and other traditional archival records facilities are genealogists and family historians. (Millions of users yearly!)
Analyzing archives and finding facts: use and users of digital data records Margaret O’Neill Adams - Archival Science, 2007 Vol 7 No. 1 21-36
Relocating Meaning in Heritage Archives: A Call for Participatory Heritage Databases. Angela M. Labrador and Elizabeth S. Chilton. Computer Applications in Archaeology. Annual Meeting Proceedings 2009
Leave a comment:
-
Complete Genomics has stated very clearly that they are in the game only to do complete human genome sequencing. The Y has limited medical relevance (the obvious example is azoospermia due to deletions on Y) and in any case they've decided to stay out of the retail genomics game. Right now, the assumption is that you would need to assume regulatory issues unless you could really prove otherwise -- a headache they wish to stay away from.
Companies such as Ion Torrent (now into LifeTech) are in the game to make machines, not do much in the way of specific sequencing. PacBio would be the same issue. PacBio+RainDance or Fluidigm may make a very interesting combo for your application (probably also the other targeted sequencing schemes as well), but I still think you'll have trouble getting the cost to where you need it.
I think the quick answer right now is that at the moment there isn't a good solution in the cost you are looking for. If you could batch a large number of samples, then RainDance might be a reasonable option to do the targeting. In any case, I think you will find it challenging to go below $500 total cost per sample with a service provider, and that may even be a bit low-ball.
If I were in your shoes, particularly if not with a lot of funds, I'd focus on mining the available genomes & 1000 genome data to get a much richer set of SNPs. If you look around here, there is another thread on how to access consolidated SNP info from 1K genomes. These could be converted into one of the typical cheap SNP-typing formats & then you could get a richer set to type lots of genomes on cheap array/PCR platforms.
In any case, you probably should find an academic somewhere to collaborate with, as all of these companies will probably be more comfortable doing that than with a private citizen.
Leave a comment:
-
Originally posted by nilshomer View PostDisclaimer:
The World Personal Genome Registry is a website created by Illumina for the individual genome sequencing space that allows the community to keep track of the current status of personal whole-genome sequencing. We plan to transfer this registry to an appropriate standards body...
Leave a comment:
-
Originally posted by KerryOdair View Post
The World Personal Genome Registry is a website created by Illumina for the individual genome sequencing space that allows the community to keep track of the current status of personal whole-genome sequencing. We plan to transfer this registry to an appropriate standards body...
Leave a comment:
-
-
I had hoped Complete Genomics would be an excellent candidate for this type of application. Below is the frustration looking into this kind of commercial test. The Y Chromosome has no known medical implications that I am aware of. So regulatory compliance seems to be a non issue in my mind, not to mention we should have right over our own dna. There is no need for consulting services that you run into in the 23andMe world as far as regulators are concerned. This reply was sent to me in June of 2009. I also contacted Ion Torrent but since they have been bought out by Life Tech, it is unclear to me if this will speed up or hinder development of this platform. Life Tech may also influence an increase in price for the instrument after the buyout. In terms of privacy this issue has been handled by companies already with an exclusion if you so desire. However, most people seeking this kind of testing are more than willing to share their personal information for matches. PacBio still remains an unknown at this point in time. Existing 2nd generation machines could also possibly do the job right now. My expertise on these machines is lacking to know if that is the case.
There is much to be learned from a detailed y tree. From the academic world the flow of information is slow and the peer review process is a lumbering elephant trying to keep up with dramatic change.
Dear Mr. O'Dair,
Thanks for your interest in Complete Genomics. *While your project sounds very interesting, Complete Genomics does not plan on sequencing partial portions of the genome. Our current plans are to focus exclusively on providing sequencing of the complete human genome - all 6B bases.
Complete Genomics is also providing our sequencing service for research purposes only to bio-pharma companies and genome centers/research organizations. The main reason we aren't sequencing genomes for individuals is because that type of service requires legal consents, consulting services, and privacy and regulatory compliance processes such as CLIA, etc. that Complete Genomics does not have. We have no plans in 2009 to obtain such certification and hence will not be able to sell directly to individuals.
Regards,
Jennifer Turcotte
Complete Genomics, Inc.
VP of Marketing
Leave a comment:
-
Originally posted by krobison View PostHave you looked in the 1000 Genomes data at chrY? Perhaps they have some useful variants there - -quite a lot of data is online. How many new SNPs for Y showed up in the Bushman or Asian whole genome sequences?
1000 Genomes Project: Y Chromosome SNPs
Luke Jostins, Qasim Ayub, Yali Xue, Chris Tyler-Smith
Abstract
• Y chromosome SNPs were called from the 1000 Genomes data, and numerousfilters were applied
• A total of 2870 sites were called as variable in the 77 samples, of whicharound 75% are novel
• 30 sites that passed all filters were re-sequenced using capillarysequencing, giving an estimated false positive rate of 3.3%
• Known HapMap variants and variants from the Y haplogroup tree were used toestimate the sensitivity. This gave 22% for singletons and doubletons and63% for variants with a non-reference allele count of three or greater
• We use the sensitivity to estimate a polymorphism rate of 1 variant per2350bp
• HapMap genotypes and Pilot 1/Pilot 2 concordance was used to estimate aper-genotype error rate for Q10 non-reference bases of under 1%
Vince Tilroe
Many established SNPs may have been screened out by their quality "filters" (proximity to other SNPs, for example).
They had 77 men and a total of 188 million bp attributed to the Y, which means that on average only 2,400,000 bp per Y was sequenced: that's just 10%, roughly.
And these genomes have VERY low sequencing coverage (just 1.94x on average), which is one reason the Y-SNP outcomes are so weak. While the HapMap samples covered a decent array of haplogroups, just 2870 usable Y-SNPs found is not a lot given the total amount of sequencing done here.
It's interesting that even with this sparse data they point out the youthfulness of R1b1b2 ("New insights into recent human evolution can also be gained from the branch lenghts; for example, the short internal branch lengths within the haplgroup R1b relative to the other haplogroups suggest a recent expansion of this European haplogroup.)
Yet it also apparent from Figure 4 that if we are going to get deep insights into intra-haplogroup structure (not to mention a truly
precise SNP-based molecular clock) we are going to need much better Y- chromosome sequencing than was done here.
Vicent Vizachero
Many established SNPs may have been screened out by their quality "filters" (proximity to other SNPs, for example).
But I think the more likely explanation is the low quality of Y- chromosome sequencing. They only got 188 million bp out of 77 guys, which amounts to just 2.4 million bp on average. That's only 10% of the full Y, and only maybe 20% of the "sequencable" Y. And some of the men were only sequenced at low depth (<3x), so sequencing errors could also have an impact.
Vicent Vizachero
Originally posted by krobison View PostFor the sort of high-throughput approach you are describing, one of the targeted sequencing technologies could make sense. For example, you might try designing a RainDance or OLink primer library to amplify the Y. Or, perhaps a SureSelect/Nimblegen approach.
Originally posted by krobison View PostA big question would be what cost per sample are you really willing to take on and how many samples? That would really affect the choice of technology.Last edited by KerryOdair; 09-18-2010, 08:18 AM.
Leave a comment:
-
Originally posted by krobison View PostFirst off, what's wrong with the sequence we already have? Y is not a total wasteland.
Discover your DNA story and unlock the secrets of your ancestry and genealogy with our autosomal DNA, Y-DNA and mtDNA tests.
Originally posted by krobison View PostOne group has reported specifically capturing a specific mammalian chromosome by flow cytometry & getting sequences highly enriched for the targeted chromosome.
Leave a comment:
Latest Articles
Collapse
-
by seqadmin
The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...-
Channel: Articles
11-06-2024, 07:24 PM -
-
by seqadmin
Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...-
Channel: Articles
10-18-2024, 07:11 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 11-08-2024, 11:09 AM
|
0 responses
199 views
0 likes
|
Last Post
by seqadmin
11-08-2024, 11:09 AM
|
||
Started by seqadmin, 11-08-2024, 06:13 AM
|
0 responses
148 views
0 likes
|
Last Post
by seqadmin
11-08-2024, 06:13 AM
|
||
Started by seqadmin, 11-01-2024, 06:09 AM
|
0 responses
80 views
0 likes
|
Last Post
by seqadmin
11-01-2024, 06:09 AM
|
||
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks
by seqadmin
Started by seqadmin, 10-30-2024, 05:31 AM
|
0 responses
26 views
0 likes
|
Last Post
by seqadmin
10-30-2024, 05:31 AM
|
Leave a comment: