Seqanswers Leaderboard Ad

**ECO** · 02-07-2012, 08:49 PM

What are you trying to do? No one can answer your broad question as phrased.

**adaptivegenome** · 02-07-2012, 08:52 PM

Good starting point:

Ten Simple Rules for Getting Help from Online Scientific Communities

http://www.ploscompbiol.org/article/info:doi%2F10.1371%2Fjournal.pcbi.1002202

Internet,Archives,Web-based applications,Computer software,Careers,Grammar,Human learning,Scientists

**abhagwat** · 02-08-2012, 08:31 AM

We are trying to map regions of the human genome that contain a certain DNA modification. We have developed a method to pull down DNA containing such adducts and would like to determine where these pieces come from. Also, we are trying to correlate the location information to transcription activities in those regions. The RNA part is relatively straightforward as there are a lot of "expression chips" available. The DNA part is confusing because whole genome NextGen sequencing may be an overkill here. Most likely the adducts are concentrated in only 1-3% of the genome. I am afraid of being deluged with a lot of useless information which will take huge amount of bioinformatic effort to weed through. We considered oligo arrays (ENCODE and tiling), but they have their own disadvantages. ENCODE arrays cover too little of the genome while tiling arrays cover too much again creating the same problem as NextGen sequencing. My other problem so far is that most University centers seem to do NexGen sequencing over ChIP-on-chip and other assays and do not always provide good software support. My general feeling is that they are just technicians without a detailed understanding of what they are doing. I have been calling up chip manufacturers, sequencing service providers as well as genomics center operators for the past two weeks and getting nowhere.

**adaptivegenome** · 02-08-2012, 03:37 PM

Originally posted by abhagwat View Post

We are trying to map regions of the human genome that contain a certain DNA modification. We have developed a method to pull down DNA containing such adducts and would like to determine where these pieces come from. Also, we are trying to correlate the location information to transcription activities in those regions. The RNA part is relatively straightforward as there are a lot of "expression chips" available. The DNA part is confusing because whole genome NextGen sequencing may be an overkill here. Most likely the adducts are concentrated in only 1-3% of the genome. I am afraid of being deluged with a lot of useless information which will take huge amount of bioinformatic effort to weed through. We considered oligo arrays (ENCODE and tiling), but they have their own disadvantages. ENCODE arrays cover too little of the genome while tiling arrays cover too much again creating the same problem as NextGen sequencing. My other problem so far is that most University centers seem to do NexGen sequencing over ChIP-on-chip and other assays and do not always provide good software support. My general feeling is that they are just technicians without a detailed understanding of what they are doing. I have been calling up chip manufacturers, sequencing service providers as well as genomics center operators for the past two weeks and getting nowhere.

This makes lots more sense. It is a shame you have had bad experiences with genome centers. I trained at the BCM genome center and it was a great experience. We frequently developed new techniques, etc. The bioinformatics was top-notch...

I think that nextgen can help you as your project is a variation of capture sequencing. Here are some questions though before we can figure if it will help:

1. How long are the DNA fragments you pull down?
2. How many are there? How much actual nucleotide sequence will you get in the end.

Knowing these two things can help you pick a platform and the bioinformatics for what you want to do is straightforward. If there is not someone nearby you then you can PM me and I can probably help.

The alternative if you have not so much DNA and short fragments is to just clone them into a vector and sequence by Sanger.

**abhagwat** · 02-08-2012, 07:52 PM

Originally posted by genericforms View Post

This makes lots more sense. It is a shame you have had bad experiences with genome centers. I trained at the BCM genome center and it was a great experience. We frequently developed new techniques, etc. The bioinformatics was top-notch...

I think that nextgen can help you as your project is a variation of capture sequencing. Here are some questions though before we can figure if it will help:

1. How long are the DNA fragments you pull down?
2. How many are there? How much actual nucleotide sequence will you get in the end.

Knowing these two things can help you pick a platform and the bioinformatics for what you want to do is straightforward. If there is not someone nearby you then you can PM me and I can probably help.

The alternative if you have not so much DNA and short fragments is to just clone them into a vector and sequence by Sanger.

Ans.
1. The expected average size is ~250 bp (4 bp restriction enzyme), but we can make it a bit longer.
2. Don't know how many fragments there are. On an agarose gel it's a smear.

**adaptivegenome** · 02-08-2012, 08:21 PM

So I would first start with isolating the smear (I assume you would run it off the gel or do a gel extraction). There is a minimum amount of DNA necessary for sequencing on nextgen instruments. Then I think for 1%-3% of the genome I would vote for an Illumina GAII run (HiSeq is overkill) or maybe Ion Torrent. Depends on what you have available to you.

If you do not have the minimum DNA and even if you do, I would probably start with the easy experiment -- add some A/T overhangs to your product, TOPO clone, and Sanger sequence. See if you are getting sequences you expect and if it is useful. Perhaps this will be enough data. If not it makes a good validation set to confirm things are working when you move to a high throughput sequencing method.

I hope this helps. Feel free to ask more specific questions about any of the steps... (and the above is just my thoughts; I'm sure others might have more ideas for you)

**ETHANol** · 02-09-2012, 01:22 AM

Yes, the more time you spend construction your post and more specific your questions are the better the answer you will get here. And if your question is well written you will get some good answers.

As far as the wet-lab side, assuming you will use the HiSeq I would take a look at my ChIP-seq and MeDIP-seq protocols. If you have to denature the DNA prior to enrichment for you DNA marker then go with the MeDIP-seq protocol. If you do the DNA marker enrichment on double stranded DNA then follow the ChIP-seq library protocol. You would have to increase or decrease the adapter concentration to the amount of DNA you have but that's about it.

1-3% of the genome is a lot of the genome so NGS is not overkill especially in that you can barcode essentially as many samples as you like and fit them into one lane on the HiSeq.

The protocols:

MeDIP-seq protocol with TruSeq adapters

http://ethanomics.wordpress.com/medip-seq-protocol-with-truseq-adapters/

MeDIP-seq protocol with TruSeq adapters

ChIP-seq library construction using the Illumina TruSeq Adapters

http://ethanomics.wordpress.com/chip-seq-library-construction-using-the-illumina-truseq-adapters/

A few more updates to the protocol. This version is working really well. I have increased the PEG concentration used with the AMPure beads to increase the recovery of the smallest DNA fragments. …

**NextGenSeq** · 02-09-2012, 09:54 AM

Note be careful using restriction enzymes in next gen sequencing. These create libraries which have the same sequence motif which means when they are sequenced all the clusters have the same sequence in that region of the library. This leads to difficulties in resolving the clusters and poor sequence quality in those regions.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

New to Next Gen

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News