Unconfigured Ad

**JohnK** · 02-09-2011, 08:35 AM

what gene model are you using?

**pasta** · 02-09-2011, 08:46 AM

John,
We used YACOP which uses several ORF finders: Critica, Glimmer and Z-curve.

**JohnK** · 02-09-2011, 09:11 AM

It could possibly be a number of things worth investigating including- PCR dup. removal (dependent on the number of PCR cycles you did), 5'/3' bias dependent on the method for creating your cDNA library, which can happen during fragmentation of your isolated mRNA too, an unannotated gene in your gene model (maybe try something like refSeq or ensure all the transcripts in your gene model are present), or a repetitive region upstream of your gene, which caused read-mapping difficulties.

**Richard Finney** · 02-09-2011, 09:12 AM

In many bacterial genomes, the genes are quite tightly packed on to the genome.
Check out http://microbes.ucsc.edu/cgi-bin/hgTracks to see for yourself.

It is possible that they are genes. You may have to get that sequence (area in question) and tune down the parameters to see if they match a domain using your favorite motif finding software. You might run a blast to see if there's homology to another organism.

Other possibilities is that they are regulatory elements.

The region may not be unique. Check the bwa flags for the reads for more insight. I guess, in bacteria, two "snp" values might tell you there's a dupe.

Just some thoughts, I'm no bacteria expert.

**JohnK** · 02-09-2011, 01:05 PM

You might also want to check for fRNA contamination. It's a possibility...

**pasta** · 02-10-2011, 01:47 AM

Thank your for these answers, that's very nice from you. I forgot to mention that all rRNA sequences were removed from our analysis.
For case #2, I blasted the sequence : no homology found; however I found 1 nice promoter sequence. FYI, genes A and B are DNA a integration protein and a transposase respectively. That's vey interesting !

Do you have any explanation for the first case ?

**pmiguel** · 02-10-2011, 05:15 AM

What was your method of cDNA synthesis/library construction? It could be an artifact of these processes.

--
Phillip

**pasta** · 02-11-2011, 12:35 AM

Originally posted by pmiguel View Post

What was your method of cDNA synthesis/library construction? It could be an artifact of these processes.

--
Phillip

Total RNA was treated twice with MicrobExpress (ambion) to remove most rRNA.
mRNA was fragmented to prepare cDNA with hexanucleotides as primers and RNase H was used on the other strand. Then, Illumina adapters were added before the PCR.
Someone told me that the behavior that we can see in case #1 is rather normal with prokaryots. Transcription does not stop exactly at the end of the ORF, some mRNA can be longer. What do you think ?

Thanks

antoine

**pmiguel** · 02-11-2011, 05:24 AM

Yes, I would buy that explanation.

Prokaryotic messages are said to be rapidly turned-over. If this turn-over takes the form of exonucleases, that also would cause lower 5' and 3' ends in your sequencing results.

--
Phillip

**nasobema** · 02-11-2011, 05:34 AM

@case 1:
I believe it might be because of methodological bias. Some methods preferentially enrich 5'-ends of mRNAs while others do so for 3'-ends.

Your method is not strand-specific, so you cannot tell, whether you see Gene B downstream transcript or actually the gene A transcript. So, your "procaryotic" explanation is also possible, though I wouldn't expect such a long tail (just a feeling, however)

@case 2:
I'll vote for repetitive region here. You say, gene B's a transposase? Such genes move genomic elements within an between genomes, often integrating at similar sites and carrying additional DNA. While the transposase itself can be a repeat within the genome, I would also expect to find more repetitive sequence in the vicinity.

**pasta** · 02-11-2011, 06:12 AM

Thank you very much for your explanations, I appreciate. I am really starting to understand The Biology behind the data, if that makes sense.

Thanks again !

Toni

**[email protected]** · 05-01-2011, 04:10 PM

Originally posted by pasta View Post

John,
We used YACOP which uses several ORF finders: Critica, Glimmer and Z-curve.

i want to use Orpheus and Z-curve along with it but unable to find it anywhere on the web. Did you use it? Can you tell from where i can download these two.

regards,
adnan

**Simon Anders** · 05-01-2011, 10:44 PM

Not being a biologists, and never having worked with procaryotes, I apologize if this question might be stupid, but: Bacteria don't have UTRs? Not only translation but also transcription starts exactly at the start codon and stops at the stop codon? Otherwise, what is surprising about the transcript reaching beyond the gene boundary, if your gene model comes from an ORF finder?

I'm working a lot in yeast, and there, many genes look like case #1. It seems as the promoter recruits the polymerase to a quite well defined position where transcription starts, but where it stops (or more precisely: where the poly-A tail is placed) seems to be rather a region, or a colelction of several possible places, given the 3' end this "decaying" appearance. As for case #2: there are so many non-coding transcripts in eukaryotes (and in prokaryotes as well, maybe?) that I would be rather surprised if I did not find transcripts that don't overlap with an ORF.

**sshell** · 05-02-2012, 07:08 AM

I know it's an old post but in case people are still reading it, I wanted to add that bacterial certainly DO have UTRs, so it is normal and expected that transcription from two convergent genes will overlap. Bacterial terminators are also not always sharp; transcription can end over a range of positions downstream of the stop codon. Gene "B" in the example fits this pattern.

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 12 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 15 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

RNA-seq read coverage questions

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News