Unconfigured Ad

**john_mu** · 05-20-2010, 11:02 PM

http://www.nature.com/nature/journal/v464/n7289/full/nature08872.html

This paper deals with detection of Polyadenylation sites from RNA-seq reads, but they do not release the software.

**epigen** · 05-21-2010, 06:02 AM

It depends on what you want to do with poly-A containing reads. The usual software does not look at your reads to classify them in any way, it just tries to map them. You can create something like an "artificial genome" containing sequences you want to exclude (or specifically want to extract), e.g. poly-A, repeats, rRNA etc., as a reference to which you align your reads. If this is what you're looking for, e.g. the SOLiD BioScope WT pipeline provides such sequences to be used as filters, but I'm sure you can also find them somewhere else.

**ryanmcg** · 02-19-2015, 02:47 PM

Originally posted by john_mu View Post

http://www.nature.com/nature/journal...ture08872.html

This paper deals with detection of Polyadenylation sites from RNA-seq reads, but they do not release the software.

I am interested in doing exactly what this paper describes, but their methods do not give me enough detail to do this (without already having a fair amount of programming knowledge).

Is there not a tool that is built to do this type of analysis? I imagine it would be popular.

**ryanmcg** · 02-19-2015, 03:21 PM

Here is the passage from their supplement. Any suggestions on how to implement this?

As described above, our mapping strategy allowed us to find putative novel polyadenylation sites.
We first identified all sequencing reads that did not initially map to the genome and either began or
ended with a run of at least four As or T. We then trimmed off the run of As or Ts and remapped
the reads to the genome using MAQ. At those reads that then mapped uniquely to the genome,
we inferred the precise base where cleavage occurred. To filter out cleavage sites possibly due
to sequencing errors, we removed putative polyadenylation sites where the downstream genomic
regions contained at least three As or Ts, reasoning that a sequencing error at the non-A or T
site might lead to mis-mismapping and spurious calling of a poly-A site.

**ryanmcg** · 02-19-2015, 03:45 PM

part of the way there: trimest

http://emboss.toulouse.inra.fr/cgi-bin/emboss/help/trimest

**Brian Bushnell** · 02-19-2015, 04:16 PM

You can filter or trim reads matching certain patterns, like poly-A, with BBDuk. For example:

bbduk.sh in=reads.fq out=trimmed.fq ktrim=r k=8 literal=AAAAAAAAAAAA mm=f rcomp=f

That will trim reads to the right, starting at the first poly-A of at least 8 in a row. If you only want to look at tails, you could use "restrictright=20" to only look for matches in the last 20bp of the read. Without the "ktrim=r" flag, it will run in filtering mode:

bbduk.sh in=reads.fq outm=matched.fq out=unmatched.fq k=8 literal=AAAAAAAAAAAA mm=f rcomp=f

"rcomp=f" means only look for a kmer and not its reverse complement (in this case, poly-T); you can turn that on or off.

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, Yesterday, 11:08 AM	0 responses 7 views 0 reactions	Last Post by SEQadmin2 Yesterday, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 53 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

polyA containing reads in RNA-seq

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News