Hi,
Is there any tool available that allows to trim adapter (and primers) that are inside of the reads (not in the 5' or 3' ends)?
For whatever reason a portion of my reads have a variable number of random bases before the adapter and primer. I need a tool that is able to trim the adapter+primer and any bases before it (this is between the 5'end and the adapter). I need to do the same for the 3'ends, where I have the same issue.
This is an example of such a read. In red are the adapter and primer sequences. After trimming, only the lower case portion should remain:
>made-up example
ATCGTAGTATTAGGATCGTCGCTGATGACTGAatgatgatagttacctagagcta[...]cgacatcacgaccccgatagaATAGCTAGCTATCGATCGATCTGACATAGACA
A custom script would do it as well. I tried using the bash tool grep, but wasn't able to make it work.
These sequences are amplicons generated using barcoded COI primers in PacBIO (with the Conserved Consensus Sequencing protocol). When I say adapter I mean the barcode that was added to the primers to be able to split the libraries.
Thanks in advance for your answers.
Iria
Is there any tool available that allows to trim adapter (and primers) that are inside of the reads (not in the 5' or 3' ends)?
For whatever reason a portion of my reads have a variable number of random bases before the adapter and primer. I need a tool that is able to trim the adapter+primer and any bases before it (this is between the 5'end and the adapter). I need to do the same for the 3'ends, where I have the same issue.
This is an example of such a read. In red are the adapter and primer sequences. After trimming, only the lower case portion should remain:
>made-up example
ATCGTAGTATTAGGATCGTCGCTGATGACTGAatgatgatagttacctagagcta[...]cgacatcacgaccccgatagaATAGCTAGCTATCGATCGATCTGACATAGACA
A custom script would do it as well. I tried using the bash tool grep, but wasn't able to make it work.
These sequences are amplicons generated using barcoded COI primers in PacBIO (with the Conserved Consensus Sequencing protocol). When I say adapter I mean the barcode that was added to the primers to be able to split the libraries.
Thanks in advance for your answers.
Iria
Comment