Header Leaderboard Ad


Removing primers



No announcement yet.
  • Filter
  • Time
  • Show
Clear All
new posts

  • Removing primers

    Hey all,

    I am new to using gsAssembler. I want to do an assembly of cDNA sequences. However, they have primers attached to it. I saw an option called 'Trimming database for De-Novo assembly, and provided the list of primers as an input. However, when I check the output assembly, several contigs still have those primers. How do I make sure that these are removed before assembly( I have a .sff file).
    Also, I want all the contigs after assembly. The AllContig.fna in gsAssembly is not listing all the contigs ( like the singletons). How do I obtain all the contigs from the assembly?

    Thanks in advance for the help,


  • #2
    The -v option is probably what you describe, and if it does not remove all primers I guess there is a bug in gsAssembler :-(

    Singletons are not contigs (by definition, I guess), but I do understand you want to get them included in your final dataset.

    Using the sffinfo trick described in the software manual on page 134 (getting the read IDs from the 454ReadStatus.txt file, make a new sff file with only the singletons, and then extracting the reads as a fasta file) would do it.


    Latest Articles


    • seqadmin
      A Brief Overview and Common Challenges in Single-cell Sequencing Analysis
      by seqadmin

      ​​​​​​The introduction of single-cell sequencing has advanced the ability to study cell-to-cell heterogeneity. Its use has improved our understanding of somatic mutations1, cell lineages2, cellular diversity and regulation3, and development in multicellular organisms4. Single-cell sequencing encompasses hundreds of techniques with different approaches to studying the genomes, transcriptomes, epigenomes, and other omics of individual cells. The analysis of single-cell sequencing data i...

      01-24-2023, 01:19 PM
    • seqadmin
      Introduction to Single-Cell Sequencing
      by seqadmin
      Single-cell sequencing is a technique used to investigate the genome, transcriptome, epigenome, and other omics of individual cells using high-throughput sequencing. This technology has provided many scientific breakthroughs and continues to be applied across many fields, including microbiology, oncology, immunology, neurobiology, precision medicine, and stem cell research.

      The advancement of single-cell sequencing began in 2009 when Tang et al. investigated the single-cell transcriptomes
      01-09-2023, 03:10 PM