Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sameet
    Member
    • Apr 2010
    • 34

    #16
    Originally posted by mgogol View Post
    Yes.

    You can also use fastx_clipper to trim and fastx_collapser to collapse the reads if you want.

    There's a few web based tools you can plug in your collapsed sequences and counts and get some results... I'm trying them now and reading the papers just to see how they are and what they do, I might do something myself also, but it's easy enough to plug and chug. mirAnalyzer, mirTools, and DSAP.
    Hi,
    Can you point me to the specific papers that you mention?
    Sameet Mehta (Ph.D.),
    Visiting Fellow,
    National Cancer Insitute,
    Bethesda,
    US.

    Comment

    • mgogol
      Senior Member
      • Mar 2008
      • 197

      #17
      Here's the sites, the papers should be linked from these.





      Comment

      • sameet
        Member
        • Apr 2010
        • 34

        #18
        Hi,
        I have looked at the cutadpt. But in the help the sequence for the Illumina 5' primer is not given. Is that sequence or model for 5' primer already built-in in cutadapt. If not is that sequence available.

        I am also trying novoalign, will post my experience about it soon.
        Sameet Mehta (Ph.D.),
        Visiting Fellow,
        National Cancer Insitute,
        Bethesda,
        US.

        Comment

        • mgogol
          Senior Member
          • Mar 2008
          • 197

          #19
          In the data I'm currently looking at, the thing to trim was

          ATCTCGTATGCCGTCTTCTGCTTGT

          but your mileage may vary. This may help, or also just looking at some reads in the fastq file.

          Comment

          • sparks
            Senior Member
            • Mar 2008
            • 126

            #20
            Originally posted by sameet View Post
            Hi,

            I was wondering about one more question. If the adaptors are trimmed, would you get the final 'usable' (for the lack of better word) sequences of different lengths?
            Yes, you'd get trimmed reads of different length. Some aligners can handle this (eg Novoalign) some can't (e.g. MAQ)

            Colin

            Comment

            • sameet
              Member
              • Apr 2010
              • 34

              #21
              Originally posted by mgogol View Post
              In the data I'm currently looking at, the thing to trim was

              ATCTCGTATGCCGTCTTCTGCTTGT

              but your mileage may vary. This may help, or also just looking at some reads in the fastq file.
              That was very helpful. I think this is exactly what I was looking for. Thank you.
              Sameet Mehta (Ph.D.),
              Visiting Fellow,
              National Cancer Insitute,
              Bethesda,
              US.

              Comment

              • sameet
                Member
                • Apr 2010
                • 34

                #22
                I think the following link is the most comprehensive list of all the primers as far as SOLEXA/Illumina is concerned.

                Sameet Mehta (Ph.D.),
                Visiting Fellow,
                National Cancer Insitute,
                Bethesda,
                US.

                Comment

                • sameet
                  Member
                  • Apr 2010
                  • 34

                  #23
                  Originally posted by sparks View Post
                  Hi Sameet,
                  Novoalign will trim adapters and align miRNA in one go. It's also pretty fast even free version with no multi-threading as reads are usually aligned with at most one mismatch.
                  Novoalign also looks for precursor location, giving a score and location for nearby complementary alignment.
                  Steps:
                  1. index genome
                  novoindex ref.idx ref.fasta
                  2. Align the miRNA
                  novoalign -d ref.idx -a -m -t 30 -f reads.fastq

                  Colin
                  This solutions seems to be working perfectly. I dont have an idea of how long it should take. But it did remove the adaptors and i am getting an output file that looks to contain a lot of reads that are of differing lengths.
                  Sameet Mehta (Ph.D.),
                  Visiting Fellow,
                  National Cancer Insitute,
                  Bethesda,
                  US.

                  Comment

                  • bharat_iyengar
                    Member
                    • Dec 2012
                    • 20

                    #24
                    I had been doing fair amount of RNAseq and small RNAseq analysis but I still have a basic doubt regarding small RNAseq.

                    for RNAs smaller than the typical read length, where do the extra bases come from. They dont seem to have any common sequence which I might consider as adapter.

                    Comment

                    Latest Articles

                    Collapse

                    • GATTACAT
                      Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by GATTACAT
                      Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                      07-01-2026, 11:43 AM
                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, 07-02-2026, 11:08 AM
                    0 responses
                    16 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-30-2026, 05:37 AM
                    0 responses
                    17 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-26-2026, 11:10 AM
                    0 responses
                    20 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    54 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...