Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • thermophile
    replied
    I don't trim based on qscore, I use the qscores to merge reads. I use mothur which impliments pandaseq for it's read merging

    Leave a comment:


  • BioGenomics
    replied
    Hi all,

    what is the minimum Q-value you would suggest for a 16s read/merged amplicon Trimming/clipping ?

    thanks

    Leave a comment:


  • fanli
    replied
    My bad - that last screenshot is Called Int. Yeah, you're right - the Cycle 1 intensities for my last run are 30 and 39 for read 1 and read 3, respectively.

    We haven't had any issues with runs terminating randomly for 16S libraries though. Although now that I think about it, we did have one bacterial WGS run that died on cycle 515 or so. Something about a .NET framework error and tech support said they hadn't seen it before.

    Leave a comment:


  • RickC7
    replied
    strange

    hi fanli,

    hmmm, the run summary you posted on 5-11-2015 shows read 1 intensity at 17 and read 4 intensity at 53. The other run summary from the OP also shows low intensity 1st cycle. This also fits with what I see in 16s libraries on miseq. So, 3 different machines, 3 different places, 3 similarly low 1st cycle intensities...
    Tech support is telling me this is the reason I am having issues with run completion. Last 3 runs are terminating randomly, run1- cycle 385, run2-cycle 60, run3 - cycle 587. Tech support has been helpful in replacing kits, but miseq is still out of commission. Have arrange for libraries to be sequenced on another miseq, Qc all checks out so I have little to no concern about the libraries. One comment that came out was " your 1st cycle intensities are very low..." I get a stopped run and funky z-stage errors, z-stage replaced but same issue persists.

    Leave a comment:


  • fanli
    replied
    I think low diversity would only cause jaggedness in the intensity profile, but maybe you should check w/ tech support.

    Our 16S runs have 1st cycle intensities ~150.
    Attached Files

    Leave a comment:


  • RickC7
    replied
    Not meaning to hijack the thread, but can anyone explain why we see such low 1st cycle intensities with 16s libraries? I see this in both v3-v4 and v4 only libaries, perhaps due to low diversity? If I look at run summary from targeted reseq or phix, then the 1st cycle intensities are comparable and normally in the 300-400 range, but 16s runs are usually <50. Thanks for any insight.

    Leave a comment:


  • GA-J
    replied
    Thank you, Fanli.

    Leave a comment:


  • fanli
    replied
    Yes, these numbers are for v2 kits. We've found that there's little difference with a small PhiX spike on our particular MiSeq, but I don't really see the harm in doing something in the 5% range. You generally aren't going to be constrained for sequencing depth with 16S anyways.

    Leave a comment:


  • GA-J
    replied
    Fanli, thank you for your information. Two more questions, did you use Miseq V2 kit for this 16s V4 run? Why no Phix spike in(how do you decide no Phix, any protocol mentioned or you tested it out? )? I want to change my protocol, but I would like to know why. Thank you very much.

    Leave a comment:


  • fanli
    replied
    We load 8.0 pM library using the 515F/806R primers detailed here:


    Edit: 1.8pM was for NextSeq runs
    Last edited by fanli; 07-30-2015, 06:44 AM.

    Leave a comment:


  • GA-J
    replied
    Hello, Fanli, I like your result of 16s V4 Miseq run. I would like to know how much you loaded? And the size of your library is ?

    Thanks.

    Leave a comment:


  • MiSeqUserLUX
    replied
    Thank you GenoMax and thank you all for your replies.
    I had a look at our pipeline closely and indeed there was something that needs to be fixed (UPARSE workflow recommends merging of paired reads first before read quality filtering.) So your right, merging needs to be done first.

    For some reasons I don't know why our bioinformatician set-up the pipeline this way:

    STEP 1. reads quality filtering of R1 and R2 separately (this is where a lot of our reads are discarded and the bioinformatician tells me that the MiSeq data are not usable)

    IF the sequences pass STEP 1,
    then what will be done is step 2...

    STEP 2. back to scratch>> merging of paired reads, read quality filtering.... assembly....

    I believe starting from step 2 would be sufficient.

    Leave a comment:


  • GenoMax
    replied
    I think you meant to say Q30 (not 3) since your data does not seem to have any reads below Q5.

    If that is indeed Q30 (and above) then it seems to be a very stringent filter. Since the reads are expected to overlap perhaps the merging should be done prior and Q-score used as a criteria to keep the base with the higher quality (if the merge is not perfect). Look into BBMerge (http://seqanswers.com/forums/showthread.php?t=43906) or FLASH as options http://ccb.jhu.edu/software/FLASH/.

    Leave a comment:


  • MiSeqUserLUX
    replied
    If you only have a small overlap, you are trying to stick together two bad quality sections of your reads, which causes problems.
    We perform 2x300 bp PE reads for a 466 bp amplicon (we have an overlap of around 140 bp before the quality filtering). After quality filtering, we expect the reads to overlap by at least 40 bp. Then we will perform merging and classification. In this case, is 40 bp overlap after quality filtering still too small? Or is it enough?

    Do you know what Q-score cut-off your informatics people are using?
    Is it possible that you're setting the quality filter too strict?
    The quality reads selection criteria that our bioinformatician has set are as follows:
    • Expected error of global reads sequence < 1
    • Each reads nucleotide Q score > 3
    • Length > 250 bp (to have an overlap > 40bp after quality filtering)

    Is this too strict or just right? As per our bioinformatician, the Qscore and Expected error values are the ones recommended by Uparse developers and in the Uparse publication.

    An example of our quality filtering result is attached.

    I have also just ran a FASTQC on one of the samples. Attached are the results.

    Any thoughts?
    Thank you in advance.
    Attached Files

    Leave a comment:


  • microgirl123
    replied
    We expect the reads to overlap at least 40 bp.
    This might be part of your problem. I'm not a bioinformatics person, but I usually see a much larger overlap recommended (except by Illumina). The ends of Read 1 and Read 2 (especially) are much lower in quality than the start. If you only have a small overlap, you are trying to stick together two bad quality sections of your reads, which causes problems.

    I spiked-in 10% of 20pM PhiX and loaded 3.5 pM library
    I'm confused by this also. I spike 10% of 12.5 pM PhiX into a 9.5 pM library and see runs similar to the one you linked (cluster density ~900K, 10% phiX aligned). You're loading a lot more phiX and a lot less library, and only seeing 15% align.
    Last edited by microgirl123; 05-11-2015, 10:12 AM.

    Leave a comment:

Latest Articles

Collapse

  • seqadmin
    Exploring the Dynamics of the Tumor Microenvironment
    by seqadmin




    The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
    07-08-2024, 03:19 PM
  • seqadmin
    Exploring Human Diversity Through Large-Scale Omics
    by seqadmin


    In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
    06-25-2024, 06:43 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 07-19-2024, 07:20 AM
0 responses
139 views
0 likes
Last Post seqadmin  
Started by seqadmin, 07-16-2024, 05:49 AM
0 responses
114 views
0 likes
Last Post seqadmin  
Started by seqadmin, 07-15-2024, 06:53 AM
0 responses
109 views
0 likes
Last Post seqadmin  
Started by seqadmin, 07-10-2024, 07:30 AM
0 responses
43 views
0 likes
Last Post seqadmin  
Working...
X