Announcement
Collapse
No announcement yet.
X
-
Originally posted by Roy View PostThe new version of sfffile (2.9) has the option -both to remove MIDs at both ends (although only if they are the same).
Leave a comment:
-
The new version of sfffile (2.9) has the option -both to remove MIDs at both ends (although only if they are the same).
Leave a comment:
-
Originally posted by kmcarr View PostNo, sfffile will not sort reads based on two MIDs. In the original library preparation method the MID tags are only placed at the 5' end of the read. The new rapid library preparation method will result in the same MID tag being added to the both ends of the fragment.
Originally posted by kmcarr View PostIn theory the fragment should be longer than the read generated by the instrument so you would not run into the MID tag at the 3' end. However some of your inserts will be short and read all the way through the inserted fragment. The software has always recognized this possibility and would trim off any B adapter sequence it found at the 3' end of a read. Now the MID sorting software also has to deal with this. The second sequence in the MID configuration file for Rapid Library MIDs is only there to tell the software to trim off this sequence if it encounters it at the 3' end. It only sorts the reads based on the MID sequence at the 5' end.
After you manually re-process the .sff file with "sfffile -s" you yield an .sff file for each left MID. Each such file has the left MID in "low-qual" region (because it is ahead of the left-qual trim point). But, sfffile does not bother with eventual MIDs on the right side.
I have a tool that can do this, and much more. Currently, I offer only a data processing service. Do you want to place your order? ;-)
Leave a comment:
-
Originally posted by dina View PostHi.
I need help with splitting my reads to the samples they refer to, but each sample was tagged with a mid at the 5' end and at the 3' end . for example: sample 1 is tagged with mids 1 at the beginning and 2 at the end, and sample 2 was tagged with mid 1 at the beginning and mid 3 at the end etc. So I need to have reads with mid1-read-mid2 separated from the reads mid1-read-mid3 etc...
I saw you can split reads using sfffile, but I couldn't find there a way to solve this issue of mids at both ends. Thank you!!!!
Leave a comment:
-
Hi there!!
I am a newbie in 454 sequencing data analysis.
I hve learnt that sfffile program can trim off the MID tags while still retaining the .sff file format.
I am using the sfffile comand
sfffile -o roche454_trimmed.sff -s -nmft mid.fasta roche454.sff
but.. I am getting a pool of erors.
Can anyone please suggest the correct syntax for using this program?
Mostly, I am keen to know about the file format of mid file.
Leave a comment:
-
GSMIDS are the MIDs for the 'standard' shotgun library protocol, RLMIDs are for the Rapid Library protocol (available from Oct 2009). Nothing fancy to it...
Leave a comment:
-
Can someone explain to a 454-newbie what GSMIDs and RLMIDs are? I've got a dataset using RLMIDs and I'm just trying to learn a bit more about what this means. Anywhere useful y'all could point me?
Thanks!!
Leave a comment:
-
Originally posted by litali View Postbut if I use the 454 gui software , it is possible to build the multiplexer so that it will distinguish between 2 samples which have the same mid at the 5 end and different mids at the 3 end, isn't it? For example: samp1: mid1-sequence-mid2
samp2: mid1-sequence-mid7
When you are adding read files to an assembly or mapping project with the GUI and you select multiplexing you select a MID scheme (e.g. GSMIDS, RLMIDS) from your MID config file. The GUI then presents you with a list of MIDs in that scheme and you select which MIDs to include in the filtering. The software only looks at the 5' end of the read for the MID and will only identify one MID per read.
If you wanted to use mixed, dual-end MIDs with 454 you would need to write your own script to sort the reads. You would also have to make sure that your sequence reads will reach all the way to the 3' MID. This means you could really only reliably use this method for amplicon sequencing where you know exactly the size of the product and that size is reachable within a 454 run.
Leave a comment:
-
mids
but if I use the 454 gui software , it is possible to build the multiplexer so that it will distinguish between 2 samples which have the same mid at the 5 end and different mids at the 3 end, isn't it? For example: samp1: mid1-sequence-mid2
samp2: mid1-sequence-mid7
Leave a comment:
-
Originally posted by robs View PostYou can check if your version scans for both ends by just looking at the MID definitions of your current version.
Leave a comment:
-
I remember that it does it. Maybe, you might want to upgrade to the latest version that allows the detection of both ends. You can check if your version scans for both ends by just looking at the MID definitions of your current version.
Maybe, the 454 support can make a comment on this, since I don't have access to this kind of data at the moment.
Leave a comment:
-
Originally posted by robs View PostThe 454 SFF tools will split data sets by MIDs and remove the MIDs as well. If you have a data set with MIDs at the 5'-end, then it only removes them from the 5'-end. If you have a data set with MIDs at both end, then it will remove both MIDs and also split accordingly. If not, contact the 454 support.
Hey, this makes up my 100th post ;-)
Leave a comment:
-
Sequence quality is gradually reduced toward the end of a read.
What is the length distribution of your library molecules? If < 400 bp, it would be OK.
The Titanium libraries are typically 600 - 900 bp long, but you only get the first ~400 bp in your reads, meaning that if you put a barcode at the end, it would not be sequenced or would not survived the quality filter.
Anyway, you can try to sort your reads by two steps. First, as usual way, the forward sorting. Second, reverse your reads, sort again. In the second step you will probably miss many reads).
Leave a comment:
-
splitting to mids on both sides
Hi Robs,
Thank you, but I saw that if I use for example: mid2@sffFile, then all the samples with mid2 at the beginning will be chosen, so how do I define to sfffile just to present all the options with both ends and to call it accordingly. for example to result in files like this: file of mid 2-mid5 (=all the reads with mid 2 at the 5 end and mid 5 at the 3 end), etc..
Leave a comment:
Latest Articles
Collapse
-
by seqadmin
At the intersection of cytogenetics and genomics lies the exciting field of cytogenomics. It focuses on studying chromosomes at a molecular scale, involving techniques that analyze either the whole genome or particular DNA sequences to examine variations in structure and behavior at the chromosomal or subchromosomal level. By integrating cytogenetic techniques with genomic analysis, researchers can effectively investigate chromosomal abnormalities related to diseases, particularly...-
Channel: Articles
09-26-2023, 06:26 AM -
-
by seqadmin
Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...-
Channel: Articles
09-07-2023, 11:15 PM -
-
by seqadmin
Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.
Whole Transcriptome RNA-seq
Whole transcriptome sequencing...-
Channel: Articles
08-31-2023, 11:07 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 09-27-2023, 06:57 AM
|
0 responses
11 views
0 likes
|
Last Post
by seqadmin
09-27-2023, 06:57 AM
|
||
Started by seqadmin, 09-26-2023, 07:53 AM
|
0 responses
11 views
0 likes
|
Last Post
by seqadmin
09-26-2023, 07:53 AM
|
||
Multiplexed Biomarker Detection with Nanopore Technology: A Leap in Precision Diagnostics
by seqadmin
Started by seqadmin, 09-25-2023, 07:42 AM
|
0 responses
15 views
0 likes
|
Last Post
by seqadmin
09-25-2023, 07:42 AM
|
||
Started by seqadmin, 09-22-2023, 09:05 AM
|
0 responses
45 views
0 likes
|
Last Post
by seqadmin
09-22-2023, 09:05 AM
|
Leave a comment: