Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
wow that is some really good work that you've been doing there. I had been closely associated with some projects on Encodeproject on RNA sequence measures.
-
Hey, the data turned out OK and was useful! But yeah, I had to dust off the hard drive the reads were on, it'd been sitting on a shelf for eight years or so.
Leave a comment:
-
Well, the main thing is your csfasta and qual files work fine.
Could BC be barcode ? They seemt to be short, as in barcodes, and the format seems to be the csfasta format.
Not sure I ever saw these in my days of SOLiD adventures, which ended in 2012 (thank god).
Leave a comment:
-
What are these BC files for? Should I use them in alignment?
Hi, folks. I've started working with some old SOLiD single-ended RNA-seq reads, from 2010 and 2011. I'm using novoalignCS and have the quality files, csfasta files, and an additional fasta-like file with "BC" in the filename. Here's an example of the filenames:
S1001938CIS_7/primary.20100713155754122/reads/solid0015_20100706_S1001938CIS_BC_bcSample1_F3.csfastaS1001938CIS_7/primary.20100707172116543/reads/
solid0015_20100706_S1001938CIS_BC_bcSample1_F3.stats
solid0015_20100706_S1001938CIS_BC_bcSample1_F3_QV.qualsolid0015_20100706_S1001938CIS_BC_bcSample1_BC.csfasta
The F3.csfasta and F3_QV.qual files are as expected, and work fine with novoalignCS.
The BC.csfasta files have data as follows and I'm completely mystified as to what they are:
# Wed Jul 7 11:02:39 2010 /share/apps/corona/bin/filter_fasta.pl --output=/data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/results.F1B1/primary.20100707172116543 --name=solid0015_20100706_S1001938CIS_BC_bcSample1 --tag=BC --minlength=5 --mincalls=25 --prefix=G /data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/jobs/postPrimerSetPrimary.1505/rawseq
# Cwd: /home/pipeline
# Title: solid0015_20100706_S1001938CIS_BC_bcSample1
# Library:S1001938CIS_7:00313
>1_223_2_BC 0
G00313
>1_238_37_BC 0
G00313
>1_240_14_BC 0
G00313
Anyone know? Should I be using these BC files in some way? They have extremely little information content.
Latest Articles
Collapse
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
-
by seqadmin
Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...-
Channel: Articles
03-22-2024, 06:39 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
25 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
28 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
||
Started by seqadmin, 04-10-2024, 09:21 AM
|
0 responses
24 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 09:21 AM
|
||
Started by seqadmin, 04-04-2024, 09:00 AM
|
0 responses
52 views
0 likes
|
Last Post
by seqadmin
04-04-2024, 09:00 AM
|
Leave a comment: