Hello SEQanswers Community,
I am a newbie to develop bioinformatics tools although I have many experiences in computer and electrical engineering.
I've developed an error corrector for the Illumina sequencing data. For the evaluation of the module, the read IDs are mandatory. I've generated several corrected DNA sequences with Quake, SOAPec, Musket, Coral, Hybrid SHREC, and AllPath-LG EC(Error corrector).
The problem is that AllPath-LG EC and Hybrid SHREC changes the read IDs. I only need the original read IDs corrected by AllPath-LG EC.
In the output folder, there are following files:
1. Input Files: uncorrected_1.fastq, uncorrected_2.fastq
2. Output Files: uncorrected.paired.A.fastq, uncorrected.paired.B.fastq, uncorrected.unpaired.fastq, uncorrected.fastq, uncorrected.fastq.ids
The content of uncorrected.fastq.ids file is looks like:
#New_ID, Original_ID
0, 0
1, 1
2, 2
3, 3
4, 6
5, 7
6, 8
7, 9
8, 10
9, 12
10, 13
...
Unfortunately, there is no file such as 'uncorrected.paired.A.fastq.ids' and 'uncorrected.paired.B.fastq.ids'
Do you know how to get the original read IDs with these files?
Bests,
Euncheon
I am a newbie to develop bioinformatics tools although I have many experiences in computer and electrical engineering.
I've developed an error corrector for the Illumina sequencing data. For the evaluation of the module, the read IDs are mandatory. I've generated several corrected DNA sequences with Quake, SOAPec, Musket, Coral, Hybrid SHREC, and AllPath-LG EC(Error corrector).
The problem is that AllPath-LG EC and Hybrid SHREC changes the read IDs. I only need the original read IDs corrected by AllPath-LG EC.
In the output folder, there are following files:
1. Input Files: uncorrected_1.fastq, uncorrected_2.fastq
2. Output Files: uncorrected.paired.A.fastq, uncorrected.paired.B.fastq, uncorrected.unpaired.fastq, uncorrected.fastq, uncorrected.fastq.ids
The content of uncorrected.fastq.ids file is looks like:
#New_ID, Original_ID
0, 0
1, 1
2, 2
3, 3
4, 6
5, 7
6, 8
7, 9
8, 10
9, 12
10, 13
...
Unfortunately, there is no file such as 'uncorrected.paired.A.fastq.ids' and 'uncorrected.paired.B.fastq.ids'
Do you know how to get the original read IDs with these files?
Bests,
Euncheon
Comment