Actually, you might be a little more careful and change the first sed command to:
sed 's/:N:/ /'
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
I think so. From the documentation, either the names have to be the same, which they aren't, or you need a /1 and /2 to end them.
You could use sed to fix them something like:
The following should fix it sed 's/:N/ /' INFILE | awk '{print $1}' | sed -e 's|_1|/1|' -e 's|_2|/2|' > OUTFILE
Leave a comment:
-
For example, the first two paired reads in fasta file were displayed with head:
>HWI-ST1280:130:C1GYVACXX:8:1101:1161:2106_1:N:0:GGCTAGA
>HWI-ST1280:130:C1GYVACXX:8:1101:1161:2106_2:N:0:GGCTAGA
which were transformed from fastq file.
Is this the problem?
Leave a comment:
-
Do your reads have /1 and /2 after them? Otherwise, I don't think ABySS can understand the paired information.
Leave a comment:
-
By the way, the other output seems fine. No error report else was found.
Leave a comment:
-
Hi,
Here is the script i used.
for k in {56..60}; do mkdir k$k; cd k$k; abyss-pe k=$k name=scale in='/scratch/zjr/scale/data/trimedpaired.fasta' OVERLAP_OPTIONS=--no-scaffold SIMPLEGRAPH_OPTIONS=--no-scaffold E=1 n=10 v=-v; cd ..; done
Leave a comment:
-
That seems very odd. What was the command line that got you this far and the rest of the output?
Also, the ABySS user group is usually pretty helpful. So you might try there too.
Leave a comment:
-
abyss error
Hi,
I am doing an ABySS assembly recently. Here is the problem i met:
Mapped 419743499 of 454426594 reads (92.4%)
Mapped 419741912 of 454426594 reads uniquely (92.4%)
Read 454426594 alignments
Mateless 454426594 100%
Unaligned 0
Singleton 0
FR 0
RF 0
FF 0
Different 0
Total 454426594
error: the histogram `scale-3.hist' is empty
sort: write failed: standard output: Broken pipe
sort: write error
make: *** [scale-3.dist] Error 1
make: *** Deleting file `scale-3.dist'
Could anyone help me figure this out?Tags: None
Latest Articles
Collapse
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
-
by seqadmin
Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...-
Channel: Articles
03-22-2024, 06:39 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
31 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
32 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
||
Started by seqadmin, 04-10-2024, 09:21 AM
|
0 responses
28 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 09:21 AM
|
||
Started by seqadmin, 04-04-2024, 09:00 AM
|
0 responses
53 views
0 likes
|
Last Post
by seqadmin
04-04-2024, 09:00 AM
|
Leave a comment: