Seqanswers Leaderboard Ad

**colindaven** · 03-28-2012, 03:01 AM

It may not be too helpful, but we used illumina and 454 together in a genome project.

We used gsAssembler to build 454 contigs, then Velvet to assemble Illumina.

Afterwards we included mapped fragments <2000bp of the Illumina assembly as fake reads
in Newbler.

It didn't improve results too much however, so we ended up using SSPACE with PE Illumina reads and 454 contigs.

**flxlex** · 03-28-2012, 07:11 AM

Looks like something is wrong with your fastq file. Could you post the first few lines (say 12 or 16)?

**ximo** · 04-03-2012, 06:22 AM

Originally posted by flxlex View Post

Looks like something is wrong with your fastq file. Could you post the first few lines (say 12 or 16)?

I have used this file with mira and bwa whitout problems, but?

Thanks

@CUES000161
AGAGAATCACCTGCTCAGTACAAAAATAATGACGCCCA
+
######################################
@CUES000162
AAGCAGTGGCATCAACGCAGAGTACGC
+
GG5>3C;AC<DD=DDFFFAD@?79<><
@CUES000163
AGATTGTTGCCTGGATTATGATATGATACAATACAAAT
+
HHGHHHHGFH?HHHHHH0HHHHADHCHHHHEHGHHH=H
@CUES000164
TCTTGTTGTTCGAGTCAATAGGAGCTGTACTCTGTACT
+
FEFEFFFEFFEE:<FEE:EEFFFBFFEFF>G:F@=CCE
@CUES000165
GATATGTTTGTAGGAATTTTCTTGAACTTTTTACCAAT
+
GGGGGGCCCG3FCDD55544GGBBGBGGGGGGGGGGFE
@CUES000166
CTTTGCTTCTTCAGTTCAAATTGGAATTTGAGCTCGGA
+
C>@AC3CCCCA>.@<[email protected]
@CUES000167
ATTGGATATTTTTGTTAAATTATGTTTGTTCCAAAGAT
+
HHGHHGGHHHHHEEEEEHHHHHHHHHHHHHHHHHHHGA
@CUES000168
TATACTTATGTACAAGACGCTGTTATTGATATTAAATC
+
GHHCHHHHHHHHHGGHHFHHGHHE8EDFFFBHHGF1DA
@CUES000169
AGAATGTGAACCCACACACACAGCCATTTGGATCACTT
+
AEGDGGGGGDGGFEGEGEECG2GCGCCGGGGFGGCGCG
@CUES000170
CGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTTT
+
?>4?CC8;<F3)A@3DBBD5A459<FF??FBBBBBBBB
@CUES000171
TTGGAGGAAAGTTCAGCCATCCCAATAATGAAAGAGAT
+
?FFFDFFDGGDG?FGAGGADDGG=AC5?CC2DDD=AFF
@CUES000172
GATGAACATTTTAAAATCTTAATTCCTCCAATTTGGAT
+
CCCCCAGAGGGGGGCCGFGGGGGGGGGGGGGGGGGGGG
@CUES000173
GGTATGGGTGAGTTTGGTGATCGTTACTTCGGAACTGA
+
HHGHHHHHEHEHHHHHHDHBFGGFG@FGGFHHHEHHHE
@CUES000174
TTCCAAAGGGGTCGCCTTTTCAATCTCCACCATTCATG
+
GGGDDC;CCCGCGEGG?EGCEBBEEGGB7GFBEFG?0D
@CUES000175
ATCCAACTGCTGTGGAAGGCCGTCTCCTTTCAGTCAGC
+
==<<;1@9@>=E@EEHHACHHHHHHHHHHDH?HHEHHH
@CUES000176
GAGAAGGGTTATCAGATCATGATTCCTTTCTTTGATTG
+
BGHGHHGCDHHDFHFGHHHEHAHHHCHHHBHHCCFHHH
@CUES000177
TATATTCTTCGGGCAGCCGCCATTAAAGCTTTGGGATC
+
FFF?FAF?FFDDGAGDFG?DA=G=5C/=ACGG?.GAA=
@CUES000178
AAGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTT
+
HGHHHHHHHGEHHHHHHHHHHHHHH8=DAD=;<6<>C=
@CUES000179
TGAATTTCTATCTACAAACATGAACAATACCAATCTCT
+
DDADAD5@@AFFFFF>>;?>GD55A>>>?;:A/AD;A?
@CUES000180
AGCAGCCTCCACGTATGAACTCATCGTCACGTTAGATT
+
HGGEDHHHHHEHHHHHH>HHCECHHHHHDDHFEBF3<A

**flxlex** · 04-04-2012, 12:35 AM

Sorry, nothing is wrong with your file of course. However, newbler will not recognize it. It expects this header style:

Read 1:

Code:

@EAS139_FC706VJ:2:2104:15343:197393#0/1
GGGTGATGGCCGCTGCCGATGGCGTCAAATCCCACC
+
IIIIIIIIIIIIIIIIIIIIIIIIIIIIII9IG9IC

Read1 (in a separate file)

Code:

@EAS139_FC706VJ:2:2104:15343:197393#0/2
CGATGGTCGTTTCGGAAGATGACGTGAATTGCCTGG
+
IIIIIIIIIIIIIIIIIIIIIIIIIIIIII9IG9IC

The /1 and /2 at the end tell newbler the pairing info.

One solution would be to adjust the header. Alternatively, you could convert your files to fasta + qual files, and include the pairing information int header, as I explain in my blog post here.

**ximo** · 04-04-2012, 02:17 AM

flxlex,

Thanks for your help and for your useful post, but my reads are not paired-end. Do you know if Newbler works with non paired-end Illumina data?

Thanks

Ximo

**flxlex** · 04-04-2012, 11:22 PM

Not if they are that short. Newbler's minimum read length is 50 bases, which I now see is why your 36 base reads did not assemble. You could try setting the minlen parameter to your read length. But don't try to assemble the Illumina reads only using newbler, it is not built for such short reads...

**wustudybreak** · 04-06-2012, 07:01 AM

454 newbler runMapping alignment

Hello,

Does anyone know if 454 runMapping alignment doing local alignment or global alignment?

Any information on how its aligning algorithm is helpful.

Thanks

**ximo** · 04-11-2012, 12:57 AM

Originally posted by flxlex View Post

Not if they are that short. Newbler's minimum read length is 50 bases, which I now see is why your 36 base reads did not assemble. You could try setting the minlen parameter to your read length. But don't try to assemble the Illumina reads only using newbler, it is not built for such short reads...

I have tested this parameter, but I have the same result. When I have used 454 and Illumina seqs, it makes the assembling but in the 454ReadStatus.txt the illumina seqs are all labeled as TooShort

runAssembly -ml 50% -mi 95 -minlen 15 -o newbler_test test_100000_ill test_100000_454

Any suggestion?
Thanks

**flxlex** · 04-16-2012, 04:52 AM

Oops... I had forgotten that reads between minlen and 50 bases only are used when there is at least one read dataset that newbler recognizes as paired end (i.e. mate pair, long insert library). In your case, I don't think you can use newbler for your short reads. Perhaps you can assemble the Illumina reads into contigs using something like velvet, and use those contigs as reads for a contigs+454 reads assembly?

**ximo** · 04-16-2012, 11:16 PM

Ok. Thanks a lot

Ximo

**flxlex** · 04-18-2012, 04:46 AM

I just saw that newbler 2.7, which just came out, has a new flag: -short "Force use of reads shorter than 50 bp in projects that don’t include any paired end data. Reads shorter than 50bp are automatically used if any paired-end data is used in the project. The lower limit is 20 bp (or minlen if –minlen is used)."

So, I advice you to try to get you hands on this version (through the Roche website)!

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, 07-25-2024, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin 07-25-2024, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

newbler2.6 454 and illumina seq, help

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News