Seqanswers Leaderboard Ad

**maubp** · 05-04-2013, 12:34 AM

Originally posted by sp24 View Post

I know that I’m supposed to get .nhr .nin and .nsq, but I think that the database is so big that I got something like this:
DB.00.nsq, DB.00.nin, DB.00.nhr
DB.01.nsq, DB.01.nin, DB.01.nhr
and so on.
So here’s the first question: is this a problem? Or will I just have to blast my query file against each database (00, 01), one at a time?

No that's normal for a very large database - have a look at the NCBI provided NR or NT databases as an example.

Originally posted by sp24 View Post

Also, before I get too far into this, I also would like to know if for some reason I shouldn’t be merging the read files and creating a database from it.

That is a very sensible question - you might get something out of your planned analysis but this is not the normal approach (I would do a transcriptome assembly giving you putative transcripts, attempt to analyse them, for example with BLAST against sister species).

**sp24** · 05-05-2013, 05:55 PM

Thanks for your response. I decided not to merge the R1/R2 files since I've been reading on here that the R2 file needs to be reverse complemented. I've been creating blast databases with individual read files and blasting against my query, but now I'm trying to figure out how to interpret the outputs.

For the scope of this project I'm not going to be able to assemble transcriptomes, just trying to figure out how to retrieve these sequences. If someone reads this and has done this before, I would appreciate your input. Or if there's any papers/online resources you can point me to that would be great as well.

Topics	Statistics	Last Post
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, Yesterday, 12:17 PM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 12:17 PM
The Role of Enhancers in Defining Cell Fate by seqadmin Started by seqadmin, 04-29-2024, 10:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-29-2024, 10:49 AM
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 23 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM

Seqanswers Leaderboard Ad

Announcement

Makeblastdb from paired end reads

Comment

Comment

Latest Articles

ad_right_rmr

News