Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BLAST server

    Hi, I was wondering if there is online free server that can blast big data sets?

    Thanks

  • #2
    That you upload your large dataset to in order for the server to allow you to BLAST it? Is there any particular reason you can't download the BLAST+ suite and do it locally?

    Comment


    • #3
      Yes, I know offline BLAST+ can do this. However, I don't have a good cluster computer to do this. So, that is why I ask if there is any free online super computer that I can use.

      Comment


      • #4
        There are free services for some specific purposes. What might your data set be?
        savetherhino.org

        Comment


        • #5
          My data set is 454 microbial metagenomics data. I would like to do BLASTX against nr database. Maybe also BLASTN.

          Thanks

          Comment


          • #6
            Upload it to MG-RAST then. You can later extract tabular output against lots of different dbs. Register an account and have a look at some publicly available set, especially the download page..
            savetherhino.org

            Comment


            • #7
              Hi Rhinoceros,

              I did that. I need to use BLASTX is because I would like to use MEGAN to do some analysis. I posted topic couple of days ago about MGRAST-MEGAN. I think you saw it.

              Currently, there is no good way to convert mgrast annotation to NCBI format. (MG-RAST support said this).

              So, I need to BLAST my data set.

              Comment


              • #8
                Originally posted by SDPA_Pet View Post
                Hi Rhinoceros,

                Currently, there is no good way to convert mgrast annotation to NCBI format. (MG-RAST support said this).
                I'm surprised. If you go for example here, scroll down to "Annotation Download via API", select "feature" for annotation type and GenBank for data source and click download, you'll end up with the file "mgm4447970.3_feature_GenBank.tab", which looks like:

                Code:
                query sequence id	hit m5nr id (md5sum)	percentage identity	alignment length,	number of mismatches	number of gap openings	query start	query end	hit start	hit end	e-value	bit score	semicolon seperated list of annotations
                mgm4447970.3|GF8803K01BBK1N	0001e05fd22fb2fee5c66d3bfc20f4d7	62.86	140	52	0	16	155	602	741	2.2e-40	163.0	ABK03889.1
                mgm4447970.3|GF8803K01C9I15	0001e05fd22fb2fee5c66d3bfc20f4d7	66.67	69	23	0	79	147	400	468	9.5e-18	88.0	ABK03889.1
                mgm4447970.3|GF8803K01DKQT4	0001e05fd22fb2fee5c66d3bfc20f4d7	78.12	32	7	0	1	32	461	492	2.0e-07	54.0	ABK03889.1
                ..
                Now if you remove the first line and do:

                awk '{print $1"\t"$13"\t"$3"\t"$4"\t"$5"\t"$6"\t"$7"\t"$8"\t"$9"\t"$10"\t"$11"\t"$12"\t"}' mgm4447970.3_feature_GenBank.tab

                Output is:

                Code:
                mgm4447970.3|GF8803K01BBK1N	ABK03889.1	62.86	140	52	0	16	155	602	741	2.2e-40	163.0	
                mgm4447970.3|GF8803K01C9I15	ABK03889.1	66.67	69	23	0	79	147	400	468	9.5e-18	88.0	
                mgm4447970.3|GF8803K01DKQT4	ABK03889.1	78.12	32	7	0	1	32	461	492	2.0e-07	54.0
                ..
                Which looks a lot like tabular blast output to me ('qseqid sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore'). If it doesn't work as such, you should read MEGAN's manual carefully, especially the part about file formats and what it actually parses from tabular blast output. Also, I don't know if the e-values and bitscores are "proper", but if there is an "error", at least it should affect all the hits the same way.
                Last edited by rhinoceros; 09-13-2013, 12:41 AM.
                savetherhino.org

                Comment


                • #9
                  I'd like to look for functional genes on someone else's metagenome on MG RAST. I know I can search for annotated genes, but is there a way to do an alignment type search against data on the MG-RAST server?

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM
                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  25 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  29 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  25 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  52 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X