Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • shotgun sequencing dataset

    Please send me the shotgun sequence dataset of 27 million sequencing reads in fasta format so that i can find out the genes from that sequence. thanks in advance!!! if possible, can anyone send me a link for downloading celera generated shotgun sequence.
    Last edited by Heena Farooq; 08-07-2015, 02:11 AM.

  • #2
    Unless you provide some additional information you are not going to get meaningful help.

    Why 27 million and why fasta format? You don't care about what organism the data is from?

    Comment


    • #3
      Actually i have read the paper namely "Whole genome shotgun assembly and comparison of human genome assemblies". and i need to use its dataset which is generated from celera in 2001 called shotgun dataset which is of 27 million sequencing reads and i need to review this paper by calculating its accuracy of gene prediction. but i am not able to find its dataset's dna sequence so that i can find genes from that. Please help me out.

      Comment


      • #4
        I assume you are referring to this paper: http://www.pnas.org/content/101/7/1916.full

        Abbreviations: WGSA, whole-genome shotgun assembly; CSA, compartmental shotgun assembly; WGA, whole-genome assembly.

        Data deposition: The sequences of the assemblies herein referred to as WGSA, CSA, and WGA have been deposited in the GenBank database (whole-genome assembly project accession nos. AADD00000000, AADC00000000, and AADB00000000).

        If so the projects in that paper can be found here (these links seem to now point to the current versions of the human genome though):

        Download a genome data package including genome, transcript and protein sequence, annotation and a data report

        Download a genome data package including genome, transcript and protein sequence, annotation and a data report


        Third one does not seem to be in GenBank (AADB00000000).
        Last edited by GenoMax; 08-07-2015, 07:18 AM.

        Comment


        • #5
          yes, exactly...you are getting me right..that one is the paper.. i am new in this field hope u dnt mind. e.g. AADC00000000 contains a number of sequences means it has 169156 contigs. can you please tell me which sequence i need to use and how can i use it? thanks for your concern and please help

          Comment


          • #6
            It is not clear as to what you are trying to do.

            A set of genes was predicted in early 2000's based on the first assembly of human genome but that number has been shrinking over the years (see: http://www.the-scientist.com/?articl...Shrinks-Again/).

            Current human genome contains fairly well organized chromosome sequences which you can download from this link (in fasta format):ftp://ftp.ncbi.nlm.nih.gov/genomes/a...genomic.fna.gz

            I am not sure where you see 169156 contigs? Earliest dataset I can see on UCSC is from May 2010: http://hgdownload.soe.ucsc.edu/golde...y2000/bigZips/

            Comment


            • #7
              Ok let me explain you.. i have to find out the mutated genes.. and for that i have to take dataset..and thats y m reading the previous mentioned paper (1916.full).. m tryng to take dataset contains disease causing genes and dataset contains whole human genome.. aftr that i vl find out mutated genes frm thse two.. nw i gt ur dataset which you mentioned above..bt i m getting confusd abt its sequence..tht wt shud i use.. and u can also see 169156 contigs as locuslength of dataset AADC00000000.. above all this, if u dnt mind can you give me the direction for doing the same process which i want to do..hpe u undrstd wel nw..thanks

              Comment


              • #8
                please send me the algorithm for GeneZilla gene finder algorithm...i will be thankful to you...

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Exploring the Dynamics of the Tumor Microenvironment
                  by seqadmin




                  The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
                  07-08-2024, 03:19 PM
                • seqadmin
                  Exploring Human Diversity Through Large-Scale Omics
                  by seqadmin


                  In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
                  06-25-2024, 06:43 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Today, 07:20 AM
                0 responses
                11 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 07-16-2024, 05:49 AM
                0 responses
                31 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 07-15-2024, 06:53 AM
                0 responses
                36 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 07-10-2024, 07:30 AM
                0 responses
                41 views
                0 likes
                Last Post seqadmin  
                Working...
                X