Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Convert .fna file from NCBI to .fa or .fasta file

    Hello,
    I am totally new to this (I am a student following a course in bio-informatics) and I wanted to use a genome found on NCBI (in .fna or genbank format with .gff annotation) as a reference genome in STAR as an exercise but I cannot find a way to convert the .fna file so the genome can be read by STAR in --genomeFastaFiles. It is a genome not found on normal genome database sites (UCSC e.g.) since it is from a copepod and not much genomic work is done on copepods...
    Is this even possible to use such a genome as a reference genome or is this a bad idea from the start?
    Thank you in advance,
    kind regards,
    Josefien

  • #2
    As far as I know, .fna just means fasta nucleic acid (as opposed to .faa, fasta amino acid, for protein sequences), so the file is actually in fasta format.

    Comment


    • #3
      The problem is; STAR is not recognizing this fasta format (.fna), I am getting an error that is impossible to read this fasta file, that why I wondered if it was not possible to convert from .fna to .fa. Or do you think it is a problem with the file itself and STAR is able to read/load .fna files?

      Comment


      • #4
        Just rename the .fna extension to .fa (as long as the file is in fasta format). That should work.

        Code:
        $ cp file.fna file.fa
        If you are not sure about the format of the file post the output of this command
        Code:
        $ head -10 file.fna

        Comment


        • #5
          Originally posted by GenoMax View Post
          Just rename the .fna extension to .fa (as long as the file is in fasta format). That should work.

          Code:
          $ cp file.fna file.fa
          If you are not sure about the format of the file post the output of this command
          Code:
          $ head -10 file.fna
          thank you very much it is working now !

          Comment


          • #6
            could you please explain that changing the file extension won't effect the results for which we are mapping with a reference genome in case of RNA-seq for HISAT2 software?

            Comment


            • #7
              No the results will not be affected since we are not changing sequence/content of any data files. We are only renaming the file.

              Comment


              • #8
                will the content is same in both files?

                Comment


                • #9
                  Yes. As long as you only change the file name.

                  Comment


                  • #10
                    Thank you for your help

                    Comment

                    Latest Articles

                    Collapse

                    • performena
                      Reply to Recent Advances in Sequencing Technologies
                      by performena
                      Performena is a digital-first creative agency based in the UAE. We craft resonant brand experiences through strategic consultancy, creative services, media planning, programmatic advertising, and custom technology solutions.

                      Our journey began with a vision to revolutionize digital marketing. We combine data-driven insights with creative storytelling to deliver impactful campaigns that drive measurable results. Our team of wild hearts and tech wizards work tirelessly to unlock your brand's...
                      Today, 05:26 AM
                    • seqadmin
                      Recent Advances in Sequencing Technologies
                      by seqadmin







                      Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                      Long-Read Sequencing
                      Long-read sequencing has...
                      Yesterday, 01:49 PM
                    • seqadmin
                      Genetic Variation in Immunogenetics and Antibody Diversity
                      by seqadmin



                      The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
                      11-06-2024, 07:24 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 09:29 AM
                    0 responses
                    14 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, Yesterday, 09:06 AM
                    0 responses
                    11 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, Yesterday, 08:03 AM
                    0 responses
                    11 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 11-22-2024, 07:36 AM
                    0 responses
                    65 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X