Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • chariko
    Member
    • Jun 2010
    • 56

    Command for getting a fragment from a sequence file

    Hi all,

    Could anyone tell me how to get from a sequence file the sequence which is delimited from certain positions? As an example:

    Original file:

    asdfgh
    qwert


    Get sequence from positions 2 to 8:

    I tried the following:

    cat original.txt | cut -c 2-8 >result file

    Result file:

    sdfgh
    wert

    Instead of that I would like to obtain:

    sdfghqwe

    Can anyone help me?
  • mastal
    Senior Member
    • Mar 2009
    • 666

    #2
    How long are the actual sequences you are working with?

    It looks like your command would work if your sequence was all on one line.

    Comment

    • chariko
      Member
      • Jun 2010
      • 56

      #3
      Originally posted by mastal View Post
      How long are the actual sequences you are working with?

      It looks like your command would work if your sequence was all on one line.
      You are right mastal, the command works if my sequence is all on one line, the problem is that sometimes it´s not.

      Actually my sequences are contigs with about 235000 nucleotides...

      Comment

      • Richard Finney
        Senior Member
        • Feb 2009
        • 701

        #4
        Consider tr command

        tr -d "\n" to eliminate carriage returns.

        Example:
        cat file | tr -d "\n" | cut -b1-7

        Comment

        • TiborNagy
          Senior Member
          • Mar 2010
          • 329

          #5
          Seqret from Emboss package can handle it:
          seqret input.fasta -sbegin -send

          Comment

          • chariko
            Member
            • Jun 2010
            • 56

            #6
            # Richard This could be an option thanks!!

            # TiborNagy I just installed the Emboss package run your command and your option is just what I was looking for. thank you very much!!!

            Comment

            • santhilalsubhash
              Member
              • May 2012
              • 19

              #7
              Try this

              cat original.txt | sed 's/\n//g' | cut -c 2-8 >result file

              OR

              Other option will be, you can parse your BED file through this command with a simple perl or bash script.
              fastacmd -d input.fasta -s Chromosome -L start,stop

              Download fastacmd linux binary from here. Because in new BLAST tool this has been replace by blastdbcmd which does not supports this and not available in NCBI website.
              Last edited by santhilalsubhash; 02-12-2014, 01:28 PM.

              Comment

              Latest Articles

              Collapse

              • GATTACAT
                Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by GATTACAT
                Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                07-01-2026, 11:43 AM
              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, Yesterday, 11:08 AM
              0 responses
              7 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-30-2026, 05:37 AM
              0 responses
              11 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-26-2026, 11:10 AM
              0 responses
              19 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              53 views
              0 reactions
              Last Post SEQadmin2  
              Working...