Seqanswers Leaderboard Ad

**GenoMax** · 03-31-2014, 04:48 AM

See if this thread helps: http://stackoverflow.com/questions/1...a-file-in-bash

**blakeoft** · 03-31-2014, 04:59 AM

Is your input the following:

T T T T
C C C C
C C C C
C C C C
A A A A
A A A A
T T T T
T T T T
C C C C
G G G G
G G G G
C C C C

and do you want to get

TCCCAATTCGGC
TCCCAATTCGGC
TCCCAATTCGGC
TCCCAATTCGGC

back as a result?

**GenoMax** · 03-31-2014, 05:04 AM

I think @musta1234 wants the matrix transposed and then converted to a multi-fasta file.

>SNP_001
TCCCAATTCGGC
>SNP_002
TCCCAATTCGGC
>SNP_003
TCCCAATTCGGC

**musta1234** · 03-31-2014, 11:17 AM

Thats right

Sorry for the sloppy explanation, but all the nucleotides are from a tab delimited file and Genomax stated the way I want it perfectly.

>SNP_001
TCCCAATTCGGC
>SNP_002
TCCCAATTCGGC
>SNP_003
TCCCAATTCGGC

......

SNP_XXX
ATGCATGCATGC

Thanks

**GenoMax** · 03-31-2014, 03:07 PM

This is a bash shell script based on a solution in the stackoverflow thread I had posted above.

Save the code in a file (script.sh in example below) and then run as follows:

Code:

$ sh script.sh your_data file

Code:

#!/bin/bash 
declare -a array=( )                      # we build a 1-D-array

read -a line < "$1"                       # read the headline

COLS=${#line[@]}                          # save number of columns

index=0
while read -a line; do
    for (( COUNTER=0; COUNTER<${#line[@]}; COUNTER++ )); do
        array[$index]=${line[$COUNTER]}
        ((index++))
    done
done < "$1"

for (( ROW = 0; ROW < COLS; ROW++ )); do
        printf ">"
  for (( COUNTER = ROW; COUNTER < ${#array[@]}; COUNTER += COLS )); do
    printf "%s" ${array[$COUNTER]}
    if [ $COUNTER == $ROW ]
    then
        printf "\n"
    fi
  done
  printf "\n" 
done

**musta1234** · 04-01-2014, 10:39 PM

Thanks

I will definitely give it a try...

**musta1234** · 04-08-2014, 05:08 PM

Works GREAT!!!

Hey Genomax and all!!

The code works great... handles a file with 160 columns and 128,000 lines very well.

Thanks

Originally posted by GenoMax View Post

This is a bash shell script based on a solution in the stackoverflow thread I had posted above.

Save the code in a file (script.sh in example below) and then run as follows:

Code:

$ sh script.sh your_data file

Code:

#!/bin/bash 
declare -a array=( )                      # we build a 1-D-array

read -a line < "$1"                       # read the headline

COLS=${#line[@]}                          # save number of columns

index=0
while read -a line; do
    for (( COUNTER=0; COUNTER<${#line[@]}; COUNTER++ )); do
        array[$index]=${line[$COUNTER]}
        ((index++))
    done
done < "$1"

for (( ROW = 0; ROW < COLS; ROW++ )); do
        printf ">"
  for (( COUNTER = ROW; COUNTER < ${#array[@]}; COUNTER += COLS )); do
    printf "%s" ${array[$COUNTER]}
    if [ $COUNTER == $ROW ]
    then
        printf "\n"
    fi
  done
  printf "\n" 
done

Topics	Statistics	Last Post
The Role of Spliceosomes in RNA Splicing and Genome Evolution by seqadmin Started by seqadmin, 05-14-2024, 07:03 AM	0 responses 23 views 0 likes	Last Post by seqadmin 05-14-2024, 07:03 AM
A Closer Look at the Enigmatic Genomes of Oikopleura dioica by seqadmin Started by seqadmin, 05-10-2024, 06:35 AM	0 responses 44 views 0 likes	Last Post by seqadmin 05-10-2024, 06:35 AM
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, 05-09-2024, 02:46 PM	0 responses 58 views 0 likes	Last Post by seqadmin 05-09-2024, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 44 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM

Seqanswers Leaderboard Ad

Announcement

Help with While []... done in bash

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News