Unconfigured Ad

**GenoMax** · 03-31-2014, 04:48 AM

See if this thread helps: http://stackoverflow.com/questions/1...a-file-in-bash

**blakeoft** · 03-31-2014, 04:59 AM

Is your input the following:

T T T T
C C C C
C C C C
C C C C
A A A A
A A A A
T T T T
T T T T
C C C C
G G G G
G G G G
C C C C

and do you want to get

TCCCAATTCGGC
TCCCAATTCGGC
TCCCAATTCGGC
TCCCAATTCGGC

back as a result?

**GenoMax** · 03-31-2014, 05:04 AM

I think @musta1234 wants the matrix transposed and then converted to a multi-fasta file.

>SNP_001
TCCCAATTCGGC
>SNP_002
TCCCAATTCGGC
>SNP_003
TCCCAATTCGGC

**musta1234** · 03-31-2014, 11:17 AM

Thats right

Sorry for the sloppy explanation, but all the nucleotides are from a tab delimited file and Genomax stated the way I want it perfectly.

>SNP_001
TCCCAATTCGGC
>SNP_002
TCCCAATTCGGC
>SNP_003
TCCCAATTCGGC

......

SNP_XXX
ATGCATGCATGC

Thanks

**GenoMax** · 03-31-2014, 03:07 PM

This is a bash shell script based on a solution in the stackoverflow thread I had posted above.

Save the code in a file (script.sh in example below) and then run as follows:

Code:

$ sh script.sh your_data file

Code:

#!/bin/bash 
declare -a array=( )                      # we build a 1-D-array

read -a line < "$1"                       # read the headline

COLS=${#line[@]}                          # save number of columns

index=0
while read -a line; do
    for (( COUNTER=0; COUNTER<${#line[@]}; COUNTER++ )); do
        array[$index]=${line[$COUNTER]}
        ((index++))
    done
done < "$1"

for (( ROW = 0; ROW < COLS; ROW++ )); do
        printf ">"
  for (( COUNTER = ROW; COUNTER < ${#array[@]}; COUNTER += COLS )); do
    printf "%s" ${array[$COUNTER]}
    if [ $COUNTER == $ROW ]
    then
        printf "\n"
    fi
  done
  printf "\n" 
done

**musta1234** · 04-01-2014, 10:39 PM

Thanks

I will definitely give it a try...

**musta1234** · 04-08-2014, 05:08 PM

Works GREAT!!!

Hey Genomax and all!!

The code works great... handles a file with 160 columns and 128,000 lines very well.

Thanks

Originally posted by GenoMax View Post

This is a bash shell script based on a solution in the stackoverflow thread I had posted above.

Save the code in a file (script.sh in example below) and then run as follows:

Code:

$ sh script.sh your_data file

Code:

#!/bin/bash 
declare -a array=( )                      # we build a 1-D-array

read -a line < "$1"                       # read the headline

COLS=${#line[@]}                          # save number of columns

index=0
while read -a line; do
    for (( COUNTER=0; COUNTER<${#line[@]}; COUNTER++ )); do
        array[$index]=${line[$COUNTER]}
        ((index++))
    done
done < "$1"

for (( ROW = 0; ROW < COLS; ROW++ )); do
        printf ">"
  for (( COUNTER = ROW; COUNTER < ${#array[@]}; COUNTER += COLS )); do
    printf "%s" ${array[$COUNTER]}
    if [ $COUNTER == $ROW ]
    then
        printf "\n"
    fi
  done
  printf "\n" 
done

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 12 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 14 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Help with While []... done in bash

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News