Seqanswers Leaderboard Ad

**mastal** · 05-02-2014, 12:19 PM

see if this works:

Code:

        my ($sample, $mir, $abun) = /(.+?)\t(.+)\t(.+)/;
        $h{$mir}{$sample} = $abun; 
}
foreach my $mir (sort keys %h){
        print "$mir\t";
        foreach my $sample (sort keys %{h{$mir}}){
                print "$$h{$mir}{$sample}\t;"
        }
        print "\n";
}

**pony2001mx** · 05-03-2014, 05:54 PM

Hi mastal, I still have problem, but thanks a lot anyway.

**SNPsaurus** · 05-03-2014, 06:49 PM

What problem are you still having? I think mastal re-organized it correctly. You want to print a line that has a mir, and then the value for each sample. So you would definitely want to have the outer loop be mir, and the inner loop be sample. That way it prints the mir, then on the same line prints each of the sample values.

mastal's code may have some typos in it (with Perl, it is difficult to tell the difference between a typo and brilliant code, so I am not sure), but I edited it and it works:

Code:

foreach my $mir (sort keys %h){
        print "$mir\t";
        foreach my $sample (sort keys %{$h{$mir}}){ # changed h{$mir} to $h{$mir}
                print "$h{$mir}{$sample}\t"; # changed $$h to $h and \t;" to \t";
        }
        print "\n";
}

when I made a little tester it outputs this:
m1 1.1 1.2 1.3
m2 2.1 2.2 2.3
which is correct.

**pony2001mx** · 05-04-2014, 05:28 AM

Hi SNPSaurus,
Thanks a lot for your comments! Actually it's not so easy. If the input data is as follows, then it's ok.

Code:

sample1	mir1	1.1
sample1	mir2	1.2
sample2	mir1	2.1
sample2	mir2	2.2

However, if the input data changes to below, it won't be what i expect. The problem is MISSING VALUE.

Code:

sample1	mir1	1.1
sample1	mir2	1.2
sample2	mir1	2.1
sample2	mir2	2.2
sample3	mir4	3.1

i am a beginner and am learning perl. I tried best to write a script as follows (i add some comments for easier understanding), but still have problem. Could you please check please? I appreciate your helps!

Code:

#!/usr/bin/perl
use strict;
use warnings;

open FH, '<', $ARGV[0] || die "open failed $!";
my %h;
my %h2;
while (<FH>){
        my ($sample, $mir, $abun) = /(\S+?)\t(\S+)\t(\S+)/;
        $h{$mir}{$sample} = $abun; 
		$h2{$sample} +=1; #increament to calculate total samples
}

foreach my $sample_h2 (sort keys %h2){ #print sample header 
	print "\t$sample_h2";
}
print "\n";

foreach my $mir (sort keys %h){
    print "$mir\t";  #print mir name
	foreach my $sample2(sort keys %h2){ #sort according to sample header
		foreach my $sample (sort keys %{$h{$mir}}){  #search sample name in %h2 from that in %h
			if ($sample eq $sample2) {  
				print "$h{$mir}{$sample}\t"; #when matched print 
				last;
			}
		}
	}
	print "\n";
}

**SNPsaurus** · 05-04-2014, 09:59 AM

I think I see what you are trying to do. Some mir don't have data for all samples. So you construct a list of samples separate from the hash of hashes. You go through the hash of samples, and then go through the list of samples in your hash of hashes, and if they match you print. This is probably better done with an "exist" check, and a printing of a blank if not present:

Code:

foreach my $mir (sort keys %h){
    print "$mir\t";  #print mir name
	foreach my $sample2(sort keys %h2){ #sort according to sample header
		if (exists $h{$mir}{$sample2}) {
				print "$h{$mir}{$sample2}\t"; #if exists print 
		} else {
			print "\t"; # print a blank if that sample doesn't exist for that mir
		}
	}
	print "\n";
}

**pony2001mx** · 05-04-2014, 05:12 PM

Hi SNPSaurus, Thank you very much! It's really good stuff for me to learn. Thanks.

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 22 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Perl script: Make Statistics Of Mirna Abundances For Many Samples

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News