Seqanswers Leaderboard Ad

**boetsie** · 02-07-2013, 03:42 AM

Hi Vanisha,

I don't know if this solves your problem, but it looks like you declared the variable $sample but did not attach a name to it. So $sample is still empty.

Regards,
Boetsie

**Vanisha** · 02-07-2013, 04:28 AM

Hi Boetsie

I have a sample list relating to $sample in the command:
chomp (my @fastq_R1list=`ls ./300bp_fastq/${sample}*R1_001.fastq.gz`);

When I do:
chomp (my $sample = `ls ./sample`);

it still doesn't identify the sample list to input into ${sample}

**GenoMax** · 02-07-2013, 04:31 AM

Since you are using relative paths, are you running your perl script from a directory that is immediately above the "300bp_fastq" directory?

As "boetsie" pointed out above there is nothing attached to $sample. Are you actually getting a listing if you run "ls ./300bp_fastq/${sample}*R1_001.fastq.gz" on the shell prompt?

**Vanisha** · 02-07-2013, 04:52 AM

yes, this is the output from the command line:

ls ./300bp_fastq/${sample}*R1_001.fastq.gz | head
./300bp_fastq/sample1.fastq.gz
./300bp_fastq/sample2.fastq.gz
./300bp_fastq/sample3.fastq.gz
./300bp_fastq/sample4.fastq.gz
etc..

and I am running the script from directory above 300bp_fastq.

Do I need to use use a loop with the sample list, as I have done with the fastq files?

**GenoMax** · 02-07-2013, 05:18 AM

chomp (my @fastq_R1list=`ls ./300bp_fastq/${sample}*R1_001.fastq.gz`);
chomp (my @fastq_R2list=`ls ./300bp_fastq/${sample}*R2_001.fastq.gz`);

Remove "chomp" and try the run.

**Vanisha** · 02-07-2013, 05:26 AM

this is the start of the script now when removing chomp and declaring $sample

my @sample= `ls ./300bp_fastq/*.gz`);
my @fastq_R1list=`ls ./300bp_fastq/${sample}*R1_001.fastq.gz`);
my @fastq_R2list=`ls ./300bp_fastq/${sample}*R2_001.fastq.gz`);

I get these errors:
syntax error at TrimAlignFastq.pl line 18, near "`ls ./300bp_fastq/*.gz`)"
Global symbol "$sample" requires explicit package name at TrimAlignFastq.pl line 19.
syntax error at TrimAlignFastq.pl line 19, near "`ls ./300bp_fastq/${sample}*R1_001.fastq.gz`)"
Global symbol "$sample" requires explicit package name at TrimAlignFastq.pl line 20.
syntax error at TrimAlignFastq.pl line 20, near "`ls ./300bp_fastq/${sample}*R2_001.fastq.gz`)"

**boetsie** · 02-07-2013, 05:50 AM

You could just read in the file './samples' (assuming you have the names of the samples in here on each line) and go through each line:

open(IN,samples);
while(my $sample = <IN>){
chomp $sample;
print "reading sample $sample...\n"

my $file1 = "./300bp_fastq/${sample}*R1_001.fastq.gz" ;

my $file2 = "./300bp_fastq/${sample}*R2_001.fastq.gz" ;
etc...
}

**boetsie** · 02-07-2013, 05:56 AM

Or do something like:

open(IN, 'ls ./300bp_fastq/*.gz |');
while(my $sample = <IN>){
chomp $sample;
print "sample = $sample\n";
#etc...

}

Regards,
Boetsie

**Vanisha** · 02-07-2013, 06:32 AM

That's great - so I have read the samples in, but i now cannot specify read 1 and read 2:

CODE:
open(IN, 'ls ./300bp_fastq/*.gz |');
while(my $sample = <IN>){
chomp $sample;
print "sample = $sample\n";

my $fastq1 = "./300bp_fastq/${sample}*R1_00*.fastq.gz";
my $fastq2 = "./300bp_fastq/${sample}*R2_00*.fastq.gz";

OUTPUT:
sample = ./300bp_fastq/sample-1511075418_S97_L001_R1_001.fastq.gz
no fastq.gz file found for ./300bp_fastq/./300bp_fastq/sample-1511075418_S97_L001_R1_001.fastq.gz*R1_00*.fastq.gz

the output is the same even when I change the command to:
my $fastq1 = "./300bp_fastq/*R1_00*.fastq.gz";

**GenoMax** · 02-07-2013, 07:03 AM

Generally R1 and R2 would be listed one after the other. It will work in this case but you may want to explicitly match the R1/R2 in the name by pattern search just to be sure.

It looks like you are already picking up the full path in the $sample looking at the OUTPUT in post #10

OUTPUT:
sample = ./300bp_fastq/sample-1511075418_S97_L001_R1_001.fastq.gz

so you do not need to add the extra "./300bp_fastq/".

Topics	Statistics	Last Post
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 12 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 16 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, 05-02-2024, 08:06 AM	0 responses 22 views 0 likes	Last Post by seqadmin 05-02-2024, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 24 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM

Seqanswers Leaderboard Ad

Announcement

Novoalign alignment script help (perl)

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News