Okay, here is my attempt at a qucik perlscript to do this (see attachment). Hope some perl expert can make it less clumsy!
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Hello,
I am working with RNASEq data and we are to use 2 sets of 3 bam files each, i.e. 2 different conditions and 3 replicates per condition (which were obtained using Tophat2) with Cufflinks, Cuffmerge and finally Cuffdiff.
Cuffmerge requires a txt file with the full paths to the (in this case 6) transcripts.gtf files which was obtained on running cufflinks on each of the bam files. It also requires a reference genome fasta file and a gtf annotation file.
I chose to use gencode's latest version 'gencode.v12.annotation.gtf' with cuffmerge. I however am getting the following error, and I don't have an answer for it.
[Wed Jul 4 00:53:50 2012] Preparing output location ./merged_asm/
[Wed Jul 4 00:55:46 2012] Converting GTF files to SAM
[00:57:34] Loading reference annotation.
Error: duplicate GFF ID 'ENST00000361547.2' encountered!
[FAILED]
Error: could not execute gtf_to_sam
Traceback (most recent call last):
File "/share/apps/assembly/bin/cuffmerge", line 576, in ?
sys.exit(main())
File "/share/apps/assembly/bin/cuffmerge", line 554, in main
sam_input_files = convert_gtf_to_sam(gtf_input_files)
File "/share/apps/assembly/bin/cuffmerge", line 287, in convert_gtf_to_sam
sam_out = gtf_to_sam(line)
File "/share/apps/assembly/bin/cuffmerge", line 247, in gtf_to_sam
exit(1)
TypeError: 'str' object is not callable
Does anyone know what this means, or why it is happening? Can it be fixed? Any help would be greatly appreciated. Thanks
- Vinay
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 04-25-2024, 11:49 AM
|
0 responses
19 views
0 likes
|
Last Post
by seqadmin
04-25-2024, 11:49 AM
|
||
Started by seqadmin, 04-24-2024, 08:47 AM
|
0 responses
18 views
0 likes
|
Last Post
by seqadmin
04-24-2024, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
62 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Comment