Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • RockChalkJayhawk
    Senior Member
    • Mar 2009
    • 192

    Cuffdiff short name

    I tired running cuffdiff on two RNA samples: Liver & Intestine.
    After aligning with tophat, I ran cuffcompare
    Code:
    cuffcompare -o GFF -r ../hg19.RefFlat.GFF3 ../Liver/transcripts.gtf ../Lung/transcripts.gtf
    Code:
    cuffdiff GFF.combined.gtf ../Liver/accepted_hits.sam ../Lung/accepted_hits.sam
    but my genes file looks like this:
    Code:
    ref_trans_id	class_code	gene_short_name	tss_id	locus	q0_FPKM	q0_conf_lo	q0_conf_hi	q1_FPKM	q1_conf_lo	q1_conf_hi	ref_id
    XLOC_000001-[chr22:17517459-17539682]	-	-	-	chr22:17517459-17539682	0	0	0	0.56927	0	1.37434	-
    XLOC_000002-[chr22:17565848-17591387]	-	-	-	chr22:17565848-17591387	2.37959	0.696966	4.06222	15.9232	12.1168	19.7296	-
    XLOC_000003-[chr22:17956627-18033845]	-	-	-	chr22:17956627-18033845	0.540063	0	1.16367	0.413014	0	0.889921	-
    XLOC_000004-[chr22:18043182-18073647]	-	-	-	chr22:18043182-18073647	92.8549	79.4174	106.292	1.11535	0	2.40326	-
    XLOC_000005-[chr22:18121484-18211987]	-	-	-	chr22:18121484-18211987	2.41355	0.706914	4.12019	1.38433	0.254029	2.51463	-
    XLOC_000006-[chr22:18560685-18572206]	-	-	-	chr22:18560685-18572206	3.25714	0.953993	5.56028	3.11362	1.1444	5.08285	-
    XLOC_000007-[chr22:18593558-18614498]	-	-	-	chr22:18593558-18614498	3.26743	0.704014	5.83085	0.960786	0	2.17638	-
    XLOC_000008-[chr22:18632757-18660160]	-	-	-	chr22:18632757-18660160	3.48871	0.849702	6.12772	3.43404	1.14438	5.7237	-
    XLOC_000009-[chr22:18893735-18899600]	-	-	-	chr22:18893735-18899600	25.8998	16.5909	35.2086	0	0	0	-
    My transcripts* and genes files have the correct gene and transcript names:
    Code:
    gene_id	bundle_id	chr	left	right	FPKM
    SAMD11	45228	chr1	861120	879961	0.794517
    NOC2L	45228	chr1	879583	894679	25.374
    ISG15	45228	chr1	948846	949915	3.06525
    AGRN	45228	chr1	955502	991492	8.73339
    C1orf159	45228	chr1	1017197	1051736	1.44667
    SDF4	45228	chr1	1152288	1167447	89.863
    UBE2J2	45228	chr1	1189293	1209234	12.5814
    ACAP3	45228	chr1	1227763	1243269	5.66814
    PUSL1	45228	chr1	1243993	1247056	2.45665
    Any clues?
  • DrD2009
    Member
    • Oct 2009
    • 88

    #2
    Sorry I can't really help you with your problem, but I'm trying to use Cufflinks and I see you have made progress where I have been failing and was wondering if you could help me.

    I've been trying to supply a GFF file when I run cuffcompare, but the names are never assigned to the transcripts and they are all classified as class "u" or "."

    Code:
    brandon@brandon-desktop:~/arab/small$ cuffcompare -o 162_162E -r ~/Desktop/tair9_small_RNAs.gff 162/162.cufflinks/transcripts.gtf 162E/162E.cufflinks/transcripts.gtf 
    Warning: found 79213 transcripts with undetermined strand.
    Warning: found 64824 transcripts with undetermined strand.
    Any idea what I could be doing wrong? This problem has been holding me up for a while.

    Comment

    • RockChalkJayhawk
      Senior Member
      • Mar 2009
      • 192

      #3
      Originally posted by DrD2009 View Post
      Sorry I can't really help you with your problem, but I'm trying to use Cufflinks and I see you have made progress where I have been failing and was wondering if you could help me.

      I've been trying to supply a GFF file when I run cuffcompare, but the names are never assigned to the transcripts and they are all classified as class "u" or "."

      Code:
      brandon@brandon-desktop:~/arab/small$ cuffcompare -o 162_162E -r ~/Desktop/tair9_small_RNAs.gff 162/162.cufflinks/transcripts.gtf 162E/162E.cufflinks/transcripts.gtf 
      Warning: found 79213 transcripts with undetermined strand.
      Warning: found 64824 transcripts with undetermined strand.
      Any idea what I could be doing wrong? This problem has been holding me up for a while.
      You might check your reference file to see if the stand information is in the correct place.

      I really wish there were some repository where we all could download the same files and make this process a lot easier!

      Comment

      • DrD2009
        Member
        • Oct 2009
        • 88

        #4
        Thanks. I'll double check the file.

        I agree. One day it will be done, probably after I stop working on this stuff. lol

        Comment

        • DrD2009
          Member
          • Oct 2009
          • 88

          #5
          I was wondering if you could post a few lines of the GFF file you used to annotate your reads for cuffcompare?

          I've tried GFF and GFF3, but the reads still came out without annotations.

          Are you able to supply cufflinks with a GFF as well to provide annotation? The manual only mentions GTF files.

          Thanks again,
          Brandon

          Comment

          • RockChalkJayhawk
            Senior Member
            • Mar 2009
            • 192

            #6
            Originally posted by DrD2009 View Post
            I was wondering if you could post a few lines of the GFF file you used to annotate your reads for cuffcompare?

            I've tried GFF and GFF3, but the reads still came out without annotations.

            Are you able to supply cufflinks with a GFF as well to provide annotation? The manual only mentions GTF files.

            Thanks again,
            Brandon
            They only take GTF. I can send you the GTF file if you like. this is how I run the analysis:
            tophat -p 5 - SampleName Sample.fq

            cufflinks -p 5 -L Sample1 -G hg19.Ens.GTF Sample1.sam

            cuffcompare -o Sample1 -r hg19.Ens.GTF -R Sample1.transcripts.gtf Sample2.transcripts.gtf

            cuffdiff -p 5 combined.gtf Sample1.sam Sample2.sam
            The GTF looks like this:
            chr1 protein_coding CDS 67050223 67050289 . + 1 gene_id "ENSG00000173020"; transcript_id "ENST00000308595"; exon_number "14"; gene_name "ADRBK1"; transcript_name "ADRBK1-201"; protein_id "ENSP00000312262";
            chr1 protein_coding CDS 67050223 67050289 . + 1 gene_id "ENSG00000173020"; transcript_id "ENST00000416281"; exon_number "2"; gene_name "ADRBK1"; transcript_name "ADRBK1-202"; protein_id "ENSP00000407159";
            chr1 protein_coding CDS 67050599 67050699 . + 0 gene_id "ENSG00000173020"; transcript_id "ENST00000308595"; exon_number "15"; gene_name "ADRBK1"; transcript_name "ADRBK1-201"; protein_id "ENSP00000312262";
            I can send you the GTF if you like.

            Comment

            • DrD2009
              Member
              • Oct 2009
              • 88

              #7
              Ahh, then that is my problem. The organism I work on, Arabidopsis thaliana, has no published GTF files anywhere that I have been able to locate to provide annotation for Cufflinks.

              I have only GFF files. I might try creating GTFs out of my GFFs and seeing if I can provide annotation with Cufflinks that way.

              Thank you for all of your help and fast replies. I really appreciate it.

              Comment

              • mrfox
                Senior Member
                • Aug 2010
                • 103

                #8
                isoforms with cufflinks and cuffdiff

                Hi everyone,
                I have questions on the rebuilt isofroms/transcripts in the different steps of Cufflinks tool. I found that the rebuilt transcripts recovered by cufflinks and the transcripts in isoform_exp.diff created by cuffdiff are different. My questions are:
                1)Cuffdiff re-assign the reads, rebuild the transcripts, and re-estimate the expression level?
                2)How can I track the structure of the transcripts in isoform_exp.diff?

                Thank you.

                Comment

                • mrfox
                  Senior Member
                  • Aug 2010
                  • 103

                  #9
                  sorry i posted the above thread in the wrong place.

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by SEQadmin2


                    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                    Here are nine questions we think about, in roughly the order they matter, before...
                    Today, 07:11 AM
                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM
                  • SEQadmin2
                    Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                    by SEQadmin2


                    With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                    Introduction

                    Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                    05-22-2026, 06:42 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, Yesterday, 06:09 AM
                  0 responses
                  16 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-09-2026, 11:58 AM
                  0 responses
                  36 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-05-2026, 10:09 AM
                  0 responses
                  42 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-04-2026, 08:59 AM
                  0 responses
                  49 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...