Header Leaderboard Ad

Collapse

What do the numbers mean in these RNA-Seq gene/transcript TPM files?

Collapse

Announcement

Collapse

SEQanswers June Challenge Has Begun!

The competition has begun! We're giving away a $50 Amazon gift card to the member who answers the most questions on our site during the month. We want to encourage our community members to share their knowledge and help each other out by answering questions related to sequencing technologies, genomics, and bioinformatics. The competition is open to all members of the site, and the winner will be announced at the beginning of July. Best of luck!

For a list of the official rules, visit (https://www.seqanswers.com/forum/sit...wledge-and-win)
See more
See less
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • What do the numbers mean in these RNA-Seq gene/transcript TPM files?

    From the link https://gtexportal.org/home/datasets, under V7, I'm trying to do R/Python analyses on the Gene TPM and Transcript TPM files. But in these files (and to open them I had to use Universal Viewer since the files are too large to view with an app like NotePad), I'm seeing a bunch of ID's for samples (i.e. GTEX-1117F-0226-SM-5GZZ7), followed by transcript ID's like ENSG00000223972.4, and then a bunch of numbers like 0.02865 (and they take up like 99% of the large files). Can someone help me decipher what the numbers mean, please? And are the numbers supposed to be assigned to a specific sample ID? (The amount of letters far exceed the amount of samples, btw). I tried opening these files as tables in R but I do not think R is categorizing the contents of the file correctly.

    For context, I am planning to match males with females for sex comparison but in order to do that, I need to get R to categorize everything correctly. (I know that females have "F" where ####-#####-####-#x-##### where x is and males have "M").

  • #2
    Hey Macromind101

    Those numbers represent the TPM (Transcripts Per Million) for each sample. So the numbers are not assigned to a specific sample ID directly, instead, each number corresponds to the expression level of a specific gene or transcript in a specific sample.



    Comment

    Latest Articles

    Collapse

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 06-01-2023, 08:56 PM
    0 responses
    9 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 06-01-2023, 07:33 AM
    0 responses
    9 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 05-31-2023, 07:50 AM
    0 responses
    4 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 05-26-2023, 09:22 AM
    0 responses
    11 views
    0 likes
    Last Post seqadmin  
    Working...
    X