Additional answer
I got the same issue and found your post helpful. To solve I opened the file in notepad and changed the encoding from Unicode to ANSI and then it imported cleanly into R.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Error at Creating Count Table for DESeq2
I have used Tophat-CuffDiff pipeline so far but I want to give a try for DESeq2. I have 2 conditions and 3 replicates for each, aim is to find the differentially expressed genes.
For a couple of days, I am trying to use HTSeq to prepare my count files. I guess I did it but now I am stuck at creating the count table as the DESeq2 input.
I didn't use R that much so far, so I am having difficulties. Here is the problem:
Code:> library('DESeq2') Loading required package: GenomicRanges Loading required package: BiocGenerics Loading required package: parallel Attaching package: ‘BiocGenerics’ The following objects are masked from ‘package:parallel’: clusterApply, clusterApplyLB, clusterCall, clusterEvalQ, clusterExport, clusterMap, parApply, parCapply, parLapply, parLapplyLB, parRapply, parSapply, parSapplyLB The following object is masked from ‘package:stats’: xtabs The following objects are masked from ‘package:base’: anyDuplicated, append, as.data.frame, as.vector, cbind, colnames, duplicated, eval, evalq, Filter, Find, get, intersect, is.unsorted, lapply, Map, mapply, match, mget, order, paste, pmax, pmax.int, pmin, pmin.int, Position, rank, rbind, Reduce, rep.int, rownames, sapply, setdiff, sort, table, tapply, union, unique, unlist Loading required package: IRanges Loading required package: XVector Loading required package: Rcpp Loading required package: RcppArmadillo > setwd("C:/Python27/SKMEL-5") > directory<-"C:/Python27/SKMEL-5/ALL" > sampleFiles <- grep("SKMEL-5",list.files(directory),value=TRUE) > sampleCondition<-c("KD","KD","KD","WT","WT","WT") > sampleTable<-data.frame(sampleName=sampleFiles, fileName=sampleFiles, condition=sampleCondition) > sampleTable sampleName fileName condition 1 SKMEL-5_I-1.txt SKMEL-5_I-1.txt KD 2 SKMEL-5_I-2.txt SKMEL-5_I-2.txt KD 3 SKMEL-5_I-3.txt SKMEL-5_I-3.txt KD 4 SKMEL-5_L-1.txt SKMEL-5_L-1.txt WT 5 SKMEL-5_L-2.txt SKMEL-5_L-2.txt WT 6 SKMEL-5_L-3.txt SKMEL-5_L-3.txt WT > ddsHTSeq<-DESeqDataSetFromHTSeqCount(sampleTable=sampleTable, directory=directory, design=~condition) Error in DESeqDataSetFromHTSeqCount(sampleTable = sampleTable, directory = directory, : Gene IDs (first column) differ between files. In addition: There were 36 warnings (use warnings() to see them)
Code:Warning messages: 1: In read.table(file.path(directory, fn)) : line 1 appears to contain embedded nulls 2: In read.table(file.path(directory, fn)) : line 2 appears to contain embedded nulls 3: In read.table(file.path(directory, fn)) : line 3 appears to contain embedded nulls 4: In read.table(file.path(directory, fn)) : line 4 appears to contain embedded nulls 5: In read.table(file.path(directory, fn)) : line 5 appears to contain embedded nulls 6: In scan(file = file, what = what, sep = sep, quote = quote, ... : embedded nul(s) found in input 7: In read.table(file.path(directory, fn)) : line 1 appears to contain embedded nulls 8: In read.table(file.path(directory, fn)) : line 2 appears to contain embedded nulls 9: In read.table(file.path(directory, fn)) : line 3 appears to contain embedded nulls 10: In read.table(file.path(directory, fn)) : line 4 appears to contain embedded nulls 11: In read.table(file.path(directory, fn)) : line 5 appears to contain embedded nulls 12: In scan(file = file, what = what, sep = sep, quote = quote, ... : embedded nul(s) found in input 13: In read.table(file.path(directory, fn)) : line 1 appears to contain embedded nulls 14: In read.table(file.path(directory, fn)) : line 2 appears to contain embedded nulls 15: In read.table(file.path(directory, fn)) : line 3 appears to contain embedded nulls 16: In read.table(file.path(directory, fn)) : line 4 appears to contain embedded nulls 17: In read.table(file.path(directory, fn)) : line 5 appears to contain embedded nulls 18: In scan(file = file, what = what, sep = sep, quote = quote, ... : embedded nul(s) found in input 19: In read.table(file.path(directory, fn)) : line 1 appears to contain embedded nulls 20: In read.table(file.path(directory, fn)) : line 2 appears to contain embedded nulls 21: In read.table(file.path(directory, fn)) : line 3 appears to contain embedded nulls 22: In read.table(file.path(directory, fn)) : line 4 appears to contain embedded nulls 23: In read.table(file.path(directory, fn)) : line 5 appears to contain embedded nulls 24: In scan(file = file, what = what, sep = sep, quote = quote, ... : embedded nul(s) found in input 25: In read.table(file.path(directory, fn)) : line 1 appears to contain embedded nulls 26: In read.table(file.path(directory, fn)) : line 2 appears to contain embedded nulls 27: In read.table(file.path(directory, fn)) : line 3 appears to contain embedded nulls 28: In read.table(file.path(directory, fn)) : line 4 appears to contain embedded nulls 29: In read.table(file.path(directory, fn)) : line 5 appears to contain embedded nulls 30: In scan(file = file, what = what, sep = sep, quote = quote, ... : embedded nul(s) found in input 31: In read.table(file.path(directory, fn)) : line 1 appears to contain embedded nulls 32: In read.table(file.path(directory, fn)) : line 2 appears to contain embedded nulls 33: In read.table(file.path(directory, fn)) : line 3 appears to contain embedded nulls 34: In read.table(file.path(directory, fn)) : line 4 appears to contain embedded nulls 35: In read.table(file.path(directory, fn)) : line 5 appears to contain embedded nulls 36: In scan(file = file, what = what, sep = sep, quote = quote, ... : embedded nul(s) found in input
I know the problem is at a very basic stage but I have no clue as an R-noob.Last edited by sazz; 03-23-2014, 04:27 AM.Tags: None
Latest Articles
Collapse
-
by seqadmin
Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...-
Channel: Articles
03-22-2024, 06:39 AM -
-
by seqadmin
The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.
Avian Conservation
Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...-
Channel: Articles
03-08-2024, 10:41 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 06:37 PM
|
0 responses
11 views
0 likes
|
Last Post
by seqadmin
Yesterday, 06:37 PM
|
||
Started by seqadmin, Yesterday, 06:07 PM
|
0 responses
10 views
0 likes
|
Last Post
by seqadmin
Yesterday, 06:07 PM
|
||
Started by seqadmin, 03-22-2024, 10:03 AM
|
0 responses
51 views
0 likes
|
Last Post
by seqadmin
03-22-2024, 10:03 AM
|
||
Started by seqadmin, 03-21-2024, 07:32 AM
|
0 responses
68 views
0 likes
|
Last Post
by seqadmin
03-21-2024, 07:32 AM
|
Leave a comment: