Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • manvendra7
    Junior Member
    • Oct 2012
    • 7

    where is real problem, tophat2 code or my fastq files

    Dear FOlks,
    I am so new, an early stage researcher.

    I am using TopHat2 to map the reads, I guess, I am fulfilling all the requirements, my code is

    /usr/local/bin/tophat2 -p 8 -G ~/path/to/Homo_sapiens.GRCh37.72.gtf -o
    ~/path/to/Human_mapping_iPS_s7_rep1
    --splice-mismatches 1 --max-multihits 30 --microexon-search --fusion-search
    ~/path/to/bowtie2_index/hg19
    ~/path/to/myfile.fastq

    I am submitting on grid engine cluster with qsub -l h_vmem=50G [above_script]
    this is showing error as:
    """""TopHat requires all reads be either FASTQ or FASTA. Mixing formats is not supported"""

    I am bit frustrated because my fastq files look fine to me as shown in code

    @SOLEXA-GA05_00009_SRi_AD_MS_BN_VW:7:1:2364:933#ATGAGCA
    NGGCCTTCCCACATTCTTTACACTCATAGGTTTTCTCACCAGTGTGAGTTCTCTTGTGCACAATAAGGTAAGAGCC
    +SOLEXA-GA05_00009_SRi_AD_MS_BN_VW:7:1:2364:933#ATGAGCA
    !454478347;09977778<655476;69;8588380745<75;57495945158::=677976:7674:64763-

    Please help???????
    Manvendra Singh
    PhD student,
    Mobile DNA, MDC, Berlin
  • KatsenPlatz
    Junior Member
    • May 2010
    • 3

    #2
    My Guess is that there is a problem somewhere in the fastq file, although the first 4 lines look good! A few checks that you can do are:

    1. there are 4 lines for each sequence read in the file, i.e. total count of lines in the fastq file is 4x the total number of reads
    2. every first line of each record starts with @ and every third line starts with +
    3. the length of the quality sequence is the same as the length of the sequence read for every record

    Comment

    • GenoMax
      Senior Member
      • Feb 2008
      • 7142

      #3
      Looking at the sequence identifiers I wonder if this is old data from a GAII machine. It is then likely in the older illumina (1.3) Fastq format. If that is the case then you may need to add the relevant options for tophat to take that into account.

      From TopHat manual

      --solexa-quals Use the Solexa scale for quality values in FASTQ files.
      --solexa1.3-quals As of the Illumina GA pipeline version 1.3, quality scores are encoded in Phred-scaled base-64. Use this option for FASTQ files from pipeline 1.3 or later.

      Comment

      • manvendra7
        Junior Member
        • Oct 2012
        • 7

        #4
        Thanks guys,
        My Problem is figured out. There was a problem with my fastq file
        Manvendra Singh
        PhD student,
        Mobile DNA, MDC, Berlin

        Comment

        • arkilis
          Senior Member
          • Jul 2013
          • 119

          #5
          Originally posted by manvendra7 View Post
          Dear FOlks,
          I am so new, an early stage researcher.

          I am using TopHat2 to map the reads, I guess, I am fulfilling all the requirements, my code is

          /usr/local/bin/tophat2 -p 8 -G ~/path/to/Homo_sapiens.GRCh37.72.gtf -o
          ~/path/to/Human_mapping_iPS_s7_rep1
          --splice-mismatches 1 --max-multihits 30 --microexon-search --fusion-search
          ~/path/to/bowtie2_index/hg19
          ~/path/to/myfile.fastq

          I am submitting on grid engine cluster with qsub -l h_vmem=50G [above_script]
          this is showing error as:
          """""TopHat requires all reads be either FASTQ or FASTA. Mixing formats is not supported"""

          I am bit frustrated because my fastq files look fine to me as shown in code

          @SOLEXA-GA05_00009_SRi_AD_MS_BN_VW:7:1:2364:933#ATGAGCA
          NGGCCTTCCCACATTCTTTACACTCATAGGTTTTCTCACCAGTGTGAGTTCTCTTGTGCACAATAAGGTAAGAGCC
          +SOLEXA-GA05_00009_SRi_AD_MS_BN_VW:7:1:2364:933#ATGAGCA
          !454478347;09977778<655476;69;8588380745<75;57495945158::=677976:7674:64763-

          Please help???????
          For all what I know is there are diff verions of fastq format. you better have to check of the software requirements.

          FASTQ: three main versions, illumina 1.3+, 1.5+ and 1.8+

          Comment

          Latest Articles

          Collapse

          • GATTACAT
            Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by GATTACAT
            Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
            07-01-2026, 11:43 AM
          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Yesterday, 11:08 AM
          0 responses
          7 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-30-2026, 05:37 AM
          0 responses
          11 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-26-2026, 11:10 AM
          0 responses
          19 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          53 views
          0 reactions
          Last Post SEQadmin2  
          Working...