Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • nisha
    Junior Member
    • Jun 2009
    • 5

    error while maaping paired end reads in Maq

    Hi,

    This is my first post.
    I am running maq map for paired end SOLiD reads and ive gone through all the initial procedures of buildinf .bfq files and ref.csbfa sequence

    when i run the command:
    maq map -c aln.cs.map ref.csbfa in.read1.bfq in.read2.bfq 2>aln.log

    i always get the following error:
    maq: read.cc:61: longreads_t* ma_load_reads(void*, int, void*, int): Assertion `strncmp(name, lr->name[j], tl-1) == 0' failed.

    Could someone please tell me what this mean? What are the strings that are being compared? i checked the name of the reads in both the read1 and read2 fastq files and they are matching with /1 and /2 respectively for the read pairs?

    Also the length of the reads are the same in both the files.

    Any help will be appreciated.

    Thanks,
    N
  • der_eiskern
    Member
    • Jul 2009
    • 46

    #2
    maq + solid

    yeah, i'm getting the same error. i've not been able to figure it out yet but in the meantime i ran single ends and got 70% of the reads mappable. did you figure it out? i'd be interested in hearing about your problems.

    btw, to introduce myself, i'm der_eiskern and at the moment i'm doing whole genome sequencing with both SOLiD and Illumina platforms.

    cheers.

    Comment

    • nilshomer
      Nils Homer
      • Nov 2008
      • 1283

      #3
      Originally posted by der_eiskern View Post
      yeah, i'm getting the same error. i've not been able to figure it out yet but in the meantime i ran single ends and got 70% of the reads mappable. did you figure it out? i'd be interested in hearing about your problems.

      btw, to introduce myself, i'm der_eiskern and at the moment i'm doing whole genome sequencing with both SOLiD and Illumina platforms.

      cheers.
      Did you check the length of the quality strings? MAQ's convert script can output too long of a quality string if there are "-1" qualities. This fixed the problem for me.

      Comment

      • der_eiskern
        Member
        • Jul 2009
        • 46

        #4
        the quality strings in my fastq files are 50 bp and my read length is 50 bp. I didn't generate the files myself. just my task to run them. i haven't written a bash script to check the length of every single read though...so would just a single aberrant length stop maq completely before it begins?

        any other ideas, nils? I'm scratching my head because maq gives me the error suggesting the two reads are of different lengths still. I'm hoping that my 70% Mappability will increase when i can get paired end assignments working.

        thanks.

        Comment

        • nisha
          Junior Member
          • Jun 2009
          • 5

          #5
          hi der_eiskern,

          Yeah repeating what nilshomer mentioned ...

          Yes i figured out what the problem is. I'm assuming it would be the same problem for you.

          The *.qual files containing the qualities for both the F3 and R3 reads have negative values, mainly -1. So when the solid2fastq.pl creates the fastq files it does not handle these negative values correctly (treating the "-" and "1" as separate entities) and the length of the quality string is not equal to the length of the read string.

          You would have to change the script a bit to handle this problem.

          hope this helps.

          N

          Comment

          • nilshomer
            Nils Homer
            • Nov 2008
            • 1283

            #6
            Originally posted by nisha View Post
            hi der_eiskern,

            Yeah repeating what nilshomer mentioned ...

            Yes i figured out what the problem is. I'm assuming it would be the same problem for you.

            The *.qual files containing the qualities for both the F3 and R3 reads have negative values, mainly -1. So when the solid2fastq.pl creates the fastq files it does not handle these negative values correctly (treating the "-" and "1" as separate entities) and the length of the quality string is not equal to the length of the read string.

            You would have to change the script a bit to handle this problem.

            hope this helps.

            N
            This is what I mentioned above. I have emailed Heng Li (MAQ's author) about the problem, but it should be a one liner in his code.

            Comment

            • der_eiskern
              Member
              • Jul 2009
              • 46

              #7
              thanks. this is a part of an output where i printed every quality string

              (*)&-"-"-",&&'&''*)',&&,-"1&+&))))1-"&&)/-")&',&(&-)-"-"'
              <;>=-"-"-"7=><1=>?:<>;>A-"=;<88=:=6-"=0:<-">8?(;9;8,-"-",
              5&,;-"-"-":&/,/(($-8,)5/-"1)((&+&,'-")$($-"/&0''(/28-"-"$
              $##&-"-"-"%&'##'###%%###-"#1####&##-"##$#-"$#####$#$-"-"#
              /,&&-"-"-",5<,*,<&1)',+/-",5,/&'/),-"//,4-"&&)7+&),)-"-"&
              $/&&-"-"-"81/@*,>),)3)(,-"<(>/'/)-",&-"&,<1).&&8-"-"2
              :?>8-"-"-">?8;0>6;:.>9=6-";98>/6$%9-")%47-"+#1;/)'.7-"-".
              =<A?-"-"-"==><<0>1;@89>=-"579;A==>3-"1<79-":<=37)55;-"-"<
              8;<<-"-"-":;8<86526<8,;<-"891:76,,9-"7037-"5.+:;1;65-"-"9
              <::A-"-"-"9A@:9<<><8==@5-"8;;5:89;6-"<05:-"9<2=)8>68-"-"=
              #&$#-"-"-"$''%##$'$'$-(&-"$&&#'#-#'-"#%&.-"$''#%%&(%-"-"*

              yep. there's all those "-1" improperly translated. is there a way that i can correct these files without retranslating everything to fastq?

              nisha, what was your way around this if you didn't change the script?

              thanks.

              Comment

              • nilshomer
                Nils Homer
                • Nov 2008
                • 1283

                #8
                Originally posted by der_eiskern View Post
                thanks. this is a part of an output where i printed every quality string

                (*)&-"-"-",&&'&''*)',&&,-"1&+&))))1-"&&)/-")&',&(&-)-"-"'
                <;>=-"-"-"7=><1=>?:<>;>A-"=;<88=:=6-"=0:<-">8?(;9;8,-"-",
                5&,;-"-"-":&/,/(($-8,)5/-"1)((&+&,'-")$($-"/&0''(/28-"-"$
                $##&-"-"-"%&'##'###%%###-"#1####&##-"##$#-"$#####$#$-"-"#
                /,&&-"-"-",5<,*,<&1)',+/-",5,/&'/),-"//,4-"&&)7+&),)-"-"&
                $/&&-"-"-"81/@*,>),)3)(,-"<(>/'/)-",&-"&,<1).&&8-"-"2
                :?>8-"-"-">?8;0>6;:.>9=6-";98>/6$%9-")%47-"+#1;/)'.7-"-".
                =<A?-"-"-"==><<0>1;@89>=-"579;A==>3-"1<79-":<=37)55;-"-"<
                8;<<-"-"-":;8<86526<8,;<-"891:76,,9-"7037-"5.+:;1;65-"-"9
                <::A-"-"-"9A@:9<<><8==@5-"8;;5:89;6-"<05:-"9<2=)8>68-"-"=
                #&$#-"-"-"$''%##$'$'$-(&-"$&&#'#-#'-"#%&.-"$''#%%&(%-"-"*

                yep. there's all those "-1" improperly translated. is there a way that i can correct these files without retranslating everything to fastq?

                nisha, what was your way around this if you didn't change the script?

                thanks.
                If it is in the .bfq format, you will have to convert it back to fastq (since the .bfq is gzip compressed).

                You can always modify the input "qual" files using "sed":

                Code:
                sed -i 's_-1_1_g' <QV file>

                Comment

                • der_eiskern
                  Member
                  • Jul 2009
                  • 46

                  #9
                  Originally posted by nilshomer View Post
                  If it is in the .bfq format, you will have to convert it back to fastq (since the .bfq is gzip compressed).

                  You can always modify the input "qual" files using "sed":

                  Code:
                  sed -i 's_-1_1_g' <QV file>
                  Thanks! we don't have the original qual files unfortunately, can i apply this command to the fastq files i have? or are they beyond help?

                  Comment

                  • nilshomer
                    Nils Homer
                    • Nov 2008
                    • 1283

                    #10
                    Originally posted by der_eiskern View Post
                    Thanks! we don't have the original qual files unfortunately, can i apply this command to the fastq files i have? or are they beyond help?
                    You can try to modify the fastq files. The only problem is that -1 encoded in sanger ASCII is -", and both - and " are also sanger ASCII (I believe). So some fraction of the time -" will occur not from one -1 quality but two independent qualities. Therefore it is fairly tricky, unless you try to match up the -1 qualities with the missing color (which is usually the case). This is starting to sound like a lot of work!

                    Did you delete the original qual files? How did you get the fastq file in the first place?

                    Comment

                    • der_eiskern
                      Member
                      • Jul 2009
                      • 46

                      #11
                      Originally posted by nilshomer View Post
                      So some fraction of the time -" will occur not from one -1 quality but two independent qualities. Therefore it is fairly tricky, unless you try to match up the -1 qualities with the missing color (which is usually the case). This is starting to sound like a lot of work!

                      Did you delete the original qual files? How did you get the fastq file in the first place?
                      Our SOLiD data came from offsite and they did they're own SNPcalling using the Corona Lite pipeline and gave us converted qual files in the fastq format for us to run MAQ on. Email communication has been slow...its looking like i'll have to pay them a visit to get all this straightened out.

                      Comment

                      • nilshomer
                        Nils Homer
                        • Nov 2008
                        • 1283

                        #12
                        Originally posted by der_eiskern View Post
                        Our SOLiD data came from offsite and they did they're own SNPcalling using the Corona Lite pipeline and gave us converted qual files in the fastq format for us to run MAQ on. Email communication has been slow...its looking like i'll have to pay them a visit to get all this straightened out.
                        I am a bit confused. So they aligned the reads and made variant calls using corona-lite, and then gave you the raw color data (fastq)? Why don't they just give you the variant calls and alignments?

                        I would definitely ask for the *csfasta and *qual files in this case and do your own alignment and SNP calling...

                        Comment

                        • der_eiskern
                          Member
                          • Jul 2009
                          • 46

                          #13
                          Originally posted by nilshomer View Post
                          I am a bit confused. So they aligned the reads and made variant calls using corona-lite, and then gave you the raw color data (fastq)? Why don't they just give you the variant calls and alignments?

                          I would definitely ask for the *csfasta and *qual files in this case and do your own alignment and SNP calling...
                          yeah, that's what i've been trying to do with MAQ and have been rather successful with the "homozygous" calls (using the flawed data they gave us) but not so much for the hets. i'm going to have to redo all of it though. thanks again for your help, nils.

                          Comment

                          Latest Articles

                          Collapse

                          • seqadmin
                            New Genomics Tools and Methods Shared at AGBT 2025
                            by seqadmin


                            This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                            The Headliner
                            The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                            03-03-2025, 01:39 PM
                          • seqadmin
                            Investigating the Gut Microbiome Through Diet and Spatial Biology
                            by seqadmin




                            The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
                            02-24-2025, 06:31 AM

                          ad_right_rmr

                          Collapse

                          News

                          Collapse

                          Topics Statistics Last Post
                          Started by seqadmin, 03-20-2025, 05:03 AM
                          0 responses
                          17 views
                          0 reactions
                          Last Post seqadmin  
                          Started by seqadmin, 03-19-2025, 07:27 AM
                          0 responses
                          18 views
                          0 reactions
                          Last Post seqadmin  
                          Started by seqadmin, 03-18-2025, 12:50 PM
                          0 responses
                          19 views
                          0 reactions
                          Last Post seqadmin  
                          Started by seqadmin, 03-03-2025, 01:15 PM
                          0 responses
                          186 views
                          0 reactions
                          Last Post seqadmin  
                          Working...