Hi,
I am finding bit difficulties in understading BWA output format (with indels) and following are my queries.
1. XT:A:U means unique hits and XT:A:R means repeat..
2. The above query has the desciptor as 70Match, 1 insertion and 4 matches (75 bp query). If I go to UCSC blat to check this part, first 70 bases matches with the ref (hg18) sequnces and the last GATC is not in that position followed by an insertion in 71st position. Its bit confusing at this stage.
3. How to intrepret
Number of gaps and gap extentions. If my understanding is correct, it opens 1 gap and extended to 1 base. I am trying to understnad the above read results.
3. Sometimes BWA gives XA: (alternate hits) for XT:A:R (multiple hits). Is tehre any threshold for BWA to report these alternate hits. Sometimes I see X0:i:10 ( number of best hits as 10) but alternate hits reports only 3 or 4. Is there any threshold to report alternate hits as I believe BWA doesnot report all alternate hits for every multiple hits.
4. I also want to know about the mismatch tag
explanation.
I would appreciate very much if someone could explain my above queries. Thanks.
I am finding bit difficulties in understading BWA output format (with indels) and following are my queries.
Code:
GA1_0001:5:1:2237:4692#0 0 chr8 145553847 37 70M1I4M * 0 0 GGATCTGGGTGGAGCTACCTGTGGTGGTCAAAGAGCTTCCAGAGG GTGAGTGGGAGGGAGGTGCAGGTGTAGATC 00000;<9:<99+<;9<>;;<AA<A%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% XT:A:U NM:i:3 X0:i:1 X1:i:0 XM :i:3 XO:i:1 XG:i:1 MD:Z:71C0C1
2. The above query has the desciptor as 70Match, 1 insertion and 4 matches (75 bp query). If I go to UCSC blat to check this part, first 70 bases matches with the ref (hg18) sequnces and the last GATC is not in that position followed by an insertion in 71st position. Its bit confusing at this stage.
3. How to intrepret
Code:
XO:i:1 XG:i:1
3. Sometimes BWA gives XA: (alternate hits) for XT:A:R (multiple hits). Is tehre any threshold for BWA to report these alternate hits. Sometimes I see X0:i:10 ( number of best hits as 10) but alternate hits reports only 3 or 4. Is there any threshold to report alternate hits as I believe BWA doesnot report all alternate hits for every multiple hits.
4. I also want to know about the mismatch tag
Code:
MD:Z:71C0C1
I would appreciate very much if someone could explain my above queries. Thanks.
Comment