a) Updating for posterity original scripts for VQSR project consensus (snp and indel) so that they use newest GATK syntax (also they were not run in the end). b) New scripts for redoing indel GL's, jobs are split per sample and per chromosome.
Another script (PhaseIndelRedoCombining.scala) does the combining of all pieces - not the best solution, but best for development/debugging to have two separate entities. c) Experimental walker to assess GL concordance of two callsets - just accumulates squared error between each sample and site GL's and outputs at the end. d) Another one-off to fill a callset of given GL's with greedy genotypes (genotype that maximizes PL) to have more meaningful metrics and QC of a given GL file.
This commit is contained in:
parent
bc842ab3a5
commit
d5d0e94a11