gatk-3.8/public/java/test/org/broadinstitute/sting
Chris Hartl 1f777c4898 Introducing the latest-and-greatest in genotyping: CalculatePosteriors.
CalculatePosteriors enables the user to calculate genotype likelihood posteriors (and set genotypes accordingly) given one or more panels containing allele counts (for instance, calculating NA12878 genotypes based on 1000G EUR frequencies). The uncertainty in allele frequency is modeled by a Dirichlet distribution (parameters being the observed allele counts across each allele), and the genotype state is modeled by assuming independent draws (Hardy-Weinberg Equilibrium). This leads to the Dirichlet-Multinomial distribution.

Currently this is implemented only for ploidy=2. It should be straightforward to generalize. In addition there's a parameter for "EM" that currently does nothing but throw an exception -- another extension of this method is to run an EM over the Maximum A-Posteriori (MAP) allele count in the input sample as follows:
 while not converged:
  * AC = [external AC] + [sample AC]
  * Prior = DirichletMultinomial[AC]
  * Posteriors = [sample GL + Prior]
  * sample AC = MLEAC(Posteriors)

This is more useful for large callsets with small panels than for small callsets with large panels -- the latter of these being the more common usecase.

Fully unit tested.

Reviewer (Eric) jumped in to address many of his own comments plus removed public->protected dependencies.
2013-11-27 13:00:45 -05:00
..
commandline Fixed the logic of the @Output annotation and its interaction with 'required'. 2013-03-14 11:58:51 -04:00
gatk Adaptations to accomodate Tribble API changes, comprising mostly of the following. 2013-08-19 15:52:47 -04:00
jna Fix pipeline tests that were not respecting the pipeline test dry run setting 2013-06-28 15:27:17 -04:00
utils Introducing the latest-and-greatest in genotyping: CalculatePosteriors. 2013-11-27 13:00:45 -05:00
BaseTest.java Adaptations to accomodate Tribble API changes, comprising mostly of the following. 2013-08-19 15:52:47 -04:00
ExampleToCopyUnitTest.java Replace org.broadinstitute.variant with jar built from the Picard repo 2013-02-05 17:24:25 -05:00
MD5DB.java Enable convenient display of diff engine output in Bamboo, plus misc. minor test-related improvements 2013-05-10 19:00:33 -04:00
MD5Mismatch.java Enable convenient display of diff engine output in Bamboo, plus misc. minor test-related improvements 2013-05-10 19:00:33 -04:00
StingTextReporter.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
TestNGTestTransformer.java Fix tests that were consistently or intermittently failing when run in parallel on the farm 2013-03-06 13:56:54 -05:00
WalkerTest.java Reduce number of leftover temp files in GATK runs 2013-06-14 15:56:03 -04:00