Commit Graph

6 Commits (b63d64bbbc73ab9ef2f8b60c0f63b35f9882c813)

Author SHA1 Message Date
depristo b63d64bbbc Beautiful labels, better choice of dimension ranges. Supports fast loading of just first N records for testing.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3964 348d0f76-0448-11de-a6fe-93d51630548a
2010-08-06 23:17:32 +00:00
depristo d3bebe0f2c Reasonable comment
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3963 348d0f76-0448-11de-a6fe-93d51630548a
2010-08-06 22:03:55 +00:00
depristo bb5dfd7e5e Slightly nicer plotting; not yet complete
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3961 348d0f76-0448-11de-a6fe-93d51630548a
2010-08-06 20:01:31 +00:00
depristo 70f492a6e8 Prints out trivial debugging info
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3957 348d0f76-0448-11de-a6fe-93d51630548a
2010-08-06 13:24:21 +00:00
kiran 1a36cb9296 Can now set the maximum number of variants to see in a cluster plot (useful when you don't need to see a billion points to get an idea of what's going on. Limit applies to known and novel variants separately.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3937 348d0f76-0448-11de-a6fe-93d51630548a
2010-08-04 18:45:24 +00:00
kiran bd27287fe7 An R module that takes in a Variant Recalibration cluster file (file with '@!CLUSTER' lines in it), a tabularized VCF, and optionally a set of loci that should be examined more carefully, and emits a tremendous number of plots. For every annotation used in clustering, the distributions and pair-wise comparison (with ellipses denoting the 2-sigma cluster boundaries) are shown. Each cluster is shaded with a color proportional to its mixture coefficient.
To use this module, you'll first have to take your VCF and create an R-readable table out of it with the following command:

python /path/to/Sting/trunk/python/vcf2table.py -f CHROM,POS,ID,AC,AF,AN,DB,DP,HRun,MQ,MQ0,MyHaplotypeScore,QD,SB my.vcf > my.vcf.table

Then, simply invoke this module with the command:

Rscript /path/to/Sting/trunk/R/VariantRecalibratorReport/VariantRecalibratorReport.R /path/to/output/prefix /path/to/my/my.clusters /path/to/my.vcf.table [/path/to/my.suspicious.loci]

This will create a number of plots all with the prefix "/path/to/output/prefix".  For instance, if you used QD, SB, HRun, and MyHaplotypeScore annotations during clustering, you should see output like this:

    /path/to/output/prefix.anndist.HRun.pdf
    /path/to/output/prefix.anndist.MyHaplotypeScore.pdf
    /path/to/output/prefix.anndist.QD.pdf
    /path/to/output/prefix.anndist.SB.pdf
    /path/to/output/prefix.cluster.HRun_vs_MyHaplotypeScore.pdf
    /path/to/output/prefix.cluster.HRun_vs_QD.pdf
    /path/to/output/prefix.cluster.HRun_vs_SB.pdf
    /path/to/output/prefix.cluster.MyHaplotypeScore_vs_HRun.pdf
    /path/to/output/prefix.cluster.MyHaplotypeScore_vs_QD.pdf
    /path/to/output/prefix.cluster.MyHaplotypeScore_vs_SB.pdf
    /path/to/output/prefix.cluster.QD_vs_HRun.pdf
    /path/to/output/prefix.cluster.QD_vs_MyHaplotypeScore.pdf
    /path/to/output/prefix.cluster.QD_vs_SB.pdf
    /path/to/output/prefix.cluster.SB_vs_HRun.pdf
    /path/to/output/prefix.cluster.SB_vs_MyHaplotypeScore.pdf
    /path/to/output/prefix.cluster.SB_vs_QD.pdf



git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3936 348d0f76-0448-11de-a6fe-93d51630548a
2010-08-04 18:35:14 +00:00