gatk-3.8/python
chartl 8a0e813b04 A helper script that will take a list of bams, a list of case sample IDs, and a list of control sample IDs, and generate a sample meta data yaml (which includes the bamfiles)
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@5482 348d0f76-0448-11de-a6fe-93d51630548a
2011-03-21 16:11:55 +00:00
..
genomicAnnotatorScripts Adding a script to do big table conversion. Removing Ben's script which is totally obsolete and busted. 2010-10-12 01:40:43 +00:00
1kgStatsForCalls.py Better reporting and now with a special mode for listing exceptions 2010-09-01 16:19:51 +00:00
AnnotateVCFwithMAF.py Restore "type" annotation (but not genomechange or cDNA change, which are already encoded in the VCF) 2010-07-13 17:33:15 +00:00
JobDispatcher.py Memory string added 2010-07-12 20:43:19 +00:00
JobDispatcherExample.py This is a python job dispatcher I've been using, which builds on Mark's FarmJob utility, and an example script of how I'm using it. Basically I wrote it to smartly break up analysis over an interval list, givin a maximum number of bases per job, a list of available queues, and a limit on each queue. It handles going over these limits in three ways: 2010-03-31 19:53:13 +00:00
MPGQueuePipelineStatus.py updated to work with the new tearsheet 2011-01-28 18:46:38 +00:00
ParseDCCSequenceData.py
RefseqLibrary.py Awesome: JobDispatcher can now dispatch jobs by gene from a target .design file found in /seq/references. 2010-04-14 18:17:41 +00:00
RunPilot2Pipeline.py
SyzygyCallsFileToVCF.py
VCFValidationAnalysis.py
Verify1KGArchiveBAMs.py Minor utility improvements 2011-02-26 15:36:26 +00:00
analyzeRunReports.py updates to handle only reporting on a specific SVN revision. Updated the R script to show the domain name of the runner, now that S3 logging is working 2011-02-01 12:02:12 +00:00
collectCalls.py
countCoverageWithSamtools.py
createCaseControlMetaData.py A helper script that will take a list of bams, a list of case sample IDs, and a list of control sample IDs, and generate a sample meta data yaml (which includes the bamfiles) 2011-03-21 16:11:55 +00:00
dataProcessingPaper.py Trival changes to data processing paper python 2010-12-01 14:57:14 +00:00
expandedSummaryToVCF.py
faiReader.py
farm_commands.py
farm_commands2.py Minor improvements to my crappy old python job management system. Mauricio's first task is to retire all of this code and move the DPP pipeline over to Queue 2010-12-09 04:44:16 +00:00
fasta.py
firehose_out_email.py updated to deal with new cleaning pipeline outputs and potentially infinity TI/TV 2010-08-24 16:01:09 +00:00
fixSoapSnp.py Simple pre-processing script for soapsnp files 2010-11-04 20:34:43 +00:00
gatherIndelsToVCF.py Adding this to get around lsf/csh issues (see recent help message). Also seems like a good time to reiterate http://www.faqs.org/faqs/unix-faq/shell/csh-whynot/ 2010-09-19 02:45:16 +00:00
gatherSampleSummaries.py If you scatter depth of coverage and need to do something more sophisticated than gathering up (e.g. concatenating) the interval summary file, and need to smartly gather up a full summary file, modify (stress on MODIFY) this script to do it 2011-02-25 01:23:53 +00:00
gatkConfigParser.py
generate1KGHapmapVCF.py Quick script that changes "chr#" to "#" and "chrM" to "MT" and moves mitochondria to the end of the vcf; in accordance with the 1KG reference. 2010-01-28 21:59:33 +00:00
getBamFilesFromSpreadsheet.py Moved CoverageStatistics to core. This will be (soon) renamed DepthOfCoverage; so please use CoverageStatistics 2010-03-29 13:32:00 +00:00
getLaneAndSequenceInfo.py Grabs average SNP calls, mismatch rate, aligned reads, and other important lane metrics from a SQUID export and summarizes them across multiple margins (lane numbers, flowcells, samples, libraries) 2010-04-17 03:09:05 +00:00
getTargetedGenes.py Just some code I want to freeze. If you ever need to estimate the % of bases covered by exon, given an interval list, give it to getTargetedGenes. Not the best name for this function, but I don't expect anyone to use it but me. 2010-04-01 20:21:50 +00:00
igvController.py Now can take a VCF file as input 2010-05-18 17:06:12 +00:00
indelVerboseStats.py Minor improvements to simple python code 2010-05-07 21:34:46 +00:00
lsf_post_touch.py Do this the right way 2010-09-19 04:30:48 +00:00
madPipelineUtils.py Minor improvements to my crappy old python job management system. Mauricio's first task is to retire all of this code and move the DPP pipeline over to Queue 2010-12-09 04:44:16 +00:00
makeIndelMask.py Removing unnecessary dependences that were causing problems for Sendu 2010-05-21 13:07:41 +00:00
makeMetricsFilesForFirehose.py Moved CoverageStatistics to core. This will be (soon) renamed DepthOfCoverage; so please use CoverageStatistics 2010-03-29 13:32:00 +00:00
memo.py
mergeVCFInfoFields.py Moved CoverageStatistics to core. This will be (soon) renamed DepthOfCoverage; so please use CoverageStatistics 2010-03-29 13:32:00 +00:00
picard_utils.py misc. changes to python scripts 2010-07-09 00:13:35 +00:00
privateMutationRates.py Private mutation simulator and analysis routines for EOMI paper 2011-01-07 21:23:29 +00:00
pushback_file.py
realignBamByChr.py Improvements to 1KG processing pipeline 2010-07-16 15:33:47 +00:00
recalAssociation.py One last little thing, I swear 2011-03-11 17:37:40 +00:00
sample_lister.py module for listing out samples for data processing and firehose reporting 2010-07-21 15:05:41 +00:00
setFilterGenotypesToRef.py Quick python scripts for going from genotype VCFs to site-only VCFs, and one to fix BC vcf files (which had "het" genotypes at non-variant sites) 2010-07-12 19:13:32 +00:00
splitIntervalsByContig.py Adding this to get around lsf/csh issues (see recent help message). Also seems like a good time to reiterate http://www.faqs.org/faqs/unix-faq/shell/csh-whynot/ 2010-09-19 02:45:16 +00:00
theoPost.py Modifications, bugfix to theoretical posteriors. (Bug fix: eliminated discontinuity in prior distribution) 2011-02-14 19:47:34 +00:00
ucscRepeatMaskToIntervalList.py better farm commands, and simple utility to convert ucsc repeat masks to interval lists 2010-03-19 13:11:06 +00:00
updateTribbleToCorrectSVNVersion.py clean-up the docs a little 2010-09-29 05:02:41 +00:00
validatePosterior.py Committing two pieces of code for exome analysis, in case they need be returned to 2011-01-31 14:13:09 +00:00
vcf2table.py now support -o output option, useful for pipelines 2010-08-06 14:57:04 +00:00
vcfGenotypeToSites.py Quick python scripts for going from genotype VCFs to site-only VCFs, and one to fix BC vcf files (which had "het" genotypes at non-variant sites) 2010-07-12 19:13:32 +00:00
vcfReader.py now support -o output option, useful for pipelines 2010-08-06 14:57:04 +00:00
vcf_b36_to_hg18.py Tired of writing vcf_hg18_to_b36 over and over again when necessary. Added a -r flag to this script that does it. 2010-09-28 14:51:57 +00:00