gatk-3.8/python
depristo cf910d9cc2 misc. useful updates to python library
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3683 348d0f76-0448-11de-a6fe-93d51630548a
2010-06-30 16:33:32 +00:00
..
1kgScripts
genomicAnnotatorScripts Fixed error message 2010-06-28 14:50:28 +00:00
1kgStatsForCalls.py misc. useful updates to python library 2010-06-30 16:33:32 +00:00
AlignBam.py
AlignBams.py
AnnotateVCFwithMAF.py Make annotated VCF not be broken 2010-03-01 23:22:41 +00:00
BarcodeAnalysis.py
CoverageEval.py Updated SNP calling power from coverage tools to work with new UnifiedGenotyper and DepthOfCoverage tools. 2009-12-16 20:44:30 +00:00
CoverageMeta.py Updated SNP calling power from coverage tools to work with new UnifiedGenotyper and DepthOfCoverage tools. 2009-12-16 20:44:30 +00:00
DOCParameter.py
EvalMapping.py
FastaQuals2Fastq.py
FlatFileTable.py Add ability for flat file table parsing module to skip ahead to first occurence of a regular expression (use case: consistently parsing DepthOfCoverage output for histogram section of file across file format changes) 2009-12-16 20:38:50 +00:00
Geli2GFF.py
Gelis2PopSNPs.py
JobDispatcher.py Awesome: JobDispatcher can now dispatch jobs by gene from a target .design file found in /seq/references. 2010-04-14 18:17:41 +00:00
JobDispatcherExample.py This is a python job dispatcher I've been using, which builds on Mark's FarmJob utility, and an example script of how I'm using it. Basically I wrote it to smartly break up analysis over an interval list, givin a maximum number of bases per job, a list of available queues, and a limit on each queue. It handles going over these limits in three ways: 2010-03-31 19:53:13 +00:00
LogRegression.py
LogisticRegressionByReadGroup.py
MergeBAMBatch.py
MergeBAMsUtils.py
MergeBamsByKey.py
MergeEvalMapTabs.py
ParseDCCSequenceData.py
RecalQual.py
RefseqLibrary.py Awesome: JobDispatcher can now dispatch jobs by gene from a target .design file found in /seq/references. 2010-04-14 18:17:41 +00:00
RunPilot2Pipeline.py
SAM.py
SamWalk.py
SamWalkTest.py
SimpleSAM.py
SimulateReads.py
SpawnMapperJobs.py
SpawnValidationJobs.py
StressTestGATK.py
SyzygyCallsFileToVCF.py Minor changes (additional info calculated) 2010-01-06 16:41:01 +00:00
VCFValidationAnalysis.py A collection of python objects that are useful for VCF validation. Use 'em or don't. 2010-01-25 18:44:10 +00:00
ValidateGATK.py
Verify1KGArchiveBAMs.py Marginally more useful output 2010-04-20 14:45:14 +00:00
WalkLociTest.py
Walker.py
aln_file.nocvs.py
aln_file.py
callingProgress.py
collectCalls.py Adding this to subversion so it's protected 2009-12-09 21:26:17 +00:00
compSNPCalls.py
countCoverageWithSamtools.py
create_venn_evals.py Script to split concordance files into their constituent sets and calculate summary stats from a concordance file - SNPs called and number in dbSNP 2010-03-12 22:20:44 +00:00
easyRecalQuals.py
expandedSummaryToVCF.py
faiReader.py Better snpSelector, plus VCFmerge tool 2009-11-11 22:02:57 +00:00
farm_commands.py Now supports strings in command line for farm submission 2010-01-06 13:15:40 +00:00
farm_commands2.py more powerful management routines for my pipeline 2010-05-12 13:37:39 +00:00
fasta.py
gatherIndelsToVCF.py Added - Q&D script to gather verbose bed files to a VCF. 2010-05-05 02:49:16 +00:00
gatkConfigParser.py
generate1KGHapmapVCF.py Quick script that changes "chr#" to "#" and "chrM" to "MT" and moves mitochondria to the end of the vcf; in accordance with the 1KG reference. 2010-01-28 21:59:33 +00:00
getBamFilesFromSpreadsheet.py Moved CoverageStatistics to core. This will be (soon) renamed DepthOfCoverage; so please use CoverageStatistics 2010-03-29 13:32:00 +00:00
getLaneAndSequenceInfo.py Grabs average SNP calls, mismatch rate, aligned reads, and other important lane metrics from a SQUID export and summarizes them across multiple margins (lane numbers, flowcells, samples, libraries) 2010-04-17 03:09:05 +00:00
getTargetedGenes.py Just some code I want to freeze. If you ever need to estimate the % of bases covered by exon, given an interval list, give it to getTargetedGenes. Not the best name for this function, but I don't expect anyone to use it but me. 2010-04-01 20:21:50 +00:00
igvController.py Now can take a VCF file as input 2010-05-18 17:06:12 +00:00
indelVerboseStats.py Minor improvements to simple python code 2010-05-07 21:34:46 +00:00
madPipelineUtils.py more powerful management routines for my pipeline 2010-05-12 13:37:39 +00:00
makeIGVCategory.py misc. useful updates to python library 2010-06-30 16:33:32 +00:00
makeIndelMask.py Removing unnecessary dependences that were causing problems for Sendu 2010-05-21 13:07:41 +00:00
makeMetricsFilesForFirehose.py Moved CoverageStatistics to core. This will be (soon) renamed DepthOfCoverage; so please use CoverageStatistics 2010-03-29 13:32:00 +00:00
memo.py
mergeVCFInfoFields.py Moved CoverageStatistics to core. This will be (soon) renamed DepthOfCoverage; so please use CoverageStatistics 2010-03-29 13:32:00 +00:00
mergeVCFs.py new -a option does fast merging of already sorted files 2010-01-05 13:55:39 +00:00
picard_utils.py misc. useful updates to python library 2010-06-30 16:33:32 +00:00
pilot2CallingPipeline.py keeping a backup 2010-01-31 15:36:25 +00:00
pushback_file.py
qltout.py
realignBamByChr.py misc. useful updates to python library 2010-06-30 16:33:32 +00:00
samtooltest.sh
snpSelector.py snpSelector now supports min and max q scores. 2010-01-31 19:38:34 +00:00
subsetDbSNPWithrsIDs.py simple tool that takes two dbSNP files and subsets the seconds to only include rsID SNPs present in the first. Used to make b129 against b37 by subsetting b131/b37 vs. b129/b36 2010-05-12 13:39:09 +00:00
tgtc2sam.py
ucscRepeatMaskToIntervalList.py better farm commands, and simple utility to convert ucsc repeat masks to interval lists 2010-03-19 13:11:06 +00:00
vcf2table.py Minor improvements to simple python code 2010-05-07 21:34:46 +00:00
vcfReader.py misc. useful updates to python library 2010-06-30 16:33:32 +00:00
vcf_b36_to_hg18.py some useful routines for working with project processing. madPipeline contains a bunch of useful routines for building pipelines that I finally put into one file. Let's just say that I'm really looking forward to the new pipeline system... 2010-04-26 12:34:04 +00:00