f5a301fb63Bugfix for AlignmentUtils.trimCigarByBases
Mark DePristo
2013-04-23 18:26:20 -0400
2bcbdd469fleftAlignCigarSequentially now supports haplotypes with insertions and deletions where the deletion allele was previously removed by the leftAlignSingleIndel during it's cleanup phase.
Mark DePristo
2013-04-19 16:11:29 -0400
aefddaa219Merge pull request #210 from broadinstitute/gda_QUAL_fix_GSA-910
Eric Banks
2013-05-03 06:28:10 -0700
4f627e96c0Merge pull request #211 from broadinstitute/md_kb_script
Mark DePristo
2013-05-03 04:30:23 -0700
b89d97cb9cStarting NA12878 KB requires use Java-1.7
Mark DePristo
2013-05-03 07:29:29 -0400
0c30a5ebc6Rev'd up Picard to get PL fix: PLs were saturated to 32767 (Short.MAX_VALUE) when converting from GL to integers. Increase capping to Integer.MAX_VALUE (2^31-1) which should be enough for reasonable sites now. Integration tests change because some tests have some hyper-deep pileups where this case was hit
Guillermo del Angel
2013-05-01 15:25:23 -0400
c6df20cff5Merge pull request #209 from broadinstitute/eb_more_fixes_to_bundle_script
Eric Banks
2013-05-02 12:23:41 -0700
d981fd01b8Now that we don't generate dict and fai files, the resource script needs to copy them to the bundle.
Eric Banks
2013-05-02 15:18:13 -0400
13bfa963daRevert changes to exampleFASTA.fasta.fai for now to get tests passing again
David Roazen
2013-05-02 12:59:20 -0400
dfdc0df4f1Merge pull request #207 from broadinstitute/eb_fixing_bundle_script
Mark DePristo
2013-05-02 08:18:19 -0700
f88a964e2cAdding .fai file to example fasta since we don't generate it anymore
Eric Banks
2013-05-02 10:54:32 -0400
6d0e383a60Fixing the bundle script
Eric Banks
2013-05-02 09:28:40 -0400
4b8b411b92- Fixed a small bug in the printout of molten data in GenotypeConcordance Output didn't "mix-up" the genotypes, it outputed the same HET vs HET (e.g.) 3 times rather than the combinations of HET vs {HET, HOM, HOM_REF}, etc. This was only a problem in the text, _not_ the actual numbers, which were outputted correctly.
Yossi Farjoun
2013-05-01 12:45:03 -0400
803f666fd5Merge pull request #204 from broadinstitute/rp_cgl_force_sample_name
Mark DePristo
2013-05-01 13:59:20 -0700
ad84f15572Adding argument to CGL to not stratify by sample name. This is useful when running with 1000G so that you don't get 1000s of lines on the plots.
Ryan Poplin
2013-05-01 12:22:27 -0400
19b59f0fb9Merge pull request #206 from broadinstitute/dr_update_tests_for_java7
droazen
2013-05-01 13:23:25 -0700
f3c94a3c87Update expected test output for Java 7
David Roazen
2013-05-01 15:48:05 -0400
d0980e236aMerged bug fix from Stable into Unstable
David Roazen
2013-05-01 01:08:29 -0400
f57256b6c2Delete unused FastaSequenceIndexBuilder class and accompanying test
David Roazen
2013-05-01 01:02:01 -0400
2edb286d1cMerged bug fix from Stable into Unstable
David Roazen
2013-04-30 22:33:36 -0400
3390fc7d67Include cofoja jar in classpath when testing release jars
David Roazen
2013-04-30 22:16:58 -0400
e29b52b9a5Merge remote-tracking branch 'unstable/master'
Eric Banks
2013-04-30 15:31:33 -0400
a0a1a366e3Merge pull request #201 from broadinstitute/eb_fix_reduced_count_tagging
Mark DePristo
2013-04-30 12:14:40 -0700
58424e56beSetting the reduce reads count tag was all wrong in a previous commit; fixing.
Eric Banks
2013-04-29 11:17:38 -0400
15266da51cMerge pull request #203 from broadinstitute/gda_poolcaller_paper
delangel
2013-04-30 07:12:02 -0700
95637e03a0Updates to pool caller scala script due to new paths and cleanup, hopefully with final changes for paper. Added also R script used to process everything into a couple of ggplot-friendly data frames. Functionality is basically the same. Enhancements: -- Add annotation to log axiom and Exome Chip AC along with LOF results for concordance comparisons. -- General Cleanup. -- Used base path for files as a variable in case directory structure in gsa-hpprojects changes again. -- Output also per-pool data by subsetting genotypes per pool and comparing with corresponding genotypes from Axiom, exome chip and omni. -- Commit R scripts that load all tables and crunch them to analyze them.
Guillermo del Angel
2013-02-15 12:28:13 -0500
6ea2bceb55Merge pull request #202 from broadinstitute/yf_remove_getlength_from_every_GATKBAMIndex_read
Mark DePristo
2013-04-30 06:24:34 -0700
73fcacbf1bChange Long to long
Mark DePristo
2013-04-30 09:21:10 -0400
a3a2ec5a1cMerge pull request #200 from broadinstitute/gda_ug_rr_bug_48742591
Eric Banks
2013-04-29 17:54:43 -0700
20d3137928Fix for indel calling with UG in presence of reduced reads: When a read is long enough so that there's no reference context available, the reads gets clipped so that it falls again within the reference context range. However, the clipping is incorrect, as it makes the read end precisely at the end of the reference context coordinates. This might lead to a case where a read might span beyond the haplotype if one of the candidate haplotypes is shorter than the reference context (As in the case e.g. with deletions). In this case, the HMM will not work properly and the likelihood will be bad, since "insertions" at end of reads when haplotype is done will be penalized and likelihood will be much lower than it should. -- Added check to see if read spans beyond reference window MINUS padding and event length. This guarantees that read will always be contained in haplotype. -- Changed md5's that happen when long reads from old 454 data have their likelihoods changed because of the extra base clipping.
Guillermo del Angel
2013-04-28 09:14:35 -0400
0e7e6d35d8GATKBAMIndex calls buffer.length() on every read. This is causing much pain. Optimized by getting the read of the file upon opening the index-file and using that instead.
Yossi Farjoun
2013-04-29 12:49:02 -0400
c5701a9adeMerge pull request #199 from broadinstitute/md_clipped_reduced_reads
Eric Banks
2013-04-29 09:14:43 -0700
0387ea8df9Bugfix for ReadClipper with ReducedReads
Mark DePristo
2013-04-29 10:05:18 -0400
5dd73ba2d1Merge pull request #198 from broadinstitute/mc_reduce_reads_ds_doc
Mark DePristo
2013-04-27 05:49:47 -0700
651e1f23b1Merge pull request #194 from broadinstitute/gda_ancient_dna_newPipeline
delangel
2013-04-27 04:59:09 -0700
76e997895eUpdates GATKDocs for ReduceReads downsampling
Mauricio Carneiro
2013-04-26 23:33:09 -0400
4168aaf280Add feature to specify Allele frequency priors by command line when calling variants.
Guillermo del Angel
2013-04-26 15:49:31 -0400
759c531d1bMerge pull request #197 from broadinstitute/dr_disable_snpeff_version_check
Mark DePristo
2013-04-26 13:55:14 -0700
7d90bbab08Add support for snpEff "GATK compatibility mode" (-o gatk)
David Roazen
2013-04-26 13:42:01 -0400
ec8fb9860aMerge pull request #196 from broadinstitute/rp_cgl_allele_matching
Mark DePristo
2013-04-26 12:38:59 -0700
93fc48739aIn CGL ensure that the alleles match exactly between the comp track and the external likelihoods track.
Ryan Poplin
2013-04-26 14:20:33 -0400
071fd67d55Merge pull request #193 from broadinstitute/eb_contamination_fixing_for_reduced_reads
Mark DePristo
2013-04-26 09:48:45 -0700
92a6c7b561Merge pull request #195 from broadinstitute/eb_exclude_sample_file_bug_in_select_variants
Mark DePristo
2013-04-26 09:47:38 -0700
360e2ba87eFixed bug reported on the forum where using the --exclude_sample_file argument in SV was giving bad results.
Eric Banks
2013-04-26 12:23:11 -0400
021adf4220WTF - I thought we had disabled the randomized dithering of rank sum tests for integration tests?!
Eric Banks
2013-04-25 15:39:42 -0400
ba2c3b57edExtended the allele-biased down-sampling functionality to handle reduced reads.
Eric Banks
2013-04-24 11:52:26 -0400
b749f06ba6Merge pull request #192 from broadinstitute/dr_rev_picard_for_2.5_release
droazen
2013-04-25 17:59:27 -0700
7cb1247164Rev picard, sam-jdk, tribble, and variant jars to version 1.90.1442
David Roazen
2013-04-25 13:53:32 -0400
528c3d083aMerge pull request #191 from broadinstitute/dr_fix_rod_system_locking
Mark DePristo
2013-04-25 09:32:54 -0700
3c7db87527Merge pull request #189 from broadinstitute/md_hc_clip_before_merging
Ryan Poplin
2013-04-25 08:42:03 -0700
d20be41feeBugfix for FragmentUtils.mergeOverlappingPairedFragments
Mark DePristo
2013-04-24 17:22:07 -0400
4d56142163Detect stuck lock-acquisition calls, and disable file locking for tests
David Roazen
2013-04-11 13:04:29 -0400
43f1746eb9Merge pull request #190 from broadinstitute/mc_processing_pipeline
Mark DePristo
2013-04-24 17:04:46 -0700
95ac9b6a33TechDev version of the Data Processing Pipeline
Mauricio Carneiro
2013-04-24 19:51:34 -0400
55ead98d3dMerge pull request #188 from broadinstitute/gda_cgl_fix
Mark DePristo
2013-04-24 16:17:29 -0700
3d49f524eeSmall fixes for CalibrateGenotypeLikelihoods. -- If we are using an external vcf, do not consider filtered out records when argument -ignoreFiltered is set. -- Fix for R script: it uses ddply but some default R installations don't include plyr library by default.
Guillermo del Angel
2013-04-24 18:41:40 -0400
27bb699e8bMerge pull request #181 from broadinstitute/eb_yet_more_rr_improvements_GSA-930
MauricioCarneiro
2013-04-24 15:40:03 -0700
379a9841ceVarious bug fixes for recent Reduce Reads additions plus solution implemented for low MQ reads.
Eric Banks
2013-04-17 12:45:09 -0400
45fec382e7Merge pull request #180 from broadinstitute/mc_diagnosetargets_missing_targets
MauricioCarneiro
2013-04-24 14:54:55 -0700
3f52f55c55Merge pull request #186 from broadinstitute/md_libs_canonical_cigar
Eric Banks
2013-04-24 12:58:32 -0700
367f0c0ac1Split class names into stratification and metrics
Mauricio Carneiro
2013-04-24 14:15:49 -0400
91d5674cc5Merge pull request #187 from broadinstitute/rp_new_bundle_for_release
Mark DePristo
2013-04-24 08:44:38 -0700
80131ac996Adding the 1000G_phase1.snps.high_confidence callset to the GATK resource bundle for use in the April 2013 updated best practices.
Ryan Poplin
2013-04-24 11:41:32 -0400
df90597bfcPerformance optimizations and caliper benchmarking code for consolidateCigar
Mark DePristo
2013-04-19 17:31:59 -0400
df6ba74395Merge pull request #185 from broadinstitute/gda_poolcaller_fix_47921867
Mark DePristo
2013-04-24 04:14:05 -0700
2ab270cf3fCorner case fix to General Ploidy SNP likelihood model. -- In case there are no informative bases in a pileup but pileup isn't empty (like when all bases have Q < min base quality) the GLs were still computed (but were all zeros) and fed to the exact model. Now, mimic case of diploid Gl computation where GLs are only added if # good bases > 0 -- I believe general case where only non-informative GLs are fed into AF calc model is broken and yields bogus QUAL, will investigate separately.
Guillermo del Angel
2013-04-23 17:51:46 -0400
8f8f339e4bAbstract class for the statistics
Mauricio Carneiro
2013-04-23 18:02:27 -0400
e83d9bef59Merge pull request #182 from broadinstitute/md_hc_vqsr_best_practices
Ryan Poplin
2013-04-23 12:49:10 -0700
90fc249c8dMerge pull request #184 from broadinstitute/rp_revert_sw_params
Mark DePristo
2013-04-23 12:32:13 -0700
cb2a8f83deMerge pull request #183 from jsilter/master
Mark DePristo
2013-04-23 11:59:52 -0700
75184614c6Add additional necessary class files to na12878kb.jar target
Jacob Silterra
2013-04-23 14:03:48 -0400
38662f1d47Limiting access to the DT classes
Mauricio Carneiro
2013-04-23 14:01:43 -0400
d5d87c50e6Updates to GeneralCallingPipeline
Mark DePristo
2013-04-04 17:57:20 -0400
cb4ec3437aAfter debate reverting SW parameter changes temporarily while we explore global SW plans.
Ryan Poplin
2013-04-23 13:32:06 -0400
ff430c821eMerge pull request #178 from broadinstitute/md_common_suffix_bugfix
Ryan Poplin
2013-04-22 06:53:01 -0700
be66049a6fBugfix for CommonSuffixSplitter
Mark DePristo
2013-04-21 15:42:26 -0400
fc29a66a63Merge pull request #177 from broadinstitute/md_gatklog_cleanup
Mark DePristo
2013-04-20 11:08:06 -0700
7e88a12bf1Delete progress log when script is done for downloadGATKReportsFromS3.csh
Mark DePristo
2013-04-20 14:06:02 -0400
12ac60ac3aMerge pull request #175 from broadinstitute/eb_increase_log10_cache_size
Eric Banks
2013-04-19 05:43:07 -0700
3477e092eaMinor: bump up the amount of cached log10 data in MathUtils so that Monkol can actually call 50K samples.
Eric Banks
2013-04-19 08:39:08 -0400
ef8679c0a0Merge pull request #174 from broadinstitute/md_hc_parameters
Ryan Poplin
2013-04-17 12:33:42 -0700
f0e64850daTwo sensitivity / specificity improvements to the haplotype caller
Mark DePristo
2013-04-17 10:30:37 -0400
b27706859aMerge pull request #172 from broadinstitute/eb_reducereads_het_improvements
MauricioCarneiro
2013-04-16 16:48:10 -0700
5bce0e086eRefactored binomial probability code in MathUtils.
Eric Banks
2013-04-09 15:37:08 -0400
df189293ceImprove compression in Reduce Reads by incorporating probabilistic model and global het compression
Eric Banks
2013-04-08 14:01:57 -0400
8e6309d56eMerge pull request #171 from broadinstitute/rp_hc_restore_filter_function
Mark DePristo
2013-04-16 11:21:32 -0700
e0dfe5ca14Restore the read filter function in the HaplotypeCaller.
Ryan Poplin
2013-04-16 09:37:04 -0400
e176fc3af1Merge pull request #159 from broadinstitute/md_bqsr_ion
Geraldine Van der Auwera
2013-04-16 08:54:47 -0700
936f4da1f6Merge pull request #166 from broadinstitute/md_hc_persample_haplotypes
Ryan Poplin
2013-04-16 08:46:56 -0700
bfbc23e99eMerge pull request #173 from broadinstitute/md_fix_vqsr
Mark DePristo
2013-04-16 08:46:26 -0700
17982bcbf8Update MD5s for VQSR header change
Mark DePristo
2013-04-16 11:45:45 -0400
067d24957bSelect the haplotypes we move forward for genotyping per sample, not pooled
Mark DePristo
2013-04-16 09:30:06 -0400
0ee21e58c3Merge pull request #165 from broadinstitute/md_vqsr_improvements
Ryan Poplin
2013-04-16 06:26:38 -0700
5a74a3190cImprovements to the VariantRecalibrator R plots
Mark DePristo
2013-04-11 10:52:59 -0400
564fe36d22VariantRecalibrator's VQSR.vcf now contains NEG/POS labels
Mark DePristo
2013-04-05 09:28:46 -0400