gatk-3.8/perl/sync1000Genomes
ebanks 63f40215b3 2 more scripts I found helpful in syncing (and cleaning up) the 1000G mirror
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@5287 348d0f76-0448-11de-a6fe-93d51630548a
2011-02-22 04:17:36 +00:00
..
README.sync Checking in the scripts I'm using for syncing the 1000G mirror. Note that very few people can actually use them because you most likely don't have perission to write to /humgen/1kg/DCC, but these should be used as a resource if anyone ever needs to do this in the future. These scripts are very naive and consist of just the actual pulling down of data. Currently aspera and wget are supported, but Mark should feel free to add lftp if he wants. :) Also, while I'm here, I'm removing obsolete scripts for running an obsolete pipeline. 2011-02-20 03:59:26 +00:00
checkMD5s.pl 2 more scripts I found helpful in syncing (and cleaning up) the 1000G mirror 2011-02-22 04:17:36 +00:00
findFilesNotInAlignmentIndex.sh 2 more scripts I found helpful in syncing (and cleaning up) the 1000G mirror 2011-02-22 04:17:36 +00:00
runAspera.pl Checking in the scripts I'm using for syncing the 1000G mirror. Note that very few people can actually use them because you most likely don't have perission to write to /humgen/1kg/DCC, but these should be used as a resource if anyone ever needs to do this in the future. These scripts are very naive and consist of just the actual pulling down of data. Currently aspera and wget are supported, but Mark should feel free to add lftp if he wants. :) Also, while I'm here, I'm removing obsolete scripts for running an obsolete pipeline. 2011-02-20 03:59:26 +00:00
runWget.pl Checking in the scripts I'm using for syncing the 1000G mirror. Note that very few people can actually use them because you most likely don't have perission to write to /humgen/1kg/DCC, but these should be used as a resource if anyone ever needs to do this in the future. These scripts are very naive and consist of just the actual pulling down of data. Currently aspera and wget are supported, but Mark should feel free to add lftp if he wants. :) Also, while I'm here, I'm removing obsolete scripts for running an obsolete pipeline. 2011-02-20 03:59:26 +00:00
syncFilesInList.pl Checking in the scripts I'm using for syncing the 1000G mirror. Note that very few people can actually use them because you most likely don't have perission to write to /humgen/1kg/DCC, but these should be used as a resource if anyone ever needs to do this in the future. These scripts are very naive and consist of just the actual pulling down of data. Currently aspera and wget are supported, but Mark should feel free to add lftp if he wants. :) Also, while I'm here, I'm removing obsolete scripts for running an obsolete pipeline. 2011-02-20 03:59:26 +00:00

README.sync

Before sync'ing 1000 Genomes, you need to be logged in as gsa-dev:
% sudo -s -u gsa-dev
[Note that this step needs to be done before subsequent steps for stability]

To use Aspera, you'll need to ssh into one of the appropriate machines:
% ssh vbigtube or mirror

[The NCBI Aspera source is: anonftp@ftp-private.ncbi.nih.gov:/1000genomes/ftp/]