Last updated: 2020-12-22
Checks: 1 1
Knit directory: popstruct_scripts/
This reproducible R Markdown analysis was created with workflowr (version 1.6.2). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.
The R Markdown file has unstaged changes. To know which version of the R Markdown file created these results, you’ll want to first commit it to the Git repo. If you’re still working on the analysis, you can ignore this warning. When you’re finished, you can run wflow_publish
to commit the R Markdown file and build the HTML.
Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility.
The results in this page were generated with repository version 5f827bc. See the Past versions tab to see a history of the changes made to the R Markdown and HTML files.
Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish
or wflow_git_commit
). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:
Ignored files:
Ignored: .DS_Store
Ignored: .Rproj.user/
Ignored: analysis/.DS_Store
Ignored: code/.DS_Store
Ignored: code/burden_msprime/.DS_Store
Ignored: code/burden_msprime/generate_burden/.DS_Store
Ignored: code/burden_msprime/notebooks/.DS_Store
Ignored: code/fine_mapping/.DS_Store
Ignored: code/fine_mapping/notebooks/.DS_Store
Ignored: code/gwas/.DS_Store
Ignored: code/gwas/grid/.DS_Store
Ignored: code/gwas/grid/tau-9/.DS_Store
Ignored: code/gwas/grid/tau-9/scripts/.DS_Store
Ignored: code/gwas/grid/tau-9/scripts/generate_genotypes/.DS_Store
Ignored: code/gwas/grid/tau-9/scripts/gwas/.DS_Store
Ignored: code/gwas/grid/tau100/.DS_Store
Ignored: code/gwas/grid/tau100/scripts/.DS_Store
Ignored: code/gwas/grid/tau100/scripts/generate_genotypes/.DS_Store
Ignored: code/gwas/grid/tau100/scripts/gwas/.DS_Store
Ignored: code/gwas/notebooks/.DS_Store
Ignored: code/gwas/ukb/.DS_Store
Ignored: code/gwas/ukb/scripts/.DS_Store
Ignored: code/imputation/.DS_Store
Ignored: code/imputation/notebooks/.DS_Store
Ignored: code/optimize_migration_rate/.DS_Store
Ignored: code/optimize_migration_rate/complex_dem/.DS_Store
Ignored: code/optimize_migration_rate/grid/.DS_Store
Ignored: code/pca_plots/.DS_Store
Ignored: code/pca_plots/notebooks/.DS_Store
Ignored: code/prs/.DS_Store
Ignored: code/prs/complex_dem/.DS_Store
Ignored: code/prs/grid/.DS_Store
Ignored: code/prs/notebooks/.DS_Store
Ignored: code/prs/ukb/.DS_Store
Ignored: code/qqplots/.DS_Store
Ignored: code/qqplots/fixed_effects/.DS_Store
Ignored: code/qqplots/lmms/.DS_Store
Ignored: code/revisions/.DS_Store
Ignored: code/shared_scripts/gwas/
Ignored: code/shared_scripts/prs/
Ignored: code/shared_scripts/simphenotype/
Ignored: code/sib_analysis/.DS_Store
Ignored: code/sib_analysis/complex_dem/.DS_Store
Ignored: code/sib_analysis/grid/.DS_Store
Ignored: code/sib_analysis/grid/tau100/.DS_Store
Ignored: code/sib_analysis/grid/tau100/mates/
Ignored: code/simulating_genotypes/.DS_Store
Ignored: code/simulating_genotypes/grid/.DS_Store
Ignored: code/simulating_phenotypes/.DS_Store
Ignored: data/.DS_Store
Ignored: data/burden_msprime/
Ignored: data/burden_msprime2/
Ignored: data/gwas/
Ignored: data/ukmap/
Ignored: output/plots/
Unstaged changes:
Modified: analysis/index.Rmd
Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.
These are the previous versions of the repository in which changes were made to the R Markdown (analysis/index.Rmd
) and HTML (docs/index.html
) files. If you’ve configured a remote Git repository (see ?wflow_git_remote
), click on the hyperlinks in the table below to view the files as they were in that past version.
File | Version | Author | Date | Message |
---|---|---|---|---|
Rmd | 6b2564f | Arslan-Zaidi | 2020-12-22 | reorganized for deployment |
html | 6b2564f | Arslan-Zaidi | 2020-12-22 | reorganized for deployment |
html | 5bce003 | Arslan-Zaidi | 2020-12-22 | added wflow builds |
html | 99813ec | Arslan-Zaidi | 2020-12-03 | organizing workflowr project |
Rmd | 31a95ef | Arslan Zaidi | 2020-11-23 | Start workflowr project. |
This repository contains scripts and code to carry out the analyses in:
Even though the code should be annotated and I’ve included descriptions where necessary, I would not use this as a self-sustaining resource. I would read the paper first, where things are described in detail and come here if you need to find code for a specific set of analyses. If you use any of the code here, please cite the paper above.
This site describes the code used for most of the post-simulation analyses.
Code for the plot of PCA on genotype data simulated under the recent and perpetual structure model is detailed here:
Simulation of phenotypes (both non-heritable and heritable) are described here.
Code for QQ-plot of GWAS on non-heritable phenotypes under different demographic histories is detailed here
Code for QQ-plot of gene burden tests is detailed here
Code for Gini curves showing the geographically clustered common and rare variants and gene burden can be found here
Code for studying the population structure in polygenic scores simulated under the ‘grid’ model of population structure is here
Code for studying the population structure in the more complex model of population structure can be found here
Code for studying the population structure in polygenic scores generated from sibling GWAS can be found here
Code for studying the effect of fine-mapping vs clumping+threshold on stratification and prediction accuracy of polygenic scores can be found here
Code for studying the effect of variant ascertainment schemes and effect size (re-)estimation can be found here
The wiki page contains additional information on how scripts were chained for upstream activities (e.g. simulation, data wrangling etc.).
code/optimize_migration_rate : scripts used to choose migration rate between demes such that the Fst and genomic inflation (on birthplace) matches that observed in the UK Biobank.
code/simulating_genotypes : scripts to carry out msprime simulations, output the genotypes from these simulations, and run PCA on those genotypes. The folder is organized further by different demographic models used: - /grid : Simulations carried out for individuals sampled in a 6x6 square grid. - tau-9 : Individuals sampled in a 6x6 grid where all demes remain structured back in time (i.e. perpetual model) - tau100 : Individuals sampled in a 6x6 grid where all demes become panmictic 100 generations in the past. - /ukb : Simulations carried out for individuals sampled from 35 demes arranged according to the geographic structure in England and Wales.
code/pca_plots : scripts used to generate scatter plots of eigenvectors from PCA carried out on simulated genotype data
code/gwas : scripts used to simulate phenotypes, carry out GWAS, and construct polygenic scores.
code/burden_msprime: scripts used to carry out burden tests