Thursday, July 27, 2017

Genomes_dnj Notebooks


The results of project genomes_dnj are contained in a tree of 100 jupyter notebooks.

The top layer of the tree contains several notebooks that provide a summary of the patterns of human genetic history revealed by the project's study of 1000 genomes phase 3 data.

The next layer is divided into the main hierarchies of human genetic history observed in the study

For each of the major hierarchies, the bottom layer documents the SNPs in each of the series associated with the hierarchy.  In total 76 different series of 4 or more SNPs are documented.  The individual series notebooks provide distributions of expression of its SNPs among the 5008 chromosome samples in the 1000 genomes phase 3 data.  They also provide regional population data for the expression of the series and for all of its series associations within the studied interval of chromosome 2.

The easiest way to access the notebooks is to download the whole tree in html format from the notebooks html folder on google drive.

Another possibility is to download the notebooks in native format.  One method is to clone the github master branch for genomes_dnj.  An alternative is to download the whole tree from the notebooks folder on google drive.

Viewing native notebooks requires a python installation.  Anaconda2-4.1.1 was used for all of the project work.

The top level notebooks can be viewed directly online from anaconda cloud.

No comments:

Post a Comment

Assembling Genomes_dnj Packages

The genomes_dnj_2  github repository was split from the  genomes_dnj  repository to make it easier for anyone who wanted to use the genomes_...