Healthy Brain Network

Analysis of phenotypic data from the Child Mind Institute's Healthy Brain Network Initiative

First Steps

Clone Repo

Clone the repo to your own path on OpenMind at /om2/user/"username"

git clone git@github.com:maedbhk/healthy_brain_network.git`

Activate Virtual Environment You can use either pipenv or conda for virtual environment and python package management

see OpenMind Setup for more detailed instructions on setting up virtual environments on OpenMind

Install editable package (make sure virtual env is activated)

pip install -e .

Activate jupyter notebook kernel

To run jupyter notebook using modules installed in virtual env, run the following command in top-level directory of repo

ipython kernel install --name "hbn" --user

Setting Paths and Accessing Data

see OpenMind Setup for more detailed instructions on setting paths on OpenMind

Data are stored on OpenMind here: /om2/user/maedbh/hbn_data
Create symlinks from this folder (or copy over hbn_data/raw folder) to your directory so you can read/write new files to your own path Example Command:

cd /om2/user/"username"
mkdir hbn_data
cp -R /om2/user/maedbh/hbn_data/raw /om2/user/"username"/hbn_data/

Go to constants.py and set DATA_DIR to be the fullpath to your top-level directory of hbn_data

For example: DATA_DIR = PosixPath("/om2/user/"username"/hbn_data")

You can explore the HBN data dictionary Release9_DataDic, which is located on OpenMind at hbn_data/raw/phenotype

Example for creating specs:

feature specs are created using the following command:

cd /om2/user/"username"/healthy_brain_network/hbn/scripts

# preprocess phenotypic data
python3 preprocess_phenotype.py

# make features for modeling
python3 make_phenotype_specs.py

Example for running modeling routine:

Model spec files (.json) are created using the following command:

# make model specs
python3 make_phenotype_models.py

# run model pipeline
python3 run_phenotype_models.py --cachedir=/om2/users/"username"/bin/.cache/pydra-ml/cache-wf/

To run a predictive modeling script on OpenMind:
- 1. cd /om2/user/"username"/healthy_brain_network/hpc_scripts
- 1. vim test_phenotype_workflow.sh and change the username
- 1. Run the bash script: sbatch test_phenotype_workflow.sh
The bash script executes the Python script hbn/tests/test_workflow.py

Project Organization

Data

Note: hbn_data folder is stored on OpenMind at /om2/user/maedbh/hbn_data/

├── hbn_data
│   ├── interim        <- Intermediate data that has been transformed (model outputs are stored here)
│   ├── processed      <- The final, canonical data sets for modeling
│   └── raw            <- The original, immutable data dump

Directories

PATHS are stored in constants.py:

├── constants.py
│   ├── DATA_DIR         <- top-level directory where **phenotype** folder is stored 
│   ├── FEATURE_DIR      <- where feature spec (.json) and features (.csv) files are stored
│   └── MODEL_SPEC_DIR   <- where model specs (.json) are stored
│   └── MODEL_DIR        <- where model outputs (*pkl) are stored
│   └── BASH_SCRIPTS     <- where bash scripts (.sh) are stored
│   └── TEST_DIR         <- where test scripts (.py) are stored

Code

├── LICENSE
├── Makefile           <- Makefile with commands like `make data` or `make train`
├── README.md          <- The top-level README for developers using this project.
│
├── docs               <- A default Sphinx project; see sphinx-doc.org for details
│
├── model_specs        <- Model Spec files
│
├── features           <- Feature spec files and csv files containing features (X) and target (y)
│
├── hpc_scripts        <- Bash scripts for running jobs on openmind. See `run_phenotype_workflow_openmind.sh` as an example
│
├── notebooks          <- Jupyter notebooks. Naming convention is a number (for ordering),
│                         the creator's initials, and a short `-` delimited description, e.g.
│                         `1.0-jqp-initial-data-exploration`.
│
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── figures        <- Generated graphics and figures to be used in reporting
│
├── Pipfile            <- The file for reproducing the analysis environment, e.g.
│                         generated with `$ pipenv install` (to install environment) and `$ pipenv shell` (to activate environment)
│
├── setup.py           <- makes project pip installable (pip install -e .) so src can be imported
├── hbn                <- Source code for use in this project.
│   ├── __init__.py    <- Makes src a Python module
|   |
|   ├── constants.py      <- Directories are set here
│   │
│   ├── data           <- Scripts to download or generate data
│   │   └── make_dataset.py
│   │
│   ├── features       <- Scripts to turn raw data into features for modeling
│   │   └── build_features.py
│   │
│   ├── models         <- Scripts to train models and then use trained models to make
│   │   │                 predictions
│   │   └── test_models.py
│   │   └── second_level_modeling.py
│   │
│   └── visualization  <- Scripts to create exploratory and results oriented visualizations
│   │   └── visualize.py
│   |
│   └─── scripts <- Scripts to run workflow for phenotypic assessment
│   │   └── run_phenotype_workflow.py
│   │   └── feature_embeddings.py
│
└── tox.ini            <- tox file with settings for running tox; see tox.readthedocs.io

Project based on the cookiecutter data science project template. #cookiecutterdatascience

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Healthy Brain Network

First Steps

Example for creating specs:

Example for running modeling routine:

Project Organization

Data

Directories

Code

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 489 Commits
docs		docs
hbn		hbn
hpc_scripts		hpc_scripts
notebooks		notebooks
references		references
reports		reports
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
Makefile		Makefile
Pipfile		Pipfile
README.md		README.md
abcd_preprocess.py		abcd_preprocess.py
create_env.sh		create_env.sh
environment.yml		environment.yml
output.png		output.png
poetry.lock		poetry.lock
pyproject.toml.bak		pyproject.toml.bak
setup.py		setup.py
tox.ini		tox.ini

License

maedbhk/healthy_brain_network

Folders and files

Latest commit

History

Repository files navigation

Healthy Brain Network

First Steps

Example for creating specs:

Example for running modeling routine:

Project Organization

Data

Directories

Code

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages