Skip to content

Commit 03ef787

Browse files
authored
Merge pull request #109 from UBC-MDS/improve_reproducib
update installation and usage instructions
2 parents b521a9c + 42aad41 commit 03ef787

File tree

1 file changed

+19
-4
lines changed

1 file changed

+19
-4
lines changed

README.md

Lines changed: 19 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,22 @@ Motivation behind this project: http://third-bit.com/2017/09/30/git-graphs-and-e
2222
- [Blog](https://ubc-mds.github.io/RStudio-GitHub-Analysis/)
2323

2424
## Usage
25-
PLACEHOLDER
25+
Run the following commands to reproduce this analysis:
26+
```{bash}
27+
snakemake get_ght_data # Downloads GH Torrent data from figshare. Be aware that the file is quite large, and downloading can take 1-2 hours.
28+
29+
snakemake run_analysis # Run our pipeline; generate embeddings, clusters, tsne graph, motif report, etc.
30+
31+
snakemake generate_images # Generate images of our most important findings.
32+
```
33+
34+
To change parameters from the command line, simply put `--config param=value` after your snakemake call. For a full list of configurable parameters, see the `config.json` file in the root directory of this project.
35+
For example, if you wanted to run the analysis with 5 workers instead of the default, run:
36+
37+
```{bash}
38+
snakemake run_analysis --config n_workers=5
39+
```
40+
2641

2742
### Example
2843
From the root directory, run:
@@ -32,10 +47,10 @@ python src/python src/github_analysis/main.py -dp "/home/rayce/Assignments/Capst
3247
```
3348

3449
## Installation instructions
35-
To get credentials file for GitHub Torrent Google Cloud (necessary for re-running the pipeline to generate images):
50+
First, to get credentials file neccessary for pulling the GitHub Torrent from Google Cloud (necessary for re-generating images for our analysis):
3651

37-
- Follow the instructions here to create and download a credentials file: https://developers.google.com/adwords/api/docs/guides/authentication#generate_oauth2_credentials
38-
- Change the name of the file to `credentials_file.json` and put it in the root directory of the project (a sample file with the same name is included as a reference).
52+
- Follow the instructions under 'Set up a service account' to create and download a credentials file: https://cloud.google.com/video-intelligence/docs/common/auth
53+
- Change the name of the file to `credentials_file.json` and put it in the root directory of the project (a sample file with the name `credentials_file_EXAMPLE.json` is included as a reference).
3954

4055

4156
## Data Repository

0 commit comments

Comments
 (0)