Skip to content

Commit 76139f1

Browse files
authored
Merge pull request #116 from UBC-MDS/rzitomer-patch-1
Update README.md
2 parents e3d78a6 + 5cd4bcc commit 76139f1

File tree

1 file changed

+7
-14
lines changed

1 file changed

+7
-14
lines changed

README.md

Lines changed: 7 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,12 @@ Motivation behind this project: http://third-bit.com/2017/09/30/git-graphs-and-e
2121

2222
- [Blog](https://ubc-mds.github.io/RStudio-GitHub-Analysis/)
2323

24+
## Installation instructions
25+
First, to get credentials file neccessary for pulling the GitHub Torrent from Google Cloud (necessary for re-generating images for our analysis):
26+
27+
- Follow the instructions under 'Set up a service account' to create and download a credentials file: https://cloud.google.com/video-intelligence/docs/common/auth
28+
- Change the name of the file to `credentials_file.json` and put it in the root directory of the project (a sample file with the name `credentials_file_EXAMPLE.json` is included as a reference).
29+
2430
## Usage
2531
Run the following commands to reproduce this analysis:
2632
```{bash}
@@ -43,7 +49,7 @@ snakemake run_analysis --config n_workers=5
4349
|Short Name|Long Name|Description|Default|Type|
4450
|-|-|-|-|-|
4551
| -rp | --results_path | The folder to output results of the analysis. e.g. embeddings and plots| ./results/|String|
46-
| -nw | --n_workers | The number of workers to use when running the analysis.| 8| int|
52+
| -nw | --n_workers | The number of workers to use when running the analysis.| 1| int|
4753
| -dp | --data_path | The path to the commits.feather file. e.g. /home/user/RStudio-Data-Repository/clean_data/commits_by_org.feather| /home/user/RStudio-Data-Repository/clean_data/commits_by_org.feather|String|
4854
| -np | --n_projects | The number of projects to sample from the dataset.| 1000| int|
4955
| -mc | --min_commits | The minimum number of commits for a project to be included in the sample.| None | none_or_int|
@@ -54,19 +60,6 @@ snakemake run_analysis --config n_workers=5
5460
| -rs | --random_state | The random state to initalize all random states.| 1| int|
5561

5662

57-
### Example
58-
From the root directory, run:
59-
60-
```{bash}
61-
python src/python src/github_analysis/main.py -dp "/home/rayce/Assignments/Capstone/RStudio-Data-Repository/clean_data/commits_by_org.feather".py
62-
```
63-
64-
## Installation instructions
65-
First, to get credentials file neccessary for pulling the GitHub Torrent from Google Cloud (necessary for re-generating images for our analysis):
66-
67-
- Follow the instructions under 'Set up a service account' to create and download a credentials file: https://cloud.google.com/video-intelligence/docs/common/auth
68-
- Change the name of the file to `credentials_file.json` and put it in the root directory of the project (a sample file with the name `credentials_file_EXAMPLE.json` is included as a reference).
69-
7063

7164
## Data Repositories
7265
[RStudio-Data-Repository](https://github.com/UBC-MDS/RStudio-Data-Repository)

0 commit comments

Comments
 (0)