Skip to content

Commit 4d9753b

Browse files
committed
add updated config file with snakemake
1 parent d1853f3 commit 4d9753b

File tree

2 files changed

+21
-10
lines changed

2 files changed

+21
-10
lines changed

Snakefile

Lines changed: 12 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,20 @@
1+
configfile: "config.json"
2+
13
rule run_analysis:
24
input:
3-
data_path = "/Users/richiezitomer/Documents/RStudio-Data-Repository/clean_data/commits.feather"
5+
data_path = "/Users/richiezitomer/Documents/RStudio-Data-Repository/clean_data/commits_by_org.feather"
46
output:
57
results_path = directory("results/")
68
params:
7-
python_hash_seed = 0,
8-
n_workers = 1,
9-
n_projects = 1000,
10-
min_commits = None,
11-
min_count = 5,
12-
n_personas = 5,
13-
n_neurons = 128,
14-
n_iter = 10,
15-
random_state = 1
9+
python_hash_seed = config["python_hash_seed"],
10+
n_workers = config["n_workers"],
11+
n_projects = config["n_projects"],
12+
min_commits = config["min_commits"],
13+
min_count = config["min_count"],
14+
n_personas = config["n_personas"],
15+
n_neurons = config["n_neurons"],
16+
n_iter = config["n_iter"],
17+
random_state = config["random_state"]
1618
shell:
1719
"PYTHONHASHSEED={params.python_hash_seed} python src/github_analysis/main.py -dp {input.data_path} -rp {output.results_path} -nw {params.n_workers} -np {params.n_projects} -mc {params.min_commits} -mcount {params.min_count} -nps {params.n_personas} -nn {params.n_neurons} -ni {params.n_iter} -rs {params.random_state}"
1820

config.json

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
{"python_hash_seed": 0,
2+
"n_workers": 1,
3+
"n_projects": 1000,
4+
"min_commits": "None",
5+
"min_count": 5,
6+
"n_personas": 5,
7+
"n_neurons": 128,
8+
"n_iter": 10,
9+
"random_state": 1}

0 commit comments

Comments
 (0)