-
Notifications
You must be signed in to change notification settings - Fork 3
enfascination/wiki_policy_dev_ecosystem
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
BIGSSS project on how wiki language editions evolve policies and influence each other 1. Run SPARQL query to get the list of policy pages in the Wikidata category "Wikimedia policy pages and guidelines" (Q4656150): https://www.wikidata.org/wiki/Q4656150 Script: 01-sparql_query.txt Query: https://w.wiki/5RCg Output: generated_data/policy_page_sitelinks-sparql.csv (symlink) 2. Get the full list of policy pages for each of the policies listed in Wikidata collected from the query in #1. Script: 02-collect_sitelinks_for_all_policy_articles.py Inputs: generated_data/policy_page_sitelinks-sparql.csv (#1) Outputs: edge_list.csv policy_page_links.csv 3. Enrich the metadata Script: 03-collect_policy_level_metadata.py Inputs: generated_data/policy_page_links.csv Outputs: generated_data/policy_page_links-post03.csv 4. Test the hypothesis that there are more than one policy development path, representing the problem as a sequence clustering. Script is is a stochastic numerical optimizer performing many fits for an optimal allocation of sequences to k groups. This can be run for k={1,2,3,4}, possibly higher, without real edits (perhaps turning some blocks of code on and off. Script: `04-cluster_fit.r` Inputs: some version of `generated_data/policy_page_links-post03.csv` Outputs: `.Rdata` files of arrays of the population of fits rather, ac[k]_results.Rdata` 5. Generate analytics on sequence results, for robustness and presentation. to perform analyses to understand the characteristics of the population of clusters. important for descriptives and robustness checks. Latter for now produces a few simple descriptive plots. Scripts: `05-analyze_clusters.r`, `06-figures.r` Inputs: The output of `04-cluster_fit.r`, `.Rdata` files Outputs: textual outputs to shell and some images
About
BIGSSS projecy on policy evolution on online communities
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published