This repository represents a manubot version of the green paper, whose prose is included in the content folder. The fully executed green paper can be found at:
- HTML manuscript: https://Sage-Bionetworks.github.io/governanceGreenPaper/
- PDF manuscript: https://Sage-Bionetworks.github.io/governanceGreenPaper/manuscript.pdf
This report provides a landscape analysis of of models of governance for open data sharing based on our observations in the biomedical sciences. We offer an overview of those observations and show areas where we think this work can expand to supply further support for open data sharing outside the sciences.
The central argument of this paper is that the “right” system of governance is determined by first understanding the nature of the collaborative activities intended. These activities map to types of governance structures, which in turn can be built out of standardized parts — what we call governance design patterns. In this way, governance for data science can be easy to build, follow key laws and ethics regimes, and enable innovative models of collaboration. We provide an initial survey of structures and design patterns, as well as examples of how we leverage this approach to rapidly build out ethics-centered governance in biomedical research.
As this paper itself will be deposited in GitHub, we also envision a contributory process whereby this inventory can be extended with more resources and links over time. We can envision communities using these design resources to create clearly governed networks. We can also imagine small private collectives amongst corporations and their partners, both academic and smaller businesses, using these designs as a “stack” to govern data science beyond biomedical research.
While there is no one-size-fits-all solution, we argue for learning from ongoing data science collaborations and building on from existing standards and tools. And in so doing, we argue for data governance as a discipline worthy of expertise, attention, standards, and innovation.
Except when noted otherwise, the entirety of this repository is licensed under a CC BY 4.0 License (LICENSE.md
), which allows reuse with attribution.
Please attribute by linking to https://github.com/Sage-Bionetworks/governanceGreenPaper.
Since CC BY is not ideal for code and data, certain repository components are also released under the CC0 1.0 public domain dedication (LICENSE-CC0.md
).
All files matched by the following glob patterns are dual licensed under CC BY 4.0 and CC0 1.0:
*.sh
*.py
*.yml
/*.yaml
*.json
*.bib
*.tsv
.gitignore
All other files are only available under CC BY 4.0, including:
*.md
*.html
*.pdf
*.docx