I would like to add some sort of categorization of the PhD projects that live on GitHub. e.g. - What is the purpose of the repository? - Code & data for the research project - Thesis - High-level description and documentation of progress? - File format? - .md, .tex, .docx, etc. What kind of information should we collect and track?