-
Notifications
You must be signed in to change notification settings - Fork 29
Description
Chapter 0 - Intro
Link 0.A
dime-data-handbook/chapters/0-introduction.tex
Lines 321 to 324 in e31c9f3
\textbf{Stata},\sidenote{\citet{statacorp2019stata}} | |
as a proprietary software, is the notable exception here | |
due to its persistent popularity in development economics and econometrics.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Software_Tools}} |
This points to https://dimewiki.worldbank.org/Software_Tools which is a page that lists some of the software that we use at DIME (Ex. Stata, R, ArcGis), but it lists some programs that I think we do not use (Ex. Eviews) and some important ones that we do not mention (Ex. Python). I am not sure what the main purpose of this wiki page is. It is too brief to be a guide to these software, and if it is just a list of examples commonly used at DIME we should make sure the list is up to date and have links to installation pages or other resources for the software
Chapter 1
Link 1.A
dime-data-handbook/chapters/1-reproducibility.tex
Lines 96 to 103 in e31c9f3
Registration of research studies is an increasingly common practice, | |
and more journals are beginning to require | |
the registration of studies they publish.\sidenote{\citet{vilhuber2020report}} | |
Study registration intended to ensure that a complete record of research inquiry is easily available.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Study_Registration}} | |
Registering research studies ensures that future scholars can quickly | |
find out what work has been carried out on a given question, | |
even if some or all of the work done never results in formal publication. |
This wiki page https://dimewiki.worldbank.org/Study_Registration is just a stub, we need to write this article before we can describe it in the link in the book
Link 1.B
dime-data-handbook/chapters/1-reproducibility.tex
Lines 173 to 177 in e31c9f3
\textbf{Registered reports} | |
take the process of pre-specifying a complex research design | |
to the level of a formal publication.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Registered_Reports}}\index{registered reports} | |
In a registered report, a journal or other publisher |
This wiki page https://dimewiki.worldbank.org/Registered_Reports is just a stub, we need to write this article before we can describe it in the link in the book
Link 1.C
dime-data-handbook/chapters/1-reproducibility.tex
Lines 208 to 212 in e31c9f3
This means that readers are able to judge for themselves whether the research was done well | |
and the decision-making process was sound. | |
If the research is well-structured, and all of the relevant documentation\sidenote{ | |
\url{https://dimewiki.worldbank.org/Research\_Documentation}}\index{research documentation} | |
is shared, it is easy for the reader to understand the analysis fully. |
This wiki page https://dimewiki.worldbank.org/Research_Documentation is just a stub, we need to write this article before we can describe it in the link in the book
Chapter 2
Link 2.A
dime-data-handbook/chapters/2-collaboration.tex
Lines 73 to 77 in e31c9f3
Follow the \textbf{3-2-1 rule}: maintain 3 copies of all original or irreplaceable data, | |
on at least 2 different hardware devices you have access to, | |
with 1 offsite storage method.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Data_Storage}} | |
Chapter 4 provides a protocol for implementing this. |
This wiki page https://dimewiki.worldbank.org/Data_Storage is just a stub, we need to write this article before we can describe it in the link in the book
Link 2.B
dime-data-handbook/chapters/2-collaboration.tex
Lines 98 to 102 in e31c9f3
When you are working with others, you will most likely be using | |
some kind of \textbf{file sharing} software.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Collaboration_Tools}} | |
\index{file sharing} | |
The exact services you use will depend on your tasks, |
This wikipage https://dimewiki.worldbank.org/wiki/Collaboration_Tools is quite thin and does include other collaboration tools, but nothing about file sharing software.
Link 2.C
dime-data-handbook/chapters/2-collaboration.tex
Lines 248 to 250 in e31c9f3
You might also consider using an external editor for your R or Stata code.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Code_Editors}} | |
These editors offer great accessibility and quality features. |
This wiki page https://dimewiki.worldbank.org/wiki/Code_Editors is just a stub, we need to write this article before we can describe it in the link in the book
Link 2.D
dime-data-handbook/chapters/2-collaboration.tex
Lines 564 to 570 in e31c9f3
\textbf{personally-identifying information (PII)}.\index{personally-identifying information (PII)}\sidenote{ | |
\textbf{Personally-identifying information:} any piece or set of information | |
that can be used to identify an individual research subject. | |
\url{https://dimewiki.worldbank.org/Protecting_Human_Research_Subjects}} | |
PII is information which can, without any transformation or linkage, | |
be used to identify individual people, households, firms, | |
(or other units) in your data. |
Two sections in this wiki article https://dimewiki.worldbank.org/wiki/Protecting_Human_Research_Subjects are just stubs
Link 2.E
dime-data-handbook/chapters/2-collaboration.tex
Lines 583 to 588 in e31c9f3
There is no one-size-fits-all solution to determine what is PII, | |
research teams have to use careful judgment in each case to avoid statistical disclosure.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Data_Release}}\index{statistical disclosure}\index{data release} | |
It is important to keep in mind that data privacy principles apply | |
not only for the respondent giving you the information | |
but also for their household members or other individuals who are included in the data. |
This wiki page is incomplete https://dimewiki.worldbank.org/wiki/Data_Release. Can't this be combined with https://dimewiki.worldbank.org/wiki/Publishing_Data?
Chapter 3
Link 3.A
dime-data-handbook/chapters/3-measurement.tex
Lines 116 to 122 in e31c9f3
and the name of the \textbf{project ID}\sidenote{ | |
\textbf{Project ID:} The main ID used in your project to identify | |
observations. | |
You should never have multiple project IDs for the same unit of observation. | |
The project ID must uniquely and fully identify all observations in the project. | |
See \url{https://dimewiki.worldbank.org/Project\_ID} for more details.} | |
variable for that unit of observation. |
This wiki page https://dimewiki.worldbank.org/Project_ID is just a stub, we need to write this article before we can describe it in the link in the book
Link 3.B
dime-data-handbook/chapters/3-measurement.tex
Lines 498 to 502 in e31c9f3
For any study with an ex-ante design, | |
\textbf{monitoring data}\index{monitoring data}\sidenote{ | |
\url{https://dimewiki.worldbank.org/Monitoring\_Data}} | |
is very important for understanding if the | |
research design corresponds to reality. |
This page has been deleted https://dimewiki.worldbank.org/Monitoring_Data
Chapter 4
Link 4.A
dime-data-handbook/chapters/4-acquisition.tex
Lines 125 to 128 in e31c9f3
Data ownership\sidenote{\url{https://dimewiki.worldbank.org/Data_Ownership}}\index{data ownership} | |
can sometimes be challenging to establish, | |
as various jurisdictions have differing laws regarding data and information, | |
and the research team may have their own information regulations. |
This wiki page https://dimewiki.worldbank.org/Data_Ownership is just a stub, we need to write this article before we can describe it in the link in the book
Link 4.B
dime-data-handbook/chapters/4-acquisition.tex
Lines 184 to 188 in e31c9f3
Data licensing\sidenote{ | |
\url{https://dimewiki.worldbank.org/Data_License_Agreement}} | |
is the formal act of the dataset owner | |
giving some data rights to a specific user, | |
while retaining ownership of the dataset. |
This wiki page https://dimewiki.worldbank.org/Data_License_Agreement is just a stub, we need to write this article before we can describe it in the link in the book
Link 4.C
dime-data-handbook/chapters/4-acquisition.tex
Lines 298 to 303 in e31c9f3
It is essential to start with a clear understanding of the | |
\textbf{theory of change}\sidenote{ | |
\url{https://dimewiki.worldbank.org/Theory\_of\_Change}}\index{theory of change} | |
and \textbf{research design}\sidenote{ | |
\url{https://dimewiki.worldbank.org/Research_Design}}\index{research design} | |
for your project. |
This wiki page https://dimewiki.worldbank.org/wiki/Research_Design is just a stub, we need to write this article before we can describe it in the link in the book
Link 4.D
dime-data-handbook/chapters/4-acquisition.tex
Lines 340 to 342 in e31c9f3
The survey pilot includes three steps: | |
a \textbf{pre-pilot}, a \textbf{content-focused pilot}, and a \textbf{data-focused pilot}.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Structuring\_a\_Survey\_Pilot}} |
This wiki page https://dimewiki.worldbank.org/wiki/Structuring_a_Survey_Pilot is linked to to describe the three steps of a pilot pre-pilot, content-focused pilot, and data-focused pilot. That is not mentioned in this wiki article. That is mentioned in the https://dimewiki.worldbank.org/Survey_Pilot page linked to a few rows above. Other people are better on this than me. Is there something I am missing?
Link 4.E
dime-data-handbook/chapters/4-acquisition.tex
Lines 438 to 442 in e31c9f3
We recommend using descriptive names with clear prefixes so that variables | |
within a module stay together when sorted alphabetically.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Variable_Names}}\index{variable naming} | |
Variable names should never include spaces or mixed cases | |
(we prefer all-lowercase naming). |
This page https://dimewiki.worldbank.org/wiki/Variable_Names is a stub but should probably be able to be combined with https://dimewiki.worldbank.org/wiki/Naming_Conventions. Or is it something specific to a variables names in survey that justify a separate page? Doesn't seem like it to me, but someone must have created this stub.
Chapter 5
Link 5.A
dime-data-handbook/chapters/5-processing.tex
Lines 104 to 109 in a6b91ce
It is also possible for a raw dataset to not include an unique identifier, | |
or that the identifier is not a suitable \textbf{project ID}.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Project\_ID}} | |
Suitable project IDs should, for example, not involve long strings | |
that are difficult to work with, such as a name, | |
or be an ID that is known outside the research team. |
This wiki page https://dimewiki.worldbank.org/Project_ID is just a stub, we need to write this article before we can describe it in the link in the book
Link 5.B
dime-data-handbook/chapters/5-processing.tex
Lines 454 to 458 in a6b91ce
Some electronic surveys surveys also provide a unique opportunity | |
to do audits through audio recordings of the interview, | |
typically short recordings triggered at random throughout the questionnaire. | |
\textbf{Audio audits}\sidenote{\url{https://dimewiki.worldbank.org/Random\_Audio\_Audits}} | |
are a useful means to assess whether enumerators are conducting interviews as expected. |
This wiki page https://dimewiki.worldbank.org/Random_Audio_Audits is not much more than a stub, we need to complete this article before we can describe it in the link in the book
Link 5.C
dime-data-handbook/chapters/5-processing.tex
Lines 481 to 483 in a6b91ce
Most development data involves human subjects.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Human\_Subjects\_Approval}} | |
\index{human subjects} |
Two sections in this wiki article https://dimewiki.worldbank.org/wiki/Protecting_Human_Research_Subjects are just stubs
Chapter 6
Link 6.A
dime-data-handbook/chapters/6-analysis.tex
Lines 139 to 142 in a6b91ce
To create the analysis dataset, | |
it is typically necessary to combine information | |
from different data sources.\sidenote{\url{ | |
https://dimewiki.worldbank.org/Data\_Integration}} |
This page https://dimewiki.worldbank.org/Data_Integration is not much more than a stub
Link 6.B
dime-data-handbook/chapters/6-analysis.tex
Lines 238 to 244 in a6b91ce
At this point, you will also need to decide | |
how to handle any outliers or unusual values identified during data cleaning. | |
How to treat outliers is a research question.\sidenote{ | |
\url{https://dimewiki.worldbank.org/Variable_Construction\#Dealing_with_outliers}}\index{outliers} | |
There are multiple possible approaches, | |
and the best choice for a particular case | |
will depend on the objectives of the analysis. |
This wiki page https://dimewiki.worldbank.org/Variable_Construction is just a stub, we need to write this article before we can describe it in the link in the book