diff --git a/.gitignore b/.gitignore
index 974dc1146..bd8ec03d7 100644
--- a/.gitignore
+++ b/.gitignore
@@ -4,3 +4,4 @@
 .Rproj.user
 .RData
 
+*.db
diff --git a/02_RProgramming/DataTypes/index.Rmd b/02_RProgramming/DataTypes/index.Rmd
index 65eb1ce54..694d14494 100644
--- a/02_RProgramming/DataTypes/index.Rmd
+++ b/02_RProgramming/DataTypes/index.Rmd
@@ -200,7 +200,9 @@ NAs introduced by coercion
 > as.logical(x)
 [1] NA NA NA
 > as.complex(x)
-[1] 0+0i 1+0i 2+0i 3+0i 4+0i 5+0i 6+0i
+[1] NA NA NA NA
+Warning message:
+NAs introduced by coercion
 ```
 
 ---
@@ -472,4 +474,4 @@ Data Types
 
 - data frames
 
-- names
\ No newline at end of file
+- names
diff --git a/02_RProgramming/DataTypes/index.html b/02_RProgramming/DataTypes/index.html
index 9b50617cb..60f66d06a 100644
--- a/02_RProgramming/DataTypes/index.html
+++ b/02_RProgramming/DataTypes/index.html
@@ -263,7 +263,9 @@ <h2>Explicit Coercion</h2>
 &gt; as.logical(x)
 [1] NA NA NA
 &gt; as.complex(x)
-[1] 0+0i 1+0i 2+0i 3+0i 4+0i 5+0i 6+0i
+[1] NA NA NA NA
+Warning message:
+NAs introduced by coercion
 </code></pre>
 
   </article>
@@ -636,4 +638,4 @@ <h2>Summary</h2>
 <script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
 <script>hljs.initHighlightingOnLoad();</script>
 <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+</html>
diff --git a/02_RProgramming/DataTypes/index.md b/02_RProgramming/DataTypes/index.md
index ccd9ff364..694d14494 100644
--- a/02_RProgramming/DataTypes/index.md
+++ b/02_RProgramming/DataTypes/index.md
@@ -200,7 +200,9 @@ NAs introduced by coercion
 > as.logical(x)
 [1] NA NA NA
 > as.complex(x)
-[1] 0+0i 1+0i 2+0i 3+0i 4+0i 5+0i 6+0i
+[1] NA NA NA NA
+Warning message:
+NAs introduced by coercion
 ```
 
 ---
diff --git a/02_RProgramming/assets/img/Thumbs.db b/02_RProgramming/assets/img/Thumbs.db
new file mode 100644
index 000000000..cdea17aff
Binary files /dev/null and b/02_RProgramming/assets/img/Thumbs.db differ
diff --git a/04_ExploratoryAnalysis/assets/img/Thumbs.db b/04_ExploratoryAnalysis/assets/img/Thumbs.db
new file mode 100644
index 000000000..966dbaaf7
Binary files /dev/null and b/04_ExploratoryAnalysis/assets/img/Thumbs.db differ
diff --git a/06_StatisticalInference/01_01_Introduction/index.pdf b/06_StatisticalInference/01_01_Introduction/index.pdf
deleted file mode 100644
index 70d9be1bc..000000000
Binary files a/06_StatisticalInference/01_01_Introduction/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/01_02_Probability/index.pdf b/06_StatisticalInference/01_02_Probability/index.pdf
deleted file mode 100644
index b431ce394..000000000
Binary files a/06_StatisticalInference/01_02_Probability/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/01_03_Expectations/index.pdf b/06_StatisticalInference/01_03_Expectations/index.pdf
deleted file mode 100644
index c9c43b5a3..000000000
Binary files a/06_StatisticalInference/01_03_Expectations/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/01_04_Independence/index.pdf b/06_StatisticalInference/01_04_Independence/index.pdf
deleted file mode 100644
index ba92e4d8e..000000000
Binary files a/06_StatisticalInference/01_04_Independence/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/01_05_ConditionalProbability/index.pdf b/06_StatisticalInference/01_05_ConditionalProbability/index.pdf
deleted file mode 100644
index 7cbac08c4..000000000
Binary files a/06_StatisticalInference/01_05_ConditionalProbability/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/01_Introduction/fig/fmri-salmon.jpg b/06_StatisticalInference/01_Introduction/fig/fmri-salmon.jpg
new file mode 100644
index 000000000..41bb6154b
Binary files /dev/null and b/06_StatisticalInference/01_Introduction/fig/fmri-salmon.jpg differ
diff --git a/06_StatisticalInference/01_Introduction/index.Rmd b/06_StatisticalInference/01_Introduction/index.Rmd
new file mode 100644
index 000000000..dde2d8720
--- /dev/null
+++ b/06_StatisticalInference/01_Introduction/index.Rmd
@@ -0,0 +1,161 @@
+---
+title       : Introduction to statistical inference
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Statistical inference defined
+
+Statistical inference is the process of drawing formal conclusions from
+data. 
+
+In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
+statistical data where uncertainty must be accounted for.
+
+---
+
+## Motivating example: who's going to win the election?
+
+In every major election, pollsters would like to know, ahead of the
+actual election, who's going to win. Here, the target of
+estimation (the estimand) is clear, the percentage of people in 
+a particular group (city, state, county, country or other electoral
+grouping) who will vote for each candidate.
+
+We can not poll everyone. Even if we could, some polled 
+may change their vote by the time the election occurs.
+How do we collect a reasonable subset of data and quantify the
+uncertainty in the process to produce a good guess at who will win?
+
+---
+
+## Motivating example: is hormone replacement therapy effective? 
+
+A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.**
+
+Here's there's two inferential problems. 
+
+1. Is HRT effective?
+2. How long should we continue the trial in the presence of contrary
+evidence?
+
+See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts
+
+---
+
+## Motivating example 
+### Brain activation
+
+![fMRI salmon study](fig/fmri-salmon.jpg 'fMRI salmon study')
+
+http://www.wired.com/2009/09/fmrisalmon/
+
+
+---
+
+## Summary
+
+- These examples illustrate many of the difficulties of trying
+to use data to create general conclusions about a population.
+- Paramount among our concerns are:
+  - Is the sample representative of the population that we'd like to draw inferences about?
+  - Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?
+  - Is there systematic bias created by missing data or the design or conduct of the study?
+  - What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
+or random sampling, or implicit as the aggregation of many complex uknown processes.
+  - Are we trying to estimate an underlying mechanistic model of phenomena under study?
+- Statistical inference requires navigating the set of assumptions and
+tools and subsequently thinking about how to draw conclusions from data.
+
+--- 
+## Example goals of inference
+
+1. Estimate and quantify the uncertainty of an estimate of 
+a population quantity (the proportion of people who will
+  vote for a candidate).
+2. Determine whether a population quantity 
+  is a benchmark value ("is the treatment effective?").
+3. Infer a mechanistic relationship when quantities are measured with
+  noise ("What is the slope for Hooke's law?")
+4. Determine the impact of a policy? ("If we reduce polution levels,
+  will asthma rates decline?")
+5. Talk about the probability that something occurs.
+
+---
+## Example tools of the trade 
+
+1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest
+2. Random sampling: concerned with obtaining data that is representative 
+of the population of interest
+3. Sampling models: concerned with creating a model for the sampling
+process, the most common is so called "iid".
+4. Hypothesis testing: concerned with decision making in the presence of uncertainty
+5. Confidence intervals: concerned with quantifying uncertainty in 
+estimation
+6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
+approximated.
+7. Study design: the process of designing an experiment to minimize biases and variability.
+8. Nonparametric bootstrapping: the process of using the data to,
+  with minimal probability model assumptions, create inferences.
+9. Permutation, randomization and exchangeability testing: the process 
+of using data permutations to perform inferences.
+
+---
+## Different thinking about probability leads to different styles of inference
+
+We won't spend too much time talking about this, but there are several different
+styles of inference. Two broad categories that get discussed a lot are:
+
+1. Frequency probability: is the long run proportion of
+ times an event occurs in independent, identically distributed 
+ repetitions.
+2. Frequency inference: uses frequency interpretations of probabilities
+to control error rates. Answers questions like "What should I decide
+given my data controlling the long run proportion of mistakes I make at
+a tolerable level."
+3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.
+4. Bayesian inference: the use of Bayesian probability representation
+of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what
+should I believe now?"
+
+Data scientists tend to fall within shades of gray of these and various other schools of inference. 
+
+---
+## In this class
+
+* In this class, we will primarily focus on basic sampling models, 
+basic probability models and frequency style analyses
+to create standard inferences. 
+* Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
+and bootstrapping.
+* As probability modeling will be our starting point, we first build
+up basic probability.
+
+---
+## Where to learn more on the topics not covered
+
+1. Explicit use of random sampling in inferences: look in references
+on "finite population statistics". Used heavily in polling and
+sample surveys.
+2. Explicit use of randomization in inferences: look in references
+on "causal inference" especially in clinical trials.
+3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).
+4. Missing data: well covered in biostatistics and econometric
+references; look for references to "multiple imputation", a popular tool for
+addressing missing data.
+5. Study design: consider looking in the subject matter area that
+  you are interested in; some examples with rich histories in design:
+  1. The epidemiological literature is very focused on using study design to investigate public health.
+  2. The classical development of study design in agriculture broadly covers design and design principles.
+  3. The industrial quality control literature covers design thoroughly.
+
diff --git a/06_StatisticalInference/01_Introduction/index.html b/06_StatisticalInference/01_Introduction/index.html
new file mode 100644
index 000000000..2772c7646
--- /dev/null
+++ b/06_StatisticalInference/01_Introduction/index.html
@@ -0,0 +1,361 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Introduction to statistical inference</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Introduction to statistical inference">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Introduction to statistical inference</h1>
+    <h2>Statistical inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Statistical inference defined</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Statistical inference is the process of drawing formal conclusions from
+data. </p>
+
+<p>In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
+statistical data where uncertainty must be accounted for.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Motivating example: who&#39;s going to win the election?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>In every major election, pollsters would like to know, ahead of the
+actual election, who&#39;s going to win. Here, the target of
+estimation (the estimand) is clear, the percentage of people in 
+a particular group (city, state, county, country or other electoral
+grouping) who will vote for each candidate.</p>
+
+<p>We can not poll everyone. Even if we could, some polled 
+may change their vote by the time the election occurs.
+How do we collect a reasonable subset of data and quantify the
+uncertainty in the process to produce a good guess at who will win?</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Motivating example: is hormone replacement therapy effective?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. <strong>Based on a statistically based protocol, the study was stopped early due an excess number of negative events.</strong></p>
+
+<p>Here&#39;s there&#39;s two inferential problems. </p>
+
+<ol>
+<li>Is HRT effective?</li>
+<li>How long should we continue the trial in the presence of contrary
+evidence?</li>
+</ol>
+
+<p>See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Motivating example</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>Brain activation</h3>
+
+<p><img src="fig/fmri-salmon.jpg" alt="fMRI salmon study" title="fMRI salmon study"></p>
+
+<p><a href="http://www.wired.com/2009/09/fmrisalmon/">http://www.wired.com/2009/09/fmrisalmon/</a></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Summary</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>These examples illustrate many of the difficulties of trying
+to use data to create general conclusions about a population.</li>
+<li>Paramount among our concerns are:
+
+<ul>
+<li>Is the sample representative of the population that we&#39;d like to draw inferences about?</li>
+<li>Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?</li>
+<li>Is there systematic bias created by missing data or the design or conduct of the study?</li>
+<li>What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
+or random sampling, or implicit as the aggregation of many complex uknown processes.</li>
+<li>Are we trying to estimate an underlying mechanistic model of phenomena under study?</li>
+</ul></li>
+<li>Statistical inference requires navigating the set of assumptions and
+tools and subsequently thinking about how to draw conclusions from data.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Example goals of inference</h2>
+  </hgroup>
+  <article data-timings="">
+    <ol>
+<li>Estimate and quantify the uncertainty of an estimate of 
+a population quantity (the proportion of people who will
+vote for a candidate).</li>
+<li>Determine whether a population quantity 
+is a benchmark value (&quot;is the treatment effective?&quot;).</li>
+<li>Infer a mechanistic relationship when quantities are measured with
+noise (&quot;What is the slope for Hooke&#39;s law?&quot;)</li>
+<li>Determine the impact of a policy? (&quot;If we reduce polution levels,
+will asthma rates decline?&quot;)</li>
+<li>Talk about the probability that something occurs.</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Example tools of the trade</h2>
+  </hgroup>
+  <article data-timings="">
+    <ol>
+<li>Randomization: concerned with balancing unobserved variables that may confound inferences of interest</li>
+<li>Random sampling: concerned with obtaining data that is representative 
+of the population of interest</li>
+<li>Sampling models: concerned with creating a model for the sampling
+process, the most common is so called &quot;iid&quot;.</li>
+<li>Hypothesis testing: concerned with decision making in the presence of uncertainty</li>
+<li>Confidence intervals: concerned with quantifying uncertainty in 
+estimation</li>
+<li>Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
+approximated.</li>
+<li>Study design: the process of designing an experiment to minimize biases and variability.</li>
+<li>Nonparametric bootstrapping: the process of using the data to,
+with minimal probability model assumptions, create inferences.</li>
+<li>Permutation, randomization and exchangeability testing: the process 
+of using data permutations to perform inferences.</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Different thinking about probability leads to different styles of inference</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>We won&#39;t spend too much time talking about this, but there are several different
+styles of inference. Two broad categories that get discussed a lot are:</p>
+
+<ol>
+<li>Frequency probability: is the long run proportion of
+times an event occurs in independent, identically distributed 
+repetitions.</li>
+<li>Frequency inference: uses frequency interpretations of probabilities
+to control error rates. Answers questions like &quot;What should I decide
+given my data controlling the long run proportion of mistakes I make at
+a tolerable level.&quot;</li>
+<li>Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.</li>
+<li>Bayesian inference: the use of Bayesian probability representation
+of beliefs to perform inference. Answers questions like &quot;Given my subjective beliefs and the objective information from the data, what
+should I believe now?&quot;</li>
+</ol>
+
+<p>Data scientists tend to fall within shades of gray of these and various other schools of inference. </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>In this class</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In this class, we will primarily focus on basic sampling models, 
+basic probability models and frequency style analyses
+to create standard inferences. </li>
+<li>Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
+and bootstrapping.</li>
+<li>As probability modeling will be our starting point, we first build
+up basic probability.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Where to learn more on the topics not covered</h2>
+  </hgroup>
+  <article data-timings="">
+    <ol>
+<li>Explicit use of random sampling in inferences: look in references
+on &quot;finite population statistics&quot;. Used heavily in polling and
+sample surveys.</li>
+<li>Explicit use of randomization in inferences: look in references
+on &quot;causal inference&quot; especially in clinical trials.</li>
+<li>Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).</li>
+<li>Missing data: well covered in biostatistics and econometric
+references; look for references to &quot;multiple imputation&quot;, a popular tool for
+addressing missing data.</li>
+<li>Study design: consider looking in the subject matter area that
+you are interested in; some examples with rich histories in design:
+
+<ol>
+<li>The epidemiological literature is very focused on using study design to investigate public health.</li>
+<li>The classical development of study design in agriculture broadly covers design and design principles.</li>
+<li>The industrial quality control literature covers design thoroughly.</li>
+</ol></li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Statistical inference defined'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Motivating example: who&#39;s going to win the election?'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Motivating example: is hormone replacement therapy effective?'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Motivating example'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Summary'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Example goals of inference'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Example tools of the trade'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Different thinking about probability leads to different styles of inference'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='In this class'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Where to learn more on the topics not covered'>
+         10
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/01_Introduction/index.md b/06_StatisticalInference/01_Introduction/index.md
new file mode 100644
index 000000000..dde2d8720
--- /dev/null
+++ b/06_StatisticalInference/01_Introduction/index.md
@@ -0,0 +1,161 @@
+---
+title       : Introduction to statistical inference
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Statistical inference defined
+
+Statistical inference is the process of drawing formal conclusions from
+data. 
+
+In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
+statistical data where uncertainty must be accounted for.
+
+---
+
+## Motivating example: who's going to win the election?
+
+In every major election, pollsters would like to know, ahead of the
+actual election, who's going to win. Here, the target of
+estimation (the estimand) is clear, the percentage of people in 
+a particular group (city, state, county, country or other electoral
+grouping) who will vote for each candidate.
+
+We can not poll everyone. Even if we could, some polled 
+may change their vote by the time the election occurs.
+How do we collect a reasonable subset of data and quantify the
+uncertainty in the process to produce a good guess at who will win?
+
+---
+
+## Motivating example: is hormone replacement therapy effective? 
+
+A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.**
+
+Here's there's two inferential problems. 
+
+1. Is HRT effective?
+2. How long should we continue the trial in the presence of contrary
+evidence?
+
+See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts
+
+---
+
+## Motivating example 
+### Brain activation
+
+![fMRI salmon study](fig/fmri-salmon.jpg 'fMRI salmon study')
+
+http://www.wired.com/2009/09/fmrisalmon/
+
+
+---
+
+## Summary
+
+- These examples illustrate many of the difficulties of trying
+to use data to create general conclusions about a population.
+- Paramount among our concerns are:
+  - Is the sample representative of the population that we'd like to draw inferences about?
+  - Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?
+  - Is there systematic bias created by missing data or the design or conduct of the study?
+  - What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
+or random sampling, or implicit as the aggregation of many complex uknown processes.
+  - Are we trying to estimate an underlying mechanistic model of phenomena under study?
+- Statistical inference requires navigating the set of assumptions and
+tools and subsequently thinking about how to draw conclusions from data.
+
+--- 
+## Example goals of inference
+
+1. Estimate and quantify the uncertainty of an estimate of 
+a population quantity (the proportion of people who will
+  vote for a candidate).
+2. Determine whether a population quantity 
+  is a benchmark value ("is the treatment effective?").
+3. Infer a mechanistic relationship when quantities are measured with
+  noise ("What is the slope for Hooke's law?")
+4. Determine the impact of a policy? ("If we reduce polution levels,
+  will asthma rates decline?")
+5. Talk about the probability that something occurs.
+
+---
+## Example tools of the trade 
+
+1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest
+2. Random sampling: concerned with obtaining data that is representative 
+of the population of interest
+3. Sampling models: concerned with creating a model for the sampling
+process, the most common is so called "iid".
+4. Hypothesis testing: concerned with decision making in the presence of uncertainty
+5. Confidence intervals: concerned with quantifying uncertainty in 
+estimation
+6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
+approximated.
+7. Study design: the process of designing an experiment to minimize biases and variability.
+8. Nonparametric bootstrapping: the process of using the data to,
+  with minimal probability model assumptions, create inferences.
+9. Permutation, randomization and exchangeability testing: the process 
+of using data permutations to perform inferences.
+
+---
+## Different thinking about probability leads to different styles of inference
+
+We won't spend too much time talking about this, but there are several different
+styles of inference. Two broad categories that get discussed a lot are:
+
+1. Frequency probability: is the long run proportion of
+ times an event occurs in independent, identically distributed 
+ repetitions.
+2. Frequency inference: uses frequency interpretations of probabilities
+to control error rates. Answers questions like "What should I decide
+given my data controlling the long run proportion of mistakes I make at
+a tolerable level."
+3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.
+4. Bayesian inference: the use of Bayesian probability representation
+of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what
+should I believe now?"
+
+Data scientists tend to fall within shades of gray of these and various other schools of inference. 
+
+---
+## In this class
+
+* In this class, we will primarily focus on basic sampling models, 
+basic probability models and frequency style analyses
+to create standard inferences. 
+* Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
+and bootstrapping.
+* As probability modeling will be our starting point, we first build
+up basic probability.
+
+---
+## Where to learn more on the topics not covered
+
+1. Explicit use of random sampling in inferences: look in references
+on "finite population statistics". Used heavily in polling and
+sample surveys.
+2. Explicit use of randomization in inferences: look in references
+on "causal inference" especially in clinical trials.
+3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).
+4. Missing data: well covered in biostatistics and econometric
+references; look for references to "multiple imputation", a popular tool for
+addressing missing data.
+5. Study design: consider looking in the subject matter area that
+  you are interested in; some examples with rich histories in design:
+  1. The epidemiological literature is very focused on using study design to investigate public health.
+  2. The classical development of study design in agriculture broadly covers design and design principles.
+  3. The industrial quality control literature covers design thoroughly.
+
diff --git a/06_StatisticalInference/01_Introduction/index.pdf b/06_StatisticalInference/01_Introduction/index.pdf
new file mode 100644
index 000000000..ba632a641
Binary files /dev/null and b/06_StatisticalInference/01_Introduction/index.pdf differ
diff --git a/06_StatisticalInference/02_01_CommonDistributions/index.pdf b/06_StatisticalInference/02_01_CommonDistributions/index.pdf
deleted file mode 100644
index 633936fe6..000000000
Binary files a/06_StatisticalInference/02_01_CommonDistributions/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/02_02_Asymptopia/index.pdf b/06_StatisticalInference/02_02_Asymptopia/index.pdf
deleted file mode 100644
index 7fcb65b90..000000000
Binary files a/06_StatisticalInference/02_02_Asymptopia/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/02_03_tCIs/index.pdf b/06_StatisticalInference/02_03_tCIs/index.pdf
deleted file mode 100644
index 855c3502b..000000000
Binary files a/06_StatisticalInference/02_03_tCIs/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/02_04_Likeklihood/index.pdf b/06_StatisticalInference/02_04_Likeklihood/index.pdf
deleted file mode 100644
index 85787788c..000000000
Binary files a/06_StatisticalInference/02_04_Likeklihood/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/02_05_Bayes/index.pdf b/06_StatisticalInference/02_05_Bayes/index.pdf
deleted file mode 100644
index c8043e4a1..000000000
Binary files a/06_StatisticalInference/02_05_Bayes/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/02_Probability/assets/fig/unnamed-chunk-1.png b/06_StatisticalInference/02_Probability/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..b4fb0bd35
Binary files /dev/null and b/06_StatisticalInference/02_Probability/assets/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/02_Probability/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/02_Probability/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..ff974fda8
Binary files /dev/null and b/06_StatisticalInference/02_Probability/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/01_02_Probability/figure/unnamed-chunk-1.png b/06_StatisticalInference/02_Probability/figure/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/01_02_Probability/figure/unnamed-chunk-1.png
rename to 06_StatisticalInference/02_Probability/figure/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/01_02_Probability/figure/unnamed-chunk-2.png b/06_StatisticalInference/02_Probability/figure/unnamed-chunk-2.png
similarity index 100%
rename from 06_StatisticalInference/01_02_Probability/figure/unnamed-chunk-2.png
rename to 06_StatisticalInference/02_Probability/figure/unnamed-chunk-2.png
diff --git a/06_StatisticalInference/01_02_Probability/figure/unnamed-chunk-4.png b/06_StatisticalInference/02_Probability/figure/unnamed-chunk-4.png
similarity index 100%
rename from 06_StatisticalInference/01_02_Probability/figure/unnamed-chunk-4.png
rename to 06_StatisticalInference/02_Probability/figure/unnamed-chunk-4.png
diff --git a/06_StatisticalInference/01_02_Probability/figure/unnamed-chunk-6.png b/06_StatisticalInference/02_Probability/figure/unnamed-chunk-6.png
similarity index 100%
rename from 06_StatisticalInference/01_02_Probability/figure/unnamed-chunk-6.png
rename to 06_StatisticalInference/02_Probability/figure/unnamed-chunk-6.png
diff --git a/06_StatisticalInference/02_Probability/index.Rmd b/06_StatisticalInference/02_Probability/index.Rmd
new file mode 100644
index 000000000..9f81bb399
--- /dev/null
+++ b/06_StatisticalInference/02_Probability/index.Rmd
@@ -0,0 +1,586 @@
+<<<<<<< HEAD:06_StatisticalInference/01_02_Probability/index.Rmd
+---
+title       : Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Notation
+
+- The **sample space**, $\Omega$, is the collection of possible outcomes of an experiment
+  - Example: die roll $\Omega = \{1,2,3,4,5,6\}$
+- An **event**, say $E$, is a subset of $\Omega$ 
+  - Example: die roll is even $E = \{2,4,6\}$
+- An **elementary** or **simple** event is a particular result
+  of an experiment
+  - Example: die roll is a four, $\omega = 4$
+- $\emptyset$ is called the **null event** or the **empty set**
+
+---
+
+## Interpretation of set operations
+
+Normal set operations have particular interpretations in this setting
+
+1. $\omega \in E$ implies that $E$ occurs when $\omega$ occurs
+2. $\omega \not\in E$ implies that $E$ does not occur when $\omega$ occurs
+3. $E \subset F$ implies that the occurrence of $E$ implies the occurrence of $F$
+4. $E \cap F$  implies the event that both $E$ and $F$ occur
+5. $E \cup F$ implies the event that at least one of $E$ or $F$ occur
+6. $E \cap F=\emptyset$ means that $E$ and $F$ are **mutually exclusive**, or cannot both occur
+7. $E^c$ or $\bar E$ is the event that $E$ does not occur
+
+---
+
+## Probability
+
+A **probability measure**, $P$, is a function from the collection of possible events so that the following hold
+
+1. For an event $E\subset \Omega$, $0 \leq P(E) \leq 1$
+2. $P(\Omega) = 1$
+3. If $E_1$ and $E_2$ are mutually exclusive events
+  $P(E_1 \cup E_2) = P(E_1) + P(E_2)$.
+
+Part 3 of the definition implies **finite additivity**
+
+$$
+P(\cup_{i=1}^n A_i) = \sum_{i=1}^n P(A_i)
+$$
+where the $\{A_i\}$ are mutually exclusive. (Note a more general version of
+additivity is used in advanced classes.)
+
+
+---
+
+
+## Example consequences
+
+- $P(\emptyset) = 0$
+- $P(E) = 1 - P(E^c)$
+- $P(A \cup B) = P(A) + P(B) - P(A \cap B)$
+- if $A \subset B$ then $P(A) \leq P(B)$
+- $P\left(A \cup B\right) = 1 - P(A^c \cap B^c)$
+- $P(A \cap B^c) = P(A) - P(A \cap B)$
+- $P(\cup_{i=1}^n E_i) \leq \sum_{i=1}^n P(E_i)$
+- $P(\cup_{i=1}^n E_i) \geq \max_i P(E_i)$
+
+---
+
+## Example
+
+The National Sleep Foundation ([www.sleepfoundation.org](http://www.sleepfoundation.org/)) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?
+
+---
+
+## Example continued
+
+Answer: No, the events are not mutually exclusive. To elaborate let:
+
+$$
+\begin{eqnarray*}
+    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
+    A_2 & = & \{\mbox{Person has RLS}\} 
+  \end{eqnarray*}
+$$
+
+Then 
+
+$$
+\begin{eqnarray*}
+    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
+   & = & 0.13 - \mbox{Probability of having both}
+  \end{eqnarray*}
+$$
+Likely, some fraction of the population has both.
+
+---
+
+## Random variables
+
+- A **random variable** is a numerical outcome of an experiment.
+- The random variables that we study will come in two varieties,
+  **discrete** or **continuous**.
+- Discrete random variable are random variables that take on only a
+countable number of possibilities.
+  * $P(X = k)$
+- Continuous random variable can take any value on the real line or some subset of the real line.
+  * $P(X \in A)$
+
+---
+
+## Examples of variables that can be thought of as random variables
+
+- The $(0-1)$ outcome of the flip of a coin
+- The outcome from the roll of a die
+- The BMI of a subject four years after a baseline measurement
+- The hypertension status of a subject randomly drawn from a population
+
+---
+
+## PMF
+
+A probability mass function evaluated at a value corresponds to the
+probability that a random variable takes that value. To be a valid
+pmf a function, $p$, must satisfy
+
+  1. $p(x) \geq 0$ for all $x$
+  2. $\sum_{x} p(x) = 1$
+
+The sum is taken over all of the possible values for $x$.
+
+---
+
+## Example
+
+Let $X$ be the result of a coin flip where $X=0$ represents
+tails and $X = 1$ represents heads.
+$$
+p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+Suppose that we do not know whether or not the coin is fair; Let
+$\theta$ be the probability of a head expressed as a proportion
+(between 0 and 1).
+$$
+p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+
+---
+
+## PDF
+
+A probability density function (pdf), is a function associated with
+a continuous random variable 
+
+  *Areas under pdfs correspond to probabilities for that random variable*
+
+To be a valid pdf, a function $f$ must satisfy
+
+1. $f(x) \geq 0$ for all $x$
+
+2. The area under $f(x)$ is one.
+
+---
+## Example
+
+Suppose that the proportion of help calls that get addressed in
+a random day by a help line is given by
+$$
+f(x) = \left\{\begin{array}{ll}
+    2 x & \mbox{ for } 1 > x > 0 \\
+    0                 & \mbox{ otherwise} 
+\end{array} \right. 
+$$
+
+Is this a mathematically valid density?
+
+---
+
+```{r, fig.height = 5, fig.width = 5, echo = TRUE, fig.align='center'}
+x <- c(-0.5, 0, 1, 1, 1.5); y <- c( 0, 0, 2, 0, 0)
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+```
+
+---
+
+## Example continued
+
+What is the probability that 75% or fewer of calls get addressed?
+
+```{r, fig.height = 5, fig.width = 5, echo = FALSE, fig.align='center'}
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+polygon(c(0, .75, .75, 0), c(0, 0, 1.5, 0), lwd = 3, col = "lightblue")
+```
+
+---
+```{r}
+1.5 * .75 / 2
+pbeta(.75, 2, 1)
+```
+---
+
+## CDF and survival function
+
+- The **cumulative distribution function** (CDF) of a random variable $X$ is defined as the function 
+$$
+F(x) = P(X \leq x)
+$$
+- This definition applies regardless of whether $X$ is discrete or continuous.
+- The **survival function** of a random variable $X$ is defined as
+$$
+S(x) = P(X > x)
+$$
+- Notice that $S(x) = 1 - F(x)$
+- For continuous random variables, the PDF is the derivative of the CDF
+
+---
+
+## Example
+
+What are the survival function and CDF from the density considered before?
+
+For $1 \geq x \geq 0$
+$$
+F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
+$$
+
+$$
+S(x) = 1 - x^2
+$$
+
+```{r}
+pbeta(c(0.4, 0.5, 0.6), 2, 1)
+```
+
+---
+
+## Quantiles
+
+- The  $\alpha^{th}$ **quantile** of a distribution with distribution function $F$ is the point $x_\alpha$ so that
+$$
+F(x_\alpha) = \alpha
+$$
+- A **percentile** is simply a quantile with $\alpha$ expressed as a percent
+- The **median** is the $50^{th}$ percentile
+
+---
+## Example
+- We want to solve $0.5 = F(x) = x^2$
+- Resulting in the solution 
+```{r, echo = TRUE} 
+sqrt(0.5)
+``` 
+- Therefore, about `r sqrt(0.5)` of calls being answered on a random day is the median.
+- R can approximate quantiles for you for common distributions
+
+```{r}
+qbeta(0.5, 2, 1)
+```
+
+---
+
+## Summary
+
+- You might be wondering at this point "I've heard of a median before, it didn't require integration. Where's the data?"
+- We're referring to are **population quantities**. Therefore, the median being
+  discussed is the **population median**.
+- A probability model connects the data to the population using assumptions.
+- Therefore the median we're discussing is the **estimand**, the sample median will be the **estimator**
+=======
+---
+title       : Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Probability
+
+- In these slides we will cover the basics of probability at low enough level
+to have a basic understanding for the rest of the series
+- For a more complete treatment see the class Mathematical Biostatistics Boot Camp 1
+    - Youtube: www.youtube.com/playlist?list=PLpl-gQkQivXhk6qSyiNj51qamjAtZISJ-
+    - Coursera: www.coursera.org/course/biostats
+    - Git: http://github.com/bcaffo/Caffo-Coursera
+
+
+---
+
+## Probability
+
+Given a random experiment (say rolling a die) a probability measure is a population quantity
+that summarizes the randomness.
+
+Specifically, probability takes a possible outcome from the expertiment and:
+
+- assigns it a number between 0 and 1 
+- so that the probability that something occurs is 1 (the die must be rolled)
+and 
+- so that the probability of the union of any two sets of outcomes that have nothing in common (mutually exclusive)
+is the sum of their respective probabilities.
+
+
+The Russian mathematician Kolmogorov formalized these rules.
+
+---
+
+
+## Rules probability must follow
+
+- The probability that nothing occurs is 0
+- The probability that something occurs is 1
+- The probability of something is 1 minus the probability that the opposite occurs
+- The probability of at least one of 
+    two (or more) things that can not simultaneously occur (mutually exclusive) 
+    is the sum of their
+    respective probabilities
+- If an event A implies the occurrence of event B, then the probability of A
+occurring is less than the probability that B occurs
+- For any two events the probability that at least one occurs is the sum of their
+    probabilities minus their intersection.
+
+---
+
+## Example
+
+The National Sleep Foundation ([www.sleepfoundation.org](http://www.sleepfoundation.org/)) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?
+
+---
+
+## Example continued
+
+Answer: No, the events can simultaneously occur and so 
+are not mutually exclusive. To elaborate let:
+
+---
+## If you want to see the mathematics
+
+$$
+\begin{eqnarray*}
+    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
+    A_2 & = & \{\mbox{Person has RLS}\} 
+  \end{eqnarray*}
+$$
+
+Then 
+
+$$
+\begin{eqnarray*}
+    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
+   & = & 0.13 - \mbox{Probability of having both}
+  \end{eqnarray*}
+$$
+Likely, some fraction of the population has both.
+
+---
+## Going further
+
+Probability calculus is useful for understanding the rules that probabilities
+must follow. 
+
+However, we need ways to model and think about probabilities for
+numeric outcomes of experiments (broadly defined). 
+
+Densities and mass functions for random variables are the best starting point for this.
+
+Remember, everything we're talking about up to at this point is a population quantity 
+not a statement about what occurs in the data.  
+- We're going with this is: use the data to estimate properties of the population.
+
+---
+## Random variables
+
+- A **random variable** is a numerical outcome of an experiment.
+- The random variables that we study will come in two varieties,
+  **discrete** or **continuous**.
+- Discrete random variable are random variables that take on only a
+countable number of possibilities and we talk about the probability that they
+take specific values
+- Continuous random variable can conceptually take any value on the real line or some subset of the real line and we talk about the probability that they line within
+some range
+
+---
+
+## Examples of variables that can be thought of as random variables
+
+Experiments that we use for intuition and building context
+- The $(0-1)$ outcome of the flip of a coin
+- The outcome from the roll of a die
+
+Specific instances of treating variables as if random
+- The web site traffic on a given day
+- The BMI of a subject four years after a baseline measurement
+- The hypertension status of a subject randomly drawn from a population
+- The number of people who click on an ad 
+- Intelligence quotients for a sample of children
+
+---
+
+## PMF
+
+A probability mass function evaluated at a value corresponds to the
+probability that a random variable takes that value. To be a valid
+pmf a function, $p$, must satisfy
+
+  1. It must always be larger than or equal to 0.
+  2. The sum of the possible values that the random variable can take has to add up to one.
+
+---
+
+## Example
+
+Let $X$ be the result of a coin flip where $X=0$ represents
+tails and $X = 1$ represents heads.
+$$
+p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+Suppose that we do not know whether or not the coin is fair; Let
+$\theta$ be the probability of a head expressed as a proportion
+(between 0 and 1).
+$$
+p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+
+---
+
+## PDF
+
+A probability density function (pdf), is a function associated with
+a continuous random variable 
+
+  *Areas under pdfs correspond to probabilities for that random variable*
+
+To be a valid pdf, a function must satisfy
+
+1. It must be larger than or equal to zero everywhere.
+
+2. The total area under it must be one.
+
+---
+## Example
+
+Suppose that the proportion of help calls that get addressed in
+a random day by a help line is given by
+$$
+f(x) = \left\{\begin{array}{ll}
+    2 x & \mbox{ for }& 0< x < 1 \\
+    0                 & \mbox{ otherwise} 
+\end{array} \right. 
+$$
+
+Is this a mathematically valid density?
+
+---
+
+```{r, fig.height = 5, fig.width = 5, echo = TRUE, fig.align='center'}
+x <- c(-0.5, 0, 1, 1, 1.5); y <- c( 0, 0, 2, 0, 0)
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+```
+
+---
+
+## Example continued
+
+What is the probability that 75% or fewer of calls get addressed?
+
+```{r, fig.height = 5, fig.width = 5, echo = FALSE, fig.align='center'}
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+polygon(c(0, .75, .75, 0), c(0, 0, 1.5, 0), lwd = 3, col = "lightblue")
+```
+
+---
+```{r}
+1.5 * .75 / 2
+pbeta(.75, 2, 1)
+```
+---
+
+## CDF and survival function
+### Certain areas are so useful, we give them names
+
+- The **cumulative distribution function** (CDF) of a random variable, $X$, returns the probability that the random variable is less than or equal to the value $x$
+$$
+F(x) = P(X \leq x)
+$$
+(This definition applies regardless of whether $X$ is discrete or continuous.)
+- The **survival function** of a random variable $X$ is defined as the probability
+that the random variable is greater than the value $x$
+$$
+S(x) = P(X > x)
+$$
+- Notice that $S(x) = 1 - F(x)$
+
+---
+
+## Example
+
+What are the survival function and CDF from the density considered before?
+
+For $1 \geq x \geq 0$
+$$
+F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
+$$
+
+$$
+S(x) = 1 - x^2
+$$
+
+```{r}
+pbeta(c(0.4, 0.5, 0.6), 2, 1)
+```
+
+---
+
+## Quantiles
+
+You've heard of sample quantiles. If you were the 95th percentile on an exam, you know
+that 95% of people scored worse than you and 5% scored better. 
+These are sample quantities. Here we define their population analogs.
+
+
+---
+## Definition
+
+The  $\alpha^{th}$ **quantile** of a distribution with distribution function $F$ is the point $x_\alpha$ so that
+$$
+F(x_\alpha) = \alpha
+$$
+- A **percentile** is simply a quantile with $\alpha$ expressed as a percent
+- The **median** is the $50^{th}$ percentile
+
+---
+## For example
+
+The $95^{th}$ percentile of a distribution is the point so that:
+- the probability that a random variable drawn from the population is less is 95%
+- the probability that a random variable drawn from the population is more is 5%
+
+---
+## Example
+What is the median of the distribution that we were working with before?
+- We want to solve $0.5 = F(x) = x^2$
+- Resulting in the solution 
+```{r, echo = TRUE} 
+sqrt(0.5)
+``` 
+- Therefore, about `r sqrt(0.5)` of calls being answered on a random day is the median.
+
+---
+## Example continued
+R can approximate quantiles for you for common distributions
+
+```{r}
+qbeta(0.5, 2, 1)
+```
+
+---
+
+## Summary
+
+- You might be wondering at this point "I've heard of a median before, it didn't require integration. Where's the data?"
+- We're referring to are **population quantities**. Therefore, the median being
+  discussed is the **population median**.
+- A probability model connects the data to the population using assumptions.
+- Therefore the median we're discussing is the **estimand**, the sample median will be the **estimator**
+
+
+
+>>>>>>> devel:06_StatisticalInference/02_Probability/index.Rmd
diff --git a/06_StatisticalInference/02_Probability/index.html b/06_StatisticalInference/02_Probability/index.html
new file mode 100644
index 000000000..7c3a70fe4
--- /dev/null
+++ b/06_StatisticalInference/02_Probability/index.html
@@ -0,0 +1,693 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Probability</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Probability">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Probability</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Probability</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In these slides we will cover the basics of probability at low enough level
+to have a basic understanding for the rest of the series</li>
+<li>For a more complete treatment see the class Mathematical Biostatistics Boot Camp 1
+
+<ul>
+<li>Youtube: <a href="http://www.youtube.com/playlist?list=PLpl-gQkQivXhk6qSyiNj51qamjAtZISJ-">www.youtube.com/playlist?list=PLpl-gQkQivXhk6qSyiNj51qamjAtZISJ-</a></li>
+<li>Coursera: <a href="http://www.coursera.org/course/biostats">www.coursera.org/course/biostats</a></li>
+<li>Git: <a href="http://github.com/bcaffo/Caffo-Coursera">http://github.com/bcaffo/Caffo-Coursera</a></li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Probability</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Given a random experiment (say rolling a die) a probability measure is a population quantity
+that summarizes the randomness.</p>
+
+<p>Specifically, probability takes a possible outcome from the expertiment and:</p>
+
+<ul>
+<li>assigns it a number between 0 and 1 </li>
+<li>so that the probability that something occurs is 1 (the die must be rolled)
+and </li>
+<li>so that the probability of the union of any two sets of outcomes that have nothing in common (mutually exclusive)
+is the sum of their respective probabilities.</li>
+</ul>
+
+<p>The Russian mathematician Kolmogorov formalized these rules.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Rules probability must follow</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The probability that nothing occurs is 0</li>
+<li>The probability that something occurs is 1</li>
+<li>The probability of something is 1 minus the probability that the opposite occurs</li>
+<li>The probability of at least one of 
+two (or more) things that can not simultaneously occur (mutually exclusive) 
+is the sum of their
+respective probabilities</li>
+<li>If an event A implies the occurrence of event B, then the probability of A
+occurring is less than the probability that B occurs</li>
+<li>For any two events the probability that at least one occurs is the sum of their
+probabilities minus their intersection.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>The National Sleep Foundation (<a href="http://www.sleepfoundation.org/">www.sleepfoundation.org</a>) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Answer: No, the events can simultaneously occur and so 
+are not mutually exclusive. To elaborate let:</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>If you want to see the mathematics</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>\[
+\begin{eqnarray*}
+    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
+    A_2 & = & \{\mbox{Person has RLS}\} 
+  \end{eqnarray*}
+\]</p>
+
+<p>Then </p>
+
+<p>\[
+\begin{eqnarray*}
+    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
+   & = & 0.13 - \mbox{Probability of having both}
+  \end{eqnarray*}
+\]
+Likely, some fraction of the population has both.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Going further</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Probability calculus is useful for understanding the rules that probabilities
+must follow. </p>
+
+<p>However, we need ways to model and think about probabilities for
+numeric outcomes of experiments (broadly defined). </p>
+
+<p>Densities and mass functions for random variables are the best starting point for this.</p>
+
+<p>Remember, everything we&#39;re talking about up to at this point is a population quantity 
+not a statement about what occurs in the data.  </p>
+
+<ul>
+<li>We&#39;re going with this is: use the data to estimate properties of the population.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A <strong>random variable</strong> is a numerical outcome of an experiment.</li>
+<li>The random variables that we study will come in two varieties,
+<strong>discrete</strong> or <strong>continuous</strong>.</li>
+<li>Discrete random variable are random variables that take on only a
+countable number of possibilities and we talk about the probability that they
+take specific values</li>
+<li>Continuous random variable can conceptually take any value on the real line or some subset of the real line and we talk about the probability that they line within
+some range</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Examples of variables that can be thought of as random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Experiments that we use for intuition and building context</p>
+
+<ul>
+<li>The \((0-1)\) outcome of the flip of a coin</li>
+<li>The outcome from the roll of a die</li>
+</ul>
+
+<p>Specific instances of treating variables as if random</p>
+
+<ul>
+<li>The web site traffic on a given day</li>
+<li>The BMI of a subject four years after a baseline measurement</li>
+<li>The hypertension status of a subject randomly drawn from a population</li>
+<li>The number of people who click on an ad </li>
+<li>Intelligence quotients for a sample of children</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>PMF</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>A probability mass function evaluated at a value corresponds to the
+probability that a random variable takes that value. To be a valid
+pmf a function, \(p\), must satisfy</p>
+
+<ol>
+<li>It must always be larger than or equal to 0.</li>
+<li>The sum of the possible values that the random variable can take has to add up to one.</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Let \(X\) be the result of a coin flip where \(X=0\) represents
+tails and \(X = 1\) represents heads.
+\[
+p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
+\]
+Suppose that we do not know whether or not the coin is fair; Let
+\(\theta\) be the probability of a head expressed as a proportion
+(between 0 and 1).
+\[
+p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>PDF</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>A probability density function (pdf), is a function associated with
+a continuous random variable </p>
+
+<p><em>Areas under pdfs correspond to probabilities for that random variable</em></p>
+
+<p>To be a valid pdf, a function must satisfy</p>
+
+<ol>
+<li><p>It must be larger than or equal to zero everywhere.</p></li>
+<li><p>The total area under it must be one.</p></li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Suppose that the proportion of help calls that get addressed in
+a random day by a help line is given by
+\[
+f(x) = \left\{\begin{array}{ll}
+    2 x & \mbox{ for }& 0< x < 1 \\
+    0                 & \mbox{ otherwise} 
+\end{array} \right. 
+\]</p>
+
+<p>Is this a mathematically valid density?</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">x &lt;- c(-0.5, 0, 1, 1, 1.5)
+y &lt;- c(0, 0, 2, 0, 0)
+plot(x, y, lwd = 3, frame = FALSE, type = &quot;l&quot;)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>What is the probability that 75% or fewer of calls get addressed?</p>
+
+<p><img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">1.5 * 0.75/2
+</code></pre>
+
+<pre><code>## [1] 0.5625
+</code></pre>
+
+<pre><code class="r">pbeta(0.75, 2, 1)
+</code></pre>
+
+<pre><code>## [1] 0.5625
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>CDF and survival function</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>Certain areas are so useful, we give them names</h3>
+
+<ul>
+<li>The <strong>cumulative distribution function</strong> (CDF) of a random variable, \(X\), returns the probability that the random variable is less than or equal to the value \(x\)
+\[
+F(x) = P(X \leq x)
+\]
+(This definition applies regardless of whether \(X\) is discrete or continuous.)</li>
+<li>The <strong>survival function</strong> of a random variable \(X\) is defined as the probability
+that the random variable is greater than the value \(x\)
+\[
+S(x) = P(X > x)
+\]</li>
+<li>Notice that \(S(x) = 1 - F(x)\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>What are the survival function and CDF from the density considered before?</p>
+
+<p>For \(1 \geq x \geq 0\)
+\[
+F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
+\]</p>
+
+<p>\[
+S(x) = 1 - x^2
+\]</p>
+
+<pre><code class="r">pbeta(c(0.4, 0.5, 0.6), 2, 1)
+</code></pre>
+
+<pre><code>## [1] 0.16 0.25 0.36
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>Quantiles</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>You&#39;ve heard of sample quantiles. If you were the 95th percentile on an exam, you know
+that 95% of people scored worse than you and 5% scored better. 
+These are sample quantities. Here we define their population analogs.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-20" style="background:;">
+  <hgroup>
+    <h2>Definition</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>The  \(\alpha^{th}\) <strong>quantile</strong> of a distribution with distribution function \(F\) is the point \(x_\alpha\) so that
+\[
+F(x_\alpha) = \alpha
+\]</p>
+
+<ul>
+<li>A <strong>percentile</strong> is simply a quantile with \(\alpha\) expressed as a percent</li>
+<li>The <strong>median</strong> is the \(50^{th}\) percentile</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-21" style="background:;">
+  <hgroup>
+    <h2>For example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>The \(95^{th}\) percentile of a distribution is the point so that:</p>
+
+<ul>
+<li>the probability that a random variable drawn from the population is less is 95%</li>
+<li>the probability that a random variable drawn from the population is more is 5%</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-22" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>What is the median of the distribution that we were working with before?</p>
+
+<ul>
+<li>We want to solve \(0.5 = F(x) = x^2\)</li>
+<li>Resulting in the solution </li>
+</ul>
+
+<pre><code class="r">sqrt(0.5)
+</code></pre>
+
+<pre><code>## [1] 0.7071
+</code></pre>
+
+<ul>
+<li>Therefore, about 0.7071 of calls being answered on a random day is the median.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-23" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>R can approximate quantiles for you for common distributions</p>
+
+<pre><code class="r">qbeta(0.5, 2, 1)
+</code></pre>
+
+<pre><code>## [1] 0.7071
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-24" style="background:;">
+  <hgroup>
+    <h2>Summary</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>You might be wondering at this point &quot;I&#39;ve heard of a median before, it didn&#39;t require integration. Where&#39;s the data?&quot;</li>
+<li>We&#39;re referring to are <strong>population quantities</strong>. Therefore, the median being
+discussed is the <strong>population median</strong>.</li>
+<li>A probability model connects the data to the population using assumptions.</li>
+<li>Therefore the median we&#39;re discussing is the <strong>estimand</strong>, the sample median will be the <strong>estimator</strong></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Probability'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Probability'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Rules probability must follow'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Example'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Example continued'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='If you want to see the mathematics'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Going further'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Random variables'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Examples of variables that can be thought of as random variables'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='PMF'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='PDF'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Example'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title=''>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Example continued'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title=''>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='CDF and survival function'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Example'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='Quantiles'>
+         19
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=20 title='Definition'>
+         20
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=21 title='For example'>
+         21
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=22 title='Example'>
+         22
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=23 title='Example continued'>
+         23
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=24 title='Summary'>
+         24
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/02_Probability/index.md b/06_StatisticalInference/02_Probability/index.md
new file mode 100644
index 000000000..aa9afeb16
--- /dev/null
+++ b/06_StatisticalInference/02_Probability/index.md
@@ -0,0 +1,341 @@
+---
+title       : Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Probability
+
+- In these slides we will cover the basics of probability at low enough level
+to have a basic understanding for the rest of the series
+- For a more complete treatment see the class Mathematical Biostatistics Boot Camp 1
+    - Youtube: www.youtube.com/playlist?list=PLpl-gQkQivXhk6qSyiNj51qamjAtZISJ-
+    - Coursera: www.coursera.org/course/biostats
+    - Git: http://github.com/bcaffo/Caffo-Coursera
+
+
+---
+
+## Probability
+
+Given a random experiment (say rolling a die) a probability measure is a population quantity
+that summarizes the randomness.
+
+Specifically, probability takes a possible outcome from the expertiment and:
+
+- assigns it a number between 0 and 1 
+- so that the probability that something occurs is 1 (the die must be rolled)
+and 
+- so that the probability of the union of any two sets of outcomes that have nothing in common (mutually exclusive)
+is the sum of their respective probabilities.
+
+
+The Russian mathematician Kolmogorov formalized these rules.
+
+---
+
+
+## Rules probability must follow
+
+- The probability that nothing occurs is 0
+- The probability that something occurs is 1
+- The probability of something is 1 minus the probability that the opposite occurs
+- The probability of at least one of 
+    two (or more) things that can not simultaneously occur (mutually exclusive) 
+    is the sum of their
+    respective probabilities
+- If an event A implies the occurrence of event B, then the probability of A
+occurring is less than the probability that B occurs
+- For any two events the probability that at least one occurs is the sum of their
+    probabilities minus their intersection.
+
+---
+
+## Example
+
+The National Sleep Foundation ([www.sleepfoundation.org](http://www.sleepfoundation.org/)) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?
+
+---
+
+## Example continued
+
+Answer: No, the events can simultaneously occur and so 
+are not mutually exclusive. To elaborate let:
+
+---
+## If you want to see the mathematics
+
+$$
+\begin{eqnarray*}
+    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
+    A_2 & = & \{\mbox{Person has RLS}\} 
+  \end{eqnarray*}
+$$
+
+Then 
+
+$$
+\begin{eqnarray*}
+    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
+   & = & 0.13 - \mbox{Probability of having both}
+  \end{eqnarray*}
+$$
+Likely, some fraction of the population has both.
+
+---
+## Going further
+
+Probability calculus is useful for understanding the rules that probabilities
+must follow. 
+
+However, we need ways to model and think about probabilities for
+numeric outcomes of experiments (broadly defined). 
+
+Densities and mass functions for random variables are the best starting point for this.
+
+Remember, everything we're talking about up to at this point is a population quantity 
+not a statement about what occurs in the data.  
+- We're going with this is: use the data to estimate properties of the population.
+
+---
+## Random variables
+
+- A **random variable** is a numerical outcome of an experiment.
+- The random variables that we study will come in two varieties,
+  **discrete** or **continuous**.
+- Discrete random variable are random variables that take on only a
+countable number of possibilities and we talk about the probability that they
+take specific values
+- Continuous random variable can conceptually take any value on the real line or some subset of the real line and we talk about the probability that they line within
+some range
+
+---
+
+## Examples of variables that can be thought of as random variables
+
+Experiments that we use for intuition and building context
+- The $(0-1)$ outcome of the flip of a coin
+- The outcome from the roll of a die
+
+Specific instances of treating variables as if random
+- The web site traffic on a given day
+- The BMI of a subject four years after a baseline measurement
+- The hypertension status of a subject randomly drawn from a population
+- The number of people who click on an ad 
+- Intelligence quotients for a sample of children
+
+---
+
+## PMF
+
+A probability mass function evaluated at a value corresponds to the
+probability that a random variable takes that value. To be a valid
+pmf a function, $p$, must satisfy
+
+  1. It must always be larger than or equal to 0.
+  2. The sum of the possible values that the random variable can take has to add up to one.
+
+---
+
+## Example
+
+Let $X$ be the result of a coin flip where $X=0$ represents
+tails and $X = 1$ represents heads.
+$$
+p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+Suppose that we do not know whether or not the coin is fair; Let
+$\theta$ be the probability of a head expressed as a proportion
+(between 0 and 1).
+$$
+p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+
+---
+
+## PDF
+
+A probability density function (pdf), is a function associated with
+a continuous random variable 
+
+  *Areas under pdfs correspond to probabilities for that random variable*
+
+To be a valid pdf, a function must satisfy
+
+1. It must be larger than or equal to zero everywhere.
+
+2. The total area under it must be one.
+
+---
+## Example
+
+Suppose that the proportion of help calls that get addressed in
+a random day by a help line is given by
+$$
+f(x) = \left\{\begin{array}{ll}
+    2 x & \mbox{ for }& 0< x < 1 \\
+    0                 & \mbox{ otherwise} 
+\end{array} \right. 
+$$
+
+Is this a mathematically valid density?
+
+---
+
+
+```r
+x <- c(-0.5, 0, 1, 1, 1.5)
+y <- c(0, 0, 2, 0, 0)
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+```
+
+<img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" />
+
+
+---
+
+## Example continued
+
+What is the probability that 75% or fewer of calls get addressed?
+
+<img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" />
+
+
+---
+
+```r
+1.5 * 0.75/2
+```
+
+```
+## [1] 0.5625
+```
+
+```r
+pbeta(0.75, 2, 1)
+```
+
+```
+## [1] 0.5625
+```
+
+---
+
+## CDF and survival function
+### Certain areas are so useful, we give them names
+
+- The **cumulative distribution function** (CDF) of a random variable, $X$, returns the probability that the random variable is less than or equal to the value $x$
+$$
+F(x) = P(X \leq x)
+$$
+(This definition applies regardless of whether $X$ is discrete or continuous.)
+- The **survival function** of a random variable $X$ is defined as the probability
+that the random variable is greater than the value $x$
+$$
+S(x) = P(X > x)
+$$
+- Notice that $S(x) = 1 - F(x)$
+
+---
+
+## Example
+
+What are the survival function and CDF from the density considered before?
+
+For $1 \geq x \geq 0$
+$$
+F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
+$$
+
+$$
+S(x) = 1 - x^2
+$$
+
+
+```r
+pbeta(c(0.4, 0.5, 0.6), 2, 1)
+```
+
+```
+## [1] 0.16 0.25 0.36
+```
+
+
+---
+
+## Quantiles
+
+You've heard of sample quantiles. If you were the 95th percentile on an exam, you know
+that 95% of people scored worse than you and 5% scored better. 
+These are sample quantities. Here we define their population analogs.
+
+
+---
+## Definition
+
+The  $\alpha^{th}$ **quantile** of a distribution with distribution function $F$ is the point $x_\alpha$ so that
+$$
+F(x_\alpha) = \alpha
+$$
+- A **percentile** is simply a quantile with $\alpha$ expressed as a percent
+- The **median** is the $50^{th}$ percentile
+
+---
+## For example
+
+The $95^{th}$ percentile of a distribution is the point so that:
+- the probability that a random variable drawn from the population is less is 95%
+- the probability that a random variable drawn from the population is more is 5%
+
+---
+## Example
+What is the median of the distribution that we were working with before?
+- We want to solve $0.5 = F(x) = x^2$
+- Resulting in the solution 
+
+```r
+sqrt(0.5)
+```
+
+```
+## [1] 0.7071
+```
+
+- Therefore, about 0.7071 of calls being answered on a random day is the median.
+
+---
+## Example continued
+R can approximate quantiles for you for common distributions
+
+
+```r
+qbeta(0.5, 2, 1)
+```
+
+```
+## [1] 0.7071
+```
+
+
+---
+
+## Summary
+
+- You might be wondering at this point "I've heard of a median before, it didn't require integration. Where's the data?"
+- We're referring to are **population quantities**. Therefore, the median being
+  discussed is the **population median**.
+- A probability model connects the data to the population using assumptions.
+- Therefore the median we're discussing is the **estimand**, the sample median will be the **estimator**
+
+
+
diff --git a/06_StatisticalInference/02_Probability/index.pdf b/06_StatisticalInference/02_Probability/index.pdf
new file mode 100644
index 000000000..105568760
Binary files /dev/null and b/06_StatisticalInference/02_Probability/index.pdf differ
diff --git a/06_StatisticalInference/03_01_TwoGroupIntervals/index.pdf b/06_StatisticalInference/03_01_TwoGroupIntervals/index.pdf
deleted file mode 100644
index a15898d7a..000000000
Binary files a/06_StatisticalInference/03_01_TwoGroupIntervals/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/03_02_HypothesisTesting/index.pdf b/06_StatisticalInference/03_02_HypothesisTesting/index.pdf
deleted file mode 100644
index 9e5f2ae42..000000000
Binary files a/06_StatisticalInference/03_02_HypothesisTesting/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/03_03_pValues/index.pdf b/06_StatisticalInference/03_03_pValues/index.pdf
deleted file mode 100644
index 85d8bd9d1..000000000
Binary files a/06_StatisticalInference/03_03_pValues/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/03_04_Power/index.pdf b/06_StatisticalInference/03_04_Power/index.pdf
deleted file mode 100644
index a5daf8abd..000000000
Binary files a/06_StatisticalInference/03_04_Power/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/03_05_MultipleTesting/index.pdf b/06_StatisticalInference/03_05_MultipleTesting/index.pdf
deleted file mode 100644
index 190c24c34..000000000
Binary files a/06_StatisticalInference/03_05_MultipleTesting/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/03_06_resampledInference/index.pdf b/06_StatisticalInference/03_06_resampledInference/index.pdf
deleted file mode 100644
index df08fe3df..000000000
Binary files a/06_StatisticalInference/03_06_resampledInference/index.pdf and /dev/null differ
diff --git a/06_StatisticalInference/03_ConditionalProbability/index.Rmd b/06_StatisticalInference/03_ConditionalProbability/index.Rmd
new file mode 100644
index 000000000..c33fdafa5
--- /dev/null
+++ b/06_StatisticalInference/03_ConditionalProbability/index.Rmd
@@ -0,0 +1,221 @@
+---
+title       : Conditional Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Conditional probability, motivation
+
+- The probability of getting a one when rolling a (standard) die
+  is usually assumed to be one sixth
+- Suppose you were given the extra information that the die roll
+  was an odd number (hence 1, 3 or 5)
+- *conditional on this new information*, the probability of a
+  one is now one third
+
+---
+
+## Conditional probability, definition
+
+- Let $B$ be an event so that $P(B) > 0$
+- Then the conditional probability of an event $A$ given that $B$ has occurred is
+  $$
+  P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
+  $$
+- Notice that if $A$ and $B$ are independent (defined later in the lecture), then
+  $$
+  P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
+  $$
+
+---
+
+## Example
+
+- Consider our die roll example
+- $B = \{1, 3, 5\}$
+- $A = \{1\}$
+$$
+  \begin{eqnarray*}
+P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
+  & = & \frac{P(A \cap B)}{P(B)} \\ \\
+  & = & \frac{P(A)}{P(B)} \\ \\ 
+  & = & \frac{1/6}{3/6} = \frac{1}{3}
+  \end{eqnarray*}
+$$
+
+
+
+---
+
+## Bayes' rule
+Baye's rule allows us to reverse the conditioning set provided
+that we know some marginal probabilities
+$$
+P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
+$$
+  
+
+---
+
+## Diagnostic tests
+
+- Let $+$ and $-$ be the events that the result of a diagnostic test is positive or negative respectively
+- Let $D$ and $D^c$ be the event that the subject of the test has or does not have the disease respectively 
+- The **sensitivity** is the probability that the test is positive given that the subject actually has the disease, $P(+ ~|~ D)$
+- The **specificity** is the probability that the test is negative given that the subject does not have the disease, $P(- ~|~ D^c)$
+
+---
+
+## More definitions
+
+- The **positive predictive value** is the probability that the subject has the  disease given that the test is positive, $P(D ~|~ +)$
+- The **negative predictive value** is the probability that the subject does not have the disease given that the test is negative, $P(D^c ~|~ -)$
+- The **prevalence of the disease** is the marginal probability of disease, $P(D)$
+
+---
+
+## More definitions
+
+- The **diagnostic likelihood ratio of a positive test**, labeled $DLR_+$, is $P(+ ~|~ D) / P(+ ~|~ D^c)$, which is the $$sensitivity / (1 - specificity)$$
+- The **diagnostic likelihood ratio of a negative test**, labeled $DLR_-$, is $P(- ~|~ D) / P(- ~|~ D^c)$, which is the $$(1 - sensitivity) / specificity$$
+
+---
+
+## Example
+
+- A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%
+- Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the positive predictive value?
+- Mathematically, we want $P(D ~|~ +)$ given the sensitivity, $P(+ ~|~ D) = .997$, the specificity, $P(- ~|~ D^c) =.985$, and the prevalence $P(D) = .001$
+
+---
+
+## Using Bayes' formula
+
+$$
+\begin{eqnarray*}
+  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
+ & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
+ & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
+ & = & .062
+\end{eqnarray*}
+$$
+
+- In this population a positive test result only suggests a 6% probability that the subject has the disease 
+- (The positive predictive value is 6% for this test)
+
+---
+
+## More on this example
+
+- The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity
+- Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner
+- Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject's population, only our interpretation of that evidence changes
+
+---
+
+## Likelihood ratios
+
+- Using Bayes rule, we have
+  $$
+  P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
+  $$
+  and
+  $$
+  P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
+  $$
+
+---
+
+## Likelihood ratios
+
+- Therefore
+$$
+\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
+$$
+ie
+$$
+\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
+$$
+- Similarly, $DLR_-$ relates the decrease in the odds of the
+  disease after a negative test result to the odds of disease prior to
+  the test.
+
+---
+
+## HIV example revisited
+
+- Suppose a subject has a positive HIV test
+- $DLR_+ = .997 / (1 - .985) \approx 66$
+- The result of the positive test is that the odds of disease is now 66 times the pretest odds
+- Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease
+
+---
+
+## HIV example revisited
+
+- Suppose that a subject has a negative test result 
+- $DLR_- = (1 - .997) / .985  \approx .003$
+- Therefore, the post-test odds of disease is now $.3\%$ of the pretest odds given the negative test.
+- Or, the hypothesis of disease is supported $.003$ times that of the hypothesis of absence of disease given the negative test result
+
+---
+
+## Independence
+
+- Two events $A$ and $B$ are **independent** if $$P(A \cap B) = P(A)P(B)$$
+- Equivalently if $P(A ~|~ B) = P(A)$ 
+- Two random variables, $X$ and $Y$ are independent if for any two sets $A$ and $B$ $$P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)$$
+- If $A$ is independent of $B$ then 
+  - $A^c$ is independent of $B$ 
+  - $A$ is independent of $B^c$
+  - $A^c$ is independent of $B^c$
+
+
+---
+
+## Example
+
+- What is the probability of getting two consecutive heads?
+- $A = \{\mbox{Head on flip 1}\}$ ~ $P(A) = .5$
+- $B = \{\mbox{Head on flip 2}\}$ ~ $P(B) = .5$
+- $A \cap B = \{\mbox{Head on flips 1 and 2}\}$
+- $P(A \cap B) = P(A)P(B) = .5 \times .5 = .25$ 
+
+---
+
+## Example
+
+- Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial
+- Based on an estimated prevalence of sudden infant death syndrome of $1$ out of $8,543$, the physician testified that that the probability of a mother having two children with SIDS was $\left(\frac{1}{8,543}\right)^2$
+- The mother on trial was convicted of murder
+
+---
+
+## Example: continued
+
+- Relevant to this discussion, the principal mistake was to *assume* that the events of having SIDs within a family are independent
+- That is, $P(A_1 \cap A_2)$ is not necessarily equal to $P(A_1)P(A_2)$
+- Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families
+- (There are many other statistical points of discussion for this case.)
+
+
+---
+## IID random variables
+
+- Random variables are said to be iid if they are independent and identically distributed
+  - Independent: statistically unrelated from one and another
+  - Identically distributed: all having been drawn from the same population distribution
+- iid random variables are the default model for random samples
+- Many of the important theories of statistics are founded on assuming that variables are iid
+- Assuming a random sample and iid will be the default starting point of inference for this class
+
diff --git a/06_StatisticalInference/03_ConditionalProbability/index.html b/06_StatisticalInference/03_ConditionalProbability/index.html
new file mode 100644
index 000000000..524f79bf5
--- /dev/null
+++ b/06_StatisticalInference/03_ConditionalProbability/index.html
@@ -0,0 +1,534 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Conditional Probability</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Conditional Probability">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Conditional Probability</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Conditional probability, motivation</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The probability of getting a one when rolling a (standard) die
+is usually assumed to be one sixth</li>
+<li>Suppose you were given the extra information that the die roll
+was an odd number (hence 1, 3 or 5)</li>
+<li><em>conditional on this new information</em>, the probability of a
+one is now one third</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Conditional probability, definition</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Let \(B\) be an event so that \(P(B) > 0\)</li>
+<li>Then the conditional probability of an event \(A\) given that \(B\) has occurred is
+\[
+P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
+\]</li>
+<li>Notice that if \(A\) and \(B\) are independent (defined later in the lecture), then
+\[
+P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider our die roll example</li>
+<li>\(B = \{1, 3, 5\}\)</li>
+<li>\(A = \{1\}\)
+\[
+\begin{eqnarray*}
+P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
+& = & \frac{P(A \cap B)}{P(B)} \\ \\
+& = & \frac{P(A)}{P(B)} \\ \\ 
+& = & \frac{1/6}{3/6} = \frac{1}{3}
+\end{eqnarray*}
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Bayes&#39; rule</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Baye&#39;s rule allows us to reverse the conditioning set provided
+that we know some marginal probabilities
+\[
+P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Diagnostic tests</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Let \(+\) and \(-\) be the events that the result of a diagnostic test is positive or negative respectively</li>
+<li>Let \(D\) and \(D^c\) be the event that the subject of the test has or does not have the disease respectively </li>
+<li>The <strong>sensitivity</strong> is the probability that the test is positive given that the subject actually has the disease, \(P(+ ~|~ D)\)</li>
+<li>The <strong>specificity</strong> is the probability that the test is negative given that the subject does not have the disease, \(P(- ~|~ D^c)\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>More definitions</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>positive predictive value</strong> is the probability that the subject has the  disease given that the test is positive, \(P(D ~|~ +)\)</li>
+<li>The <strong>negative predictive value</strong> is the probability that the subject does not have the disease given that the test is negative, \(P(D^c ~|~ -)\)</li>
+<li>The <strong>prevalence of the disease</strong> is the marginal probability of disease, \(P(D)\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>More definitions</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>diagnostic likelihood ratio of a positive test</strong>, labeled \(DLR_+\), is \(P(+ ~|~ D) / P(+ ~|~ D^c)\), which is the \[sensitivity / (1 - specificity)\]</li>
+<li>The <strong>diagnostic likelihood ratio of a negative test</strong>, labeled \(DLR_-\), is \(P(- ~|~ D) / P(- ~|~ D^c)\), which is the \[(1 - sensitivity) / specificity\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%</li>
+<li>Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the positive predictive value?</li>
+<li>Mathematically, we want \(P(D ~|~ +)\) given the sensitivity, \(P(+ ~|~ D) = .997\), the specificity, \(P(- ~|~ D^c) =.985\), and the prevalence \(P(D) = .001\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Using Bayes&#39; formula</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>\[
+\begin{eqnarray*}
+  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
+ & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
+ & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
+ & = & .062
+\end{eqnarray*}
+\]</p>
+
+<ul>
+<li>In this population a positive test result only suggests a 6% probability that the subject has the disease </li>
+<li>(The positive predictive value is 6% for this test)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>More on this example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity</li>
+<li>Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner</li>
+<li>Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject&#39;s population, only our interpretation of that evidence changes</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Likelihood ratios</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Using Bayes rule, we have
+\[
+P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
+\]
+and
+\[
+P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Likelihood ratios</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Therefore
+\[
+\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
+\]
+ie
+\[
+\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
+\]</li>
+<li>Similarly, \(DLR_-\) relates the decrease in the odds of the
+disease after a negative test result to the odds of disease prior to
+the test.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>HIV example revisited</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose a subject has a positive HIV test</li>
+<li>\(DLR_+ = .997 / (1 - .985) \approx 66\)</li>
+<li>The result of the positive test is that the odds of disease is now 66 times the pretest odds</li>
+<li>Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>HIV example revisited</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that a subject has a negative test result </li>
+<li>\(DLR_- = (1 - .997) / .985  \approx .003\)</li>
+<li>Therefore, the post-test odds of disease is now \(.3\%\) of the pretest odds given the negative test.</li>
+<li>Or, the hypothesis of disease is supported \(.003\) times that of the hypothesis of absence of disease given the negative test result</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Independence</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Two events \(A\) and \(B\) are <strong>independent</strong> if \[P(A \cap B) = P(A)P(B)\]</li>
+<li>Equivalently if \(P(A ~|~ B) = P(A)\) </li>
+<li>Two random variables, \(X\) and \(Y\) are independent if for any two sets \(A\) and \(B\) \[P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)\]</li>
+<li>If \(A\) is independent of \(B\) then 
+
+<ul>
+<li>\(A^c\) is independent of \(B\) </li>
+<li>\(A\) is independent of \(B^c\)</li>
+<li>\(A^c\) is independent of \(B^c\)</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>What is the probability of getting two consecutive heads?</li>
+<li>\(A = \{\mbox{Head on flip 1}\}\) ~ \(P(A) = .5\)</li>
+<li>\(B = \{\mbox{Head on flip 2}\}\) ~ \(P(B) = .5\)</li>
+<li>\(A \cap B = \{\mbox{Head on flips 1 and 2}\}\)</li>
+<li>\(P(A \cap B) = P(A)P(B) = .5 \times .5 = .25\) </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial</li>
+<li>Based on an estimated prevalence of sudden infant death syndrome of \(1\) out of \(8,543\), the physician testified that that the probability of a mother having two children with SIDS was \(\left(\frac{1}{8,543}\right)^2\)</li>
+<li>The mother on trial was convicted of murder</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Example: continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Relevant to this discussion, the principal mistake was to <em>assume</em> that the events of having SIDs within a family are independent</li>
+<li>That is, \(P(A_1 \cap A_2)\) is not necessarily equal to \(P(A_1)P(A_2)\)</li>
+<li>Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families</li>
+<li>(There are many other statistical points of discussion for this case.)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>IID random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Random variables are said to be iid if they are independent and identically distributed
+
+<ul>
+<li>Independent: statistically unrelated from one and another</li>
+<li>Identically distributed: all having been drawn from the same population distribution</li>
+</ul></li>
+<li>iid random variables are the default model for random samples</li>
+<li>Many of the important theories of statistics are founded on assuming that variables are iid</li>
+<li>Assuming a random sample and iid will be the default starting point of inference for this class</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Conditional probability, motivation'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Conditional probability, definition'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Example'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Bayes&#39; rule'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Diagnostic tests'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='More definitions'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='More definitions'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Example'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Using Bayes&#39; formula'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='More on this example'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Likelihood ratios'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Likelihood ratios'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='HIV example revisited'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='HIV example revisited'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Independence'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Example'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Example'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Example: continued'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='IID random variables'>
+         19
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/03_ConditionalProbability/index.md b/06_StatisticalInference/03_ConditionalProbability/index.md
new file mode 100644
index 000000000..c33fdafa5
--- /dev/null
+++ b/06_StatisticalInference/03_ConditionalProbability/index.md
@@ -0,0 +1,221 @@
+---
+title       : Conditional Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Conditional probability, motivation
+
+- The probability of getting a one when rolling a (standard) die
+  is usually assumed to be one sixth
+- Suppose you were given the extra information that the die roll
+  was an odd number (hence 1, 3 or 5)
+- *conditional on this new information*, the probability of a
+  one is now one third
+
+---
+
+## Conditional probability, definition
+
+- Let $B$ be an event so that $P(B) > 0$
+- Then the conditional probability of an event $A$ given that $B$ has occurred is
+  $$
+  P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
+  $$
+- Notice that if $A$ and $B$ are independent (defined later in the lecture), then
+  $$
+  P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
+  $$
+
+---
+
+## Example
+
+- Consider our die roll example
+- $B = \{1, 3, 5\}$
+- $A = \{1\}$
+$$
+  \begin{eqnarray*}
+P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
+  & = & \frac{P(A \cap B)}{P(B)} \\ \\
+  & = & \frac{P(A)}{P(B)} \\ \\ 
+  & = & \frac{1/6}{3/6} = \frac{1}{3}
+  \end{eqnarray*}
+$$
+
+
+
+---
+
+## Bayes' rule
+Baye's rule allows us to reverse the conditioning set provided
+that we know some marginal probabilities
+$$
+P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
+$$
+  
+
+---
+
+## Diagnostic tests
+
+- Let $+$ and $-$ be the events that the result of a diagnostic test is positive or negative respectively
+- Let $D$ and $D^c$ be the event that the subject of the test has or does not have the disease respectively 
+- The **sensitivity** is the probability that the test is positive given that the subject actually has the disease, $P(+ ~|~ D)$
+- The **specificity** is the probability that the test is negative given that the subject does not have the disease, $P(- ~|~ D^c)$
+
+---
+
+## More definitions
+
+- The **positive predictive value** is the probability that the subject has the  disease given that the test is positive, $P(D ~|~ +)$
+- The **negative predictive value** is the probability that the subject does not have the disease given that the test is negative, $P(D^c ~|~ -)$
+- The **prevalence of the disease** is the marginal probability of disease, $P(D)$
+
+---
+
+## More definitions
+
+- The **diagnostic likelihood ratio of a positive test**, labeled $DLR_+$, is $P(+ ~|~ D) / P(+ ~|~ D^c)$, which is the $$sensitivity / (1 - specificity)$$
+- The **diagnostic likelihood ratio of a negative test**, labeled $DLR_-$, is $P(- ~|~ D) / P(- ~|~ D^c)$, which is the $$(1 - sensitivity) / specificity$$
+
+---
+
+## Example
+
+- A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%
+- Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the positive predictive value?
+- Mathematically, we want $P(D ~|~ +)$ given the sensitivity, $P(+ ~|~ D) = .997$, the specificity, $P(- ~|~ D^c) =.985$, and the prevalence $P(D) = .001$
+
+---
+
+## Using Bayes' formula
+
+$$
+\begin{eqnarray*}
+  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
+ & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
+ & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
+ & = & .062
+\end{eqnarray*}
+$$
+
+- In this population a positive test result only suggests a 6% probability that the subject has the disease 
+- (The positive predictive value is 6% for this test)
+
+---
+
+## More on this example
+
+- The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity
+- Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner
+- Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject's population, only our interpretation of that evidence changes
+
+---
+
+## Likelihood ratios
+
+- Using Bayes rule, we have
+  $$
+  P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
+  $$
+  and
+  $$
+  P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
+  $$
+
+---
+
+## Likelihood ratios
+
+- Therefore
+$$
+\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
+$$
+ie
+$$
+\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
+$$
+- Similarly, $DLR_-$ relates the decrease in the odds of the
+  disease after a negative test result to the odds of disease prior to
+  the test.
+
+---
+
+## HIV example revisited
+
+- Suppose a subject has a positive HIV test
+- $DLR_+ = .997 / (1 - .985) \approx 66$
+- The result of the positive test is that the odds of disease is now 66 times the pretest odds
+- Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease
+
+---
+
+## HIV example revisited
+
+- Suppose that a subject has a negative test result 
+- $DLR_- = (1 - .997) / .985  \approx .003$
+- Therefore, the post-test odds of disease is now $.3\%$ of the pretest odds given the negative test.
+- Or, the hypothesis of disease is supported $.003$ times that of the hypothesis of absence of disease given the negative test result
+
+---
+
+## Independence
+
+- Two events $A$ and $B$ are **independent** if $$P(A \cap B) = P(A)P(B)$$
+- Equivalently if $P(A ~|~ B) = P(A)$ 
+- Two random variables, $X$ and $Y$ are independent if for any two sets $A$ and $B$ $$P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)$$
+- If $A$ is independent of $B$ then 
+  - $A^c$ is independent of $B$ 
+  - $A$ is independent of $B^c$
+  - $A^c$ is independent of $B^c$
+
+
+---
+
+## Example
+
+- What is the probability of getting two consecutive heads?
+- $A = \{\mbox{Head on flip 1}\}$ ~ $P(A) = .5$
+- $B = \{\mbox{Head on flip 2}\}$ ~ $P(B) = .5$
+- $A \cap B = \{\mbox{Head on flips 1 and 2}\}$
+- $P(A \cap B) = P(A)P(B) = .5 \times .5 = .25$ 
+
+---
+
+## Example
+
+- Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial
+- Based on an estimated prevalence of sudden infant death syndrome of $1$ out of $8,543$, the physician testified that that the probability of a mother having two children with SIDS was $\left(\frac{1}{8,543}\right)^2$
+- The mother on trial was convicted of murder
+
+---
+
+## Example: continued
+
+- Relevant to this discussion, the principal mistake was to *assume* that the events of having SIDs within a family are independent
+- That is, $P(A_1 \cap A_2)$ is not necessarily equal to $P(A_1)P(A_2)$
+- Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families
+- (There are many other statistical points of discussion for this case.)
+
+
+---
+## IID random variables
+
+- Random variables are said to be iid if they are independent and identically distributed
+  - Independent: statistically unrelated from one and another
+  - Identically distributed: all having been drawn from the same population distribution
+- iid random variables are the default model for random samples
+- Many of the important theories of statistics are founded on assuming that variables are iid
+- Assuming a random sample and iid will be the default starting point of inference for this class
+
diff --git a/06_StatisticalInference/03_ConditionalProbability/index.pdf b/06_StatisticalInference/03_ConditionalProbability/index.pdf
new file mode 100644
index 000000000..b91f495a9
Binary files /dev/null and b/06_StatisticalInference/03_ConditionalProbability/index.pdf differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/galton.png b/06_StatisticalInference/04_Expectations/assets/fig/galton.png
new file mode 100644
index 000000000..19abb675a
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/galton.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/lsm.png b/06_StatisticalInference/04_Expectations/assets/fig/lsm.png
new file mode 100644
index 000000000..9a33fef15
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/lsm.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-1.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..c8c6209b8
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-11.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-11.png
new file mode 100644
index 000000000..bdfdd22df
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-11.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-12.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-12.png
new file mode 100644
index 000000000..67d844343
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-12.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..480b45d70
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..904f824bf
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-4.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..6cb7c0dcc
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-5.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..32e3f0c9b
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-5.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-6.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-6.png
new file mode 100644
index 000000000..f574e8155
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-6.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-7.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-7.png
new file mode 100644
index 000000000..7c2834b0d
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-7.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-8.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-8.png
new file mode 100644
index 000000000..60e61f6e8
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-8.png differ
diff --git a/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-9.png b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-9.png
new file mode 100644
index 000000000..0dd06658f
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/assets/fig/unnamed-chunk-9.png differ
diff --git a/06_StatisticalInference/01_03_Expectations/figure/galton.png b/06_StatisticalInference/04_Expectations/figure/galton.png
similarity index 100%
rename from 06_StatisticalInference/01_03_Expectations/figure/galton.png
rename to 06_StatisticalInference/04_Expectations/figure/galton.png
diff --git a/06_StatisticalInference/01_03_Expectations/figure/lsm.png b/06_StatisticalInference/04_Expectations/figure/lsm.png
similarity index 100%
rename from 06_StatisticalInference/01_03_Expectations/figure/lsm.png
rename to 06_StatisticalInference/04_Expectations/figure/lsm.png
diff --git a/06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-1.png b/06_StatisticalInference/04_Expectations/figure/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-1.png
rename to 06_StatisticalInference/04_Expectations/figure/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-2.png b/06_StatisticalInference/04_Expectations/figure/unnamed-chunk-2.png
similarity index 100%
rename from 06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-2.png
rename to 06_StatisticalInference/04_Expectations/figure/unnamed-chunk-2.png
diff --git a/06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-3.png b/06_StatisticalInference/04_Expectations/figure/unnamed-chunk-3.png
similarity index 100%
rename from 06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-3.png
rename to 06_StatisticalInference/04_Expectations/figure/unnamed-chunk-3.png
diff --git a/06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-31.png b/06_StatisticalInference/04_Expectations/figure/unnamed-chunk-31.png
similarity index 100%
rename from 06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-31.png
rename to 06_StatisticalInference/04_Expectations/figure/unnamed-chunk-31.png
diff --git a/06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-32.png b/06_StatisticalInference/04_Expectations/figure/unnamed-chunk-32.png
similarity index 100%
rename from 06_StatisticalInference/01_03_Expectations/figure/unnamed-chunk-32.png
rename to 06_StatisticalInference/04_Expectations/figure/unnamed-chunk-32.png
diff --git a/06_StatisticalInference/04_Expectations/index.Rmd b/06_StatisticalInference/04_Expectations/index.Rmd
new file mode 100644
index 000000000..40a2f598b
--- /dev/null
+++ b/06_StatisticalInference/04_Expectations/index.Rmd
@@ -0,0 +1,226 @@
+---
+title       : Expected values
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Expected values
+- Expected values are useful cor characterizing a distribution
+- The mean is a characterization of its center
+- The variance and standard deviation are characterizations of
+how spread out it is
+- Our sample expected values (the sample mean and variance) will
+estimate the population versions
+
+
+---
+## The population mean
+- The **expected value** or **mean** of a random variable is the center of its distribution
+- For discrete random variable $X$ with PMF $p(x)$, it is defined as follows
+    $$
+    E[X] = \sum_x xp(x).
+    $$
+    where the sum is taken over the possible values of $x$
+- $E[X]$ represents the center of mass of a collection of locations and weights, $\{x, p(x)\}$
+
+---
+## The sample mean
+- The sample mean estimates this population mean
+- The center of mass of the data is the empirical mean
+$$
+\bar X = \sum_{i=1}^n x_i p(x_i)
+$$
+where $p(x_i) = 1/n$
+
+---
+
+## Example
+### Find the center of mass of the bars
+```{r galton, fig.height=6,fig.width=12, fig.align='center', echo = FALSE, message =FALSE, warning=FALSE}
+library(UsingR); data(galton); library(ggplot2)
+library(reshape2)
+longGalton <- melt(galton, measure.vars = c("child", "parent"))
+g <- ggplot(longGalton, aes(x = value)) + geom_histogram(aes(y = ..density..,  fill = variable), binwidth=1, colour = "black") + geom_density(size = 2)
+g <- g + facet_grid(. ~ variable)
+g
+```
+
+---
+## Using manipulate
+```
+library(manipulate)
+myHist <- function(mu){
+    g <- ggplot(galton, aes(x = child))
+    g <- g + geom_histogram(fill = "salmon", 
+      binwidth=1, aes(y = ..density..), colour = "black")
+    g <- g + geom_density(size = 2)
+    g <- g + geom_vline(xintercept = mu, size = 2)
+    mse <- round(mean((galton$child - mu)^2), 3)  
+    g <- g + labs(title = paste('mu = ', mu, ' MSE = ', mse))
+    g
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The center of mass is the empirical mean
+```{r lsm, dependson="galton",fig.height=7,fig.width=7, fig.align='center', echo = FALSE}
+    g <- ggplot(galton, aes(x = child))
+    g <- g + geom_histogram(fill = "salmon", 
+      binwidth=1, aes(y = ..density..), colour = "black")
+    g <- g + geom_density(size = 2)
+    g <- g + geom_vline(xintercept = mean(galton$child), size = 2)
+    g
+```
+
+
+---
+## Example of a population mean
+
+- Suppose a coin is flipped and $X$ is declared $0$ or $1$ corresponding to a head or a tail, respectively
+- What is the expected value of $X$? 
+    $$
+    E[X] = .5 \times 0 + .5 \times 1 = .5
+    $$
+- Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be $.5$
+
+```{r, echo = FALSE, fig.height=4, fig.width = 6, fig.align='center'}
+ggplot(data.frame(x = factor(0 : 1), y = c(.5, .5)), aes(x = x, y = y)) + geom_bar(stat = "identity", colour = 'black', fill = "lightblue")
+```
+
+---
+## What about a biased coin?
+
+- Suppose that a random variable, $X$, is so that
+$P(X=1) = p$ and $P(X=0) = (1 - p)$
+- (This is a biased coin when $p\neq 0.5$)
+- What is its expected value?
+$$
+E[X] = 0 * (1 - p) + 1 * p = p
+$$
+
+---
+
+## Example
+
+- Suppose that a die is rolled and $X$ is the number face up
+- What is the expected value of $X$?
+    $$
+    E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
+    3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
+    5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
+    $$
+- Again, the geometric argument makes this answer obvious without calculation.
+
+```{r, fig.align='center', echo=FALSE, fig.height=4, fig.width=10}
+ggplot(data.frame(x = factor(1 : 6), y = rep(1/6, 6)), aes(x = x, y = y)) + geom_bar(stat = "identity", colour = 'black', fill = "lightblue")
+```
+
+---
+
+## Continuous random variables
+
+- For a continuous random variable, $X$, with density, $f$, the expected value is again exactly the center of mass of the density
+
+
+---
+
+## Example
+
+- Consider a density where $f(x) = 1$ for $x$ between zero and one
+- (Is this a valid density?)
+- Suppose that $X$ follows this density; what is its expected value?  
+```{r, fig.height=6, fig.width=6, echo=FALSE, fig.align='center'}
+g <- ggplot(data.frame(x = c(-0.25, 0, 0, 1, 1, 1.25),
+                  y = c(0, 0, 1, 1, 0, 0)),
+       aes(x = x, y = y))
+g <- g + geom_line(size = 2, colour = "black")
+g <- g + labs(title = "Uniform density")
+g  
+
+```
+
+---
+
+## Facts about expected values
+
+- Recall that expected values are properties of distributions
+- Note the average of random variables is itself a random variable
+and its associated distribution has an expected value
+- The center of this distribution is the same as that of the original distribution
+- Therefore, the expected value of the **sample mean** is the population mean that it's trying to estimate
+- When the expected value of an estimator is what its trying to estimate, we say that the estimator is **unbiased**
+- Let's try a simulation experiment
+
+---
+## Simulation experiment
+Simulating normals with mean 0 and variance 1 versus averages
+of 10 normals from the same population
+
+```{r, fig.height=6, figh.width=6, fig.align='center', echo = FALSE}
+library(ggplot2)
+nosim <- 10000; n <- 10
+dat <- data.frame(
+    x = c(rnorm(nosim), apply(matrix(rnorm(nosim * n), nosim), 1, mean)),
+    what = factor(rep(c("Obs", "Mean"), c(nosim, nosim))) 
+    )
+ggplot(dat, aes(x = x, fill = what)) + geom_density(size = 2, alpha = .2); 
+
+```
+
+---
+## Averages of x die rolls
+
+```{r, fig.align='center',fig.height=5, fig.width=10, echo = FALSE, warning=FALSE, error=FALSE, message=FALSE}  
+dat <- data.frame(
+  x = c(sample(1 : 6, nosim, replace = TRUE),
+        apply(matrix(sample(1 : 6, nosim * 2, replace = TRUE), 
+                     nosim), 1, mean),
+        apply(matrix(sample(1 : 6, nosim * 3, replace = TRUE), 
+                     nosim), 1, mean),
+        apply(matrix(sample(1 : 6, nosim * 4, replace = TRUE), 
+                     nosim), 1, mean)
+        ),
+  size = factor(rep(1 : 4, rep(nosim, 4))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(alpha = .20, binwidth=.25, colour = "black") 
+g + facet_grid(. ~ size)
+```
+
+
+---
+## Averages of x coin flips
+```{r, fig.align='center',fig.height=5, fig.width=10, echo = FALSE, warning=FALSE, error=FALSE, message=FALSE}
+dat <- data.frame(
+  x = c(sample(0 : 1, nosim, replace = TRUE),
+        apply(matrix(sample(0 : 1, nosim * 10, replace = TRUE), 
+                     nosim), 1, mean),
+        apply(matrix(sample(0 : 1, nosim * 20, replace = TRUE), 
+                     nosim), 1, mean),
+        apply(matrix(sample(0 : 1, nosim * 30, replace = TRUE), 
+                     nosim), 1, mean)
+        ),
+  size = factor(rep(c(1, 10, 20, 30), rep(nosim, 4))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(alpha = .20, binwidth = 1 / 12, colour = "black"); 
+g + facet_grid(. ~ size)
+```
+
+---
+## Sumarizing what we know
+- Expected values are properties of distributions
+- The population mean is the center of mass of population
+- The sample mean is the center of mass of the observed data
+- The sample mean is an estimate of the population mean
+- The sample mean is unbiased 
+  - The population mean of its distribution is the mean that it's
+  trying to estimate
+- The more data that goes into the sample mean, the more 
+concentrated its density / mass function is around the population mean
diff --git a/06_StatisticalInference/04_Expectations/index.html b/06_StatisticalInference/04_Expectations/index.html
new file mode 100644
index 000000000..174080640
--- /dev/null
+++ b/06_StatisticalInference/04_Expectations/index.html
@@ -0,0 +1,446 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Expected values</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Expected values">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Expected values</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Expected values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Expected values are useful cor characterizing a distribution</li>
+<li>The mean is a characterization of its center</li>
+<li>The variance and standard deviation are characterizations of
+how spread out it is</li>
+<li>Our sample expected values (the sample mean and variance) will
+estimate the population versions</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>The population mean</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>expected value</strong> or <strong>mean</strong> of a random variable is the center of its distribution</li>
+<li>For discrete random variable \(X\) with PMF \(p(x)\), it is defined as follows
+\[
+E[X] = \sum_x xp(x).
+\]
+where the sum is taken over the possible values of \(x\)</li>
+<li>\(E[X]\) represents the center of mass of a collection of locations and weights, \(\{x, p(x)\}\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>The sample mean</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The sample mean estimates this population mean</li>
+<li>The center of mass of the data is the empirical mean
+\[
+\bar X = \sum_{i=1}^n x_i p(x_i)
+\]
+where \(p(x_i) = 1/n\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>Find the center of mass of the bars</h3>
+
+<p><img src="assets/fig/galton.png" title="plot of chunk galton" alt="plot of chunk galton" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Using manipulate</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code>library(manipulate)
+myHist &lt;- function(mu){
+    g &lt;- ggplot(galton, aes(x = child))
+    g &lt;- g + geom_histogram(fill = &quot;salmon&quot;, 
+      binwidth=1, aes(y = ..density..), colour = &quot;black&quot;)
+    g &lt;- g + geom_density(size = 2)
+    g &lt;- g + geom_vline(xintercept = mu, size = 2)
+    mse &lt;- round(mean((galton$child - mu)^2), 3)  
+    g &lt;- g + labs(title = paste(&#39;mu = &#39;, mu, &#39; MSE = &#39;, mse))
+    g
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>The center of mass is the empirical mean</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/lsm.png" title="plot of chunk lsm" alt="plot of chunk lsm" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Example of a population mean</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose a coin is flipped and \(X\) is declared \(0\) or \(1\) corresponding to a head or a tail, respectively</li>
+<li>What is the expected value of \(X\)? 
+\[
+E[X] = .5 \times 0 + .5 \times 1 = .5
+\]</li>
+<li>Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be \(.5\)</li>
+</ul>
+
+<p><img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>What about a biased coin?</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that a random variable, \(X\), is so that
+\(P(X=1) = p\) and \(P(X=0) = (1 - p)\)</li>
+<li>(This is a biased coin when \(p\neq 0.5\))</li>
+<li>What is its expected value?
+\[
+E[X] = 0 * (1 - p) + 1 * p = p
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that a die is rolled and \(X\) is the number face up</li>
+<li>What is the expected value of \(X\)?
+\[
+E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
+3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
+5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
+\]</li>
+<li>Again, the geometric argument makes this answer obvious without calculation.</li>
+</ul>
+
+<p><img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Continuous random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>For a continuous random variable, \(X\), with density, \(f\), the expected value is again exactly the center of mass of the density</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider a density where \(f(x) = 1\) for \(x\) between zero and one</li>
+<li>(Is this a valid density?)</li>
+<li>Suppose that \(X\) follows this density; what is its expected value?<br>
+<img src="assets/fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" style="display: block; margin: auto;" /></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Facts about expected values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Recall that expected values are properties of distributions</li>
+<li>Note the average of random variables is itself a random variable
+and its associated distribution has an expected value</li>
+<li>The center of this distribution is the same as that of the original distribution</li>
+<li>Therefore, the expected value of the <strong>sample mean</strong> is the population mean that it&#39;s trying to estimate</li>
+<li>When the expected value of an estimator is what its trying to estimate, we say that the estimator is <strong>unbiased</strong></li>
+<li>Let&#39;s try a simulation experiment</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Simulation experiment</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Simulating normals with mean 0 and variance 1 versus averages
+of 10 normals from the same population</p>
+
+<p><img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Averages of x die rolls</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Averages of x coin flips</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Sumarizing what we know</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Expected values are properties of distributions</li>
+<li>The population mean is the center of mass of population</li>
+<li>The sample mean is the center of mass of the observed data</li>
+<li>The sample mean is an estimate of the population mean</li>
+<li>The sample mean is unbiased 
+
+<ul>
+<li>The population mean of its distribution is the mean that it&#39;s
+trying to estimate</li>
+</ul></li>
+<li>The more data that goes into the sample mean, the more 
+concentrated its density / mass function is around the population mean</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Expected values'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='The population mean'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='The sample mean'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Example'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Using manipulate'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='The center of mass is the empirical mean'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Example of a population mean'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='What about a biased coin?'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Example'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Continuous random variables'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Facts about expected values'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Simulation experiment'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Averages of x die rolls'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Averages of x coin flips'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Sumarizing what we know'>
+         16
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/04_Expectations/index.md b/06_StatisticalInference/04_Expectations/index.md
new file mode 100644
index 000000000..89fea21a5
--- /dev/null
+++ b/06_StatisticalInference/04_Expectations/index.md
@@ -0,0 +1,165 @@
+---
+title       : Expected values
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Expected values
+- Expected values are useful cor characterizing a distribution
+- The mean is a characterization of its center
+- The variance and standard deviation are characterizations of
+how spread out it is
+- Our sample expected values (the sample mean and variance) will
+estimate the population versions
+
+
+---
+## The population mean
+- The **expected value** or **mean** of a random variable is the center of its distribution
+- For discrete random variable $X$ with PMF $p(x)$, it is defined as follows
+    $$
+    E[X] = \sum_x xp(x).
+    $$
+    where the sum is taken over the possible values of $x$
+- $E[X]$ represents the center of mass of a collection of locations and weights, $\{x, p(x)\}$
+
+---
+## The sample mean
+- The sample mean estimates this population mean
+- The center of mass of the data is the empirical mean
+$$
+\bar X = \sum_{i=1}^n x_i p(x_i)
+$$
+where $p(x_i) = 1/n$
+
+---
+
+## Example
+### Find the center of mass of the bars
+<img src="assets/fig/galton.png" title="plot of chunk galton" alt="plot of chunk galton" style="display: block; margin: auto;" />
+
+---
+## Using manipulate
+```
+library(manipulate)
+myHist <- function(mu){
+    g <- ggplot(galton, aes(x = child))
+    g <- g + geom_histogram(fill = "salmon", 
+      binwidth=1, aes(y = ..density..), colour = "black")
+    g <- g + geom_density(size = 2)
+    g <- g + geom_vline(xintercept = mu, size = 2)
+    mse <- round(mean((galton$child - mu)^2), 3)  
+    g <- g + labs(title = paste('mu = ', mu, ' MSE = ', mse))
+    g
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The center of mass is the empirical mean
+<img src="assets/fig/lsm.png" title="plot of chunk lsm" alt="plot of chunk lsm" style="display: block; margin: auto;" />
+
+
+---
+## Example of a population mean
+
+- Suppose a coin is flipped and $X$ is declared $0$ or $1$ corresponding to a head or a tail, respectively
+- What is the expected value of $X$? 
+    $$
+    E[X] = .5 \times 0 + .5 \times 1 = .5
+    $$
+- Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be $.5$
+
+<img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" />
+
+---
+## What about a biased coin?
+
+- Suppose that a random variable, $X$, is so that
+$P(X=1) = p$ and $P(X=0) = (1 - p)$
+- (This is a biased coin when $p\neq 0.5$)
+- What is its expected value?
+$$
+E[X] = 0 * (1 - p) + 1 * p = p
+$$
+
+---
+
+## Example
+
+- Suppose that a die is rolled and $X$ is the number face up
+- What is the expected value of $X$?
+    $$
+    E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
+    3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
+    5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
+    $$
+- Again, the geometric argument makes this answer obvious without calculation.
+
+<img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" />
+
+---
+
+## Continuous random variables
+
+- For a continuous random variable, $X$, with density, $f$, the expected value is again exactly the center of mass of the density
+
+
+---
+
+## Example
+
+- Consider a density where $f(x) = 1$ for $x$ between zero and one
+- (Is this a valid density?)
+- Suppose that $X$ follows this density; what is its expected value?  
+<img src="assets/fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" style="display: block; margin: auto;" />
+
+---
+
+## Facts about expected values
+
+- Recall that expected values are properties of distributions
+- Note the average of random variables is itself a random variable
+and its associated distribution has an expected value
+- The center of this distribution is the same as that of the original distribution
+- Therefore, the expected value of the **sample mean** is the population mean that it's trying to estimate
+- When the expected value of an estimator is what its trying to estimate, we say that the estimator is **unbiased**
+- Let's try a simulation experiment
+
+---
+## Simulation experiment
+Simulating normals with mean 0 and variance 1 versus averages
+of 10 normals from the same population
+
+<img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" />
+
+---
+## Averages of x die rolls
+
+<img src="assets/fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" style="display: block; margin: auto;" />
+
+
+---
+## Averages of x coin flips
+<img src="assets/fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" style="display: block; margin: auto;" />
+
+---
+## Sumarizing what we know
+- Expected values are properties of distributions
+- The population mean is the center of mass of population
+- The sample mean is the center of mass of the observed data
+- The sample mean is an estimate of the population mean
+- The sample mean is unbiased 
+  - The population mean of its distribution is the mean that it's
+  trying to estimate
+- The more data that goes into the sample mean, the more 
+concentrated its density / mass function is around the population mean
diff --git a/06_StatisticalInference/04_Expectations/index.pdf b/06_StatisticalInference/04_Expectations/index.pdf
new file mode 100644
index 000000000..bade44b67
Binary files /dev/null and b/06_StatisticalInference/04_Expectations/index.pdf differ
diff --git a/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-1.png b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..7c2834b0d
Binary files /dev/null and b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-10.png b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-10.png
new file mode 100644
index 000000000..f904f389c
Binary files /dev/null and b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-10.png differ
diff --git a/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-11.png b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-11.png
new file mode 100644
index 000000000..f904f389c
Binary files /dev/null and b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-11.png differ
diff --git a/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..348f1f6ff
Binary files /dev/null and b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..f22b9b90d
Binary files /dev/null and b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-9.png b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-9.png
new file mode 100644
index 000000000..43f87d1dd
Binary files /dev/null and b/06_StatisticalInference/05_Variance/assets/fig/unnamed-chunk-9.png differ
diff --git a/06_StatisticalInference/05_Variance/index.Rmd b/06_StatisticalInference/05_Variance/index.Rmd
new file mode 100644
index 000000000..97678f49a
--- /dev/null
+++ b/06_StatisticalInference/05_Variance/index.Rmd
@@ -0,0 +1,239 @@
+---
+title       : The variance
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## The variance
+
+- The variance of a random variable is a measure of *spread*
+- If $X$ is a random variable with mean $\mu$, the variance of $X$ is defined as
+
+$$
+Var(X) = E[(X - \mu)^2] = E[X^2] - E[X]^2
+$$ 
+
+- The expected (squared) distance from the mean
+- Densities with a higher variance are more spread out than densities with a lower variance
+- The square root of the variance is called the **standard deviation**
+- The standard deviation has the same units as $X$
+
+---
+
+## Example
+
+- What's the variance from the result of a toss of a die? 
+
+  - $E[X] = 3.5$ 
+  - $E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17$ 
+
+- $Var(X) = E[X^2] - E[X]^2 \approx 2.92$
+
+---
+
+## Example
+
+- What's the variance from the result of the toss of a coin with probability of heads (1) of $p$? 
+
+  - $E[X] = 0 \times (1 - p) + 1 \times p = p$
+  - $E[X^2] = E[X] = p$ 
+
+$$Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)$$
+
+
+---
+## Distributions with increasing variance
+```{r, echo = FALSE, fig.height = 6, fig.width = 8, fig.align='center'}
+library(ggplot2)
+xvals <- seq(-10, 10, by = .01)
+dat <- data.frame(
+    y = c(
+        dnorm(xvals, mean = 0, sd = 1),
+        dnorm(xvals, mean = 0, sd = 2),
+        dnorm(xvals, mean = 0, sd = 3),
+        dnorm(xvals, mean = 0, sd = 4)
+    ),
+    x = rep(xvals, 4),
+    factor = factor(rep(1 : 4, rep(length(xvals), 4)))
+)
+ggplot(dat, aes(x = x, y = y, color = factor)) + geom_line(size = 2)    
+```
+
+---
+## The sample variance 
+- The sample variance is 
+$$
+S^2 = \frac{\sum_{i=1} (X_i - \bar X)^2}{n-1}
+$$
+(almost, but not quite, the average squared deviation from
+the sample mean)
+- It is also a random variable
+  - It has an associate population distribution
+  - Its expected value is the population variance
+  - Its distribution gets more concentrated around the population variance with more data
+- Its square root is the sample standard deviation
+
+
+---
+## Simulation experiment
+### Simulating from a population with variance 1
+
+```{r, fig.height=6, figh.width=6, fig.align='center', echo = FALSE}
+library(ggplot2)
+nosim <- 10000; 
+dat <- data.frame(
+    x = c(apply(matrix(rnorm(nosim * 10), nosim), 1, var),
+          apply(matrix(rnorm(nosim * 20), nosim), 1, var),
+          apply(matrix(rnorm(nosim * 30), nosim), 1, var)),
+    n = factor(rep(c("10", "20", "30"), c(nosim, nosim, nosim))) 
+    )
+ggplot(dat, aes(x = x, fill = n)) + geom_density(size = 2, alpha = .2) + geom_vline(xintercept = 1, size = 2) 
+
+```
+
+---
+## Variances of x die rolls
+```{r, fig.align='center',fig.height=5, fig.width=10, echo = FALSE, warning=FALSE, error=FALSE, message=FALSE}  
+dat <- data.frame(
+  x = c(apply(matrix(sample(1 : 6, nosim * 10, replace = TRUE), 
+                     nosim), 1, var),
+        apply(matrix(sample(1 : 6, nosim * 20, replace = TRUE), 
+                     nosim), 1, var),
+        apply(matrix(sample(1 : 6, nosim * 30, replace = TRUE), 
+                     nosim), 1, var)
+        ),
+  size = factor(rep(c(10, 20, 30), rep(nosim, 3))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(alpha = .20, binwidth=.3, colour = "black") 
+g <- g + geom_vline(xintercept = 2.92, size = 2)
+g + facet_grid(. ~ size)
+```
+
+
+---
+
+## Recall the mean
+- Recall that the average of random sample from a population 
+is itself a random variable
+- We know that this distribution is centered around the population
+mean, $E[\bar X] = \mu$
+- We also know what its variance is $Var(\bar X) = \sigma^2 / n$
+- This is very useful, since we don't have repeat sample means 
+to get its variance; now we know how it relates to
+the population variance
+- We call the standard deviation of a statistic a standard error
+
+---
+## To summarize
+- The sample variance, $S^2$, estimates the population variance, $\sigma^2$
+- The distribution of the sample variance is centered around $\sigma^2$
+- The the variance of sample mean is $\sigma^2 / n$
+  - Its logical estimate is $s^2 / n$
+  - The logical estimate of the standard error is $s / \sqrt{n}$
+- $s$, the standard deviation, talks about how variable the population is
+- $s/\sqrt{n}$, the standard error, talks about how variable averages of random samples of size $n$ from the population are
+
+---
+## Simulation example
+Standard normals have variance 1; means of $n$ standard normals
+have standard deviation $1/\sqrt{n}$
+
+```{r}
+nosim <- 1000
+n <- 10
+sd(apply(matrix(rnorm(nosim * n), nosim), 1, mean))
+1 / sqrt(n)
+```
+
+
+---
+## Simulation example
+Standard uniforms have variance $1/12$; means of 
+random samples of $n$ uniforms have sd $1/\sqrt{12 \times n}$
+
+
+```{r}
+nosim <- 1000
+n <- 10
+sd(apply(matrix(runif(nosim * n), nosim), 1, mean))
+1 / sqrt(12 * n)
+```
+
+
+---
+## Simulation example
+Poisson(4) have variance $4$; means of 
+random samples of $n$ Poisson(4) have sd $2/\sqrt{n}$
+
+
+```{r}
+nosim <- 1000
+n <- 10
+sd(apply(matrix(rpois(nosim * n, 4), nosim), 1, mean))
+2 / sqrt(n)
+```
+
+
+---
+## Simulation example
+Fair coin flips have variance $0.25$; means of 
+random samples of $n$ coin flips have sd $1 / (2 \sqrt{n})$
+
+
+```{r}
+nosim <- 1000
+n <- 10
+sd(apply(matrix(sample(0 : 1, nosim * n, replace = TRUE),
+                nosim), 1, mean))
+1 / (2 * sqrt(n))
+```
+
+---
+## Data example
+```{r}
+library(UsingR); data(father.son); 
+x <- father.son$sheight
+n<-length(x)
+```
+
+---
+## Plot of the son's heights
+```{r, fig.height=6, fig.width=6, echo=FALSE, fig.align='center'}
+g <- ggplot(data = father.son, aes(x = sheight)) 
+g <- g + geom_histogram(aes(y = ..density..), fill = "lightblue", binwidth=1, colour = "black")
+g <- g + geom_density(size = 2, colour = "black")
+g
+```
+
+---
+## Let's interpret these numbers
+```{r}
+round(c(var(x), var(x) / n, sd(x), sd(x) / sqrt(n)),2)
+```
+
+```{r, echo = FALSE, fig.height=4, fig.width=4,fig.align='center'}
+g
+```
+
+
+---
+## Summarizing what we know about variances
+- The sample variance estimates the population variance
+- The distribution of the sample variance is centered at
+what its estimating
+- It gets more concentrated around the population variance with larger sample sizes
+- The variance of the sample mean is the population variance
+divided by $n$
+  - The square root is the standard error
+- It turns out that we can say a lot about the distribution of
+averages from random samples, 
+even though we only get one to look at in a given data set
diff --git a/06_StatisticalInference/05_Variance/index.html b/06_StatisticalInference/05_Variance/index.html
new file mode 100644
index 000000000..d002219ee
--- /dev/null
+++ b/06_StatisticalInference/05_Variance/index.html
@@ -0,0 +1,521 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>The variance</title>
+  <meta charset="utf-8">
+  <meta name="description" content="The variance">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>The variance</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>The variance</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The variance of a random variable is a measure of <em>spread</em></li>
+<li>If \(X\) is a random variable with mean \(\mu\), the variance of \(X\) is defined as</li>
+</ul>
+
+<p>\[
+Var(X) = E[(X - \mu)^2] = E[X^2] - E[X]^2
+\] </p>
+
+<ul>
+<li>The expected (squared) distance from the mean</li>
+<li>Densities with a higher variance are more spread out than densities with a lower variance</li>
+<li>The square root of the variance is called the <strong>standard deviation</strong></li>
+<li>The standard deviation has the same units as \(X\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li><p>What&#39;s the variance from the result of a toss of a die? </p>
+
+<ul>
+<li>\(E[X] = 3.5\) </li>
+<li>\(E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17\) </li>
+</ul></li>
+<li><p>\(Var(X) = E[X^2] - E[X]^2 \approx 2.92\)</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li><p>What&#39;s the variance from the result of the toss of a coin with probability of heads (1) of \(p\)? </p>
+
+<ul>
+<li>\(E[X] = 0 \times (1 - p) + 1 \times p = p\)</li>
+<li>\(E[X^2] = E[X] = p\) </li>
+</ul></li>
+</ul>
+
+<p>\[Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Distributions with increasing variance</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>The sample variance</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The sample variance is 
+\[
+S^2 = \frac{\sum_{i=1} (X_i - \bar X)^2}{n-1}
+\]
+(almost, but not quite, the average squared deviation from
+the sample mean)</li>
+<li>It is also a random variable
+
+<ul>
+<li>It has an associate population distribution</li>
+<li>Its expected value is the population variance</li>
+<li>Its distribution gets more concentrated around the population variance with mroe data</li>
+</ul></li>
+<li>Its square root is the sample standard deviation</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Simulation experiment</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>Simulating from a population with variance 1</h3>
+
+<p><img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Variances of x die rolls</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Recall the mean</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Recall that the average of random sample from a population 
+is itself a random variable</li>
+<li>We know that this distribution is centered around the population
+mean, \(E[\bar X] = \mu\)</li>
+<li>We also know what its variance is \(Var(\bar X) = \sigma^2 / n\)</li>
+<li>This is very useful, since we don&#39;t have repeat sample means 
+to get its variance; now we know how it relates to
+the population variance</li>
+<li>We call the standard deviation of a statistic a standard error</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>To summarize</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The sample variance, \(S^2\), estimates the population variance, \(\sigma^2\)</li>
+<li>The distribution of the sample variance is centered around \(\sigma^2\)</li>
+<li>The the variance of sample mean is \(\sigma^2 / n\)
+
+<ul>
+<li>Its logical estimate is \(s^2 / n\)</li>
+<li>The logical estimate of the standard error is \(S / \sqrt{n}\)</li>
+</ul></li>
+<li>\(S\), the standard deviation, talks about how variable the population is</li>
+<li>\(S/\sqrt{n}\), the standard error, talks about how variable averages of random samples of size \(n\) from the population are</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Simulation example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Standard normals have variance 1; means of \(n\) standard normals
+have standard deviation \(1/\sqrt{n}\)</p>
+
+<pre><code class="r">nosim &lt;- 1000
+n &lt;- 10
+sd(apply(matrix(rnorm(nosim * n), nosim), 1, mean))
+</code></pre>
+
+<pre><code>## [1] 0.3156
+</code></pre>
+
+<pre><code class="r">1 / sqrt(n)
+</code></pre>
+
+<pre><code>## [1] 0.3162
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Simulation example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Standard uniforms have variance \(1/12\); means of 
+random samples of \(n\) uniforms have sd \(1/\sqrt{12 \times n}\)</p>
+
+<pre><code class="r">nosim &lt;- 1000
+n &lt;- 10
+sd(apply(matrix(runif(nosim * n), nosim), 1, mean))
+</code></pre>
+
+<pre><code>## [1] 0.09017
+</code></pre>
+
+<pre><code class="r">1 / sqrt(12 * n)
+</code></pre>
+
+<pre><code>## [1] 0.09129
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Simulation example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Poisson(4) have variance \(4\); means of 
+random samples of \(n\) Poisson(4) have sd \(2/\sqrt{n}\)</p>
+
+<pre><code class="r">nosim &lt;- 1000
+n &lt;- 10
+sd(apply(matrix(rpois(nosim * n, 4), nosim), 1, mean))
+</code></pre>
+
+<pre><code>## [1] 0.6219
+</code></pre>
+
+<pre><code class="r">2 / sqrt(n)
+</code></pre>
+
+<pre><code>## [1] 0.6325
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Simulation example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Fair coin flips have variance \(0.25\); means of 
+random samples of \(n\) coin flips have sd \(1 / (2 \sqrt{n})\)</p>
+
+<pre><code class="r">nosim &lt;- 1000
+n &lt;- 10
+sd(apply(matrix(sample(0 : 1, nosim * n, replace = TRUE),
+                nosim), 1, mean))
+</code></pre>
+
+<pre><code>## [1] 0.1587
+</code></pre>
+
+<pre><code class="r">1 / (2 * sqrt(n))
+</code></pre>
+
+<pre><code>## [1] 0.1581
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Data example</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">library(UsingR); data(father.son); 
+x &lt;- father.son$sheight
+n&lt;-length(x)
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Plot of the son&#39;s heights</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-9.png" title="plot of chunk unnamed-chunk-9" alt="plot of chunk unnamed-chunk-9" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Let&#39;s interpret these numbers</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">round(c(var(x), var(x) / n, sd(x), sd(x) / sqrt(n)),2)
+</code></pre>
+
+<pre><code>## [1] 7.92 0.01 2.81 0.09
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-11.png" title="plot of chunk unnamed-chunk-11" alt="plot of chunk unnamed-chunk-11" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Summarizing what we know about variances</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The sample variance estimates the population variance</li>
+<li>The distribution of the sample variance is centered at
+what its estimating</li>
+<li>It gets more concentrated around the population variance with larger sample sizes</li>
+<li>The variance of the sample mean is the population variance
+divided by \(n\)
+
+<ul>
+<li>The square root is the standard error</li>
+</ul></li>
+<li>It turns out that we can say a lot about the distribution of
+averages from random samples, 
+even though we only get one to look at in a given data set</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='The variance'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Example'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Example'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Distributions with increasing variance'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='The sample variance'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Simulation experiment'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Variances of x die rolls'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Recall the mean'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='To summarize'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Simulation example'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Simulation example'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Simulation example'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Simulation example'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Data example'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Plot of the son&#39;s heights'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Let&#39;s interpret these numbers'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Summarizing what we know about variances'>
+         17
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/05_Variance/index.md b/06_StatisticalInference/05_Variance/index.md
new file mode 100644
index 000000000..ac2361fe3
--- /dev/null
+++ b/06_StatisticalInference/05_Variance/index.md
@@ -0,0 +1,248 @@
+---
+title       : The variance
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## The variance
+
+- The variance of a random variable is a measure of *spread*
+- If $X$ is a random variable with mean $\mu$, the variance of $X$ is defined as
+
+$$
+Var(X) = E[(X - \mu)^2] = E[X^2] - E[X]^2
+$$ 
+
+- The expected (squared) distance from the mean
+- Densities with a higher variance are more spread out than densities with a lower variance
+- The square root of the variance is called the **standard deviation**
+- The standard deviation has the same units as $X$
+
+---
+
+## Example
+
+- What's the variance from the result of a toss of a die? 
+
+  - $E[X] = 3.5$ 
+  - $E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17$ 
+
+- $Var(X) = E[X^2] - E[X]^2 \approx 2.92$
+
+---
+
+## Example
+
+- What's the variance from the result of the toss of a coin with probability of heads (1) of $p$? 
+
+  - $E[X] = 0 \times (1 - p) + 1 \times p = p$
+  - $E[X^2] = E[X] = p$ 
+
+$$Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)$$
+
+
+---
+## Distributions with increasing variance
+<img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" />
+
+---
+## The sample variance 
+- The sample variance is 
+$$
+S^2 = \frac{\sum_{i=1} (X_i - \bar X)^2}{n-1}
+$$
+(almost, but not quite, the average squared deviation from
+the sample mean)
+- It is also a random variable
+  - It has an associate population distribution
+  - Its expected value is the population variance
+  - Its distribution gets more concentrated around the population variance with mroe data
+- Its square root is the sample standard deviation
+
+
+---
+## Simulation experiment
+### Simulating from a population with variance 1
+
+<img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" />
+
+---
+## Variances of x die rolls
+<img src="assets/fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" style="display: block; margin: auto;" />
+
+
+---
+
+## Recall the mean
+- Recall that the average of random sample from a population 
+is itself a random variable
+- We know that this distribution is centered around the population
+mean, $E[\bar X] = \mu$
+- We also know what its variance is $Var(\bar X) = \sigma^2 / n$
+- This is very useful, since we don't have repeat sample means 
+to get its variance; now we know how it relates to
+the population variance
+- We call the standard deviation of a statistic a standard error
+
+---
+## To summarize
+- The sample variance, $S^2$, estimates the population variance, $\sigma^2$
+- The distribution of the sample variance is centered around $\sigma^2$
+- The variance of the sample mean is $\sigma^2 / n$
+  - Its logical estimate is $s^2 / n$
+  - The logical estimate of the standard error is $S / \sqrt{n}$
+- $S$, the standard deviation, talks about how variable the population is
+- $S/\sqrt{n}$, the standard error, talks about how variable averages of random samples of size $n$ from the population are
+
+---
+## Simulation example
+Standard normals have variance 1; means of $n$ standard normals
+have standard deviation $1/\sqrt{n}$
+
+
+```r
+nosim <- 1000
+n <- 10
+sd(apply(matrix(rnorm(nosim * n), nosim), 1, mean))
+```
+
+```
+## [1] 0.3156
+```
+
+```r
+1 / sqrt(n)
+```
+
+```
+## [1] 0.3162
+```
+
+
+---
+## Simulation example
+Standard uniforms have variance $1/12$; means of 
+random samples of $n$ uniforms have sd $1/\sqrt{12 \times n}$
+
+
+
+```r
+nosim <- 1000
+n <- 10
+sd(apply(matrix(runif(nosim * n), nosim), 1, mean))
+```
+
+```
+## [1] 0.09017
+```
+
+```r
+1 / sqrt(12 * n)
+```
+
+```
+## [1] 0.09129
+```
+
+
+---
+## Simulation example
+Poisson(4) have variance $4$; means of 
+random samples of $n$ Poisson(4) have sd $2/\sqrt{n}$
+
+
+
+```r
+nosim <- 1000
+n <- 10
+sd(apply(matrix(rpois(nosim * n, 4), nosim), 1, mean))
+```
+
+```
+## [1] 0.6219
+```
+
+```r
+2 / sqrt(n)
+```
+
+```
+## [1] 0.6325
+```
+
+
+---
+## Simulation example
+Fair coin flips have variance $0.25$; means of 
+random samples of $n$ coin flips have sd $1 / (2 \sqrt{n})$
+
+
+
+```r
+nosim <- 1000
+n <- 10
+sd(apply(matrix(sample(0 : 1, nosim * n, replace = TRUE),
+                nosim), 1, mean))
+```
+
+```
+## [1] 0.1587
+```
+
+```r
+1 / (2 * sqrt(n))
+```
+
+```
+## [1] 0.1581
+```
+
+---
+## Data example
+
+```r
+library(UsingR); data(father.son); 
+x <- father.son$sheight
+n<-length(x)
+```
+
+---
+## Plot of the son's heights
+<img src="assets/fig/unnamed-chunk-9.png" title="plot of chunk unnamed-chunk-9" alt="plot of chunk unnamed-chunk-9" style="display: block; margin: auto;" />
+
+---
+## Let's interpret these numbers
+
+```r
+round(c(var(x), var(x) / n, sd(x), sd(x) / sqrt(n)),2)
+```
+
+```
+## [1] 7.92 0.01 2.81 0.09
+```
+
+<img src="assets/fig/unnamed-chunk-11.png" title="plot of chunk unnamed-chunk-11" alt="plot of chunk unnamed-chunk-11" style="display: block; margin: auto;" />
+
+
+---
+## Summarizing what we know about variances
+- The sample variance estimates the population variance
+- The distribution of the sample variance is centered at
+what its estimating
+- It gets more concentrated around the population variance with larger sample sizes
+- The variance of the sample mean is the population variance
+divided by $n$
+  - The square root is the standard error
+- It turns out that we can say a lot about the distribution of
+averages from random samples, 
+even though we only get one to look at in a given data set
diff --git a/06_StatisticalInference/05_Variance/index.pdf b/06_StatisticalInference/05_Variance/index.pdf
new file mode 100644
index 000000000..9fdc1ed32
Binary files /dev/null and b/06_StatisticalInference/05_Variance/index.pdf differ
diff --git a/06_StatisticalInference/06_CommonDistros/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/06_CommonDistros/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..0822baa05
Binary files /dev/null and b/06_StatisticalInference/06_CommonDistros/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/06_CommonDistros/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/06_CommonDistros/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..0822baa05
Binary files /dev/null and b/06_StatisticalInference/06_CommonDistros/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/02_01_CommonDistributions/fig/unnamed-chunk-1.png b/06_StatisticalInference/06_CommonDistros/fig/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/02_01_CommonDistributions/fig/unnamed-chunk-1.png
rename to 06_StatisticalInference/06_CommonDistros/fig/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/02_01_CommonDistributions/fig/unnamed-chunk-3.png b/06_StatisticalInference/06_CommonDistros/fig/unnamed-chunk-3.png
similarity index 100%
rename from 06_StatisticalInference/02_01_CommonDistributions/fig/unnamed-chunk-3.png
rename to 06_StatisticalInference/06_CommonDistros/fig/unnamed-chunk-3.png
diff --git a/06_StatisticalInference/02_01_CommonDistributions/fig/unnamed-chunk-4.png b/06_StatisticalInference/06_CommonDistros/fig/unnamed-chunk-4.png
similarity index 100%
rename from 06_StatisticalInference/02_01_CommonDistributions/fig/unnamed-chunk-4.png
rename to 06_StatisticalInference/06_CommonDistros/fig/unnamed-chunk-4.png
diff --git a/06_StatisticalInference/02_01_CommonDistributions/figure/unnamed-chunk-1.png b/06_StatisticalInference/06_CommonDistros/figure/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/02_01_CommonDistributions/figure/unnamed-chunk-1.png
rename to 06_StatisticalInference/06_CommonDistros/figure/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/06_CommonDistros/index.Rmd b/06_StatisticalInference/06_CommonDistros/index.Rmd
new file mode 100644
index 000000000..3f38a1f54
--- /dev/null
+++ b/06_StatisticalInference/06_CommonDistros/index.Rmd
@@ -0,0 +1,261 @@
+---
+title       : Some Common Distributions
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+
+## The Bernoulli distribution
+
+- The **Bernoulli distribution** arises as the result of a binary outcome
+- Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) $p$ and $1-p$ respectively
+- The PMF for a Bernoulli random variable $X$ is $$P(X = x) =  p^x (1 - p)^{1 - x}$$
+- The mean of a Bernoulli random variable is $p$ and the variance is $p(1 - p)$
+- If we let $X$ be a Bernoulli random variable, it is typical to call $X=1$ as a "success" and $X=0$ as a "failure"
+
+
+---
+
+## Binomial trials
+
+- The *binomial random variables* are obtained as the sum of iid Bernoulli trials
+- In specific, let $X_1,\ldots,X_n$ be iid Bernoulli$(p)$; then $X = \sum_{i=1}^n X_i$ is a binomial random variable
+- The binomial mass function is
+$$
+P(X = x) = 
+\left(
+\begin{array}{c}
+  n \\ x
+\end{array}
+\right)
+p^x(1 - p)^{n-x}
+$$
+for $x=0,\ldots,n$
+
+---
+
+## Choose
+
+- Recall that the notation 
+  $$\left(
+    \begin{array}{c}
+      n \\ x
+    \end{array}
+  \right) = \frac{n!}{x!(n-x)!}
+  $$ (read "$n$ choose $x$") counts the number of ways of selecting $x$ items out of $n$
+  without replacement disregarding the order of the items
+
+$$\left(
+    \begin{array}{c}
+      n \\ 0
+    \end{array}
+  \right) =
+\left(
+    \begin{array}{c}
+      n \\ n
+    \end{array}
+  \right) =  1
+  $$ 
+
+---
+
+## Example
+
+- Suppose a friend has $8$ children (oh my!), $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+$$\left(
+\begin{array}{c}
+  8 \\ 7
+\end{array}
+\right) .5^{7}(1-.5)^{1}
++
+\left(
+\begin{array}{c}
+  8 \\ 8
+\end{array}
+\right) .5^{8}(1-.5)^{0} \approx 0.04
+$$
+```{r}
+choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
+pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
+```
+
+
+---
+
+## The normal distribution
+
+- A random variable is said to follow a **normal** or **Gaussian** distribution with mean $\mu$ and variance $\sigma^2$ if the associated density is
+  $$
+  (2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
+  $$
+  If $X$ a RV with this density then $E[X] = \mu$ and $Var(X) = \sigma^2$
+- We write $X\sim \mbox{N}(\mu, \sigma^2)$
+- When $\mu = 0$ and $\sigma = 1$ the resulting distribution is called **the standard normal distribution**
+- Standard normal RVs are often labeled $Z$
+
+---
+## The standard normal distribution with reference lines 
+```{r, fig.height=6, fig.width=6, fig.align='center', echo = FALSE}
+x <- seq(-3, 3, length = 1000)
+library(ggplot2)
+g <- ggplot(data.frame(x = x, y = dnorm(x)), 
+            aes(x = x, y = y)) + geom_line(size = 2)
+g <- g + geom_vline(xintercept = -3 : 3, size = 2)
+g
+```
+
+---
+
+## Facts about the normal density
+
+If $X \sim \mbox{N}(\mu,\sigma^2)$ then 
+$$Z = \frac{X -\mu}{\sigma} \sim N(0, 1)$$ 
+
+
+If $Z$ is standard normal $$X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)$$
+
+---
+
+## More facts about the normal density
+
+1. Approximately $68\%$, $95\%$ and $99\%$  of the normal density lies within $1$, $2$ and $3$ standard deviations from the mean, respectively
+2. $-1.28$, $-1.645$, $-1.96$ and $-2.33$ are the $10^{th}$, $5^{th}$, $2.5^{th}$ and $1^{st}$ percentiles of the standard normal distribution respectively
+3. By symmetry, $1.28$, $1.645$, $1.96$ and $2.33$ are the $90^{th}$, $95^{th}$, $97.5^{th}$ and $99^{th}$ percentiles of the standard normal distribution respectively
+
+---
+
+## Question
+
+- What is the $95^{th}$ percentile of a $N(\mu, \sigma^2)$ distribution? 
+  - Quick answer in R `qnorm(.95, mean = mu, sd = sd)`
+- Or, because you have the standard normal quantiles memorized
+and you know that 1.645 is the 95th percentile you know that the answer has to be
+$$\mu + \sigma 1.645$$
+- (In general $\mu + \sigma z_0$ where $z_0$ is the appropriate standard normal quantile)
+
+---
+
+## Question
+
+- What is the probability that a $\mbox{N}(\mu,\sigma^2)$ RV is larger than $x$?
+
+---
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What's the probability of getting
+more than  1,160 clicks in a day?
+
+---
+
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What's the probability of getting
+more than  1,160 clicks in a day?
+
+It's not very likely, 1,160 is `r (1160 - 1020) / 50` standard
+deviations from the mean 
+```{r}
+pnorm(1160, mean = 1020, sd = 50, lower.tail = FALSE)
+pnorm(2.8, lower.tail = FALSE)
+```
+
+---
+
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What number of daily ad clicks would represent
+the one where 75% of days have fewer clicks (assuming
+days are independent and identically distributed)?
+
+---
+
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What number of daily ad clicks would represent
+the one where 75% of days have fewer clicks (assuming
+days are independent and identically distributed)?
+
+```{r}
+qnorm(0.75, mean = 1020, sd = 50)
+```
+
+---
+## The Poisson distribution
+* Used to model counts
+* The Poisson mass function is
+$$
+P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
+$$
+for $x=0,1,\ldots$
+* The mean of this distribution is $\lambda$
+* The variance of this distribution is $\lambda$
+* Notice that $x$ ranges from $0$ to $\infty$
+
+---
+## Some uses for the Poisson distribution
+* Modeling count data  
+* Modeling event-time or survival data
+* Modeling contingency tables
+* Approximating binomials when $n$ is large and $p$ is small
+
+---
+## Rates and Poisson random variables
+* Poisson random variables are used to model rates
+* $X \sim Poisson(\lambda t)$ where 
+  * $\lambda = E[X / t]$ is the expected count per unit of time
+  * $t$ is the total monitoring time
+
+---
+## Example
+The number of people that show up at a bus stop is Poisson with
+a mean of $2.5$ per hour.
+
+If watching the bus stop for 4 hours, what is the probability that $3$
+or fewer people show up for the whole time?
+
+```{r}
+ppois(3, lambda = 2.5 * 4)
+```
+
+---
+## Poisson approximation to the binomial
+* When $n$ is large and $p$ is small the Poisson distribution
+  is an accurate approximation to the binomial distribution
+* Notation
+  * $X \sim \mbox{Binomial}(n, p)$
+  * $\lambda = n p$
+  * $n$ gets large 
+  * $p$ gets small
+
+
+---
+## Example, Poisson approximation to the binomial
+
+We flip a coin with success probablity $0.01$ five hundred times. 
+
+What's the probability of 2 or fewer successes?
+
+```{r}
+pbinom(2, size = 500, prob = .01)
+ppois(2, lambda=500 * .01)
+```
+
diff --git a/06_StatisticalInference/06_CommonDistros/index.html b/06_StatisticalInference/06_CommonDistros/index.html
new file mode 100644
index 000000000..0835dd5a9
--- /dev/null
+++ b/06_StatisticalInference/06_CommonDistros/index.html
@@ -0,0 +1,608 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Some Common Distributions</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Some Common Distributions">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Some Common Distributions</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>The Bernoulli distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>Bernoulli distribution</strong> arises as the result of a binary outcome</li>
+<li>Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) \(p\) and \(1-p\) respectively</li>
+<li>The PMF for a Bernoulli random variable \(X\) is \[P(X = x) =  p^x (1 - p)^{1 - x}\]</li>
+<li>The mean of a Bernoulli random variable is \(p\) and the variance is \(p(1 - p)\)</li>
+<li>If we let \(X\) be a Bernoulli random variable, it is typical to call \(X=1\) as a &quot;success&quot; and \(X=0\) as a &quot;failure&quot;</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Binomial trials</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <em>binomial random variables</em> are obtained as the sum of iid Bernoulli trials</li>
+<li>In specific, let \(X_1,\ldots,X_n\) be iid Bernoulli\((p)\); then \(X = \sum_{i=1}^n X_i\) is a binomial random variable</li>
+<li>The binomial mass function is
+\[
+P(X = x) = 
+\left(
+\begin{array}{c}
+n \\ x
+\end{array}
+\right)
+p^x(1 - p)^{n-x}
+\]
+for \(x=0,\ldots,n\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Choose</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Recall that the notation 
+\[\left(
+\begin{array}{c}
+  n \\ x
+\end{array}
+\right) = \frac{n!}{x!(n-x)!}
+\] (read &quot;\(n\) choose \(x\)&quot;) counts the number of ways of selecting \(x\) items out of \(n\)
+without replacement disregarding the order of the items</li>
+</ul>
+
+<p>\[\left(
+    \begin{array}{c}
+      n \\ 0
+    \end{array}
+  \right) =
+\left(
+    \begin{array}{c}
+      n \\ n
+    \end{array}
+  \right) =  1
+  \] </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose a friend has \(8\) children (oh my!), \(7\) of which are girls and none are twins</li>
+<li>If each gender has an independent \(50\)% probability for each birth, what&#39;s the probability of getting \(7\) or more girls out of \(8\) births?
+\[\left(
+\begin{array}{c}
+8 \\ 7
+\end{array}
+\right) .5^{7}(1-.5)^{1}
++
+\left(
+\begin{array}{c}
+8 \\ 8
+\end{array}
+\right) .5^{8}(1-.5)^{0} \approx 0.04
+\]</li>
+</ul>
+
+<pre><code class="r">choose(8, 7) * 0.5^8 + choose(8, 8) * 0.5^8
+</code></pre>
+
+<pre><code>## [1] 0.03516
+</code></pre>
+
+<pre><code class="r">pbinom(6, size = 8, prob = 0.5, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.03516
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>The normal distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A random variable is said to follow a <strong>normal</strong> or <strong>Gaussian</strong> distribution with mean \(\mu\) and variance \(\sigma^2\) if the associated density is
+\[
+(2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
+\]
+If \(X\) a RV with this density then \(E[X] = \mu\) and \(Var(X) = \sigma^2\)</li>
+<li>We write \(X\sim \mbox{N}(\mu, \sigma^2)\)</li>
+<li>When \(\mu = 0\) and \(\sigma = 1\) the resulting distribution is called <strong>the standard normal distribution</strong></li>
+<li>Standard normal RVs are often labeled \(Z\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>The standard normal distribution with reference lines</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Facts about the normal density</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>If \(X \sim \mbox{N}(\mu,\sigma^2)\) then 
+\[Z = \frac{X -\mu}{\sigma} \sim N(0, 1)\] </p>
+
+<p>If \(Z\) is standard normal \[X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>More facts about the normal density</h2>
+  </hgroup>
+  <article data-timings="">
+    <ol>
+<li>Approximately \(68\%\), \(95\%\) and \(99\%\)  of the normal density lies within \(1\), \(2\) and \(3\) standard deviations from the mean, respectively</li>
+<li>\(-1.28\), \(-1.645\), \(-1.96\) and \(-2.33\) are the \(10^{th}\), \(5^{th}\), \(2.5^{th}\) and \(1^{st}\) percentiles of the standard normal distribution respectively</li>
+<li>By symmetry, \(1.28\), \(1.645\), \(1.96\) and \(2.33\) are the \(90^{th}\), \(95^{th}\), \(97.5^{th}\) and \(99^{th}\) percentiles of the standard normal distribution respectively</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Question</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>What is the \(95^{th}\) percentile of a \(N(\mu, \sigma^2)\) distribution? 
+
+<ul>
+<li>Quick answer in R <code>qnorm(.95, mean = mu, sd = sd)</code></li>
+</ul></li>
+<li>Or, because you have the standard normal quantiles memorized
+and you know that 1.645 is the 95th percentile you know that the answer has to be
+\[\mu + \sigma 1.645\]</li>
+<li>(In general \(\mu + \sigma z_0\) where \(z_0\) is the appropriate standard normal quantile)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Question</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>What is the probability that a \(\mbox{N}(\mu,\sigma^2)\) RV is larger than \(x\)?</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What&#39;s the probability of getting
+more than  1,160 clicks in a day?</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What&#39;s the probability of getting
+more than  1,160 clicks in a day?</p>
+
+<p>It&#39;s not very likely, 1,160 is 2.8 standard
+deviations from the mean </p>
+
+<pre><code class="r">pnorm(1160, mean = 1020, sd = 50, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.002555
+</code></pre>
+
+<pre><code class="r">pnorm(2.8, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.002555
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What number of daily ad clicks would represent
+the one where 75% of days have fewer clicks (assuming
+days are independent and identically distributed)?</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What number of daily ad clicks would represent
+the one where 75% of days have fewer clicks (assuming
+days are independent and identically distributed)?</p>
+
+<pre><code class="r">qnorm(0.75, mean = 1020, sd = 50)
+</code></pre>
+
+<pre><code>## [1] 1054
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>The Poisson distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Used to model counts</li>
+<li>The Poisson mass function is
+\[
+P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
+\]
+for \(x=0,1,\ldots\)</li>
+<li>The mean of this distribution is \(\lambda\)</li>
+<li>The variance of this distribution is \(\lambda\)</li>
+<li>Notice that \(x\) ranges from \(0\) to \(\infty\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Some uses for the Poisson distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Modeling count data<br></li>
+<li>Modeling event-time or survival data</li>
+<li>Modeling contingency tables</li>
+<li>Approximating binomials when \(n\) is large and \(p\) is small</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Rates and Poisson random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Poisson random variables are used to model rates</li>
+<li>\(X \sim Poisson(\lambda t)\) where 
+
+<ul>
+<li>\(\lambda = E[X / t]\) is the expected count per unit of time</li>
+<li>\(t\) is the total monitoring time</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>The number of people that show up at a bus stop is Poisson with
+a mean of \(2.5\) per hour.</p>
+
+<p>If watching the bus stop for 4 hours, what is the probability that \(3\)
+or fewer people show up for the whole time?</p>
+
+<pre><code class="r">ppois(3, lambda = 2.5 * 4)
+</code></pre>
+
+<pre><code>## [1] 0.01034
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>Poisson approximation to the binomial</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>When \(n\) is large and \(p\) is small the Poisson distribution
+is an accurate approximation to the binomial distribution</li>
+<li>Notation
+
+<ul>
+<li>\(X \sim \mbox{Binomial}(n, p)\)</li>
+<li>\(\lambda = n p\)</li>
+<li>\(n\) gets large </li>
+<li>\(p\) gets small</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-20" style="background:;">
+  <hgroup>
+    <h2>Example, Poisson approximation to the binomial</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>We flip a coin with success probablity \(0.01\) five hundred times. </p>
+
+<p>What&#39;s the probability of 2 or fewer successes?</p>
+
+<pre><code class="r">pbinom(2, size = 500, prob = 0.01)
+</code></pre>
+
+<pre><code>## [1] 0.1234
+</code></pre>
+
+<pre><code class="r">ppois(2, lambda = 500 * 0.01)
+</code></pre>
+
+<pre><code>## [1] 0.1247
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='The Bernoulli distribution'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Binomial trials'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Choose'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Example'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='The normal distribution'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='The standard normal distribution with reference lines'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Facts about the normal density'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='More facts about the normal density'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Question'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Question'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Example'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Example'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Example'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='The Poisson distribution'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Some uses for the Poisson distribution'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Rates and Poisson random variables'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Example'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='Poisson approximation to the binomial'>
+         19
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=20 title='Example, Poisson approximation to the binomial'>
+         20
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/06_CommonDistros/index.md b/06_StatisticalInference/06_CommonDistros/index.md
new file mode 100644
index 000000000..744cdd43f
--- /dev/null
+++ b/06_StatisticalInference/06_CommonDistros/index.md
@@ -0,0 +1,306 @@
+---
+title       : Some Common Distributions
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+
+## The Bernoulli distribution
+
+- The **Bernoulli distribution** arises as the result of a binary outcome
+- Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) $p$ and $1-p$ respectively
+- The PMF for a Bernoulli random variable $X$ is $$P(X = x) =  p^x (1 - p)^{1 - x}$$
+- The mean of a Bernoulli random variable is $p$ and the variance is $p(1 - p)$
+- If we let $X$ be a Bernoulli random variable, it is typical to call $X=1$ as a "success" and $X=0$ as a "failure"
+
+
+---
+
+## Binomial trials
+
+- The *binomial random variables* are obtained as the sum of iid Bernoulli trials
+- In specific, let $X_1,\ldots,X_n$ be iid Bernoulli$(p)$; then $X = \sum_{i=1}^n X_i$ is a binomial random variable
+- The binomial mass function is
+$$
+P(X = x) = 
+\left(
+\begin{array}{c}
+  n \\ x
+\end{array}
+\right)
+p^x(1 - p)^{n-x}
+$$
+for $x=0,\ldots,n$
+
+---
+
+## Choose
+
+- Recall that the notation 
+  $$\left(
+    \begin{array}{c}
+      n \\ x
+    \end{array}
+  \right) = \frac{n!}{x!(n-x)!}
+  $$ (read "$n$ choose $x$") counts the number of ways of selecting $x$ items out of $n$
+  without replacement disregarding the order of the items
+
+$$\left(
+    \begin{array}{c}
+      n \\ 0
+    \end{array}
+  \right) =
+\left(
+    \begin{array}{c}
+      n \\ n
+    \end{array}
+  \right) =  1
+  $$ 
+
+---
+
+## Example
+
+- Suppose a friend has $8$ children (oh my!), $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+$$\left(
+\begin{array}{c}
+  8 \\ 7
+\end{array}
+\right) .5^{7}(1-.5)^{1}
++
+\left(
+\begin{array}{c}
+  8 \\ 8
+\end{array}
+\right) .5^{8}(1-.5)^{0} \approx 0.04
+$$
+
+```r
+choose(8, 7) * 0.5^8 + choose(8, 8) * 0.5^8
+```
+
+```
+## [1] 0.03516
+```
+
+```r
+pbinom(6, size = 8, prob = 0.5, lower.tail = FALSE)
+```
+
+```
+## [1] 0.03516
+```
+
+
+
+---
+
+## The normal distribution
+
+- A random variable is said to follow a **normal** or **Gaussian** distribution with mean $\mu$ and variance $\sigma^2$ if the associated density is
+  $$
+  (2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
+  $$
+  If $X$ a RV with this density then $E[X] = \mu$ and $Var(X) = \sigma^2$
+- We write $X\sim \mbox{N}(\mu, \sigma^2)$
+- When $\mu = 0$ and $\sigma = 1$ the resulting distribution is called **the standard normal distribution**
+- Standard normal RVs are often labeled $Z$
+
+---
+## The standard normal distribution with reference lines 
+<img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" />
+
+
+---
+
+## Facts about the normal density
+
+If $X \sim \mbox{N}(\mu,\sigma^2)$ then 
+$$Z = \frac{X -\mu}{\sigma} \sim N(0, 1)$$ 
+
+
+If $Z$ is standard normal $$X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)$$
+
+---
+
+## More facts about the normal density
+
+1. Approximately $68\%$, $95\%$ and $99\%$  of the normal density lies within $1$, $2$ and $3$ standard deviations from the mean, respectively
+2. $-1.28$, $-1.645$, $-1.96$ and $-2.33$ are the $10^{th}$, $5^{th}$, $2.5^{th}$ and $1^{st}$ percentiles of the standard normal distribution respectively
+3. By symmetry, $1.28$, $1.645$, $1.96$ and $2.33$ are the $90^{th}$, $95^{th}$, $97.5^{th}$ and $99^{th}$ percentiles of the standard normal distribution respectively
+
+---
+
+## Question
+
+- What is the $95^{th}$ percentile of a $N(\mu, \sigma^2)$ distribution? 
+  - Quick answer in R `qnorm(.95, mean = mu, sd = sd)`
+- Or, because you have the standard normal quantiles memorized
+and you know that 1.645 is the 95th percentile you know that the answer has to be
+$$\mu + \sigma 1.645$$
+- (In general $\mu + \sigma z_0$ where $z_0$ is the appropriate standard normal quantile)
+
+---
+
+## Question
+
+- What is the probability that a $\mbox{N}(\mu,\sigma^2)$ RV is larger than $x$?
+
+---
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What's the probability of getting
+more than  1,160 clicks in a day?
+
+---
+
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What's the probability of getting
+more than  1,160 clicks in a day?
+
+It's not very likely, 1,160 is 2.8 standard
+deviations from the mean 
+
+```r
+pnorm(1160, mean = 1020, sd = 50, lower.tail = FALSE)
+```
+
+```
+## [1] 0.002555
+```
+
+```r
+pnorm(2.8, lower.tail = FALSE)
+```
+
+```
+## [1] 0.002555
+```
+
+
+---
+
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What number of daily ad clicks would represent
+the one where 75% of days have fewer clicks (assuming
+days are independent and identically distributed)?
+
+---
+
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What number of daily ad clicks would represent
+the one where 75% of days have fewer clicks (assuming
+days are independent and identically distributed)?
+
+
+```r
+qnorm(0.75, mean = 1020, sd = 50)
+```
+
+```
+## [1] 1054
+```
+
+
+---
+## The Poisson distribution
+* Used to model counts
+* The Poisson mass function is
+$$
+P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
+$$
+for $x=0,1,\ldots$
+* The mean of this distribution is $\lambda$
+* The variance of this distribution is $\lambda$
+* Notice that $x$ ranges from $0$ to $\infty$
+
+---
+## Some uses for the Poisson distribution
+* Modeling count data  
+* Modeling event-time or survival data
+* Modeling contingency tables
+* Approximating binomials when $n$ is large and $p$ is small
+
+---
+## Rates and Poisson random variables
+* Poisson random variables are used to model rates
+* $X \sim Poisson(\lambda t)$ where 
+  * $\lambda = E[X / t]$ is the expected count per unit of time
+  * $t$ is the total monitoring time
+
+---
+## Example
+The number of people that show up at a bus stop is Poisson with
+a mean of $2.5$ per hour.
+
+If watching the bus stop for 4 hours, what is the probability that $3$
+or fewer people show up for the whole time?
+
+
+```r
+ppois(3, lambda = 2.5 * 4)
+```
+
+```
+## [1] 0.01034
+```
+
+
+---
+## Poisson approximation to the binomial
+* When $n$ is large and $p$ is small the Poisson distribution
+  is an accurate approximation to the binomial distribution
+* Notation
+  * $X \sim \mbox{Binomial}(n, p)$
+  * $\lambda = n p$
+  * $n$ gets large 
+  * $p$ gets small
+
+
+---
+## Example, Poisson approximation to the binomial
+
+We flip a coin with success probablity $0.01$ five hundred times. 
+
+What's the probability of 2 or fewer successes?
+
+
+```r
+pbinom(2, size = 500, prob = 0.01)
+```
+
+```
+## [1] 0.1234
+```
+
+```r
+ppois(2, lambda = 500 * 0.01)
+```
+
+```
+## [1] 0.1247
+```
+
+
diff --git a/06_StatisticalInference/06_CommonDistros/index.pdf b/06_StatisticalInference/06_CommonDistros/index.pdf
new file mode 100644
index 000000000..1d98f72f9
Binary files /dev/null and b/06_StatisticalInference/06_CommonDistros/index.pdf differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-1.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..433cd180b
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-10.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-10.png
new file mode 100644
index 000000000..3c28ac41f
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-10.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-11.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-11.png
new file mode 100644
index 000000000..207257860
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-11.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-12.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-12.png
new file mode 100644
index 000000000..b2a86f37c
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-12.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-13.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-13.png
new file mode 100644
index 000000000..a104e90bd
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-13.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-14.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-14.png
new file mode 100644
index 000000000..4b2ba14ef
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-14.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-16.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-16.png
new file mode 100644
index 000000000..f122776c5
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-16.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-17.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-17.png
new file mode 100644
index 000000000..949dc88db
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-17.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-18.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-18.png
new file mode 100644
index 000000000..858b8550b
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-18.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..79975fffe
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..417ca9044
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-4.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..79799d5f9
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-5.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..1a423b2f0
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-5.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-6.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-6.png
new file mode 100644
index 000000000..eb8177f97
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-6.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-9.png b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-9.png
new file mode 100644
index 000000000..aadb6890e
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/assets/fig/unnamed-chunk-9.png differ
diff --git a/06_StatisticalInference/07_Asymptopia/fig/Thumbs.db b/06_StatisticalInference/07_Asymptopia/fig/Thumbs.db
new file mode 100644
index 000000000..961350486
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/fig/Thumbs.db differ
diff --git a/06_StatisticalInference/07_Asymptopia/fig/quincunx.png b/06_StatisticalInference/07_Asymptopia/fig/quincunx.png
new file mode 100644
index 000000000..2d77ba0cb
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/fig/quincunx.png differ
diff --git a/06_StatisticalInference/02_02_Asymptopia/fig/unnamed-chunk-1.png b/06_StatisticalInference/07_Asymptopia/fig/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/02_02_Asymptopia/fig/unnamed-chunk-1.png
rename to 06_StatisticalInference/07_Asymptopia/fig/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/02_02_Asymptopia/fig/unnamed-chunk-2.png b/06_StatisticalInference/07_Asymptopia/fig/unnamed-chunk-2.png
similarity index 100%
rename from 06_StatisticalInference/02_02_Asymptopia/fig/unnamed-chunk-2.png
rename to 06_StatisticalInference/07_Asymptopia/fig/unnamed-chunk-2.png
diff --git a/06_StatisticalInference/02_02_Asymptopia/fig/unnamed-chunk-3.png b/06_StatisticalInference/07_Asymptopia/fig/unnamed-chunk-3.png
similarity index 100%
rename from 06_StatisticalInference/02_02_Asymptopia/fig/unnamed-chunk-3.png
rename to 06_StatisticalInference/07_Asymptopia/fig/unnamed-chunk-3.png
diff --git a/06_StatisticalInference/07_Asymptopia/index.Rmd b/06_StatisticalInference/07_Asymptopia/index.Rmd
new file mode 100644
index 000000000..56527da6c
--- /dev/null
+++ b/06_StatisticalInference/07_Asymptopia/index.Rmd
@@ -0,0 +1,405 @@
+---
+title       : A trip to Asymptopia
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Asymptotics
+* Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)
+* (Asymptopia is my name for the land of asymptotics, where everything works out well and there's no messes. The land of infinite data is nice that way.)
+* Asymptotics are incredibly useful for simple statistical inference and approximations 
+* (Not covered in this class) Asymptotics often lead to nice understanding of procedures
+* Asymptotics generally give no assurances about finite sample performance
+* Asymptotics form the basis for frequency interpretation of probabilities 
+  (the long run proportion of times an event occurs)
+
+
+---
+
+## Limits of random variables
+
+- Fortunately, for the sample mean there's a set of powerful results
+- These results allow us to talk about the large sample distribution
+of sample means of a collection of $iid$ observations
+- The first of these results we intuitively know
+  - It says that the average limits to what it's estimating, the population mean
+  - It's called the Law of Large Numbers
+  - Example $\bar X_n$ could be the average of the result of $n$ coin flips (i.e. the sample proportion of heads)
+    - As we flip a fair coin over and over, it eventually converges to the
+    true probability of a head
+    The LLN forms the basis of frequency style thinking
+
+
+---
+## Law of large numbers in action
+```{r, fig.height=5, fig.width=5}
+n <- 10000; means <- cumsum(rnorm(n)) / (1  : n); library(ggplot2)
+g <- ggplot(data.frame(x = 1 : n, y = means), aes(x = x, y = y)) 
+g <- g + geom_hline(yintercept = 0) + geom_line(size = 2) 
+g <- g + labs(x = "Number of obs", y = "Cumulative mean")
+g
+```
+
+
+---
+## Law of large numbers in action, coin flip
+```{r, fig.height=5, fig.width=5}
+means <- cumsum(sample(0 : 1, n , replace = TRUE)) / (1  : n)
+g <- ggplot(data.frame(x = 1 : n, y = means), aes(x = x, y = y)) 
+g <- g + geom_hline(yintercept = 0.5) + geom_line(size = 2) 
+g <- g + labs(x = "Number of obs", y = "Cumulative mean")
+g
+```
+
+
+
+---
+## Discussion
+- An estimator is **consistent** if it converges to what you want to estimate
+  - The LLN says that the sample mean of iid sample is
+  consistent for the population mean
+  - Typically, good estimators are consistent; it's not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer
+- The sample variance and the sample standard deviation
+of iid random variables are consistent as well
+
+---
+
+## The Central Limit Theorem
+
+- The **Central Limit Theorem** (CLT) is one of the most important theorems in statistics
+- For our purposes, the CLT states that the distribution of averages of iid variables (properly normalized) becomes that of a standard normal as the sample size increases
+- The CLT applies in an endless variety of settings
+- The result is that 
+$$\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}=
+\frac{\sqrt n (\bar X_n - \mu)}{\sigma}
+= \frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}$$ has a distribution like that of a standard normal for large $n$.
+- (Replacing the standard error by its estimated value doesn't change the CLT)
+- The useful way to think about the CLT is that 
+$\bar X_n$ is approximately
+$N(\mu, \sigma^2 / n)$
+
+
+
+---
+
+## Example
+
+- Simulate a standard normal random variable by rolling $n$ (six sided)
+- Let $X_i$ be the outcome for die $i$
+- Then note that $\mu = E[X_i] = 3.5$
+- $Var(X_i) = 2.92$ 
+- SE $\sqrt{2.92 / n} = 1.71 / \sqrt{n}$
+- Let's roll $n$ dice, take their mean, subtract off 3.5,
+and divide by $1.71 / \sqrt{n}$ and repeat this over and over
+
+
+---
+## Result of our die rolling experiment
+
+```{r, echo = FALSE, fig.width=9, fig.height = 6, fig.align='center'}
+nosim <- 1000
+cfunc <- function(x, n) sqrt(n) * (mean(x) - 3.5) / 1.71
+dat <- data.frame(
+  x = c(apply(matrix(sample(1 : 6, nosim * 10, replace = TRUE), 
+                     nosim), 1, cfunc, 10),
+        apply(matrix(sample(1 : 6, nosim * 20, replace = TRUE), 
+                     nosim), 1, cfunc, 20),
+        apply(matrix(sample(1 : 6, nosim * 30, replace = TRUE), 
+                     nosim), 1, cfunc, 30)
+        ),
+  size = factor(rep(c(10, 20, 30), rep(nosim, 3))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(alpha = .20, binwidth=.3, colour = "black", aes(y = ..density..)) 
+g <- g + stat_function(fun = dnorm, size = 2)
+g + facet_grid(. ~ size)
+```
+
+
+---
+## Coin CLT
+
+- Let $X_i$ be the $0$ or $1$ result of the $i^{th}$ flip of a possibly unfair coin
+- The sample proportion, say $\hat p$, is the average of the coin flips
+- $E[X_i] = p$ and $Var(X_i) = p(1-p)$
+- Standard error of the mean is $\sqrt{p(1-p)/n}$
+- Then
+$$
+    \frac{\hat p - p}{\sqrt{p(1-p)/n}}
+$$
+will be approximately normally distributed
+- Let's flip a coin $n$ times, take the sample proportion
+of heads, subtract off .5 and multiply the result by
+$2 \sqrt{n}$ (divide by $1/(2 \sqrt{n})$)
+
+---
+## Simulation results
+```{r, echo = FALSE, fig.width=9, fig.height = 6, fig.align='center'}
+nosim <- 1000
+cfunc <- function(x, n) 2 * sqrt(n) * (mean(x) - 0.5) 
+dat <- data.frame(
+  x = c(apply(matrix(sample(0:1, nosim * 10, replace = TRUE), 
+                     nosim), 1, cfunc, 10),
+        apply(matrix(sample(0:1, nosim * 20, replace = TRUE), 
+                     nosim), 1, cfunc, 20),
+        apply(matrix(sample(0:1, nosim * 30, replace = TRUE), 
+                     nosim), 1, cfunc, 30)
+        ),
+  size = factor(rep(c(10, 20, 30), rep(nosim, 3))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(binwidth=.3, colour = "black", aes(y = ..density..)) 
+g <- g + stat_function(fun = dnorm, size = 2)
+g + facet_grid(. ~ size)
+```
+
+---
+## Simulation results, $p = 0.9$
+```{r, echo = FALSE, fig.width=9, fig.height = 6, fig.align='center'}
+nosim <- 1000
+cfunc <- function(x, n) sqrt(n) * (mean(x) - 0.9) / sqrt(.1 * .9)
+dat <- data.frame(
+  x = c(apply(matrix(sample(0:1, prob = c(.1,.9), nosim * 10, replace = TRUE), 
+                     nosim), 1, cfunc, 10),
+        apply(matrix(sample(0:1, prob = c(.1,.9), nosim * 20, replace = TRUE), 
+                     nosim), 1, cfunc, 20),
+        apply(matrix(sample(0:1, prob = c(.1,.9), nosim * 30, replace = TRUE), 
+                     nosim), 1, cfunc, 30)
+        ),
+  size = factor(rep(c(10, 20, 30), rep(nosim, 3))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(binwidth=.3, colour = "black", aes(y = ..density..)) 
+g <- g + stat_function(fun = dnorm, size = 2)
+g + facet_grid(. ~ size)
+```
+
+---
+## Galton's quincunx 
+
+http://en.wikipedia.org/wiki/Bean_machine#mediaviewer/File:Quincunx_(Galton_Box)_-_Galton_1889_diagram.png
+
+<img src="fig/quincunx.png" height="450"></img>
+
+---
+
+## Confidence intervals
+
+- According to the CLT, the sample mean, $\bar X$, 
+is approximately normal with mean $\mu$ and sd $\sigma / \sqrt{n}$
+- $\mu + 2 \sigma /\sqrt{n}$ is pretty far out in the tail
+(only 2.5% of a normal being larger than 2 sds in the tail)
+- Similarly, $\mu - 2 \sigma /\sqrt{n}$ is pretty far in the left tail (only 2.5% chance of a normal being smaller than 2 sds in the tail)
+- So the probability $\bar X$ is bigger than $\mu + 2 \sigma / \sqrt{n}$
+or smaller than $\mu - 2 \sigma / \sqrt{n}$ is 5%
+    - Or equivalently, the probability of being between these limits is 95%
+- The quantity $\bar X \pm 2 \sigma /\sqrt{n}$ is called
+a 95% interval for $\mu$
+- The 95% refers to the fact that if one were to repeatedly
+get samples of size $n$, about 95% of the intervals obtained
+would contain $\mu$
+- The 97.5th quantile is 1.96 (so I rounded to 2 above)
+- 90% interval you want (100 - 90) / 2 = 5% in each tail 
+  - So you want the 95th percentile (1.645)
+
+
+---
+## Give a confidence interval for the average height of sons
+in Galton's data
+```{r}
+library(UsingR);data(father.son); x <- father.son$sheight
+(mean(x) + c(-1, 1) * qnorm(.975) * sd(x) / sqrt(length(x))) / 12
+```
+
+---
+
+## Sample proportions
+
+- In the event that each $X_i$ is $0$ or $1$ with common success probability $p$ then $\sigma^2 = p(1 - p)$
+- The interval takes the form
+$$
+    \hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
+$$
+- Replacing $p$ by $\hat p$ in the standard error results in what is called a Wald confidence interval for $p$
+- For 95% intervals
+$$\hat p \pm \frac{1}{\sqrt{n}}$$ 
+is a quick CI estimate for $p$
+
+---
+## Example
+* Your campaign advisor told you that in a random sample of 100 likely voters,
+  56 intent to vote for you. 
+  * Can you relax? Do you have this race in the bag?
+  * Without access to a computer or calculator, how precise is this estimate?
+* `1/sqrt(100)=0.1` so a back of the envelope calculation gives an approximate 95% interval of `(0.46, 0.66)`
+  * Not enough for you to relax, better go do more campaigning!
+* Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.
+```{r}
+round(1 / sqrt(10 ^ (1 : 6)), 3)
+```
+
+
+
+---
+## Binomial interval
+
+```{r}
+.56 + c(-1, 1) * qnorm(.975) * sqrt(.56 * .44 / 100)
+binom.test(56, 100)$conf.int
+```
+
+---
+
+## Simulation
+
+```{r}
+n <- 20; pvals <- seq(.1, .9, by = .05); nosim <- 1000
+coverage <- sapply(pvals, function(p){
+  phats <- rbinom(nosim, prob = p, size = n) / n
+  ll <- phats - qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  ul <- phats + qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  mean(ll < p & ul > p)
+})
+
+```
+
+
+---
+## Plot of the results (not so good)
+```{r, echo=FALSE, fig.align='center', fig.height=6, fig.width=6}
+ggplot(data.frame(pvals, coverage), aes(x = pvals, y = coverage)) + geom_line(size = 2) + geom_hline(yintercept = 0.95) + ylim(.75, 1.0)
+````
+
+---
+## What's happening?
+- $n$ isn't large enough for the CLT to be applicable
+for many of the values of $p$
+- Quick fix, form the interval with 
+$$
+\frac{X + 2}{n + 4}
+$$
+- (Add two successes and failures, Agresti/Coull interval)
+
+---
+## Simulation
+First let's show that coverage gets better with $n$
+
+```{r}
+n <- 100; pvals <- seq(.1, .9, by = .05); nosim <- 1000
+coverage2 <- sapply(pvals, function(p){
+  phats <- rbinom(nosim, prob = p, size = n) / n
+  ll <- phats - qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  ul <- phats + qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  mean(ll < p & ul > p)
+})
+
+```
+
+---
+## Plot of coverage for $n=100$
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE}
+ggplot(data.frame(pvals, coverage), aes(x = pvals, y = coverage2)) + geom_line(size = 2) + geom_hline(yintercept = 0.95)+ ylim(.75, 1.0)
+```
+
+---
+## Simulation
+Now let's look at $n=20$ but adding 2 successes and failures
+```{r}
+n <- 20; pvals <- seq(.1, .9, by = .05); nosim <- 1000
+coverage <- sapply(pvals, function(p){
+  phats <- (rbinom(nosim, prob = p, size = n) + 2) / (n + 4)
+  ll <- phats - qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  ul <- phats + qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  mean(ll < p & ul > p)
+})
+```
+
+
+---
+## Adding 2 successes and 2 failures
+(It's a little conservative)
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE}
+ggplot(data.frame(pvals, coverage), aes(x = pvals, y = coverage)) + geom_line(size = 2) + geom_hline(yintercept = 0.95)+ ylim(.75, 1.0)
+````
+
+---
+
+## Poisson interval
+* A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?
+* $X \sim Poisson(\lambda t)$.
+* Estimate $\hat \lambda = X/t$
+* $Var(\hat \lambda) = \lambda / t$ 
+* $\hat \lambda / t$ is our variance estimate
+
+---
+## R code
+```{r}
+x <- 5; t <- 94.32; lambda <- x / t
+round(lambda + c(-1, 1) * qnorm(.975) * sqrt(lambda / t), 3)
+poisson.test(x, T = 94.32)$conf
+```
+
+
+---
+## Simulating the Poisson coverage rate
+Let's see how this interval performs for lambda
+values near what we're estimating
+```{r}
+lambdavals <- seq(0.005, 0.10, by = .01); nosim <- 1000
+t <- 100
+coverage <- sapply(lambdavals, function(lambda){
+  lhats <- rpois(nosim, lambda = lambda * t) / t
+  ll <- lhats - qnorm(.975) * sqrt(lhats / t)
+  ul <- lhats + qnorm(.975) * sqrt(lhats / t)
+  mean(ll < lambda & ul > lambda)
+})
+```
+
+
+
+---
+## Covarage
+(Gets really bad for small values of lambda)
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE}
+ggplot(data.frame(lambdavals, coverage), aes(x = lambdavals, y = coverage)) + geom_line(size = 2) + geom_hline(yintercept = 0.95)+ylim(0, 1.0)
+````
+
+
+
+---
+## What if we increase t to 1000?
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE}
+lambdavals <- seq(0.005, 0.10, by = .01); nosim <- 1000
+t <- 1000
+coverage <- sapply(lambdavals, function(lambda){
+  lhats <- rpois(nosim, lambda = lambda * t) / t
+  ll <- lhats - qnorm(.975) * sqrt(lhats / t)
+  ul <- lhats + qnorm(.975) * sqrt(lhats / t)
+  mean(ll < lambda & ul > lambda)
+})
+ggplot(data.frame(lambdavals, coverage), aes(x = lambdavals, y = coverage)) + geom_line(size = 2) + geom_hline(yintercept = 0.95) + ylim(0, 1.0)
+```
+
+
+---
+## Summary
+- The LLN states that averages of iid samples 
+converge to the population means that they are estimating
+- The CLT states that averages are approximately normal, with
+distributions
+  - centered at the population mean 
+  - with standard deviation equal to the standard error of the mean
+  - CLT gives no guarantee that $n$ is large enough
+- Taking the mean and adding and subtracting the relevant
+normal quantile times the SE yields a confidence interval for the mean
+  - Adding and subtracting 2 SEs works for 95% intervals
+- Confidence intervals get wider as the coverage increases
+(why?)
+- Confidence intervals get narrower with less variability or
+larger sample sizes
+- The Poisson and binomial case have exact intervals that
+don't require the CLT
+  - But a quick fix for small sample size binomial calculations is to add 2 successes and failures
diff --git a/06_StatisticalInference/07_Asymptopia/index.html b/06_StatisticalInference/07_Asymptopia/index.html
new file mode 100644
index 000000000..72b17e765
--- /dev/null
+++ b/06_StatisticalInference/07_Asymptopia/index.html
@@ -0,0 +1,850 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>A trip to Asymptopia</title>
+  <meta charset="utf-8">
+  <meta name="description" content="A trip to Asymptopia">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>A trip to Asymptopia</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Asymptotics</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)</li>
+<li>(Asymptopia is my name for the land of asymptotics, where everything works out well and there&#39;s no messes. The land of infinite data is nice that way.)</li>
+<li>Asymptotics are incredibly useful for simple statistical inference and approximations </li>
+<li>(Not covered in this class) Asymptotics often lead to nice understanding of procedures</li>
+<li>Asymptotics generally give no assurances about finite sample performance</li>
+<li>Asymptotics form the basis for frequency interpretation of probabilities 
+(the long run proportion of times an event occurs)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Limits of random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Fortunately, for the sample mean there&#39;s a set of powerful results</li>
+<li>These results allow us to talk about the large sample distribution
+of sample means of a collection of \(iid\) observations</li>
+<li>The first of these results we inuitively know
+
+<ul>
+<li>It says that the average limits to what its estimating, the population mean</li>
+<li>It&#39;s called the Law of Large Numbers</li>
+<li>Example \(\bar X_n\) could be the average of the result of \(n\) coin flips (i.e. the sample proportion of heads)</li>
+<li>As we flip a fair coin over and over, it evetually converges to the
+true probability of a head
+The LLN forms the basis of frequency style thinking</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Law of large numbers in action</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">n &lt;- 10000
+means &lt;- cumsum(rnorm(n))/(1:n)
+library(ggplot2)
+g &lt;- ggplot(data.frame(x = 1:n, y = means), aes(x = x, y = y))
+g &lt;- g + geom_hline(yintercept = 0) + geom_line(size = 2)
+g &lt;- g + labs(x = &quot;Number of obs&quot;, y = &quot;Cumulative mean&quot;)
+g
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-1.png" alt="plot of chunk unnamed-chunk-1"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Law of large numbers in action, coin flip</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">means &lt;- cumsum(sample(0:1, n, replace = TRUE))/(1:n)
+g &lt;- ggplot(data.frame(x = 1:n, y = means), aes(x = x, y = y))
+g &lt;- g + geom_hline(yintercept = 0.5) + geom_line(size = 2)
+g &lt;- g + labs(x = &quot;Number of obs&quot;, y = &quot;Cumulative mean&quot;)
+g
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-2.png" alt="plot of chunk unnamed-chunk-2"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Discussion</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>An estimator is <strong>consistent</strong> if it converges to what you want to estimate
+
+<ul>
+<li>The LLN says that the sample mean of iid sample is
+consistent for the population mean</li>
+<li>Typically, good estimators are consistent; it&#39;s not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer</li>
+</ul></li>
+<li>The sample variance and the sample standard deviation
+of iid random variables are consistent as well</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>The Central Limit Theorem</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>Central Limit Theorem</strong> (CLT) is one of the most important theorems in statistics</li>
+<li>For our purposes, the CLT states that the distribution of averages of iid variables (properly normalized) becomes that of a standard normal as the sample size increases</li>
+<li>The CLT applies in an endless variety of settings</li>
+<li>The result is that 
+\[\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}=
+\frac{\sqrt n (\bar X_n - \mu)}{\sigma}
+= \frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}\] has a distribution like that of a standard normal for large \(n\).</li>
+<li>(Replacing the standard error by its estimated value doesn&#39;t change the CLT)</li>
+<li>The useful way to think about the CLT is that 
+\(\bar X_n\) is approximately
+\(N(\mu, \sigma^2 / n)\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Simulate a standard normal random variable by rolling \(n\) (six sided)</li>
+<li>Let \(X_i\) be the outcome for die \(i\)</li>
+<li>Then note that \(\mu = E[X_i] = 3.5\)</li>
+<li>\(Var(X_i) = 2.92\) </li>
+<li>SE \(\sqrt{2.92 / n} = 1.71 / \sqrt{n}\)</li>
+<li>Lets roll \(n\) dice, take their mean, subtract off 3.5,
+and divide by \(1.71 / \sqrt{n}\) and repeat this over and over</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Result of our die rolling experiment</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Coin CLT</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Let \(X_i\) be the \(0\) or \(1\) result of the \(i^{th}\) flip of a possibly unfair coin
+
+<ul>
+<li>The sample proportion, say \(\hat p\), is the average of the coin flips</li>
+<li>\(E[X_i] = p\) and \(Var(X_i) = p(1-p)\)</li>
+<li>Standard error of the mean is \(\sqrt{p(1-p)/n}\)</li>
+<li>Then
+\[
+\frac{\hat p - p}{\sqrt{p(1-p)/n}}
+\]
+will be approximately normally distributed</li>
+<li>Let&#39;s flip a coin \(n\) times, take the sample proportion
+of heads, subtract off .5 and multiply the result by
+\(2 \sqrt{n}\) (divide by \(1/(2 \sqrt{n})\))</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Simulation results</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Simulation results, \(p = 0.9\)</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Galton&#39;s quincunx</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><a href="http://en.wikipedia.org/wiki/Bean_machine#mediaviewer/File:Quincunx_(Galton_Box)_-_Galton_1889_diagram.png">http://en.wikipedia.org/wiki/Bean_machine#mediaviewer/File:Quincunx_(Galton_Box)_-_Galton_1889_diagram.png</a></p>
+
+<p><img src="fig/quincunx.png" height="450"></img></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Confidence intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>According to the CLT, the sample mean, \(\bar X\), 
+is approximately normal with mean \(\mu\) and sd \(\sigma / \sqrt{n}\)</li>
+<li>\(\mu + 2 \sigma /\sqrt{n}\) is pretty far out in the tail
+(only 2.5% of a normal being larger than 2 sds in the tail)</li>
+<li>Similarly, \(\mu - 2 \sigma /\sqrt{n}\) is pretty far in the left tail (only 2.5% chance of a normal being smaller than 2 sds in the tail)</li>
+<li>So the probability \(\bar X\) is bigger than \(\mu + 2 \sigma / \sqrt{n}\)
+or smaller than \(\mu - 2 \sigma / \sqrt{n}\) is 5%
+
+<ul>
+<li>Or equivalently, the probability of being between these limits is 95%</li>
+</ul></li>
+<li>The quantity \(\bar X \pm 2 \sigma /\sqrt{n}\) is called
+a 95% interval for \(\mu\)</li>
+<li>The 95% refers to the fact that if one were to repeatly
+get samples of size \(n\), about 95% of the intervals obtained
+would contain \(\mu\)</li>
+<li>The 97.5th quantile is 1.96 (so I rounded to 2 above)</li>
+<li>90% interval you want (100 - 90) / 2 = 5% in each tail 
+
+<ul>
+<li>So you want the 95th percentile (1.645)</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Give a confidence interval for the average height of sons</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>in Galton&#39;s data</p>
+
+<pre><code class="r">library(UsingR)
+data(father.son)
+x &lt;- father.son$sheight
+(mean(x) + c(-1, 1) * qnorm(0.975) * sd(x)/sqrt(length(x)))/12
+</code></pre>
+
+<pre><code>## [1] 5.710 5.738
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Sample proportions</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In the event that each \(X_i\) is \(0\) or \(1\) with common success probability \(p\) then \(\sigma^2 = p(1 - p)\)</li>
+<li>The interval takes the form
+\[
+\hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
+\]</li>
+<li>Replacing \(p\) by \(\hat p\) in the standard error results in what is called a Wald confidence interval for \(p\)</li>
+<li>For 95% intervals
+\[\hat p \pm \frac{1}{\sqrt{n}}\] 
+is a quick CI estimate for \(p\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Your campaign advisor told you that in a random sample of 100 likely voters,
+56 intent to vote for you. 
+
+<ul>
+<li>Can you relax? Do you have this race in the bag?</li>
+<li>Without access to a computer or calculator, how precise is this estimate?</li>
+</ul></li>
+<li><code>1/sqrt(100)=0.1</code> so a back of the envelope calculation gives an approximate 95% interval of <code>(0.46, 0.66)</code>
+
+<ul>
+<li>Not enough for you to relax, better go do more campaigning!</li>
+</ul></li>
+<li>Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.</li>
+</ul>
+
+<pre><code class="r">round(1/sqrt(10^(1:6)), 3)
+</code></pre>
+
+<pre><code>## [1] 0.316 0.100 0.032 0.010 0.003 0.001
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Binomial interval</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">0.56 + c(-1, 1) * qnorm(0.975) * sqrt(0.56 * 0.44/100)
+</code></pre>
+
+<pre><code>## [1] 0.4627 0.6573
+</code></pre>
+
+<pre><code class="r">binom.test(56, 100)$conf.int
+</code></pre>
+
+<pre><code>## [1] 0.4572 0.6592
+## attr(,&quot;conf.level&quot;)
+## [1] 0.95
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Simulation</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">n &lt;- 20
+pvals &lt;- seq(0.1, 0.9, by = 0.05)
+nosim &lt;- 1000
+coverage &lt;- sapply(pvals, function(p) {
+    phats &lt;- rbinom(nosim, prob = p, size = n)/n
+    ll &lt;- phats - qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    ul &lt;- phats + qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    mean(ll &lt; p &amp; ul &gt; p)
+})
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>Plot of the results (not so good)</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-10.png" title="plot of chunk unnamed-chunk-10" alt="plot of chunk unnamed-chunk-10" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-20" style="background:;">
+  <hgroup>
+    <h2>What&#39;s happening?</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>\(n\) isn&#39;t large enough for the CLT to be applicable
+for many of the values of \(p\)</li>
+<li>Quick fix, form the interval with 
+\[
+\frac{X + 2}{n + 4}
+\]</li>
+<li>(Add two successes and failures, Agresti/Coull interval)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-21" style="background:;">
+  <hgroup>
+    <h2>Simulation</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>First let&#39;s show that coverage gets better with \(n\)</p>
+
+<pre><code class="r">n &lt;- 100
+pvals &lt;- seq(0.1, 0.9, by = 0.05)
+nosim &lt;- 1000
+coverage2 &lt;- sapply(pvals, function(p) {
+    phats &lt;- rbinom(nosim, prob = p, size = n)/n
+    ll &lt;- phats - qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    ul &lt;- phats + qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    mean(ll &lt; p &amp; ul &gt; p)
+})
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-22" style="background:;">
+  <hgroup>
+    <h2>Plot of coverage for \(n=100\)</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-12.png" title="plot of chunk unnamed-chunk-12" alt="plot of chunk unnamed-chunk-12" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-23" style="background:;">
+  <hgroup>
+    <h2>Simulation</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Now let&#39;s look at \(n=20\) but adding 2 successes and failures</p>
+
+<pre><code class="r">n &lt;- 20
+pvals &lt;- seq(0.1, 0.9, by = 0.05)
+nosim &lt;- 1000
+coverage &lt;- sapply(pvals, function(p) {
+    phats &lt;- (rbinom(nosim, prob = p, size = n) + 2)/(n + 4)
+    ll &lt;- phats - qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    ul &lt;- phats + qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    mean(ll &lt; p &amp; ul &gt; p)
+})
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-24" style="background:;">
+  <hgroup>
+    <h2>Adding 2 successes and 2 failures</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>(It&#39;s a little conservative)
+<img src="assets/fig/unnamed-chunk-14.png" title="plot of chunk unnamed-chunk-14" alt="plot of chunk unnamed-chunk-14" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-25" style="background:;">
+  <hgroup>
+    <h2>Poisson interval</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?</li>
+<li>\(X \sim Poisson(\lambda t)\).</li>
+<li>Estimate \(\hat \lambda = X/t\)</li>
+<li>\(Var(\hat \lambda) = \lambda / t\) </li>
+<li>\(\hat \lambda / t\) is our variance estimate</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-26" style="background:;">
+  <hgroup>
+    <h2>R code</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">x &lt;- 5
+t &lt;- 94.32
+lambda &lt;- x/t
+round(lambda + c(-1, 1) * qnorm(0.975) * sqrt(lambda/t), 3)
+</code></pre>
+
+<pre><code>## [1] 0.007 0.099
+</code></pre>
+
+<pre><code class="r">poisson.test(x, T = 94.32)$conf
+</code></pre>
+
+<pre><code>## [1] 0.01721 0.12371
+## attr(,&quot;conf.level&quot;)
+## [1] 0.95
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-27" style="background:;">
+  <hgroup>
+    <h2>Simulating the Poisson coverage rate</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Let&#39;s see how this interval performs for lambda
+values near what we&#39;re estimating</p>
+
+<pre><code class="r">lambdavals &lt;- seq(0.005, 0.1, by = 0.01)
+nosim &lt;- 1000
+t &lt;- 100
+coverage &lt;- sapply(lambdavals, function(lambda) {
+    lhats &lt;- rpois(nosim, lambda = lambda * t)/t
+    ll &lt;- lhats - qnorm(0.975) * sqrt(lhats/t)
+    ul &lt;- lhats + qnorm(0.975) * sqrt(lhats/t)
+    mean(ll &lt; lambda &amp; ul &gt; lambda)
+})
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-28" style="background:;">
+  <hgroup>
+    <h2>Covarage</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>(Gets really bad for small values of lambda)
+<img src="assets/fig/unnamed-chunk-17.png" title="plot of chunk unnamed-chunk-17" alt="plot of chunk unnamed-chunk-17" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-29" style="background:;">
+  <hgroup>
+    <h2>What if we increase t to 1000?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-18.png" title="plot of chunk unnamed-chunk-18" alt="plot of chunk unnamed-chunk-18" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-30" style="background:;">
+  <hgroup>
+    <h2>Summary</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The LLN states that averages of iid samples 
+converge to the population means that they are estimating</li>
+<li>The CLT states that averages are approximately normal, with
+distributions
+
+<ul>
+<li>centered at the population mean </li>
+<li>with standard deviation equal to the standard error of the mean</li>
+<li>CLT gives no guarantee that \(n\) is large enough</li>
+</ul></li>
+<li>Taking the mean and adding and subtracting the relevant
+normal quantile times the SE yields a confidence interval for the mean
+
+<ul>
+<li>Adding and subtracting 2 SEs works for 95% intervals</li>
+</ul></li>
+<li>Confidence intervals get wider as the coverage increases
+(why?)</li>
+<li>Confidence intervals get narrower with less variability or
+larger sample sizes</li>
+<li>The Poisson and binomial case have exact intervals that
+don&#39;t require the CLT
+
+<ul>
+<li>But a quick fix for small sample size binomial calculations is to add 2 successes and failures</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Asymptotics'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Limits of random variables'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Law of large numbers in action'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Law of large numbers in action, coin flip'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Discussion'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='The Central Limit Theorem'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Example'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Result of our die rolling experiment'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Coin CLT'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Simulation results'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Simulation results, \(p = 0.9\)'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Galton&#39;s quincunx'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Confidence intervals'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Give a confidence interval for the average height of sons'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Sample proportions'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Example'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Binomial interval'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Simulation'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='Plot of the results (not so good)'>
+         19
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=20 title='What&#39;s happening?'>
+         20
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=21 title='Simulation'>
+         21
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=22 title='Plot of coverage for \(n=100\)'>
+         22
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=23 title='Simulation'>
+         23
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=24 title='Adding 2 successes and 2 failures'>
+         24
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=25 title='Poisson interval'>
+         25
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=26 title='R code'>
+         26
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=27 title='Simulating the Poisson coverage rate'>
+         27
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=28 title='Covarage'>
+         28
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=29 title='What if we increase t to 1000?'>
+         29
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=30 title='Summary'>
+         30
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/07_Asymptopia/index.md b/06_StatisticalInference/07_Asymptopia/index.md
new file mode 100644
index 000000000..eccfcd58c
--- /dev/null
+++ b/06_StatisticalInference/07_Asymptopia/index.md
@@ -0,0 +1,419 @@
+---
+title       : A trip to Asymptopia
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Asymptotics
+* Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)
+* (Asymptopia is my name for the land of asymptotics, where everything works out well and there's no messes. The land of infinite data is nice that way.)
+* Asymptotics are incredibly useful for simple statistical inference and approximations 
+* (Not covered in this class) Asymptotics often lead to nice understanding of procedures
+* Asymptotics generally give no assurances about finite sample performance
+* Asymptotics form the basis for frequency interpretation of probabilities 
+  (the long run proportion of times an event occurs)
+
+
+---
+
+## Limits of random variables
+
+- Fortunately, for the sample mean there's a set of powerful results
+- These results allow us to talk about the large sample distribution
+of sample means of a collection of $iid$ observations
+- The first of these results we inuitively know
+  - It says that the average limits to what its estimating, the population mean
+  - It's called the Law of Large Numbers
+  - Example $\bar X_n$ could be the average of the result of $n$ coin flips (i.e. the sample proportion of heads)
+    - As we flip a fair coin over and over, it evetually converges to the
+    true probability of a head
+    The LLN forms the basis of frequency style thinking
+
+
+---
+## Law of large numbers in action
+
+```r
+n <- 10000
+means <- cumsum(rnorm(n))/(1:n)
+library(ggplot2)
+g <- ggplot(data.frame(x = 1:n, y = means), aes(x = x, y = y))
+g <- g + geom_hline(yintercept = 0) + geom_line(size = 2)
+g <- g + labs(x = "Number of obs", y = "Cumulative mean")
+g
+```
+
+![plot of chunk unnamed-chunk-1](assets/fig/unnamed-chunk-1.png) 
+
+
+
+---
+## Law of large numbers in action, coin flip
+
+```r
+means <- cumsum(sample(0:1, n, replace = TRUE))/(1:n)
+g <- ggplot(data.frame(x = 1:n, y = means), aes(x = x, y = y))
+g <- g + geom_hline(yintercept = 0.5) + geom_line(size = 2)
+g <- g + labs(x = "Number of obs", y = "Cumulative mean")
+g
+```
+
+![plot of chunk unnamed-chunk-2](assets/fig/unnamed-chunk-2.png) 
+
+
+
+
+---
+## Discussion
+- An estimator is **consistent** if it converges to what you want to estimate
+  - The LLN says that the sample mean of iid sample is
+  consistent for the population mean
+  - Typically, good estimators are consistent; it's not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer
+- The sample variance and the sample standard deviation
+of iid random variables are consistent as well
+
+---
+
+## The Central Limit Theorem
+
+- The **Central Limit Theorem** (CLT) is one of the most important theorems in statistics
+- For our purposes, the CLT states that the distribution of averages of iid variables (properly normalized) becomes that of a standard normal as the sample size increases
+- The CLT applies in an endless variety of settings
+- The result is that 
+$$\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}=
+\frac{\sqrt n (\bar X_n - \mu)}{\sigma}
+= \frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}$$ has a distribution like that of a standard normal for large $n$.
+- (Replacing the standard error by its estimated value doesn't change the CLT)
+- The useful way to think about the CLT is that 
+$\bar X_n$ is approximately
+$N(\mu, \sigma^2 / n)$
+
+
+
+---
+
+## Example
+
+- Simulate a standard normal random variable by rolling $n$ (six sided)
+- Let $X_i$ be the outcome for die $i$
+- Then note that $\mu = E[X_i] = 3.5$
+- $Var(X_i) = 2.92$ 
+- SE $\sqrt{2.92 / n} = 1.71 / \sqrt{n}$
+- Lets roll $n$ dice, take their mean, subtract off 3.5,
+and divide by $1.71 / \sqrt{n}$ and repeat this over and over
+
+
+---
+## Result of our die rolling experiment
+
+<img src="assets/fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" style="display: block; margin: auto;" />
+
+
+
+---
+## Coin CLT
+
+ - Let $X_i$ be the $0$ or $1$ result of the $i^{th}$ flip of a possibly unfair coin
+- The sample proportion, say $\hat p$, is the average of the coin flips
+- $E[X_i] = p$ and $Var(X_i) = p(1-p)$
+- Standard error of the mean is $\sqrt{p(1-p)/n}$
+- Then
+$$
+    \frac{\hat p - p}{\sqrt{p(1-p)/n}}
+$$
+will be approximately normally distributed
+- Let's flip a coin $n$ times, take the sample proportion
+of heads, subtract off .5 and multiply the result by
+$2 \sqrt{n}$ (divide by $1/(2 \sqrt{n})$)
+
+---
+## Simulation results
+<img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" />
+
+
+---
+## Simulation results, $p = 0.9$
+<img src="assets/fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" style="display: block; margin: auto;" />
+
+
+---
+## Galton's quincunx 
+
+http://en.wikipedia.org/wiki/Bean_machine#mediaviewer/File:Quincunx_(Galton_Box)_-_Galton_1889_diagram.png
+
+<img src="fig/quincunx.png" height="450"></img>
+
+---
+
+## Confidence intervals
+
+- According to the CLT, the sample mean, $\bar X$, 
+is approximately normal with mean $\mu$ and sd $\sigma / \sqrt{n}$
+- $\mu + 2 \sigma /\sqrt{n}$ is pretty far out in the tail
+(only 2.5% of a normal being larger than 2 sds in the tail)
+- Similarly, $\mu - 2 \sigma /\sqrt{n}$ is pretty far in the left tail (only 2.5% chance of a normal being smaller than 2 sds in the tail)
+- So the probability $\bar X$ is bigger than $\mu + 2 \sigma / \sqrt{n}$
+or smaller than $\mu - 2 \sigma / \sqrt{n}$ is 5%
+    - Or equivalently, the probability of being between these limits is 95%
+- The quantity $\bar X \pm 2 \sigma /\sqrt{n}$ is called
+a 95% interval for $\mu$
+- The 95% refers to the fact that if one were to repeatly
+get samples of size $n$, about 95% of the intervals obtained
+would contain $\mu$
+- The 97.5th quantile is 1.96 (so I rounded to 2 above)
+- 90% interval you want (100 - 90) / 2 = 5% in each tail 
+  - So you want the 95th percentile (1.645)
+
+
+---
+## Give a confidence interval for the average height of sons
+in Galton's data
+
+```r
+library(UsingR)
+data(father.son)
+x <- father.son$sheight
+(mean(x) + c(-1, 1) * qnorm(0.975) * sd(x)/sqrt(length(x)))/12
+```
+
+```
+## [1] 5.710 5.738
+```
+
+
+---
+
+## Sample proportions
+
+- In the event that each $X_i$ is $0$ or $1$ with common success probability $p$ then $\sigma^2 = p(1 - p)$
+- The interval takes the form
+$$
+    \hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
+$$
+- Replacing $p$ by $\hat p$ in the standard error results in what is called a Wald confidence interval for $p$
+- For 95% intervals
+$$\hat p \pm \frac{1}{\sqrt{n}}$$ 
+is a quick CI estimate for $p$
+
+---
+## Example
+* Your campaign advisor told you that in a random sample of 100 likely voters,
+  56 intent to vote for you. 
+  * Can you relax? Do you have this race in the bag?
+  * Without access to a computer or calculator, how precise is this estimate?
+* `1/sqrt(100)=0.1` so a back of the envelope calculation gives an approximate 95% interval of `(0.46, 0.66)`
+  * Not enough for you to relax, better go do more campaigning!
+* Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.
+
+```r
+round(1/sqrt(10^(1:6)), 3)
+```
+
+```
+## [1] 0.316 0.100 0.032 0.010 0.003 0.001
+```
+
+
+
+
+---
+## Binomial interval
+
+
+```r
+0.56 + c(-1, 1) * qnorm(0.975) * sqrt(0.56 * 0.44/100)
+```
+
+```
+## [1] 0.4627 0.6573
+```
+
+```r
+binom.test(56, 100)$conf.int
+```
+
+```
+## [1] 0.4572 0.6592
+## attr(,"conf.level")
+## [1] 0.95
+```
+
+
+---
+
+## Simulation
+
+
+```r
+n <- 20
+pvals <- seq(0.1, 0.9, by = 0.05)
+nosim <- 1000
+coverage <- sapply(pvals, function(p) {
+    phats <- rbinom(nosim, prob = p, size = n)/n
+    ll <- phats - qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    ul <- phats + qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    mean(ll < p & ul > p)
+})
+```
+
+
+
+---
+## Plot of the results (not so good)
+<img src="assets/fig/unnamed-chunk-10.png" title="plot of chunk unnamed-chunk-10" alt="plot of chunk unnamed-chunk-10" style="display: block; margin: auto;" />
+
+
+---
+## What's happening?
+- $n$ isn't large enough for the CLT to be applicable
+for many of the values of $p$
+- Quick fix, form the interval with 
+$$
+\frac{X + 2}{n + 4}
+$$
+- (Add two successes and failures, Agresti/Coull interval)
+
+---
+## Simulation
+First let's show that coverage gets better with $n$
+
+
+```r
+n <- 100
+pvals <- seq(0.1, 0.9, by = 0.05)
+nosim <- 1000
+coverage2 <- sapply(pvals, function(p) {
+    phats <- rbinom(nosim, prob = p, size = n)/n
+    ll <- phats - qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    ul <- phats + qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    mean(ll < p & ul > p)
+})
+```
+
+
+---
+## Plot of coverage for $n=100$
+<img src="assets/fig/unnamed-chunk-12.png" title="plot of chunk unnamed-chunk-12" alt="plot of chunk unnamed-chunk-12" style="display: block; margin: auto;" />
+
+
+---
+## Simulation
+Now let's look at $n=20$ but adding 2 successes and failures
+
+```r
+n <- 20
+pvals <- seq(0.1, 0.9, by = 0.05)
+nosim <- 1000
+coverage <- sapply(pvals, function(p) {
+    phats <- (rbinom(nosim, prob = p, size = n) + 2)/(n + 4)
+    ll <- phats - qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    ul <- phats + qnorm(0.975) * sqrt(phats * (1 - phats)/n)
+    mean(ll < p & ul > p)
+})
+```
+
+
+
+---
+## Adding 2 successes and 2 failures
+(It's a little conservative)
+<img src="assets/fig/unnamed-chunk-14.png" title="plot of chunk unnamed-chunk-14" alt="plot of chunk unnamed-chunk-14" style="display: block; margin: auto;" />
+
+
+---
+
+## Poisson interval
+* A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?
+* $X \sim Poisson(\lambda t)$.
+* Estimate $\hat \lambda = X/t$
+* $Var(\hat \lambda) = \lambda / t$ 
+* $\hat \lambda / t$ is our variance estimate
+
+---
+## R code
+
+```r
+x <- 5
+t <- 94.32
+lambda <- x/t
+round(lambda + c(-1, 1) * qnorm(0.975) * sqrt(lambda/t), 3)
+```
+
+```
+## [1] 0.007 0.099
+```
+
+```r
+poisson.test(x, T = 94.32)$conf
+```
+
+```
+## [1] 0.01721 0.12371
+## attr(,"conf.level")
+## [1] 0.95
+```
+
+
+
+---
+## Simulating the Poisson coverage rate
+Let's see how this interval performs for lambda
+values near what we're estimating
+
+```r
+lambdavals <- seq(0.005, 0.1, by = 0.01)
+nosim <- 1000
+t <- 100
+coverage <- sapply(lambdavals, function(lambda) {
+    lhats <- rpois(nosim, lambda = lambda * t)/t
+    ll <- lhats - qnorm(0.975) * sqrt(lhats/t)
+    ul <- lhats + qnorm(0.975) * sqrt(lhats/t)
+    mean(ll < lambda & ul > lambda)
+})
+```
+
+
+
+
+---
+## Covarage
+(Gets really bad for small values of lambda)
+<img src="assets/fig/unnamed-chunk-17.png" title="plot of chunk unnamed-chunk-17" alt="plot of chunk unnamed-chunk-17" style="display: block; margin: auto;" />
+
+
+
+
+---
+## What if we increase t to 1000?
+<img src="assets/fig/unnamed-chunk-18.png" title="plot of chunk unnamed-chunk-18" alt="plot of chunk unnamed-chunk-18" style="display: block; margin: auto;" />
+
+
+
+---
+## Summary
+- The LLN states that averages of iid samples 
+converge to the population means that they are estimating
+- The CLT states that averages are approximately normal, with
+distributions
+  - centered at the population mean 
+  - with standard deviation equal to the standard error of the mean
+  - CLT gives no guarantee that $n$ is large enough
+- Taking the mean and adding and subtracting the relevant
+normal quantile times the SE yields a confidence interval for the mean
+  - Adding and subtracting 2 SEs works for 95% intervals
+- Confidence intervals get wider as the coverage increases
+(why?)
+- Confidence intervals get narrower with less variability or
+larger sample sizes
+- The Poisson and binomial case have exact intervals that
+don't require the CLT
+  - But a quick fix for small sample size binomial calculations is to add 2 successes and failures
diff --git a/06_StatisticalInference/07_Asymptopia/index.pdf b/06_StatisticalInference/07_Asymptopia/index.pdf
new file mode 100644
index 000000000..79cf80d5c
Binary files /dev/null and b/06_StatisticalInference/07_Asymptopia/index.pdf differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-10.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-10.png
new file mode 100644
index 000000000..83ff203af
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-10.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-11.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-11.png
new file mode 100644
index 000000000..42169d11d
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-11.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-12.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-12.png
new file mode 100644
index 000000000..a6b1571c6
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-12.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-13.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-13.png
new file mode 100644
index 000000000..42169d11d
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-13.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..83ff203af
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..c9cf93fe1
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-4.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..83ff203af
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-5.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..1793327e3
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-5.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-6.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-6.png
new file mode 100644
index 000000000..1793327e3
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-6.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-7.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-7.png
new file mode 100644
index 000000000..1793327e3
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-7.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-8.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-8.png
new file mode 100644
index 000000000..83ff203af
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-8.png differ
diff --git a/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-9.png b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-9.png
new file mode 100644
index 000000000..a6b1571c6
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/assets/fig/unnamed-chunk-9.png differ
diff --git a/06_StatisticalInference/08_tCIs/index.Rmd b/06_StatisticalInference/08_tCIs/index.Rmd
new file mode 100644
index 000000000..44d768a00
--- /dev/null
+++ b/06_StatisticalInference/08_tCIs/index.Rmd
@@ -0,0 +1,292 @@
+---
+title       : T Confidence Intervals
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## T Confidence intervals
+
+- In the previous, we discussed creating a confidence interval using the CLT
+  - They took the form $Est \pm ZQ \times SE_{Est}$
+- In this lecture, we discuss some methods for small samples, notably Gosset's $t$ distribution and $t$ confidence intervals
+  - They are of the form $Est \pm TQ \times SE_{Est}$
+- These are some of the handiest of intervals
+- If you want a rule between whether to use a $t$ interval
+or normal interval, just always use the $t$ interval
+- We'll cover the one and two group versions
+
+---
+
+## Gosset's $t$ distribution
+
+- Invented by William Gosset (under the pseudonym "Student") in 1908
+- Has thicker tails than the normal
+- Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger
+- It assumes that the underlying data are iid 
+Gaussian with the result that
+$$
+\frac{\bar X - \mu}{S/\sqrt{n}}
+$$
+follows Gosset's $t$ distribution with $n-1$ degrees of freedom
+- (If we replaced $s$ by $\sigma$ the statistic would be exactly standard normal)
+- Interval is $\bar X \pm t_{n-1} S/\sqrt{n}$ where $t_{n-1}$
+is the relevant quantile
+
+---
+## Code for manipulate
+```{r, echo=TRUE,eval=FALSE}
+library(ggplot2); library(manipulate)
+k <- 1000
+xvals <- seq(-5, 5, length = k)
+myplot <- function(df){
+  d <- data.frame(y = c(dnorm(xvals), dt(xvals, df)),
+                  x = xvals,
+                  dist = factor(rep(c("Normal", "T"), c(k,k))))
+  g <- ggplot(d, aes(x = x, y = y)) 
+  g <- g + geom_line(size = 2, aes(colour = dist))
+  g
+}
+manipulate(myplot(mu), mu = slider(1, 20, step = 1))  
+```
+
+---
+## Easier to see
+```{r, eval = FALSE, echo = TRUE}
+pvals <- seq(.5, .99, by = .01)
+myplot2 <- function(df){
+  d <- data.frame(n= qnorm(pvals),t=qt(pvals, df),
+                  p = pvals)
+  g <- ggplot(d, aes(x= n, y = t))
+  g <- g + geom_abline(size = 2, col = "lightblue")
+  g <- g + geom_line(size = 2, col = "black")
+  g <- g + geom_vline(xintercept = qnorm(0.975))
+  g <- g + geom_hline(yintercept = qt(0.975, df))
+  g
+}
+manipulate(myplot2(df), df = slider(1, 20, step = 1))
+```
+
+---
+
+## Note's about the $t$ interval
+
+- The $t$ interval technically assumes that the data are iid normal, though it is robust to this assumption
+- It works well whenever the distribution of the data is roughly symmetric and mound shaped
+- Paired observations are often analyzed using the $t$ interval by taking differences
+- For large degrees of freedom, $t$ quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded
+- For skewed distributions, the spirit of the $t$ interval assumptions are violated
+  - Also, for skewed distributions, it doesn't make a lot of sense to center the interval at the mean
+  - In this case, consider taking logs or using a different summary like the median
+- For highly discrete data, like binary, other intervals are available
+
+---
+
+## Sleep data
+
+In R typing `data(sleep)` brings up the sleep data originally
+analyzed in Gosset's Biometrika paper, which shows the increase in
+hours for 10 patients on two soporific drugs. R treats the data as two
+groups rather than paired.
+
+---
+## The data
+```{r}
+data(sleep)
+head(sleep)
+```
+
+---
+## Plotting the data
+```{r, echo = FALSE, fig.width=6, fig.height=6, fig.align='center'}
+library(ggplot2)
+g <- ggplot(sleep, aes(x = group, y = extra, group = factor(ID)))
+g <- g + geom_line(size = 1, aes(colour = ID)) + geom_point(size =10, pch = 21, fill = "salmon", alpha = .5)
+g
+```
+
+---
+## Results
+```{r, echo=TRUE}
+g1 <- sleep$extra[1 : 10]; g2 <- sleep$extra[11 : 20]
+difference <- g2 - g1
+mn <- mean(difference); s <- sd(difference); n <- 10
+```
+```{r, echo=TRUE,eval=FALSE}
+mn + c(-1, 1) * qt(.975, n-1) * s / sqrt(n)
+t.test(difference)
+t.test(g2, g1, paired = TRUE)
+t.test(extra ~ I(relevel(group, 2)), paired = TRUE, data = sleep)
+```
+
+---
+## The results
+(After a little formatting)
+```{r, echo = FALSE}
+rbind(
+mn + c(-1, 1) * qt(.975, n-1) * s / sqrt(n),
+as.vector(t.test(difference)$conf.int),
+as.vector(t.test(g2, g1, paired = TRUE)$conf.int),
+as.vector(t.test(extra ~ I(relevel(group, 2)), paired = TRUE, data = sleep)$conf.int)
+)
+```
+
+---
+
+## Independent group $t$ confidence intervals
+
+- Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo
+- We cannot use the paired t test because the groups are independent and may have different sample sizes
+- We now present methods for comparing independent groups
+
+---
+## Confidence interval
+
+- Therefore a $(1 - \alpha)\times 100\%$ confidence interval for $\mu_y - \mu_x$ is 
+$$
+    \bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
+$$
+- The pooled variance estimator is $$S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)$$ 
+- Remember this interval is assuming a constant variance across the two groups
+- If there is some doubt, assume a different variance per group, which we will discuss later
+
+---
+
+## Example
+### Based on Rosner, Fundamentals of Biostatistics
+(Really a very good reference book)
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- Pooled variance estimate
+```{r}
+sp <- sqrt((7 * 15.34^2 + 20 * 18.23^2) / (8 + 21 - 2))
+132.86 - 127.44 + c(-1, 1) * qt(.975, 27) * sp * (1 / 8 + 1 / 21)^.5
+```
+
+
+---
+## Mistakenly treating the sleep data as grouped
+```{r}
+n1 <- length(g1); n2 <- length(g2)
+sp <- sqrt( ((n1 - 1) * sd(x1)^2 + (n2-1) * sd(x2)^2) / (n1 + n2-2))
+md <- mean(g2) - mean(g1)
+semd <- sp * sqrt(1 / n1 + 1/n2)
+rbind(
+md + c(-1, 1) * qt(.975, n1 + n2 - 2) * semd,  
+t.test(g2, g1, paired = FALSE, var.equal = TRUE)$conf,
+t.test(g2, g1, paired = TRUE)$conf
+)
+```
+
+---
+## Grouped versus independent
+```{r, echo = FALSE, fig.width=6, fig.height=6, fig.align='center'}
+library(ggplot2)
+g <- ggplot(sleep, aes(x = group, y = extra, group = factor(ID)))
+g <- g + geom_line(size = 1, aes(colour = ID)) + geom_point(size =10, pch = 21, fill = "salmon", alpha = .5)
+g
+```
+
+---
+
+## `ChickWeight` data in R
+```{r}
+library(datasets); data(ChickWeight); library(reshape2)
+##define weight gain or loss
+wideCW <- dcast(ChickWeight, Diet + Chick ~ Time, value.var = "weight")
+names(wideCW)[-(1 : 2)] <- paste("time", names(wideCW)[-(1 : 2)], sep = "")
+library(dplyr)
+wideCW <- mutate(wideCW,
+  gain = time21 - time0
+)
+
+```
+
+---
+## Plotting the raw data
+
+```{r, echo =FALSE, fig.align='center', fig.width=12, fig.height=6}
+g <- ggplot(ChickWeight, aes(x = Time, y = weight, 
+                             colour = Diet, group = Chick))
+g <- g + geom_line()
+g <- g + stat_summary(aes(group = 1), geom = "line", fun.y = mean, size = 1, col = "black")
+g <- g + facet_grid(. ~ Diet)
+g
+```
+
+
+
+---
+## Weight gain by diet
+```{r, echo=FALSE, fig.align='center', fig.width=6, fig.height=6, warning=FALSE}
+g <- ggplot(wideCW, aes(x = factor(Diet), y = gain, fill = factor(Diet)))
+g <- g + geom_violin(col = "black", size = 2)
+g
+
+```
+
+---
+## Let's do a t interval
+```{r}
+wideCW14 <- subset(wideCW, Diet %in% c(1, 4))
+rbind(
+t.test(gain ~ Diet, paired = FALSE, var.equal = TRUE, data = wideCW14)$conf,
+t.test(gain ~ Diet, paired = FALSE, var.equal = FALSE, data = wideCW14)$conf
+)
+```
+
+
+---
+
+## Unequal variances
+
+- Under unequal variances
+$$
+\bar Y - \bar X \pm t_{df} \times \left(\frac{s_x^2}{n_x} + \frac{s_y^2}{n_y}\right)^{1/2}
+$$
+where $t_{df}$ is calculated with degrees of freedom
+$$
+df=    \frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
+    {\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
+      \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
+$$
+will be approximately a 95% interval
+- This works really well
+  - So when in doubt, just assume unequal variances
+
+---
+
+## Example
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- $df=15.04$, $t_{15.04, .975} = 2.13$
+- Interval
+$$
+132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
+= [-8.91, 19.75]
+$$
+- In R, `t.test(..., var.equal = FALSE)`
+
+---
+## Comparing other kinds of data
+* For binomial data, there's lots of ways to compare two groups
+  * Relative risk, risk difference, odds ratio.
+  * Chi-squared tests, normal approximations, exact tests.
+* For count data, there's also Chi-squared tests and exact tests.
+* We'll leave the discussions for comparing groups of data for binary
+  and count data until covering glms in the regression class.
+* In addition, Mathematical Biostatistics Boot Camp 2 covers many special
+  cases relevant to biostatistics.
+
diff --git a/06_StatisticalInference/08_tCIs/index.html b/06_StatisticalInference/08_tCIs/index.html
new file mode 100644
index 000000000..910843b4e
--- /dev/null
+++ b/06_StatisticalInference/08_tCIs/index.html
@@ -0,0 +1,668 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>T Confidence Intervals</title>
+  <meta charset="utf-8">
+  <meta name="description" content="T Confidence Intervals">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>T Confidence Intervals</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>T Confidence intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In the previous, we discussed creating a confidence interval using the CLT
+
+<ul>
+<li>They took the form \(Est \pm ZQ \times SE_{Est}\)</li>
+</ul></li>
+<li>In this lecture, we discuss some methods for small samples, notably Gosset&#39;s \(t\) distribution and \(t\) confidence intervals
+
+<ul>
+<li>They are of the form \(Est \pm TQ \times SE_{Est}\)</li>
+</ul></li>
+<li>These are some of the handiest of intervals</li>
+<li>If you want a rule between whether to use a \(t\) interval
+or normal interval, just always use the \(t\) interval</li>
+<li>We&#39;ll cover the one and two group versions</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Gosset&#39;s \(t\) distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Invented by William Gosset (under the pseudonym &quot;Student&quot;) in 1908</li>
+<li>Has thicker tails than the normal</li>
+<li>Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger</li>
+<li>It assumes that the underlying data are iid 
+Gaussian with the result that
+\[
+\frac{\bar X - \mu}{S/\sqrt{n}}
+\]
+follows Gosset&#39;s \(t\) distribution with \(n-1\) degrees of freedom</li>
+<li>(If we replaced \(s\) by \(\sigma\) the statistic would be exactly standard normal)</li>
+<li>Interval is \(\bar X \pm t_{n-1} S/\sqrt{n}\) where \(t_{n-1}\)
+is the relevant quantile</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Code for manipulate</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">library(ggplot2)
+library(manipulate)
+k &lt;- 1000
+xvals &lt;- seq(-5, 5, length = k)
+myplot &lt;- function(df) {
+    d &lt;- data.frame(y = c(dnorm(xvals), dt(xvals, df)), x = xvals, dist = factor(rep(c(&quot;Normal&quot;, 
+        &quot;T&quot;), c(k, k))))
+    g &lt;- ggplot(d, aes(x = x, y = y))
+    g &lt;- g + geom_line(size = 2, aes(colour = dist))
+    g
+}
+manipulate(myplot(mu), mu = slider(1, 20, step = 1))
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Easier to see</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">pvals &lt;- seq(0.5, 0.99, by = 0.01)
+myplot2 &lt;- function(df) {
+    d &lt;- data.frame(n = qnorm(pvals), t = qt(pvals, df), p = pvals)
+    g &lt;- ggplot(d, aes(x = n, y = t))
+    g &lt;- g + geom_abline(size = 2, col = &quot;lightblue&quot;)
+    g &lt;- g + geom_line(size = 2, col = &quot;black&quot;)
+    g &lt;- g + geom_vline(xintercept = qnorm(0.975))
+    g &lt;- g + geom_hline(yintercept = qt(0.975, df))
+    g
+}
+manipulate(myplot2(df), df = slider(1, 20, step = 1))
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Note&#39;s about the \(t\) interval</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The \(t\) interval technically assumes that the data are iid normal, though it is robust to this assumption</li>
+<li>It works well whenever the distribution of the data is roughly symmetric and mound shaped</li>
+<li>Paired observations are often analyzed using the \(t\) interval by taking differences</li>
+<li>For large degrees of freedom, \(t\) quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded</li>
+<li>For skewed distributions, the spirit of the \(t\) interval assumptions are violated
+
+<ul>
+<li>Also, for skewed distributions, it doesn&#39;t make a lot of sense to center the interval at the mean</li>
+<li>In this case, consider taking logs or using a different summary like the median</li>
+</ul></li>
+<li>For highly discrete data, like binary, other intervals are available</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Sleep data</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>In R typing <code>data(sleep)</code> brings up the sleep data originally
+analyzed in Gosset&#39;s Biometrika paper, which shows the increase in
+hours for 10 patients on two soporific drugs. R treats the data as two
+groups rather than paired.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>The data</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">data(sleep)
+head(sleep)
+</code></pre>
+
+<pre><code>##   extra group ID
+## 1   0.7     1  1
+## 2  -1.6     1  2
+## 3  -0.2     1  3
+## 4  -1.2     1  4
+## 5  -0.1     1  5
+## 6   3.4     1  6
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Plotting the data</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code>## Warning: package &#39;ggplot2&#39; was built under R version 3.1.1
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Results</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">g1 &lt;- sleep$extra[1:10]
+g2 &lt;- sleep$extra[11:20]
+difference &lt;- g2 - g1
+mn &lt;- mean(difference)
+s &lt;- sd(difference)
+n &lt;- 10
+</code></pre>
+
+<pre><code class="r">mn + c(-1, 1) * qt(0.975, n - 1) * s/sqrt(n)
+t.test(difference)
+t.test(g2, g1, paired = TRUE)
+t.test(extra ~ I(relevel(group, 2)), paired = TRUE, data = sleep)
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>The results</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>(After a little formatting)</p>
+
+<pre><code>##        [,1] [,2]
+## [1,] 0.7001 2.46
+## [2,] 0.7001 2.46
+## [3,] 0.7001 2.46
+## [4,] 0.7001 2.46
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Independent group \(t\) confidence intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo</li>
+<li>We cannot use the paired t test because the groups are independent and may have different sample sizes</li>
+<li>We now present methods for comparing independent groups</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Confidence interval</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Therefore a \((1 - \alpha)\times 100\%\) confidence interval for \(\mu_y - \mu_x\) is 
+\[
+\bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
+\]</li>
+<li>The pooled variance estimator is \[S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)\] </li>
+<li>Remember this interval is assuming a constant variance across the two groups</li>
+<li>If there is some doubt, assume a different variance per group, which we will discuss later</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>Based on Rosner, Fundamentals of Biostatistics</h3>
+
+<p>(Really a very good reference book)</p>
+
+<ul>
+<li>Comparing SBP for 8 oral contraceptive users versus 21 controls</li>
+<li>\(\bar X_{OC} = 132.86\) mmHg with \(s_{OC} = 15.34\) mmHg</li>
+<li>\(\bar X_{C} = 127.44\) mmHg with \(s_{C} = 18.23\) mmHg</li>
+<li>Pooled variance estimate</li>
+</ul>
+
+<pre><code class="r">sp &lt;- sqrt((7 * 15.34^2 + 20 * 18.23^2)/(8 + 21 - 2))
+132.86 - 127.44 + c(-1, 1) * qt(0.975, 27) * sp * (1/8 + 1/21)^0.5
+</code></pre>
+
+<pre><code>## [1] -9.521 20.361
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Mistakenly treating the sleep data as grouped</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">n1 &lt;- length(g1)
+n2 &lt;- length(g2)
+sp &lt;- sqrt(((n1 - 1) * sd(x1)^2 + (n2 - 1) * sd(x2)^2)/(n1 + n2 - 2))
+</code></pre>
+
+<pre><code>## Error: object &#39;x1&#39; not found
+</code></pre>
+
+<pre><code class="r">md &lt;- mean(g2) - mean(g1)
+semd &lt;- sp * sqrt(1/n1 + 1/n2)
+rbind(md + c(-1, 1) * qt(0.975, n1 + n2 - 2) * semd, t.test(g2, g1, paired = FALSE, 
+    var.equal = TRUE)$conf, t.test(g2, g1, paired = TRUE)$conf)
+</code></pre>
+
+<pre><code>##          [,1]   [,2]
+## [1,] -14.8873 18.047
+## [2,]  -0.2039  3.364
+## [3,]   0.7001  2.460
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Grouped versus independent</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-10.png" title="plot of chunk unnamed-chunk-10" alt="plot of chunk unnamed-chunk-10" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2><code>ChickWeight</code> data in R</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">library(datasets)
+data(ChickWeight)
+library(reshape2)
+## define weight gain or loss
+wideCW &lt;- dcast(ChickWeight, Diet + Chick ~ Time, value.var = &quot;weight&quot;)
+names(wideCW)[-(1:2)] &lt;- paste(&quot;time&quot;, names(wideCW)[-(1:2)], sep = &quot;&quot;)
+library(dplyr)
+</code></pre>
+
+<pre><code>## 
+## Attaching package: &#39;dplyr&#39;
+## 
+## The following objects are masked from &#39;package:stats&#39;:
+## 
+##     filter, lag
+## 
+## The following objects are masked from &#39;package:base&#39;:
+## 
+##     intersect, setdiff, setequal, union
+</code></pre>
+
+<pre><code class="r">wideCW &lt;- mutate(wideCW, gain = time21 - time0)
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Plotting the raw data</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-12.png" title="plot of chunk unnamed-chunk-12" alt="plot of chunk unnamed-chunk-12" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Weight gain by diet</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-13.png" title="plot of chunk unnamed-chunk-13" alt="plot of chunk unnamed-chunk-13" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>Let&#39;s do a t interval</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">wideCW14 &lt;- subset(wideCW, Diet %in% c(1, 4))
+rbind(t.test(gain ~ Diet, paired = FALSE, var.equal = TRUE, data = wideCW14)$conf, 
+    t.test(gain ~ Diet, paired = FALSE, var.equal = FALSE, data = wideCW14)$conf)
+</code></pre>
+
+<pre><code>##        [,1]   [,2]
+## [1,] -108.1 -14.81
+## [2,] -104.7 -18.30
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-20" style="background:;">
+  <hgroup>
+    <h2>Unequal variances</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Under unequal variances
+\[
+\bar Y - \bar X \pm t_{df} \times \left(\frac{s_x^2}{n_x} + \frac{s_y^2}{n_y}\right)^{1/2}
+\]
+where \(t_{df}\) is calculated with degrees of freedom
+\[
+df=    \frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
+{\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
+  \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
+\]
+will be approximately a 95% interval</li>
+<li>This works really well
+
+<ul>
+<li>So when in doubt, just assume unequal variances</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-21" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Comparing SBP for 8 oral contraceptive users versus 21 controls</li>
+<li>\(\bar X_{OC} = 132.86\) mmHg with \(s_{OC} = 15.34\) mmHg</li>
+<li>\(\bar X_{C} = 127.44\) mmHg with \(s_{C} = 18.23\) mmHg</li>
+<li>\(df=15.04\), \(t_{15.04, .975} = 2.13\)</li>
+<li>Interval
+\[
+132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
+= [-8.91, 19.75]
+\]</li>
+<li>In R, <code>t.test(..., var.equal = FALSE)</code></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-22" style="background:;">
+  <hgroup>
+    <h2>Comparing other kinds of data</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>For binomial data, there&#39;s lots of ways to compare two groups
+
+<ul>
+<li>Relative risk, risk difference, odds ratio.</li>
+<li>Chi-squared tests, normal approximations, exact tests.</li>
+</ul></li>
+<li>For count data, there&#39;s also Chi-squared tests and exact tests.</li>
+<li>We&#39;ll leave the discussions for comparing groups of data for binary
+and count data until covering glms in the regression class.</li>
+<li>In addition, Mathematical Biostatistics Boot Camp 2 covers many special
+cases relevant to biostatistics.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='T Confidence intervals'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Gosset&#39;s \(t\) distribution'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Code for manipulate'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Easier to see'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Note&#39;s about the \(t\) interval'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Sleep data'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='The data'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Plotting the data'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Results'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='The results'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Independent group \(t\) confidence intervals'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Confidence interval'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Example'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Mistakenly treating the sleep data as grouped'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Grouped versus independent'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='<code>ChickWeight</code> data in R'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Plotting the raw data'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Weight gain by diet'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='Let&#39;s do a t interval'>
+         19
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=20 title='Unequal variances'>
+         20
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=21 title='Example'>
+         21
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=22 title='Comparing other kinds of data'>
+         22
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/08_tCIs/index.md b/06_StatisticalInference/08_tCIs/index.md
new file mode 100644
index 000000000..e140c1b27
--- /dev/null
+++ b/06_StatisticalInference/08_tCIs/index.md
@@ -0,0 +1,345 @@
+---
+title       : T Confidence Intervals
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## T Confidence intervals
+
+- In the previous, we discussed creating a confidence interval using the CLT
+  - They took the form $Est \pm ZQ \times SE_{Est}$
+- In this lecture, we discuss some methods for small samples, notably Gosset's $t$ distribution and $t$ confidence intervals
+  - They are of the form $Est \pm TQ \times SE_{Est}$
+- These are some of the handiest of intervals
+- If you want a rule between whether to use a $t$ interval
+or normal interval, just always use the $t$ interval
+- We'll cover the one and two group versions
+
+---
+
+## Gosset's $t$ distribution
+
+- Invented by William Gosset (under the pseudonym "Student") in 1908
+- Has thicker tails than the normal
+- Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger
+- It assumes that the underlying data are iid 
+Gaussian with the result that
+$$
+\frac{\bar X - \mu}{S/\sqrt{n}}
+$$
+follows Gosset's $t$ distribution with $n-1$ degrees of freedom
+- (If we replaced $s$ by $\sigma$ the statistic would be exactly standard normal)
+- Interval is $\bar X \pm t_{n-1} S/\sqrt{n}$ where $t_{n-1}$
+is the relevant quantile
+
+---
+## Code for manipulate
+
+```r
+library(ggplot2)
+library(manipulate)
+k <- 1000
+xvals <- seq(-5, 5, length = k)
+myplot <- function(df) {
+    d <- data.frame(y = c(dnorm(xvals), dt(xvals, df)), x = xvals, dist = factor(rep(c("Normal", 
+        "T"), c(k, k))))
+    g <- ggplot(d, aes(x = x, y = y))
+    g <- g + geom_line(size = 2, aes(colour = dist))
+    g
+}
+manipulate(myplot(mu), mu = slider(1, 20, step = 1))
+```
+
+
+---
+## Easier to see
+
+```r
+pvals <- seq(0.5, 0.99, by = 0.01)
+myplot2 <- function(df) {
+    d <- data.frame(n = qnorm(pvals), t = qt(pvals, df), p = pvals)
+    g <- ggplot(d, aes(x = n, y = t))
+    g <- g + geom_abline(size = 2, col = "lightblue")
+    g <- g + geom_line(size = 2, col = "black")
+    g <- g + geom_vline(xintercept = qnorm(0.975))
+    g <- g + geom_hline(yintercept = qt(0.975, df))
+    g
+}
+manipulate(myplot2(df), df = slider(1, 20, step = 1))
+```
+
+
+---
+
+## Note's about the $t$ interval
+
+- The $t$ interval technically assumes that the data are iid normal, though it is robust to this assumption
+- It works well whenever the distribution of the data is roughly symmetric and mound shaped
+- Paired observations are often analyzed using the $t$ interval by taking differences
+- For large degrees of freedom, $t$ quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded
+- For skewed distributions, the spirit of the $t$ interval assumptions are violated
+  - Also, for skewed distributions, it doesn't make a lot of sense to center the interval at the mean
+  - In this case, consider taking logs or using a different summary like the median
+- For highly discrete data, like binary, other intervals are available
+
+---
+
+## Sleep data
+
+In R typing `data(sleep)` brings up the sleep data originally
+analyzed in Gosset's Biometrika paper, which shows the increase in
+hours for 10 patients on two soporific drugs. R treats the data as two
+groups rather than paired.
+
+---
+## The data
+
+```r
+data(sleep)
+head(sleep)
+```
+
+```
+##   extra group ID
+## 1   0.7     1  1
+## 2  -1.6     1  2
+## 3  -0.2     1  3
+## 4  -1.2     1  4
+## 5  -0.1     1  5
+## 6   3.4     1  6
+```
+
+
+---
+## Plotting the data
+
+```
+## Warning: package 'ggplot2' was built under R version 3.1.1
+```
+
+<img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" />
+
+
+---
+## Results
+
+```r
+g1 <- sleep$extra[1:10]
+g2 <- sleep$extra[11:20]
+difference <- g2 - g1
+mn <- mean(difference)
+s <- sd(difference)
+n <- 10
+```
+
+
+```r
+mn + c(-1, 1) * qt(0.975, n - 1) * s/sqrt(n)
+t.test(difference)
+t.test(g2, g1, paired = TRUE)
+t.test(extra ~ I(relevel(group, 2)), paired = TRUE, data = sleep)
+```
+
+
+---
+## The results
+(After a little formatting)
+
+```
+##        [,1] [,2]
+## [1,] 0.7001 2.46
+## [2,] 0.7001 2.46
+## [3,] 0.7001 2.46
+## [4,] 0.7001 2.46
+```
+
+
+---
+
+## Independent group $t$ confidence intervals
+
+- Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo
+- We cannot use the paired t test because the groups are independent and may have different sample sizes
+- We now present methods for comparing independent groups
+
+---
+## Confidence interval
+
+- Therefore a $(1 - \alpha)\times 100\%$ confidence interval for $\mu_y - \mu_x$ is 
+$$
+    \bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
+$$
+- The pooled variance estimator is $$S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)$$ 
+- Remember this interval is assuming a constant variance across the two groups
+- If there is some doubt, assume a different variance per group, which we will discuss later
+
+---
+
+## Example
+### Based on Rosner, Fundamentals of Biostatistics
+(Really a very good reference book)
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- Pooled variance estimate
+
+```r
+sp <- sqrt((7 * 15.34^2 + 20 * 18.23^2)/(8 + 21 - 2))
+132.86 - 127.44 + c(-1, 1) * qt(0.975, 27) * sp * (1/8 + 1/21)^0.5
+```
+
+```
+## [1] -9.521 20.361
+```
+
+
+
+---
+## Mistakenly treating the sleep data as grouped
+
+```r
+n1 <- length(g1)
+n2 <- length(g2)
+sp <- sqrt(((n1 - 1) * sd(x1)^2 + (n2 - 1) * sd(x2)^2)/(n1 + n2 - 2))
+```
+
+```
+## Error: object 'x1' not found
+```
+
+```r
+md <- mean(g2) - mean(g1)
+semd <- sp * sqrt(1/n1 + 1/n2)
+rbind(md + c(-1, 1) * qt(0.975, n1 + n2 - 2) * semd, t.test(g2, g1, paired = FALSE, 
+    var.equal = TRUE)$conf, t.test(g2, g1, paired = TRUE)$conf)
+```
+
+```
+##          [,1]   [,2]
+## [1,] -14.8873 18.047
+## [2,]  -0.2039  3.364
+## [3,]   0.7001  2.460
+```
+
+
+---
+## Grouped versus independent
+<img src="assets/fig/unnamed-chunk-10.png" title="plot of chunk unnamed-chunk-10" alt="plot of chunk unnamed-chunk-10" style="display: block; margin: auto;" />
+
+
+---
+
+## `ChickWeight` data in R
+
+```r
+library(datasets)
+data(ChickWeight)
+library(reshape2)
+## define weight gain or loss
+wideCW <- dcast(ChickWeight, Diet + Chick ~ Time, value.var = "weight")
+names(wideCW)[-(1:2)] <- paste("time", names(wideCW)[-(1:2)], sep = "")
+library(dplyr)
+```
+
+```
+## 
+## Attaching package: 'dplyr'
+## 
+## The following objects are masked from 'package:stats':
+## 
+##     filter, lag
+## 
+## The following objects are masked from 'package:base':
+## 
+##     intersect, setdiff, setequal, union
+```
+
+```r
+wideCW <- mutate(wideCW, gain = time21 - time0)
+```
+
+
+---
+## Plotting the raw data
+
+<img src="assets/fig/unnamed-chunk-12.png" title="plot of chunk unnamed-chunk-12" alt="plot of chunk unnamed-chunk-12" style="display: block; margin: auto;" />
+
+
+
+
+---
+## Weight gain by diet
+<img src="assets/fig/unnamed-chunk-13.png" title="plot of chunk unnamed-chunk-13" alt="plot of chunk unnamed-chunk-13" style="display: block; margin: auto;" />
+
+
+---
+## Let's do a t interval
+
+```r
+wideCW14 <- subset(wideCW, Diet %in% c(1, 4))
+rbind(t.test(gain ~ Diet, paired = FALSE, var.equal = TRUE, data = wideCW14)$conf, 
+    t.test(gain ~ Diet, paired = FALSE, var.equal = FALSE, data = wideCW14)$conf)
+```
+
+```
+##        [,1]   [,2]
+## [1,] -108.1 -14.81
+## [2,] -104.7 -18.30
+```
+
+
+
+---
+
+## Unequal variances
+
+- Under unequal variances
+$$
+\bar Y - \bar X \pm t_{df} \times \left(\frac{s_x^2}{n_x} + \frac{s_y^2}{n_y}\right)^{1/2}
+$$
+where $t_{df}$ is calculated with degrees of freedom
+$$
+df=    \frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
+    {\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
+      \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
+$$
+will be approximately a 95% interval
+- This works really well
+  - So when in doubt, just assume unequal variances
+
+---
+
+## Example
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- $df=15.04$, $t_{15.04, .975} = 2.13$
+- Interval
+$$
+132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
+= [-8.91, 19.75]
+$$
+- In R, `t.test(..., var.equal = FALSE)`
+
+---
+## Comparing other kinds of data
+* For binomial data, there's lots of ways to compare two groups
+  * Relative risk, risk difference, odds ratio.
+  * Chi-squared tests, normal approximations, exact tests.
+* For count data, there's also Chi-squared tests and exact tests.
+* We'll leave the discussions for comparing groups of data for binary
+  and count data until covering glms in the regression class.
+* In addition, Mathematical Biostatistics Boot Camp 2 covers many special
+  cases relevant to biostatistics.
+
diff --git a/06_StatisticalInference/08_tCIs/index.pdf b/06_StatisticalInference/08_tCIs/index.pdf
new file mode 100644
index 000000000..9c12c073a
Binary files /dev/null and b/06_StatisticalInference/08_tCIs/index.pdf differ
diff --git a/06_StatisticalInference/09_HT/index.Rmd b/06_StatisticalInference/09_HT/index.Rmd
new file mode 100644
index 000000000..40140aa48
--- /dev/null
+++ b/06_StatisticalInference/09_HT/index.Rmd
@@ -0,0 +1,241 @@
+---
+title       : Hypothesis testing
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Hypothesis testing
+* Hypothesis testing is concerned with making decisions using data
+* A null hypothesis is specified that represents the status quo,
+  usually labeled $H_0$
+* The null hypothesis is assumed true and statistical evidence is required
+  to reject it in favor of a research or alternative hypothesis 
+
+---
+## Example
+* A respiratory disturbance index of more than $30$ events / hour, say, is 
+  considered evidence of severe sleep disordered breathing (SDB).
+* Suppose that in a sample of $100$ overweight subjects with other
+  risk factors for sleep disordered breathing at a sleep clinic, the
+  mean RDI was $32$ events / hour with a standard deviation of $10$ events / hour.
+* We might want to test the hypothesis that 
+  * $H_0 : \mu = 30$
+  * $H_a : \mu > 30$
+  * where $\mu$ is the population mean RDI.
+
+---
+## Hypothesis testing
+* The alternative hypotheses are typically of the form $<$, $>$ or $\neq$
+* Note that there are four possible outcomes of our statistical decision process
+
+Truth | Decide | Result |
+---|---|---|
+$H_0$ | $H_0$ | Correctly accept null |
+$H_0$ | $H_a$ | Type I error |
+$H_a$ | $H_a$ | Correctly reject null |
+$H_a$ | $H_0$ | Type II error |
+
+---
+## Discussion
+* Consider a court of law; the null hypothesis is that the
+  defendant is innocent
+* We require a standard on the available evidence to reject the null hypothesis (convict)
+* If we set a low standard, then we would increase the
+  percentage of innocent people convicted (type I errors); however we
+  would also increase the percentage of guilty people convicted
+  (correctly rejecting the null)
+* If we set a high standard, then we increase the the
+  percentage of innocent people let free (correctly accepting the
+  null) while we would also increase the percentage of guilty people
+  let free (type II errors)
+
+---
+## Example
+* Consider our sleep example again
+* A reasonable strategy would reject the null hypothesis if
+  $\bar X$ was larger than some constant, say $C$
+* Typically, $C$ is chosen so that the probability of a Type I
+  error, $\alpha$, is $.05$ (or some other relevant constant)
+* $\alpha$ = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct
+
+---
+## Example continued
+- Standard error of the mean $10 / \sqrt{100} = 1$
+- Under $H_0$ $\bar X \sim N(30, 1)$ 
+- We want to chose $C$ so that the $P(\bar X > C; H_0)$ is 
+5%
+- The 95th percentile of a normal distribution is 1.645
+standard deviations from the mean
+- If $C = 30 + 1 \times 1.645 = 31.645$
+  - Then the probability that a $N(30, 1)$ is larger
+    than it is 5%
+  - So the rule "Reject $H_0$ when $\bar X \geq 31.645$"
+    has the property that the probability of rejection
+    is 5% when $H_0$ is true (for the $\mu_0$, $\sigma$
+    and $n$ given)
+
+
+---
+## Discussion
+* In general we don't convert $C$ back to the original scale
+* We would just reject because the Z-score; which is how many
+  standard errors the sample mean is above the hypothesized mean
+  $$
+  \frac{32 - 30}{10 / \sqrt{100}} = 2
+  $$
+  is greater than $1.645$
+* Or, whenever $\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}$
+
+---
+## General rules
+* The $Z$ test for $H_0:\mu = \mu_0$ versus 
+  * $H_1: \mu < \mu_0$
+  * $H_2: \mu \neq \mu_0$
+  * $H_3: \mu > \mu_0$ 
+* Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $
+* Reject the null hypothesis when 
+  * $TS \leq Z_{\alpha} = -Z_{1 - \alpha}$
+  * $|TS| \geq Z_{1 - \alpha / 2}$
+  * $TS \geq Z_{1 - \alpha}$
+
+---
+## Notes
+* We have fixed $\alpha$ to be low, so if we reject $H_0$ (either
+  our model is wrong) or there is a low probability that we have made
+  an error
+* We have not fixed the probability of a type II error, $\beta$;
+  therefore we tend to say ``Fail to reject $H_0$'' rather than
+  accepting $H_0$
+* Statistical significance is no the same as scientific
+  significance
+* The region of TS values for which you reject $H_0$ is called the
+  rejection region
+
+---
+## More notes
+* The $Z$ test requires the assumptions of the CLT and for $n$ to be large enough
+  for it to apply
+* If $n$ is small, then a Gossett's $T$ test is performed exactly in the same way,
+  with the normal quantiles replaced by the appropriate Student's $T$ quantiles and
+  $n-1$ df
+* The probability of rejecting the null hypothesis when it is false is called *power*
+* Power is a used a lot to calculate sample sizes for experiments
+
+---
+## Example reconsidered
+- Consider our example again. Suppose that $n= 16$ (rather than
+$100$)
+- The statistic
+$$
+\frac{\bar X - 30}{s / \sqrt{16}}
+$$
+follows a $T$ distribution with 15 df under $H_0$
+- Under $H_0$, the probability that it is larger
+that the 95th percentile of the $T$ distribution is 5%
+- The 95th percentile of the T distribution with 15
+df is `r qt(.95, 15)` (obtained via `qt(.95, 15)`)
+- So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $
+- We now fail to reject.
+
+---
+## Two sided tests
+* Suppose that we would reject the null hypothesis if in fact the  mean was too large or too small
+* That is, we want to test the alternative $H_a : \mu \neq 30$
+* We will reject if the test statistic, $0.8$, is either too large or too small
+* Then we want the probability of rejecting under the
+null to be 5%, split equally as 2.5% in the upper
+tail and 2.5% in the lower tail
+* Thus we reject if our test statistic is larger
+than `qt(.975, 15)` or smaller than `qt(.025, 15)`
+  * This is the same as saying: reject if the
+  absolute value of our statistic is larger than
+  `qt(0.975, 15)` = `r qt(0.975, 15)`
+  * So we fail to reject the two sided test as well
+  * (If you fail to reject the one sided test, you
+  know that you will fail to reject the two sided)
+
+---
+## T test in R
+```{r, echo=TRUE, comment=">", results='markup'}
+library(UsingR); data(father.son)
+t.test(father.son$sheight - father.son$fheight)
+```
+
+---
+## Connections with confidence intervals
+* Consider testing $H_0: \mu = \mu_0$ versus $H_a: \mu \neq \mu_0$
+* Take the set of all possible values for which you fail to reject $H_0$, this set is a $(1-\alpha)100\%$ confidence interval for $\mu$
+* The same works in reverse; if a $(1-\alpha)100\%$ interval
+  contains $\mu_0$, then we *fail  to* reject $H_0$
+
+---
+## Two group intervals
+- First, now you know how to do two group T tests
+since we already covered indepedent group T intervals
+- Rejection rules are the same 
+- Test $H_0 : \mu_1 = \mu_2$
+- Let's just go through an example
+
+---
+## `chickWeight` data
+Recall that we reformatted this data
+```{r, echo=TRUE,results='hide'}
+library(datasets); data(ChickWeight); library(reshape2)
+##define weight gain or loss
+wideCW <- dcast(ChickWeight, Diet + Chick ~ Time, value.var = "weight")
+names(wideCW)[-(1 : 2)] <- paste("time", names(wideCW)[-(1 : 2)], sep = "")
+library(dplyr)
+wideCW <- mutate(wideCW,
+  gain = time21 - time0
+)
+```
+
+---
+### Unequal variance T test comparing diets 1 and 4
+```{r,echo=TRUE, comment="> ", results='markup'}
+wideCW14 <- subset(wideCW, Diet %in% c(1, 4))
+t.test(gain ~ Diet, paired = FALSE, 
+       var.equal = TRUE, data = wideCW14)
+```
+
+
+
+---
+## Exact binomial test
+- Recall this problem, *Suppose a friend has $8$ children, $7$ of which are girls and none are twins*
+- Perform the relevant hypothesis test. $H_0 : p = 0.5$ $H_a : p > 0.5$
+  - What is the relevant rejection region so that the probability of rejecting is (less than) 5%?
+  
+Rejection region | Type I error rate |
+---|---|
+[0 : 8] | `r pbinom(-1, size = 8, p = .5, lower.tail = FALSE)`
+[1 : 8] | `r pbinom( 0, size = 8, p = .5, lower.tail = FALSE)`
+[2 : 8] | `r pbinom( 1, size = 8, p = .5, lower.tail = FALSE)`
+[3 : 8] | `r pbinom( 2, size = 8, p = .5, lower.tail = FALSE)`
+[4 : 8] | `r pbinom( 3, size = 8, p = .5, lower.tail = FALSE)`
+[5 : 8] | `r pbinom( 4, size = 8, p = .5, lower.tail = FALSE)`
+[6 : 8] | `r pbinom( 5, size = 8, p = .5, lower.tail = FALSE)`
+[7 : 8] | `r pbinom( 6, size = 8, p = .5, lower.tail = FALSE)`
+[8 : 8] | `r pbinom( 7, size = 8, p = .5, lower.tail = FALSE)`
+
+---
+## Notes
+* It's impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
+  * The closest is the rejection region [7 : 8]
+  * Any alpha level lower than `r 1 / 2 ^8` is not attainable.
+* For larger sample sizes, we could do a normal approximation, but you already knew this.
+* Two sided test isn't obvious. 
+  * Given a way to do two sided tests, we could take the set of values of $p_0$ for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)
+* For these problems, people always create a P-value (next lecture) rather than computing the rejection region.
+
+
diff --git a/06_StatisticalInference/09_HT/index.html b/06_StatisticalInference/09_HT/index.html
new file mode 100644
index 000000000..1422d9b3c
--- /dev/null
+++ b/06_StatisticalInference/09_HT/index.html
@@ -0,0 +1,682 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Hypothesis testing</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Hypothesis testing">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Hypothesis testing</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Hypothesis testing</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Hypothesis testing is concerned with making decisions using data</li>
+<li>A null hypothesis is specified that represents the status quo,
+usually labeled \(H_0\)</li>
+<li>The null hypothesis is assumed true and statistical evidence is required
+to reject it in favor of a research or alternative hypothesis </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A respiratory disturbance index of more than \(30\) events / hour, say, is 
+considered evidence of severe sleep disordered breathing (SDB).</li>
+<li>Suppose that in a sample of \(100\) overweight subjects with other
+risk factors for sleep disordered breathing at a sleep clinic, the
+mean RDI was \(32\) events / hour with a standard deviation of \(10\) events / hour.</li>
+<li>We might want to test the hypothesis that 
+
+<ul>
+<li>\(H_0 : \mu = 30\)</li>
+<li>\(H_a : \mu > 30\)</li>
+<li>where \(\mu\) is the population mean RDI.</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Hypothesis testing</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The alternative hypotheses are typically of the form \(<\), \(>\) or \(\neq\)</li>
+<li>Note that there are four possible outcomes of our statistical decision process</li>
+</ul>
+
+<table><thead>
+<tr>
+<th>Truth</th>
+<th>Decide</th>
+<th>Result</th>
+</tr>
+</thead><tbody>
+<tr>
+<td>\(H_0\)</td>
+<td>\(H_0\)</td>
+<td>Correctly accept null</td>
+</tr>
+<tr>
+<td>\(H_0\)</td>
+<td>\(H_a\)</td>
+<td>Type I error</td>
+</tr>
+<tr>
+<td>\(H_a\)</td>
+<td>\(H_a\)</td>
+<td>Correctly reject null</td>
+</tr>
+<tr>
+<td>\(H_a\)</td>
+<td>\(H_0\)</td>
+<td>Type II error</td>
+</tr>
+</tbody></table>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Discussion</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider a court of law; the null hypothesis is that the
+defendant is innocent</li>
+<li>We require a standard on the available evidence to reject the null hypothesis (convict)</li>
+<li>If we set a low standard, then we would increase the
+percentage of innocent people convicted (type I errors); however we
+would also increase the percentage of guilty people convicted
+(correctly rejecting the null)</li>
+<li>If we set a high standard, then we increase the the
+percentage of innocent people let free (correctly accepting the
+null) while we would also increase the percentage of guilty people
+let free (type II errors)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider our sleep example again</li>
+<li>A reasonable strategy would reject the null hypothesis if
+\(\bar X\) was larger than some constant, say \(C\)</li>
+<li>Typically, \(C\) is chosen so that the probability of a Type I
+error, \(\alpha\), is \(.05\) (or some other relevant constant)</li>
+<li>\(\alpha\) = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Standard error of the mean \(10 / \sqrt{100} = 1\)</li>
+<li>Under \(H_0\) \(\bar X \sim N(30, 1)\) </li>
+<li>We want to chose \(C\) so that the \(P(\bar X > C; H_0)\) is 
+5%</li>
+<li>The 95th percentile of a normal distribution is 1.645
+standard deviations from the mean</li>
+<li>If \(C = 30 + 1 \times 1.645 = 31.645\)
+
+<ul>
+<li>Then the probability that a \(N(30, 1)\) is larger
+than it is 5%</li>
+<li>So the rule &quot;Reject \(H_0\) when \(\bar X \geq 31.645\)&quot;
+has the property that the probability of rejection
+is 5% when \(H_0\) is true (for the \(\mu_0\), \(\sigma\)
+and \(n\) given)</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Discussion</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In general we don&#39;t convert \(C\) back to the original scale</li>
+<li>We would just reject because the Z-score; which is how many
+standard errors the sample mean is above the hypothesized mean
+\[
+\frac{32 - 30}{10 / \sqrt{100}} = 2
+\]
+is greater than \(1.645\)</li>
+<li>Or, whenever \(\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>General rules</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The \(Z\) test for \(H_0:\mu = \mu_0\) versus 
+
+<ul>
+<li>\(H_1: \mu < \mu_0\)</li>
+<li>\(H_2: \mu \neq \mu_0\)</li>
+<li>\(H_3: \mu > \mu_0\) </li>
+</ul></li>
+<li>Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $</li>
+<li>Reject the null hypothesis when 
+
+<ul>
+<li>\(TS \leq Z_{\alpha} = -Z_{1 - \alpha}\)</li>
+<li>\(|TS| \geq Z_{1 - \alpha / 2}\)</li>
+<li>\(TS \geq Z_{1 - \alpha}\)</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>We have fixed \(\alpha\) to be low, so if we reject \(H_0\) (either
+our model is wrong) or there is a low probability that we have made
+an error</li>
+<li>We have not fixed the probability of a type II error, \(\beta\);
+therefore we tend to say ``Fail to reject \(H_0\)&#39;&#39; rather than
+accepting \(H_0\)</li>
+<li>Statistical significance is no the same as scientific
+significance</li>
+<li>The region of TS values for which you reject \(H_0\) is called the
+rejection region</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>More notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The \(Z\) test requires the assumptions of the CLT and for \(n\) to be large enough
+for it to apply</li>
+<li>If \(n\) is small, then a Gossett&#39;s \(T\) test is performed exactly in the same way,
+with the normal quantiles replaced by the appropriate Student&#39;s \(T\) quantiles and
+\(n-1\) df</li>
+<li>The probability of rejecting the null hypothesis when it is false is called <em>power</em></li>
+<li>Power is a used a lot to calculate sample sizes for experiments</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example reconsidered</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider our example again. Suppose that \(n= 16\) (rather than
+\(100\))</li>
+<li>The statistic
+\[
+\frac{\bar X - 30}{s / \sqrt{16}}
+\]
+follows a \(T\) distribution with 15 df under \(H_0\)</li>
+<li>Under \(H_0\), the probability that it is larger
+that the 95th percentile of the \(T\) distribution is 5%</li>
+<li>The 95th percentile of the T distribution with 15
+df is 1.7531 (obtained via <code>qt(.95, 15)</code>)</li>
+<li>So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $</li>
+<li>We now fail to reject.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Two sided tests</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that we would reject the null hypothesis if in fact the  mean was too large or too small</li>
+<li>That is, we want to test the alternative \(H_a : \mu \neq 30\)</li>
+<li>We will reject if the test statistic, \(0.8\), is either too large or too small</li>
+<li>Then we want the probability of rejecting under the
+null to be 5%, split equally as 2.5% in the upper
+tail and 2.5% in the lower tail</li>
+<li>Thus we reject if our test statistic is larger
+than <code>qt(.975, 15)</code> or smaller than <code>qt(.025, 15)</code>
+
+<ul>
+<li>This is the same as saying: reject if the
+absolute value of our statistic is larger than
+<code>qt(0.975, 15)</code> = 2.1314</li>
+<li>So we fail to reject the two sided test as well</li>
+<li>(If you fail to reject the one sided test, you
+know that you will fail to reject the two sided)</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>T test in R</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">library(UsingR); data(father.son)
+t.test(father.son$sheight - father.son$fheight)
+</code></pre>
+
+<pre><code>&gt; 
+&gt;   One Sample t-test
+&gt; 
+&gt; data:  father.son$sheight - father.son$fheight
+&gt; t = 11.79, df = 1077, p-value &lt; 2.2e-16
+&gt; alternative hypothesis: true mean is not equal to 0
+&gt; 95 percent confidence interval:
+&gt;  0.831 1.163
+&gt; sample estimates:
+&gt; mean of x 
+&gt;     0.997
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Connections with confidence intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider testing \(H_0: \mu = \mu_0\) versus \(H_a: \mu \neq \mu_0\)</li>
+<li>Take the set of all possible values for which you fail to reject \(H_0\), this set is a \((1-\alpha)100\%\) confidence interval for \(\mu\)</li>
+<li>The same works in reverse; if a \((1-\alpha)100\%\) interval
+contains \(\mu_0\), then we <em>fail  to</em> reject \(H_0\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Two group intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>First, now you know how to do two group T tests
+since we already covered indepedent group T intervals</li>
+<li>Rejection rules are the same </li>
+<li>Test \(H_0 : \mu_1 = \mu_2\)</li>
+<li>Let&#39;s just go through an example</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2><code>chickWeight</code> data</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Recall that we reformatted this data</p>
+
+<pre><code class="r">library(datasets); data(ChickWeight); library(reshape2)
+##define weight gain or loss
+wideCW &lt;- dcast(ChickWeight, Diet + Chick ~ Time, value.var = &quot;weight&quot;)
+names(wideCW)[-(1 : 2)] &lt;- paste(&quot;time&quot;, names(wideCW)[-(1 : 2)], sep = &quot;&quot;)
+library(dplyr)
+wideCW &lt;- mutate(wideCW,
+  gain = time21 - time0
+)
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h3>Unequal variance T test comparing diets 1 and 4</h3>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">wideCW14 &lt;- subset(wideCW, Diet %in% c(1, 4))
+t.test(gain ~ Diet, paired = FALSE, 
+       var.equal = TRUE, data = wideCW14)
+</code></pre>
+
+<pre><code>&gt;  
+&gt;   Two Sample t-test
+&gt;  
+&gt;  data:  gain by Diet
+&gt;  t = -2.725, df = 23, p-value = 0.01207
+&gt;  alternative hypothesis: true difference in means is not equal to 0
+&gt;  95 percent confidence interval:
+&gt;   -108.15  -14.81
+&gt;  sample estimates:
+&gt;  mean in group 1 mean in group 4 
+&gt;            136.2           197.7
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Exact binomial test</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Recall this problem, <em>Suppose a friend has \(8\) children, \(7\) of which are girls and none are twins</em></li>
+<li>Perform the relevant hypothesis test. \(H_0 : p = 0.5\) \(H_a : p > 0.5\)
+
+<ul>
+<li>What is the relevant rejection region so that the probability of rejecting is (less than) 5%?</li>
+</ul></li>
+</ul>
+
+<table><thead>
+<tr>
+<th>Rejection region</th>
+<th>Type I error rate</th>
+</tr>
+</thead><tbody>
+<tr>
+<td>[0 : 8]</td>
+<td>1</td>
+</tr>
+<tr>
+<td>[1 : 8]</td>
+<td>0.9961</td>
+</tr>
+<tr>
+<td>[2 : 8]</td>
+<td>0.9648</td>
+</tr>
+<tr>
+<td>[3 : 8]</td>
+<td>0.8555</td>
+</tr>
+<tr>
+<td>[4 : 8]</td>
+<td>0.6367</td>
+</tr>
+<tr>
+<td>[5 : 8]</td>
+<td>0.3633</td>
+</tr>
+<tr>
+<td>[6 : 8]</td>
+<td>0.1445</td>
+</tr>
+<tr>
+<td>[7 : 8]</td>
+<td>0.0352</td>
+</tr>
+<tr>
+<td>[8 : 8]</td>
+<td>0.0039</td>
+</tr>
+</tbody></table>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>It&#39;s impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
+
+<ul>
+<li>The closest is the rejection region [7 : 8]</li>
+<li>Any alpha level lower than 0.0039 is not attainable.</li>
+</ul></li>
+<li>For larger sample sizes, we could do a normal approximation, but you already knew this.</li>
+<li>Two sided test isn&#39;t obvious. 
+
+<ul>
+<li>Given a way to do two sided tests, we could take the set of values of \(p_0\) for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)</li>
+</ul></li>
+<li>For these problems, people always create a P-value (next lecture) rather than computing the rejection region.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Hypothesis testing'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Example'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Hypothesis testing'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Discussion'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Example'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Example continued'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Discussion'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='General rules'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Notes'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='More notes'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example reconsidered'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Two sided tests'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='T test in R'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Connections with confidence intervals'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Two group intervals'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='<code>chickWeight</code> data'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Unequal variance T test comparing diets 1 and 4'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Exact binomial test'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='Notes'>
+         19
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/09_HT/index.md b/06_StatisticalInference/09_HT/index.md
new file mode 100644
index 000000000..f357d7a14
--- /dev/null
+++ b/06_StatisticalInference/09_HT/index.md
@@ -0,0 +1,272 @@
+---
+title       : Hypothesis testing
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Hypothesis testing
+* Hypothesis testing is concerned with making decisions using data
+* A null hypothesis is specified that represents the status quo,
+  usually labeled $H_0$
+* The null hypothesis is assumed true and statistical evidence is required
+  to reject it in favor of a research or alternative hypothesis 
+
+---
+## Example
+* A respiratory disturbance index of more than $30$ events / hour, say, is 
+  considered evidence of severe sleep disordered breathing (SDB).
+* Suppose that in a sample of $100$ overweight subjects with other
+  risk factors for sleep disordered breathing at a sleep clinic, the
+  mean RDI was $32$ events / hour with a standard deviation of $10$ events / hour.
+* We might want to test the hypothesis that 
+  * $H_0 : \mu = 30$
+  * $H_a : \mu > 30$
+  * where $\mu$ is the population mean RDI.
+
+---
+## Hypothesis testing
+* The alternative hypotheses are typically of the form $<$, $>$ or $\neq$
+* Note that there are four possible outcomes of our statistical decision process
+
+Truth | Decide | Result |
+---|---|---|
+$H_0$ | $H_0$ | Correctly accept null |
+$H_0$ | $H_a$ | Type I error |
+$H_a$ | $H_a$ | Correctly reject null |
+$H_a$ | $H_0$ | Type II error |
+
+---
+## Discussion
+* Consider a court of law; the null hypothesis is that the
+  defendant is innocent
+* We require a standard on the available evidence to reject the null hypothesis (convict)
+* If we set a low standard, then we would increase the
+  percentage of innocent people convicted (type I errors); however we
+  would also increase the percentage of guilty people convicted
+  (correctly rejecting the null)
+* If we set a high standard, then we increase the the
+  percentage of innocent people let free (correctly accepting the
+  null) while we would also increase the percentage of guilty people
+  let free (type II errors)
+
+---
+## Example
+* Consider our sleep example again
+* A reasonable strategy would reject the null hypothesis if
+  $\bar X$ was larger than some constant, say $C$
+* Typically, $C$ is chosen so that the probability of a Type I
+  error, $\alpha$, is $.05$ (or some other relevant constant)
+* $\alpha$ = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct
+
+---
+## Example continued
+- Standard error of the mean $10 / \sqrt{100} = 1$
+- Under $H_0$ $\bar X \sim N(30, 1)$ 
+- We want to chose $C$ so that the $P(\bar X > C; H_0)$ is 
+5%
+- The 95th percentile of a normal distribution is 1.645
+standard deviations from the mean
+- If $C = 30 + 1 \times 1.645 = 31.645$
+  - Then the probability that a $N(30, 1)$ is larger
+    than it is 5%
+  - So the rule "Reject $H_0$ when $\bar X \geq 31.645$"
+    has the property that the probability of rejection
+    is 5% when $H_0$ is true (for the $\mu_0$, $\sigma$
+    and $n$ given)
+
+
+---
+## Discussion
+* In general we don't convert $C$ back to the original scale
+* We would just reject because the Z-score; which is how many
+  standard errors the sample mean is above the hypothesized mean
+  $$
+  \frac{32 - 30}{10 / \sqrt{100}} = 2
+  $$
+  is greater than $1.645$
+* Or, whenever $\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}$
+
+---
+## General rules
+* The $Z$ test for $H_0:\mu = \mu_0$ versus 
+  * $H_1: \mu < \mu_0$
+  * $H_2: \mu \neq \mu_0$
+  * $H_3: \mu > \mu_0$ 
+* Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $
+* Reject the null hypothesis when 
+  * $TS \leq Z_{\alpha} = -Z_{1 - \alpha}$
+  * $|TS| \geq Z_{1 - \alpha / 2}$
+  * $TS \geq Z_{1 - \alpha}$
+
+---
+## Notes
+* We have fixed $\alpha$ to be low, so if we reject $H_0$ (either
+  our model is wrong) or there is a low probability that we have made
+  an error
+* We have not fixed the probability of a type II error, $\beta$;
+  therefore we tend to say ``Fail to reject $H_0$'' rather than
+  accepting $H_0$
+* Statistical significance is no the same as scientific
+  significance
+* The region of TS values for which you reject $H_0$ is called the
+  rejection region
+
+---
+## More notes
+* The $Z$ test requires the assumptions of the CLT and for $n$ to be large enough
+  for it to apply
+* If $n$ is small, then a Gossett's $T$ test is performed exactly in the same way,
+  with the normal quantiles replaced by the appropriate Student's $T$ quantiles and
+  $n-1$ df
+* The probability of rejecting the null hypothesis when it is false is called *power*
+* Power is a used a lot to calculate sample sizes for experiments
+
+---
+## Example reconsidered
+- Consider our example again. Suppose that $n= 16$ (rather than
+$100$)
+- The statistic
+$$
+\frac{\bar X - 30}{s / \sqrt{16}}
+$$
+follows a $T$ distribution with 15 df under $H_0$
+- Under $H_0$, the probability that it is larger
+that the 95th percentile of the $T$ distribution is 5%
+- The 95th percentile of the T distribution with 15
+df is 1.7531 (obtained via `qt(.95, 15)`)
+- So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $
+- We now fail to reject.
+
+---
+## Two sided tests
+* Suppose that we would reject the null hypothesis if in fact the  mean was too large or too small
+* That is, we want to test the alternative $H_a : \mu \neq 30$
+* We will reject if the test statistic, $0.8$, is either too large or too small
+* Then we want the probability of rejecting under the
+null to be 5%, split equally as 2.5% in the upper
+tail and 2.5% in the lower tail
+* Thus we reject if our test statistic is larger
+than `qt(.975, 15)` or smaller than `qt(.025, 15)`
+  * This is the same as saying: reject if the
+  absolute value of our statistic is larger than
+  `qt(0.975, 15)` = 2.1314
+  * So we fail to reject the two sided test as well
+  * (If you fail to reject the one sided test, you
+  know that you will fail to reject the two sided)
+
+---
+## T test in R
+
+```r
+library(UsingR); data(father.son)
+t.test(father.son$sheight - father.son$fheight)
+```
+
+```
+> 
+> 	One Sample t-test
+> 
+> data:  father.son$sheight - father.son$fheight
+> t = 11.79, df = 1077, p-value < 2.2e-16
+> alternative hypothesis: true mean is not equal to 0
+> 95 percent confidence interval:
+>  0.831 1.163
+> sample estimates:
+> mean of x 
+>     0.997
+```
+
+---
+## Connections with confidence intervals
+* Consider testing $H_0: \mu = \mu_0$ versus $H_a: \mu \neq \mu_0$
+* Take the set of all possible values for which you fail to reject $H_0$, this set is a $(1-\alpha)100\%$ confidence interval for $\mu$
+* The same works in reverse; if a $(1-\alpha)100\%$ interval
+  contains $\mu_0$, then we *fail  to* reject $H_0$
+
+---
+## Two group intervals
+- First, now you know how to do two group T tests
+since we already covered indepedent group T intervals
+- Rejection rules are the same 
+- Test $H_0 : \mu_1 = \mu_2$
+- Let's just go through an example
+
+---
+## `chickWeight` data
+Recall that we reformatted this data
+
+```r
+library(datasets); data(ChickWeight); library(reshape2)
+##define weight gain or loss
+wideCW <- dcast(ChickWeight, Diet + Chick ~ Time, value.var = "weight")
+names(wideCW)[-(1 : 2)] <- paste("time", names(wideCW)[-(1 : 2)], sep = "")
+library(dplyr)
+wideCW <- mutate(wideCW,
+  gain = time21 - time0
+)
+```
+
+---
+### Unequal variance T test comparing diets 1 and 4
+
+```r
+wideCW14 <- subset(wideCW, Diet %in% c(1, 4))
+t.test(gain ~ Diet, paired = FALSE, 
+       var.equal = TRUE, data = wideCW14)
+```
+
+```
+>  
+>  	Two Sample t-test
+>  
+>  data:  gain by Diet
+>  t = -2.725, df = 23, p-value = 0.01207
+>  alternative hypothesis: true difference in means is not equal to 0
+>  95 percent confidence interval:
+>   -108.15  -14.81
+>  sample estimates:
+>  mean in group 1 mean in group 4 
+>            136.2           197.7
+```
+
+
+
+---
+## Exact binomial test
+- Recall this problem, *Suppose a friend has $8$ children, $7$ of which are girls and none are twins*
+- Perform the relevant hypothesis test. $H_0 : p = 0.5$ $H_a : p > 0.5$
+  - What is the relevant rejection region so that the probability of rejecting is (less than) 5%?
+  
+Rejection region | Type I error rate |
+---|---|
+[0 : 8] | 1
+[1 : 8] | 0.9961
+[2 : 8] | 0.9648
+[3 : 8] | 0.8555
+[4 : 8] | 0.6367
+[5 : 8] | 0.3633
+[6 : 8] | 0.1445
+[7 : 8] | 0.0352
+[8 : 8] | 0.0039
+
+---
+## Notes
+* It's impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
+  * The closest is the rejection region [7 : 8]
+  * Any alpha level lower than 0.0039 is not attainable.
+* For larger sample sizes, we could do a normal approximation, but you already knew this.
+* Two sided test isn't obvious. 
+  * Given a way to do two sided tests, we could take the set of values of $p_0$ for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)
+* For these problems, people always create a P-value (next lecture) rather than computing the rejection region.
+
+
diff --git a/06_StatisticalInference/09_HT/index.pdf b/06_StatisticalInference/09_HT/index.pdf
new file mode 100644
index 000000000..9ed5b7d41
Binary files /dev/null and b/06_StatisticalInference/09_HT/index.pdf differ
diff --git a/06_StatisticalInference/03_02_HypothesisTesting/lecture1.tex b/06_StatisticalInference/09_HT/lecture1.tex
similarity index 100%
rename from 06_StatisticalInference/03_02_HypothesisTesting/lecture1.tex
rename to 06_StatisticalInference/09_HT/lecture1.tex
diff --git a/06_StatisticalInference/03_03_pValues/P-values.pdf b/06_StatisticalInference/10_pValues/P-values.pdf
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/P-values.pdf
rename to 06_StatisticalInference/10_pValues/P-values.pdf
diff --git a/06_StatisticalInference/03_03_pValues/data/quakesRaw.rda b/06_StatisticalInference/10_pValues/data/quakesRaw.rda
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/data/quakesRaw.rda
rename to 06_StatisticalInference/10_pValues/data/quakesRaw.rda
diff --git a/06_StatisticalInference/03_03_pValues/fig/galton.png b/06_StatisticalInference/10_pValues/fig/galton.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/galton.png
rename to 06_StatisticalInference/10_pValues/fig/galton.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/loadGalton.png b/06_StatisticalInference/10_pValues/fig/loadGalton.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/loadGalton.png
rename to 06_StatisticalInference/10_pValues/fig/loadGalton.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-1.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-1.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-10.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-10.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-10.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-10.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-101.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-101.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-101.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-101.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-102.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-102.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-102.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-102.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-11.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-11.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-11.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-11.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-12.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-12.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-12.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-12.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-13.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-13.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-13.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-13.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-14.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-14.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-14.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-14.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-15.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-15.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-15.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-15.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-16.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-16.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-16.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-16.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-17.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-17.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-17.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-17.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-18.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-18.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-18.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-18.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-19.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-19.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-19.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-19.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-2.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-2.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-2.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-2.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-20.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-20.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-20.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-20.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-21.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-21.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-21.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-21.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-22.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-22.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-22.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-22.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-23.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-23.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-23.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-23.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-24.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-24.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-24.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-24.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-3.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-3.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-3.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-3.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-4.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-4.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-4.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-4.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-5.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-5.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-5.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-5.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-6.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-6.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-6.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-6.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-7.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-7.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-7.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-7.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-8.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-8.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-8.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-8.png
diff --git a/06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-9.png b/06_StatisticalInference/10_pValues/fig/unnamed-chunk-9.png
similarity index 100%
rename from 06_StatisticalInference/03_03_pValues/fig/unnamed-chunk-9.png
rename to 06_StatisticalInference/10_pValues/fig/unnamed-chunk-9.png
diff --git a/06_StatisticalInference/10_pValues/index.Rmd b/06_StatisticalInference/10_pValues/index.Rmd
new file mode 100644
index 000000000..36ce9f492
--- /dev/null
+++ b/06_StatisticalInference/10_pValues/index.Rmd
@@ -0,0 +1,93 @@
+---
+title       : P-values
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeffrey Leek, Roger Peng 
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+  
+## P-values
+
+* Most common measure of statistical significance
+* Their ubiquity, along with concern over their interpretation and use
+  makes them controversial among statisticians
+  * [http://warnercnr.colostate.edu/~anderson/thompson1.html](http://warnercnr.colostate.edu/~anderson/thompson1.html)
+  * Also see *Statistical Evidence: A Likelihood Paradigm* by Richard Royall 
+  * *Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy* by Steve Goodman
+  * The hilariously titled: *The Earth is Round (p < .05)* by Cohen.
+* Some positive comments
+  * [simply statistics](http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/)
+  * [normal deviate](http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/)
+  * [Error statistics](http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/)
+
+---
+
+
+## What is a P-value? 
+
+__Idea__: Suppose nothing is going on - how unusual is it to see the estimate we got?
+
+__Approach__: 
+
+1. Define the hypothetical distribution of a data summary (statistic) when "nothing is going on" (_null hypothesis_)
+2. Calculate the summary/statistic with the data we have (_test statistic_)
+3. Compare what we calculated to our hypothetical distribution and see if the value is "extreme" (_p-value_)
+
+---
+## P-values
+* The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than that obtained
+* If the P-value is small, then either $H_0$ is true and we have observed a rare event or $H_0$ is false
+*  Suppos that you get a $T$ statistic of $2.5$ for 15 df testing $H_0:\mu = \mu_0$
+versus $H_a : \mu > \mu_0$. 
+  * What's the probability of getting a $T$ statistic as large as $2.5$?
+```{r}
+pt(2.5, 15, lower.tail = FALSE) 
+```
+* Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under $H_0$ is `r pt(2.5, 15, lower.tail = FALSE)`
+
+---
+## The attained significance level
+* Our test statistic was $2$ for $H_0 : \mu_0  = 30$ versus $H_a:\mu > 30$.
+* Notice that we rejected the one sided test when $\alpha = 0.05$, would we reject if $\alpha = 0.01$, how about $0.001$?
+* The smallest value for alpha that you still reject the null hypothesis is called the *attained significance level*
+* This is equivalent, but philosophically a little different from, the *P-value*
+
+---
+## Notes
+* By reporting a P-value the reader can perform the hypothesis
+  test at whatever $\alpha$ level he or she choses
+* If the P-value is less than $\alpha$ you reject the null hypothesis 
+* For two sided hypothesis test, double the smaller of the two one
+  sided hypothesis test Pvalues
+
+---
+## Revisiting an earlier example
+- Suppose a friend has $8$ children, $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+```{r}
+choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
+pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
+```
+
+---
+## Poisson example
+- Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.
+- Assume that an infection rate of 0.05 is an important benchmark. 
+- Given the model, could the observed rate being larger than 0.05 be attributed to chance?
+- Under $H_0: \lambda = 0.05$ so that $\lambda_0 100 = 5$
+- Consider $H_a: \lambda > 0.05$.
+
+```{r}
+ppois(9, 5, lower.tail = FALSE)
+```
+
+
+
diff --git a/06_StatisticalInference/10_pValues/index.html b/06_StatisticalInference/10_pValues/index.html
new file mode 100644
index 000000000..2ea2aa210
--- /dev/null
+++ b/06_StatisticalInference/10_pValues/index.html
@@ -0,0 +1,281 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>P-values</title>
+  <meta charset="utf-8">
+  <meta name="description" content="P-values">
+  <meta name="author" content="Brian Caffo, Jeffrey Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>P-values</h1>
+    <h2>Statistical inference</h2>
+    <p>Brian Caffo, Jeffrey Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>P-values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Most common measure of statistical significance</li>
+<li>Their ubiquity, along with concern over their interpretation and use
+makes them controversial among statisticians
+
+<ul>
+<li><a href="http://warnercnr.colostate.edu/%7Eanderson/thompson1.html">http://warnercnr.colostate.edu/~anderson/thompson1.html</a></li>
+<li>Also see <em>Statistical Evidence: A Likelihood Paradigm</em> by Richard Royall </li>
+<li><em>Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy</em> by Steve Goodman</li>
+<li>The hilariously titled: <em>The Earth is Round (p &lt; .05)</em> by Cohen.</li>
+</ul></li>
+<li>Some positive comments
+
+<ul>
+<li><a href="http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/">simply statistics</a></li>
+<li><a href="http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/">normal deviate</a></li>
+<li><a href="http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/">Error statistics</a></li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>What is a P-value?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>Idea</strong>: Suppose nothing is going on - how unusual is it to see the estimate we got?</p>
+
+<p><strong>Approach</strong>: </p>
+
+<ol>
+<li>Define the hypothetical distribution of a data summary (statistic) when &quot;nothing is going on&quot; (<em>null hypothesis</em>)</li>
+<li>Calculate the summary/statistic with the data we have (<em>test statistic</em>)</li>
+<li>Compare what we calculated to our hypothetical distribution and see if the value is &quot;extreme&quot; (<em>p-value</em>)</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>P-values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than that obtained</li>
+<li>If the P-value is small, then either \(H_0\) is true and we have observed a rare event or \(H_0\) is false</li>
+<li> Suppos that you get a \(T\) statistic of \(2.5\) for 15 df testing \(H_0:\mu = \mu_0\)
+versus \(H_a : \mu > \mu_0\). 
+
+<ul>
+<li>What&#39;s the probability of getting a \(T\) statistic as large as \(2.5\)?</li>
+</ul></li>
+</ul>
+
+<pre><code class="r">pt(2.5, 15, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.01225
+</code></pre>
+
+<ul>
+<li>Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under \(H_0\) is 0.0123</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>The attained significance level</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Our test statistic was \(2\) for \(H_0 : \mu_0  = 30\) versus \(H_a:\mu > 30\).</li>
+<li>Notice that we rejected the one sided test when \(\alpha = 0.05\), would we reject if \(\alpha = 0.01\), how about \(0.001\)?</li>
+<li>The smallest value for alpha that you still reject the null hypothesis is called the <em>attained significance level</em></li>
+<li>This is equivalent, but philosophically a little different from, the <em>P-value</em></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>By reporting a P-value the reader can perform the hypothesis
+test at whatever \(\alpha\) level he or she choses</li>
+<li>If the P-value is less than \(\alpha\) you reject the null hypothesis </li>
+<li>For two sided hypothesis test, double the smaller of the two one
+sided hypothesis test Pvalues</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Revisiting an earlier example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose a friend has \(8\) children, \(7\) of which are girls and none are twins</li>
+<li>If each gender has an independent \(50\)% probability for each birth, what&#39;s the probability of getting \(7\) or more girls out of \(8\) births?</li>
+</ul>
+
+<pre><code class="r">choose(8, 7) * 0.5^8 + choose(8, 8) * 0.5^8
+</code></pre>
+
+<pre><code>## [1] 0.03516
+</code></pre>
+
+<pre><code class="r">pbinom(6, size = 8, prob = 0.5, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.03516
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Poisson example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.</li>
+<li>Assume that an infection rate of 0.05 is an important benchmark. </li>
+<li>Given the model, could the observed rate being larger than 0.05 be attributed to chance?</li>
+<li>Under \(H_0: \lambda = 0.05\) so that \(\lambda_0 100 = 5\)</li>
+<li>Consider \(H_a: \lambda > 0.05\).</li>
+</ul>
+
+<pre><code class="r">ppois(9, 5, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.03183
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='P-values'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='What is a P-value?'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='P-values'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='The attained significance level'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Notes'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Revisiting an earlier example'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Poisson example'>
+         7
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/10_pValues/index.md b/06_StatisticalInference/10_pValues/index.md
new file mode 100644
index 000000000..10d09758f
--- /dev/null
+++ b/06_StatisticalInference/10_pValues/index.md
@@ -0,0 +1,118 @@
+---
+title       : P-values
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeffrey Leek, Roger Peng 
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+  
+## P-values
+
+* Most common measure of statistical significance
+* Their ubiquity, along with concern over their interpretation and use
+  makes them controversial among statisticians
+  * [http://warnercnr.colostate.edu/~anderson/thompson1.html](http://warnercnr.colostate.edu/~anderson/thompson1.html)
+  * Also see *Statistical Evidence: A Likelihood Paradigm* by Richard Royall 
+  * *Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy* by Steve Goodman
+  * The hilariously titled: *The Earth is Round (p < .05)* by Cohen.
+* Some positive comments
+  * [simply statistics](http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/)
+  * [normal deviate](http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/)
+  * [Error statistics](http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/)
+
+---
+
+
+## What is a P-value? 
+
+__Idea__: Suppose nothing is going on - how unusual is it to see the estimate we got?
+
+__Approach__: 
+
+1. Define the hypothetical distribution of a data summary (statistic) when "nothing is going on" (_null hypothesis_)
+2. Calculate the summary/statistic with the data we have (_test statistic_)
+3. Compare what we calculated to our hypothetical distribution and see if the value is "extreme" (_p-value_)
+
+---
+## P-values
+* The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than that obtained
+* If the P-value is small, then either $H_0$ is true and we have observed a rare event or $H_0$ is false
+*  Suppos that you get a $T$ statistic of $2.5$ for 15 df testing $H_0:\mu = \mu_0$
+versus $H_a : \mu > \mu_0$. 
+  * What's the probability of getting a $T$ statistic as large as $2.5$?
+
+```r
+pt(2.5, 15, lower.tail = FALSE)
+```
+
+```
+## [1] 0.01225
+```
+
+* Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under $H_0$ is 0.0123
+
+---
+## The attained significance level
+* Our test statistic was $2$ for $H_0 : \mu_0  = 30$ versus $H_a:\mu > 30$.
+* Notice that we rejected the one sided test when $\alpha = 0.05$, would we reject if $\alpha = 0.01$, how about $0.001$?
+* The smallest value for alpha that you still reject the null hypothesis is called the *attained significance level*
+* This is equivalent, but philosophically a little different from, the *P-value*
+
+---
+## Notes
+* By reporting a P-value the reader can perform the hypothesis
+  test at whatever $\alpha$ level he or she choses
+* If the P-value is less than $\alpha$ you reject the null hypothesis 
+* For two sided hypothesis test, double the smaller of the two one
+  sided hypothesis test Pvalues
+
+---
+## Revisiting an earlier example
+- Suppose a friend has $8$ children, $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+
+```r
+choose(8, 7) * 0.5^8 + choose(8, 8) * 0.5^8
+```
+
+```
+## [1] 0.03516
+```
+
+```r
+pbinom(6, size = 8, prob = 0.5, lower.tail = FALSE)
+```
+
+```
+## [1] 0.03516
+```
+
+
+---
+## Poisson example
+- Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.
+- Assume that an infection rate of 0.05 is an important benchmark. 
+- Given the model, could the observed rate being larger than 0.05 be attributed to chance?
+- Under $H_0: \lambda = 0.05$ so that $\lambda_0 100 = 5$
+- Consider $H_a: \lambda > 0.05$.
+
+
+```r
+ppois(9, 5, lower.tail = FALSE)
+```
+
+```
+## [1] 0.03183
+```
+
+
+
+
diff --git a/06_StatisticalInference/10_pValues/index.pdf b/06_StatisticalInference/10_pValues/index.pdf
new file mode 100644
index 000000000..ba31db25c
Binary files /dev/null and b/06_StatisticalInference/10_pValues/index.pdf differ
diff --git a/06_StatisticalInference/11_Power/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/11_Power/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..1e08993c6
Binary files /dev/null and b/06_StatisticalInference/11_Power/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/03_04_Power/fig/unnamed-chunk-2.png b/06_StatisticalInference/11_Power/fig/unnamed-chunk-2.png
similarity index 100%
rename from 06_StatisticalInference/03_04_Power/fig/unnamed-chunk-2.png
rename to 06_StatisticalInference/11_Power/fig/unnamed-chunk-2.png
diff --git a/06_StatisticalInference/11_Power/index.Rmd b/06_StatisticalInference/11_Power/index.Rmd
new file mode 100644
index 000000000..3b597112e
--- /dev/null
+++ b/06_StatisticalInference/11_Power/index.Rmd
@@ -0,0 +1,160 @@
+---
+title       : Power
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Power
+- Power is the probability of rejecting the null hypothesis when it is false
+- Ergo, power (as its name would suggest) is a good thing; you want more power
+- A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it's false; the probability of a type II error is usually called $\beta$
+- Note Power  $= 1 - \beta$
+
+---
+## Notes
+- Consider our previous example involving RDI
+- $H_0: \mu = 30$ versus $H_a: \mu > 30$
+- Then power is 
+$$P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~;~ \mu = \mu_a \right)$$
+- Note that this is a function that depends on the specific value of $\mu_a$!
+- Notice as $\mu_a$ approaches $30$ the power approaches $\alpha$
+
+
+---
+## Calculating power for Gaussian data
+- We reject if $\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha}$    
+    - Equivalently if $\bar X > 30 + Z_{1-\alpha} \frac{\sigma}{\sqrt{n}}$
+- Under $H_0 : \bar X \sim N(\mu_0, \sigma^2 / n)$
+- Under $H_a : \bar X \sim N(\mu_a, \sigma^2 / n)$
+- So we want 
+```{r, echo=TRUE,eval=FALSE}
+alpha = 0.05
+z = qnorm(1 - alpha)
+pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+```
+
+---
+## Example continued
+- $\mu_a = 32$, $\mu_0 = 30$, $n =16$, $\sigma = 4$
+```{r, echo=TRUE,eval=TRUE}
+mu0 = 30; mua = 32; sigma = 4; n = 16
+z = qnorm(1 - alpha)
+pnorm(mu0 + z * sigma / sqrt(n), mean = mu0, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+```
+
+---
+##  Plotting the power curve
+
+```{r, fig.align='center', fig.height=6, fig.width=12, echo=FALSE}
+library(ggplot2)
+nseq = c(8, 16, 32, 64, 128)
+mua = seq(30, 35, by = 0.1)
+z = qnorm(.95)
+power = sapply(nseq, function(n)
+pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+          lower.tail = FALSE)
+    )
+colnames(power) <- paste("n", nseq, sep = "")
+d <- data.frame(mua, power)
+library(reshape2)
+d2 <- melt(d, id.vars = "mua")
+names(d2) <- c("mua", "n", "power")    
+g <- ggplot(d2, 
+            aes(x = mua, y = power, col = n)) + geom_line(size = 2)
+g            
+```
+
+
+---
+## Graphical Depiction of Power
+```{r, echo = TRUE, eval=FALSE}
+library(manipulate)
+mu0 = 30
+myplot <- function(sigma, mua, n, alpha){
+    g = ggplot(data.frame(mu = c(27, 36)), aes(x = mu))
+    g = g + stat_function(fun=dnorm, geom = "line", 
+                          args = list(mean = mu0, sd = sigma / sqrt(n)), 
+                          size = 2, col = "red")
+    g = g + stat_function(fun=dnorm, geom = "line", 
+                          args = list(mean = mua, sd = sigma / sqrt(n)), 
+                          size = 2, col = "blue")
+    xitc = mu0 + qnorm(1 - alpha) * sigma / sqrt(n)
+    g = g + geom_vline(xintercept=xitc, size = 3)
+    g
+}
+manipulate(
+    myplot(sigma, mua, n, alpha),
+    sigma = slider(1, 10, step = 1, initial = 4),
+    mua = slider(30, 35, step = 1, initial = 32),
+    n = slider(1, 50, step = 1, initial = 16),
+    alpha = slider(0.01, 0.1, step = 0.01, initial = 0.05)
+    )
+
+```
+
+
+---
+## Question
+- When testing $H_a : \mu > \mu_0$, notice if power is $1 - \beta$, then 
+$$1 - \beta = P\left(\bar X > \mu_0 + z_{1-\alpha} \frac{\sigma}{\sqrt{n}} ; \mu = \mu_a \right)$$
+- where $\bar X \sim N(\mu_a, \sigma^2 / n)$
+- Unknowns: $\mu_a$, $\sigma$, $n$, $\beta$
+- Knowns: $\mu_0$, $\alpha$
+- Specify any 3 of the unknowns and you can solve for the remainder
+
+---
+## Notes
+- The calculation for $H_a:\mu < \mu_0$ is similar
+- For $H_a: \mu \neq \mu_0$ calculate the one sided power using
+  $\alpha / 2$ (this is only approximately right, it excludes the probability of
+  getting a large TS in the opposite direction of the truth)
+- Power goes up as $\alpha$ gets larger
+- Power of a one sided test is greater than the power of the
+  associated two sided test
+- Power goes up as $\mu_1$ gets further away from $\mu_0$
+- Power goes up as $n$ goes up
+- Power doesn't need $\mu_a$, $\sigma$ and $n$, instead only $\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}$
+  - The quantity $\frac{\mu_a - \mu_0}{\sigma}$ is called the effect size, the difference in the means in standard deviation units.
+  - Being unit free, it has some hope of interpretability across settings
+
+---
+## T-test power
+-  Consider calculating power for a Gossett's $T$ test for our example
+-  The power is
+  $$
+  P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~;~ \mu = \mu_a \right)
+  $$
+- Calcuting this requires the non-central t distribution.
+- `power.t.test` does this very well
+  - Omit one of the arguments and it solves for it
+
+---
+## Example
+```{r}
+power.t.test(n = 16, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$power
+power.t.test(n = 16, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$power
+power.t.test(n = 16, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$power
+```
+
+---
+## Example
+```{r}
+power.t.test(power = .8, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$n
+power.t.test(power = .8, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$n
+power.t.test(power = .8, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$n
+```
+
diff --git a/06_StatisticalInference/11_Power/index.html b/06_StatisticalInference/11_Power/index.html
new file mode 100644
index 000000000..725b505bf
--- /dev/null
+++ b/06_StatisticalInference/11_Power/index.html
@@ -0,0 +1,404 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Power</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Power">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Power</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Power</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Power is the probability of rejecting the null hypothesis when it is false</li>
+<li>Ergo, power (as its name would suggest) is a good thing; you want more power</li>
+<li>A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it&#39;s false; the probability of a type II error is usually called \(\beta\)</li>
+<li>Note Power  \(= 1 - \beta\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider our previous example involving RDI</li>
+<li>\(H_0: \mu = 30\) versus \(H_a: \mu > 30\)</li>
+<li>Then power is 
+\[P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~;~ \mu = \mu_a \right)\]</li>
+<li>Note that this is a function that depends on the specific value of \(\mu_a\)!</li>
+<li>Notice as \(\mu_a\) approaches \(30\) the power approaches \(\alpha\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Calculating power for Gaussian data</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>We reject if \(\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha}\)<br>
+
+<ul>
+<li>Equivalently if \(\bar X > 30 + Z_{1-\alpha} \frac{\sigma}{\sqrt{n}}\)</li>
+</ul></li>
+<li>Under \(H_0 : \bar X \sim N(\mu_0, \sigma^2 / n)\)</li>
+<li>Under \(H_a : \bar X \sim N(\mu_a, \sigma^2 / n)\)</li>
+<li>So we want </li>
+</ul>
+
+<pre><code class="r">alpha = 0.05
+z = qnorm(1 - alpha)
+pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>\(\mu_a = 32\), \(\mu_0 = 30\), \(n =16\), \(\sigma = 4\)</li>
+</ul>
+
+<pre><code class="r">mu0 = 30; mua = 32; sigma = 4; n = 16
+z = qnorm(1 - alpha)
+</code></pre>
+
+<pre><code>## Error: object &#39;alpha&#39; not found
+</code></pre>
+
+<pre><code class="r">pnorm(mu0 + z * sigma / sqrt(n), mean = mu0, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## Error: object &#39;z&#39; not found
+</code></pre>
+
+<pre><code class="r">pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## Error: object &#39;z&#39; not found
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Plotting the power curve</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Graphical Depiction of Power</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">library(manipulate)
+mu0 = 30
+myplot &lt;- function(sigma, mua, n, alpha){
+    g = ggplot(data.frame(mu = c(27, 36)), aes(x = mu))
+    g = g + stat_function(fun=dnorm, geom = &quot;line&quot;, 
+                          args = list(mean = mu0, sd = sigma / sqrt(n)), 
+                          size = 2, col = &quot;red&quot;)
+    g = g + stat_function(fun=dnorm, geom = &quot;line&quot;, 
+                          args = list(mean = mua, sd = sigma / sqrt(n)), 
+                          size = 2, col = &quot;blue&quot;)
+    xitc = mu0 + qnorm(1 - alpha) * sigma / sqrt(n)
+    g = g + geom_vline(xintercept=xitc, size = 3)
+    g
+}
+manipulate(
+    myplot(sigma, mua, n, alpha),
+    sigma = slider(1, 10, step = 1, initial = 4),
+    mua = slider(30, 35, step = 1, initial = 32),
+    n = slider(1, 50, step = 1, initial = 16),
+    alpha = slider(0.01, 0.1, step = 0.01, initial = 0.05)
+    )
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Question</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>When testing \(H_a : \mu > \mu_0\), notice if power is \(1 - \beta\), then 
+\[1 - \beta = P\left(\bar X > \mu_0 + z_{1-\alpha} \frac{\sigma}{\sqrt{n}} ; \mu = \mu_a \right)\]</li>
+<li>where \(\bar X \sim N(\mu_a, \sigma^2 / n)\)</li>
+<li>Unknowns: \(\mu_a\), \(\sigma\), \(n\), \(\beta\)</li>
+<li>Knowns: \(\mu_0\), \(\alpha\)</li>
+<li>Specify any 3 of the unknowns and you can solve for the remainder</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The calculation for \(H_a:\mu < \mu_0\) is similar</li>
+<li>For \(H_a: \mu \neq \mu_0\) calculate the one sided power using
+\(\alpha / 2\) (this is only approximately right, it excludes the probability of
+getting a large TS in the opposite direction of the truth)</li>
+<li>Power goes up as \(\alpha\) gets larger</li>
+<li>Power of a one sided test is greater than the power of the
+associated two sided test</li>
+<li>Power goes up as \(\mu_1\) gets further away from \(\mu_0\)</li>
+<li>Power goes up as \(n\) goes up</li>
+<li>Power doesn&#39;t need \(\mu_a\), \(\sigma\) and \(n\), instead only \(\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}\)
+
+<ul>
+<li>The quantity \(\frac{\mu_a - \mu_0}{\sigma}\) is called the effect size, the difference in the means in standard deviation units.</li>
+<li>Being unit free, it has some hope of interpretability across settings</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>T-test power</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li> Consider calculating power for a Gossett&#39;s \(T\) test for our example</li>
+<li> The power is
+\[
+P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~;~ \mu = \mu_a \right)
+\]</li>
+<li>Calcuting this requires the non-central t distribution.</li>
+<li><code>power.t.test</code> does this very well
+
+<ul>
+<li>Omit one of the arguments and it solves for it</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">power.t.test(n = 16, delta = 2 / 4, sd=1, type = &quot;one.sample&quot;,  alt = &quot;one.sided&quot;)$power
+</code></pre>
+
+<pre><code>## [1] 0.604
+</code></pre>
+
+<pre><code class="r">power.t.test(n = 16, delta = 2, sd=4, type = &quot;one.sample&quot;,  alt = &quot;one.sided&quot;)$power
+</code></pre>
+
+<pre><code>## [1] 0.604
+</code></pre>
+
+<pre><code class="r">power.t.test(n = 16, delta = 100, sd=200, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$power
+</code></pre>
+
+<pre><code>## [1] 0.604
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">power.t.test(power = .8, delta = 2 / 4, sd=1, type = &quot;one.sample&quot;,  alt = &quot;one.sided&quot;)$n
+</code></pre>
+
+<pre><code>## [1] 26.14
+</code></pre>
+
+<pre><code class="r">power.t.test(power = .8, delta = 2, sd=4, type = &quot;one.sample&quot;,  alt = &quot;one.sided&quot;)$n
+</code></pre>
+
+<pre><code>## [1] 26.14
+</code></pre>
+
+<pre><code class="r">power.t.test(power = .8, delta = 100, sd=200, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$n
+</code></pre>
+
+<pre><code>## [1] 26.14
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Power'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Notes'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Calculating power for Gaussian data'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Example continued'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Plotting the power curve'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Graphical Depiction of Power'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Question'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Notes'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='T-test power'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Example'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example'>
+         11
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/11_Power/index.md b/06_StatisticalInference/11_Power/index.md
new file mode 100644
index 000000000..71ed1ff8b
--- /dev/null
+++ b/06_StatisticalInference/11_Power/index.md
@@ -0,0 +1,201 @@
+---
+title       : Power
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Power
+- Power is the probability of rejecting the null hypothesis when it is false
+- Ergo, power (as its name would suggest) is a good thing; you want more power
+- A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it's false; the probability of a type II error is usually called $\beta$
+- Note Power  $= 1 - \beta$
+
+---
+## Notes
+- Consider our previous example involving RDI
+- $H_0: \mu = 30$ versus $H_a: \mu > 30$
+- Then power is 
+$$P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~;~ \mu = \mu_a \right)$$
+- Note that this is a function that depends on the specific value of $\mu_a$!
+- Notice as $\mu_a$ approaches $30$ the power approaches $\alpha$
+
+
+---
+## Calculating power for Gaussian data
+- We reject if $\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha}$    
+    - Equivalently if $\bar X > 30 + Z_{1-\alpha} \frac{\sigma}{\sqrt{n}}$
+- Under $H_0 : \bar X \sim N(\mu_0, \sigma^2 / n)$
+- Under $H_a : \bar X \sim N(\mu_a, \sigma^2 / n)$
+- So we want 
+
+```r
+alpha = 0.05
+z = qnorm(1 - alpha)
+pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+```
+
+---
+## Example continued
+- $\mu_a = 32$, $\mu_0 = 30$, $n =16$, $\sigma = 4$
+
+```r
+mu0 = 30; mua = 32; sigma = 4; n = 16
+z = qnorm(1 - alpha)
+```
+
+```
+## Error: object 'alpha' not found
+```
+
+```r
+pnorm(mu0 + z * sigma / sqrt(n), mean = mu0, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+```
+
+```
+## Error: object 'z' not found
+```
+
+```r
+pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+```
+
+```
+## Error: object 'z' not found
+```
+
+---
+##  Plotting the power curve
+
+<img src="assets/fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" style="display: block; margin: auto;" />
+
+
+---
+## Graphical Depiction of Power
+
+```r
+library(manipulate)
+mu0 = 30
+myplot <- function(sigma, mua, n, alpha){
+    g = ggplot(data.frame(mu = c(27, 36)), aes(x = mu))
+    g = g + stat_function(fun=dnorm, geom = "line", 
+                          args = list(mean = mu0, sd = sigma / sqrt(n)), 
+                          size = 2, col = "red")
+    g = g + stat_function(fun=dnorm, geom = "line", 
+                          args = list(mean = mua, sd = sigma / sqrt(n)), 
+                          size = 2, col = "blue")
+    xitc = mu0 + qnorm(1 - alpha) * sigma / sqrt(n)
+    g = g + geom_vline(xintercept=xitc, size = 3)
+    g
+}
+manipulate(
+    myplot(sigma, mua, n, alpha),
+    sigma = slider(1, 10, step = 1, initial = 4),
+    mua = slider(30, 35, step = 1, initial = 32),
+    n = slider(1, 50, step = 1, initial = 16),
+    alpha = slider(0.01, 0.1, step = 0.01, initial = 0.05)
+    )
+```
+
+
+---
+## Question
+- When testing $H_a : \mu > \mu_0$, notice if power is $1 - \beta$, then 
+$$1 - \beta = P\left(\bar X > \mu_0 + z_{1-\alpha} \frac{\sigma}{\sqrt{n}} ; \mu = \mu_a \right)$$
+- where $\bar X \sim N(\mu_a, \sigma^2 / n)$
+- Unknowns: $\mu_a$, $\sigma$, $n$, $\beta$
+- Knowns: $\mu_0$, $\alpha$
+- Specify any 3 of the unknowns and you can solve for the remainder
+
+---
+## Notes
+- The calculation for $H_a:\mu < \mu_0$ is similar
+- For $H_a: \mu \neq \mu_0$ calculate the one sided power using
+  $\alpha / 2$ (this is only approximately right, it excludes the probability of
+  getting a large TS in the opposite direction of the truth)
+- Power goes up as $\alpha$ gets larger
+- Power of a one sided test is greater than the power of the
+  associated two sided test
+- Power goes up as $\mu_1$ gets further away from $\mu_0$
+- Power goes up as $n$ goes up
+- Power doesn't need $\mu_a$, $\sigma$ and $n$, instead only $\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}$
+  - The quantity $\frac{\mu_a - \mu_0}{\sigma}$ is called the effect size, the difference in the means in standard deviation units.
+  - Being unit free, it has some hope of interpretability across settings
+
+---
+## T-test power
+-  Consider calculating power for a Gossett's $T$ test for our example
+-  The power is
+  $$
+  P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~;~ \mu = \mu_a \right)
+  $$
+- Calcuting this requires the non-central t distribution.
+- `power.t.test` does this very well
+  - Omit one of the arguments and it solves for it
+
+---
+## Example
+
+```r
+power.t.test(n = 16, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$power
+```
+
+```
+## [1] 0.604
+```
+
+```r
+power.t.test(n = 16, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$power
+```
+
+```
+## [1] 0.604
+```
+
+```r
+power.t.test(n = 16, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$power
+```
+
+```
+## [1] 0.604
+```
+
+---
+## Example
+
+```r
+power.t.test(power = .8, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$n
+```
+
+```
+## [1] 26.14
+```
+
+```r
+power.t.test(power = .8, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$n
+```
+
+```
+## [1] 26.14
+```
+
+```r
+power.t.test(power = .8, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$n
+```
+
+```
+## [1] 26.14
+```
+
diff --git a/06_StatisticalInference/11_Power/index.pdf b/06_StatisticalInference/11_Power/index.pdf
new file mode 100644
index 000000000..d4ef53661
Binary files /dev/null and b/06_StatisticalInference/11_Power/index.pdf differ
diff --git a/06_StatisticalInference/03_05_MultipleTesting/Multiple testing.pdf b/06_StatisticalInference/12_MultipleTesting/Multiple testing.pdf
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/Multiple testing.pdf
rename to 06_StatisticalInference/12_MultipleTesting/Multiple testing.pdf
diff --git a/06_StatisticalInference/12_MultipleTesting/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/12_MultipleTesting/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..556c3a44b
Binary files /dev/null and b/06_StatisticalInference/12_MultipleTesting/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/03_05_MultipleTesting/data/cd4.data b/06_StatisticalInference/12_MultipleTesting/data/cd4.data
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/data/cd4.data
rename to 06_StatisticalInference/12_MultipleTesting/data/cd4.data
diff --git a/06_StatisticalInference/03_05_MultipleTesting/data/movies.txt b/06_StatisticalInference/12_MultipleTesting/data/movies.txt
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/data/movies.txt
rename to 06_StatisticalInference/12_MultipleTesting/data/movies.txt
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/datasources.png b/06_StatisticalInference/12_MultipleTesting/fig/datasources.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/datasources.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/datasources.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/example10pvals.png b/06_StatisticalInference/12_MultipleTesting/fig/example10pvals.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/example10pvals.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/example10pvals.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/galton.png b/06_StatisticalInference/12_MultipleTesting/fig/galton.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/galton.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/galton.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/jellybeans1.png b/06_StatisticalInference/12_MultipleTesting/fig/jellybeans1.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/jellybeans1.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/jellybeans1.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/jellybeans2.png b/06_StatisticalInference/12_MultipleTesting/fig/jellybeans2.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/jellybeans2.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/jellybeans2.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/lowess.png b/06_StatisticalInference/12_MultipleTesting/fig/lowess.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/lowess.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/lowess.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/significant.png b/06_StatisticalInference/12_MultipleTesting/fig/significant.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/significant.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/significant.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/splines.png b/06_StatisticalInference/12_MultipleTesting/fig/splines.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/splines.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/splines.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-1.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-1.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-10.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-10.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-10.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-10.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-101.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-101.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-101.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-101.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-102.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-102.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-102.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-102.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-11.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-11.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-11.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-11.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-12.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-12.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-12.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-12.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-13.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-13.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-13.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-13.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-14.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-14.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-14.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-14.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-15.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-15.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-15.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-15.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-16.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-16.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-16.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-16.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-17.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-17.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-17.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-17.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-18.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-18.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-18.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-18.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-19.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-19.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-19.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-19.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-2.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-2.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-2.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-2.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-20.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-20.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-20.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-20.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-21.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-21.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-21.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-21.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-22.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-22.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-22.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-22.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-23.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-23.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-23.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-23.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-24.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-24.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-24.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-24.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-3.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-3.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-3.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-3.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-4.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-4.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-4.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-4.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-5.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-5.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-5.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-5.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-6.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-6.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-6.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-6.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-7.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-7.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-7.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-7.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-8.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-8.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-8.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-8.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-9.png b/06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-9.png
similarity index 100%
rename from 06_StatisticalInference/03_05_MultipleTesting/fig/unnamed-chunk-9.png
rename to 06_StatisticalInference/12_MultipleTesting/fig/unnamed-chunk-9.png
diff --git a/06_StatisticalInference/03_05_MultipleTesting/index.Rmd b/06_StatisticalInference/12_MultipleTesting/index.Rmd
similarity index 90%
rename from 06_StatisticalInference/03_05_MultipleTesting/index.Rmd
rename to 06_StatisticalInference/12_MultipleTesting/index.Rmd
index 6c19901a0..4d5cc68a4 100644
--- a/06_StatisticalInference/03_05_MultipleTesting/index.Rmd
+++ b/06_StatisticalInference/12_MultipleTesting/index.Rmd
@@ -1,271 +1,253 @@
----
-title       : Multiple testing
-subtitle    : Statistical Inference 
-author      : Brian Caffo, Jeffrey Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow   # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-```
-
-## Key ideas
-
-* Hypothesis testing/significance analysis is commonly overused
-* Correcting for multiple testing avoids false positives or discoveries
-* Two key components
-  * Error measure
-  * Correction
-
-
----
-
-## Three eras of statistics
-
-__The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions__: Are there more male than female births? Is the rate of insanity rising?
-
-The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who __developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment__. The questions dealt with still tended to be simple Is treatment A better than treatment B? 
-
-__The era of scientific mass production__, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?
-
-[http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf](http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf)
-
----
-
-## Reasons for multiple testing
-
-<img class=center src=fig/datasources.png height=450>
-
-
----
-
-## Why correct for multiple tests?
-
-<img class=center src=fig/jellybeans1.png height=450>
-
-
-[http://xkcd.com/882/](http://xkcd.com/882/)
-
----
-
-## Why correct for multiple tests?
-
-<img class=center src=fig/jellybeans2.png height=400>
-
-[http://xkcd.com/882/](http://xkcd.com/882/)
-
-
----
-
-## Types of errors
-
-Suppose you are testing a hypothesis that a parameter $\beta$ equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
-</br></br>
-
-                    | $\beta=0$   | $\beta\neq0$   |  Hypotheses
---------------------|-------------|----------------|---------
-Claim $\beta=0$     |      $U$    |      $T$       |  $m-R$
-Claim $\beta\neq 0$ |      $V$    |      $S$       |  $R$
-    Claims          |     $m_0$   |      $m-m_0$   |  $m$
-
-</br></br>
-
-__Type I error or false positive ($V$)__ Say that the parameter does not equal zero when it does
-
-__Type II error or false negative ($T$)__ Say that the parameter equals zero when it doesn't 
-
-
----
-
-## Error rates
-
-__False positive rate__ - The rate at which false results ($\beta = 0$) are called significant: $E\left[\frac{V}{m_0}\right]$*
-
-__Family wise error rate (FWER)__ - The probability of at least one false positive ${\rm Pr}(V \geq 1)$
-
-__False discovery rate (FDR)__ - The rate at which claims of significance are false $E\left[\frac{V}{R}\right]$
-
-* The false positive rate is closely related to the type I error rate [http://en.wikipedia.org/wiki/False_positive_rate](http://en.wikipedia.org/wiki/False_positive_rate)
-
----
-
-## Controlling the false positive rate
-
-If P-values are correctly calculated calling all $P < \alpha$ significant will control the false positive rate at level $\alpha$ on average. 
-
-<redtext>Problem</redtext>: Suppose that you perform 10,000 tests and $\beta = 0$ for all of them. 
-
-Suppose that you call all $P < 0.05$ significant. 
-
-The expected number of false positives is: $10,000 \times 0.05 = 500$  false positives. 
-
-__How do we avoid so many false positives?__
-
-
----
-
-## Controlling family-wise error rate (FWER)
-
-
-The [Bonferroni correction](http://en.wikipedia.org/wiki/Bonferroni_correction) is the oldest multiple testing correction. 
-
-__Basic idea__: 
-* Suppose you do $m$ tests
-* You want to control FWER at level $\alpha$ so $Pr(V \geq 1) < \alpha$
-* Calculate P-values normally
-* Set $\alpha_{fwer} = \alpha/m$
-* Call all $P$-values less than $\alpha_{fwer}$ significant
-
-__Pros__: Easy to calculate, conservative
-__Cons__: May be very conservative
-
-
----
-
-## Controlling false discovery rate (FDR)
-
-This is the most popular correction when performing _lots_ of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. 
-
-__Basic idea__: 
-* Suppose you do $m$ tests
-* You want to control FDR at level $\alpha$ so $E\left[\frac{V}{R}\right]$
-* Calculate P-values normally
-* Order the P-values from smallest to largest $P_{(1)},...,P_{(m)}$
-* Call any $P_{(i)} \leq \alpha \times \frac{i}{m}$ significant
-
-__Pros__: Still pretty easy to calculate, less conservative (maybe much less)
-
-__Cons__: Allows for more false positives, may behave strangely under dependence
-
----
-
-## Example with 10 P-values
-
-<img class=center src=fig/example10pvals.png height=450>
-
-Controlling all error rates at $\alpha = 0.20$
-
----
-
-## Adjusted P-values
-
-* One approach is to adjust the threshold $\alpha$
-* A different approach is to calculate "adjusted p-values"
-* They _are not p-values_ anymore
-* But they can be used directly without adjusting $\alpha$
-
-__Example__: 
-* Suppose P-values are $P_1,\ldots,P_m$
-* You could adjust them by taking $P_i^{fwer} = \max{m \times P_i,1}$ for each P-value.
-* Then if you call all $P_i^{fwer} < \alpha$ significant you will control the FWER. 
-
----
-
-## Case study I: no true positives
-
-```{r createPvals,cache=TRUE}
-set.seed(1010093)
-pValues <- rep(NA,1000)
-for(i in 1:1000){
-  y <- rnorm(20)
-  x <- rnorm(20)
-  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
-}
-
-# Controls false positive rate
-sum(pValues < 0.05)
-```
-
----
-
-## Case study I: no true positives
-
-```{r, dependson="createPvals"}
-# Controls FWER 
-sum(p.adjust(pValues,method="bonferroni") < 0.05)
-# Controls FDR 
-sum(p.adjust(pValues,method="BH") < 0.05)
-```
-
-
----
-
-## Case study II: 50% true positives
-
-```{r createPvals2,cache=TRUE}
-set.seed(1010093)
-pValues <- rep(NA,1000)
-for(i in 1:1000){
-  x <- rnorm(20)
-  # First 500 beta=0, last 500 beta=2
-  if(i <= 500){y <- rnorm(20)}else{ y <- rnorm(20,mean=2*x)}
-  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
-}
-trueStatus <- rep(c("zero","not zero"),each=500)
-table(pValues < 0.05, trueStatus)
-```
-
----
-
-
-## Case study II: 50% true positives
-
-```{r, dependson="createPvals2"}
-# Controls FWER 
-table(p.adjust(pValues,method="bonferroni") < 0.05,trueStatus)
-# Controls FDR 
-table(p.adjust(pValues,method="BH") < 0.05,trueStatus)
-```
-
-
----
-
-
-## Case study II: 50% true positives
-
-__P-values versus adjusted P-values__
-```{r, dependson="createPvals2",fig.height=4,fig.width=8}
-par(mfrow=c(1,2))
-plot(pValues,p.adjust(pValues,method="bonferroni"),pch=19)
-plot(pValues,p.adjust(pValues,method="BH"),pch=19)
-```
-
-
----
-
-
-## Notes and resources
-
-__Notes__:
-* Multiple testing is an entire subfield
-* A basic Bonferroni/BH correction is usually enough
-* If there is strong dependence between tests there may be problems
-  * Consider method="BY"
-
-__Further resources__:
-* [Multiple testing procedures with applications to genomics](http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&s=books&qid=1187394873&sr=1-2)
-* [Statistical significance for genome-wide studies](http://www.pnas.org/content/100/16/9440.full)
-* [Introduction to multiple testing](http://ies.ed.gov/ncee/pubs/20084018/app_b.asp)
-
+---
+title       : Multiple testing
+subtitle    : Statistical Inference 
+author      : Brian Caffo, Jeffrey Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Key ideas
+
+* Hypothesis testing/significance analysis is commonly overused
+* Correcting for multiple testing avoids false positives or discoveries
+* Two key components
+  * Error measure
+  * Correction
+
+
+---
+
+## Three eras of statistics
+
+__The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions__: Are there more male than female births? Is the rate of insanity rising?
+
+The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who __developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment__. The questions dealt with still tended to be simple Is treatment A better than treatment B? 
+
+__The era of scientific mass production__, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?
+
+[http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf](http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf)
+
+---
+
+## Reasons for multiple testing
+
+<img class=center src=fig/datasources.png height=450>
+
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans1.png height=450>
+
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans2.png height=400>
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+
+---
+
+## Types of errors
+
+Suppose you are testing a hypothesis that a parameter $\beta$ equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
+</br></br>
+
+                    | $\beta=0$   | $\beta\neq0$   |  Hypotheses
+--------------------|-------------|----------------|---------
+Claim $\beta=0$     |      $U$    |      $T$       |  $m-R$
+Claim $\beta\neq 0$ |      $V$    |      $S$       |  $R$
+    Claims          |     $m_0$   |      $m-m_0$   |  $m$
+
+</br></br>
+
+__Type I error or false positive ($V$)__ Say that the parameter does not equal zero when it does
+
+__Type II error or false negative ($T$)__ Say that the parameter equals zero when it doesn't 
+
+
+---
+
+## Error rates
+
+__False positive rate__ - The rate at which false results ($\beta = 0$) are called significant: $E\left[\frac{V}{m_0}\right]$*
+
+__Family wise error rate (FWER)__ - The probability of at least one false positive ${\rm Pr}(V \geq 1)$
+
+__False discovery rate (FDR)__ - The rate at which claims of significance are false $E\left[\frac{V}{R}\right]$
+
+* The false positive rate is closely related to the type I error rate [http://en.wikipedia.org/wiki/False_positive_rate](http://en.wikipedia.org/wiki/False_positive_rate)
+
+---
+
+## Controlling the false positive rate
+
+If P-values are correctly calculated calling all $P < \alpha$ significant will control the false positive rate at level $\alpha$ on average. 
+
+<redtext>Problem</redtext>: Suppose that you perform 10,000 tests and $\beta = 0$ for all of them. 
+
+Suppose that you call all $P < 0.05$ significant. 
+
+The expected number of false positives is: $10,000 \times 0.05 = 500$  false positives. 
+
+__How do we avoid so many false positives?__
+
+
+---
+
+## Controlling family-wise error rate (FWER)
+
+
+The [Bonferroni correction](http://en.wikipedia.org/wiki/Bonferroni_correction) is the oldest multiple testing correction. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FWER at level $\alpha$ so $Pr(V \geq 1) < \alpha$
+* Calculate P-values normally
+* Set $\alpha_{fwer} = \alpha/m$
+* Call all $P$-values less than $\alpha_{fwer}$ significant
+
+__Pros__: Easy to calculate, conservative
+__Cons__: May be very conservative
+
+
+---
+
+## Controlling false discovery rate (FDR)
+
+This is the most popular correction when performing _lots_ of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FDR at level $\alpha$ so $E\left[\frac{V}{R}\right]$
+* Calculate P-values normally
+* Order the P-values from smallest to largest $P_{(1)},...,P_{(m)}$
+* Call any $P_{(i)} \leq \alpha \times \frac{i}{m}$ significant
+
+__Pros__: Still pretty easy to calculate, less conservative (maybe much less)
+
+__Cons__: Allows for more false positives, may behave strangely under dependence
+
+---
+
+## Example with 10 P-values
+
+<img class=center src=fig/example10pvals.png height=450>
+
+Controlling all error rates at $\alpha = 0.20$
+
+---
+
+## Adjusted P-values
+
+* One approach is to adjust the threshold $\alpha$
+* A different approach is to calculate "adjusted p-values"
+* They _are not p-values_ anymore
+* But they can be used directly without adjusting $\alpha$
+
+__Example__: 
+* Suppose P-values are $P_1,\ldots,P_m$
+* You could adjust them by taking $P_i^{fwer} = \max{m \times P_i,1}$ for each P-value.
+* Then if you call all $P_i^{fwer} < \alpha$ significant you will control the FWER. 
+
+---
+
+## Case study I: no true positives
+
+```{r createPvals,cache=TRUE}
+set.seed(1010093)
+pValues <- rep(NA,1000)
+for(i in 1:1000){
+  y <- rnorm(20)
+  x <- rnorm(20)
+  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
+}
+
+# Controls false positive rate
+sum(pValues < 0.05)
+```
+
+---
+
+## Case study I: no true positives
+
+```{r, dependson="createPvals"}
+# Controls FWER 
+sum(p.adjust(pValues,method="bonferroni") < 0.05)
+# Controls FDR 
+sum(p.adjust(pValues,method="BH") < 0.05)
+```
+
+
+---
+
+## Case study II: 50% true positives
+
+```{r createPvals2,cache=TRUE}
+set.seed(1010093)
+pValues <- rep(NA,1000)
+for(i in 1:1000){
+  x <- rnorm(20)
+  # First 500 beta=0, last 500 beta=2
+  if(i <= 500){y <- rnorm(20)}else{ y <- rnorm(20,mean=2*x)}
+  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
+}
+trueStatus <- rep(c("zero","not zero"),each=500)
+table(pValues < 0.05, trueStatus)
+```
+
+---
+
+
+## Case study II: 50% true positives
+
+```{r, dependson="createPvals2"}
+# Controls FWER 
+table(p.adjust(pValues,method="bonferroni") < 0.05,trueStatus)
+# Controls FDR 
+table(p.adjust(pValues,method="BH") < 0.05,trueStatus)
+```
+
+
+---
+
+
+## Case study II: 50% true positives
+
+__P-values versus adjusted P-values__
+```{r, dependson="createPvals2",fig.height=4,fig.width=8}
+par(mfrow=c(1,2))
+plot(pValues,p.adjust(pValues,method="bonferroni"),pch=19)
+plot(pValues,p.adjust(pValues,method="BH"),pch=19)
+```
+
+
+---
+
+
+## Notes and resources
+
+__Notes__:
+* Multiple testing is an entire subfield
+* A basic Bonferroni/BH correction is usually enough
+* If there is strong dependence between tests there may be problems
+  * Consider method="BY"
+
+__Further resources__:
+* [Multiple testing procedures with applications to genomics](http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&s=books&qid=1187394873&sr=1-2)
+* [Statistical significance for genome-wide studies](http://www.pnas.org/content/100/16/9440.full)
+* [Introduction to multiple testing](http://ies.ed.gov/ncee/pubs/20084018/app_b.asp)
+
diff --git a/06_StatisticalInference/03_05_MultipleTesting/index.html b/06_StatisticalInference/12_MultipleTesting/index.html
similarity index 88%
rename from 06_StatisticalInference/03_05_MultipleTesting/index.html
rename to 06_StatisticalInference/12_MultipleTesting/index.html
index bb3a271db..dfc498eeb 100644
--- a/06_StatisticalInference/03_05_MultipleTesting/index.html
+++ b/06_StatisticalInference/12_MultipleTesting/index.html
@@ -1,581 +1,585 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Multiple testing</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Multiple testing">
-  <meta name="author" content="Brian Caffo, Jeffrey Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Multiple testing</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeffrey Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Key ideas</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Hypothesis testing/significance analysis is commonly overused</li>
-<li>Correcting for multiple testing avoids false positives or discoveries</li>
-<li>Two key components
-
-<ul>
-<li>Error measure</li>
-<li>Correction</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Three eras of statistics</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><strong>The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions</strong>: Are there more male than female births? Is the rate of insanity rising?</p>
-
-<p>The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who <strong>developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment</strong>. The questions dealt with still tended to be simple Is treatment A better than treatment B? </p>
-
-<p><strong>The era of scientific mass production</strong>, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?</p>
-
-<p><a href="http://www-stat.stanford.edu/%7Eckirby/brad/papers/2010LSIexcerpt.pdf">http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf</a></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Reasons for multiple testing</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><img class=center src=fig/datasources.png height=450></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Why correct for multiple tests?</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><img class=center src=fig/jellybeans1.png height=450></p>
-
-<p><a href="http://xkcd.com/882/">http://xkcd.com/882/</a></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Why correct for multiple tests?</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><img class=center src=fig/jellybeans2.png height=400></p>
-
-<p><a href="http://xkcd.com/882/">http://xkcd.com/882/</a></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Types of errors</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Suppose you are testing a hypothesis that a parameter \(\beta\) equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
-</br></br></p>
-
-<table><thead>
-<tr>
-<th></th>
-<th>\(\beta=0\)</th>
-<th>\(\beta\neq0\)</th>
-<th>Hypotheses</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>Claim \(\beta=0\)</td>
-<td>\(U\)</td>
-<td>\(T\)</td>
-<td>\(m-R\)</td>
-</tr>
-<tr>
-<td>Claim \(\beta\neq 0\)</td>
-<td>\(V\)</td>
-<td>\(S\)</td>
-<td>\(R\)</td>
-</tr>
-<tr>
-<td>Claims</td>
-<td>\(m_0\)</td>
-<td>\(m-m_0\)</td>
-<td>\(m\)</td>
-</tr>
-</tbody></table>
-
-<p></br></br></p>
-
-<p><strong>Type I error or false positive (\(V\))</strong> Say that the parameter does not equal zero when it does</p>
-
-<p><strong>Type II error or false negative (\(T\))</strong> Say that the parameter equals zero when it doesn&#39;t </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Error rates</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><strong>False positive rate</strong> - The rate at which false results (\(\beta = 0\)) are called significant: \(E\left[\frac{V}{m_0}\right]\)*</p>
-
-<p><strong>Family wise error rate (FWER)</strong> - The probability of at least one false positive \({\rm Pr}(V \geq 1)\)</p>
-
-<p><strong>False discovery rate (FDR)</strong> - The rate at which claims of significance are false \(E\left[\frac{V}{R}\right]\)</p>
-
-<ul>
-<li>The false positive rate is closely related to the type I error rate <a href="http://en.wikipedia.org/wiki/False_positive_rate">http://en.wikipedia.org/wiki/False_positive_rate</a></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Controlling the false positive rate</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>If P-values are correctly calculated calling all \(P < \alpha\) significant will control the false positive rate at level \(\alpha\) on average. </p>
-
-<p><redtext>Problem</redtext>: Suppose that you perform 10,000 tests and \(\beta = 0\) for all of them. </p>
-
-<p>Suppose that you call all \(P < 0.05\) significant. </p>
-
-<p>The expected number of false positives is: \(10,000 \times 0.05 = 500\)  false positives. </p>
-
-<p><strong>How do we avoid so many false positives?</strong></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Controlling family-wise error rate (FWER)</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>The <a href="http://en.wikipedia.org/wiki/Bonferroni_correction">Bonferroni correction</a> is the oldest multiple testing correction. </p>
-
-<p><strong>Basic idea</strong>: </p>
-
-<ul>
-<li>Suppose you do \(m\) tests</li>
-<li>You want to control FWER at level \(\alpha\) so \(Pr(V \geq 1) < \alpha\)</li>
-<li>Calculate P-values normally</li>
-<li>Set \(\alpha_{fwer} = \alpha/m\)</li>
-<li>Call all \(P\)-values less than \(\alpha_{fwer}\) significant</li>
-</ul>
-
-<p><strong>Pros</strong>: Easy to calculate, conservative
-<strong>Cons</strong>: May be very conservative</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Controlling false discovery rate (FDR)</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>This is the most popular correction when performing <em>lots</em> of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. </p>
-
-<p><strong>Basic idea</strong>: </p>
-
-<ul>
-<li>Suppose you do \(m\) tests</li>
-<li>You want to control FDR at level \(\alpha\) so \(E\left[\frac{V}{R}\right]\)</li>
-<li>Calculate P-values normally</li>
-<li>Order the P-values from smallest to largest \(P_{(1)},...,P_{(m)}\)</li>
-<li>Call any \(P_{(i)} \leq \alpha \times \frac{i}{m}\) significant</li>
-</ul>
-
-<p><strong>Pros</strong>: Still pretty easy to calculate, less conservative (maybe much less)</p>
-
-<p><strong>Cons</strong>: Allows for more false positives, may behave strangely under dependence</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Example with 10 P-values</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><img class=center src=fig/example10pvals.png height=450></p>
-
-<p>Controlling all error rates at \(\alpha = 0.20\)</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Adjusted P-values</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>One approach is to adjust the threshold \(\alpha\)</li>
-<li>A different approach is to calculate &quot;adjusted p-values&quot;</li>
-<li>They <em>are not p-values</em> anymore</li>
-<li>But they can be used directly without adjusting \(\alpha\)</li>
-</ul>
-
-<p><strong>Example</strong>: </p>
-
-<ul>
-<li>Suppose P-values are \(P_1,\ldots,P_m\)</li>
-<li>You could adjust them by taking \(P_i^{fwer} = \max{m \times P_i,1}\) for each P-value.</li>
-<li>Then if you call all \(P_i^{fwer} < \alpha\) significant you will control the FWER. </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>Case study I: no true positives</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">set.seed(1010093)
-pValues &lt;- rep(NA,1000)
-for(i in 1:1000){
-  y &lt;- rnorm(20)
-  x &lt;- rnorm(20)
-  pValues[i] &lt;- summary(lm(y ~ x))$coeff[2,4]
-}
-
-# Controls false positive rate
-sum(pValues &lt; 0.05)
-</code></pre>
-
-<pre><code>[1] 51
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Case study I: no true positives</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r"># Controls FWER 
-sum(p.adjust(pValues,method=&quot;bonferroni&quot;) &lt; 0.05)
-</code></pre>
-
-<pre><code>[1] 0
-</code></pre>
-
-<pre><code class="r"># Controls FDR 
-sum(p.adjust(pValues,method=&quot;BH&quot;) &lt; 0.05)
-</code></pre>
-
-<pre><code>[1] 0
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Case study II: 50% true positives</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">set.seed(1010093)
-pValues &lt;- rep(NA,1000)
-for(i in 1:1000){
-  x &lt;- rnorm(20)
-  # First 500 beta=0, last 500 beta=2
-  if(i &lt;= 500){y &lt;- rnorm(20)}else{ y &lt;- rnorm(20,mean=2*x)}
-  pValues[i] &lt;- summary(lm(y ~ x))$coeff[2,4]
-}
-trueStatus &lt;- rep(c(&quot;zero&quot;,&quot;not zero&quot;),each=500)
-table(pValues &lt; 0.05, trueStatus)
-</code></pre>
-
-<pre><code>       trueStatus
-        not zero zero
-  FALSE        0  476
-  TRUE       500   24
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Case study II: 50% true positives</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r"># Controls FWER 
-table(p.adjust(pValues,method=&quot;bonferroni&quot;) &lt; 0.05,trueStatus)
-</code></pre>
-
-<pre><code>       trueStatus
-        not zero zero
-  FALSE       23  500
-  TRUE       477    0
-</code></pre>
-
-<pre><code class="r"># Controls FDR 
-table(p.adjust(pValues,method=&quot;BH&quot;) &lt; 0.05,trueStatus)
-</code></pre>
-
-<pre><code>       trueStatus
-        not zero zero
-  FALSE        0  487
-  TRUE       500   13
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Case study II: 50% true positives</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><strong>P-values versus adjusted P-values</strong></p>
-
-<pre><code class="r">par(mfrow=c(1,2))
-plot(pValues,p.adjust(pValues,method=&quot;bonferroni&quot;),pch=19)
-plot(pValues,p.adjust(pValues,method=&quot;BH&quot;),pch=19)
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h2>Notes and resources</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><strong>Notes</strong>:</p>
-
-<ul>
-<li>Multiple testing is an entire subfield</li>
-<li>A basic Bonferroni/BH correction is usually enough</li>
-<li>If there is strong dependence between tests there may be problems
-
-<ul>
-<li>Consider method=&quot;BY&quot;</li>
-</ul></li>
-</ul>
-
-<p><strong>Further resources</strong>:</p>
-
-<ul>
-<li><a href="http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&amp;s=books&amp;qid=1187394873&amp;sr=1-2">Multiple testing procedures with applications to genomics</a></li>
-<li><a href="http://www.pnas.org/content/100/16/9440.full">Statistical significance for genome-wide studies</a></li>
-<li><a href="http://ies.ed.gov/ncee/pubs/20084018/app_b.asp">Introduction to multiple testing</a></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Key ideas'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Three eras of statistics'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Reasons for multiple testing'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Why correct for multiple tests?'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Why correct for multiple tests?'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Types of errors'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Error rates'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Controlling the false positive rate'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Controlling family-wise error rate (FWER)'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Controlling false discovery rate (FDR)'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='Example with 10 P-values'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Adjusted P-values'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='Case study I: no true positives'>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='Case study I: no true positives'>
-         14
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title='Case study II: 50% true positives'>
-         15
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='Case study II: 50% true positives'>
-         16
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=17 title='Case study II: 50% true positives'>
-         17
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=18 title='Notes and resources'>
-         18
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Multiple testing</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Multiple testing">
+  <meta name="author" content="Brian Caffo, Jeffrey Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Multiple testing</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeffrey Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Key ideas</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Hypothesis testing/significance analysis is commonly overused</li>
+<li>Correcting for multiple testing avoids false positives or discoveries</li>
+<li>Two key components
+
+<ul>
+<li>Error measure</li>
+<li>Correction</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Three eras of statistics</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions</strong>: Are there more male than female births? Is the rate of insanity rising?</p>
+
+<p>The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who <strong>developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment</strong>. The questions dealt with still tended to be simple Is treatment A better than treatment B? </p>
+
+<p><strong>The era of scientific mass production</strong>, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?</p>
+
+<p><a href="http://www-stat.stanford.edu/%7Eckirby/brad/papers/2010LSIexcerpt.pdf">http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf</a></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Reasons for multiple testing</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img class=center src=fig/datasources.png height=450></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Why correct for multiple tests?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img class=center src=fig/jellybeans1.png height=450></p>
+
+<p><a href="http://xkcd.com/882/">http://xkcd.com/882/</a></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Why correct for multiple tests?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img class=center src=fig/jellybeans2.png height=400></p>
+
+<p><a href="http://xkcd.com/882/">http://xkcd.com/882/</a></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Types of errors</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Suppose you are testing a hypothesis that a parameter \(\beta\) equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
+</br></br></p>
+
+<table><thead>
+<tr>
+<th></th>
+<th>\(\beta=0\)</th>
+<th>\(\beta\neq0\)</th>
+<th>Hypotheses</th>
+</tr>
+</thead><tbody>
+<tr>
+<td>Claim \(\beta=0\)</td>
+<td>\(U\)</td>
+<td>\(T\)</td>
+<td>\(m-R\)</td>
+</tr>
+<tr>
+<td>Claim \(\beta\neq 0\)</td>
+<td>\(V\)</td>
+<td>\(S\)</td>
+<td>\(R\)</td>
+</tr>
+<tr>
+<td>Claims</td>
+<td>\(m_0\)</td>
+<td>\(m-m_0\)</td>
+<td>\(m\)</td>
+</tr>
+</tbody></table>
+
+<p></br></br></p>
+
+<p><strong>Type I error or false positive (\(V\))</strong> Say that the parameter does not equal zero when it does</p>
+
+<p><strong>Type II error or false negative (\(T\))</strong> Say that the parameter equals zero when it doesn&#39;t </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Error rates</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>False positive rate</strong> - The rate at which false results (\(\beta = 0\)) are called significant: \(E\left[\frac{V}{m_0}\right]\)*</p>
+
+<p><strong>Family wise error rate (FWER)</strong> - The probability of at least one false positive \({\rm Pr}(V \geq 1)\)</p>
+
+<p><strong>False discovery rate (FDR)</strong> - The rate at which claims of significance are false \(E\left[\frac{V}{R}\right]\)</p>
+
+<ul>
+<li>The false positive rate is closely related to the type I error rate <a href="http://en.wikipedia.org/wiki/False_positive_rate">http://en.wikipedia.org/wiki/False_positive_rate</a></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Controlling the false positive rate</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>If P-values are correctly calculated calling all \(P < \alpha\) significant will control the false positive rate at level \(\alpha\) on average. </p>
+
+<p><redtext>Problem</redtext>: Suppose that you perform 10,000 tests and \(\beta = 0\) for all of them. </p>
+
+<p>Suppose that you call all \(P < 0.05\) significant. </p>
+
+<p>The expected number of false positives is: \(10,000 \times 0.05 = 500\)  false positives. </p>
+
+<p><strong>How do we avoid so many false positives?</strong></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Controlling family-wise error rate (FWER)</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>The <a href="http://en.wikipedia.org/wiki/Bonferroni_correction">Bonferroni correction</a> is the oldest multiple testing correction. </p>
+
+<p><strong>Basic idea</strong>: </p>
+
+<ul>
+<li>Suppose you do \(m\) tests</li>
+<li>You want to control FWER at level \(\alpha\) so \(Pr(V \geq 1) < \alpha\)</li>
+<li>Calculate P-values normally</li>
+<li>Set \(\alpha_{fwer} = \alpha/m\)</li>
+<li>Call all \(P\)-values less than \(\alpha_{fwer}\) significant</li>
+</ul>
+
+<p><strong>Pros</strong>: Easy to calculate, conservative
+<strong>Cons</strong>: May be very conservative</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Controlling false discovery rate (FDR)</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>This is the most popular correction when performing <em>lots</em> of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. </p>
+
+<p><strong>Basic idea</strong>: </p>
+
+<ul>
+<li>Suppose you do \(m\) tests</li>
+<li>You want to control FDR at level \(\alpha\) so \(E\left[\frac{V}{R}\right]\)</li>
+<li>Calculate P-values normally</li>
+<li>Order the P-values from smallest to largest \(P_{(1)},...,P_{(m)}\)</li>
+<li>Call any \(P_{(i)} \leq \alpha \times \frac{i}{m}\) significant</li>
+</ul>
+
+<p><strong>Pros</strong>: Still pretty easy to calculate, less conservative (maybe much less)</p>
+
+<p><strong>Cons</strong>: Allows for more false positives, may behave strangely under dependence</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example with 10 P-values</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img class=center src=fig/example10pvals.png height=450></p>
+
+<p>Controlling all error rates at \(\alpha = 0.20\)</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Adjusted P-values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>One approach is to adjust the threshold \(\alpha\)</li>
+<li>A different approach is to calculate &quot;adjusted p-values&quot;</li>
+<li>They <em>are not p-values</em> anymore</li>
+<li>But they can be used directly without adjusting \(\alpha\)</li>
+</ul>
+
+<p><strong>Example</strong>: </p>
+
+<ul>
+<li>Suppose P-values are \(P_1,\ldots,P_m\)</li>
+<li>You could adjust them by taking \(P_i^{fwer} = \max{m \times P_i,1}\) for each P-value.</li>
+<li>Then if you call all \(P_i^{fwer} < \alpha\) significant you will control the FWER. </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Case study I: no true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">set.seed(1010093)
+pValues &lt;- rep(NA, 1000)
+for (i in 1:1000) {
+    y &lt;- rnorm(20)
+    x &lt;- rnorm(20)
+    pValues[i] &lt;- summary(lm(y ~ x))$coeff[2, 4]
+}
+
+# Controls false positive rate
+sum(pValues &lt; 0.05)
+</code></pre>
+
+<pre><code>## [1] 51
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Case study I: no true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r"># Controls FWER
+sum(p.adjust(pValues, method = &quot;bonferroni&quot;) &lt; 0.05)
+</code></pre>
+
+<pre><code>## [1] 0
+</code></pre>
+
+<pre><code class="r"># Controls FDR
+sum(p.adjust(pValues, method = &quot;BH&quot;) &lt; 0.05)
+</code></pre>
+
+<pre><code>## [1] 0
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Case study II: 50% true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">set.seed(1010093)
+pValues &lt;- rep(NA, 1000)
+for (i in 1:1000) {
+    x &lt;- rnorm(20)
+    # First 500 beta=0, last 500 beta=2
+    if (i &lt;= 500) {
+        y &lt;- rnorm(20)
+    } else {
+        y &lt;- rnorm(20, mean = 2 * x)
+    }
+    pValues[i] &lt;- summary(lm(y ~ x))$coeff[2, 4]
+}
+trueStatus &lt;- rep(c(&quot;zero&quot;, &quot;not zero&quot;), each = 500)
+table(pValues &lt; 0.05, trueStatus)
+</code></pre>
+
+<pre><code>##        trueStatus
+##         not zero zero
+##   FALSE        0  476
+##   TRUE       500   24
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Case study II: 50% true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r"># Controls FWER
+table(p.adjust(pValues, method = &quot;bonferroni&quot;) &lt; 0.05, trueStatus)
+</code></pre>
+
+<pre><code>##        trueStatus
+##         not zero zero
+##   FALSE       23  500
+##   TRUE       477    0
+</code></pre>
+
+<pre><code class="r"># Controls FDR
+table(p.adjust(pValues, method = &quot;BH&quot;) &lt; 0.05, trueStatus)
+</code></pre>
+
+<pre><code>##        trueStatus
+##         not zero zero
+##   FALSE        0  487
+##   TRUE       500   13
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Case study II: 50% true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>P-values versus adjusted P-values</strong></p>
+
+<pre><code class="r">par(mfrow = c(1, 2))
+plot(pValues, p.adjust(pValues, method = &quot;bonferroni&quot;), pch = 19)
+plot(pValues, p.adjust(pValues, method = &quot;BH&quot;), pch = 19)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-3.png" alt="plot of chunk unnamed-chunk-3"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Notes and resources</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>Notes</strong>:</p>
+
+<ul>
+<li>Multiple testing is an entire subfield</li>
+<li>A basic Bonferroni/BH correction is usually enough</li>
+<li>If there is strong dependence between tests there may be problems
+
+<ul>
+<li>Consider method=&quot;BY&quot;</li>
+</ul></li>
+</ul>
+
+<p><strong>Further resources</strong>:</p>
+
+<ul>
+<li><a href="http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&amp;s=books&amp;qid=1187394873&amp;sr=1-2">Multiple testing procedures with applications to genomics</a></li>
+<li><a href="http://www.pnas.org/content/100/16/9440.full">Statistical significance for genome-wide studies</a></li>
+<li><a href="http://ies.ed.gov/ncee/pubs/20084018/app_b.asp">Introduction to multiple testing</a></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Key ideas'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Three eras of statistics'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Reasons for multiple testing'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Why correct for multiple tests?'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Why correct for multiple tests?'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Types of errors'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Error rates'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Controlling the false positive rate'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Controlling family-wise error rate (FWER)'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Controlling false discovery rate (FDR)'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example with 10 P-values'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Adjusted P-values'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Case study I: no true positives'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Case study I: no true positives'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Case study II: 50% true positives'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Case study II: 50% true positives'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Case study II: 50% true positives'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Notes and resources'>
+         18
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/12_MultipleTesting/index.md b/06_StatisticalInference/12_MultipleTesting/index.md
new file mode 100644
index 000000000..08f1afa2f
--- /dev/null
+++ b/06_StatisticalInference/12_MultipleTesting/index.md
@@ -0,0 +1,308 @@
+---
+title       : Multiple testing
+subtitle    : Statistical Inference 
+author      : Brian Caffo, Jeffrey Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Key ideas
+
+* Hypothesis testing/significance analysis is commonly overused
+* Correcting for multiple testing avoids false positives or discoveries
+* Two key components
+  * Error measure
+  * Correction
+
+
+---
+
+## Three eras of statistics
+
+__The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions__: Are there more male than female births? Is the rate of insanity rising?
+
+The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who __developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment__. The questions dealt with still tended to be simple Is treatment A better than treatment B? 
+
+__The era of scientific mass production__, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?
+
+[http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf](http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf)
+
+---
+
+## Reasons for multiple testing
+
+<img class=center src=fig/datasources.png height=450>
+
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans1.png height=450>
+
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans2.png height=400>
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+
+---
+
+## Types of errors
+
+Suppose you are testing a hypothesis that a parameter $\beta$ equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
+</br></br>
+
+                    | $\beta=0$   | $\beta\neq0$   |  Hypotheses
+--------------------|-------------|----------------|---------
+Claim $\beta=0$     |      $U$    |      $T$       |  $m-R$
+Claim $\beta\neq 0$ |      $V$    |      $S$       |  $R$
+    Claims          |     $m_0$   |      $m-m_0$   |  $m$
+
+</br></br>
+
+__Type I error or false positive ($V$)__ Say that the parameter does not equal zero when it does
+
+__Type II error or false negative ($T$)__ Say that the parameter equals zero when it doesn't 
+
+
+---
+
+## Error rates
+
+__False positive rate__ - The rate at which false results ($\beta = 0$) are called significant: $E\left[\frac{V}{m_0}\right]$*
+
+__Family wise error rate (FWER)__ - The probability of at least one false positive ${\rm Pr}(V \geq 1)$
+
+__False discovery rate (FDR)__ - The rate at which claims of significance are false $E\left[\frac{V}{R}\right]$
+
+* The false positive rate is closely related to the type I error rate [http://en.wikipedia.org/wiki/False_positive_rate](http://en.wikipedia.org/wiki/False_positive_rate)
+
+---
+
+## Controlling the false positive rate
+
+If P-values are correctly calculated calling all $P < \alpha$ significant will control the false positive rate at level $\alpha$ on average. 
+
+<redtext>Problem</redtext>: Suppose that you perform 10,000 tests and $\beta = 0$ for all of them. 
+
+Suppose that you call all $P < 0.05$ significant. 
+
+The expected number of false positives is: $10,000 \times 0.05 = 500$  false positives. 
+
+__How do we avoid so many false positives?__
+
+
+---
+
+## Controlling family-wise error rate (FWER)
+
+
+The [Bonferroni correction](http://en.wikipedia.org/wiki/Bonferroni_correction) is the oldest multiple testing correction. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FWER at level $\alpha$ so $Pr(V \geq 1) < \alpha$
+* Calculate P-values normally
+* Set $\alpha_{fwer} = \alpha/m$
+* Call all $P$-values less than $\alpha_{fwer}$ significant
+
+__Pros__: Easy to calculate, conservative
+__Cons__: May be very conservative
+
+
+---
+
+## Controlling false discovery rate (FDR)
+
+This is the most popular correction when performing _lots_ of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FDR at level $\alpha$ so $E\left[\frac{V}{R}\right]$
+* Calculate P-values normally
+* Order the P-values from smallest to largest $P_{(1)},...,P_{(m)}$
+* Call any $P_{(i)} \leq \alpha \times \frac{i}{m}$ significant
+
+__Pros__: Still pretty easy to calculate, less conservative (maybe much less)
+
+__Cons__: Allows for more false positives, may behave strangely under dependence
+
+---
+
+## Example with 10 P-values
+
+<img class=center src=fig/example10pvals.png height=450>
+
+Controlling all error rates at $\alpha = 0.20$
+
+---
+
+## Adjusted P-values
+
+* One approach is to adjust the threshold $\alpha$
+* A different approach is to calculate "adjusted p-values"
+* They _are not p-values_ anymore
+* But they can be used directly without adjusting $\alpha$
+
+__Example__: 
+* Suppose P-values are $P_1,\ldots,P_m$
+* You could adjust them by taking $P_i^{fwer} = \max{m \times P_i,1}$ for each P-value.
+* Then if you call all $P_i^{fwer} < \alpha$ significant you will control the FWER. 
+
+---
+
+## Case study I: no true positives
+
+
+```r
+set.seed(1010093)
+pValues <- rep(NA, 1000)
+for (i in 1:1000) {
+    y <- rnorm(20)
+    x <- rnorm(20)
+    pValues[i] <- summary(lm(y ~ x))$coeff[2, 4]
+}
+
+# Controls false positive rate
+sum(pValues < 0.05)
+```
+
+```
+## [1] 51
+```
+
+
+---
+
+## Case study I: no true positives
+
+
+```r
+# Controls FWER
+sum(p.adjust(pValues, method = "bonferroni") < 0.05)
+```
+
+```
+## [1] 0
+```
+
+```r
+# Controls FDR
+sum(p.adjust(pValues, method = "BH") < 0.05)
+```
+
+```
+## [1] 0
+```
+
+
+
+---
+
+## Case study II: 50% true positives
+
+
+```r
+set.seed(1010093)
+pValues <- rep(NA, 1000)
+for (i in 1:1000) {
+    x <- rnorm(20)
+    # First 500 beta=0, last 500 beta=2
+    if (i <= 500) {
+        y <- rnorm(20)
+    } else {
+        y <- rnorm(20, mean = 2 * x)
+    }
+    pValues[i] <- summary(lm(y ~ x))$coeff[2, 4]
+}
+trueStatus <- rep(c("zero", "not zero"), each = 500)
+table(pValues < 0.05, trueStatus)
+```
+
+```
+##        trueStatus
+##         not zero zero
+##   FALSE        0  476
+##   TRUE       500   24
+```
+
+
+---
+
+
+## Case study II: 50% true positives
+
+
+```r
+# Controls FWER
+table(p.adjust(pValues, method = "bonferroni") < 0.05, trueStatus)
+```
+
+```
+##        trueStatus
+##         not zero zero
+##   FALSE       23  500
+##   TRUE       477    0
+```
+
+```r
+# Controls FDR
+table(p.adjust(pValues, method = "BH") < 0.05, trueStatus)
+```
+
+```
+##        trueStatus
+##         not zero zero
+##   FALSE        0  487
+##   TRUE       500   13
+```
+
+
+
+---
+
+
+## Case study II: 50% true positives
+
+__P-values versus adjusted P-values__
+
+```r
+par(mfrow = c(1, 2))
+plot(pValues, p.adjust(pValues, method = "bonferroni"), pch = 19)
+plot(pValues, p.adjust(pValues, method = "BH"), pch = 19)
+```
+
+![plot of chunk unnamed-chunk-3](assets/fig/unnamed-chunk-3.png) 
+
+
+
+---
+
+
+## Notes and resources
+
+__Notes__:
+* Multiple testing is an entire subfield
+* A basic Bonferroni/BH correction is usually enough
+* If there is strong dependence between tests there may be problems
+  * Consider method="BY"
+
+__Further resources__:
+* [Multiple testing procedures with applications to genomics](http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&s=books&qid=1187394873&sr=1-2)
+* [Statistical significance for genome-wide studies](http://www.pnas.org/content/100/16/9440.full)
+* [Introduction to multiple testing](http://ies.ed.gov/ncee/pubs/20084018/app_b.asp)
+
diff --git a/06_StatisticalInference/12_MultipleTesting/index.pdf b/06_StatisticalInference/12_MultipleTesting/index.pdf
new file mode 100644
index 000000000..88d17ad14
Binary files /dev/null and b/06_StatisticalInference/12_MultipleTesting/index.pdf differ
diff --git a/06_StatisticalInference/03_06_resampledInference/Bootstrapping.pdf b/06_StatisticalInference/13_Resampling/Bootstrapping.pdf
similarity index 100%
rename from 06_StatisticalInference/03_06_resampledInference/Bootstrapping.pdf
rename to 06_StatisticalInference/13_Resampling/Bootstrapping.pdf
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-1.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..d1942083a
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-10.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-10.png
new file mode 100644
index 000000000..7a15bddea
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-10.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-11.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-11.png
new file mode 100644
index 000000000..9e8c3415a
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-11.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-12.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-12.png
new file mode 100644
index 000000000..7dac51d10
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-12.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..e352482dd
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..8c870b65b
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-4.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..bedda710c
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-5.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..8c870b65b
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-5.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-6.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-6.png
new file mode 100644
index 000000000..f6dfe3779
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-6.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-7.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-7.png
new file mode 100644
index 000000000..bad3b9ea5
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-7.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-8.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-8.png
new file mode 100644
index 000000000..46e951432
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-8.png differ
diff --git a/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-9.png b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-9.png
new file mode 100644
index 000000000..09ea388cd
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/assets/fig/unnamed-chunk-9.png differ
diff --git a/06_StatisticalInference/03_06_resampledInference/fig/unnamed-chunk-4.png b/06_StatisticalInference/13_Resampling/fig/unnamed-chunk-4.png
similarity index 100%
rename from 06_StatisticalInference/03_06_resampledInference/fig/unnamed-chunk-4.png
rename to 06_StatisticalInference/13_Resampling/fig/unnamed-chunk-4.png
diff --git a/06_StatisticalInference/03_06_resampledInference/fig/unnamed-chunk-5.png b/06_StatisticalInference/13_Resampling/fig/unnamed-chunk-5.png
similarity index 100%
rename from 06_StatisticalInference/03_06_resampledInference/fig/unnamed-chunk-5.png
rename to 06_StatisticalInference/13_Resampling/fig/unnamed-chunk-5.png
diff --git a/06_StatisticalInference/03_06_resampledInference/fig/unnamed-chunk-6.png b/06_StatisticalInference/13_Resampling/fig/unnamed-chunk-6.png
similarity index 100%
rename from 06_StatisticalInference/03_06_resampledInference/fig/unnamed-chunk-6.png
rename to 06_StatisticalInference/13_Resampling/fig/unnamed-chunk-6.png
diff --git a/06_StatisticalInference/03_06_resampledInference/fig/unnamed-chunk-7.png b/06_StatisticalInference/13_Resampling/fig/unnamed-chunk-7.png
similarity index 100%
rename from 06_StatisticalInference/03_06_resampledInference/fig/unnamed-chunk-7.png
rename to 06_StatisticalInference/13_Resampling/fig/unnamed-chunk-7.png
diff --git a/06_StatisticalInference/03_06_resampledInference/figure/unnamed-chunk-4.png b/06_StatisticalInference/13_Resampling/figure/unnamed-chunk-4.png
similarity index 100%
rename from 06_StatisticalInference/03_06_resampledInference/figure/unnamed-chunk-4.png
rename to 06_StatisticalInference/13_Resampling/figure/unnamed-chunk-4.png
diff --git a/06_StatisticalInference/13_Resampling/index.Rmd b/06_StatisticalInference/13_Resampling/index.Rmd
new file mode 100644
index 000000000..027b2fa97
--- /dev/null
+++ b/06_StatisticalInference/13_Resampling/index.Rmd
@@ -0,0 +1,222 @@
+---
+title       : Resampled inference
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## The bootstrap
+
+- The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics
+- For example, how would one derive a confidence interval for the median?
+- The bootstrap procedure follows from the so called bootstrap principle
+
+---
+## Sample of 50 die rolls
+
+```{r, echo = FALSE, fig.width=12, fig.height = 6, fig.align='center'}
+library(ggplot2)
+library(gridExtra)
+nosim <- 1000
+
+cfunc <- function(x, n) mean(x)
+g1 = ggplot(data.frame(y = rep(1/6, 6), x = 1 : 6), aes(y = y, x = x))
+g1 = g1 + geom_bar(stat = "identity", fill = "lightblue", colour = "black")
+
+dat <- data.frame(x = apply(matrix(sample(1 : 6, nosim * 50, replace = TRUE), 
+                     nosim), 1, mean))
+g2 <- ggplot(dat, aes(x = x)) + geom_histogram(binwidth=.2, colour = "black", fill = "salmon", aes(y = ..density..)) 
+
+grid.arrange(g1, g2, ncol = 2)
+
+```
+
+
+---
+## What if we only had one sample?
+```{r, echo = FALSE, fig.width=9, fig.height = 6, fig.align='center'}
+n = 50
+B = 1000
+## our data
+x = sample(1 : 6, n, replace = TRUE)
+## bootstrap resamples
+resamples = matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+resampledMeans = apply(resamples, 1, mean)
+g1 <- ggplot(as.data.frame(prop.table(table(x))), aes(x = x, y = Freq)) + geom_bar(colour = "black", fill = "lightblue", stat = "identity") 
+g2 <- ggplot(data.frame(x = resampledMeans), aes(x = x)) + geom_histogram(binwidth=.2, colour = "black", fill = "salmon", aes(y = ..density..)) 
+grid.arrange(g1, g2, ncol = 2)
+```
+
+
+---
+## Consider a data set
+```{r}
+library(UsingR)
+data(father.son)
+x <- father.son$sheight
+n <- length(x)
+B <- 10000
+resamples <- matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+resampledMedians <- apply(resamples, 1, median)
+```
+
+---
+## A plot of the histrogram of the resamples
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE, warning=FALSE}
+g = ggplot(data.frame(x = resampledMedians), aes(x = x)) 
+g = g + geom_density(size = 2, fill = "red")
+#g = g + geom_histogram(alpha = .20, binwidth=.3, colour = "black", fill = "blue", aes(y = ..density..)) 
+g = g + geom_vline(xintercept = median(x), size = 2)
+g
+```
+
+---
+
+
+## The bootstrap principle
+
+- Suppose that I have a statistic that estimates some population parameter, but I don't know its sampling distribution
+- The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution
+
+---
+
+## The bootstrap in practice
+
+- In practice, the bootstrap principle is always carried out using simulation
+- We will cover only a few aspects of bootstrap resampling
+- The general procedure follows by first simulating complete data sets from the observed data with replacement
+
+  - This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution
+
+- Calculate the statistic for each simulated data set
+- Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error
+
+
+---
+## Nonparametric bootstrap algorithm example
+
+- Bootstrap procedure for calculating confidence interval for the median from a data set of $n$ observations
+
+  i. Sample $n$ observations **with replacement** from the observed data resulting in one simulated complete data set
+  
+  ii. Take the median of the simulated data set
+  
+  iii. Repeat these two steps $B$ times, resulting in $B$ simulated medians
+  
+  iv. These medians are approximately drawn from the sampling distribution of the median of $n$ observations; therefore we can
+  
+    - Draw a histogram of them
+    - Calculate their standard deviation to estimate the standard error of the median
+    - Take the $2.5^{th}$ and $97.5^{th}$ percentiles as a confidence interval for the median
+
+---
+
+## Example code
+
+```{r}
+B <- 10000
+resamples <- matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+medians <- apply(resamples, 1, median)
+sd(medians)
+quantile(medians, c(.025, .975))
+```
+
+---
+## Histogram of bootstrap resamples
+
+```{r, fig.height=6, fig.width=6, echo=TRUE,fig.align='center', warning=FALSE}
+g = ggplot(data.frame(medians = medians), aes(x = medians))
+g = g + geom_histogram(color = "black", fill = "lightblue", binwidth = 0.05)
+g
+```
+
+---
+
+## Notes on the bootstrap
+
+- The bootstrap is non-parametric
+- Better percentile bootstrap confidence intervals correct for bias
+- There are lots of variations on bootstrap procedures; the book "An Introduction to the Bootstrap"" by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information
+
+
+---
+## Group comparisons
+- Consider comparing two independent groups.
+- Example, comparing sprays B and C
+
+```{r, fig.height=6, fig.width=8, echo=FALSE, fig.align='center'}
+data(InsectSprays)
+g = ggplot(InsectSprays, aes(spray, count, fill = spray))
+g = g + geom_boxplot()
+g
+```
+
+---
+## Permutation tests
+-  Consider the null hypothesis that the distribution of the observations from each group is the same
+-  Then, the group labels are irrelevant
+- Consider a data frome with count and spray
+- Permute the spray (group) labels 
+- Recalculate the statistic
+  - Mean difference in counts
+  - Geometric means
+  - T statistic
+- Calculate the percentage of simulations where
+the simulated statistic was more extreme (toward
+the alternative) than the observed
+
+---
+## Variations on permutation testing
+Data type | Statistic | Test name 
+---|---|---|
+Ranks | rank sum | rank sum test
+Binary | hypergeometric prob | Fisher's exact test
+Raw data | | ordinary permutation test
+
+- Also, so-called *randomization tests* are exactly permutation tests, with a different motivation.
+- For matched data, one can randomize the signs
+  - For ranks, this results in the signed rank test
+- Permutation strategies work for regression as well
+  - Permuting a regressor of interest
+- Permutation tests work very well in multivariate settings
+
+---
+## Permutation test B v C
+```{r}
+subdata <- InsectSprays[InsectSprays$spray %in% c("B", "C"),]
+y <- subdata$count
+group <- as.character(subdata$spray)
+testStat <- function(w, g) mean(w[g == "B"]) - mean(w[g == "C"])
+observedStat <- testStat(y, group)
+permutations <- sapply(1 : 10000, function(i) testStat(y, sample(group)))
+observedStat
+mean(permutations > observedStat)
+```
+
+---
+## Histogram of permutations B v C
+```{r, echo= FALSE, fig.width=6, fig.height=6, fig.align='center'}
+g = ggplot(data.frame(permutations = permutations),
+           aes(permutations))
+g = g + geom_histogram(fill = "lightblue", color = "black", binwidth = 1)
+g = g + geom_vline(xintercept = observedStat, size = 2)
+g
+```
diff --git a/06_StatisticalInference/13_Resampling/index.html b/06_StatisticalInference/13_Resampling/index.html
new file mode 100644
index 000000000..e7e900e93
--- /dev/null
+++ b/06_StatisticalInference/13_Resampling/index.html
@@ -0,0 +1,549 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Resampled inference</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Resampled inference">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Resampled inference</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>The bootstrap</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics</li>
+<li>For example, how would one derive a confidence interval for the median?</li>
+<li>The bootstrap procedure follows from the so called bootstrap principle</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Sample of 50 die rolls</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code>## Error: there is no package called &#39;gridExtra&#39;
+</code></pre>
+
+<pre><code>## Error: could not find function &quot;grid.arrange&quot;
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>What if we only had one sample?</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code>## Error: could not find function &quot;grid.arrange&quot;
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Consider a data set</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">library(UsingR)
+</code></pre>
+
+<pre><code>## Loading required package: MASS
+## Loading required package: HistData
+## Loading required package: Hmisc
+## Loading required package: grid
+## Loading required package: lattice
+## Loading required package: survival
+## Loading required package: splines
+## Loading required package: Formula
+## 
+## Attaching package: &#39;Hmisc&#39;
+## 
+## The following objects are masked from &#39;package:base&#39;:
+## 
+##     format.pval, round.POSIXt, trunc.POSIXt, units
+## 
+## Loading required package: aplpack
+## Loading required package: tcltk
+## Loading required package: quantreg
+## Loading required package: SparseM
+## 
+## Attaching package: &#39;SparseM&#39;
+## 
+## The following object is masked from &#39;package:base&#39;:
+## 
+##     backsolve
+## 
+## 
+## Attaching package: &#39;quantreg&#39;
+## 
+## The following object is masked from &#39;package:Hmisc&#39;:
+## 
+##     latex
+## 
+## The following object is masked from &#39;package:survival&#39;:
+## 
+##     untangle.specials
+## 
+## 
+## Attaching package: &#39;UsingR&#39;
+## 
+## The following object is masked from &#39;package:survival&#39;:
+## 
+##     cancer
+## 
+## The following object is masked from &#39;package:ggplot2&#39;:
+## 
+##     movies
+</code></pre>
+
+<pre><code class="r">data(father.son)
+x &lt;- father.son$sheight
+n &lt;- length(x)
+B &lt;- 10000
+resamples &lt;- matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+resampledMedians &lt;- apply(resamples, 1, median)
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>A plot of the histrogram of the resamples</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>The bootstrap principle</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that I have a statistic that estimates some population parameter, but I don&#39;t know its sampling distribution</li>
+<li>The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>The bootstrap in practice</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In practice, the bootstrap principle is always carried out using simulation</li>
+<li>We will cover only a few aspects of bootstrap resampling</li>
+<li><p>The general procedure follows by first simulating complete data sets from the observed data with replacement</p>
+
+<ul>
+<li>This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution</li>
+</ul></li>
+<li><p>Calculate the statistic for each simulated data set</p></li>
+<li><p>Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Nonparametric bootstrap algorithm example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li><p>Bootstrap procedure for calculating confidence interval for the median from a data set of \(n\) observations</p>
+
+<p>i. Sample \(n\) observations <strong>with replacement</strong> from the observed data resulting in one simulated complete data set</p>
+
+<p>ii. Take the median of the simulated data set</p>
+
+<p>iii. Repeat these two steps \(B\) times, resulting in \(B\) simulated medians</p>
+
+<p>iv. These medians are approximately drawn from the sampling distribution of the median of \(n\) observations; therefore we can</p>
+
+<ul>
+<li>Draw a histogram of them</li>
+<li>Calculate their standard deviation to estimate the standard error of the median</li>
+<li>Take the \(2.5^{th}\) and \(97.5^{th}\) percentiles as a confidence interval for the median</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Example code</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">B &lt;- 10000
+resamples &lt;- matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+medians &lt;- apply(resamples, 1, median)
+sd(medians)
+</code></pre>
+
+<pre><code>## [1] 0.08424
+</code></pre>
+
+<pre><code class="r">quantile(medians, c(.025, .975))
+</code></pre>
+
+<pre><code>##  2.5% 97.5% 
+## 68.43 68.81
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Histogram of bootstrap resamples</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">g = ggplot(data.frame(medians = medians), aes(x = medians))
+g = g + geom_histogram(color = &quot;black&quot;, fill = &quot;lightblue&quot;, binwidth = 0.05)
+g
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Notes on the bootstrap</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The bootstrap is non-parametric</li>
+<li>Better percentile bootstrap confidence intervals correct for bias</li>
+<li>There are lots of variations on bootstrap procedures; the book &quot;An Introduction to the Bootstrap&quot;&quot; by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Group comparisons</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider comparing two independent groups.</li>
+<li>Example, comparing sprays B and C</li>
+</ul>
+
+<p><img src="assets/fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Permutation tests</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li> Consider the null hypothesis that the distribution of the observations from each group is the same</li>
+<li> Then, the group labels are irrelevant</li>
+<li>Consider a data frome with count and spray</li>
+<li>Permute the spray (group) labels </li>
+<li>Recalculate the statistic
+
+<ul>
+<li>Mean difference in counts</li>
+<li>Geometric means</li>
+<li>T statistic</li>
+</ul></li>
+<li>Calculate the percentage of simulations where
+the simulated statistic was more extreme (toward
+the alternative) than the observed</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Variations on permutation testing</h2>
+  </hgroup>
+  <article data-timings="">
+    <table><thead>
+<tr>
+<th>Data type</th>
+<th>Statistic</th>
+<th>Test name</th>
+</tr>
+</thead><tbody>
+<tr>
+<td>Ranks</td>
+<td>rank sum</td>
+<td>rank sum test</td>
+</tr>
+<tr>
+<td>Binary</td>
+<td>hypergeometric prob</td>
+<td>Fisher&#39;s exact test</td>
+</tr>
+<tr>
+<td>Raw data</td>
+<td></td>
+<td>ordinary permutation test</td>
+</tr>
+</tbody></table>
+
+<ul>
+<li>Also, so-called <em>randomization tests</em> are exactly permutation tests, with a different motivation.</li>
+<li>For matched data, one can randomize the signs
+
+<ul>
+<li>For ranks, this results in the signed rank test</li>
+</ul></li>
+<li>Permutation strategies work for regression as well
+
+<ul>
+<li>Permuting a regressor of interest</li>
+</ul></li>
+<li>Permutation tests work very well in multivariate settings</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Permutation test B v C</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">subdata &lt;- InsectSprays[InsectSprays$spray %in% c(&quot;B&quot;, &quot;C&quot;),]
+y &lt;- subdata$count
+group &lt;- as.character(subdata$spray)
+testStat &lt;- function(w, g) mean(w[g == &quot;B&quot;]) - mean(w[g == &quot;C&quot;])
+observedStat &lt;- testStat(y, group)
+permutations &lt;- sapply(1 : 10000, function(i) testStat(y, sample(group)))
+observedStat
+</code></pre>
+
+<pre><code>## [1] 13.25
+</code></pre>
+
+<pre><code class="r">mean(permutations &gt; observedStat)
+</code></pre>
+
+<pre><code>## [1] 0
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Histogram of permutations B v C</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-9.png" title="plot of chunk unnamed-chunk-9" alt="plot of chunk unnamed-chunk-9" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='The bootstrap'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Sample of 50 die rolls'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='What if we only had one sample?'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Consider a data set'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='A plot of the histrogram of the resamples'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='The bootstrap principle'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='The bootstrap in practice'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Nonparametric bootstrap algorithm example'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Example code'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Histogram of bootstrap resamples'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Notes on the bootstrap'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Group comparisons'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Permutation tests'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Variations on permutation testing'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Permutation test B v C'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Histogram of permutations B v C'>
+         16
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/13_Resampling/index.md b/06_StatisticalInference/13_Resampling/index.md
new file mode 100644
index 000000000..00c62cfb0
--- /dev/null
+++ b/06_StatisticalInference/13_Resampling/index.md
@@ -0,0 +1,268 @@
+---
+title       : Resampled inference
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## The bootstrap
+
+- The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics
+- For example, how would one derive a confidence interval for the median?
+- The bootstrap procedure follows from the so called bootstrap principle
+
+---
+## Sample of 50 die rolls
+
+
+```
+## Error: there is no package called 'gridExtra'
+```
+
+```
+## Error: could not find function "grid.arrange"
+```
+
+
+---
+## What if we only had one sample?
+
+```
+## Error: could not find function "grid.arrange"
+```
+
+
+---
+## Consider a data set
+
+```r
+library(UsingR)
+```
+
+```
+## Loading required package: MASS
+## Loading required package: HistData
+## Loading required package: Hmisc
+## Loading required package: grid
+## Loading required package: lattice
+## Loading required package: survival
+## Loading required package: splines
+## Loading required package: Formula
+## 
+## Attaching package: 'Hmisc'
+## 
+## The following objects are masked from 'package:base':
+## 
+##     format.pval, round.POSIXt, trunc.POSIXt, units
+## 
+## Loading required package: aplpack
+## Loading required package: tcltk
+## Loading required package: quantreg
+## Loading required package: SparseM
+## 
+## Attaching package: 'SparseM'
+## 
+## The following object is masked from 'package:base':
+## 
+##     backsolve
+## 
+## 
+## Attaching package: 'quantreg'
+## 
+## The following object is masked from 'package:Hmisc':
+## 
+##     latex
+## 
+## The following object is masked from 'package:survival':
+## 
+##     untangle.specials
+## 
+## 
+## Attaching package: 'UsingR'
+## 
+## The following object is masked from 'package:survival':
+## 
+##     cancer
+## 
+## The following object is masked from 'package:ggplot2':
+## 
+##     movies
+```
+
+```r
+data(father.son)
+x <- father.son$sheight
+n <- length(x)
+B <- 10000
+resamples <- matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+resampledMedians <- apply(resamples, 1, median)
+```
+
+---
+## A plot of the histrogram of the resamples
+<img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" />
+
+---
+
+
+## The bootstrap principle
+
+- Suppose that I have a statistic that estimates some population parameter, but I don't know its sampling distribution
+- The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution
+
+---
+
+## The bootstrap in practice
+
+- In practice, the bootstrap principle is always carried out using simulation
+- We will cover only a few aspects of bootstrap resampling
+- The general procedure follows by first simulating complete data sets from the observed data with replacement
+
+  - This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution
+
+- Calculate the statistic for each simulated data set
+- Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error
+
+
+---
+## Nonparametric bootstrap algorithm example
+
+- Bootstrap procedure for calculating confidence interval for the median from a data set of $n$ observations
+
+  i. Sample $n$ observations **with replacement** from the observed data resulting in one simulated complete data set
+  
+  ii. Take the median of the simulated data set
+  
+  iii. Repeat these two steps $B$ times, resulting in $B$ simulated medians
+  
+  iv. These medians are approximately drawn from the sampling distribution of the median of $n$ observations; therefore we can
+  
+    - Draw a histogram of them
+    - Calculate their standard deviation to estimate the standard error of the median
+    - Take the $2.5^{th}$ and $97.5^{th}$ percentiles as a confidence interval for the median
+
+---
+
+## Example code
+
+
+```r
+B <- 10000
+resamples <- matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+medians <- apply(resamples, 1, median)
+sd(medians)
+```
+
+```
+## [1] 0.08424
+```
+
+```r
+quantile(medians, c(.025, .975))
+```
+
+```
+##  2.5% 97.5% 
+## 68.43 68.81
+```
+
+---
+## Histogram of bootstrap resamples
+
+
+```r
+g = ggplot(data.frame(medians = medians), aes(x = medians))
+g = g + geom_histogram(color = "black", fill = "lightblue", binwidth = 0.05)
+g
+```
+
+<img src="assets/fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" style="display: block; margin: auto;" />
+
+---
+
+## Notes on the bootstrap
+
+- The bootstrap is non-parametric
+- Better percentile bootstrap confidence intervals correct for bias
+- There are lots of variations on bootstrap procedures; the book "An Introduction to the Bootstrap"" by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information
+
+
+---
+## Group comparisons
+- Consider comparing two independent groups.
+- Example, comparing sprays B and C
+
+<img src="assets/fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" style="display: block; margin: auto;" />
+
+---
+## Permutation tests
+-  Consider the null hypothesis that the distribution of the observations from each group is the same
+-  Then, the group labels are irrelevant
+- Consider a data frome with count and spray
+- Permute the spray (group) labels 
+- Recalculate the statistic
+  - Mean difference in counts
+  - Geometric means
+  - T statistic
+- Calculate the percentage of simulations where
+the simulated statistic was more extreme (toward
+the alternative) than the observed
+
+---
+## Variations on permutation testing
+Data type | Statistic | Test name 
+---|---|---|
+Ranks | rank sum | rank sum test
+Binary | hypergeometric prob | Fisher's exact test
+Raw data | | ordinary permutation test
+
+- Also, so-called *randomization tests* are exactly permutation tests, with a different motivation.
+- For matched data, one can randomize the signs
+  - For ranks, this results in the signed rank test
+- Permutation strategies work for regression as well
+  - Permuting a regressor of interest
+- Permutation tests work very well in multivariate settings
+
+---
+## Permutation test B v C
+
+```r
+subdata <- InsectSprays[InsectSprays$spray %in% c("B", "C"),]
+y <- subdata$count
+group <- as.character(subdata$spray)
+testStat <- function(w, g) mean(w[g == "B"]) - mean(w[g == "C"])
+observedStat <- testStat(y, group)
+permutations <- sapply(1 : 10000, function(i) testStat(y, sample(group)))
+observedStat
+```
+
+```
+## [1] 13.25
+```
+
+```r
+mean(permutations > observedStat)
+```
+
+```
+## [1] 0
+```
+
+---
+## Histogram of permutations B v C
+<img src="assets/fig/unnamed-chunk-9.png" title="plot of chunk unnamed-chunk-9" alt="plot of chunk unnamed-chunk-9" style="display: block; margin: auto;" />
diff --git a/06_StatisticalInference/13_Resampling/index.pdf b/06_StatisticalInference/13_Resampling/index.pdf
new file mode 100644
index 000000000..ce8822a3c
Binary files /dev/null and b/06_StatisticalInference/13_Resampling/index.pdf differ
diff --git a/06_StatisticalInference/03_06_resampledInference/lecture12.tex b/06_StatisticalInference/13_Resampling/lecture12.tex
similarity index 100%
rename from 06_StatisticalInference/03_06_resampledInference/lecture12.tex
rename to 06_StatisticalInference/13_Resampling/lecture12.tex
diff --git a/06_StatisticalInference/Random Formulae/Random Formulae.pdf b/06_StatisticalInference/Random Formulae/Random Formulae.pdf
deleted file mode 100644
index 1d5418411..000000000
Binary files a/06_StatisticalInference/Random Formulae/Random Formulae.pdf and /dev/null differ
diff --git a/06_StatisticalInference/Random Formulae/index.Rmd b/06_StatisticalInference/Random Formulae/index.Rmd
deleted file mode 100644
index 3ca20a730..000000000
--- a/06_StatisticalInference/Random Formulae/index.Rmd	
+++ /dev/null
@@ -1,141 +0,0 @@
----
-title       : Random Formulae
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## About this document
-
-This document contains random formulae images I used in the notes.
-
----
-
-$$A = \{1, 2\}$$
-$$B = \{1, 2, 3\}$$
-
----
-
-$$
-\begin{eqnarray}
-E[X^2] & = & \int_0^1 x^2 dx \\
-       & = & \left. \frac{x^3}{3} \right|_0^1 = \frac{1}{3}
-\end{eqnarray}
-$$
-
----
-
-$$\frac{|x - \mu|}{k\sigma} > 1$$ 
-Over the set $\{x : |x - \mu | > k\sigma\}$
-$$\frac{(x - \mu)^2}{k^2\sigma^2} > 1$$
-$$\frac{1}{k^2\sigma^2} \int_{-\infty}^\infty (x - \mu)^2 f(x) dx$$
-$$\frac{1}{k^2\sigma^2} E[(X - \mu)^2] = \frac{1}{k^2\sigma^2} Var(X)$$
-
----
-
-$$P(A_1 \cup A_2 \cup A_3) = P\{A_1 \cup (A_2 \cup A_3)\} = P(A_1) + P(A_2 \cup A_3)$$ 
-$$P(A_1) + P(A_2 \cup A_3) = P(A_1) + P(A_2) + P(A_3)$$
-
----
-
-$$P(\cup_{i=1}^n E_i) = P\left\{E_n \cup \left(\cup_{i=1}^{n-1} E_i \right) \right\}$$
-
----
-
-$$
-(x_1, x_2, x_3, x_4) = (1, 0, 1, 1)
-$$
-$$
-p^{(1 + 0 + 1 + 1)}(1 - p)^{\{4 - (1 + 0 + 1 + 1)\}}  = p^3 (1 - p)^1
-$$
-$$
-\mathrm{SD}(X) \mathrm{SD}(Y)
-$$
-$$
-Var(X)
-$$
-$$
-Var(X) = E[X^2] - E[X]^2 \rightarrow E[X^2] = Var(X) + E[X]^2 = \sigma^2 + \mu^2 
-$$
-$$
-Var(\bar X) = E[\bar X^2] - E[\bar X]^2 \rightarrow E[\bar X^2] = Var(\bar X) + E[\bar X]^2 = \sigma^2/n + \mu^2
-$$
-$$
-f(x | y = 5) = \frac{f_{xy}(x, 5)}{f_y(5)}
-$$
-
----
-
-$$
-P(A\cap B)
-$$
-$$
-P(A)
-$$
-$$
-P(A\cap B^c)
-$$
-
----
-
-$$
-\frac{10!}{1!9!} = \frac{10\times 9 \times 8 \times \ldots \times 1}{9 \times 8 \times \ldots \times 1} = 10
-$$
-
-$$
-\frac{10!}{2!8!} = \frac{10\times 9 \times 8 \times \ldots \times 1}{2 \times 1 \times 8 \times 7 \times \ldots \times 1} = 45
-$$
-
-In general
-
-$\left(\begin{array}{c}n \\ 2\end{array}\right)= \frac{n \times (n - 1)}{2}$
-
-$$
-\mu 
-$$
-
-$$
-\sigma^2
-$$
-
-$$
-E[Z] = E\left[\frac{X - \mu}{\sigma} \right] = \frac{E[X] - \mu}{\sigma}  = 0
-$$
-
----
-
-$$
-Var(Z) = Var\left(\frac{X - \mu}{\sigma}\right) = \frac{1}{\sigma^2} Var(X - \mu) = \frac{1}{\sigma^2} Var(X) = 1
-$$
-
----
-
-$$
-E[X_i^2] = E[Y_i] = \sigma^2 + \mu^2
-$$
-$$
-\sum_{i=1}^n (X_i - \bar X)^2 = \sum_{i=1}^2 X_i^2 - n \bar X ^ 2 
-$$
-
----
-
-$$
-E[\chi^2_{df}] = df
-$$
-$$
-E[S^2] = \sigma^2
-\rightarrow 
-E\left[\frac{(n-1)S^2}{\sigma^2}\right] = (n-1)
-$$
-$$
-Var(\chi^2_{df}) = 2df
-$$
\ No newline at end of file
diff --git a/06_StatisticalInference/Random Formulae/index.html b/06_StatisticalInference/Random Formulae/index.html
deleted file mode 100644
index 7ab6f0bff..000000000
--- a/06_StatisticalInference/Random Formulae/index.html	
+++ /dev/null
@@ -1,288 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Random Formulae</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Random Formulae">
-  <meta name="author" content="Brian Caffo, PhD">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
-    
-
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Random Formulae</h1>
-        <h2>Mathematical Biostatistics Boot Camp</h2>
-        <p>Brian Caffo, PhD<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
-    <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>About this document</h2>
-  </hgroup>
-  <article>
-    <p>This document contains random formulae images I used in the notes.</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[A = \{1, 2\}\]
-\[B = \{1, 2, 3\}\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[
-\begin{eqnarray}
-E[X^2] & = & \int_0^1 x^2 dx \\
-       & = & \left. \frac{x^3}{3} \right|_0^1 = \frac{1}{3}
-\end{eqnarray}
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[\frac{|x - \mu|}{k\sigma} > 1\] 
-Over the set \(\{x : |x - \mu | > k\sigma\}\)
-\[\frac{(x - \mu)^2}{k^2\sigma^2} > 1\]
-\[\frac{1}{k^2\sigma^2} \int_{-\infty}^\infty (x - \mu)^2 f(x) dx\]
-\[\frac{1}{k^2\sigma^2} E[(X - \mu)^2] = \frac{1}{k^2\sigma^2} Var(X)\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[P(A_1 \cup A_2 \cup A_3) = P\{A_1 \cup (A_2 \cup A_3)\} = P(A_1) + P(A_2 \cup A_3)\] 
-\[P(A_1) + P(A_2 \cup A_3) = P(A_1) + P(A_2) + P(A_3)\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[P(\cup_{i=1}^n E_i) = P\left\{E_n \cup \left(\cup_{i=1}^{n-1} E_i \right) \right\}\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[
-(x_1, x_2, x_3, x_4) = (1, 0, 1, 1)
-\]
-\[
-p^{(1 + 0 + 1 + 1)}(1 - p)^{\{4 - (1 + 0 + 1 + 1)\}}  = p^3 (1 - p)^1
-\]
-\[
-\mathrm{SD}(X) \mathrm{SD}(Y)
-\]
-\[
-Var(X)
-\]
-\[
-Var(X) = E[X^2] - E[X]^2 \rightarrow E[X^2] = Var(X) + E[X]^2 = \sigma^2 + \mu^2 
-\]
-\[
-Var(\bar X) = E[\bar X^2] - E[\bar X]^2 \rightarrow E[\bar X^2] = Var(\bar X) + E[\bar X]^2 = \sigma^2/n + \mu^2
-\]
-\[
-f(x | y = 5) = \frac{f_{xy}(x, 5)}{f_y(5)}
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[
-P(A\cap B)
-\]
-\[
-P(A)
-\]
-\[
-P(A\cap B^c)
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[
-\frac{10!}{1!9!} = \frac{10\times 9 \times 8 \times \ldots \times 1}{9 \times 8 \times \ldots \times 1} = 10
-\]</p>
-
-<p>\[
-\frac{10!}{2!8!} = \frac{10\times 9 \times 8 \times \ldots \times 1}{2 \times 1 \times 8 \times 7 \times \ldots \times 1} = 45
-\]</p>
-
-<p>In general</p>
-
-<p>\(\left(\begin{array}{c}n \\ 2\end{array}\right)= \frac{n \times (n - 1)}{2}\)</p>
-
-<p>\[
-\mu 
-\]</p>
-
-<p>\[
-\sigma^2
-\]</p>
-
-<p>\[
-E[Z] = E\left[\frac{X - \mu}{\sigma} \right] = \frac{E[X] - \mu}{\sigma}  = 0
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[
-Var(Z) = Var\left(\frac{X - \mu}{\sigma}\right) = \frac{1}{\sigma^2} Var(X - \mu) = \frac{1}{\sigma^2} Var(X) = 1
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[
-E[X_i^2] = E[Y_i] = \sigma^2 + \mu^2
-\]
-\[
-\sum_{i=1}^n (X_i - \bar X)^2 = \sum_{i=1}^2 X_i^2 - n \bar X ^ 2 
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[
-E[\chi^2_{df}] = df
-\]
-\[
-E[S^2] = \sigma^2
-\rightarrow 
-E\left[\frac{(n-1)S^2}{\sigma^2}\right] = (n-1)
-\]
-\[
-Var(\chi^2_{df}) = 2df
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-
-  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
diff --git a/06_StatisticalInference/Random Formulae/index.md b/06_StatisticalInference/Random Formulae/index.md
deleted file mode 100644
index 4262bd6b5..000000000
--- a/06_StatisticalInference/Random Formulae/index.md	
+++ /dev/null
@@ -1,141 +0,0 @@
----
-title       : Random Formulae
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## About this document
-
-This document contains random formulae images I used in the notes.
-
----
-
-$$A = \{1, 2\}$$
-$$B = \{1, 2, 3\}$$
-
----
-
-$$
-\begin{eqnarray}
-E[X^2] & = & \int_0^1 x^2 dx \\
-       & = & \left. \frac{x^3}{3} \right|_0^1 = \frac{1}{3}
-\end{eqnarray}
-$$
-
----
-
-$$\frac{|x - \mu|}{k\sigma} > 1$$ 
-Over the set $\{x : |x - \mu | > k\sigma\}$
-$$\frac{(x - \mu)^2}{k^2\sigma^2} > 1$$
-$$\frac{1}{k^2\sigma^2} \int_{-\infty}^\infty (x - \mu)^2 f(x) dx$$
-$$\frac{1}{k^2\sigma^2} E[(X - \mu)^2] = \frac{1}{k^2\sigma^2} Var(X)$$
-
----
-
-$$P(A_1 \cup A_2 \cup A_3) = P\{A_1 \cup (A_2 \cup A_3)\} = P(A_1) + P(A_2 \cup A_3)$$ 
-$$P(A_1) + P(A_2 \cup A_3) = P(A_1) + P(A_2) + P(A_3)$$
-
----
-
-$$P(\cup_{i=1}^n E_i) = P\left\{E_n \cup \left(\cup_{i=1}^{n-1} E_i \right) \right\}$$
-
----
-
-$$
-(x_1, x_2, x_3, x_4) = (1, 0, 1, 1)
-$$
-$$
-p^{(1 + 0 + 1 + 1)}(1 - p)^{\{4 - (1 + 0 + 1 + 1)\}}  = p^3 (1 - p)^1
-$$
-$$
-\mathrm{SD}(X) \mathrm{SD}(Y)
-$$
-$$
-Var(X)
-$$
-$$
-Var(X) = E[X^2] - E[X]^2 \rightarrow E[X^2] = Var(X) + E[X]^2 = \sigma^2 + \mu^2 
-$$
-$$
-Var(\bar X) = E[\bar X^2] - E[\bar X]^2 \rightarrow E[\bar X^2] = Var(\bar X) + E[\bar X]^2 = \sigma^2/n + \mu^2
-$$
-$$
-f(x | y = 5) = \frac{f_{xy}(x, 5)}{f_y(5)}
-$$
-
----
-
-$$
-P(A\cap B)
-$$
-$$
-P(A)
-$$
-$$
-P(A\cap B^c)
-$$
-
----
-
-$$
-\frac{10!}{1!9!} = \frac{10\times 9 \times 8 \times \ldots \times 1}{9 \times 8 \times \ldots \times 1} = 10
-$$
-
-$$
-\frac{10!}{2!8!} = \frac{10\times 9 \times 8 \times \ldots \times 1}{2 \times 1 \times 8 \times 7 \times \ldots \times 1} = 45
-$$
-
-In general
-
-$\left(\begin{array}{c}n \\ 2\end{array}\right)= \frac{n \times (n - 1)}{2}$
-
-$$
-\mu 
-$$
-
-$$
-\sigma^2
-$$
-
-$$
-E[Z] = E\left[\frac{X - \mu}{\sigma} \right] = \frac{E[X] - \mu}{\sigma}  = 0
-$$
-
----
-
-$$
-Var(Z) = Var\left(\frac{X - \mu}{\sigma}\right) = \frac{1}{\sigma^2} Var(X - \mu) = \frac{1}{\sigma^2} Var(X) = 1
-$$
-
----
-
-$$
-E[X_i^2] = E[Y_i] = \sigma^2 + \mu^2
-$$
-$$
-\sum_{i=1}^n (X_i - \bar X)^2 = \sum_{i=1}^2 X_i^2 - n \bar X ^ 2 
-$$
-
----
-
-$$
-E[\chi^2_{df}] = df
-$$
-$$
-E[S^2] = \sigma^2
-\rightarrow 
-E\left[\frac{(n-1)S^2}{\sigma^2}\right] = (n-1)
-$$
-$$
-Var(\chi^2_{df}) = 2df
-$$
diff --git a/06_StatisticalInference/cp.R b/06_StatisticalInference/cp.R
new file mode 100644
index 000000000..2365695e8
--- /dev/null
+++ b/06_StatisticalInference/cp.R
@@ -0,0 +1,26 @@
+## A program for copying the index.pdf files and naming them
+## appropriately in the lectured directory
+## Brian Caffo
+## 
+## Has to be run within the directory and won't overwrite
+## unless you change this to TRUE
+overwrite = FALSE
+
+## Get the directory names (they all start with 0)
+dirNames <- dir(pattern = "^[0-1][0-9]_[a-zA-Z]")
+
+## Loop over them and copy the pdf files
+sapply(dirNames, function(x) 
+  file.copy(from = paste(x, "/index.pdf", sep = ""),
+            to = paste("lectures/", x, ".pdf", sep = ""),
+            overwrite = overwrite
+              )
+  )
+
+## Loop over them and copy the RMD files
+sapply(dirNames, function(x) 
+  file.copy(from = paste(x, "/index.Rmd", sep = ""),
+            to = paste("rmd/", x, ".Rmd", sep = ""),
+            overwrite = overwrite
+  )
+)
diff --git a/06_StatisticalInference/grading.md b/06_StatisticalInference/grading.md
deleted file mode 100644
index c846b9967..000000000
--- a/06_StatisticalInference/grading.md
+++ /dev/null
@@ -1,17 +0,0 @@
-## Grading and logistics
-
-The grading in this class is very straightforward. 
-
-1. There are four quizzes, each containing in the neighborhood of 10 questions.
-2. Each question is equally weighted as 1 point.
-3. Some require two answers, each giving half of a point (for a maximum total of 1 point for those questions).
-4. Your total points is the sum of the points questions across all quizzes that you answered correctly (using all of your quiz attempts).
-5. 70% or more of the total points is a pass for the class. 
-6. 80% or more of the total points is a pass with distinction.
-
-
-
-
-
-
-
diff --git a/06_StatisticalInference/homework/hw1.Rmd b/06_StatisticalInference/homework/hw1.Rmd
index f5476f5c7..edba182ee 100644
--- a/06_StatisticalInference/homework/hw1.Rmd
+++ b/06_StatisticalInference/homework/hw1.Rmd
@@ -1,187 +1,188 @@
----
-title       : Homework 1 for Stat Inference
-subtitle    : Extra problems for Stat Inference
-author      : Brian Caffo
-job         : Johns Hopkins Bloomberg School of Public Health
-framework   : io2012
-highlighter : highlight.js  
-hitheme     : tomorrow       
-#url:
-#    lib: ../../librariesNew #Remove new if using old slidify
-#    assets: ../../assets
-widgets     : [mathjax, quiz, bootstrap]
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-library(knitr)
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-runif(1)
-```
-
-## About these slides
-- These are some practice problems for Statistical Inference Quiz 1
-- They were created using slidify interactive which you will learn in 
-Creating Data Products
-- Please help improve this with pull requests here
-(https://github.com/bcaffo/courses)
-
-
---- &radio
-
-Consider influenza epidemics for two parent heterosexual families. Suppose that the probability is 15% that at least one of the parents has contracted the disease. The probability that the father has contracted influenza is 10% while that the mother contracted the disease is 9%. What is the probability that both contracted influenza expressed as a whole number percentage?
-
-1. 15%
-2. 10%
-3. 9%
-4. _4%_
-
-*** .hint
-$A = Father$, $P(A) = .10$, $B = Mother$, $P(B) = .09$ 
-$P(A\cup B) = .15$, 
-
-*** .explanation
-$P(A\cup B) = P(A) + P(B) - P(AB)$ thus
-$$.15 = .10 + .09 - P(AB)$$
-```{r}
-.10 + .09 - .15
-```
-
----  &radio
-
-A random variable, $X$, is uniform, a box from $0$ to $1$ of height $1$. (So that it's density is $f(x) = 1$ for $0\leq x \leq 1$.) What is it's median expressed to two decimal places? </p>
-
-1. 1.00
-2. 0.75
-3. _0.50_
-4. 0.25
-
-*** .hint
-The median is the point so that 50% of the density lies below it.
-
-*** .explanation
-This density looks like a box. So, notice that $P(X \leq x) = width\times height = x$.
-We want $.5 = P(X\leq x) = x$.
-
---- &radio
-
-You are playing a game with a friend where you flip a coin and if it comes up heads you give her  $X$ dollars and if it comes up tails she gives you $Y$ dollars. The odds that the coin is heads in $d$. What is your expected earnings?
-
-1. _$-X \frac{d}{1 + d} + Y \frac{1}{1+d} $_
-2. $X \frac{d}{1 + d} + Y \frac{1}{1+d} $
-3. $X \frac{d}{1 + d} - Y \frac{1}{1+d} $
-4. $-X \frac{d}{1 + d} - Y \frac{1}{1+d} $
-
-*** .hint
-The probability that you win on a given round is given by $p / (1 - p) = d$ which implies
-that $p = d / (1 + d)$.
-
-*** .explanation
-You lose $X$ with probability $p = d/(1 +d)$ and you win $Y$ with probability $1-p = 1/(1 + d)$. So your answer is
-$$
--X \frac{d}{1 + d} + Y \frac{1}{1+d} 
-$$
-
---- &radio
-A random variable takes the value -4 with probabability .2 and 1 with proabability .8. What
-is the variance of this random variable?
-
-1. 0
-2. _4_
-3. 8
-4. 16
-
-*** .hint
-This random variable has mean 0. The variance would be given by $E[X^2]$ then.
-
-*** .explanation
-$$E[X] = 0$$
-$$
-Var(X) = E[X^2] = (-4)^2 * .2 + (1)^2 * .8
-$$
-```{r}
--4 * .2 + 1 * .8
-(-4)^2 * .2 + (1)^2 * .8
-```
-
-
---- &radio
-If $\bar X$ and $\bar Y$ are comprised of $n$ iid random variables arising from distributions
-having  means $\mu_x$ and $\mu_y$, respectively and common variance $\sigma^2$
-what is the variance $\bar X - \bar Y$?
-
-1. 0
-2. _$2\sigma^2/n$_
-3. $\mu_x$ - $\mu_y$
-4. $2\sigma^2$
-
-*** .hint
-Remember that $Var(\bar X) = Var(\bar Y) = \sigma^2 / n$. 
-
-*** .explanation 
-$$
-Var(\bar X - \bar Y) = Var(\bar X) + Var(\bar Y) = \sigma^2 / n + \sigma^2 / n
-$$
-
---- &radio
-Let $X$ be a random variable having standard deviation $\sigma$. What can
-be said about $X /\sigma$?
-
-1. Nothing
-2. _It must have variance 1._
-3. It must have mean 0.
-4. It must have variance 0.
-
-*** .hint
-$Var(aX) = a^2 Var(X)$
-
-*** .explanation
-$$Var(X / \sigma) = Var(X) / \sigma^2 = 1$$
-
-
---- &radio
-If a continuous density that never touches the horizontal axis is symmetric about zero, can we say that its associated median is zero?
-
-1. _Yes_
-2. No.
-3. It can not be determined given the information given.
-
-*** .explanation
-This is a surprisingly hard problem. The easy explanation is that 50% of the probability
-is below 0 and 50% is above so yes. However, it is predicated on the density not being
-a flat line at 0 around 0. That's why the caveat that it never touches the horizontal axis
-is important.
-
-
---- &radio
-
-Consider the following pmf given in R
-```{r}
-p <- c(.1, .2, .3, .4)
-x <- 2 : 5 
-```
-What is the variance expressed to 1 decimal place?
-
-1. _1.0_
-2. 4.0
-3. 6.0
-4. 17.0
-
-*** .hint
-The variance is $E[X^2] - E[X^2]$
-
-*** .explanation 
-```{r}
-sum(x ^ 2 * p) - sum(x * p) ^ 2
-```
+---
+title       : Homework 1 for Stat Inference
+subtitle    : (Use arrow keys to navigate)
+author      : Brian Caffo
+job         : Johns Hopkins Bloomberg School of Public Health
+framework   : io2012
+highlighter : highlight.js  
+hitheme     : tomorrow
+#url:
+#    lib: ../../librariesNew #Remove new if using old slidify
+#    assets: ../../assets
+widgets     : [mathjax, quiz, bootstrap]
+mode        : selfcontained # {standalone, draft}
+---
+```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
+# make this an external chunk that can be included in any file
+library(knitr)
+options(width = 100)
+opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
+
+options(xtable.type = 'html')
+knit_hooks$set(inline = function(x) {
+  if(is.numeric(x)) {
+    round(x, getOption('digits'))
+  } else {
+    paste(as.character(x), collapse = ', ')
+  }
+})
+knit_hooks$set(plot = knitr:::hook_plot_html)
+runif(1)
+```
+
+## About these slides
+- These are some practice problems for Statistical Inference Quiz 1
+- They were created using slidify interactive which you will learn in
+Creating Data Products
+- Please help improve this with pull requests here
+(https://github.com/bcaffo/courses)
+
+--- &radio
+
+Consider influenza epidemics for two parent heterosexual families. Suppose that the probability is 15% that at least one of the parents has contracted the disease. The probability that the father has contracted influenza is 10% and the propability that the mother contracted the disease is 9%. What is the probability that both contracted influenza expressed as a whole number percentage?
+[Watch a video solution](https://www.youtube.com/watch?v=CvnmoCuIN08&index=1&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L)
+
+1. 15%
+2. 10%
+3. 9%
+4. _4%_
+
+*** .hint
+$A = Father$, $P(A) = .10$, $B = Mother$, $P(B) = .09$
+$P(A\cup B) = .15$,
+
+*** .explanation
+$P(A\cup B) = P(A) + P(B) - P(AB)$ thus
+$$.15 = .10 + .09 - P(AB)$$
+```{r}
+.10 + .09 - .15
+```
+
+---  &radio
+
+A random variable, $X$, is uniform, a box from $0$ to $1$ of height $1$. (So that its density is $f(x) = 1$ for $0\leq x \leq 1$.) What is its median expressed to two decimal places?
+[Watch a video solution.](https://www.youtube.com/watch?v=UXcarD-1xAM&index=2&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L)</p>
+
+1. 1.00
+2. 0.75
+3. _0.50_
+4. 0.25
+
+*** .hint
+The median is the point so that 50% of the density lies below it.
+
+*** .explanation
+This density looks like a box. So, notice that $P(X \leq x) = width\times height = x$.
+We want $.5 = P(X\leq x) = x$.
+
+--- &radio
+
+You are playing a game with a friend where you flip a coin and if it comes up heads you give her  $X$ dollars and if it comes up tails she gives you $Y$ dollars. The odds that the coin is heads is $d$. What is your expected earnings? [Watch a video solution.](https://www.youtube.com/watch?v=5J88Zq0q81o&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=3)
+
+1. _$-X \frac{d}{1 + d} + Y \frac{1}{1+d} $_
+2. $X \frac{d}{1 + d} + Y \frac{1}{1+d} $
+3. $X \frac{d}{1 + d} - Y \frac{1}{1+d} $
+4. $-X \frac{d}{1 + d} - Y \frac{1}{1+d} $
+
+*** .hint
+The odds that you lose on a given round is given by $p / (1 - p) = d$ which implies
+that $p = d / (1 + d)$.
+
+*** .explanation
+You lose $X$ with probability $p = d/(1 +d)$ and you win $Y$ with probability $1-p = 1/(1 + d)$. So your answer is
+$$
+-X \frac{d}{1 + d} + Y \frac{1}{1+d}
+$$
+
+--- &radio
+A random variable takes the value -4 with probability .2 and 1 with probability .8. What
+is the variance of this random variable? [Watch a video solution.](https://www.youtube.com/watch?v=Em-xJeQO1rc&index=4&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L)
+
+1. 0
+2. _4_
+3. 8
+4. 16
+
+*** .hint
+This random variable has mean 0. The variance would be given by $E[X^2]$ then.
+
+*** .explanation
+$$E[X] = 0$$
+$$
+Var(X) = E[X^2] = (-4)^2 * .2 + (1)^2 * .8
+$$
+```{r}
+-4 * .2 + 1 * .8
+(-4)^2 * .2 + (1)^2 * .8
+```
+
+
+--- &radio
+If $\bar X$ and $\bar Y$ are comprised of $n$ iid random variables arising from distributions
+having  means $\mu_x$ and $\mu_y$, respectively and common variance $\sigma^2$
+what is the variance $\bar X - \bar Y$? [Watch a video solution of this problem.](https://www.youtube.com/watch?v=7zJhPzX6jns&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=5)
+
+1. 0
+2. _$2\sigma^2/n$_
+3. $\mu_x - \mu_y$
+4. $2\sigma^2$
+
+*** .hint
+Remember that $Var(\bar X) = Var(\bar Y) = \sigma^2 / n$.
+
+*** .explanation
+$$
+Var(\bar X - \bar Y) = Var(\bar X) + Var(\bar Y) = \sigma^2 / n + \sigma^2 / n
+$$
+
+--- &radio
+Let $X$ be a random variable having standard deviation $\sigma$. What can
+be said about $X /\sigma$? [Watch a video solution of this problem.](https://www.youtube.com/watch?v=0WUj18_BUPA&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=6)
+
+1. Nothing
+2. _It must have variance 1._
+3. It must have mean 0.
+4. It must have variance 0.
+
+*** .hint
+$Var(aX) = a^2 Var(X)$
+
+*** .explanation
+$$Var(X / \sigma) = Var(X) / \sigma^2 = 1$$
+
+
+--- &radio
+If a continuous density that never touches the horizontal axis is symmetric about zero, can we say that its associated median is zero? [Watch a video solution.](https://www.youtube.com/watch?v=sn48CGH_TXI&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=7)
+
+1. _Yes_
+2. No.
+3. It can not be determined given the information given.
+
+*** .explanation
+This is a surprisingly hard problem. The easy explanation is that 50% of the probability
+is below 0 and 50% is above so yes. However, it is predicated on the density not being
+a flat line at 0 around 0. That's why the caveat that it never touches the horizontal axis
+is important.
+
+
+--- &radio
+
+Consider the following pmf given in R
+```{r}
+p <- c(.1, .2, .3, .4)
+x <- 2 : 5
+```
+What is the variance expressed to 1 decimal place? [Watch a solution to this problem.](https://www.youtube.com/watch?v=sn48CGH_TXI&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=7)
+
+1. _1.0_
+2. 4.0
+3. 6.0
+4. 17.0
+
+*** .hint
+The variance is $E[X^2] - E[X]^2$
+
+*** .explanation
+```{r}
+sum(x ^ 2 * p) - sum(x * p) ^ 2
+```
diff --git a/06_StatisticalInference/homework/hw1.html b/06_StatisticalInference/homework/hw1.html
index 3db02f1ae..ee036ef46 100644
--- a/06_StatisticalInference/homework/hw1.html
+++ b/06_StatisticalInference/homework/hw1.html
@@ -34,7 +34,7 @@
         <slide class="title-slide segue nobackground">
   <hgroup class="auto-fadein">
     <h1>Homework 1 for Stat Inference</h1>
-    <h2>Extra problems for Stat Inference</h2>
+    <h2>(Use arrow keys to navigate)</h2>
     <p>Brian Caffo<br/>Johns Hopkins Bloomberg School of Public Health</p>
   </hgroup>
   <article></article>  
@@ -49,7 +49,7 @@ <h2>About these slides</h2>
   <article data-timings="">
     <ul>
 <li>These are some practice problems for Statistical Inference Quiz 1</li>
-<li>They were created using slidify interactive which you will learn in 
+<li>They were created using slidify interactive which you will learn in
 Creating Data Products</li>
 <li>Please help improve this with pull requests here
 (<a href="https://github.com/bcaffo/courses">https://github.com/bcaffo/courses</a>)</li>
@@ -63,7 +63,8 @@ <h2>About these slides</h2>
   <article data-timings="">
     
 <div class="quiz quiz-single well ">
-  <p>Consider influenza epidemics for two parent heterosexual families. Suppose that the probability is 15% that at least one of the parents has contracted the disease. The probability that the father has contracted influenza is 10% while that the mother contracted the disease is 9%. What is the probability that both contracted influenza expressed as a whole number percentage?</p>
+  <p>Consider influenza epidemics for two parent heterosexual families. Suppose that the probability is 15% that at least one of the parents has contracted the disease. The probability that the father has contracted influenza is 10% while that the mother contracted the disease is 9%. What is the probability that both contracted influenza expressed as a whole number percentage?
+<a href="https://www.youtube.com/watch?v=CvnmoCuIN08&amp;index=1&amp;list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L">Watch a video solution</a></p>
 
 <ol>
 <li>15%</li>
@@ -78,8 +79,8 @@ <h2>About these slides</h2>
   <button class="quiz-clear btn btn-danger">Clear</button>
   
   <div class="quiz-hint">
-  <p>\(A = Father\), \(P(A) = .10\), \(B = Mother\), \(P(B) = .09\) 
-\(P(A\cup B) = .15\), </p>
+  <p>\(A = Father\), \(P(A) = .10\), \(B = Mother\), \(P(B) = .09\)
+\(P(A\cup B) = .15\),</p>
 
 </div>
 <div class="quiz-explanation">
@@ -102,7 +103,8 @@ <h2>About these slides</h2>
   <article data-timings="">
     
 <div class="quiz quiz-single well ">
-  <p>A random variable, \(X\), is uniform, a box from \(0\) to \(1\) of height \(1\). (So that it&#39;s density is \(f(x) = 1\) for \(0\leq x \leq 1\).) What is it&#39;s median expressed to two decimal places? </p></p>
+  <p>A random variable, \(X\), is uniform, a box from \(0\) to \(1\) of height \(1\). (So that its density is \(f(x) = 1\) for \(0\leq x \leq 1\).) What is its median expressed to two decimal places?
+<a href="https://www.youtube.com/watch?v=UXcarD-1xAM&amp;index=2&amp;list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L">Watch a video solution.</a></p></p>
 
 <ol>
 <li>1.00</li>
@@ -134,7 +136,7 @@ <h2>About these slides</h2>
   <article data-timings="">
     
 <div class="quiz quiz-single well ">
-  <p>You are playing a game with a friend where you flip a coin and if it comes up heads you give her  \(X\) dollars and if it comes up tails she gives you \(Y\) dollars. The odds that the coin is heads in \(d\). What is your expected earnings?</p>
+  <p>You are playing a game with a friend where you flip a coin and if it comes up heads you give her  \(X\) dollars and if it comes up tails she gives you \(Y\) dollars. The odds that the coin is heads is \(d\). What is your expected earnings? <a href="https://www.youtube.com/watch?v=5J88Zq0q81o&amp;list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&amp;index=3">Watch a video solution.</a></p>
 
 <ol>
 <li><em>$-X \frac{d}{1 + d} + Y \frac{1}{1+d} $</em></li>
@@ -149,14 +151,14 @@ <h2>About these slides</h2>
   <button class="quiz-clear btn btn-danger">Clear</button>
   
   <div class="quiz-hint">
-  <p>The probability that you win on a given round is given by \(p / (1 - p) = d\) which implies
+  <p>The odds that you lose on a given round is given by \(p / (1 - p) = d\) which implies
 that \(p = d / (1 + d)\).</p>
 
 </div>
 <div class="quiz-explanation">
   <p>You lose \(X\) with probability \(p = d/(1 +d)\) and you win \(Y\) with probability \(1-p = 1/(1 + d)\). So your answer is
 \[
--X \frac{d}{1 + d} + Y \frac{1}{1+d} 
+-X \frac{d}{1 + d} + Y \frac{1}{1+d}
 \]</p>
 
 </div>
@@ -169,8 +171,8 @@ <h2>About these slides</h2>
   <article data-timings="">
     
 <div class="quiz quiz-single well ">
-  <p>A random variable takes the value -4 with probabability .2 and 1 with proabability .8. What
-is the variance of this random variable?</p>
+  <p>A random variable takes the value -4 with probability .2 and 1 with probability .8. What
+is the variance of this random variable? <a href="https://www.youtube.com/watch?v=Em-xJeQO1rc&amp;index=4&amp;list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L">Watch a video solution.</a></p>
 
 <ol>
 <li>0</li>
@@ -218,12 +220,12 @@ <h2>About these slides</h2>
 <div class="quiz quiz-single well ">
   <p>If \(\bar X\) and \(\bar Y\) are comprised of \(n\) iid random variables arising from distributions
 having  means \(\mu_x\) and \(\mu_y\), respectively and common variance \(\sigma^2\)
-what is the variance \(\bar X - \bar Y\)?</p>
+what is the variance \(\bar X - \bar Y\)? <a href="https://www.youtube.com/watch?v=7zJhPzX6jns&amp;list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&amp;index=5">Watch a video solution of this problem.</a></p>
 
 <ol>
 <li>0</li>
 <li><em>\(2\sigma^2/n\)</em></li>
-<li>\(\mu_x\) - \(\mu_y\)</li>
+<li>\(\mu_x - \mu_y\)</li>
 <li>\(2\sigma^2\)</li>
 </ol>
 
@@ -233,7 +235,7 @@ <h2>About these slides</h2>
   <button class="quiz-clear btn btn-danger">Clear</button>
   
   <div class="quiz-hint">
-  <p>Remember that \(Var(\bar X) = Var(\bar Y) = \sigma^2 / n\). </p>
+  <p>Remember that \(Var(\bar X) = Var(\bar Y) = \sigma^2 / n\).</p>
 
 </div>
 <div class="quiz-explanation">
@@ -252,7 +254,7 @@ <h2>About these slides</h2>
     
 <div class="quiz quiz-single well ">
   <p>Let \(X\) be a random variable having standard deviation \(\sigma\). What can
-be said about \(X /\sigma\)?</p>
+be said about \(X /\sigma\)? <a href="https://www.youtube.com/watch?v=0WUj18_BUPA&amp;list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&amp;index=6">Watch a video solution of this problem.</a></p>
 
 <ol>
 <li>Nothing</li>
@@ -283,7 +285,7 @@ <h2>About these slides</h2>
   <article data-timings="">
     
 <div class="quiz quiz-single well ">
-  <p>If a continuous density that never touches the horizontal axis is symmetric about zero, can we say that its associated median is zero?</p>
+  <p>If a continuous density that never touches the horizontal axis is symmetric about zero, can we say that its associated median is zero? <a href="https://www.youtube.com/watch?v=sn48CGH_TXI&amp;list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&amp;index=7">Watch a video solution.</a></p>
 
 <ol>
 <li><em>Yes</em></li>
@@ -315,10 +317,10 @@ <h2>About these slides</h2>
   <p>Consider the following pmf given in R</p>
 
 <pre><code class="r">p &lt;- c(.1, .2, .3, .4)
-x &lt;- 2 : 5 
+x &lt;- 2 : 5
 </code></pre>
 
-<p>What is the variance expressed to 1 decimal place?</p>
+<p>What is the variance expressed to 1 decimal place? <a href="https://www.youtube.com/watch?v=sn48CGH_TXI&amp;list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&amp;index=7">Watch a solution to this problem.</a></p>
 
 <ol>
 <li><em>1.0</em></li>
@@ -333,7 +335,7 @@ <h2>About these slides</h2>
   <button class="quiz-clear btn btn-danger">Clear</button>
   
   <div class="quiz-hint">
-  <p>The variance is \(E[X^2] - E[X^2]\)</p>
+  <p>The variance is \(E[X^2] - E[X]^2\)</p>
 
 </div>
 <div class="quiz-explanation">
diff --git a/06_StatisticalInference/homework/hw1.md b/06_StatisticalInference/homework/hw1.md
index 0025d9fc3..c62e3160d 100644
--- a/06_StatisticalInference/homework/hw1.md
+++ b/06_StatisticalInference/homework/hw1.md
@@ -1,11 +1,11 @@
 ---
 title       : Homework 1 for Stat Inference
-subtitle    : Extra problems for Stat Inference
+subtitle    : (Use arrow keys to navigate)
 author      : Brian Caffo
 job         : Johns Hopkins Bloomberg School of Public Health
 framework   : io2012
 highlighter : highlight.js  
-hitheme     : tomorrow       
+hitheme     : tomorrow
 #url:
 #    lib: ../../librariesNew #Remove new if using old slidify
 #    assets: ../../assets
@@ -14,10 +14,9 @@ mode        : selfcontained # {standalone, draft}
 ---
 
 
-
 ## About these slides
 - These are some practice problems for Statistical Inference Quiz 1
-- They were created using slidify interactive which you will learn in 
+- They were created using slidify interactive which you will learn in
 Creating Data Products
 - Please help improve this with pull requests here
 (https://github.com/bcaffo/courses)
@@ -26,6 +25,7 @@ Creating Data Products
 --- &radio
 
 Consider influenza epidemics for two parent heterosexual families. Suppose that the probability is 15% that at least one of the parents has contracted the disease. The probability that the father has contracted influenza is 10% while that the mother contracted the disease is 9%. What is the probability that both contracted influenza expressed as a whole number percentage?
+[Watch a video solution](https://www.youtube.com/watch?v=CvnmoCuIN08&index=1&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L)
 
 1. 15%
 2. 10%
@@ -33,8 +33,8 @@ Consider influenza epidemics for two parent heterosexual families. Suppose that
 4. _4%_
 
 *** .hint
-$A = Father$, $P(A) = .10$, $B = Mother$, $P(B) = .09$ 
-$P(A\cup B) = .15$, 
+$A = Father$, $P(A) = .10$, $B = Mother$, $P(B) = .09$
+$P(A\cup B) = .15$,
 
 *** .explanation
 $P(A\cup B) = P(A) + P(B) - P(AB)$ thus
@@ -48,10 +48,10 @@ $$.15 = .10 + .09 - P(AB)$$
 [1] 0.04
 ```
 
-
 ---  &radio
 
-A random variable, $X$, is uniform, a box from $0$ to $1$ of height $1$. (So that it's density is $f(x) = 1$ for $0\leq x \leq 1$.) What is it's median expressed to two decimal places? </p>
+A random variable, $X$, is uniform, a box from $0$ to $1$ of height $1$. (So that its density is $f(x) = 1$ for $0\leq x \leq 1$.) What is its median expressed to two decimal places?
+[Watch a video solution.](https://www.youtube.com/watch?v=UXcarD-1xAM&index=2&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L)</p>
 
 1. 1.00
 2. 0.75
@@ -67,7 +67,7 @@ We want $.5 = P(X\leq x) = x$.
 
 --- &radio
 
-You are playing a game with a friend where you flip a coin and if it comes up heads you give her  $X$ dollars and if it comes up tails she gives you $Y$ dollars. The odds that the coin is heads in $d$. What is your expected earnings?
+You are playing a game with a friend where you flip a coin and if it comes up heads you give her  $X$ dollars and if it comes up tails she gives you $Y$ dollars. The odds that the coin is heads is $d$. What is your expected earnings? [Watch a video solution.](https://www.youtube.com/watch?v=5J88Zq0q81o&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=3)
 
 1. _$-X \frac{d}{1 + d} + Y \frac{1}{1+d} $_
 2. $X \frac{d}{1 + d} + Y \frac{1}{1+d} $
@@ -75,18 +75,18 @@ You are playing a game with a friend where you flip a coin and if it comes up he
 4. $-X \frac{d}{1 + d} - Y \frac{1}{1+d} $
 
 *** .hint
-The probability that you win on a given round is given by $p / (1 - p) = d$ which implies
+The odds that you lose on a given round is given by $p / (1 - p) = d$ which implies
 that $p = d / (1 + d)$.
 
 *** .explanation
 You lose $X$ with probability $p = d/(1 +d)$ and you win $Y$ with probability $1-p = 1/(1 + d)$. So your answer is
 $$
--X \frac{d}{1 + d} + Y \frac{1}{1+d} 
+-X \frac{d}{1 + d} + Y \frac{1}{1+d}
 $$
 
 --- &radio
-A random variable takes the value -4 with probabability .2 and 1 with proabability .8. What
-is the variance of this random variable?
+A random variable takes the value -4 with probability .2 and 1 with probability .8. What
+is the variance of this random variable? [Watch a video solution.](https://www.youtube.com/watch?v=Em-xJeQO1rc&index=4&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L)
 
 1. 0
 2. _4_
@@ -119,28 +119,27 @@ $$
 ```
 
 
-
 --- &radio
 If $\bar X$ and $\bar Y$ are comprised of $n$ iid random variables arising from distributions
 having  means $\mu_x$ and $\mu_y$, respectively and common variance $\sigma^2$
-what is the variance $\bar X - \bar Y$?
+what is the variance $\bar X - \bar Y$? [Watch a video solution of this problem.](https://www.youtube.com/watch?v=7zJhPzX6jns&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=5)
 
 1. 0
 2. _$2\sigma^2/n$_
-3. $\mu_x$ - $\mu_y$
+3. $\mu_x - \mu_y$
 4. $2\sigma^2$
 
 *** .hint
-Remember that $Var(\bar X) = Var(\bar Y) = \sigma^2 / n$. 
+Remember that $Var(\bar X) = Var(\bar Y) = \sigma^2 / n$.
 
-*** .explanation 
+*** .explanation
 $$
 Var(\bar X - \bar Y) = Var(\bar X) + Var(\bar Y) = \sigma^2 / n + \sigma^2 / n
 $$
 
 --- &radio
 Let $X$ be a random variable having standard deviation $\sigma$. What can
-be said about $X /\sigma$?
+be said about $X /\sigma$? [Watch a video solution of this problem.](https://www.youtube.com/watch?v=0WUj18_BUPA&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=6)
 
 1. Nothing
 2. _It must have variance 1._
@@ -155,7 +154,7 @@ $$Var(X / \sigma) = Var(X) / \sigma^2 = 1$$
 
 
 --- &radio
-If a continuous density that never touches the horizontal axis is symmetric about zero, can we say that its associated median is zero?
+If a continuous density that never touches the horizontal axis is symmetric about zero, can we say that its associated median is zero? [Watch a video solution.](https://www.youtube.com/watch?v=sn48CGH_TXI&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=7)
 
 1. _Yes_
 2. No.
@@ -174,10 +173,9 @@ Consider the following pmf given in R
 
 ```r
 p <- c(.1, .2, .3, .4)
-x <- 2 : 5 
+x <- 2 : 5
 ```
-
-What is the variance expressed to 1 decimal place?
+What is the variance expressed to 1 decimal place? [Watch a solution to this problem.](https://www.youtube.com/watch?v=sn48CGH_TXI&list=PLpl-gQkQivXhHOcVeU3bSJg78zaDYbP9L&index=7)
 
 1. _1.0_
 2. 4.0
@@ -185,9 +183,9 @@ What is the variance expressed to 1 decimal place?
 4. 17.0
 
 *** .hint
-The variance is $E[X^2] - E[X^2]$
+The variance is $E[X^2] - E[X]^2$
 
-*** .explanation 
+*** .explanation
 
 ```r
 sum(x ^ 2 * p) - sum(x * p) ^ 2
@@ -196,4 +194,3 @@ sum(x ^ 2 * p) - sum(x * p) ^ 2
 ```
 [1] 1
 ```
-
diff --git a/06_StatisticalInference/homework/hw2.Rmd b/06_StatisticalInference/homework/hw2.Rmd
index b38eab299..0b561e691 100644
--- a/06_StatisticalInference/homework/hw2.Rmd
+++ b/06_StatisticalInference/homework/hw2.Rmd
@@ -1,238 +1,240 @@
----
-title       : Homework 2 for Stat Inference
-subtitle    : Extra problems for Stat Inference
-author      : Brian Caffo
-job         : Johns Hopkins Bloomberg School of Public Health
-framework   : io2012
-highlighter : highlight.js  
-hitheme     : tomorrow       
-#url:
-#    lib: ../../librariesNew #Remove new if using old slidify
-#    assets: ../../assets
-widgets     : [mathjax, quiz, bootstrap]
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-library(knitr)
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-```
-
-## About these slides
-- These are some practice problems for Statistical Inference Quiz 2
-- They were created using slidify interactive which you will learn in 
-Creating Data Products
-- Please help improve this with pull requests here
-(https://github.com/bcaffo/courses)
-
---- &radio
-The probability that a manuscript gets accepted to a journal is 12% (say). However,
-given that a revision is asked for, the probability that it gets accepted
-is 90%. Is it possible that the probability that a manuscript has a revision
-asked for is 20%? 
-
-1. Yeah, that's totally possible.
-2. _No, it's not possible._
-3. It's not possible to answer this question.
-
-*** .hint
-$A = accepted$, $B = revision$. $P(A) = .12$, $P(A | B) = .90$. $P(B) = .20$
-
-*** .explanation
-$P(A \cap B) = P(A | B) * P(B) = .9 \times .2 = .18$ this is larger than
-$P(A) = .12$, which is not possible since $A \cap B \subset A$.
-
-
---- &radio
-Suppose that the number of web hits to a particular site are approximately normally
-distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. What's the probability that a given day has fewer than 93 hits per day
-expressed as a percentage to the nearest percentage point?
-
-1. 76%
-2. _24%_
-3. 47%
-4. 94%
-
-*** .hint
-Let $X$ be the number of hits per day. We want $P(X \leq 93)$ given that
-$X$ is $N(100, 10^2)$.
-
-*** .explanation
-```{r}
-round(pnorm(93, mean = 100, sd = 10) * 100)
-```
-
-
---- &radio
-Suppose 5% of housing projects have issues with asbestos. The sensitivity of a test
-for asbestos is 93% and the specificity is 88%. What is the probability that a 
-housing project has no asbestos given a negative test expressed as a percentage
-to the nearest percentage point?
-
-1. 0%
-2. 5%
-3. 10%
-4. 20%
-5. 50%
-6. _100%_
-
-*** .hint
-$A = asbestos$, $T_+ = tests positive$, $T_- = tests negative$. 
-$P(T_+ | A) = .93$, $P(T_- | A^c) = .88$, $P(A) = .05$.
-
-*** .explanation
-We want
-$$
-P(A^c | T_-) = \frac{P(T_- | A^c) P(A^c)}{P(T_- | A^c) P(A^c) + P(T_- | A) P(A)}
-$$
-```{r}
-(.88 * .95) / (.88 * .95 + .07 * .05)
-```
-
-
-
----  &multitext
-Suppose that the number of web hits to a particular site are approximately normally
-distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. 
-
-1. What number of web hits per day represents the number so that only
-5% of days have more hits? Express your answer to 3 decimal places.
-
-
-
-*** .hint
-Let $X$ be the number of hits per day. We want $P(X \leq 93)$ given that
-$X$ is $N(100, 10^2)$.
-
-*** .explanation
-<span class="answer">`r round(qnorm(.95, mean = 100, sd = 10), 3)`</span>
-```{r}
-round(qnorm(.95, mean = 100, sd = 10), 3)
-round(qnorm(.05, mean = 100, sd = 10, lower.tail = FALSE), 3)
-```
-
-
----  &multitext
-Suppose that the number of web hits to a particular site are approximately normally
-distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. Imagine taking a random sample of 50 days. 
-
-1. What number of web hits would
-be the point so that only 5% of averages of 50 days of web traffic have more hits? 
-Express your answer to 3 decimal places. 
-
-*** .hint
-Let $\bar X$ be the average number of hits per day for 50 randomly sampled days.
-$X$ is $N(100, 10^2 / 50)$.
-
-*** .explanation
-<span class="answer">`r round(qnorm(.95, mean = 100, sd = 10 / sqrt(50) ), 3)`</span>
- 
-```{r}
-round(qnorm(.95, mean = 100, sd = 10 / sqrt(50) ), 3)
-round(qnorm(.05, mean = 100, sd = 10 / sqrt(50), lower.tail = FALSE), 3)
-```
-
---- &multitext
-
-You don't believe that your friend can discern good wine from cheap. Assuming
-that you're right, in a blind test where you randomize 6 paired varieties (Merlot,
-Chianti, ...) of cheap and expensive wines
-
-1. what is the change that she gets 5 or 6 right expressed as a percentage
-to one decimal place?
-
-*** .hint
-Let $p=.5$ and $X$ be binomial
-
-*** .explanation
-
-<span class="answer">`r round(pbinom(4, prob = .5, size = 6, lower.tail = FALSE) * 100, 1)`</span>
-
-```{r}
-round(pbinom(4, prob = .5, size = 6, lower.tail = FALSE) * 100, 1)
-```
-
---- &multitext
-
-Consider a uniform distribution. If we were to sample 100 draws from a 
-a uniform distribution (which has mean 0.5, and variance 1/12) and take their
-mean, $\bar X$
-
-1. what is the approximate probability of getting as large as 0.51 or larger expressed to 3 decimal places?
-
-*** .hint
-Use the central limit theorem that says $\bar X \sim N(\mu, \sigma^2/n)$
-
-*** .explanation
-
-<span class="answer"> `r round(pnorm(.51, mean = 0.5, sd = sqrt(1 / 12 / 100), lower.tail = FALSE), 3)`</span>
-
-```{r}
-round(pnorm(.51, mean = 0.5, sd = sqrt(1 / 12 / 100), lower.tail = FALSE), 3)
-```
-
-
---- &multitext
-
-If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, 
-
-1. what would it be centered at?
-
-
-*** .hint
-$E[X_i] = E[\bar X]$ where $\bar X = \frac{1}{n}\sum_{i=1}^n X_i$
-
-*** .explanation
-
-
-The answer will be  <span class="answer">3.5</span> since the mean of the
-sampling distribution of iid draws will be the population mean that the
-individual draws were taken from.
-
---- &multitext
-
-If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, 
-
-1. what would be its variance expressed to 3 decimal places?
-
-*** .hint
-$$Var(\bar X) = \sigma^2 /n$$
-
-*** .explanation
-The answer will be <span class="answer">`r round( mean( (1 : 6 - 3.5) ^2) / 10, 3)`</span> 
-since the variance of the sampling distribution of the mean is $\sigma^2/10$
-where $\sigma^2$ is the variance of a single die roll, which is 
-
-```{r}
-mean((1 : 6 - 3.5)^2)
-```
-
---- &multitext
-The number of web hits to a site is Poisson with mean 16.5 per day. 
-
-1. What is the probability of getting 20 or fewer in 2 days expressed
-as a percentage to one decimal place?
-
-*** .hint
-Let $X$ be the number of hits in 2 days then $X \sim Poisson(2\lambda)$
-
-*** .explanation
-<span class="answer">`r round(ppois(20, lambda = 16.5 * 2) * 100, 1)`</span>
-
-```{r}
-round(ppois(20, lambda = 16.5 * 2) * 100, 1)
-```
-
-
-
+---
+title       : Homework 2 for Stat Inference
+subtitle    : Extra problems for Stat Inference
+author      : Brian Caffo
+job         : Johns Hopkins Bloomberg School of Public Health
+framework   : io2012
+highlighter : highlight.js  
+hitheme     : tomorrow       
+#url:
+#    lib: ../../librariesNew #Remove new if using old slidify
+#    assets: ../../assets
+widgets     : [mathjax, quiz, bootstrap]
+mode        : selfcontained # {standalone, draft}
+---
+```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
+# make this an external chunk that can be included in any file
+library(knitr)
+options(width = 100)
+opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
+
+options(xtable.type = 'html')
+knit_hooks$set(inline = function(x) {
+  if(is.numeric(x)) {
+    round(x, getOption('digits'))
+  } else {
+    paste(as.character(x), collapse = ', ')
+  }
+})
+knit_hooks$set(plot = knitr:::hook_plot_html)
+runif(1)
+```
+
+## About these slides
+- These are some practice problems for Statistical Inference Quiz 1
+- They were created using slidify interactive which you will learn in 
+Creating Data Products
+- Please help improve this with pull requests here
+(https://github.com/bcaffo/courses)
+runif(1)
+
+--- &radio
+The probability that a manuscript gets accepted to a journal is 12% (say). However,
+given that a revision is asked for, the probability that it gets accepted
+is 90%. Is it possible that the probability that a manuscript has a revision
+asked for is 20%? 
+
+1. Yeah, that's totally possible.
+2. _No, it's not possible._
+3. It's not possible to answer this question.
+
+*** .hint
+$A = accepted$, $B = revision$. $P(A) = .12$, $P(A | B) = .90$. $P(B) = .20$
+
+*** .explanation
+$P(A \cap B) = P(A | B) * P(B) = .9 \times .2 = .18$ this is larger than
+$P(A) = .12$, which is not possible since $A \cap B \subset A$.
+
+
+--- &radio
+Suppose that the number of web hits to a particular site are approximately normally
+distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. What's the probability that a given day has fewer than 93 hits per day
+expressed as a percentage to the nearest percentage point?
+
+1. 76%
+2. _24%_
+3. 47%
+4. 94%
+
+*** .hint
+Let $X$ be the number of hits per day. We want $P(X \leq 93)$ given that
+$X$ is $N(100, 10^2)$.
+
+*** .explanation
+```{r}
+round(pnorm(93, mean = 100, sd = 10) * 100)
+```
+
+
+--- &radio
+Suppose 5% of housing projects have issues with asbestos. The sensitivity of a test
+for asbestos is 93% and the specificity is 88%. What is the probability that a 
+housing project has no asbestos given a negative test expressed as a percentage
+to the nearest percentage point?
+
+1. 0%
+2. 5%
+3. 10%
+4. 20%
+5. 50%
+6. _100%_
+
+*** .hint
+$A = asbestos$, $T_+ = tests positive$, $T_- = tests negative$. 
+$P(T_+ | A) = .93$, $P(T_- | A^c) = .88$, $P(A) = .05$.
+
+*** .explanation
+We want
+$$
+P(A^c | T_-) = \frac{P(T_- | A^c) P(A^c)}{P(T_- | A^c) P(A^c) + P(T_- | A) P(A)}
+$$
+```{r}
+(.88 * .95) / (.88 * .95 + .07 * .05)
+```
+
+
+
+---  &multitext
+Suppose that the number of web hits to a particular site are approximately normally
+distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. 
+
+1. What number of web hits per day represents the number so that only
+5% of days have more hits? Express your answer to 3 decimal places.
+
+
+
+*** .hint
+Let $X$ be the number of hits per day. We want $P(X \leq 93)$ given that
+$X$ is $N(100, 10^2)$.
+
+*** .explanation
+<span class="answer">`r round(qnorm(.95, mean = 100, sd = 10), 3)`</span>
+```{r}
+round(qnorm(.95, mean = 100, sd = 10), 3)
+round(qnorm(.05, mean = 100, sd = 10, lower.tail = FALSE), 3)
+```
+
+
+---  &multitext
+Suppose that the number of web hits to a particular site are approximately normally
+distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. Imagine taking a random sample of 50 days. 
+
+1. What number of web hits would
+be the point so that only 5% of averages of 50 days of web traffic have more hits? 
+Express your answer to 3 decimal places. 
+
+*** .hint
+Let $\bar X$ be the average number of hits per day for 50 randomly sampled days.
+$X$ is $N(100, 10^2 / 50)$.
+
+*** .explanation
+<span class="answer">`r round(qnorm(.95, mean = 100, sd = 10 / sqrt(50) ), 3)`</span>
+ 
+```{r}
+round(qnorm(.95, mean = 100, sd = 10 / sqrt(50) ), 3)
+round(qnorm(.05, mean = 100, sd = 10 / sqrt(50), lower.tail = FALSE), 3)
+```
+
+--- &multitext
+
+You don't believe that your friend can discern good wine from cheap. Assuming
+that you're right, in a blind test where you randomize 6 paired varieties (Merlot,
+Chianti, ...) of cheap and expensive wines
+
+1. what is the change that she gets 5 or 6 right expressed as a percentage
+to one decimal place?
+
+*** .hint
+Let $p=.5$ and $X$ be binomial
+
+*** .explanation
+
+<span class="answer">`r round(pbinom(4, prob = .5, size = 6, lower.tail = TRUE) * 100, 1)`</span>
+
+```{r}
+round(pbinom(4, prob = .5, size = 6, lower.tail = TRUE) * 100, 1)
+```
+
+--- &multitext
+
+Consider a uniform distribution. If we were to sample 100 draws from a 
+a uniform distribution (which has mean 0.5, and variance 1/12) and take their
+mean, $\bar X$
+
+1. what is the approximate probability of getting as large as 0.51 or larger expressed to 3 decimal places?
+
+*** .hint
+Use the central limit theorem that says $\bar X \sim N(\mu, \sigma^2/n)$
+
+*** .explanation
+
+<span class="answer"> `r round(pnorm(.51, mean = 0.5, sd = sqrt(1 / 12 / 100), lower.tail = FALSE), 3)`</span>
+
+```{r}
+round(pnorm(.51, mean = 0.5, sd = sqrt(1 / 12 / 100), lower.tail = FALSE), 3)
+```
+
+
+--- &multitext
+
+If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, 
+
+1. what would it be centered at?
+
+
+*** .hint
+$E[X_i] = E[\bar X]$ where $\bar X = \frac{1}{n}\sum_{i=1}^n X_i$
+
+*** .explanation
+
+
+The answer will be  <span class="answer">3.5</span> since the mean of the
+sampling distribution of iid draws will be the population mean that the
+individual draws were taken from.
+
+--- &multitext
+
+If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, 
+
+1. what would be its variance expressed to 3 decimal places?
+
+*** .hint
+$$Var(\bar X) = \sigma^2 /n$$
+
+*** .explanation
+The answer will be <span class="answer">`r round( mean(1 : 6 - 3.5) ^2 / 100, 3)`</span> 
+since the variance of the sampling distribution of the mean is $\sigma^2/12$
+and the variance of a die roll is 
+
+```{r}
+mean((1 : 6 - 3.5)^2)
+```
+
+--- &multitext
+The number of web hits to a site is Poisson with mean 16.5 per day. 
+
+1. What is the probability of getting 20 or fewer in 2 days expressed
+as a percentage to one decimal place?
+
+*** .hint
+Let $X$ be the number of hits in 2 days then $X \sim Poisson(2\lambda)$
+
+*** .explanation
+<span class="answer">`r round(ppois(20, lambda = 16.5 * 2) * 100, 1)`</span>
+
+```{r}
+round(ppois(20, lambda = 16.5 * 2) * 100, 1)
+```
+
+
+
diff --git a/06_StatisticalInference/homework/hw2.html b/06_StatisticalInference/homework/hw2.html
index e7a3d6a50..be978bd67 100644
--- a/06_StatisticalInference/homework/hw2.html
+++ b/06_StatisticalInference/homework/hw2.html
@@ -1,551 +1,550 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Homework 2 for Stat Inference</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Homework 2 for Stat Inference">
-  <meta name="author" content="Brian Caffo">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="libraries/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  <link rel=stylesheet href="libraries/widgets/quiz/css/demo.css"></link>
-<link rel=stylesheet href="libraries/widgets/bootstrap/css/bootstrap.css"></link>
-
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="libraries/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="libraries/frameworks/io2012/js/slides" 
-    src="libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <hgroup class="auto-fadein">
-    <h1>Homework 2 for Stat Inference</h1>
-    <h2>Extra problems for Stat Inference</h2>
-    <p>Brian Caffo<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>About these slides</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>These are some practice problems for Statistical Inference Quiz 2</li>
-<li>They were created using slidify interactive which you will learn in 
-Creating Data Products</li>
-<li>Please help improve this with pull requests here
-(<a href="https://github.com/bcaffo/courses">https://github.com/bcaffo/courses</a>)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz quiz-single well ">
-  <p>The probability that a manuscript gets accepted to a journal is 12% (say). However,
-given that a revision is asked for, the probability that it gets accepted
-is 90%. Is it possible that the probability that a manuscript has a revision
-asked for is 20%? </p>
-
-<ol>
-<li>Yeah, that&#39;s totally possible.</li>
-<li><em>No, it&#39;s not possible.</em></li>
-<li>It&#39;s not possible to answer this question.</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>\(A = accepted\), \(B = revision\). \(P(A) = .12\), \(P(A | B) = .90\). \(P(B) = .20\)</p>
-
-</div>
-<div class="quiz-explanation">
-  <p>\(P(A \cap B) = P(A | B) * P(B) = .9 \times .2 = .18\) this is larger than
-\(P(A) = .12\), which is not possible since \(A \cap B \subset A\).</p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz quiz-single well ">
-  <p>Suppose that the number of web hits to a particular site are approximately normally
-distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. What&#39;s the probability that a given day has fewer than 93 hits per day
-expressed as a percentage to the nearest percentage point?</p>
-
-<ol>
-<li>76%</li>
-<li><em>24%</em></li>
-<li>47%</li>
-<li>94%</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Let \(X\) be the number of hits per day. We want \(P(X \leq 93)\) given that
-\(X\) is \(N(100, 10^2)\).</p>
-
-</div>
-<div class="quiz-explanation">
-  <pre><code class="r">round(pnorm(93, mean = 100, sd = 10) * 100)
-</code></pre>
-
-<pre><code>[1] 24
-</code></pre>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz quiz-single well ">
-  <p>Suppose 5% of housing projects have issues with asbestos. The sensitivity of a test
-for asbestos is 93% and the specificity is 88%. What is the probability that a 
-housing project has no asbestos given a negative test expressed as a percentage
-to the nearest percentage point?</p>
-
-<ol>
-<li>0%</li>
-<li>5%</li>
-<li>10%</li>
-<li>20%</li>
-<li>50%</li>
-<li><em>100%</em></li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>\(A = asbestos\), \(T_+ = tests positive\), \(T_- = tests negative\). 
-\(P(T_+ | A) = .93\), \(P(T_- | A^c) = .88\), \(P(A) = .05\).</p>
-
-</div>
-<div class="quiz-explanation">
-  <p>We want
-\[
-P(A^c | T_-) = \frac{P(T_- | A^c) P(A^c)}{P(T_- | A^c) P(A^c) + P(T_- | A) P(A)}
-\]</p>
-
-<pre><code class="r">(.88 * .95) / (.88 * .95 + .07 * .05)
-</code></pre>
-
-<pre><code>[1] 0.9958
-</code></pre>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>Suppose that the number of web hits to a particular site are approximately normally
-distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. </p>
-
-<ol>
-<li>What number of web hits per day represents the number so that only
-5% of days have more hits? Express your answer to 3 decimal places.</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Let \(X\) be the number of hits per day. We want \(P(X \leq 93)\) given that
-\(X\) is \(N(100, 10^2)\).</p>
-
-</div>
-<div class="quiz-explanation">
-  <p><span class="answer">116.449</span></p>
-
-<pre><code class="r">round(qnorm(.95, mean = 100, sd = 10), 3)
-</code></pre>
-
-<pre><code>[1] 116.4
-</code></pre>
-
-<pre><code class="r">round(qnorm(.05, mean = 100, sd = 10, lower.tail = FALSE), 3)
-</code></pre>
-
-<pre><code>[1] 116.4
-</code></pre>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>Suppose that the number of web hits to a particular site are approximately normally
-distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. Imagine taking a random sample of 50 days. </p>
-
-<ol>
-<li>What number of web hits would
-be the point so that only 5% of averages of 50 days of web traffic have more hits? 
-Express your answer to 3 decimal places. </li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Let \(\bar X\) be the average number of hits per day for 50 randomly sampled days.
-\(X\) is \(N(100, 10^2 / 50)\).</p>
-
-</div>
-<div class="quiz-explanation">
-  <p><span class="answer">102.326</span></p>
-
-<pre><code class="r">round(qnorm(.95, mean = 100, sd = 10 / sqrt(50) ), 3)
-</code></pre>
-
-<pre><code>[1] 102.3
-</code></pre>
-
-<pre><code class="r">round(qnorm(.05, mean = 100, sd = 10 / sqrt(50), lower.tail = FALSE), 3)
-</code></pre>
-
-<pre><code>[1] 102.3
-</code></pre>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>You don&#39;t believe that your friend can discern good wine from cheap. Assuming
-that you&#39;re right, in a blind test where you randomize 6 paired varieties (Merlot,
-Chianti, ...) of cheap and expensive wines</p>
-
-<ol>
-<li>what is the change that she gets 5 or 6 right expressed as a percentage
-to one decimal place?</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Let \(p=.5\) and \(X\) be binomial</p>
-
-</div>
-<div class="quiz-explanation">
-  <p><span class="answer">10.9</span></p>
-
-<pre><code class="r">round(pbinom(4, prob = .5, size = 6, lower.tail = FALSE) * 100, 1)
-</code></pre>
-
-<pre><code>[1] 10.9
-</code></pre>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>Consider a uniform distribution. If we were to sample 100 draws from a 
-a uniform distribution (which has mean 0.5, and variance 1/12) and take their
-mean, \(\bar X\)</p>
-
-<ol>
-<li>what is the approximate probability of getting as large as 0.51 or larger expressed to 3 decimal places?</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Use the central limit theorem that says \(\bar X \sim N(\mu, \sigma^2/n)\)</p>
-
-</div>
-<div class="quiz-explanation">
-  <p><span class="answer"> 0.365</span></p>
-
-<pre><code class="r">round(pnorm(.51, mean = 0.5, sd = sqrt(1 / 12 / 100), lower.tail = FALSE), 3)
-</code></pre>
-
-<pre><code>[1] 0.365
-</code></pre>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, </p>
-
-<ol>
-<li>what would it be centered at?</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>\(E[X_i] = E[\bar X]\) where \(\bar X = \frac{1}{n}\sum_{i=1}^n X_i\)</p>
-
-</div>
-<div class="quiz-explanation">
-  <p>The answer will be  <span class="answer">3.5</span> since the mean of the
-sampling distribution of iid draws will be the population mean that the
-individual draws were taken from.</p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, </p>
-
-<ol>
-<li>what would be its variance expressed to 3 decimal places?</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>\[Var(\bar X) = \sigma^2 /n\]</p>
-
-</div>
-<div class="quiz-explanation">
-  <p>The answer will be <span class="answer">0.292</span> 
-since the variance of the sampling distribution of the mean is \(\sigma^2/10\)
-where \(\sigma^2\) is the variance of a single die roll, which is </p>
-
-<pre><code class="r">mean((1 : 6 - 3.5)^2)
-</code></pre>
-
-<pre><code>[1] 2.917
-</code></pre>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>The number of web hits to a site is Poisson with mean 16.5 per day. </p>
-
-<ol>
-<li>What is the probability of getting 20 or fewer in 2 days expressed
-as a percentage to one decimal place?</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Let \(X\) be the number of hits in 2 days then \(X \sim Poisson(2\lambda)\)</p>
-
-</div>
-<div class="quiz-explanation">
-  <p><span class="answer">1</span></p>
-
-<pre><code class="r">round(ppois(20, lambda = 16.5 * 2) * 100, 1)
-</code></pre>
-
-<pre><code>[1] 1
-</code></pre>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='About these slides'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title=''>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title=''>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title=''>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title=''>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title=''>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title=''>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title=''>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title=''>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title=''>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title=''>
-         11
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  <script src="libraries/widgets/quiz/js/jquery.quiz.js"></script>
-<script src="libraries/widgets/quiz/js/mustache.min.js"></script>
-<script src="libraries/widgets/quiz/js/quiz-app.js"></script>
-<script src="libraries/widgets/bootstrap/js/bootstrap.min.js"></script>
-<script src="libraries/widgets/bootstrap/js/bootbox.min.js"></script>
-
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<script>  
-  $(function (){ 
-    $("#example").popover(); 
-    $("[rel='tooltip']").tooltip(); 
-  });  
-  </script>  
-  <!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="libraries/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Homework 2 for Stat Inference</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Homework 2 for Stat Inference">
+  <meta name="author" content="Brian Caffo">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  <link rel=stylesheet href="../../librariesNew/widgets/quiz/css/demo.css"></link>
+<link rel=stylesheet href="../../librariesNew/widgets/bootstrap/css/bootstrap.css"></link>
+
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <hgroup class="auto-fadein">
+    <h1>Homework 2 for Stat Inference</h1>
+    <h2>(Use the arrow keys to navigate)</h2>
+    <p>Brian Caffo<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>About these slides</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>These are some practice problems for Statistical Inference Quiz 2</li>
+<li>They were created using slidify interactive which you will learn in 
+Creating Data Products</li>
+<li>Please help improve this with pull requests here
+(<a href="https://github.com/bcaffo/courses">https://github.com/bcaffo/courses</a>)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>The probability that a manuscript gets accepted to a journal is 12% (say). However,
+given that a revision is asked for, the probability that it gets accepted
+is 90%. Is it possible that the probability that a manuscript has a revision
+asked for is 20%? </p>
+
+<ol>
+<li>Yeah, that&#39;s totally possible.</li>
+<li><em>No, it&#39;s not possible.</em></li>
+<li>It&#39;s not possible to answer this question.</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>\(A = accepted\), \(B = revision\). \(P(A) = .12\), \(P(A | B) = .90\). \(P(B) = .20\)</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>\(P(A \cap B) = P(A | B) * P(B) = .9 \times .2 = .18\) this is larger than
+\(P(A) = .12\), which is not possible since \(A \cap B \subset A\).</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>Suppose that the number of web hits to a particular site are approximately normally
+distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. What&#39;s the probability that a given day has fewer than 93 hits per day
+expressed as a percentage to the nearest percentage point?</p>
+
+<ol>
+<li>76%</li>
+<li><em>24%</em></li>
+<li>47%</li>
+<li>94%</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Let \(X\) be the number of hits per day. We want \(P(X \leq 93)\) given that
+\(X\) is \(N(100, 10^2)\).</p>
+
+</div>
+<div class="quiz-explanation">
+  <pre><code class="r">round(pnorm(93, mean = 100, sd = 10) * 100)
+</code></pre>
+
+<pre><code>[1] 24
+</code></pre>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>Suppose 5% of housing projects have issues with asbestos. The sensitivity of a test
+for asbestos is 93% and the specificity is 88%. What is the probability that a 
+housing project has no asbestos given a negative test expressed as a percentage
+to the nearest percentage point?</p>
+
+<ol>
+<li>0%</li>
+<li>5%</li>
+<li>10%</li>
+<li>20%</li>
+<li>50%</li>
+<li><em>100%</em></li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>\(A = asbestos\), \(T_+ = tests positive\), \(T_- = tests negative\). 
+\(P(T_+ | A) = .93\), \(P(T_- | A^c) = .88\), \(P(A) = .05\).</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>We want
+\[
+P(A^c | T_-) = \frac{P(T_- | A^c) P(A^c)}{P(T_- | A^c) P(A^c) + P(T_- | A) P(A)}
+\]</p>
+
+<pre><code class="r">(.88 * .95) / (.88 * .95 + .07 * .05)
+</code></pre>
+
+<pre><code>[1] 0.9958
+</code></pre>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Suppose that the number of web hits to a particular site are approximately normally
+distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. </p>
+
+<ol>
+<li>What number of web hits per day represents the number so that only
+5% of days have more hits? Express your answer to 3 decimal places.</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Let \(x\) be the number of hits per day. We want \(x\) so that \(F(x) = 0.95\).</p>
+
+</div>
+<div class="quiz-explanation">
+  <p><span class="answer">116.449</span></p>
+
+<pre><code class="r">round(qnorm(.95, mean = 100, sd = 10), 3)
+</code></pre>
+
+<pre><code>[1] 116.4
+</code></pre>
+
+<pre><code class="r">round(qnorm(.05, mean = 100, sd = 10, lower.tail = FALSE), 3)
+</code></pre>
+
+<pre><code>[1] 116.4
+</code></pre>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Suppose that the number of web hits to a particular site are approximately normally
+distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. </p>
+
+<ol>
+<li>Imagine taking a random sample of 50 days. What number of web hits would
+be the point so that only 5% of averages of 50 days of web traffic have more hits? 
+Express your answer to 3 decimal places. </li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Let \(\bar X\) be the average number of hits per day for 50 randomly sampled days.
+\(X\) is \(N(100, 10^2 / 50)\).</p>
+
+</div>
+<div class="quiz-explanation">
+  <p><span class="answer">102.326</span></p>
+
+<pre><code class="r">round(qnorm(.95, mean = 100, sd = 10 / sqrt(50) ), 3)
+</code></pre>
+
+<pre><code>[1] 102.3
+</code></pre>
+
+<pre><code class="r">round(qnorm(.05, mean = 100, sd = 10 / sqrt(50), lower.tail = FALSE), 3)
+</code></pre>
+
+<pre><code>[1] 102.3
+</code></pre>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>You don&#39;t believe that your friend can discern good wine from cheap. Assuming
+that you&#39;re right, in a blind test where you randomize 6 paired varieties (Merlot,
+Chianti, ...) of cheap and expensive wines</p>
+
+<ol>
+<li>What is the change that she gets 5 or 6 right expressed as a percentage
+to one decimal place?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Let \(p=.5\) and \(X\) be binomial</p>
+
+</div>
+<div class="quiz-explanation">
+  <p><span class="answer">10.9</span></p>
+
+<pre><code class="r">round(pbinom(4, prob = .5, size = 6, lower.tail = FALSE) * 100, 1)
+</code></pre>
+
+<pre><code>[1] 10.9
+</code></pre>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Consider a uniform distribution. If we were to sample 100 draws from a 
+a uniform distribution (which has mean 0.5, and variance 1/12) and take their
+mean, \(\bar X\)</p>
+
+<ol>
+<li>What is the approximate probability of getting as large as 0.51 or larger expressed to 3 decimal places?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Use the central limit theorem that says \(\bar X \sim N(\mu, \sigma^2/n)\)</p>
+
+</div>
+<div class="quiz-explanation">
+  <p><span class="answer"> 0.365</span></p>
+
+<pre><code class="r">round(pnorm(.51, mean = 0.5, sd = sqrt(1 / 12 / 100), lower.tail = FALSE), 3)
+</code></pre>
+
+<pre><code>[1] 0.365
+</code></pre>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, </p>
+
+<ol>
+<li>what would it be centered at?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>\(E[X_i] = E[\bar X]\) where \(\bar X = \frac{1}{n}\sum_{i=1}^n X_i\)</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>The answer will be  <span class="answer">3.5</span> since the mean of the
+sampling distribution of iid draws will be the population mean that the
+individual draws were taken from.</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, </p>
+
+<ol>
+<li>what would be its variance expressed to 3 decimal places?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>\[Var(\bar X) = \sigma^2 /n\]</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>The answer will be <span class="answer">0.292</span> 
+since the variance of the sampling distribution of the mean is \(\sigma^2/10\)
+where \(\sigma^2\) is the variance of a single die roll, which is </p>
+
+<pre><code class="r">mean((1 : 6 - 3.5)^2 / 10)
+</code></pre>
+
+<pre><code>[1] 0.2917
+</code></pre>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>The number of web hits to a site is Poisson with mean 16.5 per day. </p>
+
+<ol>
+<li>What is the probability of getting 20 or fewer in 2 days expressed
+as a percentage to one decimal place?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Let \(X\) be the number of hits in 2 days then \(X \sim Poisson(2\lambda)\)</p>
+
+</div>
+<div class="quiz-explanation">
+  <p><span class="answer">1</span></p>
+
+<pre><code class="r">round(ppois(20, lambda = 16.5 * 2) * 100, 1)
+</code></pre>
+
+<pre><code>[1] 1
+</code></pre>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='About these slides'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title=''>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title=''>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title=''>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title=''>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title=''>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title=''>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title=''>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title=''>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title=''>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title=''>
+         11
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  <script src="../../librariesNew/widgets/quiz/js/jquery.quiz.js"></script>
+<script src="../../librariesNew/widgets/quiz/js/mustache.min.js"></script>
+<script src="../../librariesNew/widgets/quiz/js/quiz-app.js"></script>
+<script src="../../librariesNew/widgets/bootstrap/js/bootstrap.min.js"></script>
+<script src="../../librariesNew/widgets/bootstrap/js/bootbox.min.js"></script>
+
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<script>  
+  $(function (){ 
+    $("#example").popover(); 
+    $("[rel='tooltip']").tooltip(); 
+  });  
+  </script>  
+  <!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/homework/hw2.md b/06_StatisticalInference/homework/hw2.md
index 44ecbe56b..de45edae7 100644
--- a/06_StatisticalInference/homework/hw2.md
+++ b/06_StatisticalInference/homework/hw2.md
@@ -1,286 +1,271 @@
----
-title       : Homework 2 for Stat Inference
-subtitle    : Extra problems for Stat Inference
-author      : Brian Caffo
-job         : Johns Hopkins Bloomberg School of Public Health
-framework   : io2012
-highlighter : highlight.js  
-hitheme     : tomorrow       
-#url:
-#    lib: ../../librariesNew #Remove new if using old slidify
-#    assets: ../../assets
-widgets     : [mathjax, quiz, bootstrap]
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## About these slides
-- These are some practice problems for Statistical Inference Quiz 2
-- They were created using slidify interactive which you will learn in 
-Creating Data Products
-- Please help improve this with pull requests here
-(https://github.com/bcaffo/courses)
-
---- &radio
-The probability that a manuscript gets accepted to a journal is 12% (say). However,
-given that a revision is asked for, the probability that it gets accepted
-is 90%. Is it possible that the probability that a manuscript has a revision
-asked for is 20%? 
-
-1. Yeah, that's totally possible.
-2. _No, it's not possible._
-3. It's not possible to answer this question.
-
-*** .hint
-$A = accepted$, $B = revision$. $P(A) = .12$, $P(A | B) = .90$. $P(B) = .20$
-
-*** .explanation
-$P(A \cap B) = P(A | B) * P(B) = .9 \times .2 = .18$ this is larger than
-$P(A) = .12$, which is not possible since $A \cap B \subset A$.
-
-
---- &radio
-Suppose that the number of web hits to a particular site are approximately normally
-distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. What's the probability that a given day has fewer than 93 hits per day
-expressed as a percentage to the nearest percentage point?
-
-1. 76%
-2. _24%_
-3. 47%
-4. 94%
-
-*** .hint
-Let $X$ be the number of hits per day. We want $P(X \leq 93)$ given that
-$X$ is $N(100, 10^2)$.
-
-*** .explanation
-
-```r
-round(pnorm(93, mean = 100, sd = 10) * 100)
-```
-
-```
-[1] 24
-```
-
-
-
---- &radio
-Suppose 5% of housing projects have issues with asbestos. The sensitivity of a test
-for asbestos is 93% and the specificity is 88%. What is the probability that a 
-housing project has no asbestos given a negative test expressed as a percentage
-to the nearest percentage point?
-
-1. 0%
-2. 5%
-3. 10%
-4. 20%
-5. 50%
-6. _100%_
-
-*** .hint
-$A = asbestos$, $T_+ = tests positive$, $T_- = tests negative$. 
-$P(T_+ | A) = .93$, $P(T_- | A^c) = .88$, $P(A) = .05$.
-
-*** .explanation
-We want
-$$
-P(A^c | T_-) = \frac{P(T_- | A^c) P(A^c)}{P(T_- | A^c) P(A^c) + P(T_- | A) P(A)}
-$$
-
-```r
-(.88 * .95) / (.88 * .95 + .07 * .05)
-```
-
-```
-[1] 0.9958
-```
-
-
-
-
----  &multitext
-Suppose that the number of web hits to a particular site are approximately normally
-distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. 
-
-1. What number of web hits per day represents the number so that only
-5% of days have more hits? Express your answer to 3 decimal places.
-
-
-
-*** .hint
-Let $X$ be the number of hits per day. We want $P(X \leq 93)$ given that
-$X$ is $N(100, 10^2)$.
-
-*** .explanation
-<span class="answer">116.449</span>
-
-```r
-round(qnorm(.95, mean = 100, sd = 10), 3)
-```
-
-```
-[1] 116.4
-```
-
-```r
-round(qnorm(.05, mean = 100, sd = 10, lower.tail = FALSE), 3)
-```
-
-```
-[1] 116.4
-```
-
-
-
----  &multitext
-Suppose that the number of web hits to a particular site are approximately normally
-distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. Imagine taking a random sample of 50 days. 
-
-1. What number of web hits would
-be the point so that only 5% of averages of 50 days of web traffic have more hits? 
-Express your answer to 3 decimal places. 
-
-*** .hint
-Let $\bar X$ be the average number of hits per day for 50 randomly sampled days.
-$X$ is $N(100, 10^2 / 50)$.
-
-*** .explanation
-<span class="answer">102.326</span>
- 
-
-```r
-round(qnorm(.95, mean = 100, sd = 10 / sqrt(50) ), 3)
-```
-
-```
-[1] 102.3
-```
-
-```r
-round(qnorm(.05, mean = 100, sd = 10 / sqrt(50), lower.tail = FALSE), 3)
-```
-
-```
-[1] 102.3
-```
-
-
---- &multitext
-
-You don't believe that your friend can discern good wine from cheap. Assuming
-that you're right, in a blind test where you randomize 6 paired varieties (Merlot,
-Chianti, ...) of cheap and expensive wines
-
-1. what is the change that she gets 5 or 6 right expressed as a percentage
-to one decimal place?
-
-*** .hint
-Let $p=.5$ and $X$ be binomial
-
-*** .explanation
-
-<span class="answer">10.9</span>
-
-
-```r
-round(pbinom(4, prob = .5, size = 6, lower.tail = FALSE) * 100, 1)
-```
-
-```
-[1] 10.9
-```
-
-
---- &multitext
-
-Consider a uniform distribution. If we were to sample 100 draws from a 
-a uniform distribution (which has mean 0.5, and variance 1/12) and take their
-mean, $\bar X$
-
-1. what is the approximate probability of getting as large as 0.51 or larger expressed to 3 decimal places?
-
-*** .hint
-Use the central limit theorem that says $\bar X \sim N(\mu, \sigma^2/n)$
-
-*** .explanation
-
-<span class="answer"> 0.365</span>
-
-
-```r
-round(pnorm(.51, mean = 0.5, sd = sqrt(1 / 12 / 100), lower.tail = FALSE), 3)
-```
-
-```
-[1] 0.365
-```
-
-
-
---- &multitext
-
-If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, 
-
-1. what would it be centered at?
-
-
-*** .hint
-$E[X_i] = E[\bar X]$ where $\bar X = \frac{1}{n}\sum_{i=1}^n X_i$
-
-*** .explanation
-
-
-The answer will be  <span class="answer">3.5</span> since the mean of the
-sampling distribution of iid draws will be the population mean that the
-individual draws were taken from.
-
---- &multitext
-
-If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram, 
-
-1. what would be its variance expressed to 3 decimal places?
-
-*** .hint
-$$Var(\bar X) = \sigma^2 /n$$
-
-*** .explanation
-The answer will be <span class="answer">0.292</span> 
-since the variance of the sampling distribution of the mean is $\sigma^2/10$
-where $\sigma^2$ is the variance of a single die roll, which is 
-
-
-```r
-mean((1 : 6 - 3.5)^2)
-```
-
-```
-[1] 2.917
-```
-
-
---- &multitext
-The number of web hits to a site is Poisson with mean 16.5 per day. 
-
-1. What is the probability of getting 20 or fewer in 2 days expressed
-as a percentage to one decimal place?
-
-*** .hint
-Let $X$ be the number of hits in 2 days then $X \sim Poisson(2\lambda)$
-
-*** .explanation
-<span class="answer">1</span>
-
-
-```r
-round(ppois(20, lambda = 16.5 * 2) * 100, 1)
-```
-
-```
-[1] 1
-```
-
-
-
-
+---
+title       : Homework 2 for Stat Inference
+subtitle    : (Use the arrow keys to navigate)
+author      : Brian Caffo
+job         : Johns Hopkins Bloomberg School of Public Health
+framework   : io2012
+highlighter : highlight.js  
+hitheme     : tomorrow
+url:
+    lib: ../../librariesNew #Remove new if using old slidify
+    assets: ../../assets
+widgets     : [mathjax, quiz, bootstrap]
+mode        : selfcontained # {standalone, draft}
+---
+
+
+## About these slides
+- These are some practice problems for Statistical Inference Quiz 2
+- They were created using slidify interactive which you will learn in
+Creating Data Products
+- Please help improve this with pull requests here
+(https://github.com/bcaffo/courses)
+
+--- &radio
+The probability that a manuscript gets accepted to a journal is 12% (say). However,
+given that a revision is asked for, the probability that it gets accepted
+is 90%. Is it possible that the probability that a manuscript has a revision
+asked for is 20%?
+
+1. Yeah, that's totally possible.
+2. _No, it's not possible._
+3. It's not possible to answer this question.
+
+*** .hint
+$A = accepted$, $B = revision$. $P(A) = .12$, $P(A | B) = .90$. $P(B) = .20$
+
+*** .explanation
+$P(A \cap B) = P(A | B) * P(B) = .9 \times .2 = .18$ this is larger than
+$P(A) = .12$, which is not possible since $A \cap B \subset A$.
+
+
+--- &radio
+Suppose that the number of web hits to a particular site are approximately normally
+distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. What's the probability that a given day has fewer than 93 hits per day
+expressed as a percentage to the nearest percentage point?
+
+1. 76%
+2. _24%_
+3. 47%
+4. 94%
+
+*** .hint
+Let $X$ be the number of hits per day. We want $P(X \leq 93)$ given that
+$X$ is $N(100, 10^2)$.
+
+*** .explanation
+
+```r
+round(pnorm(93, mean = 100, sd = 10) * 100)
+```
+
+```
+[1] 24
+```
+
+
+--- &radio
+Suppose 5% of housing projects have issues with asbestos. The sensitivity of a test
+for asbestos is 93% and the specificity is 88%. What is the probability that a
+housing project has no asbestos given a negative test expressed as a percentage
+to the nearest percentage point?
+
+1. 0%
+2. 5%
+3. 10%
+4. 20%
+5. 50%
+6. _100%_
+
+*** .hint
+$A = asbestos$, $T_+ = tests positive$, $T_- = tests negative$.
+$P(T_+ | A) = .93$, $P(T_- | A^c) = .88$, $P(A) = .05$.
+
+*** .explanation
+We want
+$$
+P(A^c | T_-) = \frac{P(T_- | A^c) P(A^c)}{P(T_- | A^c) P(A^c) + P(T_- | A) P(A)}
+$$
+
+```r
+(.88 * .95) / (.88 * .95 + .07 * .05)
+```
+
+```
+[1] 0.9958
+```
+
+
+
+---  &multitext
+Suppose that the number of web hits to a particular site are approximately normally
+distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day.
+
+1. What number of web hits per day represents the number so that only
+5% of days have more hits? Express your answer to 3 decimal places.
+
+
+
+*** .hint
+Let $x$ be the number of hits per day. We want $x$ so that $F(x) = 0.95$.
+
+*** .explanation
+<span class="answer">116.449</span>
+
+```r
+round(qnorm(.95, mean = 100, sd = 10), 3)
+```
+
+```
+[1] 116.4
+```
+
+```r
+round(qnorm(.05, mean = 100, sd = 10, lower.tail = FALSE), 3)
+```
+
+```
+[1] 116.4
+```
+
+
+---  &multitext
+Suppose that the number of web hits to a particular site are approximately normally
+distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day.
+
+2. Imagine taking a random sample of 50 days. What number of web hits would
+be the point so that only 5% of averages of 50 days of web traffic have more hits?
+Express your answer to 3 decimal places.
+
+*** .hint
+Let $\bar X$ be the average number of hits per day for 50 randomly sampled days.
+$X$ is $N(100, 10^2 / 50)$.
+
+*** .explanation
+<span class="answer">102.326</span>
+
+
+```r
+round(qnorm(.95, mean = 100, sd = 10 / sqrt(50) ), 3)
+```
+
+```
+[1] 102.3
+```
+
+```r
+round(qnorm(.05, mean = 100, sd = 10 / sqrt(50), lower.tail = FALSE), 3)
+```
+
+```
+[1] 102.3
+```
+
+--- &multitext
+
+You don't believe that your friend can discern good wine from cheap. Assuming
+that you're right, in a blind test where you randomize 6 paired varieties (Merlot,
+Chianti, ...) of cheap and expensive wines
+
+1. What is the change that she gets 5 or 6 right expressed as a percentage
+to one decimal place?
+
+*** .hint
+Let $p=.5$ and $X$ be binomial
+
+*** .explanation
+<span class="answer">10.9</span>
+
+
+```r
+round(pbinom(4, prob = .5, size = 6, lower.tail = FALSE) * 100, 1)
+```
+
+```
+[1] 10.9
+```
+
+--- &multitext
+
+Consider a uniform distribution. If we were to sample 100 draws from a
+a uniform distribution (which has mean 0.5, and variance 1/12) and take their
+mean, $\bar X$
+
+1. What is the approximate probability of getting as large as 0.51 or larger expressed to 3 decimal places?
+
+*** .hint
+Use the central limit theorem that says $\bar X \sim N(\mu, \sigma^2/n)$
+
+*** .explanation
+<span class="answer"> 0.365</span>
+
+
+```r
+round(pnorm(.51, mean = 0.5, sd = sqrt(1 / 12 / 100), lower.tail = FALSE), 3)
+```
+
+```
+[1] 0.365
+```
+
+
+--- &multitext
+
+If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram,
+
+1. what would it be centered at?
+
+
+*** .hint
+$E[X_i] = E[\bar X]$ where $\bar X = \frac{1}{n}\sum_{i=1}^n X_i$
+
+*** .explanation
+
+
+The answer will be  <span class="answer">3.5</span> since the mean of the
+sampling distribution of iid draws will be the population mean that the
+individual draws were taken from.
+
+--- &multitext
+
+If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram,
+
+2. what would be its variance expressed to 3 decimal places?
+
+*** .hint
+$$Var(\bar X) = \sigma^2 /n$$
+
+*** .explanation
+The answer will be <span class="answer">0.292</span>
+since the variance of the sampling distribution of the mean is $\sigma^2/10$
+where $\sigma^2$ is the variance of a single die roll, which is
+
+
+```r
+mean((1 : 6 - 3.5)^2 / 10)
+```
+
+```
+[1] 0.2917
+```
+
+--- &multitext
+The number of web hits to a site is Poisson with mean 16.5 per day.
+
+1. What is the probability of getting 20 or fewer in 2 days expressed
+as a percentage to one decimal place?
+
+*** .hint
+Let $X$ be the number of hits in 2 days then $X \sim Poisson(2\lambda)$
+
+*** .explanation
+<span class="answer">1</span>
+
+
+```r
+round(ppois(20, lambda = 16.5 * 2) * 100, 1)
+```
+
+```
+[1] 1
+```
diff --git a/06_StatisticalInference/homework/hw3.Rmd b/06_StatisticalInference/homework/hw3.Rmd
index df1866fc0..9ef9135e2 100644
--- a/06_StatisticalInference/homework/hw3.Rmd
+++ b/06_StatisticalInference/homework/hw3.Rmd
@@ -1,206 +1,206 @@
----
-title       : Homework 3 for Stat Inference
-subtitle    : Extra problems for Stat Inference
-author      : Brian Caffo
-job         : Johns Hopkins Bloomberg School of Public Health
-framework   : io2012
-highlighter : highlight.js  
-hitheme     : tomorrow       
-#url:
-#    lib: ../../librariesNew #Remove new if using old slidify
-#    assets: ../../assets
-widgets     : [mathjax, quiz, bootstrap]
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-library(knitr)
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-```
-
-## About these slides
-- These are some practice problems for Statistical Inference Quiz 3
-- They were created using slidify interactive which you will learn in 
-Creating Data Products
-- Please help improve this with pull requests here
-(https://github.com/bcaffo/courses)
-
-
-
---- &multitext
-Load the data set `mtcars` in the `datasets` R package. Calculate a 
-95% confidence interval to the nearest MPG.
-
-1. What is the lower endpoint of the interval?
-2. What is the upper endpoint of the interval?
-
-*** .hint
-Do `library(datasets)` and then `data(mtcars)` to get the data.
-Consider `t.test` for calculations. You may have to install
-the datasets package.
-
-
-*** .explanation
-```{r}
-library(datasets); data(mtcars)
-round(t.test(mtcars$mpg)$conf.int)
-```
-
-<span class="answer">`r round(min(t.test(mtcars$mpg)$conf.int))`</span>
-<span class="answer">`r round(max(t.test(mtcars$mpg)$conf.int))`</span>
-
---- &multitext
-Suppose that data of 9 paired differences has a standard error of $1$, what value would the average difference have to be to have the lower endpoint of a 95%
-students t confidence interval touch zero?
-
-1. Give the number here to two decimal places
-
-*** .hint
-The t interval is $\bar x t_{.95, 8}\pm s /sqrt{n}$
-
-*** .explanation
-<span class="answer">`r round(qt(.95, df = 8) * 1 / 3, 2)`</span>
-
-We want $\bar x = t_{.95} s / sqrt{n}$
-```{r}
-round(qt(.95, df = 8) * 1 / 3, 2)
-```
-
-
---- &radio
-An independent group Student's T interval is used over
-a paired T interval when:
-
-1. The observations are paired between the groups.
-2. _The observations between the groups are natually assumed to be statistically independent_
-3. As long as you do it correctly, either is fine.
-4. More details are needed to answer this question
-
-*** .hint
-A paired interval is for paired observations.
-
-*** .explanation
-We can't pair them if the groups are independent of each other as well as independent within themselves.
-
-
---- &multitext
-Consider the `mtcars` dataset. Construct a 95% T interval for MPG comparing
-4 to 6 cylinder cars (subtracting in the order of 4 - 6) 
-assume a constant variance.
-
-1. What is the lower endpoint of the interval to 1 decimal place?
-2. What is the upper endpoint of the interval to 1 decimal place?
-
-*** .hint
-Use `t.test` with `var.equal=TRUE`
-
-*** .explanation
-
-```{r}
-m4 <- mtcars$mpg[mtcars$cyl == 4]
-m6 <- mtcars$mpg[mtcars$cyl == 6]
-#this does 4 - 6
-confint <- as.vector(t.test(m4, m6, var.equal = TRUE)$conf.int)
-```
-
-<span class="answer">`r round(min(confint), 1)`</span>
-<span class="answer">`r round(max(confint), 1)`</span>
-
-
---- &radio
-If someone put a gun to your head and said "Your confidence interval
-must contain what it's estimating or I'll pull the trigger", what would
-be the smart thing to do?
-
-1. _Make your interval as wide as possible_
-2. Make your interval as small as possible
-3. Call the authorities
-
-*** .hint
-C'mon. You don't need a hint
-
-*** .explanation
-This is just an example of what happens to confidence intervals as you
-increase the confidence level. You want to be quite sure in your interval (i.e.
-have a large confidence level) and so you would increase the interval's width
-
---- &radio
-
-Refer back to comparing MPG for 4 versus 6 cylinders. What do you conclude?
-
-1. The interval is above zero, suggesting 6 is better than 4 in the terms of MPG
-2. _The interval is above zero, suggesting 4 is better than 6 in the terms of MPG_
-3. The interval does not tell you anything about the hypothesis test; you have to do the test.
-4. The interval contains 0 suggesting no difference.
-
-*** .hint
-Refer back to the problem, consider the implications of the interval being
-larger than 0, double check the order in which things were subtracted and
-make sure the results make sense in the context of the problem.
-
-*** .explanation
-The interval was conducted subtracting 4 - 6 and was entirely above zero.
-
---- &multitext
-Suppose that 18 obese subjects were randomized, 9 each, to a new diet pill and a placebo. Subjects' body mass indices (BMIs) were measured at a baseline and again after having received the treatment or placebo for four weeks. The average difference from follow-up to the baseline (followup - baseline) was 3 kg/m2 for the treated group and 1 kg/m2 for the placebo group. The corresponding standard deviations of the differences was 1.5 kg/m2 for the treatment group and 1.8 kg/m2 for the placebo group. The study aims to answer whether the change in BMI over the four week period appear to differ between the treated and placebo groups. 
-
-What is the pooled variance estimate? (to 2 decimal places)
-
-
-*** .hint
-The sample sizes are equal, so the pooled variance is the average of the 
-individual variances
-
-
-*** .explanation
-```{r}
-n1 <- n2 <- 9
-x1 <- -3  ##treated
-x2 <- 1  ##placebo
-s1 <- 1.5  ##treated
-s2 <- 1.8  ##placebo
-spsq <- ( (n1 - 1) * s1^2 + (n2 - 1) * s2^2) / (n1 + n2 - 2)
-```
-<span class="answer">`r round(spsq, 2)`</span>
-
-
---- &radio
-
-For Binomial data the maximum likelihood estimate for the probability of 
-a success is
-
-1. _The proportion of successes_
-2. The proportion of failures
-3. A shrunken version of the proportion of successes
-4. A shrunken version of the proportion of failures
-
-*** .hint
-Look back at the notes about likelihood.
-
-*** .explanation
-The MLE for binomial data is always the proportion of successes.
-
---- &radio
-
-Bayesian inference requires
-
-1. A type I error rate
-2. Setting your confidence level
-3. _Assigning a prior probability distribution_
-4. Evaluating frequency error rates
-
-*** .explanation
-All of the other answers discuss frequentist concepts. All Bayesian analyses requiring setting a prior.
-
-
+---
+title       : Homework 3 for Stat Inference
+subtitle    : (Use the arrow keys to navigate)
+author      : Brian Caffo
+job         : Johns Hopkins Bloomberg School of Public Health
+framework   : io2012
+highlighter : highlight.js  
+hitheme     : tomorrow       
+#url:
+#    lib: ../../librariesNew #Remove new if using old slidify
+#    assets: ../../assets
+widgets     : [mathjax, quiz, bootstrap]
+mode        : selfcontained # {standalone, draft}
+---
+```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
+# make this an external chunk that can be included in any file
+library(knitr)
+options(width = 100)
+opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
+
+options(xtable.type = 'html')
+knit_hooks$set(inline = function(x) {
+  if(is.numeric(x)) {
+    round(x, getOption('digits'))
+  } else {
+    paste(as.character(x), collapse = ', ')
+  }
+})
+knit_hooks$set(plot = knitr:::hook_plot_html)
+```
+
+## About these slides
+- These are some practice problems for Statistical Inference Quiz 3
+- They were created using slidify interactive which you will learn in 
+Creating Data Products
+- Please help improve this with pull requests here
+(https://github.com/bcaffo/courses)
+
+
+
+--- &multitext
+Load the data set `mtcars` in the `datasets` R package. Calculate a 
+95% confidence interval to the nearest MPG for the variable `mpg`.
+
+1. What is the lower endpoint of the interval?
+2. What is the upper endpoint of the interval?
+
+*** .hint
+Do `library(datasets)` and then `data(mtcars)` to get the data.
+Consider `t.test` for calculations. You may have to install
+the datasets package.
+
+
+*** .explanation
+```{r}
+library(datasets); data(mtcars)
+round(t.test(mtcars$mpg)$conf.int)
+```
+
+<span class="answer">`r round(min(t.test(mtcars$mpg)$conf.int))`</span>
+<span class="answer">`r round(max(t.test(mtcars$mpg)$conf.int))`</span>
+
+--- &multitext
+Suppose that standard deviation of 9 paired differences is $1$. What value would the average difference have to be so that the lower endpoint of a 95%
+students t confidence interval touches zero?
+
+1. Give the number here to two decimal places
+
+*** .hint
+The t interval is $\bar x \pm t_{.975, 8} * s /\sqrt{n}$
+
+*** .explanation
+<span class="answer">`r round(qt(.975, df = 8) * 1 / 3, 2)`</span>
+
+We want $\bar x = t_{.975,8} * s / \sqrt{n}$
+```{r}
+round(qt(.975, df = 8) * 1 / 3, 2)
+```
+
+
+--- &radio
+An independent group Student's T interval is used instead of
+a paired T interval when:
+
+1. The observations are paired between the groups.
+2. _The observations between the groups are naturally assumed to be statistically independent_
+3. As long as you do it correctly, either is fine.
+4. More details are needed to answer this question
+
+*** .hint
+A paired interval is for paired observations.
+
+*** .explanation
+We can't pair them if the groups are independent of each other as well as independent within themselves.
+
+
+--- &multitext
+Consider the `mtcars` dataset. Construct a 95% T interval for MPG comparing
+4 to 6 cylinder cars (subtracting in the order of 4 - 6) 
+assume a constant variance.
+
+1. What is the lower endpoint of the interval to 1 decimal place?
+2. What is the upper endpoint of the interval to 1 decimal place?
+
+*** .hint
+Use `t.test` with `var.equal=TRUE`
+
+*** .explanation
+
+```{r}
+m4 <- mtcars$mpg[mtcars$cyl == 4]
+m6 <- mtcars$mpg[mtcars$cyl == 6]
+#this does 4 - 6
+confint <- as.vector(t.test(m4, m6, var.equal = TRUE)$conf.int)
+```
+
+<span class="answer">`r round(min(confint), 1)`</span>
+<span class="answer">`r round(max(confint), 1)`</span>
+
+
+--- &radio
+If someone put a gun to your head and said "Your confidence interval
+must contain what it's estimating or I'll pull the trigger", what would
+be the smart thing to do?
+
+1. _Make your interval as wide as possible_
+2. Make your interval as small as possible
+3. Call the authorities
+
+*** .hint
+C'mon. You don't need a hint
+
+*** .explanation
+This is just an example of what happens to confidence intervals as you
+increase the confidence level. You want to be quite sure in your interval (i.e.
+have a large confidence level) and so you would increase the interval's width
+
+--- &radio
+
+Refer back to comparing MPG for 4 versus 6 cylinders. What do you conclude?
+
+1. The interval is above zero, suggesting 6 is better than 4 in the terms of MPG
+2. _The interval is above zero, suggesting 4 is better than 6 in the terms of MPG_
+3. The interval does not tell you anything about the hypothesis test; you have to do the test.
+4. The interval contains 0 suggesting no difference.
+
+*** .hint
+Refer back to the problem, consider the implications of the interval being
+larger than 0, double check the order in which things were subtracted and
+make sure the results make sense in the context of the problem.
+
+*** .explanation
+The interval was conducted subtracting 4 - 6 and was entirely above zero.
+
+--- &multitext
+Suppose that 18 obese subjects were randomized, 9 each, to a new diet pill and a placebo. Subjects' body mass indices (BMIs) were measured at a baseline and again after having received the treatment or placebo for four weeks. The average difference from follow-up to the baseline (followup - baseline) was 3 kg/m2 for the treated group and 1 kg/m2 for the placebo group. The corresponding standard deviations of the differences was 1.5 kg/m2 for the treatment group and 1.8 kg/m2 for the placebo group. The study aims to answer whether the change in BMI over the four week period appear to differ between the treated and placebo groups. 
+
+1. What is the pooled variance estimate? (to 2 decimal places)
+
+
+*** .hint
+The sample sizes are equal, so the pooled variance is the average of the 
+individual variances
+
+
+*** .explanation
+```{r}
+n1 <- n2 <- 9
+x1 <- -3  ##treated
+x2 <- 1  ##placebo
+s1 <- 1.5  ##treated
+s2 <- 1.8  ##placebo
+spsq <- ( (n1 - 1) * s1^2 + (n2 - 1) * s2^2) / (n1 + n2 - 2)
+```
+<span class="answer">`r round(spsq, 2)`</span>
+
+
+--- &radio
+
+For Binomial data the maximum likelihood estimate for the probability of 
+a success is
+
+1. _The proportion of successes_
+2. The proportion of failures
+3. A shrunken version of the proportion of successes
+4. A shrunken version of the proportion of failures
+
+*** .hint
+Look back at the notes about likelihood.
+
+*** .explanation
+The MLE for binomial data is always the proportion of successes.
+
+--- &radio
+
+Bayesian inference requires
+
+1. A type I error rate
+2. Setting your confidence level
+3. _Assigning a prior probability distribution_
+4. Evaluating frequency error rates
+
+*** .explanation
+All of the other answers discuss frequentist concepts. All Bayesian analyses requiring setting a prior.
+
+
diff --git a/06_StatisticalInference/homework/hw3.html b/06_StatisticalInference/homework/hw3.html
index 6e54ea85e..babd401ba 100644
--- a/06_StatisticalInference/homework/hw3.html
+++ b/06_StatisticalInference/homework/hw3.html
@@ -1,476 +1,478 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Homework 3 for Stat Inference</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Homework 3 for Stat Inference">
-  <meta name="author" content="Brian Caffo">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="libraries/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  <link rel=stylesheet href="libraries/widgets/quiz/css/demo.css"></link>
-<link rel=stylesheet href="libraries/widgets/bootstrap/css/bootstrap.css"></link>
-
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="libraries/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="libraries/frameworks/io2012/js/slides" 
-    src="libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <hgroup class="auto-fadein">
-    <h1>Homework 3 for Stat Inference</h1>
-    <h2>Extra problems for Stat Inference</h2>
-    <p>Brian Caffo<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>About these slides</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>These are some practice problems for Statistical Inference Quiz 3</li>
-<li>They were created using slidify interactive which you will learn in 
-Creating Data Products</li>
-<li>Please help improve this with pull requests here
-(<a href="https://github.com/bcaffo/courses">https://github.com/bcaffo/courses</a>)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>Load the data set <code>mtcars</code> in the <code>datasets</code> R package. Calculate a 
-95% confidence interval to the nearest MPG.</p>
-
-<ol>
-<li>What is the lower endpoint of the interval?</li>
-<li>What is the upper endpoint of the interval?</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Do <code>library(datasets)</code> and then <code>data(mtcars)</code> to get the data.
-Consider <code>t.test</code> for calculations. You may have to install
-the datasets package.</p>
-
-</div>
-<div class="quiz-explanation">
-  <pre><code class="r">library(datasets); data(mtcars)
-round(t.test(mtcars$mpg)$conf.int)
-</code></pre>
-
-<pre><code>[1] 18 22
-attr(,&quot;conf.level&quot;)
-[1] 0.95
-</code></pre>
-
-<p><span class="answer">18</span>
-<span class="answer">22</span></p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>Suppose that data of 9 paired differences has a standard error of \(1\), what value would the average difference have to be to have the lower endpoint of a 95%
-students t confidence interval touch zero?</p>
-
-<ol>
-<li>Give the number here to two decimal places</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>The t interval is \(\bar x t_{.95, 8}\pm s /sqrt{n}\)</p>
-
-</div>
-<div class="quiz-explanation">
-  <p><span class="answer">0.62</span></p>
-
-<p>We want \(\bar x = t_{.95} s / sqrt{n}\)</p>
-
-<pre><code class="r">round(qt(.95, df = 8) * 1 / 3, 2)
-</code></pre>
-
-<pre><code>[1] 0.62
-</code></pre>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz quiz-single well ">
-  <p>An independent group Student&#39;s T interval is used over
-a paired T interval when:</p>
-
-<ol>
-<li>The observations are paired between the groups.</li>
-<li><em>The observations between the groups are natually assumed to be statistically independent</em></li>
-<li>As long as you do it correctly, either is fine.</li>
-<li>More details are needed to answer this question</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>A paired interval is for paired observations.</p>
-
-</div>
-<div class="quiz-explanation">
-  <p>We can&#39;t pair them if the groups are independent of each other as well as independent within themselves.</p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>Consider the <code>mtcars</code> dataset. Construct a 95% T interval for MPG comparing
-4 to 6 cylinder cars (subtracting in the order of 4 - 6) 
-assume a constant variance.</p>
-
-<ol>
-<li>What is the lower endpoint of the interval to 1 decimal place?</li>
-<li>What is the upper endpoint of the interval to 1 decimal place?</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Use <code>t.test</code> with <code>var.equal=TRUE</code></p>
-
-</div>
-<div class="quiz-explanation">
-  <pre><code class="r">m4 &lt;- mtcars$mpg[mtcars$cyl == 4]
-m6 &lt;- mtcars$mpg[mtcars$cyl == 6]
-#this does 4 - 6
-confint &lt;- as.vector(t.test(m4, m6, var.equal = TRUE)$conf.int)
-</code></pre>
-
-<p><span class="answer">3.2</span>
-<span class="answer">10.7</span></p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz quiz-single well ">
-  <p>If someone put a gun to your head and said &quot;Your confidence interval
-must contain what it&#39;s estimating or I&#39;ll pull the trigger&quot;, what would
-be the smart thing to do?</p>
-
-<ol>
-<li><em>Make your interval as wide as possible</em></li>
-<li>Make your interval as small as possible</li>
-<li>Call the authorities</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>C&#39;mon. You don&#39;t need a hint</p>
-
-</div>
-<div class="quiz-explanation">
-  <p>This is just an example of what happens to confidence intervals as you
-increase the confidence level. You want to be quite sure in your interval (i.e.
-have a large confidence level) and so you would increase the interval&#39;s width</p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz quiz-single well ">
-  <p>Refer back to comparing MPG for 4 versus 6 cylinders. What do you conclude?</p>
-
-<ol>
-<li>The interval is above zero, suggesting 6 is better than 4 in the terms of MPG</li>
-<li><em>The interval is above zero, suggesting 4 is better than 6 in the terms of MPG</em></li>
-<li>The interval does not tell you anything about the hypothesis test; you have to do the test.</li>
-<li>The interval contains 0 suggesting no difference.</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Refer back to the problem, consider the implications of the interval being
-larger than 0, double check the order in which things were subtracted and
-make sure the results make sense in the context of the problem.</p>
-
-</div>
-<div class="quiz-explanation">
-  <p>The interval was conducted subtracting 4 - 6 and was entirely above zero.</p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz-text quiz-multitext well">
-  <p>Suppose that 18 obese subjects were randomized, 9 each, to a new diet pill and a placebo. Subjects&#39; body mass indices (BMIs) were measured at a baseline and again after having received the treatment or placebo for four weeks. The average difference from follow-up to the baseline (followup - baseline) was 3 kg/m2 for the treated group and 1 kg/m2 for the placebo group. The corresponding standard deviations of the differences was 1.5 kg/m2 for the treatment group and 1.8 kg/m2 for the placebo group. The study aims to answer whether the change in BMI over the four week period appear to differ between the treated and placebo groups. </p>
-
-<p>What is the pooled variance estimate? (to 2 decimal places)</p>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>The sample sizes are equal, so the pooled variance is the average of the 
-individual variances</p>
-
-</div>
-<div class="quiz-explanation">
-  <pre><code class="r">n1 &lt;- n2 &lt;- 9
-x1 &lt;- -3  ##treated
-x2 &lt;- 1  ##placebo
-s1 &lt;- 1.5  ##treated
-s2 &lt;- 1.8  ##placebo
-spsq &lt;- ( (n1 - 1) * s1^2 + (n2 - 1) * s2^2) / (n1 + n2 - 2)
-</code></pre>
-
-<p><span class="answer">2.75</span></p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz quiz-single well ">
-  <p>For Binomial data the maximum likelihood estimate for the probability of 
-a success is</p>
-
-<ol>
-<li><em>The proportion of successes</em></li>
-<li>The proportion of failures</li>
-<li>A shrunken version of the proportion of successes</li>
-<li>A shrunken version of the proportion of failures</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-hint">
-  <p>Look back at the notes about likelihood.</p>
-
-</div>
-<div class="quiz-explanation">
-  <p>The MLE for binomial data is always the proportion of successes.</p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <article data-timings="">
-    
-<div class="quiz quiz-single well ">
-  <p>Bayesian inference requires</p>
-
-<ol>
-<li>A type I error rate</li>
-<li>Setting your confidence level</li>
-<li><em>Assigning a prior probability distribution</em></li>
-<li>Evaluating frequency error rates</li>
-</ol>
-
-  <button class="quiz-submit btn btn-primary">Submit</button>
-  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
-  <button class="quiz-show-answer btn btn-success">Show Answer</button>
-  <button class="quiz-clear btn btn-danger">Clear</button>
-  
-  <div class="quiz-explanation">
-  <p>All of the other answers discuss frequentist concepts. All Bayesian analyses requiring setting a prior.</p>
-
-</div>
-</div>
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='About these slides'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title=''>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title=''>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title=''>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title=''>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title=''>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title=''>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title=''>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title=''>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title=''>
-         10
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  <script src="libraries/widgets/quiz/js/jquery.quiz.js"></script>
-<script src="libraries/widgets/quiz/js/mustache.min.js"></script>
-<script src="libraries/widgets/quiz/js/quiz-app.js"></script>
-<script src="libraries/widgets/bootstrap/js/bootstrap.min.js"></script>
-<script src="libraries/widgets/bootstrap/js/bootbox.min.js"></script>
-
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<script>  
-  $(function (){ 
-    $("#example").popover(); 
-    $("[rel='tooltip']").tooltip(); 
-  });  
-  </script>  
-  <!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="libraries/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Homework 3 for Stat Inference</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Homework 3 for Stat Inference">
+  <meta name="author" content="Brian Caffo">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="libraries/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="libraries/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="libraries/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="libraries/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  <link rel=stylesheet href="libraries/widgets/quiz/css/demo.css"></link>
+<link rel=stylesheet href="libraries/widgets/bootstrap/css/bootstrap.css"></link>
+
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="libraries/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="libraries/frameworks/io2012/js/slides" 
+    src="libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <hgroup class="auto-fadein">
+    <h1>Homework 3 for Stat Inference</h1>
+    <h2>(Use the arrow keys to navigate)</h2>
+    <p>Brian Caffo<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>About these slides</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>These are some practice problems for Statistical Inference Quiz 3</li>
+<li>They were created using slidify interactive which you will learn in 
+Creating Data Products</li>
+<li>Please help improve this with pull requests here
+(<a href="https://github.com/bcaffo/courses">https://github.com/bcaffo/courses</a>)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Load the data set <code>mtcars</code> in the <code>datasets</code> R package. Calculate a 
+95% confidence interval to the nearest MPG for the variable <code>mpg</code>.</p>
+
+<ol>
+<li>What is the lower endpoint of the interval?</li>
+<li>What is the upper endpoint of the interval?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Do <code>library(datasets)</code> and then <code>data(mtcars)</code> to get the data.
+Consider <code>t.test</code> for calculations. You may have to install
+the datasets package.</p>
+
+</div>
+<div class="quiz-explanation">
+  <pre><code class="r">library(datasets); data(mtcars)
+round(t.test(mtcars$mpg)$conf.int)
+</code></pre>
+
+<pre><code>[1] 18 22
+attr(,&quot;conf.level&quot;)
+[1] 0.95
+</code></pre>
+
+<p><span class="answer">18</span>
+<span class="answer">22</span></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Suppose that standard deviation of 9 paired differences is \(1\). What value would the average difference have to be so that the lower endpoint of a 95%
+students t confidence interval touches zero?</p>
+
+<ol>
+<li>Give the number here to two decimal places</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>The t interval is \(\bar x \pm t_{.975, 8} * s /\sqrt{n}\)</p>
+
+</div>
+<div class="quiz-explanation">
+  <p><span class="answer">0.77</span></p>
+
+<p>We want \(\bar x = t_{.975,8} * s / \sqrt{n}\)</p>
+
+<pre><code class="r">round(qt(.975, df = 8) * 1 / 3, 2)
+</code></pre>
+
+<pre><code>[1] 0.77
+</code></pre>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>An independent group Student&#39;s T interval is used instead of
+a paired T interval when:</p>
+
+<ol>
+<li>The observations are paired between the groups.</li>
+<li><em>The observations between the groups are naturally assumed to be statistically independent</em></li>
+<li>As long as you do it correctly, either is fine.</li>
+<li>More details are needed to answer this question</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>A paired interval is for paired observations.</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>We can&#39;t pair them if the groups are independent of each other as well as independent within themselves.</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Consider the <code>mtcars</code> dataset. Construct a 95% T interval for MPG comparing
+4 to 6 cylinder cars (subtracting in the order of 4 - 6) 
+assume a constant variance.</p>
+
+<ol>
+<li>What is the lower endpoint of the interval to 1 decimal place?</li>
+<li>What is the upper endpoint of the interval to 1 decimal place?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Use <code>t.test</code> with <code>var.equal=TRUE</code></p>
+
+</div>
+<div class="quiz-explanation">
+  <pre><code class="r">m4 &lt;- mtcars$mpg[mtcars$cyl == 4]
+m6 &lt;- mtcars$mpg[mtcars$cyl == 6]
+#this does 4 - 6
+confint &lt;- as.vector(t.test(m4, m6, var.equal = TRUE)$conf.int)
+</code></pre>
+
+<p><span class="answer">3.2</span>
+<span class="answer">10.7</span></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>If someone put a gun to your head and said &quot;Your confidence interval
+must contain what it&#39;s estimating or I&#39;ll pull the trigger&quot;, what would
+be the smart thing to do?</p>
+
+<ol>
+<li><em>Make your interval as wide as possible</em></li>
+<li>Make your interval as small as possible</li>
+<li>Call the authorities</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>C&#39;mon. You don&#39;t need a hint</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>This is just an example of what happens to confidence intervals as you
+increase the confidence level. You want to be quite sure in your interval (i.e.
+have a large confidence level) and so you would increase the interval&#39;s width</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>Refer back to comparing MPG for 4 versus 6 cylinders. What do you conclude?</p>
+
+<ol>
+<li>The interval is above zero, suggesting 6 is better than 4 in the terms of MPG</li>
+<li><em>The interval is above zero, suggesting 4 is better than 6 in the terms of MPG</em></li>
+<li>The interval does not tell you anything about the hypothesis test; you have to do the test.</li>
+<li>The interval contains 0 suggesting no difference.</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Refer back to the problem, consider the implications of the interval being
+larger than 0, double check the order in which things were subtracted and
+make sure the results make sense in the context of the problem.</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>The interval was conducted subtracting 4 - 6 and was entirely above zero.</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Suppose that 18 obese subjects were randomized, 9 each, to a new diet pill and a placebo. Subjects&#39; body mass indices (BMIs) were measured at a baseline and again after having received the treatment or placebo for four weeks. The average difference from follow-up to the baseline (followup - baseline) was 3 kg/m2 for the treated group and 1 kg/m2 for the placebo group. The corresponding standard deviations of the differences was 1.5 kg/m2 for the treatment group and 1.8 kg/m2 for the placebo group. The study aims to answer whether the change in BMI over the four week period appear to differ between the treated and placebo groups. </p>
+
+<ol>
+<li>What is the pooled variance estimate? (to 2 decimal places)</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>The sample sizes are equal, so the pooled variance is the average of the 
+individual variances</p>
+
+</div>
+<div class="quiz-explanation">
+  <pre><code class="r">n1 &lt;- n2 &lt;- 9
+x1 &lt;- -3  ##treated
+x2 &lt;- 1  ##placebo
+s1 &lt;- 1.5  ##treated
+s2 &lt;- 1.8  ##placebo
+spsq &lt;- ( (n1 - 1) * s1^2 + (n2 - 1) * s2^2) / (n1 + n2 - 2)
+</code></pre>
+
+<p><span class="answer">2.75</span></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>For Binomial data the maximum likelihood estimate for the probability of 
+a success is</p>
+
+<ol>
+<li><em>The proportion of successes</em></li>
+<li>The proportion of failures</li>
+<li>A shrunken version of the proportion of successes</li>
+<li>A shrunken version of the proportion of failures</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Look back at the notes about likelihood.</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>The MLE for binomial data is always the proportion of successes.</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>Bayesian inference requires</p>
+
+<ol>
+<li>A type I error rate</li>
+<li>Setting your confidence level</li>
+<li><em>Assigning a prior probability distribution</em></li>
+<li>Evaluating frequency error rates</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-explanation">
+  <p>All of the other answers discuss frequentist concepts. All Bayesian analyses requiring setting a prior.</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='About these slides'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title=''>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title=''>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title=''>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title=''>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title=''>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title=''>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title=''>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title=''>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title=''>
+         10
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  <script src="libraries/widgets/quiz/js/jquery.quiz.js"></script>
+<script src="libraries/widgets/quiz/js/mustache.min.js"></script>
+<script src="libraries/widgets/quiz/js/quiz-app.js"></script>
+<script src="libraries/widgets/bootstrap/js/bootstrap.min.js"></script>
+<script src="libraries/widgets/bootstrap/js/bootbox.min.js"></script>
+
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<script>  
+  $(function (){ 
+    $("#example").popover(); 
+    $("[rel='tooltip']").tooltip(); 
+  });  
+  </script>  
+  <!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="libraries/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/homework/hw3.md b/06_StatisticalInference/homework/hw3.md
index 93859ed5b..2e666d215 100644
--- a/06_StatisticalInference/homework/hw3.md
+++ b/06_StatisticalInference/homework/hw3.md
@@ -1,210 +1,210 @@
----
-title       : Homework 3 for Stat Inference
-subtitle    : Extra problems for Stat Inference
-author      : Brian Caffo
-job         : Johns Hopkins Bloomberg School of Public Health
-framework   : io2012
-highlighter : highlight.js  
-hitheme     : tomorrow       
-#url:
-#    lib: ../../librariesNew #Remove new if using old slidify
-#    assets: ../../assets
-widgets     : [mathjax, quiz, bootstrap]
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## About these slides
-- These are some practice problems for Statistical Inference Quiz 3
-- They were created using slidify interactive which you will learn in 
-Creating Data Products
-- Please help improve this with pull requests here
-(https://github.com/bcaffo/courses)
-
-
-
---- &multitext
-Load the data set `mtcars` in the `datasets` R package. Calculate a 
-95% confidence interval to the nearest MPG.
-
-1. What is the lower endpoint of the interval?
-2. What is the upper endpoint of the interval?
-
-*** .hint
-Do `library(datasets)` and then `data(mtcars)` to get the data.
-Consider `t.test` for calculations. You may have to install
-the datasets package.
-
-
-*** .explanation
-
-```r
-library(datasets); data(mtcars)
-round(t.test(mtcars$mpg)$conf.int)
-```
-
-```
-[1] 18 22
-attr(,"conf.level")
-[1] 0.95
-```
-
-
-<span class="answer">18</span>
-<span class="answer">22</span>
-
---- &multitext
-Suppose that data of 9 paired differences has a standard error of $1$, what value would the average difference have to be to have the lower endpoint of a 95%
-students t confidence interval touch zero?
-
-1. Give the number here to two decimal places
-
-*** .hint
-The t interval is $\bar x t_{.95, 8}\pm s /sqrt{n}$
-
-*** .explanation
-<span class="answer">0.62</span>
-
-We want $\bar x = t_{.95} s / sqrt{n}$
-
-```r
-round(qt(.95, df = 8) * 1 / 3, 2)
-```
-
-```
-[1] 0.62
-```
-
-
-
---- &radio
-An independent group Student's T interval is used over
-a paired T interval when:
-
-1. The observations are paired between the groups.
-2. _The observations between the groups are natually assumed to be statistically independent_
-3. As long as you do it correctly, either is fine.
-4. More details are needed to answer this question
-
-*** .hint
-A paired interval is for paired observations.
-
-*** .explanation
-We can't pair them if the groups are independent of each other as well as independent within themselves.
-
-
---- &multitext
-Consider the `mtcars` dataset. Construct a 95% T interval for MPG comparing
-4 to 6 cylinder cars (subtracting in the order of 4 - 6) 
-assume a constant variance.
-
-1. What is the lower endpoint of the interval to 1 decimal place?
-2. What is the upper endpoint of the interval to 1 decimal place?
-
-*** .hint
-Use `t.test` with `var.equal=TRUE`
-
-*** .explanation
-
-
-```r
-m4 <- mtcars$mpg[mtcars$cyl == 4]
-m6 <- mtcars$mpg[mtcars$cyl == 6]
-#this does 4 - 6
-confint <- as.vector(t.test(m4, m6, var.equal = TRUE)$conf.int)
-```
-
-
-<span class="answer">3.2</span>
-<span class="answer">10.7</span>
-
-
---- &radio
-If someone put a gun to your head and said "Your confidence interval
-must contain what it's estimating or I'll pull the trigger", what would
-be the smart thing to do?
-
-1. _Make your interval as wide as possible_
-2. Make your interval as small as possible
-3. Call the authorities
-
-*** .hint
-C'mon. You don't need a hint
-
-*** .explanation
-This is just an example of what happens to confidence intervals as you
-increase the confidence level. You want to be quite sure in your interval (i.e.
-have a large confidence level) and so you would increase the interval's width
-
---- &radio
-
-Refer back to comparing MPG for 4 versus 6 cylinders. What do you conclude?
-
-1. The interval is above zero, suggesting 6 is better than 4 in the terms of MPG
-2. _The interval is above zero, suggesting 4 is better than 6 in the terms of MPG_
-3. The interval does not tell you anything about the hypothesis test; you have to do the test.
-4. The interval contains 0 suggesting no difference.
-
-*** .hint
-Refer back to the problem, consider the implications of the interval being
-larger than 0, double check the order in which things were subtracted and
-make sure the results make sense in the context of the problem.
-
-*** .explanation
-The interval was conducted subtracting 4 - 6 and was entirely above zero.
-
---- &multitext
-Suppose that 18 obese subjects were randomized, 9 each, to a new diet pill and a placebo. Subjects' body mass indices (BMIs) were measured at a baseline and again after having received the treatment or placebo for four weeks. The average difference from follow-up to the baseline (followup - baseline) was 3 kg/m2 for the treated group and 1 kg/m2 for the placebo group. The corresponding standard deviations of the differences was 1.5 kg/m2 for the treatment group and 1.8 kg/m2 for the placebo group. The study aims to answer whether the change in BMI over the four week period appear to differ between the treated and placebo groups. 
-
-What is the pooled variance estimate? (to 2 decimal places)
-
-
-*** .hint
-The sample sizes are equal, so the pooled variance is the average of the 
-individual variances
-
-
-*** .explanation
-
-```r
-n1 <- n2 <- 9
-x1 <- -3  ##treated
-x2 <- 1  ##placebo
-s1 <- 1.5  ##treated
-s2 <- 1.8  ##placebo
-spsq <- ( (n1 - 1) * s1^2 + (n2 - 1) * s2^2) / (n1 + n2 - 2)
-```
-
-<span class="answer">2.75</span>
-
-
---- &radio
-
-For Binomial data the maximum likelihood estimate for the probability of 
-a success is
-
-1. _The proportion of successes_
-2. The proportion of failures
-3. A shrunken version of the proportion of successes
-4. A shrunken version of the proportion of failures
-
-*** .hint
-Look back at the notes about likelihood.
-
-*** .explanation
-The MLE for binomial data is always the proportion of successes.
-
---- &radio
-
-Bayesian inference requires
-
-1. A type I error rate
-2. Setting your confidence level
-3. _Assigning a prior probability distribution_
-4. Evaluating frequency error rates
-
-*** .explanation
-All of the other answers discuss frequentist concepts. All Bayesian analyses requiring setting a prior.
-
-
+---
+title       : Homework 3 for Stat Inference
+subtitle    : (Use the arrow keys to navigate)
+author      : Brian Caffo
+job         : Johns Hopkins Bloomberg School of Public Health
+framework   : io2012
+highlighter : highlight.js  
+hitheme     : tomorrow       
+#url:
+#    lib: ../../librariesNew #Remove new if using old slidify
+#    assets: ../../assets
+widgets     : [mathjax, quiz, bootstrap]
+mode        : selfcontained # {standalone, draft}
+---
+
+
+
+## About these slides
+- These are some practice problems for Statistical Inference Quiz 3
+- They were created using slidify interactive which you will learn in 
+Creating Data Products
+- Please help improve this with pull requests here
+(https://github.com/bcaffo/courses)
+
+
+
+--- &multitext
+Load the data set `mtcars` in the `datasets` R package. Calculate a 
+95% confidence interval to the nearest MPG for the variable `mpg`.
+
+1. What is the lower endpoint of the interval?
+2. What is the upper endpoint of the interval?
+
+*** .hint
+Do `library(datasets)` and then `data(mtcars)` to get the data.
+Consider `t.test` for calculations. You may have to install
+the datasets package.
+
+
+*** .explanation
+
+```r
+library(datasets); data(mtcars)
+round(t.test(mtcars$mpg)$conf.int)
+```
+
+```
+[1] 18 22
+attr(,"conf.level")
+[1] 0.95
+```
+
+
+<span class="answer">18</span>
+<span class="answer">22</span>
+
+--- &multitext
+Suppose that standard deviation of 9 paired differences is $1$. What value would the average difference have to be so that the lower endpoint of a 95%
+students t confidence interval touches zero?
+
+1. Give the number here to two decimal places
+
+*** .hint
+The t interval is $\bar x \pm t_{.975, 8} * s /\sqrt{n}$
+
+*** .explanation
+<span class="answer">0.77</span>
+
+We want $\bar x = t_{.975,8} * s / \sqrt{n}$
+
+```r
+round(qt(.975, df = 8) * 1 / 3, 2)
+```
+
+```
+[1] 0.77
+```
+
+
+
+--- &radio
+An independent group Student's T interval is used instead of
+a paired T interval when:
+
+1. The observations are paired between the groups.
+2. _The observations between the groups are naturally assumed to be statistically independent_
+3. As long as you do it correctly, either is fine.
+4. More details are needed to answer this question
+
+*** .hint
+A paired interval is for paired observations.
+
+*** .explanation
+We can't pair them if the groups are independent of each other as well as independent within themselves.
+
+
+--- &multitext
+Consider the `mtcars` dataset. Construct a 95% T interval for MPG comparing
+4 to 6 cylinder cars (subtracting in the order of 4 - 6) 
+assume a constant variance.
+
+1. What is the lower endpoint of the interval to 1 decimal place?
+2. What is the upper endpoint of the interval to 1 decimal place?
+
+*** .hint
+Use `t.test` with `var.equal=TRUE`
+
+*** .explanation
+
+
+```r
+m4 <- mtcars$mpg[mtcars$cyl == 4]
+m6 <- mtcars$mpg[mtcars$cyl == 6]
+#this does 4 - 6
+confint <- as.vector(t.test(m4, m6, var.equal = TRUE)$conf.int)
+```
+
+
+<span class="answer">3.2</span>
+<span class="answer">10.7</span>
+
+
+--- &radio
+If someone put a gun to your head and said "Your confidence interval
+must contain what it's estimating or I'll pull the trigger", what would
+be the smart thing to do?
+
+1. _Make your interval as wide as possible_
+2. Make your interval as small as possible
+3. Call the authorities
+
+*** .hint
+C'mon. You don't need a hint
+
+*** .explanation
+This is just an example of what happens to confidence intervals as you
+increase the confidence level. You want to be quite sure in your interval (i.e.
+have a large confidence level) and so you would increase the interval's width
+
+--- &radio
+
+Refer back to comparing MPG for 4 versus 6 cylinders. What do you conclude?
+
+1. The interval is above zero, suggesting 6 is better than 4 in the terms of MPG
+2. _The interval is above zero, suggesting 4 is better than 6 in the terms of MPG_
+3. The interval does not tell you anything about the hypothesis test; you have to do the test.
+4. The interval contains 0 suggesting no difference.
+
+*** .hint
+Refer back to the problem, consider the implications of the interval being
+larger than 0, double check the order in which things were subtracted and
+make sure the results make sense in the context of the problem.
+
+*** .explanation
+The interval was conducted subtracting 4 - 6 and was entirely above zero.
+
+--- &multitext
+Suppose that 18 obese subjects were randomized, 9 each, to a new diet pill and a placebo. Subjects' body mass indices (BMIs) were measured at a baseline and again after having received the treatment or placebo for four weeks. The average difference from follow-up to the baseline (followup - baseline) was 3 kg/m2 for the treated group and 1 kg/m2 for the placebo group. The corresponding standard deviations of the differences was 1.5 kg/m2 for the treatment group and 1.8 kg/m2 for the placebo group. The study aims to answer whether the change in BMI over the four week period appear to differ between the treated and placebo groups. 
+
+1. What is the pooled variance estimate? (to 2 decimal places)
+
+
+*** .hint
+The sample sizes are equal, so the pooled variance is the average of the 
+individual variances
+
+
+*** .explanation
+
+```r
+n1 <- n2 <- 9
+x1 <- -3  ##treated
+x2 <- 1  ##placebo
+s1 <- 1.5  ##treated
+s2 <- 1.8  ##placebo
+spsq <- ( (n1 - 1) * s1^2 + (n2 - 1) * s2^2) / (n1 + n2 - 2)
+```
+
+<span class="answer">2.75</span>
+
+
+--- &radio
+
+For Binomial data the maximum likelihood estimate for the probability of 
+a success is
+
+1. _The proportion of successes_
+2. The proportion of failures
+3. A shrunken version of the proportion of successes
+4. A shrunken version of the proportion of failures
+
+*** .hint
+Look back at the notes about likelihood.
+
+*** .explanation
+The MLE for binomial data is always the proportion of successes.
+
+--- &radio
+
+Bayesian inference requires
+
+1. A type I error rate
+2. Setting your confidence level
+3. _Assigning a prior probability distribution_
+4. Evaluating frequency error rates
+
+*** .explanation
+All of the other answers discuss frequentist concepts. All Bayesian analyses requiring setting a prior.
+
+
diff --git a/06_StatisticalInference/homework/hw4.Rmd b/06_StatisticalInference/homework/hw4.Rmd
index bf5a8da3b..7baae0d26 100644
--- a/06_StatisticalInference/homework/hw4.Rmd
+++ b/06_StatisticalInference/homework/hw4.Rmd
@@ -1,37 +1,351 @@
----
-title       : Homework 4 for Stat Inference
-subtitle    : Extra problems for Stat Inference
-author      : Brian Caffo
-job         : Johns Hopkins Bloomberg School of Public Health
-framework   : io2012
-highlighter : highlight.js  
-hitheme     : tomorrow       
-#url:
-#    lib: ../../librariesNew #Remove new if using old slidify
-#    assets: ../../assets
-widgets     : [mathjax, quiz, bootstrap]
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-library(knitr)
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-```
-
-## About these slides
-- These are some practice problems for Statistical Inference Quiz 4
-- They were created using slidify interactive which you will learn in 
-Creating Data Products
-- Please help improve this with pull requests here
-(https://github.com/bcaffo/courses)
+---
+title       : Homework 4 for Stat Inference
+subtitle    : (Use arrow keys to navigate)
+author      : Brian Caffo
+job         : Johns Hopkins Bloomberg School of Public Health
+framework   : io2012
+highlighter : highlight.js  
+hitheme     : tomorrow       
+url:
+    lib: ../../librariesNew #Remove new if using old slidify
+    assets: ../../assets
+widgets     : [mathjax, quiz, bootstrap]
+mode        : selfcontained # {standalone, draft}
+---
+```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
+# make this an external chunk that can be included in any file
+library(knitr)
+options(width = 100)
+opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
+
+options(xtable.type = 'html')
+knit_hooks$set(inline = function(x) {
+  if(is.numeric(x)) {
+    round(x, getOption('digits'))
+  } else {
+    paste(as.character(x), collapse = ', ')
+  }
+})
+knit_hooks$set(plot = knitr:::hook_plot_html)
+```
+
+## About these slides
+- These are some practice problems for Statistical Inference Quiz 4
+- They were created using slidify interactive which you will learn in 
+Creating Data Products
+- Please help improve this with pull requests here
+(https://github.com/bcaffo/courses)
+
+
+--- &multitext
+Load the data set `mtcars` in the `datasets` R package. Assume that the data set mtcars is a random sample. Compute the mean MPG, $\bar x,$ of this sample.
+
+You want
+to test whether the true MPG is $\mu_0$ or smaller using a one sided
+5% level test. ($H_0 : \mu = \mu_0$ versus $H_a : \mu < \mu_0$).
+Using that data set and a Z test:
+
+1. . Based on the mean MPG of the sample $\bar x,$ and by using a Z test: what is the smallest value of $\mu_0$ that you would reject for (to two decimal places)?
+
+*** .hint
+This is the inversion of a one sided hypothesis test. It yields confidence
+bounds. (Note inverting a two sidded test yields confidence intervals.) 
+Think about the derivation of the confidence interval.
+
+*** .explanation
+We want to solve 
+$$
+\frac{\sqrt{n}(\bar{X} - \mu_0)}{s} = Z_{0.05}
+$$
+Or $$\mu_0 = \bar{X} - Z_{0.05} s / \sqrt{n} = \bar{X} + Z_{0.95} s / \sqrt{n}$$ Note that the quantile is negative.
+
+```{r}
+mn <- mean(mtcars$mpg)
+s <- sd(mtcars$mpg)
+z <- qnorm(.05)
+mu0 <- mn - z * s / sqrt(nrow(mtcars))
+```
+Note, it's easy to get tripped up in this problem on signs. If you get a value
+that's less than $\bar X$, then clearly it's wrong, since you'd never reject for
+$H_0:\mu = \mu_0$ versus $H_a : \mu < \mu_0$ if $\mu_0$ was less than your observed mean.
+Also note the answer to "What is the largest value for which you would reject for?" is
+infinity.
+
+<span class="answer">`r round(mu0, 2) `</span<>
+
+
+--- &multitext
+Consider again the `mtcars` dataset. Use a two group t-test to test
+the hypothesis that the 4 and 6 cyl cars have the same mpg.  Use
+a two sided test with unequal variances.
+
+1. Do you reject at the 5% level (enter 0 for no, 1 for yes).
+2. What is the P-value to 4 decimal places expressed as a proportion?
+
+
+*** .hint
+Use `t.test` with the options `var.equal=FALSE`, `paired=FALSE`,  `altnernative` as `two.sided`. 
+
+*** .explanation
+
+```{r}
+m4 <- mtcars$mpg[mtcars$cyl == 4]
+m6 <- mtcars$mpg[mtcars$cyl == 6]
+p <- t.test(m4, m6, paired = FALSE, alternative="two.sided", var.equal=FALSE)$p.value
+```
+The answer to 1. is <span class="answer">`r as.integer(p < .05)`</span> <br>
+The answer to 2. is <span class="answer">`r round(p, 4)`</span>
+
+
+--- &multitext
+A sample of 100 men yielded an average PSA level of 3.0 with a sd of 1.1. What
+are the complete set of values that a 5% two sided Z test of $H_0 : \mu = \mu_0$ 
+would fail to reject the null hypothesis for?
+
+1. Enter the lower value to 2 decimal places.
+2. Enter the upper value to 2 decimal places. 
+
+*** .hint
+This is equivalent to the confidence interval.
+
+*** .explanation
+The answer to 1 is
+ <span class="answer">`r round(3 - qnorm(.975) * 1.1 / sqrt(100), 2)`</span><br>
+The answer to 2 is <span class="answer">`r round(3 + qnorm(.975) * 1.1 / sqrt(100), 2)`</span>
+
+
+--- &multitext
+You believe the coin that you're flipping is biased towards heads. You get 55 heads out of 
+100 flips. 
+
+1. What's the exact relevant pvalue to 4 decimal places (expressed as a proportion)?
+2. Would you reject a 1 sided hypothesis at $\alpha = .05$? (0 for no 1 for yes)?
+
+*** .hint
+Use `pbinom` for a hypothesis that $p=.5$ veruss $p>.5$ where $p$ is the binomial success
+probability.
+
+*** .explanation
+Note you have to start at 54 as it `lower.tail = FALSE` gives the strictly greater than
+probabilities
+```{r}
+ans <- round(pbinom(54, prob = .5, size = 100, lower.tail = FALSE),4)
+```
+The answer to 1 is <span class="answer">`r ans`</span><br>
+The answer to 2 is <span class="answer">`r as.integer(ans < .05)`</span><br>
+
+
+--- &multitext
+
+A web site was monitored for a year and it received 520 hits per day. In the first
+30 days in the next year, the site received 15,800 hits. Assuming that web hits
+are Poisson.
+
+1. Give an exact one sided P-value to the hypothesis that web hits are up this year over last
+to four significant digits (expressed as a proportion).
+2. Does the one sided test reject (0 for no 1 for yes)?
+
+
+
+*** .hint
+Consider using `ppois` with $\lambda=520 * 30$.  Note this is nearly exactly Gaussian, 
+so one could get away with the Gaussian calculation.
+
+*** .explanation
+This test comes with the important assumption that web hits are a Poisson process.
+
+```{r}
+pv <- ppois(15800 - 1, lambda = 520 * 30, lower.tail = FALSE)
+```
+
+The answer to 1 is <span class="answer">`r round(pv, 4)`</span><br>
+The answer to 2 is <span class="answer">`r as.integer(pv < .05)`</span><br>
+
+Also, compare with the Gaussian approximation where $\hat \lambda  \sim N(\lambda, \lambda / t)$
+```{r}
+pnorm(15800 / 30, mean = 520, sd = sqrt(520 / 30), lower.tail = FALSE)
+```
+As $t\rightarrow \infty$ this becomes more Gaussian. The approximation is pretty good for this
+setting.
+
+
+--- &multitext
+
+Suppose that in an AB test, one advertising scheme led to an average of 10 purchases per day for a sample of 100 days, while the other led to 11 purchaces per day, also for a sample of 100 days.
+Assuming a common standard deviation of 4 purchases per day.
+Assuming that the groups are independent and that they days are iid, perform a Z test of
+equivalence. 
+
+1. What is the P-value reported to 3 digits expressed as a proportion?
+2. Do you reject the test? (0 for no 1 for yes).
+
+*** .hint
+The standard error is 
+$$
+s \sqrt{\frac{1}{n_1} + \frac{1}{n_2}}
+$$
+
+*** .explanation
+```{r}
+m1 <- 10; m2 <- 11
+n1 <- n2 <- 100
+s <- 4
+se <- s * sqrt(1 / n1 + 1 / n2)
+ts <- (m2 - m1) / se
+pv <- 2 * pnorm(-abs(ts))
+```
+
+The answer to 1 is <span class="answer">`r round(pv, 3)`</span><br>
+The answer to 2 is <span class="answer">`r as.integer(pv < .05)`</span>
+
+
+--- &radio
+
+A confidence interval for the mean contains:
+
+1. _All of the values of the hypothesized mean for which we would fail to reject with 
+$\alpha = 1 - Conf. Level$._
+2. All of the values of the hypothesized mean for which we would fail to reject with 
+$2 \alpha = 1 - Conf. Level$.
+3. All of the values of the hypothesized mean for which we would reject with 
+$\alpha = 1 - Conf. Level$.
+4. All of the values of the hypothesized mean for which we would reject with 
+$2 \alpha = 1 - Conf. Level$.
+
+*** .hint
+This is directly from the notes. Note that a confidence interval gives
+values of $\mu$ that are supported by the data whereas a test rejects
+for values of $\mu$ different from $\mu_0$. 
+
+*** .explanation
+The only complicated part of this is the 2. Note that a 95% interval corresponds
+to a 5% level two sided test. So it's $\alpha = 1 - Conf.Level$. The confusion is that
+for both the two sided test and confidence interval, one needs to calculate
+$Z_{1 - \alpha / 2}$ (or the relevant T quantile).
+
+
+--- &multitext
+Consider two problems previous. Assuming that 10 purchases per day is a benchmark null value, 
+that days are iid and that the standard deviation is 4 purchases for day. Suppose that you
+plan on sampling 100 days. What would be the power for a one sided 5% 
+Z mean test that purchases per day
+have increased under the alternative of $\mu = 11$ purchase per day?
+
+
+1. Give your result as a proportion to 3 decimal places.
+
+*** .hint
+Under $H_0$ $\bar X \sim N(10, .4)$. Under $H_a$ $\bar X \sim N(11, .4)$. We reject
+when $\bar X \geq 10 + Z_{.95} .4$.
+
+
+*** .explanation
+The hint prettty much gives it away.
+```{r}
+power <- pnorm(10 + qnorm(.95) * .4, mean = 11, sd = .4, lower.tail = FALSE)
+```
+The answer is <span class="answer">`r round(power, 3)`</span>
+
+
+--- &multitext
+Researchers would like to conduct a study of healthy adults to detect a four year mean brain volume loss of .01 mm3. Assume that the standard deviation of four year volume loss in this population is .04 mm3. 
+
+1. What is necessary sample size for the study for a 5% one sided test versus a null hypothesis of no volume loss to achieve 80% power? (Always round up)
+
+
+
+*** .hint
+Under $H_0$ $\bar X$ is $N(0, .05 / \sqrt{n})$ and is $N(.01, .05 / \sqrt{n})$ under $H_a$. 
+We will reject if 
+$$
+\bar X \geq  Z_{.95} s / sqrt{n}
+$$ 
+which has probability 0.05 under $H_0$. Under $H_a$ it has probability
+$$
+P\left( \frac{\bar X - 0.01}{s / \sqrt{n}} \geq  \frac{.01}{s / \sqrt{n}} + z_{.95} \right)
+= P\left( Z \geq \frac{.01}{s / \sqrt{n}} + z_{.95}\right)
+$$
+
+*** .explanation
+Looking at the hint we set
+$$
+\frac{.01}{s / \sqrt{n}} + z_{.95} = z_{.2}
+$$
+$$
+n = \frac{(z_{.95} - z_{.2})^2 s^2}{.01^2} = \frac{ (z_{.95} + z_{.8})^2 s^2}{.01^2}
+$$
+So we get
+```{r}
+n <- (qnorm(.95) + qnorm(.8)) ^ 2 * .04 ^ 2 / .01^2
+```
+The answer is <span class="answer">`r ceiling(n)`</span>
+
+
+--- &radio
+
+In a court of law, all things being equal, if via policy you require a lower
+standard of evidence to convict people then
+
+1. Less guilty people will be convicted.
+2. _More innocent people will be convicted._
+3. More Innocent people will be not convicted.
+
+
+*** .hint
+Think about it.
+
+*** .explanation
+If you require less evidence to convict, then you will convict more people, guilty and
+innocent. Relate this property back to hypothesis tests.
+
+
+--- &multitext
+Consider the `mtcars` data set. 
+
+1. Give the p-value for a t-test comparing MPG for 6 and 8 cylinder cars assuming equal variance, as a proportion to 3 decimal places.
+2. Give the associated P-value for a z test.
+3. Give the common (pooled) standard deviation estimate for MPG across cylinders to 3 decimal places.
+4. Would the t test reject at the two sided 0.05 level (0 for no 1 for yes)?
+
+
+*** .hint
+```{r}
+mpg8 <- mtcars$mpg[mtcars$cyl == 8]
+mpg6 <- mtcars$mpg[mtcars$cyl == 6]
+m8 <- mean(mpg8); m6 <- mean(mpg6)
+s8 <- sd(mpg8); s6 <- sd(mpg6)
+n8 <- length(mpg8); n6 <- length(mpg6)
+```
+
+*** .explanation
+```{r}
+p <- t.test(mpg8, mpg6, paired = FALSE, alternative="two.sided", var.equal=TRUE)$p.value
+mixprob <- (n8 - 1) / (n8 + n6 - 2)
+s <- sqrt(mixprob * s8 ^ 2  +  (1 - mixprob) * s6 ^ 2)
+z <- (m8 - m6) / (s * sqrt(1 / n8 + 1 / n6))
+pz <- 2 * pnorm(-abs(z))
+## Hand calculating the T just to check
+#2 * pt(-abs(z), df = n8 + n6 - 2)
+````
+
+1. <span class="answer">`r round(p,  3)`</span> <br>
+2. <span class="answer">`r round(pz, 3)`</span><br>
+3. <span class="answer">`r round(s, 3)`</span><br>
+4. <span class="answer">`r as.integer(p < .05)`</span>
+
+
+--- &radio
+The Bonferonni correction controls this
+
+1. False discovery rate
+2. _The familywise error rate_
+3. The rate of true rejections
+4. The number of true rejections
+
+*** .hint
+This is pretty much straight out of the notes
+
+*** .explanation
+The Bonferonni correction is the classic correction for the familywise error rate.
+
+
diff --git a/06_StatisticalInference/homework/hw4.html b/06_StatisticalInference/homework/hw4.html
index 565621a6d..a47546475 100644
--- a/06_StatisticalInference/homework/hw4.html
+++ b/06_StatisticalInference/homework/hw4.html
@@ -1,112 +1,679 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Homework 4 for Stat Inference</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Homework 4 for Stat Inference">
-  <meta name="author" content="Brian Caffo">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="libraries/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  <link rel=stylesheet href="libraries/widgets/quiz/css/demo.css"></link>
-<link rel=stylesheet href="libraries/widgets/bootstrap/css/bootstrap.css"></link>
-
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="libraries/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="libraries/frameworks/io2012/js/slides" 
-    src="libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <hgroup class="auto-fadein">
-    <h1>Homework 4 for Stat Inference</h1>
-    <h2>Extra problems for Stat Inference</h2>
-    <p>Brian Caffo<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>About these slides</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>These are some practice problems for Statistical Inference Quiz 4</li>
-<li>They were created using slidify interactive which you will learn in 
-Creating Data Products</li>
-<li>Please help improve this with pull requests here
-(<a href="https://github.com/bcaffo/courses">https://github.com/bcaffo/courses</a>)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='About these slides'>
-         1
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  <script src="libraries/widgets/quiz/js/jquery.quiz.js"></script>
-<script src="libraries/widgets/quiz/js/mustache.min.js"></script>
-<script src="libraries/widgets/quiz/js/quiz-app.js"></script>
-<script src="libraries/widgets/bootstrap/js/bootstrap.min.js"></script>
-<script src="libraries/widgets/bootstrap/js/bootbox.min.js"></script>
-
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<script>  
-  $(function (){ 
-    $("#example").popover(); 
-    $("[rel='tooltip']").tooltip(); 
-  });  
-  </script>  
-  <!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="libraries/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Homework 4 for Stat Inference</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Homework 4 for Stat Inference">
+  <meta name="author" content="Brian Caffo">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  <link rel=stylesheet href="../../librariesNew/widgets/quiz/css/demo.css"></link>
+<link rel=stylesheet href="../../librariesNew/widgets/bootstrap/css/bootstrap.css"></link>
+
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <hgroup class="auto-fadein">
+    <h1>Homework 4 for Stat Inference</h1>
+    <h2>(Use arrow keys to navigate)</h2>
+    <p>Brian Caffo<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>About these slides</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>These are some practice problems for Statistical Inference Quiz 4</li>
+<li>They were created using slidify interactive which you will learn in 
+Creating Data Products</li>
+<li>Please help improve this with pull requests here
+(<a href="https://github.com/bcaffo/courses">https://github.com/bcaffo/courses</a>)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Load the data set <code>mtcars</code> in the <code>datasets</code> R package. Assume that the data set mtcars is a random sample. Compute the mean MPG, \(\bar x,\) of this sample.</p>
+
+<p>You want
+to test whether the true MPG is \(\mu_0\) or smaller using a one sided
+5% level test. (\(H_0 : \mu = \mu_0\) versus \(H_a : \mu < \mu_0\)).
+Using that data set and a Z test:</p>
+
+<ol>
+<li class = ''>Based on the mean MPG of the sample \(\bar x,\) and by using a Z test: what is the smallest value of \(\mu_0\) that you would reject for (to two decimal places)?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>This is the inversion of a one sided hypothesis test. It yields confidence
+bounds. (Note inverting a two sidded test yields confidence intervals.) 
+Think about the derivation of the confidence interval.</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>We want to solve 
+\[
+\frac{\sqrt{n}(\bar{X} - \mu_0)}{s} = Z_{0.05}
+\]
+Or \[\mu_0 = \bar{X} - Z_{0.05} s / \sqrt{n} = \bar{X} + Z_{0.95} s / \sqrt{n}\] Note that the quantile is negative.</p>
+
+<pre><code class="r">mn &lt;- mean(mtcars$mpg)
+s &lt;- sd(mtcars$mpg)
+z &lt;- qnorm(.05)
+mu0 &lt;- mn - z * s / sqrt(nrow(mtcars))
+</code></pre>
+
+<p>Note, it&#39;s easy to get tripped up in this problem on signs. If you get a value
+that&#39;s less than \(\bar X\), then clearly it&#39;s wrong, since you&#39;d never reject for
+\(H_0:\mu = \mu_0\) versus \(H_a : \mu < \mu_0\) if \(\mu_0\) was less than your observed mean.
+Also note the answer to &quot;What is the largest value for which you would reject for?&quot; is
+infinity.</p>
+
+<p><span class="answer">21.84</span<></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Consider again the <code>mtcars</code> dataset. Use a two group t-test to test
+the hypothesis that the 4 and 6 cyl cars have the same mpg.  Use
+a two sided test with unequal variances.</p>
+
+<ol>
+<li>Do you reject at the 5% level (enter 0 for no, 1 for yes).</li>
+<li>What is the P-value to 4 decimal places expressed as a proportion?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Use <code>t.test</code> with the options <code>var.equal=FALSE</code>, <code>paired=FALSE</code>,  <code>altnernative</code> as <code>two.sided</code>. </p>
+
+</div>
+<div class="quiz-explanation">
+  <pre><code class="r">m4 &lt;- mtcars$mpg[mtcars$cyl == 4]
+m6 &lt;- mtcars$mpg[mtcars$cyl == 6]
+p &lt;- t.test(m4, m6, paired = FALSE, alternative=&quot;two.sided&quot;, var.equal=FALSE)$p.value
+</code></pre>
+
+<p>The answer to 1. is <span class="answer">1</span> <br>
+The answer to 2. is <span class="answer">4e-04</span></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>A sample of 100 men yielded an average PSA level of 3.0 with a sd of 1.1. What
+are the complete set of values that a 5% two sided Z test of \(H_0 : \mu = \mu_0\) 
+would fail to reject the null hypothesis for?</p>
+
+<ol>
+<li>Enter the lower value to 2 decimal places.</li>
+<li>Enter the upper value to 2 decimal places. </li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>This is equivalent to the confidence interval.</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>The answer to 1 is
+ <span class="answer">2.78</span><br>
+The answer to 2 is <span class="answer">3.22</span></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>You believe the coin that you&#39;re flipping is biased towards heads. You get 55 heads out of 
+100 flips. </p>
+
+<ol>
+<li>What&#39;s the exact relevant pvalue to 4 decimal places (expressed as a proportion)?</li>
+<li>Would you reject a 1 sided hypothesis at \(\alpha = .05\)? (0 for no 1 for yes)?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Use <code>pbinom</code> for a hypothesis that \(p=.5\) veruss \(p>.5\) where \(p\) is the binomial success
+probability.</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>Note you have to start at 54 as it <code>lower.tail = FALSE</code> gives the strictly greater than
+probabilities</p>
+
+<pre><code class="r">ans &lt;- round(pbinom(54, prob = .5, size = 100, lower.tail = FALSE),4)
+</code></pre>
+
+<p>The answer to 1 is <span class="answer">0.1841</span><br>
+The answer to 2 is <span class="answer">0</span><br></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>A web site was monitored for a year and it received 520 hits per day. In the first
+30 days in the next year, the site received 15,800 hits. Assuming that web hits
+are Poisson.</p>
+
+<ol>
+<li>Give an exact one sided P-value to the hypothesis that web hits are up this year over last
+to four significant digits (expressed as a proportion).</li>
+<li>Does the one sided test reject (0 for no 1 for yes)?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Consider using <code>ppois</code> with \(\lambda=520 * 30\).  Note this is nearly exactly Gaussian, 
+so one could get away with the Gaussian calculation.</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>This test comes with the important assumption that web hits are a Poisson process.</p>
+
+<pre><code class="r">pv &lt;- ppois(15800 - 1, lambda = 520 * 30, lower.tail = FALSE)
+</code></pre>
+
+<p>The answer to 1 is <span class="answer">0.0553</span><br>
+The answer to 2 is <span class="answer">0</span><br></p>
+
+<p>Also, compare with the Gaussian approximation where \(\hat \lambda  \sim N(\lambda, \lambda / t)\)</p>
+
+<pre><code class="r">pnorm(15800 / 30, mean = 520, sd = sqrt(520 / 30), lower.tail = FALSE)
+</code></pre>
+
+<pre><code>[1] 0.05466
+</code></pre>
+
+<p>As \(t\rightarrow \infty\) this becomes more Gaussian. The approximation is pretty good for this
+setting.</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Suppose that in an AB test, one advertising scheme led to an average of 10 purchases per day for a sample of 100 days, while the other led to 11 purchaces per day, also for a sample of 100 days.
+Assuming a common standard deviation of 4 purchases per day.
+Assuming that the groups are independent and that they days are iid, perform a Z test of
+equivalence. </p>
+
+<ol>
+<li>What is the P-value reported to 3 digits expressed as a proportion?</li>
+<li>Do you reject the test? (0 for no 1 for yes).</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>The standard error is 
+\[
+s \sqrt{\frac{1}{n_1} + \frac{1}{n_2}}
+\]</p>
+
+</div>
+<div class="quiz-explanation">
+  <pre><code class="r">m1 &lt;- 10; m2 &lt;- 11
+n1 &lt;- n2 &lt;- 100
+s &lt;- 4
+se &lt;- s * sqrt(1 / n1 + 1 / n2)
+ts &lt;- (m2 - m1) / se
+pv &lt;- 2 * pnorm(-abs(ts))
+</code></pre>
+
+<p>The answer to 1 is <span class="answer">0.077</span><br>
+The answer to 2 is <span class="answer">0</span></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>A confidence interval for the mean contains:</p>
+
+<ol>
+<li><em>All of the values of the hypothesized mean for which we would fail to reject with 
+\(\alpha = 1 - Conf. Level\).</em></li>
+<li>All of the values of the hypothesized mean for which we would fail to reject with 
+\(2 \alpha = 1 - Conf. Level\).</li>
+<li>All of the values of the hypothesized mean for which we would reject with 
+\(\alpha = 1 - Conf. Level\).</li>
+<li>All of the values of the hypothesized mean for which we would reject with 
+\(2 \alpha = 1 - Conf. Level\).</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>This is directly from the notes. Note that a confidence interval gives
+values of \(\mu\) that are supported by the data whereas a test rejects
+for values of \(\mu\) different from \(\mu_0\). </p>
+
+</div>
+<div class="quiz-explanation">
+  <p>The only complicated part of this is the 2. Note that a 95% interval corresponds
+to a 5% level two sided test. So it&#39;s \(\alpha = 1 - Conf.Level\). The confusion is that
+for both the two sided test and confidence interval, one needs to calculate
+\(Z_{1 - \alpha / 2}\) (or the relevant T quantile).</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Consider two problems previous. Assuming that 10 purchases per day is a benchmark null value, 
+that days are iid and that the standard deviation is 4 purchases for day. Suppose that you
+plan on sampling 100 days. What would be the power for a one sided 5% 
+Z mean test that purchases per day
+have increased under the alternative of \(\mu = 11\) purchase per day?</p>
+
+<ol>
+<li>Give your result as a proportion to 3 decimal places.</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Under \(H_0\) \(\bar X \sim N(10, .4)\). Under \(H_a\) \(\bar X \sim N(11, .4)\). We reject
+when \(\bar X \geq 10 + Z_{.95} .4\).</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>The hint prettty much gives it away.</p>
+
+<pre><code class="r">power &lt;- pnorm(10 + qnorm(.95) * .4, mean = 11, sd = .4, lower.tail = FALSE)
+</code></pre>
+
+<p>The answer is <span class="answer">0.804</span></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Researchers would like to conduct a study of healthy adults to detect a four year mean brain volume loss of .01 mm3. Assume that the standard deviation of four year volume loss in this population is .04 mm3. </p>
+
+<ol>
+<li>What is necessary sample size for the study for a 5% one sided test versus a null hypothesis of no volume loss to achieve 80% power? (Always round up)</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Under \(H_0\) \(\bar X\) is \(N(0, .05 / \sqrt{n})\) and is \(N(.01, .05 / \sqrt{n})\) under \(H_a\). 
+We will reject if 
+\[
+\bar X \geq  Z_{.95} s / sqrt{n}
+\] 
+which has probability 0.05 under \(H_0\). Under \(H_a\) it has probability
+\[
+P\left( \frac{\bar X - 0.01}{s / \sqrt{n}} \geq  \frac{.01}{s / \sqrt{n}} + z_{.95} \right)
+= P\left( Z \geq \frac{.01}{s / \sqrt{n}} + z_{.95}\right)
+\]</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>Looking at the hint we set
+\[
+\frac{.01}{s / \sqrt{n}} + z_{.95} = z_{.2}
+\]
+\[
+n = \frac{(z_{.95} - z_{.2})^2 s^2}{.01^2} = \frac{ (z_{.95} + z_{.8})^2 s^2}{.01^2}
+\]
+So we get</p>
+
+<pre><code class="r">n &lt;- (qnorm(.95) + qnorm(.8)) ^ 2 * .04 ^ 2 / .01^2
+</code></pre>
+
+<p>The answer is <span class="answer">99</span></p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>In a court of law, all things being equal, if via policy you require a lower
+standard of evidence to convict people then</p>
+
+<ol>
+<li>Less guilty people will be convicted.</li>
+<li><em>More innocent people will be convicted.</em></li>
+<li>More Innocent people will be not convicted.</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>Think about it.</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>If you require less evidence to convict, then you will convict more people, guilty and
+innocent. Relate this property back to hypothesis tests.</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz-text quiz-multitext well">
+  <p>Consider the <code>mtcars</code> data set. </p>
+
+<ol>
+<li>Give the p-value for a t-test comparing MPG for 6 and 8 cylinder cars assuming equal variance, as a proportion to 3 decimal places.</li>
+<li>Give the associated P-value for a z test.</li>
+<li>Give the common standard deviation estimate for MPG across cylinders to 3 decimal places.</li>
+<li>Would the t test reject at the two sided 0.05 level (0 for no 1 for yes)?</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <pre><code class="r">mpg8 &lt;- mtcars$mpg[mtcars$cyl == 8]
+mpg6 &lt;- mtcars$mpg[mtcars$cyl == 6]
+m8 &lt;- mean(mpg8); m6 &lt;- mean(mpg6)
+s8 &lt;- sd(mpg8); s6 &lt;- sd(mpg6)
+n8 &lt;- length(mpg8); n6 &lt;- length(mpg6)
+</code></pre>
+
+</div>
+<div class="quiz-explanation">
+  <pre><code class="r">p &lt;- t.test(mpg8, mpg6, paired = FALSE, alternative=&quot;two.sided&quot;, var.equal=TRUE)$p.value
+mixprob &lt;- (n8 - 1) / (n8 + n6 - 2)
+s &lt;- sqrt(mixprob * s8 ^ 2  +  (1 - mixprob) * s6 ^ 2)
+z &lt;- (m8 - m6) / (s * sqrt(1 / n8 + 1 / n6))
+pz &lt;- 2 * pnorm(-abs(z))
+## Hand calculating the T just to check
+#2 * pt(-abs(z), df = n8 + n6 - 2)
+</code></pre>
+
+<ol>
+<li><span class="answer">0</span> <br></li>
+<li><span class="answer">0</span><br></li>
+<li><span class="answer">2.27</span><br></li>
+<li><span class="answer">1</span></li>
+</ol>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <article data-timings="">
+    
+<div class="quiz quiz-single well ">
+  <p>The Bonferonni correction controls this</p>
+
+<ol>
+<li>False discovery rate</li>
+<li><em>The familywise error rate</em></li>
+<li>The rate of true rejections</li>
+<li>The number of true rejections</li>
+</ol>
+
+  <button class="quiz-submit btn btn-primary">Submit</button>
+  <button class="quiz-toggle-hint btn btn-info">Show Hint</button>
+  <button class="quiz-show-answer btn btn-success">Show Answer</button>
+  <button class="quiz-clear btn btn-danger">Clear</button>
+  
+  <div class="quiz-hint">
+  <p>This is pretty much straight out of the notes</p>
+
+</div>
+<div class="quiz-explanation">
+  <p>The Bonferonni correction is the classic correction for the familywise error rate.</p>
+
+</div>
+</div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='About these slides'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title=''>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title=''>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title=''>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title=''>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title=''>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title=''>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title=''>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title=''>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title=''>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title=''>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title=''>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title=''>
+         13
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  <script src="../../librariesNew/widgets/quiz/js/jquery.quiz.js"></script>
+<script src="../../librariesNew/widgets/quiz/js/mustache.min.js"></script>
+<script src="../../librariesNew/widgets/quiz/js/quiz-app.js"></script>
+<script src="../../librariesNew/widgets/bootstrap/js/bootstrap.min.js"></script>
+<script src="../../librariesNew/widgets/bootstrap/js/bootbox.min.js"></script>
+
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<script>  
+  $(function (){ 
+    $("#example").popover(); 
+    $("[rel='tooltip']").tooltip(); 
+  });  
+  </script>  
+  <!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/homework/hw4.md b/06_StatisticalInference/homework/hw4.md
index a22e64543..9486f7753 100644
--- a/06_StatisticalInference/homework/hw4.md
+++ b/06_StatisticalInference/homework/hw4.md
@@ -1,23 +1,361 @@
----
-title       : Homework 4 for Stat Inference
-subtitle    : Extra problems for Stat Inference
-author      : Brian Caffo
-job         : Johns Hopkins Bloomberg School of Public Health
-framework   : io2012
-highlighter : highlight.js  
-hitheme     : tomorrow       
-#url:
-#    lib: ../../librariesNew #Remove new if using old slidify
-#    assets: ../../assets
-widgets     : [mathjax, quiz, bootstrap]
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## About these slides
-- These are some practice problems for Statistical Inference Quiz 4
-- They were created using slidify interactive which you will learn in 
-Creating Data Products
-- Please help improve this with pull requests here
-(https://github.com/bcaffo/courses)
+---
+title       : Homework 4 for Stat Inference
+subtitle    : (Use arrow keys to navigate)
+author      : Brian Caffo
+job         : Johns Hopkins Bloomberg School of Public Health
+framework   : io2012
+highlighter : highlight.js  
+hitheme     : tomorrow       
+url:
+    lib: ../../librariesNew #Remove new if using old slidify
+    assets: ../../assets
+widgets     : [mathjax, quiz, bootstrap]
+mode        : selfcontained # {standalone, draft}
+---
+
+
+
+## About these slides
+- These are some practice problems for Statistical Inference Quiz 4
+- They were created using slidify interactive which you will learn in 
+Creating Data Products
+- Please help improve this with pull requests here
+(https://github.com/bcaffo/courses)
+
+
+--- &multitext
+Load the data set `mtcars` in the `datasets` R package. Assume that the data set mtcars is a random sample. Compute the mean MPG, $\bar x,$ of this sample.
+
+You want
+to test whether the true MPG is $\mu_0$ or smaller using a one sided
+5% level test. ($H_0 : \mu = \mu_0$ versus $H_a : \mu < \mu_0$).
+Using that data set and a Z test:
+
+1. . Based on the mean MPG of the sample $\bar x,$ and by using a Z test: what is the smallest value of $\mu_0$ that you would reject for (to two decimal places)?
+
+*** .hint
+This is the inversion of a one sided hypothesis test. It yields confidence
+bounds. (Note inverting a two sidded test yields confidence intervals.) 
+Think about the derivation of the confidence interval.
+
+*** .explanation
+We want to solve 
+$$
+\frac{\sqrt{n}(\bar{X} - \mu_0)}{s} = Z_{0.05}
+$$
+Or $$\mu_0 = \bar{X} - Z_{0.05} s / \sqrt{n} = \bar{X} + Z_{0.95} s / \sqrt{n}$$ Note that the quantile is negative.
+
+
+```r
+mn <- mean(mtcars$mpg)
+s <- sd(mtcars$mpg)
+z <- qnorm(.05)
+mu0 <- mn - z * s / sqrt(nrow(mtcars))
+```
+
+Note, it's easy to get tripped up in this problem on signs. If you get a value
+that's less than $\bar X$, then clearly it's wrong, since you'd never reject for
+$H_0:\mu = \mu_0$ versus $H_a : \mu < \mu_0$ if $\mu_0$ was less than your observed mean.
+Also note the answer to "What is the largest value for which you would reject for?" is
+infinity.
+
+<span class="answer">21.84</span<>
+
+
+--- &multitext
+Consider again the `mtcars` dataset. Use a two group t-test to test
+the hypothesis that the 4 and 6 cyl cars have the same mpg.  Use
+a two sided test with unequal variances.
+
+1. Do you reject at the 5% level (enter 0 for no, 1 for yes).
+2. What is the P-value to 4 decimal places expressed as a proportion?
+
+
+*** .hint
+Use `t.test` with the options `var.equal=FALSE`, `paired=FALSE`,  `altnernative` as `two.sided`. 
+
+*** .explanation
+
+
+```r
+m4 <- mtcars$mpg[mtcars$cyl == 4]
+m6 <- mtcars$mpg[mtcars$cyl == 6]
+p <- t.test(m4, m6, paired = FALSE, alternative="two.sided", var.equal=FALSE)$p.value
+```
+
+The answer to 1. is <span class="answer">1</span> <br>
+The answer to 2. is <span class="answer">4e-04</span>
+
+
+--- &multitext
+A sample of 100 men yielded an average PSA level of 3.0 with a sd of 1.1. What
+are the complete set of values that a 5% two sided Z test of $H_0 : \mu = \mu_0$ 
+would fail to reject the null hypothesis for?
+
+1. Enter the lower value to 2 decimal places.
+2. Enter the upper value to 2 decimal places. 
+
+*** .hint
+This is equivalent to the confidence interval.
+
+*** .explanation
+The answer to 1 is
+ <span class="answer">2.78</span><br>
+The answer to 2 is <span class="answer">3.22</span>
+
+
+--- &multitext
+You believe the coin that you're flipping is biased towards heads. You get 55 heads out of 
+100 flips. 
+
+1. What's the exact relevant pvalue to 4 decimal places (expressed as a proportion)?
+2. Would you reject a 1 sided hypothesis at $\alpha = .05$? (0 for no 1 for yes)?
+
+*** .hint
+Use `pbinom` for a hypothesis that $p=.5$ veruss $p>.5$ where $p$ is the binomial success
+probability.
+
+*** .explanation
+Note you have to start at 54 as it `lower.tail = FALSE` gives the strictly greater than
+probabilities
+
+```r
+ans <- round(pbinom(54, prob = .5, size = 100, lower.tail = FALSE),4)
+```
+
+The answer to 1 is <span class="answer">0.1841</span><br>
+The answer to 2 is <span class="answer">0</span><br>
+
+
+--- &multitext
+
+A web site was monitored for a year and it received 520 hits per day. In the first
+30 days in the next year, the site received 15,800 hits. Assuming that web hits
+are Poisson.
+
+1. Give an exact one sided P-value to the hypothesis that web hits are up this year over last
+to four significant digits (expressed as a proportion).
+2. Does the one sided test reject (0 for no 1 for yes)?
+
+
+
+*** .hint
+Consider using `ppois` with $\lambda=520 * 30$.  Note this is nearly exactly Gaussian, 
+so one could get away with the Gaussian calculation.
+
+*** .explanation
+This test comes with the important assumption that web hits are a Poisson process.
+
+
+```r
+pv <- ppois(15800 - 1, lambda = 520 * 30, lower.tail = FALSE)
+```
+
+
+The answer to 1 is <span class="answer">0.0553</span><br>
+The answer to 2 is <span class="answer">0</span><br>
+
+Also, compare with the Gaussian approximation where $\hat \lambda  \sim N(\lambda, \lambda / t)$
+
+```r
+pnorm(15800 / 30, mean = 520, sd = sqrt(520 / 30), lower.tail = FALSE)
+```
+
+```
+[1] 0.05466
+```
+
+As $t\rightarrow \infty$ this becomes more Gaussian. The approximation is pretty good for this
+setting.
+
+
+--- &multitext
+
+Suppose that in an AB test, one advertising scheme led to an average of 10 purchases per day for a sample of 100 days, while the other led to 11 purchaces per day, also for a sample of 100 days.
+Assuming a common standard deviation of 4 purchases per day.
+Assuming that the groups are independent and that they days are iid, perform a Z test of
+equivalence. 
+
+1. What is the P-value reported to 3 digits expressed as a proportion?
+2. Do you reject the test? (0 for no 1 for yes).
+
+*** .hint
+The standard error is 
+$$
+s \sqrt{\frac{1}{n_1} + \frac{1}{n_2}}
+$$
+
+*** .explanation
+
+```r
+m1 <- 10; m2 <- 11
+n1 <- n2 <- 100
+s <- 4
+se <- s * sqrt(1 / n1 + 1 / n2)
+ts <- (m2 - m1) / se
+pv <- 2 * pnorm(-abs(ts))
+```
+
+
+The answer to 1 is <span class="answer">0.077</span><br>
+The answer to 2 is <span class="answer">0</span>
+
+
+--- &radio
+
+A confidence interval for the mean contains:
+
+1. _All of the values of the hypothesized mean for which we would fail to reject with 
+$\alpha = 1 - Conf. Level$._
+2. All of the values of the hypothesized mean for which we would fail to reject with 
+$2 \alpha = 1 - Conf. Level$.
+3. All of the values of the hypothesized mean for which we would reject with 
+$\alpha = 1 - Conf. Level$.
+4. All of the values of the hypothesized mean for which we would reject with 
+$2 \alpha = 1 - Conf. Level$.
+
+*** .hint
+This is directly from the notes. Note that a confidence interval gives
+values of $\mu$ that are supported by the data whereas a test rejects
+for values of $\mu$ different from $\mu_0$. 
+
+*** .explanation
+The only complicated part of this is the 2. Note that a 95% interval corresponds
+to a 5% level two sided test. So it's $\alpha = 1 - Conf.Level$. The confusion is that
+for both the two sided test and confidence interval, one needs to calculate
+$Z_{1 - \alpha / 2}$ (or the relevant T quantile).
+
+
+--- &multitext
+Consider two problems previous. Assuming that 10 purchases per day is a benchmark null value, 
+that days are iid and that the standard deviation is 4 purchases for day. Suppose that you
+plan on sampling 100 days. What would be the power for a one sided 5% 
+Z mean test that purchases per day
+have increased under the alternative of $\mu = 11$ purchase per day?
+
+
+1. Give your result as a proportion to 3 decimal places.
+
+*** .hint
+Under $H_0$ $\bar X \sim N(10, .4)$. Under $H_a$ $\bar X \sim N(11, .4)$. We reject
+when $\bar X \geq 10 + Z_{.95} .4$.
+
+
+*** .explanation
+The hint prettty much gives it away.
+
+```r
+power <- pnorm(10 + qnorm(.95) * .4, mean = 11, sd = .4, lower.tail = FALSE)
+```
+
+The answer is <span class="answer">0.804</span>
+
+
+--- &multitext
+Researchers would like to conduct a study of healthy adults to detect a four year mean brain volume loss of .01 mm3. Assume that the standard deviation of four year volume loss in this population is .04 mm3. 
+
+1. What is necessary sample size for the study for a 5% one sided test versus a null hypothesis of no volume loss to achieve 80% power? (Always round up)
+
+
+
+*** .hint
+Under $H_0$ $\bar X$ is $N(0, .05 / \sqrt{n})$ and is $N(.01, .05 / \sqrt{n})$ under $H_a$. 
+We will reject if 
+$$
+\bar X \geq  Z_{.95} s / sqrt{n}
+$$ 
+which has probability 0.05 under $H_0$. Under $H_a$ it has probability
+$$
+P\left( \frac{\bar X - 0.01}{s / \sqrt{n}} \geq  \frac{.01}{s / \sqrt{n}} + z_{.95} \right)
+= P\left( Z \geq \frac{.01}{s / \sqrt{n}} + z_{.95}\right)
+$$
+
+*** .explanation
+Looking at the hint we set
+$$
+\frac{.01}{s / \sqrt{n}} + z_{.95} = z_{.2}
+$$
+$$
+n = \frac{(z_{.95} - z_{.2})^2 s^2}{.01^2} = \frac{ (z_{.95} + z_{.8})^2 s^2}{.01^2}
+$$
+So we get
+
+```r
+n <- (qnorm(.95) + qnorm(.8)) ^ 2 * .04 ^ 2 / .01^2
+```
+
+The answer is <span class="answer">99</span>
+
+
+--- &radio
+
+In a court of law, all things being equal, if via policy you require a lower
+standard of evidence to convict people then
+
+1. Less guilty people will be convicted.
+2. _More innocent people will be convicted._
+3. More Innocent people will be not convicted.
+
+
+*** .hint
+Think about it.
+
+*** .explanation
+If you require less evidence to convict, then you will convict more people, guilty and
+innocent. Relate this property back to hypothesis tests.
+
+
+--- &multitext
+Consider the `mtcars` data set. 
+
+1. Give the p-value for a t-test comparing MPG for 6 and 8 cylinder cars assuming equal variance, as a proportion to 3 decimal places.
+2. Give the associated P-value for a z test.
+3. Give the common standard deviation estimate for MPG across cylinders to 3 decimal places.
+4. Would the t test reject at the two sided 0.05 level (0 for no 1 for yes)?
+
+
+*** .hint
+
+```r
+mpg8 <- mtcars$mpg[mtcars$cyl == 8]
+mpg6 <- mtcars$mpg[mtcars$cyl == 6]
+m8 <- mean(mpg8); m6 <- mean(mpg6)
+s8 <- sd(mpg8); s6 <- sd(mpg6)
+n8 <- length(mpg8); n6 <- length(mpg6)
+```
+
+
+*** .explanation
+
+```r
+p <- t.test(mpg8, mpg6, paired = FALSE, alternative="two.sided", var.equal=TRUE)$p.value
+mixprob <- (n8 - 1) / (n8 + n6 - 2)
+s <- sqrt(mixprob * s8 ^ 2  +  (1 - mixprob) * s6 ^ 2)
+z <- (m8 - m6) / (s * sqrt(1 / n8 + 1 / n6))
+pz <- 2 * pnorm(-abs(z))
+## Hand calculating the T just to check
+#2 * pt(-abs(z), df = n8 + n6 - 2)
+```
+
+
+1. <span class="answer">0</span> <br>
+2. <span class="answer">0</span><br>
+3. <span class="answer">2.27</span><br>
+4. <span class="answer">1</span>
+
+
+--- &radio
+The Bonferonni correction controls this
+
+1. False discovery rate
+2. _The familywise error rate_
+3. The rate of true rejections
+4. The number of true rejections
+
+*** .hint
+This is pretty much straight out of the notes
+
+*** .explanation
+The Bonferonni correction is the classic correction for the familywise error rate.
+
+
diff --git a/06_StatisticalInference/lectures.zip b/06_StatisticalInference/lectures.zip
new file mode 100644
index 000000000..60a4c8c7a
Binary files /dev/null and b/06_StatisticalInference/lectures.zip differ
diff --git a/06_StatisticalInference/lectures/01_Introduction.pdf b/06_StatisticalInference/lectures/01_Introduction.pdf
new file mode 100644
index 000000000..ba632a641
Binary files /dev/null and b/06_StatisticalInference/lectures/01_Introduction.pdf differ
diff --git a/06_StatisticalInference/lectures/02_Probability.pdf b/06_StatisticalInference/lectures/02_Probability.pdf
new file mode 100644
index 000000000..105568760
Binary files /dev/null and b/06_StatisticalInference/lectures/02_Probability.pdf differ
diff --git a/06_StatisticalInference/lectures/03_ConditionalProbability.pdf b/06_StatisticalInference/lectures/03_ConditionalProbability.pdf
new file mode 100644
index 000000000..b91f495a9
Binary files /dev/null and b/06_StatisticalInference/lectures/03_ConditionalProbability.pdf differ
diff --git a/06_StatisticalInference/lectures/04_Expectations.pdf b/06_StatisticalInference/lectures/04_Expectations.pdf
new file mode 100644
index 000000000..bade44b67
Binary files /dev/null and b/06_StatisticalInference/lectures/04_Expectations.pdf differ
diff --git a/06_StatisticalInference/lectures/05_Variance.pdf b/06_StatisticalInference/lectures/05_Variance.pdf
new file mode 100644
index 000000000..9fdc1ed32
Binary files /dev/null and b/06_StatisticalInference/lectures/05_Variance.pdf differ
diff --git a/06_StatisticalInference/lectures/06_CommonDistros.pdf b/06_StatisticalInference/lectures/06_CommonDistros.pdf
new file mode 100644
index 000000000..1d98f72f9
Binary files /dev/null and b/06_StatisticalInference/lectures/06_CommonDistros.pdf differ
diff --git a/06_StatisticalInference/lectures/07_Asymptopia.pdf b/06_StatisticalInference/lectures/07_Asymptopia.pdf
new file mode 100644
index 000000000..79cf80d5c
Binary files /dev/null and b/06_StatisticalInference/lectures/07_Asymptopia.pdf differ
diff --git a/06_StatisticalInference/lectures/08_tCIs.pdf b/06_StatisticalInference/lectures/08_tCIs.pdf
new file mode 100644
index 000000000..9c12c073a
Binary files /dev/null and b/06_StatisticalInference/lectures/08_tCIs.pdf differ
diff --git a/06_StatisticalInference/lectures/09_HT.pdf b/06_StatisticalInference/lectures/09_HT.pdf
new file mode 100644
index 000000000..9ed5b7d41
Binary files /dev/null and b/06_StatisticalInference/lectures/09_HT.pdf differ
diff --git a/06_StatisticalInference/lectures/10_pValues.pdf b/06_StatisticalInference/lectures/10_pValues.pdf
new file mode 100644
index 000000000..ba31db25c
Binary files /dev/null and b/06_StatisticalInference/lectures/10_pValues.pdf differ
diff --git a/06_StatisticalInference/lectures/11_Power.pdf b/06_StatisticalInference/lectures/11_Power.pdf
new file mode 100644
index 000000000..d4ef53661
Binary files /dev/null and b/06_StatisticalInference/lectures/11_Power.pdf differ
diff --git a/06_StatisticalInference/lectures/12_MultipleTesting.pdf b/06_StatisticalInference/lectures/12_MultipleTesting.pdf
new file mode 100644
index 000000000..88d17ad14
Binary files /dev/null and b/06_StatisticalInference/lectures/12_MultipleTesting.pdf differ
diff --git a/06_StatisticalInference/lectures/13_Resampling.pdf b/06_StatisticalInference/lectures/13_Resampling.pdf
new file mode 100644
index 000000000..ce8822a3c
Binary files /dev/null and b/06_StatisticalInference/lectures/13_Resampling.pdf differ
diff --git a/06_StatisticalInference/mathNotation/index.Rmd b/06_StatisticalInference/mathNotation/index.Rmd
deleted file mode 100644
index b5034e043..000000000
--- a/06_StatisticalInference/mathNotation/index.Rmd
+++ /dev/null
@@ -1,118 +0,0 @@
----
-title       : Math notation in R markdown
-subtitle    : 
-author      :  
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : zenburn   # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F}
-# make this an external chunk that can be included in any file
-options(width = 70)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache = T, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-```
-
-## Why math notation in R markdown?
-
-* Math notation is the standard for technical descriptions of machine learning/statistical models.
-* You may want to intersperse your technical decriptions with plain language descriptions. 
-* Math notation allows you to be precise
-  * "We fit a linear model with terms for age, sex" versus $Y_i = \alpha + \beta_a A_i + \beta_s S_i + \epsilon_i$
-* Math notation allows you to be concise
-  * "We estimated the intercept to be 3.3" versus $\hat{\alpha}=3.3$. 
-
----
-
-## What notation system does R markdown use?
-
-* R markdown uses the same system as [Latex](http://en.wikipedia.org/wiki/LaTeX) 
-* The basic idea:
-  * You write your document in R markdown
-  * You include symbols for math notation
-  * You indicate math by wrapping the symbols with certain text
-* Often the symbols are intuitive: _\alpha_ gives you $\alpha$
-
----
-
-## How to write math inline
-
-Including math in a sentence involves wrapping the symbols in $ symbols. For
-example if you write this in an R markdown file:
-
-<center> "The intercept was estimated as `$\hat{\alpha} = 4$`" </center>
-
-Then you get the following text after running _knit2html_ or _slidify_ on your document. 
-
-<center> "The intercept was estimated as $\hat{\alpha} = 4$" </center>
-
----
-
-## How to write math on a separate line
-
-Sometimes you have several equations you would like to line up. The
-way that you do that is with a double dollar sign \$$ and the align command.
-
-For example if you write
-
-<img class=center src=../../assets/img/mathNotation/aligned.png width='400px'/>
-
-Then you get the following text after running _knit2html_ or _slidify_ on your document. 
-
-$$
-  \begin{aligned}
-  y &= \beta_0 + \beta_1 + x_1 + \epsilon\\
-  x &= \gamma z \\
-  z &\sim N(0,1)
-  \end{aligned}
-$$
-
----
-
-## Common symbols
-
-* Subscripts to get $a_{b}$ write: `$a_{b}$`
-* Superscripts write $a^{b}$ write: `$a^{b}$`
-* Greek letters like $\alpha, \beta, \ldots$ write: `$\alpha, \beta, \ldots$`
-* Sums like $\sum_{n=1}^N$ write: `$\sum_{n=1}^N$`
-* Multiplication like $\times$ write: `$\times$`
-* Products like $\prod_{n=1}^N$ write: `$\prod_{n=1}^N$`
-* Inequalities like $<, \leq, \geq$ write: `$<, \leq, \geq$`
-* Distributed like $\sim$ write: `$\sim$`
-* Hats like $\widehat{\alpha}$ write: `$\widehat{\alpha}$`
-* Averages like $\bar{x}$ write: `$\bar{x}$`
-* Fractions like $\frac{a}{b}$ write: `$\frac{a}{b}$`
-* Big parentheses like $\left(\frac{a}{b}\right)$ write: `$\left(\frac{a}{b}\right)$`
-
-
----
-
-## For more information
-
-* Rstudio's equations page:
-  * http://www.rstudio.com/ide/docs/authoring/using_markdown_equations
-* Lists of Latex symbols
-  * http://www.rpi.edu/dept/arc/training/latex/LaTeX_symbols.pdf
-  * http://www.giss.nasa.gov/tools/latex/ltx-2.html
-  * http://omega.albany.edu:8008/Symbols.html
-
-
-
diff --git a/06_StatisticalInference/mathNotation/index.html b/06_StatisticalInference/mathNotation/index.html
deleted file mode 100644
index 2f9fa1bf3..000000000
--- a/06_StatisticalInference/mathNotation/index.html
+++ /dev/null
@@ -1,220 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Math notation in R markdown</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Math notation in R markdown">
-  <meta name="author" content="">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/zenburn.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
-    
-
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Math notation in R markdown</h1>
-        <h2></h2>
-        <p><br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
-    <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Why math notation in R markdown?</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Math notation is the standard for technical descriptions of machine learning/statistical models.</li>
-<li>You may want to intersperse your technical decriptions with plain language descriptions. </li>
-<li>Math notation allows you to be precise
-
-<ul>
-<li>&quot;We fit a linear model with terms for age, sex&quot; versus \(Y_i = \alpha + \beta_a A_i + \beta_s S_i + \epsilon_i\)</li>
-</ul></li>
-<li>Math notation allows you to be concise
-
-<ul>
-<li>&quot;We estimated the intercept to be 3.3&quot; versus \(\hat{\alpha}=3.3\). </li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>What notation system does R markdown use?</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>R markdown uses the same system as <a href="http://en.wikipedia.org/wiki/LaTeX">Latex</a> </li>
-<li>The basic idea:
-
-<ul>
-<li>You write your document in R markdown</li>
-<li>You include symbols for math notation</li>
-<li>You indicate math by wrapping the symbols with certain text</li>
-</ul></li>
-<li>Often the symbols are intuitive: <em>\alpha</em> gives you \(\alpha\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>How to write math inline</h2>
-  </hgroup>
-  <article>
-    <p>Including math in a sentence involves wrapping the symbols in $ symbols. For
-example if you write this in an R markdown file:</p>
-
-<p><center> &quot;The intercept was estimated as <code>$\hat{\alpha} = 4$</code>&quot; </center></p>
-
-<p>Then you get the following text after running <em>knit2html</em> or <em>slidify</em> on your document. </p>
-
-<p><center> &quot;The intercept was estimated as \(\hat{\alpha} = 4\)&quot; </center></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>How to write math on a separate line</h2>
-  </hgroup>
-  <article>
-    <p>Sometimes you have several equations you would like to line up. The
-way that you do that is with a double dollar sign $$ and the align command.</p>
-
-<p>For example if you write</p>
-
-<p><img class=center src=../../assets/img/mathNotation/aligned.png width='400px'/></p>
-
-<p>Then you get the following text after running <em>knit2html</em> or <em>slidify</em> on your document. </p>
-
-<p>\[
-  \begin{aligned}
-  y &= \beta_0 + \beta_1 + x_1 + \epsilon\\
-  x &= \gamma z \\
-  z &\sim N(0,1)
-  \end{aligned}
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Common symbols</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Subscripts to get \(a_{b}\) write: <code>$a_{b}$</code></li>
-<li>Superscripts write \(a^{b}\) write: <code>$a^{b}$</code></li>
-<li>Greek letters like \(\alpha, \beta, \ldots\) write: <code>$\alpha, \beta, \ldots$</code></li>
-<li>Sums like \(\sum_{n=1}^N\) write: <code>$\sum_{n=1}^N$</code></li>
-<li>Multiplication like \(\times\) write: <code>$\times$</code></li>
-<li>Products like \(\prod_{n=1}^N\) write: <code>$\prod_{n=1}^N$</code></li>
-<li>Inequalities like \(<, \leq, \geq\) write: <code>$&lt;, \leq, \geq$</code></li>
-<li>Distributed like \(\sim\) write: <code>$\sim$</code></li>
-<li>Hats like \(\widehat{\alpha}\) write: <code>$\widehat{\alpha}$</code></li>
-<li>Averages like \(\bar{x}\) write: <code>$\bar{x}$</code></li>
-<li>Fractions like \(\frac{a}{b}\) write: <code>$\frac{a}{b}$</code></li>
-<li>Big parentheses like \(\left(\frac{a}{b}\right)\) write: <code>$\left(\frac{a}{b}\right)$</code></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>For more information</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Rstudio&#39;s equations page:
-
-<ul>
-<li><a href="http://www.rstudio.com/ide/docs/authoring/using_markdown_equations">http://www.rstudio.com/ide/docs/authoring/using_markdown_equations</a></li>
-</ul></li>
-<li>Lists of Latex symbols
-
-<ul>
-<li><a href="http://www.rpi.edu/dept/arc/training/latex/LaTeX_symbols.pdf">http://www.rpi.edu/dept/arc/training/latex/LaTeX_symbols.pdf</a></li>
-<li><a href="http://www.giss.nasa.gov/tools/latex/ltx-2.html">http://www.giss.nasa.gov/tools/latex/ltx-2.html</a></li>
-<li><a href="http://omega.albany.edu:8008/Symbols.html">http://omega.albany.edu:8008/Symbols.html</a></li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-
-  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
diff --git a/06_StatisticalInference/mathNotation/index.md b/06_StatisticalInference/mathNotation/index.md
deleted file mode 100644
index fcb77d8e8..000000000
--- a/06_StatisticalInference/mathNotation/index.md
+++ /dev/null
@@ -1,105 +0,0 @@
----
-title       : Math notation in R markdown
-subtitle    : 
-author      :  
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : zenburn   # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-
-
-## Why math notation in R markdown?
-
-* Math notation is the standard for technical descriptions of machine learning/statistical models.
-* You may want to intersperse your technical decriptions with plain language descriptions. 
-* Math notation allows you to be precise
-  * "We fit a linear model with terms for age, sex" versus $Y_i = \alpha + \beta_a A_i + \beta_s S_i + \epsilon_i$
-* Math notation allows you to be concise
-  * "We estimated the intercept to be 3.3" versus $\hat{\alpha}=3.3$. 
-
----
-
-## What notation system does R markdown use?
-
-* R markdown uses the same system as [Latex](http://en.wikipedia.org/wiki/LaTeX) 
-* The basic idea:
-  * You write your document in R markdown
-  * You include symbols for math notation
-  * You indicate math by wrapping the symbols with certain text
-* Often the symbols are intuitive: _\alpha_ gives you $\alpha$
-
----
-
-## How to write math inline
-
-Including math in a sentence involves wrapping the symbols in $ symbols. For
-example if you write this in an R markdown file:
-
-<center> "The intercept was estimated as `$\hat{\alpha} = 4$`" </center>
-
-Then you get the following text after running _knit2html_ or _slidify_ on your document. 
-
-<center> "The intercept was estimated as $\hat{\alpha} = 4$" </center>
-
----
-
-## How to write math on a separate line
-
-Sometimes you have several equations you would like to line up. The
-way that you do that is with a double dollar sign \$$ and the align command.
-
-For example if you write
-
-<img class=center src=../../assets/img/mathNotation/aligned.png width='400px'/>
-
-Then you get the following text after running _knit2html_ or _slidify_ on your document. 
-
-$$
-  \begin{aligned}
-  y &= \beta_0 + \beta_1 + x_1 + \epsilon\\
-  x &= \gamma z \\
-  z &\sim N(0,1)
-  \end{aligned}
-$$
-
----
-
-## Common symbols
-
-* Subscripts to get $a_{b}$ write: `$a_{b}$`
-* Superscripts write $a^{b}$ write: `$a^{b}$`
-* Greek letters like $\alpha, \beta, \ldots$ write: `$\alpha, \beta, \ldots$`
-* Sums like $\sum_{n=1}^N$ write: `$\sum_{n=1}^N$`
-* Multiplication like $\times$ write: `$\times$`
-* Products like $\prod_{n=1}^N$ write: `$\prod_{n=1}^N$`
-* Inequalities like $<, \leq, \geq$ write: `$<, \leq, \geq$`
-* Distributed like $\sim$ write: `$\sim$`
-* Hats like $\widehat{\alpha}$ write: `$\widehat{\alpha}$`
-* Averages like $\bar{x}$ write: `$\bar{x}$`
-* Fractions like $\frac{a}{b}$ write: `$\frac{a}{b}$`
-* Big parentheses like $\left(\frac{a}{b}\right)$ write: `$\left(\frac{a}{b}\right)$`
-
-
----
-
-## For more information
-
-* Rstudio's equations page:
-  * http://www.rstudio.com/ide/docs/authoring/using_markdown_equations
-* Lists of Latex symbols
-  * http://www.rpi.edu/dept/arc/training/latex/LaTeX_symbols.pdf
-  * http://www.giss.nasa.gov/tools/latex/ltx-2.html
-  * http://omega.albany.edu:8008/Symbols.html
-
-
-
diff --git a/06_StatisticalInference/old/004representingData/How do we represent data.pdf b/06_StatisticalInference/old/004representingData/How do we represent data.pdf
deleted file mode 100644
index 11c7fe921..000000000
Binary files a/06_StatisticalInference/old/004representingData/How do we represent data.pdf and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-1.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-1.png
deleted file mode 100644
index e23ad8b80..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-1.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-10.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-10.png
deleted file mode 100644
index b40586c5d..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-10.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-2.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-2.png
deleted file mode 100644
index a5c2dccbd..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-2.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-3.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-3.png
deleted file mode 100644
index 7a21fa79f..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-3.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-4.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-4.png
deleted file mode 100644
index a5c2dccbd..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-4.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-5.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-5.png
deleted file mode 100644
index 397553290..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-5.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-6.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-6.png
deleted file mode 100644
index 7a26816e6..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-6.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-7.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-7.png
deleted file mode 100644
index e23ad8b80..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-7.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-8.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-8.png
deleted file mode 100644
index e259b0cde..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-8.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-9.png b/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-9.png
deleted file mode 100644
index b6d207f04..000000000
Binary files a/06_StatisticalInference/old/004representingData/fig/unnamed-chunk-9.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/figure/unnamed-chunk-1.png b/06_StatisticalInference/old/004representingData/figure/unnamed-chunk-1.png
deleted file mode 100644
index e45ff1bda..000000000
Binary files a/06_StatisticalInference/old/004representingData/figure/unnamed-chunk-1.png and /dev/null differ
diff --git a/06_StatisticalInference/old/004representingData/index.Rmd b/06_StatisticalInference/old/004representingData/index.Rmd
deleted file mode 100644
index f47d1928f..000000000
--- a/06_StatisticalInference/old/004representingData/index.Rmd
+++ /dev/null
@@ -1,278 +0,0 @@
----
-title       : How do we represent data?
-subtitle    : 
-author      : Jeffrey Leek, Assistant Professor of Biostatistics 
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : zenburn   # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F}
-# make this an external chunk that can be included in any file
-options(width = 70)
-opts_chunk$set(message = F, error = F, warning = F, echo = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache = T, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-```
-
-## How do we write about data?
-
-* Each data point is usually represented by a capital letter. 
-  * $H$ for height, $W$ for weight. 
-* If there are more than one data point of the same type we use subscripts.
-  * $H_1$, $H_2$, $H_3$ for three different people's heights.
-* Sometimes it is more compact to write $X_1$ for height and $X_2$ for weight. 
-* Then we need another subscript for the individual data point
-  * $X_{11}$ for the height of the first person. 
-* $Y$ representes general outcomes and $X$ general covariates. 
-* In this course we will try to use informative letters when possible. 
-
----
-
-## Randomness
-
-* Variables like $X$ and $Y$ are called _random variables_ because we expect them to be _random_ in some way. 
-* In general, randomness is a hard thing to define
-* In this class a variable may be random because
-  * It represents an incompletely measured variable
-  * It represents a sample drawn from a population using a random mechanism.  
-* Once we are talking about a specific value of a variable we have observed it isn't random anymore, we write these values with lower case letters $x,y$, etc. 
-* We write $X=x$ or $X=1$ to indicate we have observed a specific value $x$ or $1$. 
-
----
-
-## Randomness and measurement
-
-* A coin flip is commonly considered random
-* But it can be modeled by deterministic equations 
-  * Dynamical bias in the coin toss [(Diaconis, Holmes and Montgomery SIAM Review 2007)](http://www-stat.stanford.edu/~cgates/PERSI/papers/dyn_coin_07.pdf)
-  * Modeled the tossing as a dynamical system
-  * Showed that a coin is more likely to land on the side it started
-  * Did experiments that demonstrated it was a 51% chance
-* Some have taken it a bit farther making [predictable coin flipping machines](http://www.dotmancando.info/index.php?/projects/coin-flipper/) based on [physical properties](http://www.dotmancando.info/index.php?/projects/coin-flipper/). 
-
---- 
-
-## Distributions
-
-* In statistical modeling, random variables like $X$ are assumed to be samples from a _distribution_
-* A distribution tells us the possible values of $X$ and the probabilities for each value. 
-* Probability is the chance something will happen and is abbreviated $Pr$
-* The probabilities must all be between 0 and 1. 
-* The probabilities must add up to 1. 
-* An example:
-  * Let's flip a coin and allow $X$ to represent whether it is heads or tails
-  * $X = 1$ if it is heads and $X = 0$ if it is tails
-  * We expect that about 50% of the time it will be heads.
-  * The distribution can then be written Pr($X=1$)=0.5 and Pr($X=0$)=0.5
-
----
-
-## Continuous versus discrete distributions
-
-* _discrete_ distributions specify probabilities for discrete values
-  * Qualitative variables are discrete
-  * So are variables that take on all values 0,1,2,3...
-* _continuous_ distributions specify probabilities for ranges of values
-  * Quantitative variables are often assumed to be continuous
-  * But we might only see specific values 
-
-
----
-
-
-## Parameters
-
-* Distributions are defined by a set of fixed values called _parameters_. 
-* _parameters_ are sometimes represented by Greek letters like $\mu,\sigma,\tau$. 
-* Distributions are written as letters with the parameters in parentheses like $N(\mu,\sigma)$ or $Poisson(\lambda)$.
-* $X \sim N(\mu,\sigma)$ means that $X$ has the $N(\mu,\sigma)$ distribution. 
-
-
----
-
-## The three most important parameters
-
-* If $X$ is a random variable, the mean of that random variable is written $E[X]$
-  * Stands for expected value
-  * Measures the "center" of a distribution
-* The variance of that random variable is written $Var[X]$
-  * Measures how "spread out" a distribution is
-  * Measurement is in (units of X)$^2$
-* The standard deviation is written $SD[X] = \sqrt{Var[X]}$
-  * Also measures how "spread out" a distribution is
-  * Measurement is in units of X
-
-
----
-
-## Conditioning
-
-* The variables $X$ are considered to be random
-* The parameters are considered to be fixed values
-* Sometimes we want to talk about a case where one of the random variables is fixed
-* To indicate what is fixed, we _condition_ using the symbol "$|$""
-  * $X | \mu$ means that $X$ is a random variable with fixed parameter $\mu$
-  * $Y | X = 2$ means $Y$ is the random variable $Y$ when $X$ is fixed at 2.
-
-
----
-
-
-## Example distribution: Binomial
-
-__Binomial distribution: $Bin(n,p)$__
-* $X \sim Bin(10,0.5)$
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = 0:10
-plot(xvals,dbinom(xvals,10,0.5),type="h",col="blue",lwd=3,ylab="probability",xlab="X value")
-
-```
-
-
----
-
-## Example distribution: Normal
-
-__Normal Distribution: $N(\mu,\sigma)$__
-* $X \sim N(0,1)$
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = seq(-10,10,length=100)
-plot(xvals,dnorm(xvals,0,1),type="l",col="blue",lwd=3,ylab="density",xlab="X value")
-
-```
-
-
----
-
-## Example distribution: Uniform
-
-__Uniform distribution: $U(\alpha,\beta)$__
-* $X \sim U(0,1)$
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = seq(-2,2,length=100)
-plot(xvals,dunif(xvals,0,1),type="l",col="blue",lwd=3,ylab="density",xlab="X value")
-
-```
-
----
-
-## Changing parameters
-
-__Normal Distribution: $N(\mu,\sigma)$__
-* $X \sim N(0,1)$, $E[X] = \mu = 0$, $Var[X] = \sigma^2 = 1$
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = seq(-10,10,length=100)
-plot(xvals,dnorm(xvals,0,1),type="l",col="blue",lwd=3,ylab="density",xlab="X value")
-
-```
-
----
-
-## Changing parameters: the variance
-
-__Normal Distribution: $N(\mu,\sigma)$__
-* $X \sim N(0,5)$, $E[X] = \mu = 0$, $Var[X] = \sigma^2 = 25$
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = seq(-10,10,length=100)
-plot(xvals,dnorm(xvals,0,5),type="l",col="blue",lwd=3,ylab="density",xlab="X value")
-
-```
-
----
-
-## Changing parameters: the mean
-__Normal Distribution: $N(\mu,\sigma)$__
-* $X \sim N(5,1)$, $E[X] = \mu = 5$, $Var[X] = \sigma^2 = 1$
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = seq(-10,10,length=100)
-plot(xvals,dnorm(xvals,5,1),type="l",col="blue",lwd=3,ylab="density",xlab="X value")
-
-```
-
----
-
-## Example distribution: Binomial
-__Binomial distribution: $Bin(n,p)$__
-* $X \sim Bin(10,0.5)$, $E[X] = n \times p = 5$, $Var[X] = n \times p \times (1-p) = 2.5$
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = 0:10
-plot(xvals,dbinom(xvals,10,0.5),type="h",col="blue",lwd=3,ylab="probability",xlab="X value")
-
-```
-
-
----
-
-## Changing parameters: both mean and variance
-
-__Binomial distribution: $Bin(n,p)$__
-* $X \sim Bin(10,0.8)$, $E[X] = n \times p = 8$, $Var[X] = n \times p \times (1-p) = 1.6$
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = 0:10
-plot(xvals,dbinom(xvals,10,0.8),type="h",col="blue",lwd=3,ylab="probability",xlab="X value")
-
-```
-
----
-
-
-## Conditioning
-
-* Suppose $Y \sim N(X,1)$ and $X \sim N(0,1)$, then the distribution of $Y |X = 5$ is
-
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = seq(-10,10,length=100)
-plot(xvals,dnorm(xvals,5,1),type="l",col="blue",lwd=3,ylab="density",xlab="Y value")
-
-```
-
----
-
-## Conditioning
-
-* Suppose $Y \sim N(X,1)$ and $X \sim N(0,1)$, then the distribution of $Y$ is
-
-```{r, echo=FALSE, fig.height = 4, fig.width = 4}
-xvals = seq(-10,10,length=100)
-plot(xvals,dnorm(xvals,0,2),type="l",col="blue",lwd=3,ylab="density",xlab="X value")
-
-```
-
-[http://en.wikipedia.org/wiki/Law_of_total_variance](http://en.wikipedia.org/wiki/Law_of_total_variance)
-
-[http://en.wikipedia.org/wiki/Law_of_total_expectation](http://en.wikipedia.org/wiki/Law_of_total_expectation)
-
----
-
-
-## Learning more about a specific distribution
-
-<img class=center src=../../assets/img/poisson.png height='400'/>
-
-[http://en.wikipedia.org/wiki/Poisson_distribution](http://en.wikipedia.org/wiki/Poisson_distribution)
-
----
-
-## Learning more about representing data
-
-<img class=center src=../../assets/img/openintro.png height='400'/>
-
-[http://www.openintro.org/stat/textbook.php](http://www.openintro.org/stat/textbook.php)
-
diff --git a/06_StatisticalInference/old/004representingData/index.html b/06_StatisticalInference/old/004representingData/index.html
deleted file mode 100644
index fdca43fbe..000000000
--- a/06_StatisticalInference/old/004representingData/index.html
+++ /dev/null
@@ -1,456 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>How do we represent data?</title>
-  <meta charset="utf-8">
-  <meta name="description" content="How do we represent data?">
-  <meta name="author" content="Jeffrey Leek, Assistant Professor of Biostatistics">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/zenburn.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
-    
-
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>How do we represent data?</h1>
-        <h2></h2>
-        <p>Jeffrey Leek, Assistant Professor of Biostatistics<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
-    <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>How do we write about data?</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Each data point is usually represented by a capital letter. 
-
-<ul>
-<li>\(H\) for height, \(W\) for weight. </li>
-</ul></li>
-<li>If there are more than one data point of the same type we use subscripts.
-
-<ul>
-<li>\(H_1\), \(H_2\), \(H_3\) for three different people&#39;s heights.</li>
-</ul></li>
-<li>Sometimes it is more compact to write \(X_1\) for height and \(X_2\) for weight. </li>
-<li>Then we need another subscript for the individual data point
-
-<ul>
-<li>\(X_{11}\) for the height of the first person. </li>
-</ul></li>
-<li>\(Y\) representes general outcomes and \(X\) general covariates. </li>
-<li>In this course we will try to use informative letters when possible. </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Randomness</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Variables like \(X\) and \(Y\) are called <em>random variables</em> because we expect them to be <em>random</em> in some way. </li>
-<li>In general, randomness is a hard thing to define</li>
-<li>In this class a variable may be random because
-
-<ul>
-<li>It represents an incompletely measured variable</li>
-<li>It represents a sample drawn from a population using a random mechanism.<br></li>
-</ul></li>
-<li>Once we are talking about a specific value of a variable we have observed it isn&#39;t random anymore, we write these values with lower case letters \(x,y\), etc. </li>
-<li>We write \(X=x\) or \(X=1\) to indicate we have observed a specific value \(x\) or \(1\). </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Randomness and measurement</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>A coin flip is commonly considered random</li>
-<li>But it can be modeled by deterministic equations 
-
-<ul>
-<li>Dynamical bias in the coin toss <a href="http://www-stat.stanford.edu/%7Ecgates/PERSI/papers/dyn_coin_07.pdf">(Diaconis, Holmes and Montgomery SIAM Review 2007)</a></li>
-<li>Modeled the tossing as a dynamical system</li>
-<li>Showed that a coin is more likely to land on the side it started</li>
-<li>Did experiments that demonstrated it was a 51% chance</li>
-</ul></li>
-<li>Some have taken it a bit farther making <a href="http://www.dotmancando.info/index.php?/projects/coin-flipper/">predictable coin flipping machines</a> based on <a href="http://www.dotmancando.info/index.php?/projects/coin-flipper/">physical properties</a>. </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Distributions</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>In statistical modeling, random variables like \(X\) are assumed to be samples from a <em>distribution</em></li>
-<li>A distribution tells us the possible values of \(X\) and the probabilities for each value. </li>
-<li>Probability is the chance something will happen and is abbreviated \(Pr\)</li>
-<li>The probabilities must all be between 0 and 1. </li>
-<li>The probabilities must add up to 1. </li>
-<li>An example:
-
-<ul>
-<li>Let&#39;s flip a coin and allow \(X\) to represent whether it is heads or tails</li>
-<li>\(X = 1\) if it is heads and \(X = 0\) if it is tails</li>
-<li>We expect that about 50% of the time it will be heads.</li>
-<li>The distribution can then be written Pr(\(X=1\))=0.5 and Pr(\(X=0\))=0.5</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Continuous versus discrete distributions</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li><em>discrete</em> distributions specify probabilities for discrete values
-
-<ul>
-<li>Qualitative variables are discrete</li>
-<li>So are variables that take on all values 0,1,2,3...</li>
-</ul></li>
-<li><em>continuous</em> distributions specify probabilities for ranges of values
-
-<ul>
-<li>Quantitative variables are often assumed to be continuous</li>
-<li>But we might only see specific values </li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Parameters</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Distributions are defined by a set of fixed values called <em>parameters</em>. </li>
-<li><em>parameters</em> are sometimes represented by Greek letters like \(\mu,\sigma,\tau\). </li>
-<li>Distributions are written as letters with the parameters in parentheses like \(N(\mu,\sigma)\) or \(Poisson(\lambda)\).</li>
-<li>\(X \sim N(\mu,\sigma)\) means that \(X\) has the \(N(\mu,\sigma)\) distribution. </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>The three most important parameters</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>If \(X\) is a random variable, the mean of that random variable is written \(E[X]\)
-
-<ul>
-<li>Stands for expected value</li>
-<li>Measures the &quot;center&quot; of a distribution</li>
-</ul></li>
-<li>The variance of that random variable is written \(Var[X]\)
-
-<ul>
-<li>Measures how &quot;spread out&quot; a distribution is</li>
-<li>Measurement is in (units of X)\(^2\)</li>
-</ul></li>
-<li>The standard deviation is written \(SD[X] = \sqrt{Var[X]}\)
-
-<ul>
-<li>Also measures how &quot;spread out&quot; a distribution is</li>
-<li>Measurement is in units of X</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Conditioning</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The variables \(X\) are considered to be random</li>
-<li>The parameters are considered to be fixed values</li>
-<li>Sometimes we want to talk about a case where one of the random variables is fixed</li>
-<li>To indicate what is fixed, we <em>condition</em> using the symbol &quot;\(|\)&quot;&quot;
-
-<ul>
-<li>\(X | \mu\) means that \(X\) is a random variable with fixed parameter \(\mu\)</li>
-<li>\(Y | X = 2\) means \(Y\) is the random variable \(Y\) when \(X\) is fixed at 2.</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Example distribution: Binomial</h2>
-  </hgroup>
-  <article>
-    <p><strong>Binomial distribution: \(Bin(n,p)\)</strong></p>
-
-<ul>
-<li>\(X \sim Bin(10,0.5)\)
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Example distribution: Normal</h2>
-  </hgroup>
-  <article>
-    <p><strong>Normal Distribution: \(N(\mu,\sigma)\)</strong></p>
-
-<ul>
-<li>\(X \sim N(0,1)\)
-<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Example distribution: Uniform</h2>
-  </hgroup>
-  <article>
-    <p><strong>Uniform distribution: \(U(\alpha,\beta)\)</strong></p>
-
-<ul>
-<li>\(X \sim U(0,1)\)
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Changing parameters</h2>
-  </hgroup>
-  <article>
-    <p><strong>Normal Distribution: \(N(\mu,\sigma)\)</strong></p>
-
-<ul>
-<li>\(X \sim N(0,1)\), \(E[X] = \mu = 0\), \(Var[X] = \sigma^2 = 1\)
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>Changing parameters: the variance</h2>
-  </hgroup>
-  <article>
-    <p><strong>Normal Distribution: \(N(\mu,\sigma)\)</strong></p>
-
-<ul>
-<li>\(X \sim N(0,5)\), \(E[X] = \mu = 0\), \(Var[X] = \sigma^2 = 25\)
-<div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Changing parameters: the mean</h2>
-  </hgroup>
-  <article>
-    <p><strong>Normal Distribution: \(N(\mu,\sigma)\)</strong></p>
-
-<ul>
-<li>\(X \sim N(5,1)\), \(E[X] = \mu = 5\), \(Var[X] = \sigma^2 = 1\)
-<div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Example distribution: Binomial</h2>
-  </hgroup>
-  <article>
-    <p><strong>Binomial distribution: \(Bin(n,p)\)</strong></p>
-
-<ul>
-<li>\(X \sim Bin(10,0.5)\), \(E[X] = n \times p = 5\), \(Var[X] = n \times p \times (1-p) = 2.5\)
-<div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Changing parameters: both mean and variance</h2>
-  </hgroup>
-  <article>
-    <p><strong>Binomial distribution: \(Bin(n,p)\)</strong></p>
-
-<ul>
-<li>\(X \sim Bin(10,0.8)\), \(E[X] = n \times p = 8\), \(Var[X] = n \times p \times (1-p) = 1.6\)
-<div class="rimage center"><img src="fig/unnamed-chunk-8.png" title="plot of chunk unnamed-chunk-8" alt="plot of chunk unnamed-chunk-8" class="plot" /></div></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Conditioning</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Suppose \(Y \sim N(X,1)\) and \(X \sim N(0,1)\), then the distribution of \(Y |X = 5\) is</li>
-</ul>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-9.png" title="plot of chunk unnamed-chunk-9" alt="plot of chunk unnamed-chunk-9" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h2>Conditioning</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Suppose \(Y \sim N(X,1)\) and \(X \sim N(0,1)\), then the distribution of \(Y\) is</li>
-</ul>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-10.png" title="plot of chunk unnamed-chunk-10" alt="plot of chunk unnamed-chunk-10" class="plot" /></div>
-
-<p><a href="http://en.wikipedia.org/wiki/Law_of_total_variance">http://en.wikipedia.org/wiki/Law_of_total_variance</a></p>
-
-<p><a href="http://en.wikipedia.org/wiki/Law_of_total_expectation">http://en.wikipedia.org/wiki/Law_of_total_expectation</a></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    <h2>Learning more about a specific distribution</h2>
-  </hgroup>
-  <article>
-    <p><img class=center src=../../assets/img/poisson.png height='400'/></p>
-
-<p><a href="http://en.wikipedia.org/wiki/Poisson_distribution">http://en.wikipedia.org/wiki/Poisson_distribution</a></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-20" style="background:;">
-  <hgroup>
-    <h2>Learning more about representing data</h2>
-  </hgroup>
-  <article>
-    <p><img class=center src=../../assets/img/openintro.png height='400'/></p>
-
-<p><a href="http://www.openintro.org/stat/textbook.php">http://www.openintro.org/stat/textbook.php</a></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-
-  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
diff --git a/06_StatisticalInference/old/004representingData/index.md b/06_StatisticalInference/old/004representingData/index.md
deleted file mode 100644
index 3fc2003bf..000000000
--- a/06_StatisticalInference/old/004representingData/index.md
+++ /dev/null
@@ -1,235 +0,0 @@
----
-title       : How do we represent data?
-subtitle    : 
-author      : Jeffrey Leek, Assistant Professor of Biostatistics 
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : zenburn   # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-
-
-## How do we write about data?
-
-* Each data point is usually represented by a capital letter. 
-  * $H$ for height, $W$ for weight. 
-* If there are more than one data point of the same type we use subscripts.
-  * $H_1$, $H_2$, $H_3$ for three different people's heights.
-* Sometimes it is more compact to write $X_1$ for height and $X_2$ for weight. 
-* Then we need another subscript for the individual data point
-  * $X_{11}$ for the height of the first person. 
-* $Y$ representes general outcomes and $X$ general covariates. 
-* In this course we will try to use informative letters when possible. 
-
----
-
-## Randomness
-
-* Variables like $X$ and $Y$ are called _random variables_ because we expect them to be _random_ in some way. 
-* In general, randomness is a hard thing to define
-* In this class a variable may be random because
-  * It represents an incompletely measured variable
-  * It represents a sample drawn from a population using a random mechanism.  
-* Once we are talking about a specific value of a variable we have observed it isn't random anymore, we write these values with lower case letters $x,y$, etc. 
-* We write $X=x$ or $X=1$ to indicate we have observed a specific value $x$ or $1$. 
-
----
-
-## Randomness and measurement
-
-* A coin flip is commonly considered random
-* But it can be modeled by deterministic equations 
-  * Dynamical bias in the coin toss [(Diaconis, Holmes and Montgomery SIAM Review 2007)](http://www-stat.stanford.edu/~cgates/PERSI/papers/dyn_coin_07.pdf)
-  * Modeled the tossing as a dynamical system
-  * Showed that a coin is more likely to land on the side it started
-  * Did experiments that demonstrated it was a 51% chance
-* Some have taken it a bit farther making [predictable coin flipping machines](http://www.dotmancando.info/index.php?/projects/coin-flipper/) based on [physical properties](http://www.dotmancando.info/index.php?/projects/coin-flipper/). 
-
---- 
-
-## Distributions
-
-* In statistical modeling, random variables like $X$ are assumed to be samples from a _distribution_
-* A distribution tells us the possible values of $X$ and the probabilities for each value. 
-* Probability is the chance something will happen and is abbreviated $Pr$
-* The probabilities must all be between 0 and 1. 
-* The probabilities must add up to 1. 
-* An example:
-  * Let's flip a coin and allow $X$ to represent whether it is heads or tails
-  * $X = 1$ if it is heads and $X = 0$ if it is tails
-  * We expect that about 50% of the time it will be heads.
-  * The distribution can then be written Pr($X=1$)=0.5 and Pr($X=0$)=0.5
-
----
-
-## Continuous versus discrete distributions
-
-* _discrete_ distributions specify probabilities for discrete values
-  * Qualitative variables are discrete
-  * So are variables that take on all values 0,1,2,3...
-* _continuous_ distributions specify probabilities for ranges of values
-  * Quantitative variables are often assumed to be continuous
-  * But we might only see specific values 
-
-
----
-
-
-## Parameters
-
-* Distributions are defined by a set of fixed values called _parameters_. 
-* _parameters_ are sometimes represented by Greek letters like $\mu,\sigma,\tau$. 
-* Distributions are written as letters with the parameters in parentheses like $N(\mu,\sigma)$ or $Poisson(\lambda)$.
-* $X \sim N(\mu,\sigma)$ means that $X$ has the $N(\mu,\sigma)$ distribution. 
-
-
----
-
-## The three most important parameters
-
-* If $X$ is a random variable, the mean of that random variable is written $E[X]$
-  * Stands for expected value
-  * Measures the "center" of a distribution
-* The variance of that random variable is written $Var[X]$
-  * Measures how "spread out" a distribution is
-  * Measurement is in (units of X)$^2$
-* The standard deviation is written $SD[X] = \sqrt{Var[X]}$
-  * Also measures how "spread out" a distribution is
-  * Measurement is in units of X
-
-
----
-
-## Conditioning
-
-* The variables $X$ are considered to be random
-* The parameters are considered to be fixed values
-* Sometimes we want to talk about a case where one of the random variables is fixed
-* To indicate what is fixed, we _condition_ using the symbol "$|$""
-  * $X | \mu$ means that $X$ is a random variable with fixed parameter $\mu$
-  * $Y | X = 2$ means $Y$ is the random variable $Y$ when $X$ is fixed at 2.
-
-
----
-
-
-## Example distribution: Binomial
-
-__Binomial distribution: $Bin(n,p)$__
-* $X \sim Bin(10,0.5)$
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
-
-
-
----
-
-## Example distribution: Normal
-
-__Normal Distribution: $N(\mu,\sigma)$__
-* $X \sim N(0,1)$
-<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
-
-
----
-
-## Example distribution: Uniform
-
-__Uniform distribution: $U(\alpha,\beta)$__
-* $X \sim U(0,1)$
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
-
----
-
-## Changing parameters
-
-__Normal Distribution: $N(\mu,\sigma)$__
-* $X \sim N(0,1)$, $E[X] = \mu = 0$, $Var[X] = \sigma^2 = 1$
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
-
-
----
-
-## Changing parameters: the variance
-
-__Normal Distribution: $N(\mu,\sigma)$__
-* $X \sim N(0,5)$, $E[X] = \mu = 0$, $Var[X] = \sigma^2 = 25$
-<div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
-
-
----
-
-## Changing parameters: the mean
-__Normal Distribution: $N(\mu,\sigma)$__
-* $X \sim N(5,1)$, $E[X] = \mu = 5$, $Var[X] = \sigma^2 = 1$
-<div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
-
-
----
-
-## Example distribution: Binomial
-__Binomial distribution: $Bin(n,p)$__
-* $X \sim Bin(10,0.5)$, $E[X] = n \times p = 5$, $Var[X] = n \times p \times (1-p) = 2.5$
-<div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div>
-
-
-
----
-
-## Changing parameters: both mean and variance
-
-__Binomial distribution: $Bin(n,p)$__
-* $X \sim Bin(10,0.8)$, $E[X] = n \times p = 8$, $Var[X] = n \times p \times (1-p) = 1.6$
-<div class="rimage center"><img src="fig/unnamed-chunk-8.png" title="plot of chunk unnamed-chunk-8" alt="plot of chunk unnamed-chunk-8" class="plot" /></div>
-
-
----
-
-
-## Conditioning
-
-* Suppose $Y \sim N(X,1)$ and $X \sim N(0,1)$, then the distribution of $Y |X = 5$ is
-
-<div class="rimage center"><img src="fig/unnamed-chunk-9.png" title="plot of chunk unnamed-chunk-9" alt="plot of chunk unnamed-chunk-9" class="plot" /></div>
-
-
----
-
-## Conditioning
-
-* Suppose $Y \sim N(X,1)$ and $X \sim N(0,1)$, then the distribution of $Y$ is
-
-<div class="rimage center"><img src="fig/unnamed-chunk-10.png" title="plot of chunk unnamed-chunk-10" alt="plot of chunk unnamed-chunk-10" class="plot" /></div>
-
-
-[http://en.wikipedia.org/wiki/Law_of_total_variance](http://en.wikipedia.org/wiki/Law_of_total_variance)
-
-[http://en.wikipedia.org/wiki/Law_of_total_expectation](http://en.wikipedia.org/wiki/Law_of_total_expectation)
-
----
-
-
-## Learning more about a specific distribution
-
-<img class=center src=../../assets/img/poisson.png height='400'/>
-
-[http://en.wikipedia.org/wiki/Poisson_distribution](http://en.wikipedia.org/wiki/Poisson_distribution)
-
----
-
-## Learning more about representing data
-
-<img class=center src=../../assets/img/openintro.png height='400'/>
-
-[http://www.openintro.org/stat/textbook.php](http://www.openintro.org/stat/textbook.php)
-
diff --git a/06_StatisticalInference/old/005representingDataR/Representing data in R.pdf b/06_StatisticalInference/old/005representingDataR/Representing data in R.pdf
deleted file mode 100644
index 6928b1fd4..000000000
Binary files a/06_StatisticalInference/old/005representingDataR/Representing data in R.pdf and /dev/null differ
diff --git a/06_StatisticalInference/old/005representingDataR/index.Rmd b/06_StatisticalInference/old/005representingDataR/index.Rmd
deleted file mode 100644
index 8555639d3..000000000
--- a/06_StatisticalInference/old/005representingDataR/index.Rmd
+++ /dev/null
@@ -1,212 +0,0 @@
----
-title       : Representing data in R
-subtitle    : 
-author      : Jeffrey Leek, Assistant Professor of Biostatistics 
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : zenburn   # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Important data types in R
-
-__Classes__
-* Character, Numeric, Integer, Logical
-
-***********
-
-__Objects__
-
-* Vectors, Matrices, Data frames, Lists, Factors, Missing values
-
-***********
-__Operations__
-
-* Subsetting, Logical subsetting
-
-***********
-
-_For more information_: 
-* [Data Types](http://www.youtube.com/watch?v=5AQM-yUX9zg&list=PLjTlxb-wKvXNSDfcKPFH2gzHGyjpeCZmJ&index=5)
-
-
----
-
-## Character
-
-```{r}
-firstName = "jeff"
-class(firstName)
-firstName
-```
-
----
-
-## Numeric
-```{r}
-heightCM = 188.2
-class(heightCM)
-heightCM
-```
-
----
-
-## Integer
-```{r}
-numberSons = 1L
-class(numberSons)
-numberSons
-```
-
----
-
-## Logical
-```{r}
-teachingCoursera = TRUE
-class(teachingCoursera)
-teachingCoursera
-```
-
----
-## Vectors
-A set of values with the same class
-```{r}
-heights = c(188.2, 181.3, 193.4)
-heights
-
-firstNames = c("jeff","roger","andrew","brian")
-firstNames
-
-```
-
----
-
-## Lists
-A vector of values of possibly different classes
-```{r}
-vector1 = c(188.2, 181.3, 193.4)
-vector2 = c("jeff","roger","andrew","brian")
-myList = list(heights=vector1,firstNames=vector2)
-myList
-
-```
-
----
-
-## Matrices
-Vectors with multiple dimensions
-```{r}
-myMatrix = matrix(c(1,2,3,4),byrow=T,nrow=2)
-myMatrix
-
-```
-
----
-
-## Data frames
-Multiple vectors of possibly different classes, of the same length
-```{r}
-vector1 = c(188.2, 181.3, 193.4)
-vector2 = c("jeff","roger","andrew","brian")
-myDataFrame = data.frame(heights=vector1,firstNames=vector2)
-myDataFrame
-
-```
-
----
-
-## Data frames
-
-```{r}
-vector1 = c(188.2,181.3,193.4,192.3)
-vector2 = c("jeff","roger","andrew","brian")
-myDataFrame = data.frame(heights=vector1,firstNames=vector2)
-myDataFrame
-```
-
----
-## Factors
-Qualitative variables that can be included in models
-
-```{r}
-smoker = c("yes","no","yes","yes")
-smokerFactor = as.factor(smoker)
-smokerFactor
-
-```
-
----
-
-## Missing values
-In R they are usually coded NA
-
-```{r}
-vector1 = c(188.2,181.3,193.4,NA)
-vector1
-is.na(vector1)
-```
-
-
----
-
-## Subsetting
-
-```{r}
-vector1 = c(188.2,181.3,193.4,192.3)
-vector2 = c("jeff","roger","andrew","brian")
-myDataFrame = data.frame(heights=vector1,firstNames=vector2)
-
-vector1[1]
-vector1[c(1,2,4)]
-
-```
-
----
-
-## Subsetting
-
-```{r}
-myDataFrame[1,1:2]
-myDataFrame$firstNames
-
-```
----
-
-## Logical subsetting
-```{r}
-myDataFrame[myDataFrame$firstNames=="jeff",]
-myDataFrame[myDataFrame$heights < 190,]
-```
-
----
-
-## Variable naming conventions
-
-Variable names should be short, but descriptive. Here are some common styles
-
-__Camel caps__
-```{r}
-myHeightCM = 188
-```
-__Underscore__
-```{r}
-my_height_cm = 188
-```
-__Dot separated__
-```{r}
-my.height.cm = 188
-```
-
----
-
-## Style guides
-
-* [http://4dpiecharts.com/r-code-style-guide/](http://4dpiecharts.com/r-code-style-guide/)
-* [http://google-styleguide.googlecode.com/svn/trunk/google-r-style.html](http://google-styleguide.googlecode.com/svn/trunk/google-r-style.html)
-* [http://wiki.fhcrc.org/bioc/Coding_Standards](http://wiki.fhcrc.org/bioc/Coding_Standards)
diff --git a/06_StatisticalInference/old/005representingDataR/index.html b/06_StatisticalInference/old/005representingDataR/index.html
deleted file mode 100644
index 313db7e1c..000000000
--- a/06_StatisticalInference/old/005representingDataR/index.html
+++ /dev/null
@@ -1,479 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Representing data in R</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Representing data in R">
-  <meta name="author" content="Jeffrey Leek, Assistant Professor of Biostatistics">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/zenburn.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
-    
-
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Representing data in R</h1>
-        <h2></h2>
-        <p>Jeffrey Leek, Assistant Professor of Biostatistics<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
-    <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Important data types in R</h2>
-  </hgroup>
-  <article>
-    <p><strong>Classes</strong></p>
-
-<ul>
-<li>Character, Numeric, Integer, Logical</li>
-</ul>
-
-<hr>
-
-<p><strong>Objects</strong></p>
-
-<ul>
-<li>Vectors, Matrices, Data frames, Lists, Factors, Missing values</li>
-</ul>
-
-<hr>
-
-<p><strong>Operations</strong></p>
-
-<ul>
-<li>Subsetting, Logical subsetting</li>
-</ul>
-
-<hr>
-
-<p><em>For more information</em>: </p>
-
-<ul>
-<li><a href="http://www.youtube.com/watch?v=5AQM-yUX9zg&amp;list=PLjTlxb-wKvXNSDfcKPFH2gzHGyjpeCZmJ&amp;index=5">Data Types</a></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Character</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">firstName = &quot;jeff&quot;
-class(firstName)
-</code></pre>
-
-<pre><code>## [1] &quot;character&quot;
-</code></pre>
-
-<pre><code class="r">firstName
-</code></pre>
-
-<pre><code>## [1] &quot;jeff&quot;
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Numeric</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">heightCM = 188.2
-class(heightCM)
-</code></pre>
-
-<pre><code>## [1] &quot;numeric&quot;
-</code></pre>
-
-<pre><code class="r">heightCM
-</code></pre>
-
-<pre><code>## [1] 188.2
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Integer</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">numberSons = 1L
-class(numberSons)
-</code></pre>
-
-<pre><code>## [1] &quot;integer&quot;
-</code></pre>
-
-<pre><code class="r">numberSons
-</code></pre>
-
-<pre><code>## [1] 1
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Logical</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">teachingCoursera = TRUE
-class(teachingCoursera)
-</code></pre>
-
-<pre><code>## [1] &quot;logical&quot;
-</code></pre>
-
-<pre><code class="r">teachingCoursera
-</code></pre>
-
-<pre><code>## [1] TRUE
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Vectors</h2>
-  </hgroup>
-  <article>
-    <p>A set of values with the same class</p>
-
-<pre><code class="r">heights = c(188.2, 181.3, 193.4)
-heights
-</code></pre>
-
-<pre><code>## [1] 188.2 181.3 193.4
-</code></pre>
-
-<pre><code class="r">
-firstNames = c(&quot;jeff&quot;, &quot;roger&quot;, &quot;andrew&quot;, &quot;brian&quot;)
-firstNames
-</code></pre>
-
-<pre><code>## [1] &quot;jeff&quot;   &quot;roger&quot;  &quot;andrew&quot; &quot;brian&quot;
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Lists</h2>
-  </hgroup>
-  <article>
-    <p>A vector of values of possibly different classes</p>
-
-<pre><code class="r">vector1 = c(188.2, 181.3, 193.4)
-vector2 = c(&quot;jeff&quot;, &quot;roger&quot;, &quot;andrew&quot;, &quot;brian&quot;)
-myList = list(heights = vector1, firstNames = vector2)
-myList
-</code></pre>
-
-<pre><code>## $heights
-## [1] 188.2 181.3 193.4
-## 
-## $firstNames
-## [1] &quot;jeff&quot;   &quot;roger&quot;  &quot;andrew&quot; &quot;brian&quot;
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Matrices</h2>
-  </hgroup>
-  <article>
-    <p>Vectors with multiple dimensions</p>
-
-<pre><code class="r">myMatrix = matrix(c(1, 2, 3, 4), byrow = T, nrow = 2)
-myMatrix
-</code></pre>
-
-<pre><code>##      [,1] [,2]
-## [1,]    1    2
-## [2,]    3    4
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Data frames</h2>
-  </hgroup>
-  <article>
-    <p>Multiple vectors of possibly different classes, of the same length</p>
-
-<pre><code class="r">vector1 = c(188.2, 181.3, 193.4)
-vector2 = c(&quot;jeff&quot;, &quot;roger&quot;, &quot;andrew&quot;, &quot;brian&quot;)
-myDataFrame = data.frame(heights = vector1, firstNames = vector2)
-</code></pre>
-
-<pre><code>## Error: arguments imply differing number of rows: 3, 4
-</code></pre>
-
-<pre><code class="r">myDataFrame
-</code></pre>
-
-<pre><code>## Error: object &#39;myDataFrame&#39; not found
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Data frames</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">vector1 = c(188.2, 181.3, 193.4, 192.3)
-vector2 = c(&quot;jeff&quot;, &quot;roger&quot;, &quot;andrew&quot;, &quot;brian&quot;)
-myDataFrame = data.frame(heights = vector1, firstNames = vector2)
-myDataFrame
-</code></pre>
-
-<pre><code>##   heights firstNames
-## 1   188.2       jeff
-## 2   181.3      roger
-## 3   193.4     andrew
-## 4   192.3      brian
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Factors</h2>
-  </hgroup>
-  <article>
-    <p>Qualitative variables that can be included in models</p>
-
-<pre><code class="r">smoker = c(&quot;yes&quot;, &quot;no&quot;, &quot;yes&quot;, &quot;yes&quot;)
-smokerFactor = as.factor(smoker)
-smokerFactor
-</code></pre>
-
-<pre><code>## [1] yes no  yes yes
-## Levels: no yes
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Missing values</h2>
-  </hgroup>
-  <article>
-    <p>In R they are usually coded NA</p>
-
-<pre><code class="r">vector1 = c(188.2, 181.3, 193.4, NA)
-vector1
-</code></pre>
-
-<pre><code>## [1] 188.2 181.3 193.4    NA
-</code></pre>
-
-<pre><code class="r">is.na(vector1)
-</code></pre>
-
-<pre><code>## [1] FALSE FALSE FALSE  TRUE
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>Subsetting</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">vector1 = c(188.2, 181.3, 193.4, 192.3)
-vector2 = c(&quot;jeff&quot;, &quot;roger&quot;, &quot;andrew&quot;, &quot;brian&quot;)
-myDataFrame = data.frame(heights = vector1, firstNames = vector2)
-
-vector1[1]
-</code></pre>
-
-<pre><code>## [1] 188.2
-</code></pre>
-
-<pre><code class="r">vector1[c(1, 2, 4)]
-</code></pre>
-
-<pre><code>## [1] 188.2 181.3 192.3
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Subsetting</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">myDataFrame[1, 1:2]
-</code></pre>
-
-<pre><code>##   heights firstNames
-## 1   188.2       jeff
-</code></pre>
-
-<pre><code class="r">myDataFrame$firstNames
-</code></pre>
-
-<pre><code>## [1] jeff   roger  andrew brian 
-## Levels: andrew brian jeff roger
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Logical subsetting</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">myDataFrame[myDataFrame$firstNames == &quot;jeff&quot;, ]
-</code></pre>
-
-<pre><code>##   heights firstNames
-## 1   188.2       jeff
-</code></pre>
-
-<pre><code class="r">myDataFrame[myDataFrame$heights &lt; 190, ]
-</code></pre>
-
-<pre><code>##   heights firstNames
-## 1   188.2       jeff
-## 2   181.3      roger
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Variable naming conventions</h2>
-  </hgroup>
-  <article>
-    <p>Variable names should be short, but descriptive. Here are some common styles</p>
-
-<p><strong>Camel caps</strong></p>
-
-<pre><code class="r">myHeightCM = 188
-</code></pre>
-
-<p><strong>Underscore</strong></p>
-
-<pre><code class="r">my_height_cm = 188
-</code></pre>
-
-<p><strong>Dot separated</strong></p>
-
-<pre><code class="r">my.height.cm = 188
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Style guides</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li><a href="http://4dpiecharts.com/r-code-style-guide/">http://4dpiecharts.com/r-code-style-guide/</a></li>
-<li><a href="http://google-styleguide.googlecode.com/svn/trunk/google-r-style.html">http://google-styleguide.googlecode.com/svn/trunk/google-r-style.html</a></li>
-<li><a href="http://wiki.fhcrc.org/bioc/Coding_Standards">http://wiki.fhcrc.org/bioc/Coding_Standards</a></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-
-  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
diff --git a/06_StatisticalInference/old/005representingDataR/index.md b/06_StatisticalInference/old/005representingDataR/index.md
deleted file mode 100644
index e412f0097..000000000
--- a/06_StatisticalInference/old/005representingDataR/index.md
+++ /dev/null
@@ -1,381 +0,0 @@
----
-title       : Representing data in R
-subtitle    : 
-author      : Jeffrey Leek, Assistant Professor of Biostatistics 
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : zenburn   # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Important data types in R
-
-__Classes__
-* Character, Numeric, Integer, Logical
-
-***********
-
-__Objects__
-
-* Vectors, Matrices, Data frames, Lists, Factors, Missing values
-
-***********
-__Operations__
-
-* Subsetting, Logical subsetting
-
-***********
-
-_For more information_: 
-* [Data Types](http://www.youtube.com/watch?v=5AQM-yUX9zg&list=PLjTlxb-wKvXNSDfcKPFH2gzHGyjpeCZmJ&index=5)
-
-
----
-
-## Character
-
-
-```r
-firstName = "jeff"
-class(firstName)
-```
-
-```
-## [1] "character"
-```
-
-```r
-firstName
-```
-
-```
-## [1] "jeff"
-```
-
-
----
-
-## Numeric
-
-```r
-heightCM = 188.2
-class(heightCM)
-```
-
-```
-## [1] "numeric"
-```
-
-```r
-heightCM
-```
-
-```
-## [1] 188.2
-```
-
-
----
-
-## Integer
-
-```r
-numberSons = 1L
-class(numberSons)
-```
-
-```
-## [1] "integer"
-```
-
-```r
-numberSons
-```
-
-```
-## [1] 1
-```
-
-
----
-
-## Logical
-
-```r
-teachingCoursera = TRUE
-class(teachingCoursera)
-```
-
-```
-## [1] "logical"
-```
-
-```r
-teachingCoursera
-```
-
-```
-## [1] TRUE
-```
-
-
----
-## Vectors
-A set of values with the same class
-
-```r
-heights = c(188.2, 181.3, 193.4)
-heights
-```
-
-```
-## [1] 188.2 181.3 193.4
-```
-
-```r
-
-firstNames = c("jeff", "roger", "andrew", "brian")
-firstNames
-```
-
-```
-## [1] "jeff"   "roger"  "andrew" "brian"
-```
-
-
----
-
-## Lists
-A vector of values of possibly different classes
-
-```r
-vector1 = c(188.2, 181.3, 193.4)
-vector2 = c("jeff", "roger", "andrew", "brian")
-myList = list(heights = vector1, firstNames = vector2)
-myList
-```
-
-```
-## $heights
-## [1] 188.2 181.3 193.4
-## 
-## $firstNames
-## [1] "jeff"   "roger"  "andrew" "brian"
-```
-
-
----
-
-## Matrices
-Vectors with multiple dimensions
-
-```r
-myMatrix = matrix(c(1, 2, 3, 4), byrow = T, nrow = 2)
-myMatrix
-```
-
-```
-##      [,1] [,2]
-## [1,]    1    2
-## [2,]    3    4
-```
-
-
----
-
-## Data frames
-Multiple vectors of possibly different classes, of the same length
-
-```r
-vector1 = c(188.2, 181.3, 193.4)
-vector2 = c("jeff", "roger", "andrew", "brian")
-myDataFrame = data.frame(heights = vector1, firstNames = vector2)
-```
-
-```
-## Error: arguments imply differing number of rows: 3, 4
-```
-
-```r
-myDataFrame
-```
-
-```
-## Error: object 'myDataFrame' not found
-```
-
-
----
-
-## Data frames
-
-
-```r
-vector1 = c(188.2, 181.3, 193.4, 192.3)
-vector2 = c("jeff", "roger", "andrew", "brian")
-myDataFrame = data.frame(heights = vector1, firstNames = vector2)
-myDataFrame
-```
-
-```
-##   heights firstNames
-## 1   188.2       jeff
-## 2   181.3      roger
-## 3   193.4     andrew
-## 4   192.3      brian
-```
-
-
----
-## Factors
-Qualitative variables that can be included in models
-
-
-```r
-smoker = c("yes", "no", "yes", "yes")
-smokerFactor = as.factor(smoker)
-smokerFactor
-```
-
-```
-## [1] yes no  yes yes
-## Levels: no yes
-```
-
-
----
-
-## Missing values
-In R they are usually coded NA
-
-
-```r
-vector1 = c(188.2, 181.3, 193.4, NA)
-vector1
-```
-
-```
-## [1] 188.2 181.3 193.4    NA
-```
-
-```r
-is.na(vector1)
-```
-
-```
-## [1] FALSE FALSE FALSE  TRUE
-```
-
-
-
----
-
-## Subsetting
-
-
-```r
-vector1 = c(188.2, 181.3, 193.4, 192.3)
-vector2 = c("jeff", "roger", "andrew", "brian")
-myDataFrame = data.frame(heights = vector1, firstNames = vector2)
-
-vector1[1]
-```
-
-```
-## [1] 188.2
-```
-
-```r
-vector1[c(1, 2, 4)]
-```
-
-```
-## [1] 188.2 181.3 192.3
-```
-
-
----
-
-## Subsetting
-
-
-```r
-myDataFrame[1, 1:2]
-```
-
-```
-##   heights firstNames
-## 1   188.2       jeff
-```
-
-```r
-myDataFrame$firstNames
-```
-
-```
-## [1] jeff   roger  andrew brian 
-## Levels: andrew brian jeff roger
-```
-
----
-
-## Logical subsetting
-
-```r
-myDataFrame[myDataFrame$firstNames == "jeff", ]
-```
-
-```
-##   heights firstNames
-## 1   188.2       jeff
-```
-
-```r
-myDataFrame[myDataFrame$heights < 190, ]
-```
-
-```
-##   heights firstNames
-## 1   188.2       jeff
-## 2   181.3      roger
-```
-
-
----
-
-## Variable naming conventions
-
-Variable names should be short, but descriptive. Here are some common styles
-
-__Camel caps__
-
-```r
-myHeightCM = 188
-```
-
-__Underscore__
-
-```r
-my_height_cm = 188
-```
-
-__Dot separated__
-
-```r
-my.height.cm = 188
-```
-
-
----
-
-## Style guides
-
-* [http://4dpiecharts.com/r-code-style-guide/](http://4dpiecharts.com/r-code-style-guide/)
-* [http://google-styleguide.googlecode.com/svn/trunk/google-r-style.html](http://google-styleguide.googlecode.com/svn/trunk/google-r-style.html)
-* [http://wiki.fhcrc.org/bioc/Coding_Standards](http://wiki.fhcrc.org/bioc/Coding_Standards)
diff --git a/06_StatisticalInference/old/11. Plotting/Plotting.pdf b/06_StatisticalInference/old/11. Plotting/Plotting.pdf
deleted file mode 100644
index d55a2ba07..000000000
Binary files a/06_StatisticalInference/old/11. Plotting/Plotting.pdf and /dev/null differ
diff --git a/06_StatisticalInference/old/11. Plotting/index.Rmd b/06_StatisticalInference/old/11. Plotting/index.Rmd
deleted file mode 100644
index f44a2fcc9..000000000
--- a/06_StatisticalInference/old/11. Plotting/index.Rmd	
+++ /dev/null
@@ -1,282 +0,0 @@
----
-title       : Plotting
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Table of contents
-
-1. Histograms
-2. Stem and leaf
-3. Dotcharts
-4. Boxplots
-5. KDEs
-6. QQ-plots
-7. Mosaic plots
-
----
-
-## Histograms
-
-- Histograms display a sample estimate of the density or mass function by plotting a bar graph of the frequency or proportion of times that a variable takes specific values, or a range of values for continuous data, within a sample
-
----
-
-## Example
-
-- The data set `islands` in the R package `datasets` contains the areas of all land masses in thousands of square miles
-- Load the data set with the command `data(islands)`
-- View the data by typing `islands`
-- Create a histogram with the command `hist(islands)`
-- Do `?hist` for options 
-
----
-
-<img class="center" src="../assets/hist.png" height=500>
-
----
-
-## Pros and cons
-
-- Histograms are useful and easy, apply to continuous, discrete and even unordered data
-- They use a lot of ink and space to display very little information
-- It's difficult to display several at the same time for comparisons
-
-Also, for this data it's probably preferable to consider log base 10 (orders of magnitude), since the raw histogram simply says that most islands are small 
-
----
-
-<img class="center" src="../assets/histLog10.png" height=500>
-
----
-
-## Stem-and-leaf plots
-
-- Stem-and-leaf plots are extremely useful for getting distribution information on the fly
-- Read the text about creating them
-- They display the complete data set and so waste very little ink
-- Two data sets' stem and leaf plots can be shown back-to-back for comparisons
-- Created by John Tukey, a leading figure in the development of the statistical sciences and signal processing
-
----
-
-## Example
-
-```r
-> stem(log10(islands))
-
-  The decimal point is at the |
-
-  1 | 1111112222233444
-  1 | 5555556666667899999
-  2 | 3344
-  2 | 59
-  3 | 
-  3 | 5678
-  4 | 012
-```
-
----
-
-## Dotcharts
-
-- Dotcharts simply display a data set, one dot per point
-- Ordering of the dots and labeling of the axes can display additional information
-- Dotcharts show a complete data set and so have high data density
-- May be impossible to construct/difficult to interpret for data sets with lots of points
-
----
-
-<img class="center" src="../assets/dotChart.png" height=500>
-
----
-
-## Discussion
-
-- Maybe ordering alphabetically isn't the best thing for this data set
-- Perhaps grouped by continent, then nations by geography (grouping Pacific islands together)?
-
----
-
-## Dotplots comparing grouped data
-
-- For data sets in groups, you often want to display density information by group
-- If the size of the data permits, displaying the whole data is preferable
-- Add horizontal lines to depict means, medians
-- Add vertical lines to depict variation, show confidence intervals interquartile ranges
-- Jitter the points to avoid overplotting `jitter`
-
----
-
-## Example
-
-- The InsectSprays dataset contains counts of insect deaths by insecticide type (A, B, C, D, E, F)
-- You can obtain the data set with the command
-
-```r
-data(InsectSprays)
-```
-
----
-
-  The gist of the code is below
-  
-```r
-attach(InsectSprays)
-plot(c(.5, 6.5), range(count))
-sprayTypes <- unique(spray)
-for (i in 1 : length(sprayTypes)){
-  y <- count[spray == sprayTypes[i]]
-  n <- sum(spray == sprayTypes[i])
-  points(jitter(rep(i, n), amount = .1), y)
-  lines(i + c(.12, .28), rep(mean(y), 2), lwd = 3)
-  lines(rep(i + .2, 2), 
-        mean(y) + c(-1.96, 1.96) * sd(y) / sqrt(n)
-       )
-}
-```
-
----
-
-<img class="center" src="../assets/dotPlot.png" height=500>
-
----
-
-## Boxplots
-
-- Boxplots are useful for the same sort of display as the dot chart, but in instances where displaying the whole data set is not possible
-- Centerline of the boxes represents the median while the box edges correspond to the quartiles
-- Whiskers extend out to a constant times the IQR or the max value
-- Sometimes potential outliers are denoted by points beyond the whiskers
-- Also invented by Tukey
-- Skewness indicated by centerline being near one of the box edges
-
----
-
-<img class="center" src="../assets/boxplot.png" height=500>
-
----
-
-## Boxplots discussion
-
-- Don't use boxplots for small numbers of observations, just plot the data!
-- Try logging if some of the boxes are too squished relative to other ones; you can convert the axis to unlogged units (though they will not be equally spaced anymore)
-- For data with lots and lots of observations omit the outliers plotting if you get so many of them that you can't see the points
-- Example of a bad box plot
-
-```r
-boxplot(rt(500, 2))
-```
-
----
-
-<img class="center" src="../assets/boxplotBad.png" height=500>
-
----
-
-## Kernel density estimates
-
-- Kernel density estimates are essentially more modern versions of histograms providing density estimates for continuous data
-- Observations are weighted according to a "kernel", in most cases a Gaussian density
-- "Bandwidth" of the kernel effectively plays the role of the bin size for the histogram
-
-  a. Too low of a bandwidth yields a too variable (jagged) measure of the density
-  b. Too high of a bandwidth oversmooths
-
-- The R function `density` can be used to create KDEs
-
----
-
-## Example
-
-Data is the waiting and eruption times in minutes between eruptions of the Old Faithful Geyser in Yellowstone National park
-
-```r
-data(faithful)
-d <- density(faithful$eruptions, bw = "sj")
-plot(d)
-```
-
----
-
-<img class="center" src="../assets/kde.png" height=500>
-
----
-
-## Imaging example
-
-- Consider the following image slice (created in R) from a high resolution MRI of a brain
-- This is a single (axial) slice of a three-dimensional image
-- Consider discarding the location information and plotting a KDE of the intensities
-
----
-
-<img class="center" src="../assets/brain.png" height=500>
-
----
-
-<img class="center" src="../assets/brainDensityWithAnnotation.png" height=500>
-
----
-
-## QQ-plots
-
-- QQ-plots (for quantile-quantile) are extremely useful for comparing data to a theoretical distribution
-- Plot the empirical quantiles against theoretical quantiles
-- Most useful for diagnosing normality
-
----
-
-- Let $x_p$ be the $p^{th}$ quantile from a $N(\mu, \sigma^2)$
-- Then $P(X \leq x_p) = p$
-- Clearly $P(Z \leq \frac{x_p - \mu}{\sigma}) = p$
-- Therefore $x_p = \mu + z_p \sigma$ (this should not be news)
-- Result, quantiles from a $N(\mu,\sigma^2)$ population should be linearly related to standard normal quantiles
-- A normal qq-plot plots the empirical quantiles against the theoretical standard normal quantiles
-- In R `qqnorm` for a normal QQ-plot and  `qqplot` for a qqplot against an arbitrary distribution
-
----
-
-<img class="center" src="../assets/qqnorm1.png" height=500>
-
----
-
-<img class="center" src="../assets/qqnorm2.png" height=500>
-
----
-
-<img class="center" src="../assets/qqnorm3.png" height=500>
-
----
-
-## Mosaic plots
-
-- Mosaic plots are useful for displaying contingency table data
-- Consider Fisher's data regarding hair and eye color data for people from Caithness
-
-```r
-library(MASS)
-data(caith)
-caith
-mosaicplot(caith, color = topo.colors(4), 
-           main = "Mosiac plot")
-       fair red medium dark black
-blue    326  38    241  110     3
-light   688 116    584  188     4
-medium  343  84    909  412    26
-dark     98  48    403  681    85
-```
-
----
-
-<img class="center" src="../assets/mosaic.png" height=500>
diff --git a/06_StatisticalInference/old/11. Plotting/index.html b/06_StatisticalInference/old/11. Plotting/index.html
deleted file mode 100644
index 4573120cb..000000000
--- a/06_StatisticalInference/old/11. Plotting/index.html	
+++ /dev/null
@@ -1,562 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Plotting</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Plotting">
-  <meta name="author" content="Brian Caffo, PhD">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
-    
-
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Plotting</h1>
-        <h2>Mathematical Biostatistics Boot Camp</h2>
-        <p>Brian Caffo, PhD<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
-    <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Table of contents</h2>
-  </hgroup>
-  <article>
-    <ol>
-<li>Histograms</li>
-<li>Stem and leaf</li>
-<li>Dotcharts</li>
-<li>Boxplots</li>
-<li>KDEs</li>
-<li>QQ-plots</li>
-<li>Mosaic plots</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Histograms</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Histograms display a sample estimate of the density or mass function by plotting a bar graph of the frequency or proportion of times that a variable takes specific values, or a range of values for continuous data, within a sample</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The data set <code>islands</code> in the R package <code>datasets</code> contains the areas of all land masses in thousands of square miles</li>
-<li>Load the data set with the command <code>data(islands)</code></li>
-<li>View the data by typing <code>islands</code></li>
-<li>Create a histogram with the command <code>hist(islands)</code></li>
-<li>Do <code>?hist</code> for options </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/hist.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Pros and cons</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Histograms are useful and easy, apply to continuous, discrete and even unordered data</li>
-<li>They use a lot of ink and space to display very little information</li>
-<li>It&#39;s difficult to display several at the same time for comparisons</li>
-</ul>
-
-<p>Also, for this data it&#39;s probably preferable to consider log base 10 (orders of magnitude), since the raw histogram simply says that most islands are small </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/histLog10.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Stem-and-leaf plots</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Stem-and-leaf plots are extremely useful for getting distribution information on the fly</li>
-<li>Read the text about creating them</li>
-<li>They display the complete data set and so waste very little ink</li>
-<li>Two data sets&#39; stem and leaf plots can be shown back-to-back for comparisons</li>
-<li>Created by John Tukey, a leading figure in the development of the statistical sciences and signal processing</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">&gt; stem(log10(islands))
-
-  The decimal point is at the |
-
-  1 | 1111112222233444
-  1 | 5555556666667899999
-  2 | 3344
-  2 | 59
-  3 | 
-  3 | 5678
-  4 | 012
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Dotcharts</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Dotcharts simply display a data set, one dot per point</li>
-<li>Ordering of the dots and labeling of the axes can display additional information</li>
-<li>Dotcharts show a complete data set and so have high data density</li>
-<li>May be impossible to construct/difficult to interpret for data sets with lots of points</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/dotChart.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Discussion</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Maybe ordering alphabetically isn&#39;t the best thing for this data set</li>
-<li>Perhaps grouped by continent, then nations by geography (grouping Pacific islands together)?</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Dotplots comparing grouped data</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>For data sets in groups, you often want to display density information by group</li>
-<li>If the size of the data permits, displaying the whole data is preferable</li>
-<li>Add horizontal lines to depict means, medians</li>
-<li>Add vertical lines to depict variation, show confidence intervals interquartile ranges</li>
-<li>Jitter the points to avoid overplotting <code>jitter</code></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The InsectSprays dataset contains counts of insect deaths by insecticide type (A, B, C, D, E, F)</li>
-<li>You can obtain the data set with the command</li>
-</ul>
-
-<pre><code class="r">data(InsectSprays)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>The gist of the code is below</p>
-
-<pre><code class="r">attach(InsectSprays)
-plot(c(.5, 6.5), range(count))
-sprayTypes &lt;- unique(spray)
-for (i in 1 : length(sprayTypes)){
-  y &lt;- count[spray == sprayTypes[i]]
-  n &lt;- sum(spray == sprayTypes[i])
-  points(jitter(rep(i, n), amount = .1), y)
-  lines(i + c(.12, .28), rep(mean(y), 2), lwd = 3)
-  lines(rep(i + .2, 2), 
-        mean(y) + c(-1.96, 1.96) * sd(y) / sqrt(n)
-       )
-}
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/dotPlot.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Boxplots</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Boxplots are useful for the same sort of display as the dot chart, but in instances where displaying the whole data set is not possible</li>
-<li>Centerline of the boxes represents the median while the box edges correspond to the quartiles</li>
-<li>Whiskers extend out to a constant times the IQR or the max value</li>
-<li>Sometimes potential outliers are denoted by points beyond the whiskers</li>
-<li>Also invented by Tukey</li>
-<li>Skewness indicated by centerline being near one of the box edges</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/boxplot.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h2>Boxplots discussion</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Don&#39;t use boxplots for small numbers of observations, just plot the data!</li>
-<li>Try logging if some of the boxes are too squished relative to other ones; you can convert the axis to unlogged units (though they will not be equally spaced anymore)</li>
-<li>For data with lots and lots of observations omit the outliers plotting if you get so many of them that you can&#39;t see the points</li>
-<li>Example of a bad box plot</li>
-</ul>
-
-<pre><code class="r">boxplot(rt(500, 2))
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/boxplotBad.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-20" style="background:;">
-  <hgroup>
-    <h2>Kernel density estimates</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Kernel density estimates are essentially more modern versions of histograms providing density estimates for continuous data</li>
-<li>Observations are weighted according to a &quot;kernel&quot;, in most cases a Gaussian density</li>
-<li><p>&quot;Bandwidth&quot; of the kernel effectively plays the role of the bin size for the histogram</p>
-
-<p>a. Too low of a bandwidth yields a too variable (jagged) measure of the density
-b. Too high of a bandwidth oversmooths</p></li>
-<li><p>The R function <code>density</code> can be used to create KDEs</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-21" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <p>Data is the waiting and eruption times in minutes between eruptions of the Old Faithful Geyser in Yellowstone National park</p>
-
-<pre><code class="r">data(faithful)
-d &lt;- density(faithful$eruptions, bw = &quot;sj&quot;)
-plot(d)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-22" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/kde.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-23" style="background:;">
-  <hgroup>
-    <h2>Imaging example</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Consider the following image slice (created in R) from a high resolution MRI of a brain</li>
-<li>This is a single (axial) slice of a three-dimensional image</li>
-<li>Consider discarding the location information and plotting a KDE of the intensities</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-24" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/brain.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-25" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/brainDensityWithAnnotation.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-26" style="background:;">
-  <hgroup>
-    <h2>QQ-plots</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>QQ-plots (for quantile-quantile) are extremely useful for comparing data to a theoretical distribution</li>
-<li>Plot the empirical quantiles against theoretical quantiles</li>
-<li>Most useful for diagnosing normality</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-27" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <ul>
-<li>Let \(x_p\) be the \(p^{th}\) quantile from a \(N(\mu, \sigma^2)\)</li>
-<li>Then \(P(X \leq x_p) = p\)</li>
-<li>Clearly \(P(Z \leq \frac{x_p - \mu}{\sigma}) = p\)</li>
-<li>Therefore \(x_p = \mu + z_p \sigma\) (this should not be news)</li>
-<li>Result, quantiles from a \(N(\mu,\sigma^2)\) population should be linearly related to standard normal quantiles</li>
-<li>A normal qq-plot plots the empirical quantiles against the theoretical standard normal quantiles</li>
-<li>In R <code>qqnorm</code> for a normal QQ-plot and  <code>qqplot</code> for a qqplot against an arbitrary distribution</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-28" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/qqnorm1.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-29" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/qqnorm2.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-30" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/qqnorm3.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-31" style="background:;">
-  <hgroup>
-    <h2>Mosaic plots</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Mosaic plots are useful for displaying contingency table data</li>
-<li>Consider Fisher&#39;s data regarding hair and eye color data for people from Caithness</li>
-</ul>
-
-<pre><code class="r">library(MASS)
-data(caith)
-caith
-mosaicplot(caith, color = topo.colors(4), 
-           main = &quot;Mosiac plot&quot;)
-       fair red medium dark black
-blue    326  38    241  110     3
-light   688 116    584  188     4
-medium  343  84    909  412    26
-dark     98  48    403  681    85
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-32" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/mosaic.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-
-  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
diff --git a/06_StatisticalInference/old/11. Plotting/index.md b/06_StatisticalInference/old/11. Plotting/index.md
deleted file mode 100644
index f44a2fcc9..000000000
--- a/06_StatisticalInference/old/11. Plotting/index.md	
+++ /dev/null
@@ -1,282 +0,0 @@
----
-title       : Plotting
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Table of contents
-
-1. Histograms
-2. Stem and leaf
-3. Dotcharts
-4. Boxplots
-5. KDEs
-6. QQ-plots
-7. Mosaic plots
-
----
-
-## Histograms
-
-- Histograms display a sample estimate of the density or mass function by plotting a bar graph of the frequency or proportion of times that a variable takes specific values, or a range of values for continuous data, within a sample
-
----
-
-## Example
-
-- The data set `islands` in the R package `datasets` contains the areas of all land masses in thousands of square miles
-- Load the data set with the command `data(islands)`
-- View the data by typing `islands`
-- Create a histogram with the command `hist(islands)`
-- Do `?hist` for options 
-
----
-
-<img class="center" src="../assets/hist.png" height=500>
-
----
-
-## Pros and cons
-
-- Histograms are useful and easy, apply to continuous, discrete and even unordered data
-- They use a lot of ink and space to display very little information
-- It's difficult to display several at the same time for comparisons
-
-Also, for this data it's probably preferable to consider log base 10 (orders of magnitude), since the raw histogram simply says that most islands are small 
-
----
-
-<img class="center" src="../assets/histLog10.png" height=500>
-
----
-
-## Stem-and-leaf plots
-
-- Stem-and-leaf plots are extremely useful for getting distribution information on the fly
-- Read the text about creating them
-- They display the complete data set and so waste very little ink
-- Two data sets' stem and leaf plots can be shown back-to-back for comparisons
-- Created by John Tukey, a leading figure in the development of the statistical sciences and signal processing
-
----
-
-## Example
-
-```r
-> stem(log10(islands))
-
-  The decimal point is at the |
-
-  1 | 1111112222233444
-  1 | 5555556666667899999
-  2 | 3344
-  2 | 59
-  3 | 
-  3 | 5678
-  4 | 012
-```
-
----
-
-## Dotcharts
-
-- Dotcharts simply display a data set, one dot per point
-- Ordering of the dots and labeling of the axes can display additional information
-- Dotcharts show a complete data set and so have high data density
-- May be impossible to construct/difficult to interpret for data sets with lots of points
-
----
-
-<img class="center" src="../assets/dotChart.png" height=500>
-
----
-
-## Discussion
-
-- Maybe ordering alphabetically isn't the best thing for this data set
-- Perhaps grouped by continent, then nations by geography (grouping Pacific islands together)?
-
----
-
-## Dotplots comparing grouped data
-
-- For data sets in groups, you often want to display density information by group
-- If the size of the data permits, displaying the whole data is preferable
-- Add horizontal lines to depict means, medians
-- Add vertical lines to depict variation, show confidence intervals interquartile ranges
-- Jitter the points to avoid overplotting `jitter`
-
----
-
-## Example
-
-- The InsectSprays dataset contains counts of insect deaths by insecticide type (A, B, C, D, E, F)
-- You can obtain the data set with the command
-
-```r
-data(InsectSprays)
-```
-
----
-
-  The gist of the code is below
-  
-```r
-attach(InsectSprays)
-plot(c(.5, 6.5), range(count))
-sprayTypes <- unique(spray)
-for (i in 1 : length(sprayTypes)){
-  y <- count[spray == sprayTypes[i]]
-  n <- sum(spray == sprayTypes[i])
-  points(jitter(rep(i, n), amount = .1), y)
-  lines(i + c(.12, .28), rep(mean(y), 2), lwd = 3)
-  lines(rep(i + .2, 2), 
-        mean(y) + c(-1.96, 1.96) * sd(y) / sqrt(n)
-       )
-}
-```
-
----
-
-<img class="center" src="../assets/dotPlot.png" height=500>
-
----
-
-## Boxplots
-
-- Boxplots are useful for the same sort of display as the dot chart, but in instances where displaying the whole data set is not possible
-- Centerline of the boxes represents the median while the box edges correspond to the quartiles
-- Whiskers extend out to a constant times the IQR or the max value
-- Sometimes potential outliers are denoted by points beyond the whiskers
-- Also invented by Tukey
-- Skewness indicated by centerline being near one of the box edges
-
----
-
-<img class="center" src="../assets/boxplot.png" height=500>
-
----
-
-## Boxplots discussion
-
-- Don't use boxplots for small numbers of observations, just plot the data!
-- Try logging if some of the boxes are too squished relative to other ones; you can convert the axis to unlogged units (though they will not be equally spaced anymore)
-- For data with lots and lots of observations omit the outliers plotting if you get so many of them that you can't see the points
-- Example of a bad box plot
-
-```r
-boxplot(rt(500, 2))
-```
-
----
-
-<img class="center" src="../assets/boxplotBad.png" height=500>
-
----
-
-## Kernel density estimates
-
-- Kernel density estimates are essentially more modern versions of histograms providing density estimates for continuous data
-- Observations are weighted according to a "kernel", in most cases a Gaussian density
-- "Bandwidth" of the kernel effectively plays the role of the bin size for the histogram
-
-  a. Too low of a bandwidth yields a too variable (jagged) measure of the density
-  b. Too high of a bandwidth oversmooths
-
-- The R function `density` can be used to create KDEs
-
----
-
-## Example
-
-Data is the waiting and eruption times in minutes between eruptions of the Old Faithful Geyser in Yellowstone National park
-
-```r
-data(faithful)
-d <- density(faithful$eruptions, bw = "sj")
-plot(d)
-```
-
----
-
-<img class="center" src="../assets/kde.png" height=500>
-
----
-
-## Imaging example
-
-- Consider the following image slice (created in R) from a high resolution MRI of a brain
-- This is a single (axial) slice of a three-dimensional image
-- Consider discarding the location information and plotting a KDE of the intensities
-
----
-
-<img class="center" src="../assets/brain.png" height=500>
-
----
-
-<img class="center" src="../assets/brainDensityWithAnnotation.png" height=500>
-
----
-
-## QQ-plots
-
-- QQ-plots (for quantile-quantile) are extremely useful for comparing data to a theoretical distribution
-- Plot the empirical quantiles against theoretical quantiles
-- Most useful for diagnosing normality
-
----
-
-- Let $x_p$ be the $p^{th}$ quantile from a $N(\mu, \sigma^2)$
-- Then $P(X \leq x_p) = p$
-- Clearly $P(Z \leq \frac{x_p - \mu}{\sigma}) = p$
-- Therefore $x_p = \mu + z_p \sigma$ (this should not be news)
-- Result, quantiles from a $N(\mu,\sigma^2)$ population should be linearly related to standard normal quantiles
-- A normal qq-plot plots the empirical quantiles against the theoretical standard normal quantiles
-- In R `qqnorm` for a normal QQ-plot and  `qqplot` for a qqplot against an arbitrary distribution
-
----
-
-<img class="center" src="../assets/qqnorm1.png" height=500>
-
----
-
-<img class="center" src="../assets/qqnorm2.png" height=500>
-
----
-
-<img class="center" src="../assets/qqnorm3.png" height=500>
-
----
-
-## Mosaic plots
-
-- Mosaic plots are useful for displaying contingency table data
-- Consider Fisher's data regarding hair and eye color data for people from Caithness
-
-```r
-library(MASS)
-data(caith)
-caith
-mosaicplot(caith, color = topo.colors(4), 
-           main = "Mosiac plot")
-       fair red medium dark black
-blue    326  38    241  110     3
-light   688 116    584  188     4
-medium  343  84    909  412    26
-dark     98  48    403  681    85
-```
-
----
-
-<img class="center" src="../assets/mosaic.png" height=500>
diff --git a/06_StatisticalInference/old/12. Bootstrapping/index.Rmd b/06_StatisticalInference/old/12. Bootstrapping/index.Rmd
deleted file mode 100644
index 73360d5ee..000000000
--- a/06_StatisticalInference/old/12. Bootstrapping/index.Rmd	
+++ /dev/null
@@ -1,220 +0,0 @@
----
-title       : Bootstrapping
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Table of contents
-
-1. The jackknife
-2. The bootstrap principle
-3. The bootstrap
-
----
-
-## The jackknife
-
-- The jackknife is a tool for estimating standard errors  and the bias of estimators 
-- As its name suggests, the jackknife is a small, handy tool; in contrast to the bootstrap, which is then the moral equivalent of a giant workshop full of tools
-- Both the jackknife and the bootstrap involve *resampling* data; that is, repeatedly creating new data sets from the original data
-
----
-
-## The jackknife
-
-- The jackknife deletes each observation and calculates an estimate based on the remaining $n-1$ of them
-- It uses this collection of estimates to do things like estimate the bias and the standard error
-- Note that estimating the bias and having a standard error are not needed for things like sample means, which we know are unbiased estimates of population means and what their standard errors are
-
----
-
-## The jackknife
-
-- We'll consider the jackknife for univariate data
-- Let $X_1,\ldots,X_n$ be a collection of data used to estimate a parameter $\theta$
-- Let $\hat \theta$ be the estimate based on the full data set
-- Let $\hat \theta_{i}$ be the estimate of $\theta$ obtained by *deleting observation $i$*
-- Let $\bar \theta = \frac{1}{n}\sum_{i=1}^n \hat \theta_{i}$
-
----
-
-## Continued
-
-- Then, the jackknife estimate of the bias is
-   $$
-   (n - 1) \left(\bar \theta - \hat \theta\right)
-   $$
-   (how far the average delete-one estimate is from the actual estimate)
-- The jackknife estimate of the standard error is
-   $$
-   \left[\frac{n-1}{n}\sum_{i=1}^n (\hat \theta_i - \bar\theta )^2\right]^{1/2}
-   $$
-(the deviance of the delete-one estimates from the average delete-one estimate)
-
----
-
-## Example
-
-- Consider the data set of $630$ measurements of gray matter volume for workers from a lead manufacturing plant
-- The median gray matter volume is around 589 cubic centimeters
-- We want to estimate the bias and standard error of the median
-
----
-
-## Example
-
-The gist of the code
-
-```r
-n <- length(gmVol)
-theta <- median(gmVol)
-jk <- sapply(1 : n,
-             function(i) median(gmVol[-i])
-             )
-thetaBar <- mean(jk)
-biasEst <- (n - 1) * (thetaBar - theta) 
-seEst <- sqrt((n - 1) * mean((jk - thetaBar)^2))
-```
-
----
-
-## Example
-
-Or, using the `bootstrap` package
-
-```r
-library(bootstrap)
-out <- jackknife(gmVol, median)
-out$jack.se
-out$jack.bias
-```
-
----
-
-## Example
-
-- Both methods (of course) yield an estimated bias of $0$ and a se of $9.94$
-- Odd little fact: the jackknife estimate of the bias for the median is always $0$ when the number of observations is even
-- It has been shown that the jackknife is a linear approximation to the bootstrap
-- Generally do not use the jackknife for sample quantiles like the median; as it has been shown to have some poor properties
-
----
-
-## Pseudo observations
-
-- Another interesting way to think about the jackknife uses pseudo observations
-- Let
-$$
-      \mbox{Pseudo Obs} = n \hat \theta - (n - 1) \hat \theta_{i}
-$$
-- Think of these as ``whatever observation $i$ contributes to the estimate of $\theta$''
-- Note when $\hat \theta$ is the sample mean, the pseudo observations are the data themselves
-- Then the sample standard error of these observations is the previous jackknife estimated standard error.
-- The mean of these observations is a bias-corrected estimate of $\theta$
-
----
-
-## The bootstrap
-
-- The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics
-- For example, how would one derive a confidence interval for the median?
-- The bootstrap procedure follows from the so called bootstrap principle
-
----
-
-## The bootstrap principle
-
-- Suppose that I have a statistic that estimates some population parameter, but I don't know its sampling distribution
-- The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution
-
----
-
-## The bootstrap in practice
-
-- In practice, the bootstrap principle is always carried out using simulation
-- We will cover only a few aspects of bootstrap resampling
-- The general procedure follows by first simulating complete data sets from the observed data with replacement
-
-  - This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution
-
-- Calculate the statistic for each simulated data set
-- Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error
-
----
-
-## Example
-
-- Consider again, the data set of $630$ measurements of gray matter volume for workers from a lead manufacturing plant
-- The median gray matter volume is around 589 cubic centimeters
-- We want a confidence interval for the median of these measurements
-
----
-
-- Bootstrap procedure for calculating confidence interval for the median from a data set of $n$ observations
-
-  i. Sample $n$ observations **with replacement** from the observed data resulting in one simulated complete data set
-  
-  ii. Take the median of the simulated data set
-  
-  iii. Repeat these two steps $B$ times, resulting in $B$ simulated medians
-  
-  iv. These medians are approximately drawn from the sampling distribution of the median of $n$ observations; therefore we can
-  
-    - Draw a histogram of them
-    - Calculate their standard deviation to estimate the standard error of the median
-    - Take the $2.5^{th}$ and $97.5^{th}$ percentiles as a confidence interval for the median
-
----
-
-## Example code
-
-```r
-B <- 1000
-n <- length(gmVol)
-resamples <- matrix(sample(gmVol,
-                           n * B,
-                           replace = TRUE),
-                    B, n)
-medians <- apply(resamples, 1, median)
-sd(medians)
-[1] 3.148706
-quantile(medians, c(.025, .975))
-    2.5%    97.5% 
-582.6384 595.3553 
-```
-
----
-
-<img class="center" src="../assets/bootstrap.png" height=500>
-
----
-
-## Notes on the bootstrap
-
-- The bootstrap is non-parametric
-- However, the theoretical arguments proving the validity of the bootstrap rely on large samples
-- Better percentile bootstrap confidence intervals correct for bias
-- There are lots of variations on bootstrap procedures; the book "An Introduction to the Bootstrap"" by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information
-
----
-
-```r
-library(boot)
-stat <- function(x, i) {median(x[i])}  
-boot.out <- boot(data = gmVol,
-                 statistic = stat,
-                 R = 1000)
-boot.ci(boot.out)
-Level     Percentile            BCa          
-95%   (583.1, 595.2 )   (583.2, 595.3 ) 
-```
\ No newline at end of file
diff --git a/06_StatisticalInference/old/12. Bootstrapping/index.html b/06_StatisticalInference/old/12. Bootstrapping/index.html
deleted file mode 100644
index f8f572732..000000000
--- a/06_StatisticalInference/old/12. Bootstrapping/index.html	
+++ /dev/null
@@ -1,413 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Bootstrapping</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Bootstrapping">
-  <meta name="author" content="Brian Caffo, PhD">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
-    
-
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Bootstrapping</h1>
-        <h2>Mathematical Biostatistics Boot Camp</h2>
-        <p>Brian Caffo, PhD<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
-    <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Table of contents</h2>
-  </hgroup>
-  <article>
-    <ol>
-<li>The jackknife</li>
-<li>The bootstrap principle</li>
-<li>The bootstrap</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>The jackknife</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The jackknife is a tool for estimating standard errors  and the bias of estimators </li>
-<li>As its name suggests, the jackknife is a small, handy tool; in contrast to the bootstrap, which is then the moral equivalent of a giant workshop full of tools</li>
-<li>Both the jackknife and the bootstrap involve <em>resampling</em> data; that is, repeatedly creating new data sets from the original data</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>The jackknife</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The jackknife deletes each observation and calculates an estimate based on the remaining \(n-1\) of them</li>
-<li>It uses this collection of estimates to do things like estimate the bias and the standard error</li>
-<li>Note that estimating the bias and having a standard error are not needed for things like sample means, which we know are unbiased estimates of population means and what their standard errors are</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>The jackknife</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>We&#39;ll consider the jackknife for univariate data</li>
-<li>Let \(X_1,\ldots,X_n\) be a collection of data used to estimate a parameter \(\theta\)</li>
-<li>Let \(\hat \theta\) be the estimate based on the full data set</li>
-<li>Let \(\hat \theta_{i}\) be the estimate of \(\theta\) obtained by <em>deleting observation \(i\)</em></li>
-<li>Let \(\bar \theta = \frac{1}{n}\sum_{i=1}^n \hat \theta_{i}\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Continued</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Then, the jackknife estimate of the bias is
-\[
-(n - 1) \left(\bar \theta - \hat \theta\right)
-\]
-(how far the average delete-one estimate is from the actual estimate)</li>
-<li>The jackknife estimate of the standard error is
-\[
-\left[\frac{n-1}{n}\sum_{i=1}^n (\hat \theta_i - \bar\theta )^2\right]^{1/2}
-\]
-(the deviance of the delete-one estimates from the average delete-one estimate)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Consider the data set of \(630\) measurements of gray matter volume for workers from a lead manufacturing plant</li>
-<li>The median gray matter volume is around 589 cubic centimeters</li>
-<li>We want to estimate the bias and standard error of the median</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <p>The gist of the code</p>
-
-<pre><code class="r">n &lt;- length(gmVol)
-theta &lt;- median(gmVol)
-jk &lt;- sapply(1 : n,
-             function(i) median(gmVol[-i])
-             )
-thetaBar &lt;- mean(jk)
-biasEst &lt;- (n - 1) * (thetaBar - theta) 
-seEst &lt;- sqrt((n - 1) * mean((jk - thetaBar)^2))
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <p>Or, using the <code>bootstrap</code> package</p>
-
-<pre><code class="r">library(bootstrap)
-out &lt;- jackknife(gmVol, median)
-out$jack.se
-out$jack.bias
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Both methods (of course) yield an estimated bias of \(0\) and a se of \(9.94\)</li>
-<li>Odd little fact: the jackknife estimate of the bias for the median is always \(0\) when the number of observations is even</li>
-<li>It has been shown that the jackknife is a linear approximation to the bootstrap</li>
-<li>Generally do not use the jackknife for sample quantiles like the median; as it has been shown to have some poor properties</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Pseudo observations</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Another interesting way to think about the jackknife uses pseudo observations</li>
-<li>Let
-\[
-  \mbox{Pseudo Obs} = n \hat \theta - (n - 1) \hat \theta_{i}
-\]</li>
-<li>Think of these as ``whatever observation \(i\) contributes to the estimate of \(\theta\)&#39;&#39;</li>
-<li>Note when \(\hat \theta\) is the sample mean, the pseudo observations are the data themselves</li>
-<li>Then the sample standard error of these observations is the previous jackknife estimated standard error.</li>
-<li>The mean of these observations is a bias-corrected estimate of \(\theta\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>The bootstrap</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics</li>
-<li>For example, how would one derive a confidence interval for the median?</li>
-<li>The bootstrap procedure follows from the so called bootstrap principle</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>The bootstrap principle</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Suppose that I have a statistic that estimates some population parameter, but I don&#39;t know its sampling distribution</li>
-<li>The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>The bootstrap in practice</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>In practice, the bootstrap principle is always carried out using simulation</li>
-<li>We will cover only a few aspects of bootstrap resampling</li>
-<li><p>The general procedure follows by first simulating complete data sets from the observed data with replacement</p>
-
-<ul>
-<li>This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution</li>
-</ul></li>
-<li><p>Calculate the statistic for each simulated data set</p></li>
-<li><p>Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Consider again, the data set of \(630\) measurements of gray matter volume for workers from a lead manufacturing plant</li>
-<li>The median gray matter volume is around 589 cubic centimeters</li>
-<li>We want a confidence interval for the median of these measurements</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <ul>
-<li><p>Bootstrap procedure for calculating confidence interval for the median from a data set of \(n\) observations</p>
-
-<p>i. Sample \(n\) observations <strong>with replacement</strong> from the observed data resulting in one simulated complete data set</p>
-
-<p>ii. Take the median of the simulated data set</p>
-
-<p>iii. Repeat these two steps \(B\) times, resulting in \(B\) simulated medians</p>
-
-<p>iv. These medians are approximately drawn from the sampling distribution of the median of \(n\) observations; therefore we can</p>
-
-<ul>
-<li>Draw a histogram of them</li>
-<li>Calculate their standard deviation to estimate the standard error of the median</li>
-<li>Take the \(2.5^{th}\) and \(97.5^{th}\) percentiles as a confidence interval for the median</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Example code</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">B &lt;- 1000
-n &lt;- length(gmVol)
-resamples &lt;- matrix(sample(gmVol,
-                           n * B,
-                           replace = TRUE),
-                    B, n)
-medians &lt;- apply(resamples, 1, median)
-sd(medians)
-[1] 3.148706
-quantile(medians, c(.025, .975))
-    2.5%    97.5% 
-582.6384 595.3553 
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/bootstrap.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h2>Notes on the bootstrap</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The bootstrap is non-parametric</li>
-<li>However, the theoretical arguments proving the validity of the bootstrap rely on large samples</li>
-<li>Better percentile bootstrap confidence intervals correct for bias</li>
-<li>There are lots of variations on bootstrap procedures; the book &quot;An Introduction to the Bootstrap&quot;&quot; by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <pre><code class="r">library(boot)
-stat &lt;- function(x, i) {median(x[i])}  
-boot.out &lt;- boot(data = gmVol,
-                 statistic = stat,
-                 R = 1000)
-boot.ci(boot.out)
-Level     Percentile            BCa          
-95%   (583.1, 595.2 )   (583.2, 595.3 ) 
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-
-  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
diff --git a/06_StatisticalInference/old/12. Bootstrapping/index.md b/06_StatisticalInference/old/12. Bootstrapping/index.md
deleted file mode 100644
index 349ce369c..000000000
--- a/06_StatisticalInference/old/12. Bootstrapping/index.md	
+++ /dev/null
@@ -1,220 +0,0 @@
----
-title       : Bootstrapping
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Table of contents
-
-1. The jackknife
-2. The bootstrap principle
-3. The bootstrap
-
----
-
-## The jackknife
-
-- The jackknife is a tool for estimating standard errors  and the bias of estimators 
-- As its name suggests, the jackknife is a small, handy tool; in contrast to the bootstrap, which is then the moral equivalent of a giant workshop full of tools
-- Both the jackknife and the bootstrap involve *resampling* data; that is, repeatedly creating new data sets from the original data
-
----
-
-## The jackknife
-
-- The jackknife deletes each observation and calculates an estimate based on the remaining $n-1$ of them
-- It uses this collection of estimates to do things like estimate the bias and the standard error
-- Note that estimating the bias and having a standard error are not needed for things like sample means, which we know are unbiased estimates of population means and what their standard errors are
-
----
-
-## The jackknife
-
-- We'll consider the jackknife for univariate data
-- Let $X_1,\ldots,X_n$ be a collection of data used to estimate a parameter $\theta$
-- Let $\hat \theta$ be the estimate based on the full data set
-- Let $\hat \theta_{i}$ be the estimate of $\theta$ obtained by *deleting observation $i$*
-- Let $\bar \theta = \frac{1}{n}\sum_{i=1}^n \hat \theta_{i}$
-
----
-
-## Continued
-
-- Then, the jackknife estimate of the bias is
-   $$
-   (n - 1) \left(\bar \theta - \hat \theta\right)
-   $$
-   (how far the average delete-one estimate is from the actual estimate)
-- The jackknife estimate of the standard error is
-   $$
-   \left[\frac{n-1}{n}\sum_{i=1}^n (\hat \theta_i - \bar\theta )^2\right]^{1/2}
-   $$
-(the deviance of the delete-one estimates from the average delete-one estimate)
-
----
-
-## Example
-
-- Consider the data set of $630$ measurements of gray matter volume for workers from a lead manufacturing plant
-- The median gray matter volume is around 589 cubic centimeters
-- We want to estimate the bias and standard error of the median
-
----
-
-## Example
-
-The gist of the code
-
-```r
-n <- length(gmVol)
-theta <- median(gmVol)
-jk <- sapply(1 : n,
-             function(i) median(gmVol[-i])
-             )
-thetaBar <- mean(jk)
-biasEst <- (n - 1) * (thetaBar - theta) 
-seEst <- sqrt((n - 1) * mean((jk - thetaBar)^2))
-```
-
----
-
-## Example
-
-Or, using the `bootstrap` package
-
-```r
-library(bootstrap)
-out <- jackknife(gmVol, median)
-out$jack.se
-out$jack.bias
-```
-
----
-
-## Example
-
-- Both methods (of course) yield an estimated bias of $0$ and a se of $9.94$
-- Odd little fact: the jackknife estimate of the bias for the median is always $0$ when the number of observations is even
-- It has been shown that the jackknife is a linear approximation to the bootstrap
-- Generally do not use the jackknife for sample quantiles like the median; as it has been shown to have some poor properties
-
----
-
-## Pseudo observations
-
-- Another interesting way to think about the jackknife uses pseudo observations
-- Let
-$$
-      \mbox{Pseudo Obs} = n \hat \theta - (n - 1) \hat \theta_{i}
-$$
-- Think of these as ``whatever observation $i$ contributes to the estimate of $\theta$''
-- Note when $\hat \theta$ is the sample mean, the pseudo observations are the data themselves
-- Then the sample standard error of these observations is the previous jackknife estimated standard error.
-- The mean of these observations is a bias-corrected estimate of $\theta$
-
----
-
-## The bootstrap
-
-- The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics
-- For example, how would one derive a confidence interval for the median?
-- The bootstrap procedure follows from the so called bootstrap principle
-
----
-
-## The bootstrap principle
-
-- Suppose that I have a statistic that estimates some population parameter, but I don't know its sampling distribution
-- The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution
-
----
-
-## The bootstrap in practice
-
-- In practice, the bootstrap principle is always carried out using simulation
-- We will cover only a few aspects of bootstrap resampling
-- The general procedure follows by first simulating complete data sets from the observed data with replacement
-
-  - This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution
-
-- Calculate the statistic for each simulated data set
-- Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error
-
----
-
-## Example
-
-- Consider again, the data set of $630$ measurements of gray matter volume for workers from a lead manufacturing plant
-- The median gray matter volume is around 589 cubic centimeters
-- We want a confidence interval for the median of these measurements
-
----
-
-- Bootstrap procedure for calculating confidence interval for the median from a data set of $n$ observations
-
-  i. Sample $n$ observations **with replacement** from the observed data resulting in one simulated complete data set
-  
-  ii. Take the median of the simulated data set
-  
-  iii. Repeat these two steps $B$ times, resulting in $B$ simulated medians
-  
-  iv. These medians are approximately drawn from the sampling distribution of the median of $n$ observations; therefore we can
-  
-    - Draw a histogram of them
-    - Calculate their standard deviation to estimate the standard error of the median
-    - Take the $2.5^{th}$ and $97.5^{th}$ percentiles as a confidence interval for the median
-
----
-
-## Example code
-
-```r
-B <- 1000
-n <- length(gmVol)
-resamples <- matrix(sample(gmVol,
-                           n * B,
-                           replace = TRUE),
-                    B, n)
-medians <- apply(resamples, 1, median)
-sd(medians)
-[1] 3.148706
-quantile(medians, c(.025, .975))
-    2.5%    97.5% 
-582.6384 595.3553 
-```
-
----
-
-<img class="center" src="../assets/bootstrap.png" height=500>
-
----
-
-## Notes on the bootstrap
-
-- The bootstrap is non-parametric
-- However, the theoretical arguments proving the validity of the bootstrap rely on large samples
-- Better percentile bootstrap confidence intervals correct for bias
-- There are lots of variations on bootstrap procedures; the book "An Introduction to the Bootstrap"" by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information
-
----
-
-```r
-library(boot)
-stat <- function(x, i) {median(x[i])}  
-boot.out <- boot(data = gmVol,
-                 statistic = stat,
-                 R = 1000)
-boot.ci(boot.out)
-Level     Percentile            BCa          
-95%   (583.1, 595.2 )   (583.2, 595.3 ) 
-```
diff --git a/06_StatisticalInference/old/13. Binomial Proportions/Binomial Proportions.pdf b/06_StatisticalInference/old/13. Binomial Proportions/Binomial Proportions.pdf
deleted file mode 100644
index a9c097fd1..000000000
Binary files a/06_StatisticalInference/old/13. Binomial Proportions/Binomial Proportions.pdf and /dev/null differ
diff --git a/06_StatisticalInference/old/13. Binomial Proportions/index.Rmd b/06_StatisticalInference/old/13. Binomial Proportions/index.Rmd
deleted file mode 100644
index 81f13dbdf..000000000
--- a/06_StatisticalInference/old/13. Binomial Proportions/index.Rmd	
+++ /dev/null
@@ -1,296 +0,0 @@
----
-title       : Binomial Proportions
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Table of contents
-
-1. Intervals for binomial proportions
-2. Agresti- Coull interval
-3. Bayesian analysis
-  - Prior specification
-  - Posterior
-  - Credible intervals
-4. Summary
-
----
-
-## Intervals for binomial parameters
-
-- When $X\sim\mbox{Binomial}(n, p)$ we know that
-
-  a. $\hat p = X / n$ is the MLE for $p$
-  b. $E[\hat p] = p$
-  c. $Var(\hat p) = p (1 - p) / n$
-  d.  $$
-      \frac{\hat p - p}{\sqrt{\hat p (1- \hat p)/n}}
-      $$
-follows a normal distribution for large $n$
-
-- The latter fact leads to the Wald interval for $p$
-  $$
-  \hat p \pm Z_{1-\alpha/2} \sqrt{\hat p (1 - \hat p) / n}
-  $$
-
----
-
-## Some discussion
-
-- The Wald interval performs terribly
-- Coverage probability varies wildly, sometimes being quite low for certain values of $n$ even when $p$ is not near the boundaries
-
-  - Example, when $p=.5$ and $n=40$ the actual coverage of a $95\%$ interval is only $92\%$
-
-- When $p$ is small or large, coverage can be quite poor even for extremely large values of $n$
-
-  - Example, when $p=.005$ and $n=1,876$ the actual coverage rate of a $95\%$ interval is only $90\%$
-
----
-
-## Simple fix
-
-- A simple fix for the problem is to add two successes and two failures
-- That is let $\tilde p = (X + 2) / (n + 4)$
-- The (Agresti- Coull) interval is 
-  $$
-  \tilde p \pm Z_{1-\alpha/2} \sqrt{\tilde p (1 - \tilde p) / \tilde n}
-  $$
-- Motivation: when $p$ is large or small, the distribution of $\hat p$ is skewed and it does not make sense to center the interval at the MLE; adding the pseudo observations pulls the center of the interval toward $.5$
-- Later we will show that this interval is the inversion of a hypothesis testing technique
-
----
-
-## Example
-
-Suppose that in a random sample of an at-risk population $13$ of $20$ subjects had hypertension. Estimate the prevalence of hypertension in this population.
-
--  $\hat p = .65$,  $n = 20$
--  $\tilde p = .63$, $\tilde n = 24$
--  $Z_{.975} = 1.96$
--  Wald interval $[.44, .86]$
--  Agresti-Coull interval $[.44, .82]$ 
--  $1/8$ likelihood interval $[.42, .84]$
-
----
-
-<img class="center" src="../assets/binomialLikelihoodExample.png" height=500>
-
----
-
-## Bayesian analysis
-
-- Bayesian statistics posits a **prior** on the parameter of interest
-- All inferences are then performed on the distribution of the parameter given the data, called the **posterior**
-- In general,
-  $$
-  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-  $$
-- Therefore (as we saw in diagnostic testing) the likelihood is the factor by which our prior beliefs are updated to produce conclusions in the light of the data
-
----
-
-## Beta priors
-
-- The beta distribution is the default prior for parameters between $0$ and $1$.
-\item The beta density depends on two parameters $\alpha$ and $\beta$
-$$
-\frac{\Gamma(\alpha +  \beta)}{\Gamma(\alpha)\Gamma(\beta)}
- p ^ {\alpha - 1} (1 - p) ^ {\beta - 1} ~~~~\mbox{for} ~~ 0 \leq p \leq 1
-$$
-- The mean of the beta density is $\alpha / (\alpha + \beta)$
-- The variance of the beta density is \
-$$\frac{\alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}$$
-- The uniform density is the special case where $\alpha = \beta = 1$
-
----
-
-<img class="center" src="../assets/beta.png" height=500>
-
----
-
-## Posterior
-
-- Suppose that we chose values of $\alpha$ and $\beta$ so that the beta prior is indicative of our degree of belief regarding $p$ in the absence of data
-\item Then using the rule that
-  $$
-  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-  $$
-and throwing out anything that doesn't depend on $p$, we have that
-$$
-\begin{eqnarray*}
-\mbox{Posterior} &\propto & p^x(1 - p)^{n-x} \times p^{\alpha -1} (1 - p)^{\beta - 1} \\
-                 &  =     & p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1}
-\end{eqnarray*}
-$$
-- This density is just another beta density with parameters $\tilde \alpha = x + \alpha$ and $\tilde \beta = n - x + \beta$
-
----
-
-## Posterior mean
-
-$$
- \begin{eqnarray*}
-E[p ~|~ X] & = &  \frac{\tilde \alpha}{\tilde \alpha + \tilde \beta}\\ \\
-& = & \frac{x + \alpha}{x + \alpha + n - x + \beta}\\ \\
-& = & \frac{x + \alpha}{n + \alpha + \beta} \\ \\
-& = & \frac{x}{n} \times \frac{n}{n + \alpha + \beta} + \frac{\alpha}{\alpha + \beta} \times \frac{\alpha + \beta}{n + \alpha + \beta} \\ \\
-& = & \mbox{MLE} \times \pi + \mbox{Prior Mean} \times (1 - \pi)
-  \end{eqnarray*}
-$$
-
----
-
-- The posterior mean is a mixture of the MLE ($\hat p$) and the prior mean
-- $\pi$ goes to $1$ as $n$ gets large; for large $n$ the data swamps the prior
-- For small $n$, the prior mean dominates 
-- Generalizes how science should ideally work; as data becomes increasingly available, prior beliefs should matter less and less
-- With a prior that is degenerate at a value, no amount of data can overcome the prior
-
----
-
-## Posterior variance
-
-- The posterior variance is
-$$
-  \begin{eqnarray*}
-Var(p ~|~ x) & = & \frac{\tilde \alpha \tilde \beta}%
-{(\tilde \alpha + \tilde \beta)^2 (\tilde \alpha + \tilde \beta + 1)} \\ \\
-& = & 
-\frac{ (x + \alpha)(n - x + \beta)}%
-{(n + \alpha + \beta)^2 (n + \alpha + \beta + 1)}
-\end{eqnarray*}
-$$
-- Let $\tilde p = (x + \alpha) / (n + \alpha + \beta)$ and $\tilde n = n + \alpha + \beta$ then we have
-$$
-Var(p ~|~ x) = \frac{\tilde p (1 - \tilde p)}{\tilde n + 1}
-$$
-
----
-
-## Discussion
-
-- If $\alpha = \beta = 2$ then the posterior mean is
-$$
-\tilde p = (x + 2) / (n + 4)
-$$
-and the posterior variance is 
-$$
-\tilde p (1 - \tilde p) / (\tilde n + 1)
-$$
-- This is almost exactly the mean and variance we used for the Agresti-Coull interval
-
----
-
-## Example
-
-- Consider the previous example where $x = 13$ and $n=20$
-- Consider a uniform prior, $\alpha = \beta = 1$
-- The posterior is proportional to (see formula above)
-$$
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^x (1 - p)^{n-x}
-$$
-that is, for the uniform prior, the posterior is the likelihood
-- Consider the instance where $\alpha = \beta = 2$ (recall this prior is humped around the point $.5$) the posterior is
-$$
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^{x + 1} (1 - p)^{n-x + 1}
-$$
-- The ``Jeffrey's prior'' which has some theoretical benefits puts $\alpha = \beta = .5$
-
----
-
-<img class="center" src="../assets/binBayes1.png" height=500>
-
----
-
-<img class="center" src="../assets/binBayes2.png" height=500>
-
----
-
-<img class="center" src="../assets/binBayes3.png" height=500>
-
----
-
-<img class="center" src="../assets/binBayes4.png" height=500>
-
----
-
-<img class="center" src="../assets/binBayes5.png" height=500>
-
----
-
-## Bayesian credible intervals
-
-- A *Bayesian credible interval* is the  Bayesian analog of a confidence interval
-- A $95\%$ credible interval, $[a, b]$ would satisfy
-  $$
-  P(p \in [a, b] ~|~ x) = .95
-  $$
-- The best credible intervals chop off the posterior with a horizontal line in the same way we did for likelihoods 
-- These are called highest posterior density (HPD) intervals
-
----
-
-<img class="center" src="../assets/hpd.png" height=500>
-
----
-
-## R code
-
-Install the `binom` package, then the command
-
-```r
-library(binom)
-binom.bayes(13, 20, type = "highest")
-```
-
-gives the HPD interval. The default credible level is $95\%$ and the default prior is the Jeffrey's prior.
-
----
-
-## Interpretation of confidence intervals
-
-- Confidence interval: (Wald) $[.44, .86]$
-- Fuzzy interpretation: 
-
-  *We are 95% confident that $p$ lies between $.44$ to $.86$*
-
-- Actual interpretation: 
-
-  *The interval $.44$ to $.86$ was constructed such that in repeated independent experiments, $95\%$ of the intervals obtained would contain $p$.*
-
-- Yikes!
-
----
-
-## Likelihood intervals
-
-- Recall the $1/8$ likelihood interval was $[.42, .84]$
-- Fuzzy interpretation:
-
-  *The interval $[.42, .84]$ represents plausible values for $p$.*
-
-- Actual interpretation
-
-  *The interval $[.42, .84]$ represents plausible values for $p$ in the sense that for each point in this interval, there is no other point that is more than $8$ times better supported given the data.*
-
-- Yikes!
-
----
-
-## Credible intervals
-
-- Recall that Jeffrey's prior $95\%$ credible interval was $[.44, .84]$
-- Actual interpretation
-
-  *The probability that $p$ is between $.44$ and $.84$ is $95\%$.*
diff --git a/06_StatisticalInference/old/13. Binomial Proportions/index.html b/06_StatisticalInference/old/13. Binomial Proportions/index.html
deleted file mode 100644
index db8f46c5d..000000000
--- a/06_StatisticalInference/old/13. Binomial Proportions/index.html	
+++ /dev/null
@@ -1,541 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Binomial Proportions</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Binomial Proportions">
-  <meta name="author" content="Brian Caffo, PhD">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
-    
-
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Binomial Proportions</h1>
-        <h2>Mathematical Biostatistics Boot Camp</h2>
-        <p>Brian Caffo, PhD<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
-    <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Table of contents</h2>
-  </hgroup>
-  <article>
-    <ol>
-<li>Intervals for binomial proportions</li>
-<li>Agresti- Coull interval</li>
-<li>Bayesian analysis
-
-<ul>
-<li>Prior specification</li>
-<li>Posterior</li>
-<li>Credible intervals</li>
-</ul></li>
-<li>Summary</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Intervals for binomial parameters</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li><p>When \(X\sim\mbox{Binomial}(n, p)\) we know that</p>
-
-<p>a. \(\hat p = X / n\) is the MLE for \(p\)
-b. \(E[\hat p] = p\)
-c. \(Var(\hat p) = p (1 - p) / n\)
-d.  \[
-  \frac{\hat p - p}{\sqrt{\hat p (1- \hat p)/n}}
-  \]
-follows a normal distribution for large \(n\)</p></li>
-<li><p>The latter fact leads to the Wald interval for \(p\)
-\[
-\hat p \pm Z_{1-\alpha/2} \sqrt{\hat p (1 - \hat p) / n}
-\]</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Some discussion</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The Wald interval performs terribly</li>
-<li><p>Coverage probability varies wildly, sometimes being quite low for certain values of \(n\) even when \(p\) is not near the boundaries</p>
-
-<ul>
-<li>Example, when \(p=.5\) and \(n=40\) the actual coverage of a \(95\%\) interval is only \(92\%\)</li>
-</ul></li>
-<li><p>When \(p\) is small or large, coverage can be quite poor even for extremely large values of \(n\)</p>
-
-<ul>
-<li>Example, when \(p=.005\) and \(n=1,876\) the actual coverage rate of a \(95\%\) interval is only \(90\%\)</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Simple fix</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>A simple fix for the problem is to add two successes and two failures</li>
-<li>That is let \(\tilde p = (X + 2) / (n + 4)\)</li>
-<li>The (Agresti- Coull) interval is 
-\[
-\tilde p \pm Z_{1-\alpha/2} \sqrt{\tilde p (1 - \tilde p) / \tilde n}
-\]</li>
-<li>Motivation: when \(p\) is large or small, the distribution of \(\hat p\) is skewed and it does not make sense to center the interval at the MLE; adding the pseudo observations pulls the center of the interval toward \(.5\)</li>
-<li>Later we will show that this interval is the inversion of a hypothesis testing technique</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <p>Suppose that in a random sample of an at-risk population \(13\) of \(20\) subjects had hypertension. Estimate the prevalence of hypertension in this population.</p>
-
-<ul>
-<li> \(\hat p = .65\),  \(n = 20\)</li>
-<li> \(\tilde p = .63\), \(\tilde n = 24\)</li>
-<li> \(Z_{.975} = 1.96\)</li>
-<li> Wald interval \([.44, .86]\)</li>
-<li> Agresti-Coull interval \([.44, .82]\) </li>
-<li> \(1/8\) likelihood interval \([.42, .84]\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/binomialLikelihoodExample.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Bayesian analysis</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Bayesian statistics posits a <strong>prior</strong> on the parameter of interest</li>
-<li>All inferences are then performed on the distribution of the parameter given the data, called the <strong>posterior</strong></li>
-<li>In general,
-\[
-\mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-\]</li>
-<li>Therefore (as we saw in diagnostic testing) the likelihood is the factor by which our prior beliefs are updated to produce conclusions in the light of the data</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Beta priors</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The beta distribution is the default prior for parameters between \(0\) and \(1\).
-\item The beta density depends on two parameters \(\alpha\) and \(\beta\)
-\[
-\frac{\Gamma(\alpha +  \beta)}{\Gamma(\alpha)\Gamma(\beta)}
-p ^ {\alpha - 1} (1 - p) ^ {\beta - 1} ~~~~\mbox{for} ~~ 0 \leq p \leq 1
-\]</li>
-<li>The mean of the beta density is \(\alpha / (\alpha + \beta)\)</li>
-<li>The variance of the beta density is \
-\[\frac{\alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}\]</li>
-<li>The uniform density is the special case where \(\alpha = \beta = 1\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/beta.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Posterior</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Suppose that we chose values of \(\alpha\) and \(\beta\) so that the beta prior is indicative of our degree of belief regarding \(p\) in the absence of data
-\item Then using the rule that
-\[
-\mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-\]
-and throwing out anything that doesn&#39;t depend on \(p\), we have that
-\[
-\begin{eqnarray*}
-\mbox{Posterior} &\propto & p^x(1 - p)^{n-x} \times p^{\alpha -1} (1 - p)^{\beta - 1} \\
-             &  =     & p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1}
-\end{eqnarray*}
-\]</li>
-<li>This density is just another beta density with parameters \(\tilde \alpha = x + \alpha\) and \(\tilde \beta = n - x + \beta\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Posterior mean</h2>
-  </hgroup>
-  <article>
-    <p>\[
- \begin{eqnarray*}
-E[p ~|~ X] & = &  \frac{\tilde \alpha}{\tilde \alpha + \tilde \beta}\\ \\
-& = & \frac{x + \alpha}{x + \alpha + n - x + \beta}\\ \\
-& = & \frac{x + \alpha}{n + \alpha + \beta} \\ \\
-& = & \frac{x}{n} \times \frac{n}{n + \alpha + \beta} + \frac{\alpha}{\alpha + \beta} \times \frac{\alpha + \beta}{n + \alpha + \beta} \\ \\
-& = & \mbox{MLE} \times \pi + \mbox{Prior Mean} \times (1 - \pi)
-  \end{eqnarray*}
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <ul>
-<li>The posterior mean is a mixture of the MLE (\(\hat p\)) and the prior mean</li>
-<li>\(\pi\) goes to \(1\) as \(n\) gets large; for large \(n\) the data swamps the prior</li>
-<li>For small \(n\), the prior mean dominates </li>
-<li>Generalizes how science should ideally work; as data becomes increasingly available, prior beliefs should matter less and less</li>
-<li>With a prior that is degenerate at a value, no amount of data can overcome the prior</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>Posterior variance</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The posterior variance is
-\[
-\begin{eqnarray*}
-Var(p ~|~ x) & = & \frac{\tilde \alpha \tilde \beta}%
-{(\tilde \alpha + \tilde \beta)^2 (\tilde \alpha + \tilde \beta + 1)} \\ \\
-& = & 
-\frac{ (x + \alpha)(n - x + \beta)}%
-{(n + \alpha + \beta)^2 (n + \alpha + \beta + 1)}
-\end{eqnarray*}
-\]</li>
-<li>Let \(\tilde p = (x + \alpha) / (n + \alpha + \beta)\) and \(\tilde n = n + \alpha + \beta\) then we have
-\[
-Var(p ~|~ x) = \frac{\tilde p (1 - \tilde p)}{\tilde n + 1}
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Discussion</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>If \(\alpha = \beta = 2\) then the posterior mean is
-\[
-\tilde p = (x + 2) / (n + 4)
-\]
-and the posterior variance is 
-\[
-\tilde p (1 - \tilde p) / (\tilde n + 1)
-\]</li>
-<li>This is almost exactly the mean and variance we used for the Agresti-Coull interval</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Consider the previous example where \(x = 13\) and \(n=20\)</li>
-<li>Consider a uniform prior, \(\alpha = \beta = 1\)</li>
-<li>The posterior is proportional to (see formula above)
-\[
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^x (1 - p)^{n-x}
-\]
-that is, for the uniform prior, the posterior is the likelihood</li>
-<li>Consider the instance where \(\alpha = \beta = 2\) (recall this prior is humped around the point \(.5\)) the posterior is
-\[
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^{x + 1} (1 - p)^{n-x + 1}
-\]</li>
-<li>The ``Jeffrey&#39;s prior&#39;&#39; which has some theoretical benefits puts \(\alpha = \beta = .5\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/binBayes1.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/binBayes2.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/binBayes3.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/binBayes4.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-20" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/binBayes5.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-21" style="background:;">
-  <hgroup>
-    <h2>Bayesian credible intervals</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>A <em>Bayesian credible interval</em> is the  Bayesian analog of a confidence interval</li>
-<li>A \(95\%\) credible interval, \([a, b]\) would satisfy
-\[
-P(p \in [a, b] ~|~ x) = .95
-\]</li>
-<li>The best credible intervals chop off the posterior with a horizontal line in the same way we did for likelihoods </li>
-<li>These are called highest posterior density (HPD) intervals</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-22" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p><img class="center" src="../assets/hpd.png" height=500></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-23" style="background:;">
-  <hgroup>
-    <h2>R code</h2>
-  </hgroup>
-  <article>
-    <p>Install the <code>binom</code> package, then the command</p>
-
-<pre><code class="r">library(binom)
-binom.bayes(13, 20, type = &quot;highest&quot;)
-</code></pre>
-
-<p>gives the HPD interval. The default credible level is \(95\%\) and the default prior is the Jeffrey&#39;s prior.</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-24" style="background:;">
-  <hgroup>
-    <h2>Interpretation of confidence intervals</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Confidence interval: (Wald) \([.44, .86]\)</li>
-<li><p>Fuzzy interpretation: </p>
-
-<p><em>We are 95% confident that \(p\) lies between \(.44\) to \(.86\)</em></p></li>
-<li><p>Actual interpretation: </p>
-
-<p><em>The interval \(.44\) to \(.86\) was constructed such that in repeated independent experiments, \(95\%\) of the intervals obtained would contain \(p\).</em></p></li>
-<li><p>Yikes!</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-25" style="background:;">
-  <hgroup>
-    <h2>Likelihood intervals</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Recall the \(1/8\) likelihood interval was \([.42, .84]\)</li>
-<li><p>Fuzzy interpretation:</p>
-
-<p><em>The interval \([.42, .84]\) represents plausible values for \(p\).</em></p></li>
-<li><p>Actual interpretation</p>
-
-<p><em>The interval \([.42, .84]\) represents plausible values for \(p\) in the sense that for each point in this interval, there is no other point that is more than \(8\) times better supported given the data.</em></p></li>
-<li><p>Yikes!</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-26" style="background:;">
-  <hgroup>
-    <h2>Credible intervals</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Recall that Jeffrey&#39;s prior \(95\%\) credible interval was \([.44, .84]\)</li>
-<li><p>Actual interpretation</p>
-
-<p><em>The probability that \(p\) is between \(.44\) and \(.84\) is \(95\%\).</em></p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-
-  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
diff --git a/06_StatisticalInference/old/13. Binomial Proportions/index.md b/06_StatisticalInference/old/13. Binomial Proportions/index.md
deleted file mode 100644
index 81f13dbdf..000000000
--- a/06_StatisticalInference/old/13. Binomial Proportions/index.md	
+++ /dev/null
@@ -1,296 +0,0 @@
----
-title       : Binomial Proportions
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Table of contents
-
-1. Intervals for binomial proportions
-2. Agresti- Coull interval
-3. Bayesian analysis
-  - Prior specification
-  - Posterior
-  - Credible intervals
-4. Summary
-
----
-
-## Intervals for binomial parameters
-
-- When $X\sim\mbox{Binomial}(n, p)$ we know that
-
-  a. $\hat p = X / n$ is the MLE for $p$
-  b. $E[\hat p] = p$
-  c. $Var(\hat p) = p (1 - p) / n$
-  d.  $$
-      \frac{\hat p - p}{\sqrt{\hat p (1- \hat p)/n}}
-      $$
-follows a normal distribution for large $n$
-
-- The latter fact leads to the Wald interval for $p$
-  $$
-  \hat p \pm Z_{1-\alpha/2} \sqrt{\hat p (1 - \hat p) / n}
-  $$
-
----
-
-## Some discussion
-
-- The Wald interval performs terribly
-- Coverage probability varies wildly, sometimes being quite low for certain values of $n$ even when $p$ is not near the boundaries
-
-  - Example, when $p=.5$ and $n=40$ the actual coverage of a $95\%$ interval is only $92\%$
-
-- When $p$ is small or large, coverage can be quite poor even for extremely large values of $n$
-
-  - Example, when $p=.005$ and $n=1,876$ the actual coverage rate of a $95\%$ interval is only $90\%$
-
----
-
-## Simple fix
-
-- A simple fix for the problem is to add two successes and two failures
-- That is let $\tilde p = (X + 2) / (n + 4)$
-- The (Agresti- Coull) interval is 
-  $$
-  \tilde p \pm Z_{1-\alpha/2} \sqrt{\tilde p (1 - \tilde p) / \tilde n}
-  $$
-- Motivation: when $p$ is large or small, the distribution of $\hat p$ is skewed and it does not make sense to center the interval at the MLE; adding the pseudo observations pulls the center of the interval toward $.5$
-- Later we will show that this interval is the inversion of a hypothesis testing technique
-
----
-
-## Example
-
-Suppose that in a random sample of an at-risk population $13$ of $20$ subjects had hypertension. Estimate the prevalence of hypertension in this population.
-
--  $\hat p = .65$,  $n = 20$
--  $\tilde p = .63$, $\tilde n = 24$
--  $Z_{.975} = 1.96$
--  Wald interval $[.44, .86]$
--  Agresti-Coull interval $[.44, .82]$ 
--  $1/8$ likelihood interval $[.42, .84]$
-
----
-
-<img class="center" src="../assets/binomialLikelihoodExample.png" height=500>
-
----
-
-## Bayesian analysis
-
-- Bayesian statistics posits a **prior** on the parameter of interest
-- All inferences are then performed on the distribution of the parameter given the data, called the **posterior**
-- In general,
-  $$
-  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-  $$
-- Therefore (as we saw in diagnostic testing) the likelihood is the factor by which our prior beliefs are updated to produce conclusions in the light of the data
-
----
-
-## Beta priors
-
-- The beta distribution is the default prior for parameters between $0$ and $1$.
-\item The beta density depends on two parameters $\alpha$ and $\beta$
-$$
-\frac{\Gamma(\alpha +  \beta)}{\Gamma(\alpha)\Gamma(\beta)}
- p ^ {\alpha - 1} (1 - p) ^ {\beta - 1} ~~~~\mbox{for} ~~ 0 \leq p \leq 1
-$$
-- The mean of the beta density is $\alpha / (\alpha + \beta)$
-- The variance of the beta density is \
-$$\frac{\alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}$$
-- The uniform density is the special case where $\alpha = \beta = 1$
-
----
-
-<img class="center" src="../assets/beta.png" height=500>
-
----
-
-## Posterior
-
-- Suppose that we chose values of $\alpha$ and $\beta$ so that the beta prior is indicative of our degree of belief regarding $p$ in the absence of data
-\item Then using the rule that
-  $$
-  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-  $$
-and throwing out anything that doesn't depend on $p$, we have that
-$$
-\begin{eqnarray*}
-\mbox{Posterior} &\propto & p^x(1 - p)^{n-x} \times p^{\alpha -1} (1 - p)^{\beta - 1} \\
-                 &  =     & p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1}
-\end{eqnarray*}
-$$
-- This density is just another beta density with parameters $\tilde \alpha = x + \alpha$ and $\tilde \beta = n - x + \beta$
-
----
-
-## Posterior mean
-
-$$
- \begin{eqnarray*}
-E[p ~|~ X] & = &  \frac{\tilde \alpha}{\tilde \alpha + \tilde \beta}\\ \\
-& = & \frac{x + \alpha}{x + \alpha + n - x + \beta}\\ \\
-& = & \frac{x + \alpha}{n + \alpha + \beta} \\ \\
-& = & \frac{x}{n} \times \frac{n}{n + \alpha + \beta} + \frac{\alpha}{\alpha + \beta} \times \frac{\alpha + \beta}{n + \alpha + \beta} \\ \\
-& = & \mbox{MLE} \times \pi + \mbox{Prior Mean} \times (1 - \pi)
-  \end{eqnarray*}
-$$
-
----
-
-- The posterior mean is a mixture of the MLE ($\hat p$) and the prior mean
-- $\pi$ goes to $1$ as $n$ gets large; for large $n$ the data swamps the prior
-- For small $n$, the prior mean dominates 
-- Generalizes how science should ideally work; as data becomes increasingly available, prior beliefs should matter less and less
-- With a prior that is degenerate at a value, no amount of data can overcome the prior
-
----
-
-## Posterior variance
-
-- The posterior variance is
-$$
-  \begin{eqnarray*}
-Var(p ~|~ x) & = & \frac{\tilde \alpha \tilde \beta}%
-{(\tilde \alpha + \tilde \beta)^2 (\tilde \alpha + \tilde \beta + 1)} \\ \\
-& = & 
-\frac{ (x + \alpha)(n - x + \beta)}%
-{(n + \alpha + \beta)^2 (n + \alpha + \beta + 1)}
-\end{eqnarray*}
-$$
-- Let $\tilde p = (x + \alpha) / (n + \alpha + \beta)$ and $\tilde n = n + \alpha + \beta$ then we have
-$$
-Var(p ~|~ x) = \frac{\tilde p (1 - \tilde p)}{\tilde n + 1}
-$$
-
----
-
-## Discussion
-
-- If $\alpha = \beta = 2$ then the posterior mean is
-$$
-\tilde p = (x + 2) / (n + 4)
-$$
-and the posterior variance is 
-$$
-\tilde p (1 - \tilde p) / (\tilde n + 1)
-$$
-- This is almost exactly the mean and variance we used for the Agresti-Coull interval
-
----
-
-## Example
-
-- Consider the previous example where $x = 13$ and $n=20$
-- Consider a uniform prior, $\alpha = \beta = 1$
-- The posterior is proportional to (see formula above)
-$$
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^x (1 - p)^{n-x}
-$$
-that is, for the uniform prior, the posterior is the likelihood
-- Consider the instance where $\alpha = \beta = 2$ (recall this prior is humped around the point $.5$) the posterior is
-$$
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^{x + 1} (1 - p)^{n-x + 1}
-$$
-- The ``Jeffrey's prior'' which has some theoretical benefits puts $\alpha = \beta = .5$
-
----
-
-<img class="center" src="../assets/binBayes1.png" height=500>
-
----
-
-<img class="center" src="../assets/binBayes2.png" height=500>
-
----
-
-<img class="center" src="../assets/binBayes3.png" height=500>
-
----
-
-<img class="center" src="../assets/binBayes4.png" height=500>
-
----
-
-<img class="center" src="../assets/binBayes5.png" height=500>
-
----
-
-## Bayesian credible intervals
-
-- A *Bayesian credible interval* is the  Bayesian analog of a confidence interval
-- A $95\%$ credible interval, $[a, b]$ would satisfy
-  $$
-  P(p \in [a, b] ~|~ x) = .95
-  $$
-- The best credible intervals chop off the posterior with a horizontal line in the same way we did for likelihoods 
-- These are called highest posterior density (HPD) intervals
-
----
-
-<img class="center" src="../assets/hpd.png" height=500>
-
----
-
-## R code
-
-Install the `binom` package, then the command
-
-```r
-library(binom)
-binom.bayes(13, 20, type = "highest")
-```
-
-gives the HPD interval. The default credible level is $95\%$ and the default prior is the Jeffrey's prior.
-
----
-
-## Interpretation of confidence intervals
-
-- Confidence interval: (Wald) $[.44, .86]$
-- Fuzzy interpretation: 
-
-  *We are 95% confident that $p$ lies between $.44$ to $.86$*
-
-- Actual interpretation: 
-
-  *The interval $.44$ to $.86$ was constructed such that in repeated independent experiments, $95\%$ of the intervals obtained would contain $p$.*
-
-- Yikes!
-
----
-
-## Likelihood intervals
-
-- Recall the $1/8$ likelihood interval was $[.42, .84]$
-- Fuzzy interpretation:
-
-  *The interval $[.42, .84]$ represents plausible values for $p$.*
-
-- Actual interpretation
-
-  *The interval $[.42, .84]$ represents plausible values for $p$ in the sense that for each point in this interval, there is no other point that is more than $8$ times better supported given the data.*
-
-- Yikes!
-
----
-
-## Credible intervals
-
-- Recall that Jeffrey's prior $95\%$ credible interval was $[.44, .84]$
-- Actual interpretation
-
-  *The probability that $p$ is between $.44$ and $.84$ is $95\%$.*
diff --git a/06_StatisticalInference/old/14. Logs/Logs.pdf b/06_StatisticalInference/old/14. Logs/Logs.pdf
deleted file mode 100644
index 177029e80..000000000
Binary files a/06_StatisticalInference/old/14. Logs/Logs.pdf and /dev/null differ
diff --git a/06_StatisticalInference/old/14. Logs/index.Rmd b/06_StatisticalInference/old/14. Logs/index.Rmd
deleted file mode 100644
index dd997d910..000000000
--- a/06_StatisticalInference/old/14. Logs/index.Rmd	
+++ /dev/null
@@ -1,170 +0,0 @@
----
-title       : Logs
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Table of contents
-
-1. Logs
-2. The geometric mean
-3. GM and the CLT
-4. Comparisons
-5. The log-normal distribution
-
----
-
-## Logs
-
-- Recall that $\log_B(x)$ is the number $y$ so that $B^y = x$
-- Note that you can not take the log of a negative number; $\log_B(1)$ is always 0 and $\log_B(0)$ is $-\infty$
-- When the base is $B = e$  we write $\log_e$ as just $\log$ or $\ln$
-- Other useful bases are $10$ (orders of magnitude) or $2$
-- Recall that $\log(ab) = \log(a) + \log(b)$, $\log(a^b) = b\log(a)$, $\log(a/b) = \log(a) - \log(b)$ ($\log$ turns multiplication into addition, division into subtraction, powers into multiplication)
-
----
-
-## Some reasons for "logging" data
-
-- To correct for right skewness 
-- When considering ratios
-- In settings where errors are feasibly multiplicative, such as when dealing with concentrations or rates
-- To consider orders of magnitude (using log base 10); for example when considering astronomical distances
-- Counts are often logged (though note the problem with zero counts)
-
----
-
-## The geometric mean
-
-- The (sample) **geometric mean** of a data set $X_1,\ldots,X_n$ is
-  $$
-  \left(\prod_{i=1}^n X_i \right)^{1/n}
-  $$
-- Note that (provided that the $X_i$ are positive) the log of the geometric mean is
-  $$
-  \frac{1}{n}\sum_{i=1}^n \log(X_i)
-  $$
-- As the log of the geometric mean is an average, the LLN and clt apply (under what assumptions?)
-- The geometric mean is always less than or equal to the sample (arithmetic) mean
-
----
-
-## The geometric mean
-
-- The geometric mean is often used when the $X_i$ are all multiplicative
-- Suppose that in a population of interest, the prevalence of a disease rose $2%$ one year, then fell $1%$ the next, then rose $2%$, then rose $1%$; since these factors act multiplicatively it makes sense to consider the geometric mean
-  $$
-  \left(1.02 \times .99 \times 1.02 \times 1.01\right)^{1/4} = 1.01
-  $$
-for a $1%$ geometric mean increase in disease prevalence
-
----
-
-- Notice that multiplying the initial prevalence by $1.01^4$ is the same as multiplying by the original four numbers in sequence
-- Hence $1.01$ is constant factor by which you would need to multiply the initial prevalence each year to achieve the same overall increase in prevalence over a four year period
-- The arithmetic mean, in contrast, is the constant factor by which your would need to *add* each year to achieve the same *total* increase ($1.02 + .99 + 1.02 + 1.01$)
-- In this case the product and hence the geometric mean make more sense than the arithmetic mean
-
----
-
-## Nifty fact
-
-- The *question corner* (google) at the University of Toronto's web site (where I got much of this) has a fun interpretation of the geometric mean
-- If $a$ and $b$ are the lengths of the sides of a rectangle then
-  - The arithmetic mean $(a + b) / 2$ is the length of the sides of the square that has the same perimeter
-  - The geometric mean $(ab)^{1/2}$ is the length of the sides of the square that has the same area
-
-- So if you're interested in perimeters (adding) use the arithmetic mean; if you're interested in areas (multiplying) use the geometric mean
-
----
-
-## Asymptotics
-
-- Note, by the LLN the log of the geometric mean converges to $\mu = E[\log(X)]$
-- Therefore the geometric mean converges to $\exp\{E[\log(X)]\} = e^\mu$, which is *not* the population mean on the natural scale; we call this the population geometric mean (but no one else seems to)
-- To reiterate
-  $$
-  \exp\{E[\log(x)]\} \neq E[\exp\{\log(X)\}] = E[X]
-  $$
-- Note if the distribution of $\log(X)$ is symmetric then
-  $$
-  .5 = P(\log X \leq \mu) = P(X \leq e^\mu)
-  $$
-- Therefore, for log-symmetric distributions the geometric mean is estimating the median
-
----
-
-## GM and the CLT
-
-- If you use the CLT to create a confidence interval for the log measurements, your interval is estimating $\mu$, the expected value of the log measurements
-- If you exponentiate the endpoints of the interval, you are estimating $e^\mu$, the population geometric mean
-- Recall, $e^\mu$ is the population median when the distribution of the logged data is symmetric
-- This is especially useful for paired data when their ratio, rather than their difference, is of interest
-
----
-
-## Example
-
-Rosner, Fundamentals of Biostatistics page 298 gives a paired design comparing SBP for matched oral contraceptive users and controls.
-
-- The geometric mean ratio is 1.04 (4% increase in SBP for the OC users)
-- The T interval on the difference of the log scale measurements is [0.010, 0.067] log(mm Hg)
-- Exponentiating yields [1.010, 1.069] \(mm Hg\).
-
----
-
-## Comparisons
-
-- Consider when you have two independent groups, logging the individual data points and creating a confidence interval for the difference in the log means
-- Prove to yourself that exponentiating the endpoints of this interval is then an interval for the *ratio* of the population geometric means, $\frac{e^{\mu_1}}{e^{\mu_2}}$
-
----
-
-## The log-normal distribution
-
-- A random variable is **log-normally** distributed *if its log is a normally distributed random variable*
-- "I am log-normal" means "take logs of me and then I'll then be normal"
-- Note log-normal random variables are not logs of normal random variables!!!!!! (You can't even take the log of a normal random variable)
-- Formally, $X$ is lognormal$(\mu,\sigma^2)$ if $\log(X) \sim \mbox{N}(\mu, \sigma^2)$
-- If $Y \sim \mbox{N}(\mu,\sigma^2)$ then $X = e^Y$ is log-normal
-
----
-
-## The log-normal distribution
-
-- The log-normal density is
-$$
-\frac{1}{\sqrt{2\pi}} \times \frac{\exp[-\{\log(x) - \mu\}^2/ (2\sigma^2)]}{x}
-~~\mbox{for}~~ 0\leq x \leq \infty
-$$
-- Its mean is $e^{\mu + (\sigma^2 / 2)}$ and variance is $e^{2\mu + \sigma^2}(e^{ \sigma^2} - 1)$
-- Its median is $e^\mu$
-
----
-
-## The log-normal distribution
-
-- Notice that if we assume that $X_1,\ldots,X_n$ are log-normal$(\mu,\sigma^2)$ then $Y_1 = \log X_1,\ldots, Y_n = \log X_n$ are normally distributed with mean $\mu$ and variance $\sigma^2$
-- Creating a Gosset's $t$ confidence interval on using the $Y_i$ is a confidence interval for $\mu$ the log of the median of the $X_i$
-- Exponentiate the endpoints of the interval to obtain a confidence interval for $e^\mu$, the median on the original scale
-- Assuming log-normality, exponentiating $t$ confidence intervals for the difference in two log means again estimates ratios of geometric means 
-
----
-
-## Example
-
-- Took GM volumes for the young and old groups, logged them
-- Did two independent group intervals, got old [13.24, 13.27] log(cubic cm) and young [13.29, 13.31] log(cubic cm).
-- Exponentiating yields [564.4, 577.5] cc, [592.0, 606.9] cc.
-- Doing a two group T interval on the logged measurements yields [0.032, 0.066] log(cubic cm)
-- exponentiating this interval yields [1.032, 1.068]
\ No newline at end of file
diff --git a/06_StatisticalInference/old/14. Logs/index.html b/06_StatisticalInference/old/14. Logs/index.html
deleted file mode 100644
index cd8aff89b..000000000
--- a/06_StatisticalInference/old/14. Logs/index.html	
+++ /dev/null
@@ -1,344 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Logs</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Logs">
-  <meta name="author" content="Brian Caffo, PhD">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
-    
-
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Logs</h1>
-        <h2>Mathematical Biostatistics Boot Camp</h2>
-        <p>Brian Caffo, PhD<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
-    <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Table of contents</h2>
-  </hgroup>
-  <article>
-    <ol>
-<li>Logs</li>
-<li>The geometric mean</li>
-<li>GM and the CLT</li>
-<li>Comparisons</li>
-<li>The log-normal distribution</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Logs</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Recall that \(\log_B(x)\) is the number \(y\) so that \(B^y = x\)</li>
-<li>Note that you can not take the log of a negative number; \(\log_B(1)\) is always 0 and \(\log_B(0)\) is \(-\infty\)</li>
-<li>When the base is \(B = e\)  we write \(\log_e\) as just \(\log\) or \(\ln\)</li>
-<li>Other useful bases are \(10\) (orders of magnitude) or \(2\)</li>
-<li>Recall that \(\log(ab) = \log(a) + \log(b)\), \(\log(a^b) = b\log(a)\), \(\log(a/b) = \log(a) - \log(b)\) (\(\log\) turns multiplication into addition, division into subtraction, powers into multiplication)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Some reasons for &quot;logging&quot; data</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>To correct for right skewness </li>
-<li>When considering ratios</li>
-<li>In settings where errors are feasibly multiplicative, such as when dealing with concentrations or rates</li>
-<li>To consider orders of magnitude (using log base 10); for example when considering astronomical distances</li>
-<li>Counts are often logged (though note the problem with zero counts)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>The geometric mean</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The (sample) {\bf geometric mean} of a data set \(X_1,\ldots,X_n\) is
-\[
-\left(\prod_{i=1}^n X_i \right)^{1/n}
-\]</li>
-<li>Note that (provided that the \(X_i\) are positive) the log of the geometric mean is
-\[
-\frac{1}{n}\sum_{i=1}^n \log(X_i)
-\]</li>
-<li>As the log of the geometric mean is an average, the LLN and clt apply (under what assumptions?)</li>
-<li>The geometric mean is always less than or equal to the sample (arithmetic) mean</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>The geometric mean</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The geometric mean is often used when the \(X_i\) are all multiplicative</li>
-<li>Suppose that in a population of interest, the prevalence of a disease rose \(2%\) one year, then fell \(1%\) the next, then rose \(2%\), then rose \(1%\); since these factors act multiplicatively it makes sense to consider the geometric mean
-\[
-\left(1.02 \times .99 \times 1.02 \times 1.01\right)^{1/4} = 1.01
-\]
-for a \(1%\) geometric mean increase in disease prevalence</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <ul>
-<li>Notice that multiplying the initial prevalence by \(1.01^4\) is the same as multiplying by the original four numbers in sequence</li>
-<li>Hence \(1.01\) is constant factor by which you would need to multiply the initial prevalence each year to achieve the same overall increase in prevalence over a four year period</li>
-<li>The arithmetic mean, in contrast, is the constant factor by which your would need to <em>add</em> each year to achieve the same <em>total</em> increase (\(1.02 + .99 + 1.02 + 1.01\))</li>
-<li>In this case the product and hence the geometric mean make more sense than the arithmetic mean</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Nifty fact</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The <em>question corner</em> (google) at the University of Toronto&#39;s web site (where I got much of this) has a fun interpretation of the geometric mean</li>
-<li><p>If \(a\) and \(b\) are the lengths of the sides of a rectangle then</p>
-
-<ul>
-<li>The arithmetic mean \((a + b) / 2\) is the length of the sides of the square that has the same perimeter</li>
-<li>The geometric mean \((ab)^{1/2}\) is the length of the sides of the square that has the same area</li>
-</ul></li>
-<li><p>So if you&#39;re interested in perimeters (adding) use the arithmetic mean; if you&#39;re interested in areas (multiplying) use the geometric mean</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Asymptotics</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Note, by the LLN the log of the geometric mean converges to \(\mu = E[\log(X)]\)</li>
-<li>Therefore the geometric mean converges to \(\exp\{E[\log(X)]\} = e^\mu\), which is <em>not</em> the population mean on the natural scale; we call this the population geometric mean (but no one else seems to)</li>
-<li>To reiterate
-\[
-\exp\{E[\log(x)]\} \neq E[\exp\{\log(X)\}] = E[X]
-\]</li>
-<li>Note if the distribution of \(\log(X)\) is symmetric then
-\[
-.5 = P(\log X \leq \mu) = P(X \leq e^\mu)
-\]</li>
-<li>Therefore, for log-symmetric distributions the geometric mean is estimating the median</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>GM and the CLT</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>If you use the CLT to create a confidence interval for the log measurements, your interval is estimating \(\mu\), the expected value of the log measurements</li>
-<li>If you exponentiate the endpoints of the interval, you are estimating \(e^\mu\), the population geometric mean</li>
-<li>Recall, \(e^\mu\) is the population median when the distribution of the logged data is symmetric</li>
-<li>This is especially useful for paired data when their ratio, rather than their difference, is of interest</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <p>Rosner, Fundamentals of Biostatistics page 298 gives a paired design comparing SBP for matched oral contraceptive users and controls.</p>
-
-<ul>
-<li>The geometric mean ratio is 1.04 (4% increase in SBP for the OC users)</li>
-<li>The T interval on the difference of the log scale measurements is [0.010, 0.067] log(mm Hg)</li>
-<li>Exponentiating yields [1.010, 1.069] \(mm Hg\).</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Comparisons</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Consider when you have two independent groups, logging the individual data points and creating a confidence interval for the difference in the log means</li>
-<li>Prove to yourself that exponentiating the endpoints of this interval is then an interval for the <em>ratio</em> of the population geometric means, \(\frac{e^{\mu_1}}{e^{\mu_2}}\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>The log-normal distribution</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>A random variable is <strong>log-normally</strong> distributed <em>if its log is a normally distributed random variable</em></li>
-<li>&quot;I am log-normal&quot; means &quot;take logs of me and then I&#39;ll then be normal&quot;</li>
-<li>Note log-normal random variables are not logs of normal random variables!!!!!! (You can&#39;t even take the log of a normal random variable)</li>
-<li>Formally, \(X\) is lognormal\((\mu,\sigma^2)\) if \(\log(X) \sim \mbox{N}(\mu, \sigma^2)\)</li>
-<li>If \(Y \sim \mbox{N}(\mu,\sigma^2)\) then \(X = e^Y\) is log-normal</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>The log-normal distribution</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>The log-normal density is
-\[
-\frac{1}{\sqrt{2\pi}} \times \frac{\exp[-\{\log(x) - \mu\}^2/ (2\sigma^2)]}{x}
-~~\mbox{for}~~ 0\leq x \leq \infty
-\]</li>
-<li>Its mean is \(e^{\mu + (\sigma^2 / 2)}\) and variance is \(e^{2\mu + \sigma^2}(e^{ \sigma^2} - 1)\)</li>
-<li>Its median is \(e^\mu\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>The log-normal distribution</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Notice that if we assume that \(X_1,\ldots,X_n\) are log-normal\((\mu,\sigma^2)\) then \(Y_1 = \log X_1,\ldots, Y_n = \log X_n\) are normally distributed with mean \(\mu\) and variance \(\sigma^2\)</li>
-<li>Creating a Gosset&#39;s \(t\) confidence interval on using the \(Y_i\) is a confidence interval for \(\mu\) the log of the median of the \(X_i\)</li>
-<li>Exponentiate the endpoints of the interval to obtain a confidence interval for \(e^\mu\), the median on the original scale</li>
-<li>Assuming log-normality, exponentiating \(t\) confidence intervals for the difference in two log means again estimates ratios of geometric means </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Took GM volumes for the young and old groups, logged them</li>
-<li>Did two independent group intervals, got old [13.24, 13.27] log(cubic cm) and young [13.29, 13.31] log(cubic cm).</li>
-<li>Exponentiating yields [564.4, 577.5] cc, [592.0, 606.9] cc.</li>
-<li>Doing a two group T interval on the logged measurements yields [0.032, 0.066] log(cubic cm)</li>
-<li>exponentiating this interval yields [1.032, 1.068]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-
-  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
diff --git a/06_StatisticalInference/old/14. Logs/index.md b/06_StatisticalInference/old/14. Logs/index.md
deleted file mode 100644
index 5126291ab..000000000
--- a/06_StatisticalInference/old/14. Logs/index.md	
+++ /dev/null
@@ -1,170 +0,0 @@
----
-title       : Logs
-subtitle    : Mathematical Biostatistics Boot Camp
-author      : Brian Caffo, PhD
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../libraries
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Table of contents
-
-1. Logs
-2. The geometric mean
-3. GM and the CLT
-4. Comparisons
-5. The log-normal distribution
-
----
-
-## Logs
-
-- Recall that $\log_B(x)$ is the number $y$ so that $B^y = x$
-- Note that you can not take the log of a negative number; $\log_B(1)$ is always 0 and $\log_B(0)$ is $-\infty$
-- When the base is $B = e$  we write $\log_e$ as just $\log$ or $\ln$
-- Other useful bases are $10$ (orders of magnitude) or $2$
-- Recall that $\log(ab) = \log(a) + \log(b)$, $\log(a^b) = b\log(a)$, $\log(a/b) = \log(a) - \log(b)$ ($\log$ turns multiplication into addition, division into subtraction, powers into multiplication)
-
----
-
-## Some reasons for "logging" data
-
-- To correct for right skewness 
-- When considering ratios
-- In settings where errors are feasibly multiplicative, such as when dealing with concentrations or rates
-- To consider orders of magnitude (using log base 10); for example when considering astronomical distances
-- Counts are often logged (though note the problem with zero counts)
-
----
-
-## The geometric mean
-
-- The (sample) {\bf geometric mean} of a data set $X_1,\ldots,X_n$ is
-  $$
-  \left(\prod_{i=1}^n X_i \right)^{1/n}
-  $$
-- Note that (provided that the $X_i$ are positive) the log of the geometric mean is
-  $$
-  \frac{1}{n}\sum_{i=1}^n \log(X_i)
-  $$
-- As the log of the geometric mean is an average, the LLN and clt apply (under what assumptions?)
-- The geometric mean is always less than or equal to the sample (arithmetic) mean
-
----
-
-## The geometric mean
-
-- The geometric mean is often used when the $X_i$ are all multiplicative
-- Suppose that in a population of interest, the prevalence of a disease rose $2%$ one year, then fell $1%$ the next, then rose $2%$, then rose $1%$; since these factors act multiplicatively it makes sense to consider the geometric mean
-  $$
-  \left(1.02 \times .99 \times 1.02 \times 1.01\right)^{1/4} = 1.01
-  $$
-for a $1%$ geometric mean increase in disease prevalence
-
----
-
-- Notice that multiplying the initial prevalence by $1.01^4$ is the same as multiplying by the original four numbers in sequence
-- Hence $1.01$ is constant factor by which you would need to multiply the initial prevalence each year to achieve the same overall increase in prevalence over a four year period
-- The arithmetic mean, in contrast, is the constant factor by which your would need to *add* each year to achieve the same *total* increase ($1.02 + .99 + 1.02 + 1.01$)
-- In this case the product and hence the geometric mean make more sense than the arithmetic mean
-
----
-
-## Nifty fact
-
-- The *question corner* (google) at the University of Toronto's web site (where I got much of this) has a fun interpretation of the geometric mean
-- If $a$ and $b$ are the lengths of the sides of a rectangle then
-  - The arithmetic mean $(a + b) / 2$ is the length of the sides of the square that has the same perimeter
-  - The geometric mean $(ab)^{1/2}$ is the length of the sides of the square that has the same area
-
-- So if you're interested in perimeters (adding) use the arithmetic mean; if you're interested in areas (multiplying) use the geometric mean
-
----
-
-## Asymptotics
-
-- Note, by the LLN the log of the geometric mean converges to $\mu = E[\log(X)]$
-- Therefore the geometric mean converges to $\exp\{E[\log(X)]\} = e^\mu$, which is *not* the population mean on the natural scale; we call this the population geometric mean (but no one else seems to)
-- To reiterate
-  $$
-  \exp\{E[\log(x)]\} \neq E[\exp\{\log(X)\}] = E[X]
-  $$
-- Note if the distribution of $\log(X)$ is symmetric then
-  $$
-  .5 = P(\log X \leq \mu) = P(X \leq e^\mu)
-  $$
-- Therefore, for log-symmetric distributions the geometric mean is estimating the median
-
----
-
-## GM and the CLT
-
-- If you use the CLT to create a confidence interval for the log measurements, your interval is estimating $\mu$, the expected value of the log measurements
-- If you exponentiate the endpoints of the interval, you are estimating $e^\mu$, the population geometric mean
-- Recall, $e^\mu$ is the population median when the distribution of the logged data is symmetric
-- This is especially useful for paired data when their ratio, rather than their difference, is of interest
-
----
-
-## Example
-
-Rosner, Fundamentals of Biostatistics page 298 gives a paired design comparing SBP for matched oral contraceptive users and controls.
-
-- The geometric mean ratio is 1.04 (4% increase in SBP for the OC users)
-- The T interval on the difference of the log scale measurements is [0.010, 0.067] log(mm Hg)
-- Exponentiating yields [1.010, 1.069] \(mm Hg\).
-
----
-
-## Comparisons
-
-- Consider when you have two independent groups, logging the individual data points and creating a confidence interval for the difference in the log means
-- Prove to yourself that exponentiating the endpoints of this interval is then an interval for the *ratio* of the population geometric means, $\frac{e^{\mu_1}}{e^{\mu_2}}$
-
----
-
-## The log-normal distribution
-
-- A random variable is **log-normally** distributed *if its log is a normally distributed random variable*
-- "I am log-normal" means "take logs of me and then I'll then be normal"
-- Note log-normal random variables are not logs of normal random variables!!!!!! (You can't even take the log of a normal random variable)
-- Formally, $X$ is lognormal$(\mu,\sigma^2)$ if $\log(X) \sim \mbox{N}(\mu, \sigma^2)$
-- If $Y \sim \mbox{N}(\mu,\sigma^2)$ then $X = e^Y$ is log-normal
-
----
-
-## The log-normal distribution
-
-- The log-normal density is
-$$
-\frac{1}{\sqrt{2\pi}} \times \frac{\exp[-\{\log(x) - \mu\}^2/ (2\sigma^2)]}{x}
-~~\mbox{for}~~ 0\leq x \leq \infty
-$$
-- Its mean is $e^{\mu + (\sigma^2 / 2)}$ and variance is $e^{2\mu + \sigma^2}(e^{ \sigma^2} - 1)$
-- Its median is $e^\mu$
-
----
-
-## The log-normal distribution
-
-- Notice that if we assume that $X_1,\ldots,X_n$ are log-normal$(\mu,\sigma^2)$ then $Y_1 = \log X_1,\ldots, Y_n = \log X_n$ are normally distributed with mean $\mu$ and variance $\sigma^2$
-- Creating a Gosset's $t$ confidence interval on using the $Y_i$ is a confidence interval for $\mu$ the log of the median of the $X_i$
-- Exponentiate the endpoints of the interval to obtain a confidence interval for $e^\mu$, the median on the original scale
-- Assuming log-normality, exponentiating $t$ confidence intervals for the difference in two log means again estimates ratios of geometric means 
-
----
-
-## Example
-
-- Took GM volumes for the young and old groups, logged them
-- Did two independent group intervals, got old [13.24, 13.27] log(cubic cm) and young [13.29, 13.31] log(cubic cm).
-- Exponentiating yields [564.4, 577.5] cc, [592.0, 606.9] cc.
-- Doing a two group T interval on the logged measurements yields [0.032, 0.066] log(cubic cm)
-- exponentiating this interval yields [1.032, 1.068]
diff --git a/06_StatisticalInference/old_lectures/01_01_Introduction.pdf b/06_StatisticalInference/old_lectures/01_01_Introduction.pdf
new file mode 100644
index 000000000..b50714770
Binary files /dev/null and b/06_StatisticalInference/old_lectures/01_01_Introduction.pdf differ
diff --git a/06_StatisticalInference/old_lectures/01_02_Probability.pdf b/06_StatisticalInference/old_lectures/01_02_Probability.pdf
new file mode 100644
index 000000000..fddebae9e
Binary files /dev/null and b/06_StatisticalInference/old_lectures/01_02_Probability.pdf differ
diff --git a/06_StatisticalInference/old_lectures/01_03_Expectations.pdf b/06_StatisticalInference/old_lectures/01_03_Expectations.pdf
new file mode 100644
index 000000000..aa71b7bd4
Binary files /dev/null and b/06_StatisticalInference/old_lectures/01_03_Expectations.pdf differ
diff --git a/06_StatisticalInference/old_lectures/01_04_Independence.pdf b/06_StatisticalInference/old_lectures/01_04_Independence.pdf
new file mode 100644
index 000000000..fd2201506
Binary files /dev/null and b/06_StatisticalInference/old_lectures/01_04_Independence.pdf differ
diff --git a/06_StatisticalInference/old_lectures/01_05_ConditionalProbability.pdf b/06_StatisticalInference/old_lectures/01_05_ConditionalProbability.pdf
new file mode 100644
index 000000000..a5a7edead
Binary files /dev/null and b/06_StatisticalInference/old_lectures/01_05_ConditionalProbability.pdf differ
diff --git a/06_StatisticalInference/old_lectures/02_01_CommonDistributions.pdf b/06_StatisticalInference/old_lectures/02_01_CommonDistributions.pdf
new file mode 100644
index 000000000..899e8bd29
Binary files /dev/null and b/06_StatisticalInference/old_lectures/02_01_CommonDistributions.pdf differ
diff --git a/06_StatisticalInference/old_lectures/02_02_Asymptopia.pdf b/06_StatisticalInference/old_lectures/02_02_Asymptopia.pdf
new file mode 100644
index 000000000..d43b4b4a8
Binary files /dev/null and b/06_StatisticalInference/old_lectures/02_02_Asymptopia.pdf differ
diff --git a/06_StatisticalInference/old_lectures/02_03_tCIs.pdf b/06_StatisticalInference/old_lectures/02_03_tCIs.pdf
new file mode 100644
index 000000000..19947b5e5
Binary files /dev/null and b/06_StatisticalInference/old_lectures/02_03_tCIs.pdf differ
diff --git a/06_StatisticalInference/old_lectures/02_04_Likeklihood.pdf b/06_StatisticalInference/old_lectures/02_04_Likeklihood.pdf
new file mode 100644
index 000000000..4e3388a80
Binary files /dev/null and b/06_StatisticalInference/old_lectures/02_04_Likeklihood.pdf differ
diff --git a/06_StatisticalInference/old_lectures/02_05_Bayes.pdf b/06_StatisticalInference/old_lectures/02_05_Bayes.pdf
new file mode 100644
index 000000000..ae65bc28f
Binary files /dev/null and b/06_StatisticalInference/old_lectures/02_05_Bayes.pdf differ
diff --git a/06_StatisticalInference/old_lectures/03_01_TwoGroupIntervals.pdf b/06_StatisticalInference/old_lectures/03_01_TwoGroupIntervals.pdf
new file mode 100644
index 000000000..b46a7169f
Binary files /dev/null and b/06_StatisticalInference/old_lectures/03_01_TwoGroupIntervals.pdf differ
diff --git a/06_StatisticalInference/old_lectures/03_02_HypothesisTesting.pdf b/06_StatisticalInference/old_lectures/03_02_HypothesisTesting.pdf
new file mode 100644
index 000000000..c5db5a783
Binary files /dev/null and b/06_StatisticalInference/old_lectures/03_02_HypothesisTesting.pdf differ
diff --git a/06_StatisticalInference/old_lectures/03_03_pValues.pdf b/06_StatisticalInference/old_lectures/03_03_pValues.pdf
new file mode 100644
index 000000000..ba31db25c
Binary files /dev/null and b/06_StatisticalInference/old_lectures/03_03_pValues.pdf differ
diff --git a/06_StatisticalInference/old_lectures/03_04_Power.pdf b/06_StatisticalInference/old_lectures/03_04_Power.pdf
new file mode 100644
index 000000000..b5e3024dc
Binary files /dev/null and b/06_StatisticalInference/old_lectures/03_04_Power.pdf differ
diff --git a/06_StatisticalInference/old_lectures/03_05_MultipleTesting.pdf b/06_StatisticalInference/old_lectures/03_05_MultipleTesting.pdf
new file mode 100644
index 000000000..88d17ad14
Binary files /dev/null and b/06_StatisticalInference/old_lectures/03_05_MultipleTesting.pdf differ
diff --git a/06_StatisticalInference/old_lectures/03_06_resampledInference.pdf b/06_StatisticalInference/old_lectures/03_06_resampledInference.pdf
new file mode 100644
index 000000000..ce8822a3c
Binary files /dev/null and b/06_StatisticalInference/old_lectures/03_06_resampledInference.pdf differ
diff --git a/06_StatisticalInference/01_01_Introduction/index.Rmd b/06_StatisticalInference/old_markdown/01_01_Introduction/index.Rmd
similarity index 97%
rename from 06_StatisticalInference/01_01_Introduction/index.Rmd
rename to 06_StatisticalInference/old_markdown/01_01_Introduction/index.Rmd
index 74e8b2a1e..417d214e2 100644
--- a/06_StatisticalInference/01_01_Introduction/index.Rmd
+++ b/06_StatisticalInference/old_markdown/01_01_Introduction/index.Rmd
@@ -1,158 +1,160 @@
----
-title       : Introduction to statistical inference
-subtitle    : Statistical inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-## Statistical inference defined
-
-Statistical inference is the process of drawing formal conclusions from
-data. 
-
-In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
-statistical data where uncertainty must be accounted for.
-
----
-
-## Motivating example: who's going to win the election?
-
-In every major election, pollsters would like to know, ahead of the
-actual election, who's going to win. Here, the target of
-estimation (the estimand) is clear, the percentage of people in 
-a particular group (city, state, county, country or other electoral
-grouping) who will vote for each candidate.
-
-We can not poll everyone. Even if we could, some polled 
-may change their vote by the time the election occurs.
-How do we collect a reasonable subset of data and quantify the
-uncertainty in the process to produce a good guess at who will win?
-
----
-
-## Motivating example: is hormone replacement therapy effective? 
-
-A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.**
-
-Here's there's two inferential problems. 
-
-1. Is HRT effective?
-2. How long should we continue the trial in the presence of contrary
-evidence?
-
-See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts
-
----
-
-## Motivating example: ECMO
-
-In 1985 a group at a major neonatal intensive care center published the results of a trial comparing a standard treatment and a promising new extracorporeal membrane oxygenation treatment (ECMO) for newborn infants with severe respiratory failure. **Ethical considerations lead to a statistical randomization scheme whereby one infant received the control therapy, thereby opening the study to sample-size based criticisms.**
-
-For a review and statistical discussion, see Royall Statistical Science 1991, Vol 6, No. 1, 52-88
-
----
-
-## Summary
-
-- These examples illustrate many of the difficulties of trying
-to use data to create general conclusions about a population.
-- Paramount among our concerns are:
-  - Is the sample representative of the population that we'd like to draw inferences about?
-  - Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?
-  - Is there systematic bias created by missing data or the design or conduct of the study?
-  - What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
-or random sampling, or implicit as the aggregation of many complex uknown processes.
-  - Are we trying to estimate an underlying mechanistic model of phenomena under study?
-- Statistical inference requires navigating the set of assumptions and
-tools and subsequently thinking about how to draw conclusions from data.
-
---- 
-## Example goals of inference
-
-1. Estimate and quantify the uncertainty of an estimate of 
-a population quantity (the proportion of people who will
-  vote for a candidate).
-2. Determine whether a population quantity 
-  is a benchmark value ("is the treatment effective?").
-3. Infer a mechanistic relationship when quantities are measured with
-  noise ("What is the slope for Hooke's law?")
-4. Determine the impact of a policy? ("If we reduce polution levels,
-  will asthma rates decline?")
-
-
----
-## Example tools of the trade 
-
-1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest
-2. Random sampling: concerned with obtaining data that is representative 
-of the population of interest
-3. Sampling models: concerned with creating a model for the sampling
-process, the most common is so called "iid".
-4. Hypothesis testing: concerned with decision making in the presence of uncertainty
-5. Confidence intervals: concerned with quantifying uncertainty in 
-estimation
-6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
-approximated.
-7. Study design: the process of designing an experiment to minimize biases and variability.
-8. Nonparametric bootstrapping: the process of using the data to,
-  with minimal probability model assumptions, create inferences.
-9. Permutation, randomization and exchangeability testing: the process 
-of using data permutations to perform inferences.
-
----
-## Different thinking about probability leads to different styles of inference
-
-We won't spend too much time talking about this, but there are several different
-styles of inference. Two broad categories that get discussed a lot are:
-
-1. Frequency probability: is the long run proportion of
- times an event occurs in independent, identically distributed 
- repetitions.
-2. Frequency inference: uses frequency interpretations of probabilities
-to control error rates. Answers questions like "What should I decide
-given my data controlling the long run proportion of mistakes I make at
-a tolerable level."
-3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.
-4. Bayesian inference: the use of Bayesian probability representation
-of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what
-should I believe now?"
-
-Data scientists tend to fall within shades of gray of these and various other schools of inference. 
-
----
-## In this class
-
-* In this class, we will primarily focus on basic sampling models, 
-basic probability models and frequency style analyses
-to create standard inferences. 
-* Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
-and bootstrapping.
-* As probability modeling will be our starting point, we first build
-up basic probability.
-
----
-## Where to learn more on the topics not covered
-
-1. Explicit use of random sampling in inferences: look in references
-on "finite population statistics". Used heavily in polling and
-sample surveys.
-2. Explicit use of randomization in inferences: look in references
-on "causal inference" especially in clinical trials.
-3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).
-4. Missing data: well covered in biostatistics and econometric
-references; look for references to "multiple imputation", a popular tool for
-addressing missing data.
-5. Study design: consider looking in the subject matter area that
-  you are interested in; some examples with rich histories in design:
-  1. The epidemiological literature is very focused on using study design to investigate public health.
-  2. The classical development of study design in agriculture broadly covers design and design principles.
-  3. The industrial quality control literature covers design thoroughly.
- 
+---
+title       : Introduction to statistical inference
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Statistical inference defined
+
+Statistical inference is the process of drawing formal conclusions from
+data. 
+
+In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
+statistical data where uncertainty must be accounted for.
+
+---
+
+## Motivating example: who's going to win the election?
+
+In every major election, pollsters would like to know, ahead of the
+actual election, who's going to win. Here, the target of
+estimation (the estimand) is clear, the percentage of people in 
+a particular group (city, state, county, country or other electoral
+grouping) who will vote for each candidate.
+
+We can not poll everyone. Even if we could, some polled 
+may change their vote by the time the election occurs.
+How do we collect a reasonable subset of data and quantify the
+uncertainty in the process to produce a good guess at who will win?
+
+---
+
+## Motivating example: is hormone replacement therapy effective? 
+
+A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.**
+
+Here's there's two inferential problems. 
+
+1. Is HRT effective?
+2. How long should we continue the trial in the presence of contrary
+evidence?
+
+See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for a discussion of the long term impacts
+
+---
+
+## Motivating example: ECMO
+
+In 1985 a group at a major neonatal intensive care center published the results of a trial comparing a standard treatment and a promising new extracorporeal membrane oxygenation treatment (ECMO) for newborn infants with severe respiratory failure. **Ethical considerations lead to a statistical randomization scheme whereby one infant received the control therapy, thereby opening the study to sample-size based criticisms.**
+
+For a review and statistical discussion, see Royall Statistical Science 1991, Vol 6, No. 1, 52-88
+
+---
+
+## Summary
+
+- These examples illustrate many of the difficulties of trying
+to use data to create general conclusions about a population.
+- Paramount among our concerns are:
+  - Is the sample representative of the population that we'd like to draw inferences about?
+  - Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?
+  - Is there systematic bias created by missing data or the design or conduct of the study?
+  - What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
+or random sampling, or implicit as the aggregation of many complex uknown processes.
+  - Are we trying to estimate an underlying mechanistic model of phenomena under study?
+- Statistical inference requires navigating the set of assumptions and
+tools and subsequently thinking about how to draw conclusions from data.
+
+--- 
+## Example goals of inference
+
+1. Estimate and quantify the uncertainty of an estimate of 
+a population quantity (the proportion of people who will
+  vote for a candidate).
+2. Determine whether a population quantity 
+  is a benchmark value ("is the treatment effective?").
+3. Infer a mechanistic relationship when quantities are measured with
+  noise ("What is the slope for Hooke's law?")
+4. Determine the impact of a policy? ("If we reduce polution levels,
+  will asthma rates decline?")
+
+
+---
+## Example tools of the trade 
+
+1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest
+2. Random sampling: concerned with obtaining data that is representative 
+of the population of interest
+3. Sampling models: concerned with creating a model for the sampling
+process, the most common is so called "iid".
+4. Hypothesis testing: concerned with decision making in the presence of uncertainty
+5. Confidence intervals: concerned with quantifying uncertainty in 
+estimation
+6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
+approximated.
+7. Study design: the process of designing an experiment to minimize biases and variability.
+8. Nonparametric bootstrapping: the process of using the data to,
+  with minimal probability model assumptions, create inferences.
+9. Permutation, randomization and exchangeability testing: the process 
+of using data permutations to perform inferences.
+
+---
+## Different thinking about probability leads to different styles of inference
+
+We won't spend too much time talking about this, but there are several different
+styles of inference. Two broad categories that get discussed a lot are:
+
+1. Frequency probability: is the long run proportion of
+ times an event occurs in independent, identically distributed 
+ repetitions.
+2. Frequency inference: uses frequency interpretations of probabilities
+to control error rates. Answers questions like "What should I decide
+given my data controlling the long run proportion of mistakes I make at
+a tolerable level."
+3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.
+4. Bayesian inference: the use of Bayesian probability representation
+of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what
+should I believe now?"
+
+Data scientists tend to fall within shades of gray of these and various other schools of inference. 
+
+---
+## In this class
+
+* In this class, we will primarily focus on basic sampling models, 
+basic probability models and frequency style analyses
+to create standard inferences. 
+* Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
+and bootstrapping.
+* As probability modeling will be our starting point, we first build
+up basic probability.
+
+---
+## Where to learn more on the topics not covered
+
+1. Explicit use of random sampling in inferences: look in references
+on "finite population statistics". Used heavily in polling and
+sample surveys.
+2. Explicit use of randomization in inferences: look in references
+on "causal inference" especially in clinical trials.
+3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).
+4. Missing data: well covered in biostatistics and econometric
+references; look for references to "multiple imputation", a popular tool for
+addressing missing data.
+5. Study design: consider looking in the subject matter area that
+  you are interested in; some examples with rich histories in design:
+  1. The epidemiological literature is very focused on using study design to investigate public health.
+  2. The classical development of study design in agriculture broadly covers design and design principles.
+  3. The industrial quality control literature covers design thoroughly.
+
+---
+## Test page
diff --git a/06_StatisticalInference/01_01_Introduction/index.html b/06_StatisticalInference/old_markdown/01_01_Introduction/index.html
similarity index 97%
rename from 06_StatisticalInference/01_01_Introduction/index.html
rename to 06_StatisticalInference/old_markdown/01_01_Introduction/index.html
index 391528189..7738fd352 100644
--- a/06_StatisticalInference/01_01_Introduction/index.html
+++ b/06_StatisticalInference/old_markdown/01_01_Introduction/index.html
@@ -1,358 +1,374 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Introduction to statistical inference</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Introduction to statistical inference">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Introduction to statistical inference</h1>
-    <h2>Statistical inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Statistical inference defined</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Statistical inference is the process of drawing formal conclusions from
-data. </p>
-
-<p>In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
-statistical data where uncertainty must be accounted for.</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Motivating example: who&#39;s going to win the election?</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>In every major election, pollsters would like to know, ahead of the
-actual election, who&#39;s going to win. Here, the target of
-estimation (the estimand) is clear, the percentage of people in 
-a particular group (city, state, county, country or other electoral
-grouping) who will vote for each candidate.</p>
-
-<p>We can not poll everyone. Even if we could, some polled 
-may change their vote by the time the election occurs.
-How do we collect a reasonable subset of data and quantify the
-uncertainty in the process to produce a good guess at who will win?</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Motivating example: is hormone replacement therapy effective?</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. <strong>Based on a statistically based protocol, the study was stopped early due an excess number of negative events.</strong></p>
-
-<p>Here&#39;s there&#39;s two inferential problems. </p>
-
-<ol>
-<li>Is HRT effective?</li>
-<li>How long should we continue the trial in the presence of contrary
-evidence?</li>
-</ol>
-
-<p>See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Motivating example: ECMO</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>In 1985 a group at a major neonatal intensive care center published the results of a trial comparing a standard treatment and a promising new extracorporeal membrane oxygenation treatment (ECMO) for newborn infants with severe respiratory failure. <strong>Ethical considerations lead to a statistical randomization scheme whereby one infant received the control therapy, thereby opening the study to sample-size based criticisms.</strong></p>
-
-<p>For a review and statistical discussion, see Royall Statistical Science 1991, Vol 6, No. 1, 52-88</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Summary</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>These examples illustrate many of the difficulties of trying
-to use data to create general conclusions about a population.</li>
-<li>Paramount among our concerns are:
-
-<ul>
-<li>Is the sample representative of the population that we&#39;d like to draw inferences about?</li>
-<li>Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?</li>
-<li>Is there systematic bias created by missing data or the design or conduct of the study?</li>
-<li>What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
-or random sampling, or implicit as the aggregation of many complex uknown processes.</li>
-<li>Are we trying to estimate an underlying mechanistic model of phenomena under study?</li>
-</ul></li>
-<li>Statistical inference requires navigating the set of assumptions and
-tools and subsequently thinking about how to draw conclusions from data.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Example goals of inference</h2>
-  </hgroup>
-  <article data-timings="">
-    <ol>
-<li>Estimate and quantify the uncertainty of an estimate of 
-a population quantity (the proportion of people who will
-vote for a candidate).</li>
-<li>Determine whether a population quantity 
-is a benchmark value (&quot;is the treatment effective?&quot;).</li>
-<li>Infer a mechanistic relationship when quantities are measured with
-noise (&quot;What is the slope for Hooke&#39;s law?&quot;)</li>
-<li>Determine the impact of a policy? (&quot;If we reduce polution levels,
-will asthma rates decline?&quot;)</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Example tools of the trade</h2>
-  </hgroup>
-  <article data-timings="">
-    <ol>
-<li>Randomization: concerned with balancing unobserved variables that may confound inferences of interest</li>
-<li>Random sampling: concerned with obtaining data that is representative 
-of the population of interest</li>
-<li>Sampling models: concerned with creating a model for the sampling
-process, the most common is so called &quot;iid&quot;.</li>
-<li>Hypothesis testing: concerned with decision making in the presence of uncertainty</li>
-<li>Confidence intervals: concerned with quantifying uncertainty in 
-estimation</li>
-<li>Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
-approximated.</li>
-<li>Study design: the process of designing an experiment to minimize biases and variability.</li>
-<li>Nonparametric bootstrapping: the process of using the data to,
-with minimal probability model assumptions, create inferences.</li>
-<li>Permutation, randomization and exchangeability testing: the process 
-of using data permutations to perform inferences.</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Different thinking about probability leads to different styles of inference</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>We won&#39;t spend too much time talking about this, but there are several different
-styles of inference. Two broad categories that get discussed a lot are:</p>
-
-<ol>
-<li>Frequency probability: is the long run proportion of
-times an event occurs in independent, identically distributed 
-repetitions.</li>
-<li>Frequency inference: uses frequency interpretations of probabilities
-to control error rates. Answers questions like &quot;What should I decide
-given my data controlling the long run proportion of mistakes I make at
-a tolerable level.&quot;</li>
-<li>Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.</li>
-<li>Bayesian inference: the use of Bayesian probability representation
-of beliefs to perform inference. Answers questions like &quot;Given my subjective beliefs and the objective information from the data, what
-should I believe now?&quot;</li>
-</ol>
-
-<p>Data scientists tend to fall within shades of gray of these and various other schools of inference. </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>In this class</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>In this class, we will primarily focus on basic sampling models, 
-basic probability models and frequency style analyses
-to create standard inferences. </li>
-<li>Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
-and bootstrapping.</li>
-<li>As probability modeling will be our starting point, we first build
-up basic probability.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Where to learn more on the topics not covered</h2>
-  </hgroup>
-  <article data-timings="">
-    <ol>
-<li>Explicit use of random sampling in inferences: look in references
-on &quot;finite population statistics&quot;. Used heavily in polling and
-sample surveys.</li>
-<li>Explicit use of randomization in inferences: look in references
-on &quot;causal inference&quot; especially in clinical trials.</li>
-<li>Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).</li>
-<li>Missing data: well covered in biostatistics and econometric
-references; look for references to &quot;multiple imputation&quot;, a popular tool for
-addressing missing data.</li>
-<li>Study design: consider looking in the subject matter area that
-you are interested in; some examples with rich histories in design:
-
-<ol>
-<li>The epidemiological literature is very focused on using study design to investigate public health.</li>
-<li>The classical development of study design in agriculture broadly covers design and design principles.</li>
-<li>The industrial quality control literature covers design thoroughly.</li>
-</ol></li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Statistical inference defined'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Motivating example: who&#39;s going to win the election?'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Motivating example: is hormone replacement therapy effective?'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Motivating example: ECMO'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Summary'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Example goals of inference'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Example tools of the trade'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Different thinking about probability leads to different styles of inference'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='In this class'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Where to learn more on the topics not covered'>
-         10
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Introduction to statistical inference</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Introduction to statistical inference">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Introduction to statistical inference</h1>
+    <h2>Statistical inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Statistical inference defined</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Statistical inference is the process of drawing formal conclusions from
+data. </p>
+
+<p>In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
+statistical data where uncertainty must be accounted for.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Motivating example: who&#39;s going to win the election?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>In every major election, pollsters would like to know, ahead of the
+actual election, who&#39;s going to win. Here, the target of
+estimation (the estimand) is clear, the percentage of people in 
+a particular group (city, state, county, country or other electoral
+grouping) who will vote for each candidate.</p>
+
+<p>We can not poll everyone. Even if we could, some polled 
+may change their vote by the time the election occurs.
+How do we collect a reasonable subset of data and quantify the
+uncertainty in the process to produce a good guess at who will win?</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Motivating example: is hormone replacement therapy effective?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. <strong>Based on a statistically based protocol, the study was stopped early due an excess number of negative events.</strong></p>
+
+<p>Here&#39;s there&#39;s two inferential problems. </p>
+
+<ol>
+<li>Is HRT effective?</li>
+<li>How long should we continue the trial in the presence of contrary
+evidence?</li>
+</ol>
+
+<p>See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Motivating example: ECMO</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>In 1985 a group at a major neonatal intensive care center published the results of a trial comparing a standard treatment and a promising new extracorporeal membrane oxygenation treatment (ECMO) for newborn infants with severe respiratory failure. <strong>Ethical considerations lead to a statistical randomization scheme whereby one infant received the control therapy, thereby opening the study to sample-size based criticisms.</strong></p>
+
+<p>For a review and statistical discussion, see Royall Statistical Science 1991, Vol 6, No. 1, 52-88</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Summary</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>These examples illustrate many of the difficulties of trying
+to use data to create general conclusions about a population.</li>
+<li>Paramount among our concerns are:
+
+<ul>
+<li>Is the sample representative of the population that we&#39;d like to draw inferences about?</li>
+<li>Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?</li>
+<li>Is there systematic bias created by missing data or the design or conduct of the study?</li>
+<li>What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
+or random sampling, or implicit as the aggregation of many complex uknown processes.</li>
+<li>Are we trying to estimate an underlying mechanistic model of phenomena under study?</li>
+</ul></li>
+<li>Statistical inference requires navigating the set of assumptions and
+tools and subsequently thinking about how to draw conclusions from data.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Example goals of inference</h2>
+  </hgroup>
+  <article data-timings="">
+    <ol>
+<li>Estimate and quantify the uncertainty of an estimate of 
+a population quantity (the proportion of people who will
+vote for a candidate).</li>
+<li>Determine whether a population quantity 
+is a benchmark value (&quot;is the treatment effective?&quot;).</li>
+<li>Infer a mechanistic relationship when quantities are measured with
+noise (&quot;What is the slope for Hooke&#39;s law?&quot;)</li>
+<li>Determine the impact of a policy? (&quot;If we reduce polution levels,
+will asthma rates decline?&quot;)</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Example tools of the trade</h2>
+  </hgroup>
+  <article data-timings="">
+    <ol>
+<li>Randomization: concerned with balancing unobserved variables that may confound inferences of interest</li>
+<li>Random sampling: concerned with obtaining data that is representative 
+of the population of interest</li>
+<li>Sampling models: concerned with creating a model for the sampling
+process, the most common is so called &quot;iid&quot;.</li>
+<li>Hypothesis testing: concerned with decision making in the presence of uncertainty</li>
+<li>Confidence intervals: concerned with quantifying uncertainty in 
+estimation</li>
+<li>Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
+approximated.</li>
+<li>Study design: the process of designing an experiment to minimize biases and variability.</li>
+<li>Nonparametric bootstrapping: the process of using the data to,
+with minimal probability model assumptions, create inferences.</li>
+<li>Permutation, randomization and exchangeability testing: the process 
+of using data permutations to perform inferences.</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Different thinking about probability leads to different styles of inference</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>We won&#39;t spend too much time talking about this, but there are several different
+styles of inference. Two broad categories that get discussed a lot are:</p>
+
+<ol>
+<li>Frequency probability: is the long run proportion of
+times an event occurs in independent, identically distributed 
+repetitions.</li>
+<li>Frequency inference: uses frequency interpretations of probabilities
+to control error rates. Answers questions like &quot;What should I decide
+given my data controlling the long run proportion of mistakes I make at
+a tolerable level.&quot;</li>
+<li>Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.</li>
+<li>Bayesian inference: the use of Bayesian probability representation
+of beliefs to perform inference. Answers questions like &quot;Given my subjective beliefs and the objective information from the data, what
+should I believe now?&quot;</li>
+</ol>
+
+<p>Data scientists tend to fall within shades of gray of these and various other schools of inference. </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>In this class</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In this class, we will primarily focus on basic sampling models, 
+basic probability models and frequency style analyses
+to create standard inferences. </li>
+<li>Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
+and bootstrapping.</li>
+<li>As probability modeling will be our starting point, we first build
+up basic probability.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Where to learn more on the topics not covered</h2>
+  </hgroup>
+  <article data-timings="">
+    <ol>
+<li>Explicit use of random sampling in inferences: look in references
+on &quot;finite population statistics&quot;. Used heavily in polling and
+sample surveys.</li>
+<li>Explicit use of randomization in inferences: look in references
+on &quot;causal inference&quot; especially in clinical trials.</li>
+<li>Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).</li>
+<li>Missing data: well covered in biostatistics and econometric
+references; look for references to &quot;multiple imputation&quot;, a popular tool for
+addressing missing data.</li>
+<li>Study design: consider looking in the subject matter area that
+you are interested in; some examples with rich histories in design:
+
+<ol>
+<li>The epidemiological literature is very focused on using study design to investigate public health.</li>
+<li>The classical development of study design in agriculture broadly covers design and design principles.</li>
+<li>The industrial quality control literature covers design thoroughly.</li>
+</ol></li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Test page</h2>
+  </hgroup>
+  <article data-timings="">
+    
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Statistical inference defined'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Motivating example: who&#39;s going to win the election?'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Motivating example: is hormone replacement therapy effective?'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Motivating example: ECMO'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Summary'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Example goals of inference'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Example tools of the trade'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Different thinking about probability leads to different styles of inference'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='In this class'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Where to learn more on the topics not covered'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Test page'>
+         11
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/01_01_Introduction/index.md b/06_StatisticalInference/old_markdown/01_01_Introduction/index.md
similarity index 97%
rename from 06_StatisticalInference/01_01_Introduction/index.md
rename to 06_StatisticalInference/old_markdown/01_01_Introduction/index.md
index 74e8b2a1e..ad4fd1ddb 100644
--- a/06_StatisticalInference/01_01_Introduction/index.md
+++ b/06_StatisticalInference/old_markdown/01_01_Introduction/index.md
@@ -1,158 +1,160 @@
----
-title       : Introduction to statistical inference
-subtitle    : Statistical inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-## Statistical inference defined
-
-Statistical inference is the process of drawing formal conclusions from
-data. 
-
-In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
-statistical data where uncertainty must be accounted for.
-
----
-
-## Motivating example: who's going to win the election?
-
-In every major election, pollsters would like to know, ahead of the
-actual election, who's going to win. Here, the target of
-estimation (the estimand) is clear, the percentage of people in 
-a particular group (city, state, county, country or other electoral
-grouping) who will vote for each candidate.
-
-We can not poll everyone. Even if we could, some polled 
-may change their vote by the time the election occurs.
-How do we collect a reasonable subset of data and quantify the
-uncertainty in the process to produce a good guess at who will win?
-
----
-
-## Motivating example: is hormone replacement therapy effective? 
-
-A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.**
-
-Here's there's two inferential problems. 
-
-1. Is HRT effective?
-2. How long should we continue the trial in the presence of contrary
-evidence?
-
-See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts
-
----
-
-## Motivating example: ECMO
-
-In 1985 a group at a major neonatal intensive care center published the results of a trial comparing a standard treatment and a promising new extracorporeal membrane oxygenation treatment (ECMO) for newborn infants with severe respiratory failure. **Ethical considerations lead to a statistical randomization scheme whereby one infant received the control therapy, thereby opening the study to sample-size based criticisms.**
-
-For a review and statistical discussion, see Royall Statistical Science 1991, Vol 6, No. 1, 52-88
-
----
-
-## Summary
-
-- These examples illustrate many of the difficulties of trying
-to use data to create general conclusions about a population.
-- Paramount among our concerns are:
-  - Is the sample representative of the population that we'd like to draw inferences about?
-  - Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?
-  - Is there systematic bias created by missing data or the design or conduct of the study?
-  - What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
-or random sampling, or implicit as the aggregation of many complex uknown processes.
-  - Are we trying to estimate an underlying mechanistic model of phenomena under study?
-- Statistical inference requires navigating the set of assumptions and
-tools and subsequently thinking about how to draw conclusions from data.
-
---- 
-## Example goals of inference
-
-1. Estimate and quantify the uncertainty of an estimate of 
-a population quantity (the proportion of people who will
-  vote for a candidate).
-2. Determine whether a population quantity 
-  is a benchmark value ("is the treatment effective?").
-3. Infer a mechanistic relationship when quantities are measured with
-  noise ("What is the slope for Hooke's law?")
-4. Determine the impact of a policy? ("If we reduce polution levels,
-  will asthma rates decline?")
-
-
----
-## Example tools of the trade 
-
-1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest
-2. Random sampling: concerned with obtaining data that is representative 
-of the population of interest
-3. Sampling models: concerned with creating a model for the sampling
-process, the most common is so called "iid".
-4. Hypothesis testing: concerned with decision making in the presence of uncertainty
-5. Confidence intervals: concerned with quantifying uncertainty in 
-estimation
-6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
-approximated.
-7. Study design: the process of designing an experiment to minimize biases and variability.
-8. Nonparametric bootstrapping: the process of using the data to,
-  with minimal probability model assumptions, create inferences.
-9. Permutation, randomization and exchangeability testing: the process 
-of using data permutations to perform inferences.
-
----
-## Different thinking about probability leads to different styles of inference
-
-We won't spend too much time talking about this, but there are several different
-styles of inference. Two broad categories that get discussed a lot are:
-
-1. Frequency probability: is the long run proportion of
- times an event occurs in independent, identically distributed 
- repetitions.
-2. Frequency inference: uses frequency interpretations of probabilities
-to control error rates. Answers questions like "What should I decide
-given my data controlling the long run proportion of mistakes I make at
-a tolerable level."
-3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.
-4. Bayesian inference: the use of Bayesian probability representation
-of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what
-should I believe now?"
-
-Data scientists tend to fall within shades of gray of these and various other schools of inference. 
-
----
-## In this class
-
-* In this class, we will primarily focus on basic sampling models, 
-basic probability models and frequency style analyses
-to create standard inferences. 
-* Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
-and bootstrapping.
-* As probability modeling will be our starting point, we first build
-up basic probability.
-
----
-## Where to learn more on the topics not covered
-
-1. Explicit use of random sampling in inferences: look in references
-on "finite population statistics". Used heavily in polling and
-sample surveys.
-2. Explicit use of randomization in inferences: look in references
-on "causal inference" especially in clinical trials.
-3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).
-4. Missing data: well covered in biostatistics and econometric
-references; look for references to "multiple imputation", a popular tool for
-addressing missing data.
-5. Study design: consider looking in the subject matter area that
-  you are interested in; some examples with rich histories in design:
-  1. The epidemiological literature is very focused on using study design to investigate public health.
-  2. The classical development of study design in agriculture broadly covers design and design principles.
-  3. The industrial quality control literature covers design thoroughly.
- 
+---
+title       : Introduction to statistical inference
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Statistical inference defined
+
+Statistical inference is the process of drawing formal conclusions from
+data. 
+
+In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
+statistical data where uncertainty must be accounted for.
+
+---
+
+## Motivating example: who's going to win the election?
+
+In every major election, pollsters would like to know, ahead of the
+actual election, who's going to win. Here, the target of
+estimation (the estimand) is clear, the percentage of people in 
+a particular group (city, state, county, country or other electoral
+grouping) who will vote for each candidate.
+
+We can not poll everyone. Even if we could, some polled 
+may change their vote by the time the election occurs.
+How do we collect a reasonable subset of data and quantify the
+uncertainty in the process to produce a good guess at who will win?
+
+---
+
+## Motivating example: is hormone replacement therapy effective? 
+
+A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.**
+
+Here's there's two inferential problems. 
+
+1. Is HRT effective?
+2. How long should we continue the trial in the presence of contrary
+evidence?
+
+See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts
+
+---
+
+## Motivating example: ECMO
+
+In 1985 a group at a major neonatal intensive care center published the results of a trial comparing a standard treatment and a promising new extracorporeal membrane oxygenation treatment (ECMO) for newborn infants with severe respiratory failure. **Ethical considerations lead to a statistical randomization scheme whereby one infant received the control therapy, thereby opening the study to sample-size based criticisms.**
+
+For a review and statistical discussion, see Royall Statistical Science 1991, Vol 6, No. 1, 52-88
+
+---
+
+## Summary
+
+- These examples illustrate many of the difficulties of trying
+to use data to create general conclusions about a population.
+- Paramount among our concerns are:
+  - Is the sample representative of the population that we'd like to draw inferences about?
+  - Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?
+  - Is there systematic bias created by missing data or the design or conduct of the study?
+  - What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
+or random sampling, or implicit as the aggregation of many complex uknown processes.
+  - Are we trying to estimate an underlying mechanistic model of phenomena under study?
+- Statistical inference requires navigating the set of assumptions and
+tools and subsequently thinking about how to draw conclusions from data.
+
+--- 
+## Example goals of inference
+
+1. Estimate and quantify the uncertainty of an estimate of 
+a population quantity (the proportion of people who will
+  vote for a candidate).
+2. Determine whether a population quantity 
+  is a benchmark value ("is the treatment effective?").
+3. Infer a mechanistic relationship when quantities are measured with
+  noise ("What is the slope for Hooke's law?")
+4. Determine the impact of a policy? ("If we reduce polution levels,
+  will asthma rates decline?")
+
+
+---
+## Example tools of the trade 
+
+1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest
+2. Random sampling: concerned with obtaining data that is representative 
+of the population of interest
+3. Sampling models: concerned with creating a model for the sampling
+process, the most common is so called "iid".
+4. Hypothesis testing: concerned with decision making in the presence of uncertainty
+5. Confidence intervals: concerned with quantifying uncertainty in 
+estimation
+6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
+approximated.
+7. Study design: the process of designing an experiment to minimize biases and variability.
+8. Nonparametric bootstrapping: the process of using the data to,
+  with minimal probability model assumptions, create inferences.
+9. Permutation, randomization and exchangeability testing: the process 
+of using data permutations to perform inferences.
+
+---
+## Different thinking about probability leads to different styles of inference
+
+We won't spend too much time talking about this, but there are several different
+styles of inference. Two broad categories that get discussed a lot are:
+
+1. Frequency probability: is the long run proportion of
+ times an event occurs in independent, identically distributed 
+ repetitions.
+2. Frequency inference: uses frequency interpretations of probabilities
+to control error rates. Answers questions like "What should I decide
+given my data controlling the long run proportion of mistakes I make at
+a tolerable level."
+3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.
+4. Bayesian inference: the use of Bayesian probability representation
+of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what
+should I believe now?"
+
+Data scientists tend to fall within shades of gray of these and various other schools of inference. 
+
+---
+## In this class
+
+* In this class, we will primarily focus on basic sampling models, 
+basic probability models and frequency style analyses
+to create standard inferences. 
+* Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
+and bootstrapping.
+* As probability modeling will be our starting point, we first build
+up basic probability.
+
+---
+## Where to learn more on the topics not covered
+
+1. Explicit use of random sampling in inferences: look in references
+on "finite population statistics". Used heavily in polling and
+sample surveys.
+2. Explicit use of randomization in inferences: look in references
+on "causal inference" especially in clinical trials.
+3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).
+4. Missing data: well covered in biostatistics and econometric
+references; look for references to "multiple imputation", a popular tool for
+addressing missing data.
+5. Study design: consider looking in the subject matter area that
+  you are interested in; some examples with rich histories in design:
+  1. The epidemiological literature is very focused on using study design to investigate public health.
+  2. The classical development of study design in agriculture broadly covers design and design principles.
+  3. The industrial quality control literature covers design thoroughly.
+
+---
+## Test page
diff --git a/06_StatisticalInference/old_markdown/01_01_Introduction/index.pdf b/06_StatisticalInference/old_markdown/01_01_Introduction/index.pdf
new file mode 100644
index 000000000..b50714770
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_01_Introduction/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/01_02_Probability/assets/fig/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/01_02_Probability/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..b4fb0bd35
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_02_Probability/assets/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/01_02_Probability/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/01_02_Probability/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..ff974fda8
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_02_Probability/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-1.png
new file mode 100644
index 000000000..21d71259c
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-2.png
new file mode 100644
index 000000000..833444e0b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-4.png b/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-4.png
new file mode 100644
index 000000000..833444e0b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-6.png b/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-6.png
new file mode 100644
index 000000000..833444e0b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_02_Probability/figure/unnamed-chunk-6.png differ
diff --git a/06_StatisticalInference/01_02_Probability/index.Rmd b/06_StatisticalInference/old_markdown/01_02_Probability/index.Rmd
similarity index 96%
rename from 06_StatisticalInference/01_02_Probability/index.Rmd
rename to 06_StatisticalInference/old_markdown/01_02_Probability/index.Rmd
index c925cc40e..3691fc14c 100644
--- a/06_StatisticalInference/01_02_Probability/index.Rmd
+++ b/06_StatisticalInference/old_markdown/01_02_Probability/index.Rmd
@@ -1,277 +1,276 @@
----
-title       : Probability
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Notation
-
-- The **sample space**, $\Omega$, is the collection of possible outcomes of an experiment
-  - Example: die roll $\Omega = \{1,2,3,4,5,6\}$
-- An **event**, say $E$, is a subset of $\Omega$ 
-  - Example: die roll is even $E = \{2,4,6\}$
-- An **elementary** or **simple** event is a particular result
-  of an experiment
-  - Example: die roll is a four, $\omega = 4$
-- $\emptyset$ is called the **null event** or the **empty set**
-
----
-
-## Interpretation of set operations
-
-Normal set operations have particular interpretations in this setting
-
-1. $\omega \in E$ implies that $E$ occurs when $\omega$ occurs
-2. $\omega \not\in E$ implies that $E$ does not occur when $\omega$ occurs
-3. $E \subset F$ implies that the occurrence of $E$ implies the occurrence of $F$
-4. $E \cap F$  implies the event that both $E$ and $F$ occur
-5. $E \cup F$ implies the event that at least one of $E$ or $F$ occur
-6. $E \cap F=\emptyset$ means that $E$ and $F$ are **mutually exclusive**, or cannot both occur
-7. $E^c$ or $\bar E$ is the event that $E$ does not occur
-
----
-
-## Probability
-
-A **probability measure**, $P$, is a function from the collection of possible events so that the following hold
-
-1. For an event $E\subset \Omega$, $0 \leq P(E) \leq 1$
-2. $P(\Omega) = 1$
-3. If $E_1$ and $E_2$ are mutually exclusive events
-  $P(E_1 \cup E_2) = P(E_1) + P(E_2)$.
-
-Part 3 of the definition implies **finite additivity**
-
-$$
-P(\cup_{i=1}^n A_i) = \sum_{i=1}^n P(A_i)
-$$
-where the $\{A_i\}$ are mutually exclusive. (Note a more general version of
-additivity is used in advanced classes.)
-
-
----
-
-
-## Example consequences
-
-- $P(\emptyset) = 0$
-- $P(E) = 1 - P(E^c)$
-- $P(A \cup B) = P(A) + P(B) - P(A \cap B)$
-- if $A \subset B$ then $P(A) \leq P(B)$
-- $P\left(A \cup B\right) = 1 - P(A^c \cap B^c)$
-- $P(A \cap B^c) = P(A) - P(A \cap B)$
-- $P(\cup_{i=1}^n E_i) \leq \sum_{i=1}^n P(E_i)$
-- $P(\cup_{i=1}^n E_i) \geq \max_i P(E_i)$
-
----
-
-## Example
-
-The National Sleep Foundation ([www.sleepfoundation.org](http://www.sleepfoundation.org/)) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?
-
----
-
-## Example continued
-
-Answer: No, the events are not mutually exclusive. To elaborate let:
-
-$$
-\begin{eqnarray*}
-    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
-    A_2 & = & \{\mbox{Person has RLS}\} 
-  \end{eqnarray*}
-$$
-
-Then 
-
-$$
-\begin{eqnarray*}
-    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
-   & = & 0.13 - \mbox{Probability of having both}
-  \end{eqnarray*}
-$$
-Likely, some fraction of the population has both.
-
----
-
-## Random variables
-
-- A **random variable** is a numerical outcome of an experiment.
-- The random variables that we study will come in two varieties,
-  **discrete** or **continuous**.
-- Discrete random variable are random variables that take on only a
-countable number of possibilities.
-  * $P(X = k)$
-- Continuous random variable can take any value on the real line or some subset of the real line.
-  * $P(X \in A)$
-
----
-
-## Examples of variables that can be thought of as random variables
-
-- The $(0-1)$ outcome of the flip of a coin
-- The outcome from the roll of a die
-- The BMI of a subject four years after a baseline measurement
-- The hypertension status of a subject randomly drawn from a population
-
----
-
-## PMF
-
-A probability mass function evaluated at a value corresponds to the
-probability that a random variable takes that value. To be a valid
-pmf a function, $p$, must satisfy
-
-  1. $p(x) \geq 0$ for all $x$
-  2. $\sum_{x} p(x) = 1$
-
-The sum is taken over all of the possible values for $x$.
-
----
-
-## Example
-
-Let $X$ be the result of a coin flip where $X=0$ represents
-tails and $X = 1$ represents heads.
-$$
-p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
-$$
-Suppose that we do not know whether or not the coin is fair; Let
-$\theta$ be the probability of a head expressed as a proportion
-(between 0 and 1).
-$$
-p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
-$$
-
----
-
-## PDF
-
-A probability density function (pdf), is a function associated with
-a continuous random variable 
-
-  *Areas under pdfs correspond to probabilities for that random variable*
-
-To be a valid pdf, a function $f$ must satisfy
-
-1. $f(x) \geq 0$ for all $x$
-
-2. The area under $f(x)$ is one.
-
----
-## Example
-
-Suppose that the proportion of help calls that get addressed in
-a random day by a help line is given by
-$$
-f(x) = \left\{\begin{array}{ll}
-    2 x & \mbox{ for } 1 > x > 0 \\
-    0                 & \mbox{ otherwise} 
-\end{array} \right. 
-$$
-
-Is this a mathematically valid density?
-
----
-
-```{r, fig.height = 5, fig.width = 5, echo = TRUE, fig.align='center'}
-x <- c(-0.5, 0, 1, 1, 1.5); y <- c( 0, 0, 2, 0, 0)
-plot(x, y, lwd = 3, frame = FALSE, type = "l")
-```
-
----
-
-## Example continued
-
-What is the probability that 75% or fewer of calls get addressed?
-
-```{r, fig.height = 5, fig.width = 5, echo = FALSE, fig.align='center'}
-plot(x, y, lwd = 3, frame = FALSE, type = "l")
-polygon(c(0, .75, .75, 0), c(0, 0, 1.5, 0), lwd = 3, col = "lightblue")
-```
-
----
-```{r}
-1.5 * .75 / 2
-pbeta(.75, 2, 1)
-```
----
-
-## CDF and survival function
-
-- The **cumulative distribution function** (CDF) of a random variable $X$ is defined as the function 
-$$
-F(x) = P(X \leq x)
-$$
-- This definition applies regardless of whether $X$ is discrete or continuous.
-- The **survival function** of a random variable $X$ is defined as
-$$
-S(x) = P(X > x)
-$$
-- Notice that $S(x) = 1 - F(x)$
-- For continuous random variables, the PDF is the derivative of the CDF
-
----
-
-## Example
-
-What are the survival function and CDF from the density considered before?
-
-For $1 \geq x \geq 0$
-$$
-F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
-$$
-
-$$
-S(x) = 1 - x^2
-$$
-
-```{r}
-pbeta(c(0.4, 0.5, 0.6), 2, 1)
-```
-
----
-
-## Quantiles
-
-- The  $\alpha^{th}$ **quantile** of a distribution with distribution function $F$ is the point $x_\alpha$ so that
-$$
-F(x_\alpha) = \alpha
-$$
-- A **percentile** is simply a quantile with $\alpha$ expressed as a percent
-- The **median** is the $50^{th}$ percentile
-
----
-## Example
-- We want to solve $0.5 = F(x) = x^2$
-- Resulting in the solution 
-```{r, echo = TRUE} 
-sqrt(0.5)
-``` 
-- Therefore, about `r sqrt(0.5)` of calls being answered on a random day is the median.
-- R can approximate quantiles for you for common distributions
-
-```{r}
-qbeta(0.5, 2, 1)
-```
-
----
-
-## Summary
-
-- You might be wondering at this point "I've heard of a median before, it didn't require integration. Where's the data?"
-- We're referring to are **population quantities**. Therefore, the median being
-  discussed is the **population median**.
-- A probability model connects the data to the population using assumptions.
-- Therefore the median we're discussing is the **estimand**, the sample median will be the **estimator**
-
+---
+title       : Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Notation
+
+- The **sample space**, $\Omega$, is the collection of possible outcomes of an experiment
+  - Example: die roll $\Omega = \{1,2,3,4,5,6\}$
+- An **event**, say $E$, is a subset of $\Omega$ 
+  - Example: die roll is even $E = \{2,4,6\}$
+- An **elementary** or **simple** event is a particular result
+  of an experiment
+  - Example: die roll is a four, $\omega = 4$
+- $\emptyset$ is called the **null event** or the **empty set**
+
+---
+
+## Interpretation of set operations
+
+Normal set operations have particular interpretations in this setting
+
+1. $\omega \in E$ implies that $E$ occurs when $\omega$ occurs
+2. $\omega \not\in E$ implies that $E$ does not occur when $\omega$ occurs
+3. $E \subset F$ implies that the occurrence of $E$ implies the occurrence of $F$
+4. $E \cap F$  implies the event that both $E$ and $F$ occur
+5. $E \cup F$ implies the event that at least one of $E$ or $F$ occur
+6. $E \cap F=\emptyset$ means that $E$ and $F$ are **mutually exclusive**, or cannot both occur
+7. $E^c$ or $\bar E$ is the event that $E$ does not occur
+
+---
+
+## Probability
+
+A **probability measure**, $P$, is a function from the collection of possible events so that the following hold
+
+1. For an event $E\subset \Omega$, $0 \leq P(E) \leq 1$
+2. $P(\Omega) = 1$
+3. If $E_1$ and $E_2$ are mutually exclusive events
+  $P(E_1 \cup E_2) = P(E_1) + P(E_2)$.
+
+Part 3 of the definition implies **finite additivity**
+
+$$
+P(\cup_{i=1}^n A_i) = \sum_{i=1}^n P(A_i)
+$$
+where the $\{A_i\}$ are mutually exclusive. (Note a more general version of
+additivity is used in advanced classes.)
+
+
+---
+
+
+## Example consequences
+
+- $P(\emptyset) = 0$
+- $P(E) = 1 - P(E^c)$
+- $P(A \cup B) = P(A) + P(B) - P(A \cap B)$
+- if $A \subset B$ then $P(A) \leq P(B)$
+- $P\left(A \cup B\right) = 1 - P(A^c \cap B^c)$
+- $P(A \cap B^c) = P(A) - P(A \cap B)$
+- $P(\cup_{i=1}^n E_i) \leq \sum_{i=1}^n P(E_i)$
+- $P(\cup_{i=1}^n E_i) \geq \max_i P(E_i)$
+
+---
+
+## Example
+
+The National Sleep Foundation ([www.sleepfoundation.org](http://www.sleepfoundation.org/)) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?
+
+---
+
+## Example continued
+
+Answer: No, the events are not mutually exclusive. To elaborate let:
+
+$$
+\begin{eqnarray*}
+    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
+    A_2 & = & \{\mbox{Person has RLS}\} 
+  \end{eqnarray*}
+$$
+
+Then 
+
+$$
+\begin{eqnarray*}
+    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
+   & = & 0.13 - \mbox{Probability of having both}
+  \end{eqnarray*}
+$$
+Likely, some fraction of the population has both.
+
+---
+
+## Random variables
+
+- A **random variable** is a numerical outcome of an experiment.
+- The random variables that we study will come in two varieties,
+  **discrete** or **continuous**.
+- Discrete random variable are random variables that take on only a
+countable number of possibilities.
+  * $P(X = k)$
+- Continuous random variable can take any value on the real line or some subset of the real line.
+  * $P(X \in A)$
+
+---
+
+## Examples of variables that can be thought of as random variables
+
+- The $(0-1)$ outcome of the flip of a coin
+- The outcome from the roll of a die
+- The BMI of a subject four years after a baseline measurement
+- The hypertension status of a subject randomly drawn from a population
+
+---
+
+## PMF
+
+A probability mass function evaluated at a value corresponds to the
+probability that a random variable takes that value. To be a valid
+pmf a function, $p$, must satisfy
+
+  1. $p(x) \geq 0$ for all $x$
+  2. $\sum_{x} p(x) = 1$
+
+The sum is taken over all of the possible values for $x$.
+
+---
+
+## Example
+
+Let $X$ be the result of a coin flip where $X=0$ represents
+tails and $X = 1$ represents heads.
+$$
+p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+Suppose that we do not know whether or not the coin is fair; Let
+$\theta$ be the probability of a head expressed as a proportion
+(between 0 and 1).
+$$
+p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+
+---
+
+## PDF
+
+A probability density function (pdf), is a function associated with
+a continuous random variable 
+
+  *Areas under pdfs correspond to probabilities for that random variable*
+
+To be a valid pdf, a function $f$ must satisfy
+
+1. $f(x) \geq 0$ for all $x$
+
+2. The area under $f(x)$ is one.
+
+---
+## Example
+
+Suppose that the proportion of help calls that get addressed in
+a random day by a help line is given by
+$$
+f(x) = \left\{\begin{array}{ll}
+    2 x & \mbox{ for } 1 > x > 0 \\
+    0                 & \mbox{ otherwise} 
+\end{array} \right. 
+$$
+
+Is this a mathematically valid density?
+
+---
+
+```{r, fig.height = 5, fig.width = 5, echo = TRUE, fig.align='center'}
+x <- c(-0.5, 0, 1, 1, 1.5); y <- c( 0, 0, 2, 0, 0)
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+```
+
+---
+
+## Example continued
+
+What is the probability that 75% or fewer of calls get addressed?
+
+```{r, fig.height = 5, fig.width = 5, echo = FALSE, fig.align='center'}
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+polygon(c(0, .75, .75, 0), c(0, 0, 1.5, 0), lwd = 3, col = "lightblue")
+```
+
+---
+```{r}
+1.5 * .75 / 2
+pbeta(.75, 2, 1)
+```
+---
+
+## CDF and survival function
+
+- The **cumulative distribution function** (CDF) of a random variable $X$ is defined as the function 
+$$
+F(x) = P(X \leq x)
+$$
+- This definition applies regardless of whether $X$ is discrete or continuous.
+- The **survival function** of a random variable $X$ is defined as
+$$
+S(x) = P(X > x)
+$$
+- Notice that $S(x) = 1 - F(x)$
+- For continuous random variables, the PDF is the derivative of the CDF
+
+---
+
+## Example
+
+What are the survival function and CDF from the density considered before?
+
+For $1 \geq x \geq 0$
+$$
+F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
+$$
+
+$$
+S(x) = 1 - x^2
+$$
+
+```{r}
+pbeta(c(0.4, 0.5, 0.6), 2, 1)
+```
+
+---
+
+## Quantiles
+
+- The  $\alpha^{th}$ **quantile** of a distribution with distribution function $F$ is the point $x_\alpha$ so that
+$$
+F(x_\alpha) = \alpha
+$$
+- A **percentile** is simply a quantile with $\alpha$ expressed as a percent
+- The **median** is the $50^{th}$ percentile
+
+---
+## Example
+- We want to solve $0.5 = F(x) = x^2$
+- Resulting in the solution 
+```{r, echo = TRUE} 
+sqrt(0.5)
+``` 
+- Therefore, about `r sqrt(0.5)` of calls being answered on a random day is the median.
+- R can approximate quantiles for you for common distributions
+
+```{r}
+qbeta(0.5, 2, 1)
+```
+
+---
+
+## Summary
+
+- You might be wondering at this point "I've heard of a median before, it didn't require integration. Where's the data?"
+- We're referring to are **population quantities**. Therefore, the median being
+  discussed is the **population median**.
+- A probability model connects the data to the population using assumptions.
+- Therefore the median we're discussing is the **estimand**, the sample median will be the **estimator**
diff --git a/06_StatisticalInference/01_02_Probability/index.html b/06_StatisticalInference/old_markdown/01_02_Probability/index.html
similarity index 78%
rename from 06_StatisticalInference/01_02_Probability/index.html
rename to 06_StatisticalInference/old_markdown/01_02_Probability/index.html
index 8e224deef..10cda45c3 100644
--- a/06_StatisticalInference/01_02_Probability/index.html
+++ b/06_StatisticalInference/old_markdown/01_02_Probability/index.html
@@ -1,617 +1,575 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Probability</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Probability">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Probability</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Notation</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>sample space</strong>, \(\Omega\), is the collection of possible outcomes of an experiment
-
-<ul>
-<li>Example: die roll \(\Omega = \{1,2,3,4,5,6\}\)</li>
-</ul></li>
-<li>An <strong>event</strong>, say \(E\), is a subset of \(\Omega\) 
-
-<ul>
-<li>Example: die roll is even \(E = \{2,4,6\}\)</li>
-</ul></li>
-<li>An <strong>elementary</strong> or <strong>simple</strong> event is a particular result
-of an experiment
-
-<ul>
-<li>Example: die roll is a four, \(\omega = 4\)</li>
-</ul></li>
-<li>\(\emptyset\) is called the <strong>null event</strong> or the <strong>empty set</strong></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Interpretation of set operations</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Normal set operations have particular interpretations in this setting</p>
-
-<ol>
-<li>\(\omega \in E\) implies that \(E\) occurs when \(\omega\) occurs</li>
-<li>\(\omega \not\in E\) implies that \(E\) does not occur when \(\omega\) occurs</li>
-<li>\(E \subset F\) implies that the occurrence of \(E\) implies the occurrence of \(F\)</li>
-<li>\(E \cap F\)  implies the event that both \(E\) and \(F\) occur</li>
-<li>\(E \cup F\) implies the event that at least one of \(E\) or \(F\) occur</li>
-<li>\(E \cap F=\emptyset\) means that \(E\) and \(F\) are <strong>mutually exclusive</strong>, or cannot both occur</li>
-<li>\(E^c\) or \(\bar E\) is the event that \(E\) does not occur</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Probability</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>A <strong>probability measure</strong>, \(P\), is a function from the collection of possible events so that the following hold</p>
-
-<ol>
-<li>For an event \(E\subset \Omega\), \(0 \leq P(E) \leq 1\)</li>
-<li>\(P(\Omega) = 1\)</li>
-<li>If \(E_1\) and \(E_2\) are mutually exclusive events
-\(P(E_1 \cup E_2) = P(E_1) + P(E_2)\).</li>
-</ol>
-
-<p>Part 3 of the definition implies <strong>finite additivity</strong></p>
-
-<p>\[
-P(\cup_{i=1}^n A_i) = \sum_{i=1}^n P(A_i)
-\]
-where the \(\{A_i\}\) are mutually exclusive. (Note a more general version of
-additivity is used in advanced classes.)</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Example consequences</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>\(P(\emptyset) = 0\)</li>
-<li>\(P(E) = 1 - P(E^c)\)</li>
-<li>\(P(A \cup B) = P(A) + P(B) - P(A \cap B)\)</li>
-<li>if \(A \subset B\) then \(P(A) \leq P(B)\)</li>
-<li>\(P\left(A \cup B\right) = 1 - P(A^c \cap B^c)\)</li>
-<li>\(P(A \cap B^c) = P(A) - P(A \cap B)\)</li>
-<li>\(P(\cup_{i=1}^n E_i) \leq \sum_{i=1}^n P(E_i)\)</li>
-<li>\(P(\cup_{i=1}^n E_i) \geq \max_i P(E_i)\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>The National Sleep Foundation (<a href="http://www.sleepfoundation.org/">www.sleepfoundation.org</a>) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Example continued</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Answer: No, the events are not mutually exclusive. To elaborate let:</p>
-
-<p>\[
-\begin{eqnarray*}
-    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
-    A_2 & = & \{\mbox{Person has RLS}\} 
-  \end{eqnarray*}
-\]</p>
-
-<p>Then </p>
-
-<p>\[
-\begin{eqnarray*}
-    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
-   & = & 0.13 - \mbox{Probability of having both}
-  \end{eqnarray*}
-\]
-Likely, some fraction of the population has both.</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Random variables</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>A <strong>random variable</strong> is a numerical outcome of an experiment.</li>
-<li>The random variables that we study will come in two varieties,
-<strong>discrete</strong> or <strong>continuous</strong>.</li>
-<li>Discrete random variable are random variables that take on only a
-countable number of possibilities.
-
-<ul>
-<li>\(P(X = k)\)</li>
-</ul></li>
-<li>Continuous random variable can take any value on the real line or some subset of the real line.
-
-<ul>
-<li>\(P(X \in A)\)</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Examples of variables that can be thought of as random variables</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The \((0-1)\) outcome of the flip of a coin</li>
-<li>The outcome from the roll of a die</li>
-<li>The BMI of a subject four years after a baseline measurement</li>
-<li>The hypertension status of a subject randomly drawn from a population</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>PMF</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>A probability mass function evaluated at a value corresponds to the
-probability that a random variable takes that value. To be a valid
-pmf a function, \(p\), must satisfy</p>
-
-<ol>
-<li>\(p(x) \geq 0\) for all \(x\)</li>
-<li>\(\sum_{x} p(x) = 1\)</li>
-</ol>
-
-<p>The sum is taken over all of the possible values for \(x\).</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Let \(X\) be the result of a coin flip where \(X=0\) represents
-tails and \(X = 1\) represents heads.
-\[
-p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
-\]
-Suppose that we do not know whether or not the coin is fair; Let
-\(\theta\) be the probability of a head expressed as a proportion
-(between 0 and 1).
-\[
-p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>PDF</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>A probability density function (pdf), is a function associated with
-a continuous random variable </p>
-
-<p><em>Areas under pdfs correspond to probabilities for that random variable</em></p>
-
-<p>To be a valid pdf, a function \(f\) must satisfy</p>
-
-<ol>
-<li><p>\(f(x) \geq 0\) for all \(x\)</p></li>
-<li><p>The area under \(f(x)\) is one.</p></li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Suppose that the proportion of help calls that get addressed in
-a random day by a help line is given by
-\[
-f(x) = \left\{\begin{array}{ll}
-    2 x & \mbox{ for } 1 > x > 0 \\
-    0                 & \mbox{ otherwise} 
-\end{array} \right. 
-\]</p>
-
-<p>Is this a mathematically valid density?</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <article data-timings="">
-    <pre><code class="r">x &lt;- c(-0.5, 0, 1, 1, 1.5)
-y &lt;- c(0, 0, 2, 0, 0)
-plot(x, y, lwd = 3, frame = FALSE, type = &quot;l&quot;)
-</code></pre>
-
-<p><img src="assets/fig/unnamed-chunk-1.png" alt="plot of chunk unnamed-chunk-1"> </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Example continued</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>What is the probability that 75% or fewer of calls get addressed?</p>
-
-<p><img src="assets/fig/unnamed-chunk-2.png" alt="plot of chunk unnamed-chunk-2"> </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-15" style="background:;">
-  <article data-timings="">
-    <pre><code class="r">1.5 * 0.75/2
-</code></pre>
-
-<pre><code>## [1] 0.5625
-</code></pre>
-
-<pre><code class="r">pbeta(0.75, 2, 1)
-</code></pre>
-
-<pre><code>## [1] 0.5625
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>CDF and survival function</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>cumulative distribution function</strong> (CDF) of a random variable \(X\) is defined as the function 
-\[
-F(x) = P(X \leq x)
-\]</li>
-<li>This definition applies regardless of whether \(X\) is discrete or continuous.</li>
-<li>The <strong>survival function</strong> of a random variable \(X\) is defined as
-\[
-S(x) = P(X > x)
-\]</li>
-<li>Notice that \(S(x) = 1 - F(x)\)</li>
-<li>For continuous random variables, the PDF is the derivative of the CDF</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>What are the survival function and CDF from the density considered before?</p>
-
-<p>For \(1 \geq x \geq 0\)
-\[
-F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
-\]</p>
-
-<p>\[
-S(x) = 1 - x^2
-\]</p>
-
-<pre><code class="r">pbeta(c(0.4, 0.5, 0.6), 2, 1)
-</code></pre>
-
-<pre><code>## [1] 0.16 0.25 0.36
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h2>Quantiles</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The  \(\alpha^{th}\) <strong>quantile</strong> of a distribution with distribution function \(F\) is the point \(x_\alpha\) so that
-\[
-F(x_\alpha) = \alpha
-\]</li>
-<li>A <strong>percentile</strong> is simply a quantile with \(\alpha\) expressed as a percent</li>
-<li>The <strong>median</strong> is the \(50^{th}\) percentile</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>We want to solve \(0.5 = F(x) = x^2\)</li>
-<li>Resulting in the solution </li>
-</ul>
-
-<pre><code class="r">sqrt(0.5)
-</code></pre>
-
-<pre><code>## [1] 0.7071
-</code></pre>
-
-<ul>
-<li>Therefore, about 0.7071 of calls being answered on a random day is the median.</li>
-<li>R can approximate quantiles for you for common distributions</li>
-</ul>
-
-<pre><code class="r">qbeta(0.5, 2, 1)
-</code></pre>
-
-<pre><code>## [1] 0.7071
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-20" style="background:;">
-  <hgroup>
-    <h2>Summary</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>You might be wondering at this point &quot;I&#39;ve heard of a median before, it didn&#39;t require integration. Where&#39;s the data?&quot;</li>
-<li>We&#39;re referring to are <strong>population quantities</strong>. Therefore, the median being
-discussed is the <strong>population median</strong>.</li>
-<li>A probability model connects the data to the population using assumptions.</li>
-<li>Therefore the median we&#39;re discussing is the <strong>estimand</strong>, the sample median will be the <strong>estimator</strong></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Notation'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Interpretation of set operations'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Probability'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Example consequences'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Example'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Example continued'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Random variables'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Examples of variables that can be thought of as random variables'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='PMF'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Example'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='PDF'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Example'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title=''>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='Example continued'>
-         14
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title=''>
-         15
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='CDF and survival function'>
-         16
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=17 title='Example'>
-         17
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=18 title='Quantiles'>
-         18
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=19 title='Example'>
-         19
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=20 title='Summary'>
-         20
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Probability</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Probability">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Probability</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Probability</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In these slides we will cover the basics of probability at low enough level
+to have a basic understanding for the rest of the series</li>
+<li>For a more complete treatment see the class Mathematical Biostatistics Boot Camp 1
+
+<ul>
+<li>Youtube: <a href="http://www.youtube.com/playlist?list=PLpl-gQkQivXhk6qSyiNj51qamjAtZISJ-">www.youtube.com/playlist?list=PLpl-gQkQivXhk6qSyiNj51qamjAtZISJ-</a></li>
+<li>Coursera: <a href="http://www.coursera.org/course/biostats">www.coursera.org/course/biostats</a></li>
+<li>Git: <a href="http://github.com/bcaffo/Caffo-Coursera">http://github.com/bcaffo/Caffo-Coursera</a></li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Probability</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Given a random experiment (say rolling a die) a probability measure is a population quantity
+that summarizes the randomness.</p>
+
+<p>Specifically, probability takes a possible outcome from the expertiment and assigns it a number
+between 0 and 1 so that the probability that something occurs is 1 (the die must be rolled)
+and so that the probability of the union of any two sets of outcomes that have nothing in common
+is the sum of their respective probabilities.</p>
+
+<p>The Russian mathematician Kolmogorov formalized these rules.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Example consequences</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>\(P(\emptyset) = 0\)</li>
+<li>\(P(E) = 1 - P(E^c)\)</li>
+<li>\(P(A \cup B) = P(A) + P(B) - P(A \cap B)\)</li>
+<li>if \(A \subset B\) then \(P(A) \leq P(B)\)</li>
+<li>\(P\left(A \cup B\right) = 1 - P(A^c \cap B^c)\)</li>
+<li>\(P(A \cap B^c) = P(A) - P(A \cap B)\)</li>
+<li>\(P(\cup_{i=1}^n E_i) \leq \sum_{i=1}^n P(E_i)\)</li>
+<li>\(P(\cup_{i=1}^n E_i) \geq \max_i P(E_i)\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>The National Sleep Foundation (<a href="http://www.sleepfoundation.org/">www.sleepfoundation.org</a>) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Answer: No, the events are not mutually exclusive. To elaborate let:</p>
+
+<p>\[
+\begin{eqnarray*}
+    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
+    A_2 & = & \{\mbox{Person has RLS}\} 
+  \end{eqnarray*}
+\]</p>
+
+<p>Then </p>
+
+<p>\[
+\begin{eqnarray*}
+    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
+   & = & 0.13 - \mbox{Probability of having both}
+  \end{eqnarray*}
+\]
+Likely, some fraction of the population has both.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A <strong>random variable</strong> is a numerical outcome of an experiment.</li>
+<li>The random variables that we study will come in two varieties,
+<strong>discrete</strong> or <strong>continuous</strong>.</li>
+<li>Discrete random variable are random variables that take on only a
+countable number of possibilities.
+
+<ul>
+<li>\(P(X = k)\)</li>
+</ul></li>
+<li>Continuous random variable can take any value on the real line or some subset of the real line.
+
+<ul>
+<li>\(P(X \in A)\)</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Examples of variables that can be thought of as random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The \((0-1)\) outcome of the flip of a coin</li>
+<li>The outcome from the roll of a die</li>
+<li>The BMI of a subject four years after a baseline measurement</li>
+<li>The hypertension status of a subject randomly drawn from a population</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>PMF</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>A probability mass function evaluated at a value corresponds to the
+probability that a random variable takes that value. To be a valid
+pmf a function, \(p\), must satisfy</p>
+
+<ol>
+<li>\(p(x) \geq 0\) for all \(x\)</li>
+<li>\(\sum_{x} p(x) = 1\)</li>
+</ol>
+
+<p>The sum is taken over all of the possible values for \(x\).</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Let \(X\) be the result of a coin flip where \(X=0\) represents
+tails and \(X = 1\) represents heads.
+\[
+p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
+\]
+Suppose that we do not know whether or not the coin is fair; Let
+\(\theta\) be the probability of a head expressed as a proportion
+(between 0 and 1).
+\[
+p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>PDF</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>A probability density function (pdf), is a function associated with
+a continuous random variable </p>
+
+<p><em>Areas under pdfs correspond to probabilities for that random variable</em></p>
+
+<p>To be a valid pdf, a function \(f\) must satisfy</p>
+
+<ol>
+<li><p>\(f(x) \geq 0\) for all \(x\)</p></li>
+<li><p>The area under \(f(x)\) is one.</p></li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Suppose that the proportion of help calls that get addressed in
+a random day by a help line is given by
+\[
+f(x) = \left\{\begin{array}{ll}
+    2 x & \mbox{ for } 1 > x > 0 \\
+    0                 & \mbox{ otherwise} 
+\end{array} \right. 
+\]</p>
+
+<p>Is this a mathematically valid density?</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">x &lt;- c(-0.5, 0, 1, 1, 1.5)
+y &lt;- c(0, 0, 2, 0, 0)
+plot(x, y, lwd = 3, frame = FALSE, type = &quot;l&quot;)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>What is the probability that 75% or fewer of calls get addressed?</p>
+
+<p><img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">1.5 * 0.75/2
+</code></pre>
+
+<pre><code>## [1] 0.5625
+</code></pre>
+
+<pre><code class="r">pbeta(0.75, 2, 1)
+</code></pre>
+
+<pre><code>## [1] 0.5625
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>CDF and survival function</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>cumulative distribution function</strong> (CDF) of a random variable \(X\) is defined as the function 
+\[
+F(x) = P(X \leq x)
+\]</li>
+<li>This definition applies regardless of whether \(X\) is discrete or continuous.</li>
+<li>The <strong>survival function</strong> of a random variable \(X\) is defined as
+\[
+S(x) = P(X > x)
+\]</li>
+<li>Notice that \(S(x) = 1 - F(x)\)</li>
+<li>For continuous random variables, the PDF is the derivative of the CDF</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>What are the survival function and CDF from the density considered before?</p>
+
+<p>For \(1 \geq x \geq 0\)
+\[
+F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
+\]</p>
+
+<p>\[
+S(x) = 1 - x^2
+\]</p>
+
+<pre><code class="r">pbeta(c(0.4, 0.5, 0.6), 2, 1)
+</code></pre>
+
+<pre><code>## [1] 0.16 0.25 0.36
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Quantiles</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The  \(\alpha^{th}\) <strong>quantile</strong> of a distribution with distribution function \(F\) is the point \(x_\alpha\) so that
+\[
+F(x_\alpha) = \alpha
+\]</li>
+<li>A <strong>percentile</strong> is simply a quantile with \(\alpha\) expressed as a percent</li>
+<li>The <strong>median</strong> is the \(50^{th}\) percentile</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>We want to solve \(0.5 = F(x) = x^2\)</li>
+<li>Resulting in the solution </li>
+</ul>
+
+<pre><code class="r">sqrt(0.5)
+</code></pre>
+
+<pre><code>## [1] 0.7071
+</code></pre>
+
+<ul>
+<li>Therefore, about 0.7071 of calls being answered on a random day is the median.</li>
+<li>R can approximate quantiles for you for common distributions</li>
+</ul>
+
+<pre><code class="r">qbeta(0.5, 2, 1)
+</code></pre>
+
+<pre><code>## [1] 0.7071
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>Summary</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>You might be wondering at this point &quot;I&#39;ve heard of a median before, it didn&#39;t require integration. Where&#39;s the data?&quot;</li>
+<li>We&#39;re referring to are <strong>population quantities</strong>. Therefore, the median being
+discussed is the <strong>population median</strong>.</li>
+<li>A probability model connects the data to the population using assumptions.</li>
+<li>Therefore the median we&#39;re discussing is the <strong>estimand</strong>, the sample median will be the <strong>estimator</strong></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Probability'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Probability'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Example consequences'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Example'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Example continued'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Random variables'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Examples of variables that can be thought of as random variables'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='PMF'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Example'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='PDF'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title=''>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Example continued'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title=''>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='CDF and survival function'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Example'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Quantiles'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Example'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='Summary'>
+         19
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/01_02_Probability/index.md b/06_StatisticalInference/old_markdown/01_02_Probability/index.md
similarity index 74%
rename from 06_StatisticalInference/01_02_Probability/index.md
rename to 06_StatisticalInference/old_markdown/01_02_Probability/index.md
index 61a470797..f102eecb6 100644
--- a/06_StatisticalInference/01_02_Probability/index.md
+++ b/06_StatisticalInference/old_markdown/01_02_Probability/index.md
@@ -1,311 +1,290 @@
----
-title       : Probability
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Notation
-
-- The **sample space**, $\Omega$, is the collection of possible outcomes of an experiment
-  - Example: die roll $\Omega = \{1,2,3,4,5,6\}$
-- An **event**, say $E$, is a subset of $\Omega$ 
-  - Example: die roll is even $E = \{2,4,6\}$
-- An **elementary** or **simple** event is a particular result
-  of an experiment
-  - Example: die roll is a four, $\omega = 4$
-- $\emptyset$ is called the **null event** or the **empty set**
-
----
-
-## Interpretation of set operations
-
-Normal set operations have particular interpretations in this setting
-
-1. $\omega \in E$ implies that $E$ occurs when $\omega$ occurs
-2. $\omega \not\in E$ implies that $E$ does not occur when $\omega$ occurs
-3. $E \subset F$ implies that the occurrence of $E$ implies the occurrence of $F$
-4. $E \cap F$  implies the event that both $E$ and $F$ occur
-5. $E \cup F$ implies the event that at least one of $E$ or $F$ occur
-6. $E \cap F=\emptyset$ means that $E$ and $F$ are **mutually exclusive**, or cannot both occur
-7. $E^c$ or $\bar E$ is the event that $E$ does not occur
-
----
-
-## Probability
-
-A **probability measure**, $P$, is a function from the collection of possible events so that the following hold
-
-1. For an event $E\subset \Omega$, $0 \leq P(E) \leq 1$
-2. $P(\Omega) = 1$
-3. If $E_1$ and $E_2$ are mutually exclusive events
-  $P(E_1 \cup E_2) = P(E_1) + P(E_2)$.
-
-Part 3 of the definition implies **finite additivity**
-
-$$
-P(\cup_{i=1}^n A_i) = \sum_{i=1}^n P(A_i)
-$$
-where the $\{A_i\}$ are mutually exclusive. (Note a more general version of
-additivity is used in advanced classes.)
-
-
----
-
-
-## Example consequences
-
-- $P(\emptyset) = 0$
-- $P(E) = 1 - P(E^c)$
-- $P(A \cup B) = P(A) + P(B) - P(A \cap B)$
-- if $A \subset B$ then $P(A) \leq P(B)$
-- $P\left(A \cup B\right) = 1 - P(A^c \cap B^c)$
-- $P(A \cap B^c) = P(A) - P(A \cap B)$
-- $P(\cup_{i=1}^n E_i) \leq \sum_{i=1}^n P(E_i)$
-- $P(\cup_{i=1}^n E_i) \geq \max_i P(E_i)$
-
----
-
-## Example
-
-The National Sleep Foundation ([www.sleepfoundation.org](http://www.sleepfoundation.org/)) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?
-
----
-
-## Example continued
-
-Answer: No, the events are not mutually exclusive. To elaborate let:
-
-$$
-\begin{eqnarray*}
-    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
-    A_2 & = & \{\mbox{Person has RLS}\} 
-  \end{eqnarray*}
-$$
-
-Then 
-
-$$
-\begin{eqnarray*}
-    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
-   & = & 0.13 - \mbox{Probability of having both}
-  \end{eqnarray*}
-$$
-Likely, some fraction of the population has both.
-
----
-
-## Random variables
-
-- A **random variable** is a numerical outcome of an experiment.
-- The random variables that we study will come in two varieties,
-  **discrete** or **continuous**.
-- Discrete random variable are random variables that take on only a
-countable number of possibilities.
-  * $P(X = k)$
-- Continuous random variable can take any value on the real line or some subset of the real line.
-  * $P(X \in A)$
-
----
-
-## Examples of variables that can be thought of as random variables
-
-- The $(0-1)$ outcome of the flip of a coin
-- The outcome from the roll of a die
-- The BMI of a subject four years after a baseline measurement
-- The hypertension status of a subject randomly drawn from a population
-
----
-
-## PMF
-
-A probability mass function evaluated at a value corresponds to the
-probability that a random variable takes that value. To be a valid
-pmf a function, $p$, must satisfy
-
-  1. $p(x) \geq 0$ for all $x$
-  2. $\sum_{x} p(x) = 1$
-
-The sum is taken over all of the possible values for $x$.
-
----
-
-## Example
-
-Let $X$ be the result of a coin flip where $X=0$ represents
-tails and $X = 1$ represents heads.
-$$
-p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
-$$
-Suppose that we do not know whether or not the coin is fair; Let
-$\theta$ be the probability of a head expressed as a proportion
-(between 0 and 1).
-$$
-p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
-$$
-
----
-
-## PDF
-
-A probability density function (pdf), is a function associated with
-a continuous random variable 
-
-  *Areas under pdfs correspond to probabilities for that random variable*
-
-To be a valid pdf, a function $f$ must satisfy
-
-1. $f(x) \geq 0$ for all $x$
-
-2. The area under $f(x)$ is one.
-
----
-## Example
-
-Suppose that the proportion of help calls that get addressed in
-a random day by a help line is given by
-$$
-f(x) = \left\{\begin{array}{ll}
-    2 x & \mbox{ for } 1 > x > 0 \\
-    0                 & \mbox{ otherwise} 
-\end{array} \right. 
-$$
-
-Is this a mathematically valid density?
-
----
-
-
-```r
-x <- c(-0.5, 0, 1, 1, 1.5)
-y <- c(0, 0, 2, 0, 0)
-plot(x, y, lwd = 3, frame = FALSE, type = "l")
-```
-
-![plot of chunk unnamed-chunk-1](assets/fig/unnamed-chunk-1.png) 
-
-
----
-
-## Example continued
-
-What is the probability that 75% or fewer of calls get addressed?
-
-![plot of chunk unnamed-chunk-2](assets/fig/unnamed-chunk-2.png) 
-
-
----
-
-```r
-1.5 * 0.75/2
-```
-
-```
-## [1] 0.5625
-```
-
-```r
-pbeta(0.75, 2, 1)
-```
-
-```
-## [1] 0.5625
-```
-
----
-
-## CDF and survival function
-
-- The **cumulative distribution function** (CDF) of a random variable $X$ is defined as the function 
-$$
-F(x) = P(X \leq x)
-$$
-- This definition applies regardless of whether $X$ is discrete or continuous.
-- The **survival function** of a random variable $X$ is defined as
-$$
-S(x) = P(X > x)
-$$
-- Notice that $S(x) = 1 - F(x)$
-- For continuous random variables, the PDF is the derivative of the CDF
-
----
-
-## Example
-
-What are the survival function and CDF from the density considered before?
-
-For $1 \geq x \geq 0$
-$$
-F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
-$$
-
-$$
-S(x) = 1 - x^2
-$$
-
-
-```r
-pbeta(c(0.4, 0.5, 0.6), 2, 1)
-```
-
-```
-## [1] 0.16 0.25 0.36
-```
-
-
----
-
-## Quantiles
-
-- The  $\alpha^{th}$ **quantile** of a distribution with distribution function $F$ is the point $x_\alpha$ so that
-$$
-F(x_\alpha) = \alpha
-$$
-- A **percentile** is simply a quantile with $\alpha$ expressed as a percent
-- The **median** is the $50^{th}$ percentile
-
----
-## Example
-- We want to solve $0.5 = F(x) = x^2$
-- Resulting in the solution 
-
-```r
-sqrt(0.5)
-```
-
-```
-## [1] 0.7071
-```
-
-- Therefore, about 0.7071 of calls being answered on a random day is the median.
-- R can approximate quantiles for you for common distributions
-
-
-```r
-qbeta(0.5, 2, 1)
-```
-
-```
-## [1] 0.7071
-```
-
-
----
-
-## Summary
-
-- You might be wondering at this point "I've heard of a median before, it didn't require integration. Where's the data?"
-- We're referring to are **population quantities**. Therefore, the median being
-  discussed is the **population median**.
-- A probability model connects the data to the population using assumptions.
-- Therefore the median we're discussing is the **estimand**, the sample median will be the **estimator**
-
+---
+title       : Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Probability
+
+- In these slides we will cover the basics of probability at low enough level
+to have a basic understanding for the rest of the series
+- For a more complete treatment see the class Mathematical Biostatistics Boot Camp 1
+    - Youtube: www.youtube.com/playlist?list=PLpl-gQkQivXhk6qSyiNj51qamjAtZISJ-
+    - Coursera: www.coursera.org/course/biostats
+    - Git: http://github.com/bcaffo/Caffo-Coursera
+
+
+---
+
+## Probability
+
+Given a random experiment (say rolling a die) a probability measure is a population quantity
+that summarizes the randomness.
+
+Specifically, probability takes a possible outcome from the expertiment and assigns it a number
+between 0 and 1 so that the probability that something occurs is 1 (the die must be rolled)
+and so that the probability of the union of any two sets of outcomes that have nothing in common
+is the sum of their respective probabilities.
+
+
+The Russian mathematician Kolmogorov formalized these rules.
+
+---
+
+
+## Example consequences
+
+- $P(\emptyset) = 0$
+- $P(E) = 1 - P(E^c)$
+- $P(A \cup B) = P(A) + P(B) - P(A \cap B)$
+- if $A \subset B$ then $P(A) \leq P(B)$
+- $P\left(A \cup B\right) = 1 - P(A^c \cap B^c)$
+- $P(A \cap B^c) = P(A) - P(A \cap B)$
+- $P(\cup_{i=1}^n E_i) \leq \sum_{i=1}^n P(E_i)$
+- $P(\cup_{i=1}^n E_i) \geq \max_i P(E_i)$
+
+---
+
+## Example
+
+The National Sleep Foundation ([www.sleepfoundation.org](http://www.sleepfoundation.org/)) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?
+
+---
+
+## Example continued
+
+Answer: No, the events are not mutually exclusive. To elaborate let:
+
+$$
+\begin{eqnarray*}
+    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
+    A_2 & = & \{\mbox{Person has RLS}\} 
+  \end{eqnarray*}
+$$
+
+Then 
+
+$$
+\begin{eqnarray*}
+    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
+   & = & 0.13 - \mbox{Probability of having both}
+  \end{eqnarray*}
+$$
+Likely, some fraction of the population has both.
+
+---
+
+## Random variables
+
+- A **random variable** is a numerical outcome of an experiment.
+- The random variables that we study will come in two varieties,
+  **discrete** or **continuous**.
+- Discrete random variable are random variables that take on only a
+countable number of possibilities.
+  * $P(X = k)$
+- Continuous random variable can take any value on the real line or some subset of the real line.
+  * $P(X \in A)$
+
+---
+
+## Examples of variables that can be thought of as random variables
+
+- The $(0-1)$ outcome of the flip of a coin
+- The outcome from the roll of a die
+- The BMI of a subject four years after a baseline measurement
+- The hypertension status of a subject randomly drawn from a population
+
+---
+
+## PMF
+
+A probability mass function evaluated at a value corresponds to the
+probability that a random variable takes that value. To be a valid
+pmf a function, $p$, must satisfy
+
+  1. $p(x) \geq 0$ for all $x$
+  2. $\sum_{x} p(x) = 1$
+
+The sum is taken over all of the possible values for $x$.
+
+---
+
+## Example
+
+Let $X$ be the result of a coin flip where $X=0$ represents
+tails and $X = 1$ represents heads.
+$$
+p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+Suppose that we do not know whether or not the coin is fair; Let
+$\theta$ be the probability of a head expressed as a proportion
+(between 0 and 1).
+$$
+p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+
+---
+
+## PDF
+
+A probability density function (pdf), is a function associated with
+a continuous random variable 
+
+  *Areas under pdfs correspond to probabilities for that random variable*
+
+To be a valid pdf, a function $f$ must satisfy
+
+1. $f(x) \geq 0$ for all $x$
+
+2. The area under $f(x)$ is one.
+
+---
+## Example
+
+Suppose that the proportion of help calls that get addressed in
+a random day by a help line is given by
+$$
+f(x) = \left\{\begin{array}{ll}
+    2 x & \mbox{ for } 1 > x > 0 \\
+    0                 & \mbox{ otherwise} 
+\end{array} \right. 
+$$
+
+Is this a mathematically valid density?
+
+---
+
+
+```r
+x <- c(-0.5, 0, 1, 1, 1.5)
+y <- c(0, 0, 2, 0, 0)
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+```
+
+<img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" />
+
+
+---
+
+## Example continued
+
+What is the probability that 75% or fewer of calls get addressed?
+
+<img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" />
+
+
+---
+
+```r
+1.5 * 0.75/2
+```
+
+```
+## [1] 0.5625
+```
+
+```r
+pbeta(0.75, 2, 1)
+```
+
+```
+## [1] 0.5625
+```
+
+---
+
+## CDF and survival function
+
+- The **cumulative distribution function** (CDF) of a random variable $X$ is defined as the function 
+$$
+F(x) = P(X \leq x)
+$$
+- This definition applies regardless of whether $X$ is discrete or continuous.
+- The **survival function** of a random variable $X$ is defined as
+$$
+S(x) = P(X > x)
+$$
+- Notice that $S(x) = 1 - F(x)$
+- For continuous random variables, the PDF is the derivative of the CDF
+
+---
+
+## Example
+
+What are the survival function and CDF from the density considered before?
+
+For $1 \geq x \geq 0$
+$$
+F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
+$$
+
+$$
+S(x) = 1 - x^2
+$$
+
+
+```r
+pbeta(c(0.4, 0.5, 0.6), 2, 1)
+```
+
+```
+## [1] 0.16 0.25 0.36
+```
+
+
+---
+
+## Quantiles
+
+- The  $\alpha^{th}$ **quantile** of a distribution with distribution function $F$ is the point $x_\alpha$ so that
+$$
+F(x_\alpha) = \alpha
+$$
+- A **percentile** is simply a quantile with $\alpha$ expressed as a percent
+- The **median** is the $50^{th}$ percentile
+
+---
+## Example
+- We want to solve $0.5 = F(x) = x^2$
+- Resulting in the solution 
+
+```r
+sqrt(0.5)
+```
+
+```
+## [1] 0.7071
+```
+
+- Therefore, about 0.7071 of calls being answered on a random day is the median.
+- R can approximate quantiles for you for common distributions
+
+
+```r
+qbeta(0.5, 2, 1)
+```
+
+```
+## [1] 0.7071
+```
+
+
+---
+
+## Summary
+
+- You might be wondering at this point "I've heard of a median before, it didn't require integration. Where's the data?"
+- We're referring to are **population quantities**. Therefore, the median being
+  discussed is the **population median**.
+- A probability model connects the data to the population using assumptions.
+- Therefore the median we're discussing is the **estimand**, the sample median will be the **estimator**
diff --git a/06_StatisticalInference/old_markdown/01_02_Probability/index.pdf b/06_StatisticalInference/old_markdown/01_02_Probability/index.pdf
new file mode 100644
index 000000000..fddebae9e
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_02_Probability/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/lsm.png b/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/lsm.png
new file mode 100644
index 000000000..d37f19c84
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/lsm.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..d55969896
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..94882ce30
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..15a643336
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/figure/galton.png b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/galton.png
new file mode 100644
index 000000000..d55969896
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/galton.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/figure/lsm.png b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/lsm.png
new file mode 100644
index 000000000..d37f19c84
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/lsm.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-1.png
new file mode 100644
index 000000000..d55969896
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-2.png
new file mode 100644
index 000000000..94882ce30
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-3.png
new file mode 100644
index 000000000..15a643336
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-31.png b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-31.png
new file mode 100644
index 000000000..5a1d44051
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-31.png differ
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-32.png b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-32.png
new file mode 100644
index 000000000..3a348bd00
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/figure/unnamed-chunk-32.png differ
diff --git a/06_StatisticalInference/01_03_Expectations/index.Rmd b/06_StatisticalInference/old_markdown/01_03_Expectations/index.Rmd
similarity index 96%
rename from 06_StatisticalInference/01_03_Expectations/index.Rmd
rename to 06_StatisticalInference/old_markdown/01_03_Expectations/index.Rmd
index a2e25e813..0ae93ebd7 100644
--- a/06_StatisticalInference/01_03_Expectations/index.Rmd
+++ b/06_StatisticalInference/old_markdown/01_03_Expectations/index.Rmd
@@ -1,238 +1,238 @@
----
-title       : Expected values
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-## Expected values
-
-- The **expected value** or **mean** of a random variable is the center of its distribution
-- For discrete random variable $X$ with PMF $p(x)$, it is defined as follows
-    $$
-    E[X] = \sum_x xp(x).
-    $$
-    where the sum is taken over the possible values of $x$
-- $E[X]$ represents the center of mass of a collection of locations and weights, $\{x, p(x)\}$
-
----
-
-## Example
-### Find the center of mass of the bars
-```{r ,fig.height=3.5,fig.width=8, fig.align='center', echo = FALSE}
-library(UsingR); data(galton)
-par(mfrow=c(1,2))
-hist(galton$child,col="blue",breaks=100)
-hist(galton$parent,col="blue",breaks=100)
-```
-
----
-## Using manipulate
-```
-library(manipulate)
-myHist <- function(mu){
-  hist(galton$child,col="blue",breaks=100)
-  lines(c(mu, mu), c(0, 150),col="red",lwd=5)
-  mse <- mean((galton$child - mu)^2)
-  text(63, 150, paste("mu = ", mu))
-  text(63, 140, paste("Imbalance = ", round(mse, 2)))
-}
-manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
-```
-
----
-## The center of mass is the empirical mean
-```{r lsm, dependson="galton",fig.height=4,fig.width=4, fig.align='center'}
-  hist(galton$child,col="blue",breaks=100)
-  meanChild <- mean(galton$child)
-  lines(rep(meanChild,100),seq(0,150,length=100),col="red",lwd=5)
-```
-
----
-## Example
-
-- Suppose a coin is flipped and $X$ is declared $0$ or $1$ corresponding to a head or a tail, respectively
-- What is the expected value of $X$? 
-    $$
-    E[X] = .5 \times 0 + .5 \times 1 = .5
-    $$
-- Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be $.5$
-
-```{r, echo = FALSE, fig.height=3.5, fig.width = 3.5, fig.align='center'}
-barplot(height = c(.5, .5), names = c(0, 1), border = "black", col = "lightblue", space = .75)
-```
----
-
-## Example
-
-- Suppose that a die is rolled and $X$ is the number face up
-- What is the expected value of $X$?
-    $$
-    E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
-    3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
-    5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
-    $$
-- Again, the geometric argument makes this answer obvious without calculation.
-
----
-
-## Continuous random variables
-
-- For a continuous random variable, $X$, with density, $f$, the expected
-    value is defined as follows
-    $$
-    E[X] = \mbox{the area under the function}~~~ t f(t)
-    $$
-- This definition borrows from the definition of center of mass for a continuous body
-
----
-
-## Example
-
-- Consider a density where $f(x) = 1$ for $x$ between zero and one
-- (Is this a valid density?)
-- Suppose that $X$ follows this density; what is its expected value?  
-```{r, fig.height=4, fig.width=8, echo=FALSE}
-par(mfrow = c(1, 2))
-plot(c(-0.25, 0, 0, 1, 1, 1.25), c(0, 0, 1, 1, 0, 0), type = "l", lwd = 3, frame = FALSE, xlab="", ylab = ""); title('f(t)')
-plot(c(-0.25, 0, 1, 1, 1.25), c(0, 0, 1, 0, 0), type = "l", lwd = 3, frame = FALSE, xlab="", ylab = ""); title('t f(t)')
-```
-
----
-
-## Rules about expected values
-
-- The expected value is a linear operator 
-- If $a$ and $b$ are not random and $X$ and $Y$ are two random variables then
-  - $E[aX + b] = a E[X] + b$
-  - $E[X + Y] = E[X] + E[Y]$
-
----
-
-## Example
-
-- You flip a coin, $X$ and simulate a uniform random number $Y$, what is the expected value of their sum? 
-    $$
-    E[X + Y] = E[X] + E[Y] = .5 + .5 = 1
-    $$ 
-- Another example, you roll a die twice. What is the expected value of the average? 
-- Let $X_1$ and $X_2$ be the results of the two rolls
-    $$
-    E[(X_1 + X_2) / 2] = \frac{1}{2}(E[X_1] + E[X_2])
-    = \frac{1}{2}(3.5 + 3.5) = 3.5
-    $$
-
----
-
-## Example
-
-1. Let $X_i$ for $i=1,\ldots,n$ be a collection of random variables, each from a distribution with mean $\mu$
-2. Calculate the expected value of the sample average of the $X_i$
-$$
-  \begin{eqnarray*}
-    E\left[ \frac{1}{n}\sum_{i=1}^n X_i\right]
-    & = & \frac{1}{n} E\left[\sum_{i=1}^n X_i\right] \\
-    & = & \frac{1}{n} \sum_{i=1}^n E\left[X_i\right] \\
-    & = & \frac{1}{n} \sum_{i=1}^n \mu =  \mu.
-  \end{eqnarray*}
-$$
-
----
-
-## Remark
-
-- Therefore, the expected value of the **sample mean** is the population mean that it's trying to estimate
-- When the expected value of an estimator is what its trying to estimate, we say that the estimator is **unbiased**
-
----
-
-## The variance
-
-- The variance of a random variable is a measure of *spread*
-- If $X$ is a random variable with mean $\mu$, the variance of $X$ is defined as
-
-$$
-Var(X) = E[(X - \mu)^2]
-$$
-    
-the expected (squared) distance from the mean
-- Densities with a higher variance are more spread out than densities with a lower variance
-
----
-
-- Convenient computational form
-$$
-Var(X) = E[X^2] - E[X]^2
-$$
-- If $a$ is constant then $Var(aX) = a^2 Var(X)$
-- The square root of the variance is called the **standard deviation**
-- The standard deviation has the same units as $X$
-
----
-
-## Example
-
-- What's the sample variance from the result of a toss of a die? 
-
-  - $E[X] = 3.5$ 
-  - $E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17$ 
-
-- $Var(X) = E[X^2] - E[X]^2 \approx 2.92$
-
----
-
-## Example
-
-- What's the sample variance from the result of the toss of a coin with probability of heads (1) of $p$? 
-
-  - $E[X] = 0 \times (1 - p) + 1 \times p = p$
-  - $E[X^2] = E[X] = p$ 
-
-- $Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)$
-
----
-
-## Interpreting variances
-
-- Chebyshev's inequality is useful for interpreting variances
-- This inequality states that
-$$
-P(|X - \mu| \geq k\sigma) \leq \frac{1}{k^2}
-$$
-- For example, the probability that a random variable lies beyond $k$ standard deviations from its mean is less than $1/k^2$
-$$
-\begin{eqnarray*}
-    2\sigma & \rightarrow & 25\% \\
-    3\sigma & \rightarrow & 11\% \\
-    4\sigma & \rightarrow &  6\% 
-\end{eqnarray*}
-$$
-- Note this is only a bound; the actual probability might be quite a bit smaller
-
----
-
-## Example
-
-- IQs are often said to be distributed with a mean of $100$ and a sd of $15$
-- What is the probability of a randomly drawn person having an IQ higher than $160$ or below $40$?
-- Thus we want to know the probability of a person being more than $4$ standard deviations from the mean
-- Thus Chebyshev's inequality suggests that this will be no larger than 6\%
-- IQs distributions are often cited as being bell shaped, in which case this bound is very conservative
-- The probability of a random draw from a bell curve being $4$ standard deviations from the mean is on the order of $10^{-5}$ (one thousandth of one percent)
-
----
-
-## Example
-
-- A former buzz phrase in industrial quality control is Motorola's "Six Sigma" whereby businesses are suggested to control extreme events or rare defective parts
-- Chebyshev's inequality states that the probability of a "Six Sigma" event is less than $1/6^2 \approx 3\%$
-- If a bell curve is assumed, the probability of a "six sigma" event is on the order of $10^{-9}$ (one ten millionth of a percent)
-
+---
+title       : Expected values
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Expected values
+
+- The **expected value** or **mean** of a random variable is the center of its distribution
+- For discrete random variable $X$ with PMF $p(x)$, it is defined as follows
+    $$
+    E[X] = \sum_x xp(x).
+    $$
+    where the sum is taken over the possible values of $x$
+- $E[X]$ represents the center of mass of a collection of locations and weights, $\{x, p(x)\}$
+
+---
+
+## Example
+### Find the center of mass of the bars
+```{r ,fig.height=3.5,fig.width=8, fig.align='center', echo = FALSE}
+library(UsingR); data(galton)
+par(mfrow=c(1,2))
+hist(galton$child,col="blue",breaks=100)
+hist(galton$parent,col="blue",breaks=100)
+```
+
+---
+## Using manipulate
+```
+library(manipulate)
+myHist <- function(mu){
+  hist(galton$child,col="blue",breaks=100)
+  lines(c(mu, mu), c(0, 150),col="red",lwd=5)
+  mse <- mean((galton$child - mu)^2)
+  text(63, 150, paste("mu = ", mu))
+  text(63, 140, paste("Imbalance = ", round(mse, 2)))
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The center of mass is the empirical mean
+```{r lsm, dependson="galton",fig.height=4,fig.width=4, fig.align='center'}
+  hist(galton$child,col="blue",breaks=100)
+  meanChild <- mean(galton$child)
+  lines(rep(meanChild,100),seq(0,150,length=100),col="red",lwd=5)
+```
+
+---
+## Example
+
+- Suppose a coin is flipped and $X$ is declared $0$ or $1$ corresponding to a head or a tail, respectively
+- What is the expected value of $X$? 
+    $$
+    E[X] = .5 \times 0 + .5 \times 1 = .5
+    $$
+- Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be $.5$
+
+```{r, echo = FALSE, fig.height=3.5, fig.width = 3.5, fig.align='center'}
+barplot(height = c(.5, .5), names = c(0, 1), border = "black", col = "lightblue", space = .75)
+```
+---
+
+## Example
+
+- Suppose that a die is rolled and $X$ is the number face up
+- What is the expected value of $X$?
+    $$
+    E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
+    3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
+    5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
+    $$
+- Again, the geometric argument makes this answer obvious without calculation.
+
+---
+
+## Continuous random variables
+
+- For a continuous random variable, $X$, with density, $f$, the expected
+    value is defined as follows
+    $$
+    E[X] = \mbox{the area under the function}~~~ t f(t)
+    $$
+- This definition borrows from the definition of center of mass for a continuous body
+
+---
+
+## Example
+
+- Consider a density where $f(x) = 1$ for $x$ between zero and one
+- (Is this a valid density?)
+- Suppose that $X$ follows this density; what is its expected value?  
+```{r, fig.height=4, fig.width=8, echo=FALSE}
+par(mfrow = c(1, 2))
+plot(c(-0.25, 0, 0, 1, 1, 1.25), c(0, 0, 1, 1, 0, 0), type = "l", lwd = 3, frame = FALSE, xlab="", ylab = ""); title('f(t)')
+plot(c(-0.25, 0, 1, 1, 1.25), c(0, 0, 1, 0, 0), type = "l", lwd = 3, frame = FALSE, xlab="", ylab = ""); title('t f(t)')
+```
+
+---
+
+## Rules about expected values
+
+- The expected value is a linear operator 
+- If $a$ and $b$ are not random and $X$ and $Y$ are two random variables then
+  - $E[aX + b] = a E[X] + b$
+  - $E[X + Y] = E[X] + E[Y]$
+
+---
+
+## Example
+
+- You flip a coin, $X$ and simulate a uniform random number $Y$, what is the expected value of their sum? 
+    $$
+    E[X + Y] = E[X] + E[Y] = .5 + .5 = 1
+    $$ 
+- Another example, you roll a die twice. What is the expected value of the average? 
+- Let $X_1$ and $X_2$ be the results of the two rolls
+    $$
+    E[(X_1 + X_2) / 2] = \frac{1}{2}(E[X_1] + E[X_2])
+    = \frac{1}{2}(3.5 + 3.5) = 3.5
+    $$
+
+---
+
+## Example
+
+1. Let $X_i$ for $i=1,\ldots,n$ be a collection of random variables, each from a distribution with mean $\mu$
+2. Calculate the expected value of the sample average of the $X_i$
+$$
+  \begin{eqnarray*}
+    E\left[ \frac{1}{n}\sum_{i=1}^n X_i\right]
+    & = & \frac{1}{n} E\left[\sum_{i=1}^n X_i\right] \\
+    & = & \frac{1}{n} \sum_{i=1}^n E\left[X_i\right] \\
+    & = & \frac{1}{n} \sum_{i=1}^n \mu =  \mu.
+  \end{eqnarray*}
+$$
+
+---
+
+## Remark
+
+- Therefore, the expected value of the **sample mean** is the population mean that it's trying to estimate
+- When the expected value of an estimator is what its trying to estimate, we say that the estimator is **unbiased**
+
+---
+
+## The variance
+
+- The variance of a random variable is a measure of *spread*
+- If $X$ is a random variable with mean $\mu$, the variance of $X$ is defined as
+
+$$
+Var(X) = E[(X - \mu)^2]
+$$
+    
+the expected (squared) distance from the mean
+- Densities with a higher variance are more spread out than densities with a lower variance
+
+---
+
+- Convenient computational form
+$$
+Var(X) = E[X^2] - E[X]^2
+$$
+- If $a$ is constant then $Var(aX) = a^2 Var(X)$
+- The square root of the variance is called the **standard deviation**
+- The standard deviation has the same units as $X$
+
+---
+
+## Example
+
+- What's the sample variance from the result of a toss of a die? 
+
+  - $E[X] = 3.5$ 
+  - $E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17$ 
+
+- $Var(X) = E[X^2] - E[X]^2 \approx 2.92$
+
+---
+
+## Example
+
+- What's the sample variance from the result of the toss of a coin with probability of heads (1) of $p$? 
+
+  - $E[X] = 0 \times (1 - p) + 1 \times p = p$
+  - $E[X^2] = E[X] = p$ 
+
+- $Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)$
+
+---
+
+## Interpreting variances
+
+- Chebyshev's inequality is useful for interpreting variances
+- This inequality states that
+$$
+P(|X - \mu| \geq k\sigma) \leq \frac{1}{k^2}
+$$
+- For example, the probability that a random variable lies beyond $k$ standard deviations from its mean is less than $1/k^2$
+$$
+\begin{eqnarray*}
+    2\sigma & \rightarrow & 25\% \\
+    3\sigma & \rightarrow & 11\% \\
+    4\sigma & \rightarrow &  6\% 
+\end{eqnarray*}
+$$
+- Note this is only a bound; the actual probability might be quite a bit smaller
+
+---
+
+## Example
+
+- IQs are often said to be distributed with a mean of $100$ and a sd of $15$
+- What is the probability of a randomly drawn person having an IQ higher than $160$ or below $40$?
+- Thus we want to know the probability of a person being more than $4$ standard deviations from the mean
+- Thus Chebyshev's inequality suggests that this will be no larger than 6\%
+- IQs distributions are often cited as being bell shaped, in which case this bound is very conservative
+- The probability of a random draw from a bell curve being $4$ standard deviations from the mean is on the order of $10^{-5}$ (one thousandth of one percent)
+
+---
+
+## Example
+
+- A former buzz phrase in industrial quality control is Motorola's "Six Sigma" whereby businesses are suggested to control extreme events or rare defective parts
+- Chebyshev's inequality states that the probability of a "Six Sigma" event is less than $1/6^2 \approx 3\%$
+- If a bell curve is assumed, the probability of a "six sigma" event is on the order of $10^{-9}$ (one ten millionth of a percent)
+
diff --git a/06_StatisticalInference/01_03_Expectations/index.html b/06_StatisticalInference/old_markdown/01_03_Expectations/index.html
similarity index 95%
rename from 06_StatisticalInference/01_03_Expectations/index.html
rename to 06_StatisticalInference/old_markdown/01_03_Expectations/index.html
index 04a508ac3..72dd4b7b5 100644
--- a/06_StatisticalInference/01_03_Expectations/index.html
+++ b/06_StatisticalInference/old_markdown/01_03_Expectations/index.html
@@ -1,549 +1,552 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Expected values</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Expected values">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Expected values</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Expected values</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>expected value</strong> or <strong>mean</strong> of a random variable is the center of its distribution</li>
-<li>For discrete random variable \(X\) with PMF \(p(x)\), it is defined as follows
-\[
-E[X] = \sum_x xp(x).
-\]
-where the sum is taken over the possible values of \(x\)</li>
-<li>\(E[X]\) represents the center of mass of a collection of locations and weights, \(\{x, p(x)\}\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <h3>Find the center of mass of the bars</h3>
-
-<p><img src="assets/fig/unnamed-chunk-1.png" alt="plot of chunk unnamed-chunk-1"> </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Using manipulate</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>library(manipulate)
-myHist &lt;- function(mu){
-  hist(galton$child,col=&quot;blue&quot;,breaks=100)
-  lines(c(mu, mu), c(0, 150),col=&quot;red&quot;,lwd=5)
-  mse &lt;- mean((galton$child - mu)^2)
-  text(63, 150, paste(&quot;mu = &quot;, mu))
-  text(63, 140, paste(&quot;Imbalance = &quot;, round(mse, 2)))
-}
-manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>The center of mass is the empirical mean</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">hist(galton$child, col = &quot;blue&quot;, breaks = 100)
-meanChild &lt;- mean(galton$child)
-lines(rep(meanChild, 100), seq(0, 150, length = 100), col = &quot;red&quot;, lwd = 5)
-</code></pre>
-
-<p><img src="assets/fig/lsm.png" alt="plot of chunk lsm"> </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose a coin is flipped and \(X\) is declared \(0\) or \(1\) corresponding to a head or a tail, respectively</li>
-<li>What is the expected value of \(X\)? 
-\[
-E[X] = .5 \times 0 + .5 \times 1 = .5
-\]</li>
-<li>Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be \(.5\)</li>
-</ul>
-
-<p><img src="assets/fig/unnamed-chunk-2.png" alt="plot of chunk unnamed-chunk-2"> </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that a die is rolled and \(X\) is the number face up</li>
-<li>What is the expected value of \(X\)?
-\[
-E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
-3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
-5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
-\]</li>
-<li>Again, the geometric argument makes this answer obvious without calculation.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Continuous random variables</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>For a continuous random variable, \(X\), with density, \(f\), the expected
-value is defined as follows
-\[
-E[X] = \mbox{the area under the function}~~~ t f(t)
-\]</li>
-<li>This definition borrows from the definition of center of mass for a continuous body</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider a density where \(f(x) = 1\) for \(x\) between zero and one</li>
-<li>(Is this a valid density?)</li>
-<li>Suppose that \(X\) follows this density; what is its expected value?<br>
-<img src="assets/fig/unnamed-chunk-3.png" alt="plot of chunk unnamed-chunk-3"> </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Rules about expected values</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The expected value is a linear operator </li>
-<li>If \(a\) and \(b\) are not random and \(X\) and \(Y\) are two random variables then
-
-<ul>
-<li>\(E[aX + b] = a E[X] + b\)</li>
-<li>\(E[X + Y] = E[X] + E[Y]\)</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>You flip a coin, \(X\) and simulate a uniform random number \(Y\), what is the expected value of their sum? 
-\[
-E[X + Y] = E[X] + E[Y] = .5 + .5 = 1
-\] </li>
-<li>Another example, you roll a die twice. What is the expected value of the average? </li>
-<li>Let \(X_1\) and \(X_2\) be the results of the two rolls
-\[
-E[(X_1 + X_2) / 2] = \frac{1}{2}(E[X_1] + E[X_2])
-= \frac{1}{2}(3.5 + 3.5) = 3.5
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ol>
-<li>Let \(X_i\) for \(i=1,\ldots,n\) be a collection of random variables, each from a distribution with mean \(\mu\)</li>
-<li>Calculate the expected value of the sample average of the \(X_i\)
-\[
-\begin{eqnarray*}
-E\left[ \frac{1}{n}\sum_{i=1}^n X_i\right]
-& = & \frac{1}{n} E\left[\sum_{i=1}^n X_i\right] \\
-& = & \frac{1}{n} \sum_{i=1}^n E\left[X_i\right] \\
-& = & \frac{1}{n} \sum_{i=1}^n \mu =  \mu.
-\end{eqnarray*}
-\]</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Remark</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Therefore, the expected value of the <strong>sample mean</strong> is the population mean that it&#39;s trying to estimate</li>
-<li>When the expected value of an estimator is what its trying to estimate, we say that the estimator is <strong>unbiased</strong></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>The variance</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The variance of a random variable is a measure of <em>spread</em></li>
-<li>If \(X\) is a random variable with mean \(\mu\), the variance of \(X\) is defined as</li>
-</ul>
-
-<p>\[
-Var(X) = E[(X - \mu)^2]
-\]</p>
-
-<p>the expected (squared) distance from the mean</p>
-
-<ul>
-<li>Densities with a higher variance are more spread out than densities with a lower variance</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <article data-timings="">
-    <ul>
-<li>Convenient computational form
-\[
-Var(X) = E[X^2] - E[X]^2
-\]</li>
-<li>If \(a\) is constant then \(Var(aX) = a^2 Var(X)\)</li>
-<li>The square root of the variance is called the <strong>standard deviation</strong></li>
-<li>The standard deviation has the same units as \(X\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li><p>What&#39;s the sample variance from the result of a toss of a die? </p>
-
-<ul>
-<li>\(E[X] = 3.5\) </li>
-<li>\(E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17\) </li>
-</ul></li>
-<li><p>\(Var(X) = E[X^2] - E[X]^2 \approx 2.92\)</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li><p>What&#39;s the sample variance from the result of the toss of a coin with probability of heads (1) of \(p\)? </p>
-
-<ul>
-<li>\(E[X] = 0 \times (1 - p) + 1 \times p = p\)</li>
-<li>\(E[X^2] = E[X] = p\) </li>
-</ul></li>
-<li><p>\(Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)\)</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Interpreting variances</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Chebyshev&#39;s inequality is useful for interpreting variances</li>
-<li>This inequality states that
-\[
-P(|X - \mu| \geq k\sigma) \leq \frac{1}{k^2}
-\]</li>
-<li>For example, the probability that a random variable lies beyond \(k\) standard deviations from its mean is less than \(1/k^2\)
-\[
-\begin{eqnarray*}
-2\sigma & \rightarrow & 25\% \\
-3\sigma & \rightarrow & 11\% \\
-4\sigma & \rightarrow &  6\% 
-\end{eqnarray*}
-\]</li>
-<li>Note this is only a bound; the actual probability might be quite a bit smaller</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>IQs are often said to be distributed with a mean of \(100\) and a sd of \(15\)</li>
-<li>What is the probability of a randomly drawn person having an IQ higher than \(160\) or below \(40\)?</li>
-<li>Thus we want to know the probability of a person being more than \(4\) standard deviations from the mean</li>
-<li>Thus Chebyshev&#39;s inequality suggests that this will be no larger than 6\%</li>
-<li>IQs distributions are often cited as being bell shaped, in which case this bound is very conservative</li>
-<li>The probability of a random draw from a bell curve being \(4\) standard deviations from the mean is on the order of \(10^{-5}\) (one thousandth of one percent)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>A former buzz phrase in industrial quality control is Motorola&#39;s &quot;Six Sigma&quot; whereby businesses are suggested to control extreme events or rare defective parts</li>
-<li>Chebyshev&#39;s inequality states that the probability of a &quot;Six Sigma&quot; event is less than \(1/6^2 \approx 3\%\)</li>
-<li>If a bell curve is assumed, the probability of a &quot;six sigma&quot; event is on the order of \(10^{-9}\) (one ten millionth of a percent)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Expected values'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Example'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Using manipulate'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='The center of mass is the empirical mean'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Example'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Example'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Continuous random variables'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Example'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Rules about expected values'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Example'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='Example'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Remark'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='The variance'>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title=''>
-         14
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title='Example'>
-         15
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='Example'>
-         16
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=17 title='Interpreting variances'>
-         17
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=18 title='Example'>
-         18
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=19 title='Example'>
-         19
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Expected values</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Expected values">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Expected values</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Expected values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>expected value</strong> or <strong>mean</strong> of a random variable is the center of its distribution</li>
+<li>For discrete random variable \(X\) with PMF \(p(x)\), it is defined as follows
+\[
+E[X] = \sum_x xp(x).
+\]
+where the sum is taken over the possible values of \(x\)</li>
+<li>\(E[X]\) represents the center of mass of a collection of locations and weights, \(\{x, p(x)\}\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>Find the center of mass of the bars</h3>
+
+<pre><code>## Loading required package: MASS
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Using manipulate</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code>library(manipulate)
+myHist &lt;- function(mu){
+  hist(galton$child,col=&quot;blue&quot;,breaks=100)
+  lines(c(mu, mu), c(0, 150),col=&quot;red&quot;,lwd=5)
+  mse &lt;- mean((galton$child - mu)^2)
+  text(63, 150, paste(&quot;mu = &quot;, mu))
+  text(63, 140, paste(&quot;Imbalance = &quot;, round(mse, 2)))
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>The center of mass is the empirical mean</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">hist(galton$child, col = &quot;blue&quot;, breaks = 100)
+meanChild &lt;- mean(galton$child)
+lines(rep(meanChild, 100), seq(0, 150, length = 100), col = &quot;red&quot;, lwd = 5)
+</code></pre>
+
+<p><img src="assets/fig/lsm.png" title="plot of chunk lsm" alt="plot of chunk lsm" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose a coin is flipped and \(X\) is declared \(0\) or \(1\) corresponding to a head or a tail, respectively</li>
+<li>What is the expected value of \(X\)? 
+\[
+E[X] = .5 \times 0 + .5 \times 1 = .5
+\]</li>
+<li>Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be \(.5\)</li>
+</ul>
+
+<p><img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that a die is rolled and \(X\) is the number face up</li>
+<li>What is the expected value of \(X\)?
+\[
+E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
+3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
+5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
+\]</li>
+<li>Again, the geometric argument makes this answer obvious without calculation.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Continuous random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>For a continuous random variable, \(X\), with density, \(f\), the expected
+value is defined as follows
+\[
+E[X] = \mbox{the area under the function}~~~ t f(t)
+\]</li>
+<li>This definition borrows from the definition of center of mass for a continuous body</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider a density where \(f(x) = 1\) for \(x\) between zero and one</li>
+<li>(Is this a valid density?)</li>
+<li>Suppose that \(X\) follows this density; what is its expected value?<br>
+<img src="assets/fig/unnamed-chunk-3.png" alt="plot of chunk unnamed-chunk-3"> </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Rules about expected values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The expected value is a linear operator </li>
+<li>If \(a\) and \(b\) are not random and \(X\) and \(Y\) are two random variables then
+
+<ul>
+<li>\(E[aX + b] = a E[X] + b\)</li>
+<li>\(E[X + Y] = E[X] + E[Y]\)</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>You flip a coin, \(X\) and simulate a uniform random number \(Y\), what is the expected value of their sum? 
+\[
+E[X + Y] = E[X] + E[Y] = .5 + .5 = 1
+\] </li>
+<li>Another example, you roll a die twice. What is the expected value of the average? </li>
+<li>Let \(X_1\) and \(X_2\) be the results of the two rolls
+\[
+E[(X_1 + X_2) / 2] = \frac{1}{2}(E[X_1] + E[X_2])
+= \frac{1}{2}(3.5 + 3.5) = 3.5
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ol>
+<li>Let \(X_i\) for \(i=1,\ldots,n\) be a collection of random variables, each from a distribution with mean \(\mu\)</li>
+<li>Calculate the expected value of the sample average of the \(X_i\)
+\[
+\begin{eqnarray*}
+E\left[ \frac{1}{n}\sum_{i=1}^n X_i\right]
+& = & \frac{1}{n} E\left[\sum_{i=1}^n X_i\right] \\
+& = & \frac{1}{n} \sum_{i=1}^n E\left[X_i\right] \\
+& = & \frac{1}{n} \sum_{i=1}^n \mu =  \mu.
+\end{eqnarray*}
+\]</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Remark</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Therefore, the expected value of the <strong>sample mean</strong> is the population mean that it&#39;s trying to estimate</li>
+<li>When the expected value of an estimator is what its trying to estimate, we say that the estimator is <strong>unbiased</strong></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>The variance</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The variance of a random variable is a measure of <em>spread</em></li>
+<li>If \(X\) is a random variable with mean \(\mu\), the variance of \(X\) is defined as</li>
+</ul>
+
+<p>\[
+Var(X) = E[(X - \mu)^2]
+\]</p>
+
+<p>the expected (squared) distance from the mean</p>
+
+<ul>
+<li>Densities with a higher variance are more spread out than densities with a lower variance</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <article data-timings="">
+    <ul>
+<li>Convenient computational form
+\[
+Var(X) = E[X^2] - E[X]^2
+\]</li>
+<li>If \(a\) is constant then \(Var(aX) = a^2 Var(X)\)</li>
+<li>The square root of the variance is called the <strong>standard deviation</strong></li>
+<li>The standard deviation has the same units as \(X\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li><p>What&#39;s the sample variance from the result of a toss of a die? </p>
+
+<ul>
+<li>\(E[X] = 3.5\) </li>
+<li>\(E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17\) </li>
+</ul></li>
+<li><p>\(Var(X) = E[X^2] - E[X]^2 \approx 2.92\)</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li><p>What&#39;s the sample variance from the result of the toss of a coin with probability of heads (1) of \(p\)? </p>
+
+<ul>
+<li>\(E[X] = 0 \times (1 - p) + 1 \times p = p\)</li>
+<li>\(E[X^2] = E[X] = p\) </li>
+</ul></li>
+<li><p>\(Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)\)</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Interpreting variances</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Chebyshev&#39;s inequality is useful for interpreting variances</li>
+<li>This inequality states that
+\[
+P(|X - \mu| \geq k\sigma) \leq \frac{1}{k^2}
+\]</li>
+<li>For example, the probability that a random variable lies beyond \(k\) standard deviations from its mean is less than \(1/k^2\)
+\[
+\begin{eqnarray*}
+2\sigma & \rightarrow & 25\% \\
+3\sigma & \rightarrow & 11\% \\
+4\sigma & \rightarrow &  6\% 
+\end{eqnarray*}
+\]</li>
+<li>Note this is only a bound; the actual probability might be quite a bit smaller</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>IQs are often said to be distributed with a mean of \(100\) and a sd of \(15\)</li>
+<li>What is the probability of a randomly drawn person having an IQ higher than \(160\) or below \(40\)?</li>
+<li>Thus we want to know the probability of a person being more than \(4\) standard deviations from the mean</li>
+<li>Thus Chebyshev&#39;s inequality suggests that this will be no larger than 6\%</li>
+<li>IQs distributions are often cited as being bell shaped, in which case this bound is very conservative</li>
+<li>The probability of a random draw from a bell curve being \(4\) standard deviations from the mean is on the order of \(10^{-5}\) (one thousandth of one percent)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A former buzz phrase in industrial quality control is Motorola&#39;s &quot;Six Sigma&quot; whereby businesses are suggested to control extreme events or rare defective parts</li>
+<li>Chebyshev&#39;s inequality states that the probability of a &quot;Six Sigma&quot; event is less than \(1/6^2 \approx 3\%\)</li>
+<li>If a bell curve is assumed, the probability of a &quot;six sigma&quot; event is on the order of \(10^{-9}\) (one ten millionth of a percent)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Expected values'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Example'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Using manipulate'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='The center of mass is the empirical mean'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Example'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Example'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Continuous random variables'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Example'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Rules about expected values'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Example'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Remark'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='The variance'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title=''>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Example'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Example'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Interpreting variances'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Example'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='Example'>
+         19
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/01_03_Expectations/index.md b/06_StatisticalInference/old_markdown/01_03_Expectations/index.md
similarity index 93%
rename from 06_StatisticalInference/01_03_Expectations/index.md
rename to 06_StatisticalInference/old_markdown/01_03_Expectations/index.md
index 8ac21861b..e86b30289 100644
--- a/06_StatisticalInference/01_03_Expectations/index.md
+++ b/06_StatisticalInference/old_markdown/01_03_Expectations/index.md
@@ -1,234 +1,239 @@
----
-title       : Expected values
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-## Expected values
-
-- The **expected value** or **mean** of a random variable is the center of its distribution
-- For discrete random variable $X$ with PMF $p(x)$, it is defined as follows
-    $$
-    E[X] = \sum_x xp(x).
-    $$
-    where the sum is taken over the possible values of $x$
-- $E[X]$ represents the center of mass of a collection of locations and weights, $\{x, p(x)\}$
-
----
-
-## Example
-### Find the center of mass of the bars
-![plot of chunk unnamed-chunk-1](assets/fig/unnamed-chunk-1.png) 
-
-
----
-## Using manipulate
-```
-library(manipulate)
-myHist <- function(mu){
-  hist(galton$child,col="blue",breaks=100)
-  lines(c(mu, mu), c(0, 150),col="red",lwd=5)
-  mse <- mean((galton$child - mu)^2)
-  text(63, 150, paste("mu = ", mu))
-  text(63, 140, paste("Imbalance = ", round(mse, 2)))
-}
-manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
-```
-
----
-## The center of mass is the empirical mean
-
-```r
-hist(galton$child, col = "blue", breaks = 100)
-meanChild <- mean(galton$child)
-lines(rep(meanChild, 100), seq(0, 150, length = 100), col = "red", lwd = 5)
-```
-
-![plot of chunk lsm](assets/fig/lsm.png) 
-
-
----
-## Example
-
-- Suppose a coin is flipped and $X$ is declared $0$ or $1$ corresponding to a head or a tail, respectively
-- What is the expected value of $X$? 
-    $$
-    E[X] = .5 \times 0 + .5 \times 1 = .5
-    $$
-- Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be $.5$
-
-![plot of chunk unnamed-chunk-2](assets/fig/unnamed-chunk-2.png) 
-
----
-
-## Example
-
-- Suppose that a die is rolled and $X$ is the number face up
-- What is the expected value of $X$?
-    $$
-    E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
-    3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
-    5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
-    $$
-- Again, the geometric argument makes this answer obvious without calculation.
-
----
-
-## Continuous random variables
-
-- For a continuous random variable, $X$, with density, $f$, the expected
-    value is defined as follows
-    $$
-    E[X] = \mbox{the area under the function}~~~ t f(t)
-    $$
-- This definition borrows from the definition of center of mass for a continuous body
-
----
-
-## Example
-
-- Consider a density where $f(x) = 1$ for $x$ between zero and one
-- (Is this a valid density?)
-- Suppose that $X$ follows this density; what is its expected value?  
-![plot of chunk unnamed-chunk-3](assets/fig/unnamed-chunk-3.png) 
-
-
----
-
-## Rules about expected values
-
-- The expected value is a linear operator 
-- If $a$ and $b$ are not random and $X$ and $Y$ are two random variables then
-  - $E[aX + b] = a E[X] + b$
-  - $E[X + Y] = E[X] + E[Y]$
-
----
-
-## Example
-
-- You flip a coin, $X$ and simulate a uniform random number $Y$, what is the expected value of their sum? 
-    $$
-    E[X + Y] = E[X] + E[Y] = .5 + .5 = 1
-    $$ 
-- Another example, you roll a die twice. What is the expected value of the average? 
-- Let $X_1$ and $X_2$ be the results of the two rolls
-    $$
-    E[(X_1 + X_2) / 2] = \frac{1}{2}(E[X_1] + E[X_2])
-    = \frac{1}{2}(3.5 + 3.5) = 3.5
-    $$
-
----
-
-## Example
-
-1. Let $X_i$ for $i=1,\ldots,n$ be a collection of random variables, each from a distribution with mean $\mu$
-2. Calculate the expected value of the sample average of the $X_i$
-$$
-  \begin{eqnarray*}
-    E\left[ \frac{1}{n}\sum_{i=1}^n X_i\right]
-    & = & \frac{1}{n} E\left[\sum_{i=1}^n X_i\right] \\
-    & = & \frac{1}{n} \sum_{i=1}^n E\left[X_i\right] \\
-    & = & \frac{1}{n} \sum_{i=1}^n \mu =  \mu.
-  \end{eqnarray*}
-$$
-
----
-
-## Remark
-
-- Therefore, the expected value of the **sample mean** is the population mean that it's trying to estimate
-- When the expected value of an estimator is what its trying to estimate, we say that the estimator is **unbiased**
-
----
-
-## The variance
-
-- The variance of a random variable is a measure of *spread*
-- If $X$ is a random variable with mean $\mu$, the variance of $X$ is defined as
-
-$$
-Var(X) = E[(X - \mu)^2]
-$$
-    
-the expected (squared) distance from the mean
-- Densities with a higher variance are more spread out than densities with a lower variance
-
----
-
-- Convenient computational form
-$$
-Var(X) = E[X^2] - E[X]^2
-$$
-- If $a$ is constant then $Var(aX) = a^2 Var(X)$
-- The square root of the variance is called the **standard deviation**
-- The standard deviation has the same units as $X$
-
----
-
-## Example
-
-- What's the sample variance from the result of a toss of a die? 
-
-  - $E[X] = 3.5$ 
-  - $E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17$ 
-
-- $Var(X) = E[X^2] - E[X]^2 \approx 2.92$
-
----
-
-## Example
-
-- What's the sample variance from the result of the toss of a coin with probability of heads (1) of $p$? 
-
-  - $E[X] = 0 \times (1 - p) + 1 \times p = p$
-  - $E[X^2] = E[X] = p$ 
-
-- $Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)$
-
----
-
-## Interpreting variances
-
-- Chebyshev's inequality is useful for interpreting variances
-- This inequality states that
-$$
-P(|X - \mu| \geq k\sigma) \leq \frac{1}{k^2}
-$$
-- For example, the probability that a random variable lies beyond $k$ standard deviations from its mean is less than $1/k^2$
-$$
-\begin{eqnarray*}
-    2\sigma & \rightarrow & 25\% \\
-    3\sigma & \rightarrow & 11\% \\
-    4\sigma & \rightarrow &  6\% 
-\end{eqnarray*}
-$$
-- Note this is only a bound; the actual probability might be quite a bit smaller
-
----
-
-## Example
-
-- IQs are often said to be distributed with a mean of $100$ and a sd of $15$
-- What is the probability of a randomly drawn person having an IQ higher than $160$ or below $40$?
-- Thus we want to know the probability of a person being more than $4$ standard deviations from the mean
-- Thus Chebyshev's inequality suggests that this will be no larger than 6\%
-- IQs distributions are often cited as being bell shaped, in which case this bound is very conservative
-- The probability of a random draw from a bell curve being $4$ standard deviations from the mean is on the order of $10^{-5}$ (one thousandth of one percent)
-
----
-
-## Example
-
-- A former buzz phrase in industrial quality control is Motorola's "Six Sigma" whereby businesses are suggested to control extreme events or rare defective parts
-- Chebyshev's inequality states that the probability of a "Six Sigma" event is less than $1/6^2 \approx 3\%$
-- If a bell curve is assumed, the probability of a "six sigma" event is on the order of $10^{-9}$ (one ten millionth of a percent)
-
+---
+title       : Expected values
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Expected values
+
+- The **expected value** or **mean** of a random variable is the center of its distribution
+- For discrete random variable $X$ with PMF $p(x)$, it is defined as follows
+    $$
+    E[X] = \sum_x xp(x).
+    $$
+    where the sum is taken over the possible values of $x$
+- $E[X]$ represents the center of mass of a collection of locations and weights, $\{x, p(x)\}$
+
+---
+
+## Example
+### Find the center of mass of the bars
+
+```
+## Loading required package: MASS
+```
+
+<img src="assets/fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" style="display: block; margin: auto;" />
+
+
+---
+## Using manipulate
+```
+library(manipulate)
+myHist <- function(mu){
+  hist(galton$child,col="blue",breaks=100)
+  lines(c(mu, mu), c(0, 150),col="red",lwd=5)
+  mse <- mean((galton$child - mu)^2)
+  text(63, 150, paste("mu = ", mu))
+  text(63, 140, paste("Imbalance = ", round(mse, 2)))
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The center of mass is the empirical mean
+
+```r
+hist(galton$child, col = "blue", breaks = 100)
+meanChild <- mean(galton$child)
+lines(rep(meanChild, 100), seq(0, 150, length = 100), col = "red", lwd = 5)
+```
+
+<img src="assets/fig/lsm.png" title="plot of chunk lsm" alt="plot of chunk lsm" style="display: block; margin: auto;" />
+
+
+---
+## Example
+
+- Suppose a coin is flipped and $X$ is declared $0$ or $1$ corresponding to a head or a tail, respectively
+- What is the expected value of $X$? 
+    $$
+    E[X] = .5 \times 0 + .5 \times 1 = .5
+    $$
+- Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be $.5$
+
+<img src="assets/fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" style="display: block; margin: auto;" />
+
+---
+
+## Example
+
+- Suppose that a die is rolled and $X$ is the number face up
+- What is the expected value of $X$?
+    $$
+    E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
+    3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
+    5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
+    $$
+- Again, the geometric argument makes this answer obvious without calculation.
+
+---
+
+## Continuous random variables
+
+- For a continuous random variable, $X$, with density, $f$, the expected
+    value is defined as follows
+    $$
+    E[X] = \mbox{the area under the function}~~~ t f(t)
+    $$
+- This definition borrows from the definition of center of mass for a continuous body
+
+---
+
+## Example
+
+- Consider a density where $f(x) = 1$ for $x$ between zero and one
+- (Is this a valid density?)
+- Suppose that $X$ follows this density; what is its expected value?  
+![plot of chunk unnamed-chunk-3](assets/fig/unnamed-chunk-3.png) 
+
+
+---
+
+## Rules about expected values
+
+- The expected value is a linear operator 
+- If $a$ and $b$ are not random and $X$ and $Y$ are two random variables then
+  - $E[aX + b] = a E[X] + b$
+  - $E[X + Y] = E[X] + E[Y]$
+
+---
+
+## Example
+
+- You flip a coin, $X$ and simulate a uniform random number $Y$, what is the expected value of their sum? 
+    $$
+    E[X + Y] = E[X] + E[Y] = .5 + .5 = 1
+    $$ 
+- Another example, you roll a die twice. What is the expected value of the average? 
+- Let $X_1$ and $X_2$ be the results of the two rolls
+    $$
+    E[(X_1 + X_2) / 2] = \frac{1}{2}(E[X_1] + E[X_2])
+    = \frac{1}{2}(3.5 + 3.5) = 3.5
+    $$
+
+---
+
+## Example
+
+1. Let $X_i$ for $i=1,\ldots,n$ be a collection of random variables, each from a distribution with mean $\mu$
+2. Calculate the expected value of the sample average of the $X_i$
+$$
+  \begin{eqnarray*}
+    E\left[ \frac{1}{n}\sum_{i=1}^n X_i\right]
+    & = & \frac{1}{n} E\left[\sum_{i=1}^n X_i\right] \\
+    & = & \frac{1}{n} \sum_{i=1}^n E\left[X_i\right] \\
+    & = & \frac{1}{n} \sum_{i=1}^n \mu =  \mu.
+  \end{eqnarray*}
+$$
+
+---
+
+## Remark
+
+- Therefore, the expected value of the **sample mean** is the population mean that it's trying to estimate
+- When the expected value of an estimator is what its trying to estimate, we say that the estimator is **unbiased**
+
+---
+
+## The variance
+
+- The variance of a random variable is a measure of *spread*
+- If $X$ is a random variable with mean $\mu$, the variance of $X$ is defined as
+
+$$
+Var(X) = E[(X - \mu)^2]
+$$
+    
+the expected (squared) distance from the mean
+- Densities with a higher variance are more spread out than densities with a lower variance
+
+---
+
+- Convenient computational form
+$$
+Var(X) = E[X^2] - E[X]^2
+$$
+- If $a$ is constant then $Var(aX) = a^2 Var(X)$
+- The square root of the variance is called the **standard deviation**
+- The standard deviation has the same units as $X$
+
+---
+
+## Example
+
+- What's the sample variance from the result of a toss of a die? 
+
+  - $E[X] = 3.5$ 
+  - $E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17$ 
+
+- $Var(X) = E[X^2] - E[X]^2 \approx 2.92$
+
+---
+
+## Example
+
+- What's the sample variance from the result of the toss of a coin with probability of heads (1) of $p$? 
+
+  - $E[X] = 0 \times (1 - p) + 1 \times p = p$
+  - $E[X^2] = E[X] = p$ 
+
+- $Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)$
+
+---
+
+## Interpreting variances
+
+- Chebyshev's inequality is useful for interpreting variances
+- This inequality states that
+$$
+P(|X - \mu| \geq k\sigma) \leq \frac{1}{k^2}
+$$
+- For example, the probability that a random variable lies beyond $k$ standard deviations from its mean is less than $1/k^2$
+$$
+\begin{eqnarray*}
+    2\sigma & \rightarrow & 25\% \\
+    3\sigma & \rightarrow & 11\% \\
+    4\sigma & \rightarrow &  6\% 
+\end{eqnarray*}
+$$
+- Note this is only a bound; the actual probability might be quite a bit smaller
+
+---
+
+## Example
+
+- IQs are often said to be distributed with a mean of $100$ and a sd of $15$
+- What is the probability of a randomly drawn person having an IQ higher than $160$ or below $40$?
+- Thus we want to know the probability of a person being more than $4$ standard deviations from the mean
+- Thus Chebyshev's inequality suggests that this will be no larger than 6\%
+- IQs distributions are often cited as being bell shaped, in which case this bound is very conservative
+- The probability of a random draw from a bell curve being $4$ standard deviations from the mean is on the order of $10^{-5}$ (one thousandth of one percent)
+
+---
+
+## Example
+
+- A former buzz phrase in industrial quality control is Motorola's "Six Sigma" whereby businesses are suggested to control extreme events or rare defective parts
+- Chebyshev's inequality states that the probability of a "Six Sigma" event is less than $1/6^2 \approx 3\%$
+- If a bell curve is assumed, the probability of a "six sigma" event is on the order of $10^{-9}$ (one ten millionth of a percent)
+
diff --git a/06_StatisticalInference/old_markdown/01_03_Expectations/index.pdf b/06_StatisticalInference/old_markdown/01_03_Expectations/index.pdf
new file mode 100644
index 000000000..aa71b7bd4
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_03_Expectations/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/01_04_Independence/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/01_04_Independence/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..eded2a301
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_04_Independence/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/01_04_Independence/figure/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/01_04_Independence/figure/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/01_04_Independence/figure/unnamed-chunk-1.png
rename to 06_StatisticalInference/old_markdown/01_04_Independence/figure/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/01_04_Independence/index.Rmd b/06_StatisticalInference/old_markdown/01_04_Independence/index.Rmd
similarity index 97%
rename from 06_StatisticalInference/01_04_Independence/index.Rmd
rename to 06_StatisticalInference/old_markdown/01_04_Independence/index.Rmd
index e9e31d883..ede9ce8d2 100644
--- a/06_StatisticalInference/01_04_Independence/index.Rmd
+++ b/06_StatisticalInference/old_markdown/01_04_Independence/index.Rmd
@@ -1,208 +1,208 @@
----
-title       : Independence
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Independent events
-
-- Two events $A$ and $B$ are **independent** if $$P(A \cap B) = P(A)P(B)$$
-- Two random variables, $X$ and $Y$ are independent if for any two sets $A$ and $B$ $$P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)$$
-- If $A$ is independent of $B$ then 
-
-  - $A^c$ is independent of $B$ 
-  - $A$ is independent of $B^c$
-  - $A^c$ is independent of $B^c$
-
-
----
-
-## Example
-
-- What is the probability of getting two consecutive heads?
-- $A = \{\mbox{Head on flip 1}\}$ ~ $P(A) = .5$
-- $B = \{\mbox{Head on flip 2}\}$ ~ $P(B) = .5$
-- $A \cap B = \{\mbox{Head on flips 1 and 2}\}$
-- $P(A \cap B) = P(A)P(B) = .5 \times .5 = .25$ 
-
----
-
-## Example
-
-- Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial
-- Based on an estimated prevalence of sudden infant death syndrome of $1$ out of $8,543$, Dr Meadow testified that that the probability of a mother having two children with SIDS was $\left(\frac{1}{8,543}\right)^2$
-- The mother on trial was convicted of murder
-
----
-
-## Example: continued
-
-- For the purposes of this class, the principal mistake was to *assume* that the probabilities of having SIDs within a family are independent
-- That is, $P(A_1 \cap A_2)$ is not necessarily equal to $P(A_1)P(A_2)$
-- Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families
-- (There are many other statistical points of discussion for this case.)
-
----
-
-## Useful fact
-
-We will use the following fact extensively in this class:
-
-*If a collection of random variables $X_1, X_2, \ldots, X_n$ are independent, then their joint distribution is the product of their individual densities or mass functions*
-
-*That is, if $f_i$ is the density for random variable $X_i$ we have that*
-$$
-f(x_1,\ldots, x_n) = \prod_{i=1}^n f_i(x_i)
-$$
-
----
-
-## IID random variables
-
-- Random variables are said to be iid if they are independent and identically distributed
-- iid random variables are the default model for random samples
-- Many of the important theories of statistics are founded on assuming that variables are iid
-
-
----
-
-## Example
-
-- Suppose that we flip a biased coin with success probability $p$ $n$ times, what is the join density of the collection of outcomes?
-- These random variables are iid with densities $p^{x_i} (1 - p)^{1-x_i}$ 
-- Therefore
-  $$
-  f(x_1,\ldots,x_n) = \prod_{i=1}^n p^{x_i} (1 - p)^{1-x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
-  $$
-
----
-
-## Correlation
-
-- The **covariance** between two random variables $X$ and $Y$ is defined as 
-$$
-Cov(X, Y) = E[(X - \mu_x)(Y - \mu_y)] = E[X Y] - E[X]E[Y]
-$$
-- The following are useful facts about covariance
-  1. $Cov(X, Y) = Cov(Y, X)$
-  2. $Cov(X, Y)$ can be negative or positive
-  3. $|Cov(X, Y)| \leq \sqrt{Var(X) Var(y)}$
-
----
-
-## Correlation
-
-- The **correlation** between $X$ and $Y$ is 
-$$
-Cor(X, Y) = Cov(X, Y) / \sqrt{Var(X) Var(y)}
-$$
-
-  1. $-1 \leq Cor(X, Y) \leq 1$
-  2. $Cor(X, Y) = \pm 1$ if and only if $X = a + bY$ for some constants $a$ and $b$
-  3. $Cor(X, Y)$ is unitless
-  4. $X$ and $Y$ are **uncorrelated** if $Cor(X, Y) = 0$ 
-  5.  $X$ and $Y$ are more positively correlated, the closer $Cor(X,Y)$ is to $1$
-  6.  $X$ and $Y$ are more negatively correlated, the closer $Cor(X,Y)$ is to $-1$
-
----
-
-## Some useful results
-
-- Let $\{X_i\}_{i=1}^n$ be a collection of random variables
-  - When the $\{X_i\}$ are uncorrelated $$Var\left(\sum_{i=1}^n a_i X_i + b\right) = \sum_{i=1}^n a_i^2 Var(X_i)$$  
-
-- A commonly used subcase from these properties is that *if a collection of random variables $\{X_i\}$ are uncorrelated*, then the variance of the sum is the sum of the variances
-$$
-Var\left(\sum_{i=1}^n X_i \right) = \sum_{i=1}^n Var(X_i)
-$$
-- Therefore, it is sums of variances that tend to be useful, not sums of standard deviations; that is, the standard deviation of the sum of bunch of independent random variables is the square root of the sum of the variances, not the sum of the standard deviations
-
----
-
-## The sample mean
-
-Suppose $X_i$ are iid with variance $\sigma^2$
-
-$$
-\begin{eqnarray*}
-    Var(\bar X) & = & Var \left( \frac{1}{n}\sum_{i=1}^n X_i \right)\\ \\
-    & = & \frac{1}{n^2} Var\left(\sum_{i=1}^n X_i \right)\\ \\
-    & = & \frac{1}{n^2} \sum_{i=1}^n Var(X_i) \\ \\
-    & = & \frac{1}{n^2} \times n\sigma^2 \\ \\
-    & = & \frac{\sigma^2}{n}
-  \end{eqnarray*}
-$$
-
----
-
-## Some comments
-
-- When $X_i$ are independent with a common variance $Var(\bar X) = \frac{\sigma^2}{n}$
-- $\sigma/\sqrt{n}$ is called *the standard error* of the sample mean
-- The standard error of the sample mean is the standard deviation of the distribution of the sample mean
-- $\sigma$ is the standard deviation of the distribution of a single observation
-- Easy way to remember, the sample mean has to be less variable than a single observation, therefore its standard deviation is divided by a $\sqrt{n}$
-
----
-
-## The sample variance
-- The **sample variance** is defined as 
-$$
-S^2 =   \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n-1} 
-$$
-- The sample variance is an estimator of $\sigma^2$
-- The numerator has a version that's quicker for calculation
-$$
-\sum_{i=1}^n (X_i - \bar X)^2 = \sum_{i=1}^n X_i^2 - n \bar X^2
-$$
-- The sample variance is (nearly) the mean of the squared deviations from the mean
-
----
-
-## The sample variance is unbiased
-
-$$
-  \begin{eqnarray*}
-    E\left[\sum_{i=1}^n (X_i - \bar X)^2\right] & = & \sum_{i=1}^n E\left[X_i^2\right] - n E\left[\bar X^2\right] \\ \\
-    & = & \sum_{i=1}^n \left\{Var(X_i) + \mu^2\right\} - n \left\{Var(\bar X) + \mu^2\right\} \\ \\
-    & = & \sum_{i=1}^n \left\{\sigma^2 + \mu^2\right\} - n \left\{\sigma^2 / n + \mu^2\right\} \\ \\
-    & = & n \sigma^2 + n \mu ^ 2 - \sigma^2 - n \mu^2 \\ \\
-    & = & (n - 1) \sigma^2
-  \end{eqnarray*}
-$$
-
----
-
-## Hoping to avoid some confusion
-
-- Suppose $X_i$ are iid with mean $\mu$ and variance $\sigma^2$
-- $S^2$ estimates $\sigma^2$
-- The calculation of $S^2$ involves dividing by $n-1$
-- $S / \sqrt{n}$ estimates $\sigma / \sqrt{n}$ the standard error of the mean
-- $S / \sqrt{n}$ is called the sample standard error (of the mean)
-
----
-## Example
-```{r}
-data(father.son); 
-x <- father.son$sheight
-n<-length(x)
-```
-
----
-```{r, fig.height=5, fig.width=5, echo=FALSE}
-hist(father.son$sheight, col="lightblue", border="black")
-```
-```{r}
-round(c(sum( (x - mean(x) )^ 2) / (n-1), var(x), var(x) / n, sd(x), sd(x) / sqrt(n)),2)
-```
+---
+title       : Independence
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Independent events
+
+- Two events $A$ and $B$ are **independent** if $$P(A \cap B) = P(A)P(B)$$
+- Two random variables, $X$ and $Y$ are independent if for any two sets $A$ and $B$ $$P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)$$
+- If $A$ is independent of $B$ then 
+
+  - $A^c$ is independent of $B$ 
+  - $A$ is independent of $B^c$
+  - $A^c$ is independent of $B^c$
+
+
+---
+
+## Example
+
+- What is the probability of getting two consecutive heads?
+- $A = \{\mbox{Head on flip 1}\}$ ~ $P(A) = .5$
+- $B = \{\mbox{Head on flip 2}\}$ ~ $P(B) = .5$
+- $A \cap B = \{\mbox{Head on flips 1 and 2}\}$
+- $P(A \cap B) = P(A)P(B) = .5 \times .5 = .25$ 
+
+---
+
+## Example
+
+- Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial
+- Based on an estimated prevalence of sudden infant death syndrome of $1$ out of $8,543$, Dr Meadow testified that that the probability of a mother having two children with SIDS was $\left(\frac{1}{8,543}\right)^2$
+- The mother on trial was convicted of murder
+
+---
+
+## Example: continued
+
+- For the purposes of this class, the principal mistake was to *assume* that the probabilities of having SIDs within a family are independent
+- That is, $P(A_1 \cap A_2)$ is not necessarily equal to $P(A_1)P(A_2)$
+- Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families
+- (There are many other statistical points of discussion for this case.)
+
+---
+
+## Useful fact
+
+We will use the following fact extensively in this class:
+
+*If a collection of random variables $X_1, X_2, \ldots, X_n$ are independent, then their joint distribution is the product of their individual densities or mass functions*
+
+*That is, if $f_i$ is the density for random variable $X_i$ we have that*
+$$
+f(x_1,\ldots, x_n) = \prod_{i=1}^n f_i(x_i)
+$$
+
+---
+
+## IID random variables
+
+- Random variables are said to be iid if they are independent and identically distributed
+- iid random variables are the default model for random samples
+- Many of the important theories of statistics are founded on assuming that variables are iid
+
+
+---
+
+## Example
+
+- Suppose that we flip a biased coin with success probability $p$ $n$ times, what is the join density of the collection of outcomes?
+- These random variables are iid with densities $p^{x_i} (1 - p)^{1-x_i}$ 
+- Therefore
+  $$
+  f(x_1,\ldots,x_n) = \prod_{i=1}^n p^{x_i} (1 - p)^{1-x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
+  $$
+
+---
+
+## Correlation
+
+- The **covariance** between two random variables $X$ and $Y$ is defined as 
+$$
+Cov(X, Y) = E[(X - \mu_x)(Y - \mu_y)] = E[X Y] - E[X]E[Y]
+$$
+- The following are useful facts about covariance
+  1. $Cov(X, Y) = Cov(Y, X)$
+  2. $Cov(X, Y)$ can be negative or positive
+  3. $|Cov(X, Y)| \leq \sqrt{Var(X) Var(y)}$
+
+---
+
+## Correlation
+
+- The **correlation** between $X$ and $Y$ is 
+$$
+Cor(X, Y) = Cov(X, Y) / \sqrt{Var(X) Var(y)}
+$$
+
+  1. $-1 \leq Cor(X, Y) \leq 1$
+  2. $Cor(X, Y) = \pm 1$ if and only if $X = a + bY$ for some constants $a$ and $b$
+  3. $Cor(X, Y)$ is unitless
+  4. $X$ and $Y$ are **uncorrelated** if $Cor(X, Y) = 0$ 
+  5.  $X$ and $Y$ are more positively correlated, the closer $Cor(X,Y)$ is to $1$
+  6.  $X$ and $Y$ are more negatively correlated, the closer $Cor(X,Y)$ is to $-1$
+
+---
+
+## Some useful results
+
+- Let $\{X_i\}_{i=1}^n$ be a collection of random variables
+  - When the $\{X_i\}$ are uncorrelated $$Var\left(\sum_{i=1}^n a_i X_i + b\right) = \sum_{i=1}^n a_i^2 Var(X_i)$$  
+
+- A commonly used subcase from these properties is that *if a collection of random variables $\{X_i\}$ are uncorrelated*, then the variance of the sum is the sum of the variances
+$$
+Var\left(\sum_{i=1}^n X_i \right) = \sum_{i=1}^n Var(X_i)
+$$
+- Therefore, it is sums of variances that tend to be useful, not sums of standard deviations; that is, the standard deviation of the sum of bunch of independent random variables is the square root of the sum of the variances, not the sum of the standard deviations
+
+---
+
+## The sample mean
+
+Suppose $X_i$ are iid with variance $\sigma^2$
+
+$$
+\begin{eqnarray*}
+    Var(\bar X) & = & Var \left( \frac{1}{n}\sum_{i=1}^n X_i \right)\\ \\
+    & = & \frac{1}{n^2} Var\left(\sum_{i=1}^n X_i \right)\\ \\
+    & = & \frac{1}{n^2} \sum_{i=1}^n Var(X_i) \\ \\
+    & = & \frac{1}{n^2} \times n\sigma^2 \\ \\
+    & = & \frac{\sigma^2}{n}
+  \end{eqnarray*}
+$$
+
+---
+
+## Some comments
+
+- When $X_i$ are independent with a common variance $Var(\bar X) = \frac{\sigma^2}{n}$
+- $\sigma/\sqrt{n}$ is called *the standard error* of the sample mean
+- The standard error of the sample mean is the standard deviation of the distribution of the sample mean
+- $\sigma$ is the standard deviation of the distribution of a single observation
+- Easy way to remember, the sample mean has to be less variable than a single observation, therefore its standard deviation is divided by a $\sqrt{n}$
+
+---
+
+## The sample variance
+- The **sample variance** is defined as 
+$$
+S^2 =   \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n-1} 
+$$
+- The sample variance is an estimator of $\sigma^2$
+- The numerator has a version that's quicker for calculation
+$$
+\sum_{i=1}^n (X_i - \bar X)^2 = \sum_{i=1}^n X_i^2 - n \bar X^2
+$$
+- The sample variance is (nearly) the mean of the squared deviations from the mean
+
+---
+
+## The sample variance is unbiased
+
+$$
+  \begin{eqnarray*}
+    E\left[\sum_{i=1}^n (X_i - \bar X)^2\right] & = & \sum_{i=1}^n E\left[X_i^2\right] - n E\left[\bar X^2\right] \\ \\
+    & = & \sum_{i=1}^n \left\{Var(X_i) + \mu^2\right\} - n \left\{Var(\bar X) + \mu^2\right\} \\ \\
+    & = & \sum_{i=1}^n \left\{\sigma^2 + \mu^2\right\} - n \left\{\sigma^2 / n + \mu^2\right\} \\ \\
+    & = & n \sigma^2 + n \mu ^ 2 - \sigma^2 - n \mu^2 \\ \\
+    & = & (n - 1) \sigma^2
+  \end{eqnarray*}
+$$
+
+---
+
+## Hoping to avoid some confusion
+
+- Suppose $X_i$ are iid with mean $\mu$ and variance $\sigma^2$
+- $S^2$ estimates $\sigma^2$
+- The calculation of $S^2$ involves dividing by $n-1$
+- $S / \sqrt{n}$ estimates $\sigma / \sqrt{n}$ the standard error of the mean
+- $S / \sqrt{n}$ is called the sample standard error (of the mean)
+
+---
+## Example
+```{r}
+data(father.son); 
+x <- father.son$sheight
+n<-length(x)
+```
+
+---
+```{r, fig.height=5, fig.width=5, echo=FALSE}
+hist(father.son$sheight, col="lightblue", border="black")
+```
+```{r}
+round(c(sum( (x - mean(x) )^ 2) / (n-1), var(x), var(x) / n, sd(x), sd(x) / sqrt(n)),2)
+```
diff --git a/06_StatisticalInference/01_04_Independence/index.html b/06_StatisticalInference/old_markdown/01_04_Independence/index.html
similarity index 96%
rename from 06_StatisticalInference/01_04_Independence/index.html
rename to 06_StatisticalInference/old_markdown/01_04_Independence/index.html
index 57c94e8b1..3ce8e98d5 100644
--- a/06_StatisticalInference/01_04_Independence/index.html
+++ b/06_StatisticalInference/old_markdown/01_04_Independence/index.html
@@ -1,496 +1,496 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Independence</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Independence">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Independence</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Independent events</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Two events \(A\) and \(B\) are <strong>independent</strong> if \[P(A \cap B) = P(A)P(B)\]</li>
-<li>Two random variables, \(X\) and \(Y\) are independent if for any two sets \(A\) and \(B\) \[P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)\]</li>
-<li><p>If \(A\) is independent of \(B\) then </p>
-
-<ul>
-<li>\(A^c\) is independent of \(B\) </li>
-<li>\(A\) is independent of \(B^c\)</li>
-<li>\(A^c\) is independent of \(B^c\)</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>What is the probability of getting two consecutive heads?</li>
-<li>\(A = \{\mbox{Head on flip 1}\}\) ~ \(P(A) = .5\)</li>
-<li>\(B = \{\mbox{Head on flip 2}\}\) ~ \(P(B) = .5\)</li>
-<li>\(A \cap B = \{\mbox{Head on flips 1 and 2}\}\)</li>
-<li>\(P(A \cap B) = P(A)P(B) = .5 \times .5 = .25\) </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial</li>
-<li>Based on an estimated prevalence of sudden infant death syndrome of \(1\) out of \(8,543\), Dr Meadow testified that that the probability of a mother having two children with SIDS was \(\left(\frac{1}{8,543}\right)^2\)</li>
-<li>The mother on trial was convicted of murder</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Example: continued</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>For the purposes of this class, the principal mistake was to <em>assume</em> that the probabilities of having SIDs within a family are independent</li>
-<li>That is, \(P(A_1 \cap A_2)\) is not necessarily equal to \(P(A_1)P(A_2)\)</li>
-<li>Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families</li>
-<li>(There are many other statistical points of discussion for this case.)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Useful fact</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>We will use the following fact extensively in this class:</p>
-
-<p><em>If a collection of random variables \(X_1, X_2, \ldots, X_n\) are independent, then their joint distribution is the product of their individual densities or mass functions</em></p>
-
-<p><em>That is, if \(f_i\) is the density for random variable \(X_i\) we have that</em>
-\[
-f(x_1,\ldots, x_n) = \prod_{i=1}^n f_i(x_i)
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>IID random variables</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Random variables are said to be iid if they are independent and identically distributed</li>
-<li>iid random variables are the default model for random samples</li>
-<li>Many of the important theories of statistics are founded on assuming that variables are iid</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that we flip a biased coin with success probability \(p\) \(n\) times, what is the join density of the collection of outcomes?</li>
-<li>These random variables are iid with densities \(p^{x_i} (1 - p)^{1-x_i}\) </li>
-<li>Therefore
-\[
-f(x_1,\ldots,x_n) = \prod_{i=1}^n p^{x_i} (1 - p)^{1-x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Correlation</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>covariance</strong> between two random variables \(X\) and \(Y\) is defined as 
-\[
-Cov(X, Y) = E[(X - \mu_x)(Y - \mu_y)] = E[X Y] - E[X]E[Y]
-\]</li>
-<li>The following are useful facts about covariance
-
-<ol>
-<li>\(Cov(X, Y) = Cov(Y, X)\)</li>
-<li>\(Cov(X, Y)\) can be negative or positive</li>
-<li>\(|Cov(X, Y)| \leq \sqrt{Var(X) Var(y)}\)</li>
-</ol></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Correlation</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>correlation</strong> between \(X\) and \(Y\) is 
-\[
-Cor(X, Y) = Cov(X, Y) / \sqrt{Var(X) Var(y)}
-\]</li>
-</ul>
-
-<ol>
-<li>\(-1 \leq Cor(X, Y) \leq 1\)</li>
-<li>\(Cor(X, Y) = \pm 1\) if and only if \(X = a + bY\) for some constants \(a\) and \(b\)</li>
-<li>\(Cor(X, Y)\) is unitless</li>
-<li>\(X\) and \(Y\) are <strong>uncorrelated</strong> if \(Cor(X, Y) = 0\) </li>
-<li> \(X\) and \(Y\) are more positively correlated, the closer \(Cor(X,Y)\) is to \(1\)</li>
-<li> \(X\) and \(Y\) are more negatively correlated, the closer \(Cor(X,Y)\) is to \(-1\)</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Some useful results</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li><p>Let \(\{X_i\}_{i=1}^n\) be a collection of random variables</p>
-
-<ul>
-<li>When the \(\{X_i\}\) are uncorrelated \[Var\left(\sum_{i=1}^n a_i X_i + b\right) = \sum_{i=1}^n a_i^2 Var(X_i)\]<br></li>
-</ul></li>
-<li><p>A commonly used subcase from these properties is that <em>if a collection of random variables \(\{X_i\}\) are uncorrelated</em>, then the variance of the sum is the sum of the variances
-\[
-Var\left(\sum_{i=1}^n X_i \right) = \sum_{i=1}^n Var(X_i)
-\]</p></li>
-<li><p>Therefore, it is sums of variances that tend to be useful, not sums of standard deviations; that is, the standard deviation of the sum of bunch of independent random variables is the square root of the sum of the variances, not the sum of the standard deviations</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>The sample mean</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Suppose \(X_i\) are iid with variance \(\sigma^2\)</p>
-
-<p>\[
-\begin{eqnarray*}
-    Var(\bar X) & = & Var \left( \frac{1}{n}\sum_{i=1}^n X_i \right)\\ \\
-    & = & \frac{1}{n^2} Var\left(\sum_{i=1}^n X_i \right)\\ \\
-    & = & \frac{1}{n^2} \sum_{i=1}^n Var(X_i) \\ \\
-    & = & \frac{1}{n^2} \times n\sigma^2 \\ \\
-    & = & \frac{\sigma^2}{n}
-  \end{eqnarray*}
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Some comments</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>When \(X_i\) are independent with a common variance \(Var(\bar X) = \frac{\sigma^2}{n}\)</li>
-<li>\(\sigma/\sqrt{n}\) is called <em>the standard error</em> of the sample mean</li>
-<li>The standard error of the sample mean is the standard deviation of the distribution of the sample mean</li>
-<li>\(\sigma\) is the standard deviation of the distribution of a single observation</li>
-<li>Easy way to remember, the sample mean has to be less variable than a single observation, therefore its standard deviation is divided by a \(\sqrt{n}\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>The sample variance</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>sample variance</strong> is defined as 
-\[
-S^2 =   \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n-1} 
-\]</li>
-<li>The sample variance is an estimator of \(\sigma^2\)</li>
-<li>The numerator has a version that&#39;s quicker for calculation
-\[
-\sum_{i=1}^n (X_i - \bar X)^2 = \sum_{i=1}^n X_i^2 - n \bar X^2
-\]</li>
-<li>The sample variance is (nearly) the mean of the squared deviations from the mean</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>The sample variance is unbiased</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[
-  \begin{eqnarray*}
-    E\left[\sum_{i=1}^n (X_i - \bar X)^2\right] & = & \sum_{i=1}^n E\left[X_i^2\right] - n E\left[\bar X^2\right] \\ \\
-    & = & \sum_{i=1}^n \left\{Var(X_i) + \mu^2\right\} - n \left\{Var(\bar X) + \mu^2\right\} \\ \\
-    & = & \sum_{i=1}^n \left\{\sigma^2 + \mu^2\right\} - n \left\{\sigma^2 / n + \mu^2\right\} \\ \\
-    & = & n \sigma^2 + n \mu ^ 2 - \sigma^2 - n \mu^2 \\ \\
-    & = & (n - 1) \sigma^2
-  \end{eqnarray*}
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Hoping to avoid some confusion</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose \(X_i\) are iid with mean \(\mu\) and variance \(\sigma^2\)</li>
-<li>\(S^2\) estimates \(\sigma^2\)</li>
-<li>The calculation of \(S^2\) involves dividing by \(n-1\)</li>
-<li>\(S / \sqrt{n}\) estimates \(\sigma / \sqrt{n}\) the standard error of the mean</li>
-<li>\(S / \sqrt{n}\) is called the sample standard error (of the mean)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">data(father.son)
-x &lt;- father.son$sheight
-n &lt;- length(x)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-17" style="background:;">
-  <article data-timings="">
-    <p><img src="assets/fig/unnamed-chunk-2.png" alt="plot of chunk unnamed-chunk-2"> </p>
-
-<pre><code class="r">round(c(sum((x - mean(x))^2)/(n - 1), var(x), var(x)/n, sd(x), sd(x)/sqrt(n)), 
-    2)
-</code></pre>
-
-<pre><code>## [1] 7.92 7.92 0.01 2.81 0.09
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Independent events'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Example'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Example'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Example: continued'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Useful fact'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='IID random variables'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Example'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Correlation'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Correlation'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Some useful results'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='The sample mean'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Some comments'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='The sample variance'>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='The sample variance is unbiased'>
-         14
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title='Hoping to avoid some confusion'>
-         15
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='Example'>
-         16
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=17 title=''>
-         17
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Independence</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Independence">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Independence</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Independent events</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Two events \(A\) and \(B\) are <strong>independent</strong> if \[P(A \cap B) = P(A)P(B)\]</li>
+<li>Two random variables, \(X\) and \(Y\) are independent if for any two sets \(A\) and \(B\) \[P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)\]</li>
+<li><p>If \(A\) is independent of \(B\) then </p>
+
+<ul>
+<li>\(A^c\) is independent of \(B\) </li>
+<li>\(A\) is independent of \(B^c\)</li>
+<li>\(A^c\) is independent of \(B^c\)</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>What is the probability of getting two consecutive heads?</li>
+<li>\(A = \{\mbox{Head on flip 1}\}\) ~ \(P(A) = .5\)</li>
+<li>\(B = \{\mbox{Head on flip 2}\}\) ~ \(P(B) = .5\)</li>
+<li>\(A \cap B = \{\mbox{Head on flips 1 and 2}\}\)</li>
+<li>\(P(A \cap B) = P(A)P(B) = .5 \times .5 = .25\) </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial</li>
+<li>Based on an estimated prevalence of sudden infant death syndrome of \(1\) out of \(8,543\), Dr Meadow testified that that the probability of a mother having two children with SIDS was \(\left(\frac{1}{8,543}\right)^2\)</li>
+<li>The mother on trial was convicted of murder</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Example: continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>For the purposes of this class, the principal mistake was to <em>assume</em> that the probabilities of having SIDs within a family are independent</li>
+<li>That is, \(P(A_1 \cap A_2)\) is not necessarily equal to \(P(A_1)P(A_2)\)</li>
+<li>Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families</li>
+<li>(There are many other statistical points of discussion for this case.)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Useful fact</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>We will use the following fact extensively in this class:</p>
+
+<p><em>If a collection of random variables \(X_1, X_2, \ldots, X_n\) are independent, then their joint distribution is the product of their individual densities or mass functions</em></p>
+
+<p><em>That is, if \(f_i\) is the density for random variable \(X_i\) we have that</em>
+\[
+f(x_1,\ldots, x_n) = \prod_{i=1}^n f_i(x_i)
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>IID random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Random variables are said to be iid if they are independent and identically distributed</li>
+<li>iid random variables are the default model for random samples</li>
+<li>Many of the important theories of statistics are founded on assuming that variables are iid</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that we flip a biased coin with success probability \(p\) \(n\) times, what is the join density of the collection of outcomes?</li>
+<li>These random variables are iid with densities \(p^{x_i} (1 - p)^{1-x_i}\) </li>
+<li>Therefore
+\[
+f(x_1,\ldots,x_n) = \prod_{i=1}^n p^{x_i} (1 - p)^{1-x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Correlation</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>covariance</strong> between two random variables \(X\) and \(Y\) is defined as 
+\[
+Cov(X, Y) = E[(X - \mu_x)(Y - \mu_y)] = E[X Y] - E[X]E[Y]
+\]</li>
+<li>The following are useful facts about covariance
+
+<ol>
+<li>\(Cov(X, Y) = Cov(Y, X)\)</li>
+<li>\(Cov(X, Y)\) can be negative or positive</li>
+<li>\(|Cov(X, Y)| \leq \sqrt{Var(X) Var(y)}\)</li>
+</ol></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Correlation</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>correlation</strong> between \(X\) and \(Y\) is 
+\[
+Cor(X, Y) = Cov(X, Y) / \sqrt{Var(X) Var(y)}
+\]</li>
+</ul>
+
+<ol>
+<li>\(-1 \leq Cor(X, Y) \leq 1\)</li>
+<li>\(Cor(X, Y) = \pm 1\) if and only if \(X = a + bY\) for some constants \(a\) and \(b\)</li>
+<li>\(Cor(X, Y)\) is unitless</li>
+<li>\(X\) and \(Y\) are <strong>uncorrelated</strong> if \(Cor(X, Y) = 0\) </li>
+<li> \(X\) and \(Y\) are more positively correlated, the closer \(Cor(X,Y)\) is to \(1\)</li>
+<li> \(X\) and \(Y\) are more negatively correlated, the closer \(Cor(X,Y)\) is to \(-1\)</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Some useful results</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li><p>Let \(\{X_i\}_{i=1}^n\) be a collection of random variables</p>
+
+<ul>
+<li>When the \(\{X_i\}\) are uncorrelated \[Var\left(\sum_{i=1}^n a_i X_i + b\right) = \sum_{i=1}^n a_i^2 Var(X_i)\]<br></li>
+</ul></li>
+<li><p>A commonly used subcase from these properties is that <em>if a collection of random variables \(\{X_i\}\) are uncorrelated</em>, then the variance of the sum is the sum of the variances
+\[
+Var\left(\sum_{i=1}^n X_i \right) = \sum_{i=1}^n Var(X_i)
+\]</p></li>
+<li><p>Therefore, it is sums of variances that tend to be useful, not sums of standard deviations; that is, the standard deviation of the sum of bunch of independent random variables is the square root of the sum of the variances, not the sum of the standard deviations</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>The sample mean</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Suppose \(X_i\) are iid with variance \(\sigma^2\)</p>
+
+<p>\[
+\begin{eqnarray*}
+    Var(\bar X) & = & Var \left( \frac{1}{n}\sum_{i=1}^n X_i \right)\\ \\
+    & = & \frac{1}{n^2} Var\left(\sum_{i=1}^n X_i \right)\\ \\
+    & = & \frac{1}{n^2} \sum_{i=1}^n Var(X_i) \\ \\
+    & = & \frac{1}{n^2} \times n\sigma^2 \\ \\
+    & = & \frac{\sigma^2}{n}
+  \end{eqnarray*}
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Some comments</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>When \(X_i\) are independent with a common variance \(Var(\bar X) = \frac{\sigma^2}{n}\)</li>
+<li>\(\sigma/\sqrt{n}\) is called <em>the standard error</em> of the sample mean</li>
+<li>The standard error of the sample mean is the standard deviation of the distribution of the sample mean</li>
+<li>\(\sigma\) is the standard deviation of the distribution of a single observation</li>
+<li>Easy way to remember, the sample mean has to be less variable than a single observation, therefore its standard deviation is divided by a \(\sqrt{n}\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>The sample variance</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>sample variance</strong> is defined as 
+\[
+S^2 =   \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n-1} 
+\]</li>
+<li>The sample variance is an estimator of \(\sigma^2\)</li>
+<li>The numerator has a version that&#39;s quicker for calculation
+\[
+\sum_{i=1}^n (X_i - \bar X)^2 = \sum_{i=1}^n X_i^2 - n \bar X^2
+\]</li>
+<li>The sample variance is (nearly) the mean of the squared deviations from the mean</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>The sample variance is unbiased</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>\[
+  \begin{eqnarray*}
+    E\left[\sum_{i=1}^n (X_i - \bar X)^2\right] & = & \sum_{i=1}^n E\left[X_i^2\right] - n E\left[\bar X^2\right] \\ \\
+    & = & \sum_{i=1}^n \left\{Var(X_i) + \mu^2\right\} - n \left\{Var(\bar X) + \mu^2\right\} \\ \\
+    & = & \sum_{i=1}^n \left\{\sigma^2 + \mu^2\right\} - n \left\{\sigma^2 / n + \mu^2\right\} \\ \\
+    & = & n \sigma^2 + n \mu ^ 2 - \sigma^2 - n \mu^2 \\ \\
+    & = & (n - 1) \sigma^2
+  \end{eqnarray*}
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Hoping to avoid some confusion</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose \(X_i\) are iid with mean \(\mu\) and variance \(\sigma^2\)</li>
+<li>\(S^2\) estimates \(\sigma^2\)</li>
+<li>The calculation of \(S^2\) involves dividing by \(n-1\)</li>
+<li>\(S / \sqrt{n}\) estimates \(\sigma / \sqrt{n}\) the standard error of the mean</li>
+<li>\(S / \sqrt{n}\) is called the sample standard error (of the mean)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">data(father.son)
+x &lt;- father.son$sheight
+n &lt;- length(x)
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-2.png" alt="plot of chunk unnamed-chunk-2"> </p>
+
+<pre><code class="r">round(c(sum((x - mean(x))^2)/(n - 1), var(x), var(x)/n, sd(x), sd(x)/sqrt(n)), 
+    2)
+</code></pre>
+
+<pre><code>## [1] 7.92 7.92 0.01 2.81 0.09
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Independent events'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Example'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Example'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Example: continued'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Useful fact'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='IID random variables'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Example'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Correlation'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Correlation'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Some useful results'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='The sample mean'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Some comments'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='The sample variance'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='The sample variance is unbiased'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Hoping to avoid some confusion'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Example'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title=''>
+         17
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/01_04_Independence/index.md b/06_StatisticalInference/old_markdown/01_04_Independence/index.md
similarity index 97%
rename from 06_StatisticalInference/01_04_Independence/index.md
rename to 06_StatisticalInference/old_markdown/01_04_Independence/index.md
index 12b7e2264..6ba1c02e5 100644
--- a/06_StatisticalInference/01_04_Independence/index.md
+++ b/06_StatisticalInference/old_markdown/01_04_Independence/index.md
@@ -1,216 +1,216 @@
----
-title       : Independence
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Independent events
-
-- Two events $A$ and $B$ are **independent** if $$P(A \cap B) = P(A)P(B)$$
-- Two random variables, $X$ and $Y$ are independent if for any two sets $A$ and $B$ $$P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)$$
-- If $A$ is independent of $B$ then 
-
-  - $A^c$ is independent of $B$ 
-  - $A$ is independent of $B^c$
-  - $A^c$ is independent of $B^c$
-
-
----
-
-## Example
-
-- What is the probability of getting two consecutive heads?
-- $A = \{\mbox{Head on flip 1}\}$ ~ $P(A) = .5$
-- $B = \{\mbox{Head on flip 2}\}$ ~ $P(B) = .5$
-- $A \cap B = \{\mbox{Head on flips 1 and 2}\}$
-- $P(A \cap B) = P(A)P(B) = .5 \times .5 = .25$ 
-
----
-
-## Example
-
-- Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial
-- Based on an estimated prevalence of sudden infant death syndrome of $1$ out of $8,543$, Dr Meadow testified that that the probability of a mother having two children with SIDS was $\left(\frac{1}{8,543}\right)^2$
-- The mother on trial was convicted of murder
-
----
-
-## Example: continued
-
-- For the purposes of this class, the principal mistake was to *assume* that the probabilities of having SIDs within a family are independent
-- That is, $P(A_1 \cap A_2)$ is not necessarily equal to $P(A_1)P(A_2)$
-- Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families
-- (There are many other statistical points of discussion for this case.)
-
----
-
-## Useful fact
-
-We will use the following fact extensively in this class:
-
-*If a collection of random variables $X_1, X_2, \ldots, X_n$ are independent, then their joint distribution is the product of their individual densities or mass functions*
-
-*That is, if $f_i$ is the density for random variable $X_i$ we have that*
-$$
-f(x_1,\ldots, x_n) = \prod_{i=1}^n f_i(x_i)
-$$
-
----
-
-## IID random variables
-
-- Random variables are said to be iid if they are independent and identically distributed
-- iid random variables are the default model for random samples
-- Many of the important theories of statistics are founded on assuming that variables are iid
-
-
----
-
-## Example
-
-- Suppose that we flip a biased coin with success probability $p$ $n$ times, what is the join density of the collection of outcomes?
-- These random variables are iid with densities $p^{x_i} (1 - p)^{1-x_i}$ 
-- Therefore
-  $$
-  f(x_1,\ldots,x_n) = \prod_{i=1}^n p^{x_i} (1 - p)^{1-x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
-  $$
-
----
-
-## Correlation
-
-- The **covariance** between two random variables $X$ and $Y$ is defined as 
-$$
-Cov(X, Y) = E[(X - \mu_x)(Y - \mu_y)] = E[X Y] - E[X]E[Y]
-$$
-- The following are useful facts about covariance
-  1. $Cov(X, Y) = Cov(Y, X)$
-  2. $Cov(X, Y)$ can be negative or positive
-  3. $|Cov(X, Y)| \leq \sqrt{Var(X) Var(y)}$
-
----
-
-## Correlation
-
-- The **correlation** between $X$ and $Y$ is 
-$$
-Cor(X, Y) = Cov(X, Y) / \sqrt{Var(X) Var(y)}
-$$
-
-  1. $-1 \leq Cor(X, Y) \leq 1$
-  2. $Cor(X, Y) = \pm 1$ if and only if $X = a + bY$ for some constants $a$ and $b$
-  3. $Cor(X, Y)$ is unitless
-  4. $X$ and $Y$ are **uncorrelated** if $Cor(X, Y) = 0$ 
-  5.  $X$ and $Y$ are more positively correlated, the closer $Cor(X,Y)$ is to $1$
-  6.  $X$ and $Y$ are more negatively correlated, the closer $Cor(X,Y)$ is to $-1$
-
----
-
-## Some useful results
-
-- Let $\{X_i\}_{i=1}^n$ be a collection of random variables
-  - When the $\{X_i\}$ are uncorrelated $$Var\left(\sum_{i=1}^n a_i X_i + b\right) = \sum_{i=1}^n a_i^2 Var(X_i)$$  
-
-- A commonly used subcase from these properties is that *if a collection of random variables $\{X_i\}$ are uncorrelated*, then the variance of the sum is the sum of the variances
-$$
-Var\left(\sum_{i=1}^n X_i \right) = \sum_{i=1}^n Var(X_i)
-$$
-- Therefore, it is sums of variances that tend to be useful, not sums of standard deviations; that is, the standard deviation of the sum of bunch of independent random variables is the square root of the sum of the variances, not the sum of the standard deviations
-
----
-
-## The sample mean
-
-Suppose $X_i$ are iid with variance $\sigma^2$
-
-$$
-\begin{eqnarray*}
-    Var(\bar X) & = & Var \left( \frac{1}{n}\sum_{i=1}^n X_i \right)\\ \\
-    & = & \frac{1}{n^2} Var\left(\sum_{i=1}^n X_i \right)\\ \\
-    & = & \frac{1}{n^2} \sum_{i=1}^n Var(X_i) \\ \\
-    & = & \frac{1}{n^2} \times n\sigma^2 \\ \\
-    & = & \frac{\sigma^2}{n}
-  \end{eqnarray*}
-$$
-
----
-
-## Some comments
-
-- When $X_i$ are independent with a common variance $Var(\bar X) = \frac{\sigma^2}{n}$
-- $\sigma/\sqrt{n}$ is called *the standard error* of the sample mean
-- The standard error of the sample mean is the standard deviation of the distribution of the sample mean
-- $\sigma$ is the standard deviation of the distribution of a single observation
-- Easy way to remember, the sample mean has to be less variable than a single observation, therefore its standard deviation is divided by a $\sqrt{n}$
-
----
-
-## The sample variance
-- The **sample variance** is defined as 
-$$
-S^2 =   \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n-1} 
-$$
-- The sample variance is an estimator of $\sigma^2$
-- The numerator has a version that's quicker for calculation
-$$
-\sum_{i=1}^n (X_i - \bar X)^2 = \sum_{i=1}^n X_i^2 - n \bar X^2
-$$
-- The sample variance is (nearly) the mean of the squared deviations from the mean
-
----
-
-## The sample variance is unbiased
-
-$$
-  \begin{eqnarray*}
-    E\left[\sum_{i=1}^n (X_i - \bar X)^2\right] & = & \sum_{i=1}^n E\left[X_i^2\right] - n E\left[\bar X^2\right] \\ \\
-    & = & \sum_{i=1}^n \left\{Var(X_i) + \mu^2\right\} - n \left\{Var(\bar X) + \mu^2\right\} \\ \\
-    & = & \sum_{i=1}^n \left\{\sigma^2 + \mu^2\right\} - n \left\{\sigma^2 / n + \mu^2\right\} \\ \\
-    & = & n \sigma^2 + n \mu ^ 2 - \sigma^2 - n \mu^2 \\ \\
-    & = & (n - 1) \sigma^2
-  \end{eqnarray*}
-$$
-
----
-
-## Hoping to avoid some confusion
-
-- Suppose $X_i$ are iid with mean $\mu$ and variance $\sigma^2$
-- $S^2$ estimates $\sigma^2$
-- The calculation of $S^2$ involves dividing by $n-1$
-- $S / \sqrt{n}$ estimates $\sigma / \sqrt{n}$ the standard error of the mean
-- $S / \sqrt{n}$ is called the sample standard error (of the mean)
-
----
-## Example
-
-```r
-data(father.son)
-x <- father.son$sheight
-n <- length(x)
-```
-
-
----
-![plot of chunk unnamed-chunk-2](assets/fig/unnamed-chunk-2.png) 
-
-
-```r
-round(c(sum((x - mean(x))^2)/(n - 1), var(x), var(x)/n, sd(x), sd(x)/sqrt(n)), 
-    2)
-```
-
-```
-## [1] 7.92 7.92 0.01 2.81 0.09
-```
-
+---
+title       : Independence
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Independent events
+
+- Two events $A$ and $B$ are **independent** if $$P(A \cap B) = P(A)P(B)$$
+- Two random variables, $X$ and $Y$ are independent if for any two sets $A$ and $B$ $$P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)$$
+- If $A$ is independent of $B$ then 
+
+  - $A^c$ is independent of $B$ 
+  - $A$ is independent of $B^c$
+  - $A^c$ is independent of $B^c$
+
+
+---
+
+## Example
+
+- What is the probability of getting two consecutive heads?
+- $A = \{\mbox{Head on flip 1}\}$ ~ $P(A) = .5$
+- $B = \{\mbox{Head on flip 2}\}$ ~ $P(B) = .5$
+- $A \cap B = \{\mbox{Head on flips 1 and 2}\}$
+- $P(A \cap B) = P(A)P(B) = .5 \times .5 = .25$ 
+
+---
+
+## Example
+
+- Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial
+- Based on an estimated prevalence of sudden infant death syndrome of $1$ out of $8,543$, Dr Meadow testified that that the probability of a mother having two children with SIDS was $\left(\frac{1}{8,543}\right)^2$
+- The mother on trial was convicted of murder
+
+---
+
+## Example: continued
+
+- For the purposes of this class, the principal mistake was to *assume* that the probabilities of having SIDs within a family are independent
+- That is, $P(A_1 \cap A_2)$ is not necessarily equal to $P(A_1)P(A_2)$
+- Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families
+- (There are many other statistical points of discussion for this case.)
+
+---
+
+## Useful fact
+
+We will use the following fact extensively in this class:
+
+*If a collection of random variables $X_1, X_2, \ldots, X_n$ are independent, then their joint distribution is the product of their individual densities or mass functions*
+
+*That is, if $f_i$ is the density for random variable $X_i$ we have that*
+$$
+f(x_1,\ldots, x_n) = \prod_{i=1}^n f_i(x_i)
+$$
+
+---
+
+## IID random variables
+
+- Random variables are said to be iid if they are independent and identically distributed
+- iid random variables are the default model for random samples
+- Many of the important theories of statistics are founded on assuming that variables are iid
+
+
+---
+
+## Example
+
+- Suppose that we flip a biased coin with success probability $p$ $n$ times, what is the join density of the collection of outcomes?
+- These random variables are iid with densities $p^{x_i} (1 - p)^{1-x_i}$ 
+- Therefore
+  $$
+  f(x_1,\ldots,x_n) = \prod_{i=1}^n p^{x_i} (1 - p)^{1-x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
+  $$
+
+---
+
+## Correlation
+
+- The **covariance** between two random variables $X$ and $Y$ is defined as 
+$$
+Cov(X, Y) = E[(X - \mu_x)(Y - \mu_y)] = E[X Y] - E[X]E[Y]
+$$
+- The following are useful facts about covariance
+  1. $Cov(X, Y) = Cov(Y, X)$
+  2. $Cov(X, Y)$ can be negative or positive
+  3. $|Cov(X, Y)| \leq \sqrt{Var(X) Var(y)}$
+
+---
+
+## Correlation
+
+- The **correlation** between $X$ and $Y$ is 
+$$
+Cor(X, Y) = Cov(X, Y) / \sqrt{Var(X) Var(y)}
+$$
+
+  1. $-1 \leq Cor(X, Y) \leq 1$
+  2. $Cor(X, Y) = \pm 1$ if and only if $X = a + bY$ for some constants $a$ and $b$
+  3. $Cor(X, Y)$ is unitless
+  4. $X$ and $Y$ are **uncorrelated** if $Cor(X, Y) = 0$ 
+  5.  $X$ and $Y$ are more positively correlated, the closer $Cor(X,Y)$ is to $1$
+  6.  $X$ and $Y$ are more negatively correlated, the closer $Cor(X,Y)$ is to $-1$
+
+---
+
+## Some useful results
+
+- Let $\{X_i\}_{i=1}^n$ be a collection of random variables
+  - When the $\{X_i\}$ are uncorrelated $$Var\left(\sum_{i=1}^n a_i X_i + b\right) = \sum_{i=1}^n a_i^2 Var(X_i)$$  
+
+- A commonly used subcase from these properties is that *if a collection of random variables $\{X_i\}$ are uncorrelated*, then the variance of the sum is the sum of the variances
+$$
+Var\left(\sum_{i=1}^n X_i \right) = \sum_{i=1}^n Var(X_i)
+$$
+- Therefore, it is sums of variances that tend to be useful, not sums of standard deviations; that is, the standard deviation of the sum of bunch of independent random variables is the square root of the sum of the variances, not the sum of the standard deviations
+
+---
+
+## The sample mean
+
+Suppose $X_i$ are iid with variance $\sigma^2$
+
+$$
+\begin{eqnarray*}
+    Var(\bar X) & = & Var \left( \frac{1}{n}\sum_{i=1}^n X_i \right)\\ \\
+    & = & \frac{1}{n^2} Var\left(\sum_{i=1}^n X_i \right)\\ \\
+    & = & \frac{1}{n^2} \sum_{i=1}^n Var(X_i) \\ \\
+    & = & \frac{1}{n^2} \times n\sigma^2 \\ \\
+    & = & \frac{\sigma^2}{n}
+  \end{eqnarray*}
+$$
+
+---
+
+## Some comments
+
+- When $X_i$ are independent with a common variance $Var(\bar X) = \frac{\sigma^2}{n}$
+- $\sigma/\sqrt{n}$ is called *the standard error* of the sample mean
+- The standard error of the sample mean is the standard deviation of the distribution of the sample mean
+- $\sigma$ is the standard deviation of the distribution of a single observation
+- Easy way to remember, the sample mean has to be less variable than a single observation, therefore its standard deviation is divided by a $\sqrt{n}$
+
+---
+
+## The sample variance
+- The **sample variance** is defined as 
+$$
+S^2 =   \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n-1} 
+$$
+- The sample variance is an estimator of $\sigma^2$
+- The numerator has a version that's quicker for calculation
+$$
+\sum_{i=1}^n (X_i - \bar X)^2 = \sum_{i=1}^n X_i^2 - n \bar X^2
+$$
+- The sample variance is (nearly) the mean of the squared deviations from the mean
+
+---
+
+## The sample variance is unbiased
+
+$$
+  \begin{eqnarray*}
+    E\left[\sum_{i=1}^n (X_i - \bar X)^2\right] & = & \sum_{i=1}^n E\left[X_i^2\right] - n E\left[\bar X^2\right] \\ \\
+    & = & \sum_{i=1}^n \left\{Var(X_i) + \mu^2\right\} - n \left\{Var(\bar X) + \mu^2\right\} \\ \\
+    & = & \sum_{i=1}^n \left\{\sigma^2 + \mu^2\right\} - n \left\{\sigma^2 / n + \mu^2\right\} \\ \\
+    & = & n \sigma^2 + n \mu ^ 2 - \sigma^2 - n \mu^2 \\ \\
+    & = & (n - 1) \sigma^2
+  \end{eqnarray*}
+$$
+
+---
+
+## Hoping to avoid some confusion
+
+- Suppose $X_i$ are iid with mean $\mu$ and variance $\sigma^2$
+- $S^2$ estimates $\sigma^2$
+- The calculation of $S^2$ involves dividing by $n-1$
+- $S / \sqrt{n}$ estimates $\sigma / \sqrt{n}$ the standard error of the mean
+- $S / \sqrt{n}$ is called the sample standard error (of the mean)
+
+---
+## Example
+
+```r
+data(father.son)
+x <- father.son$sheight
+n <- length(x)
+```
+
+
+---
+![plot of chunk unnamed-chunk-2](assets/fig/unnamed-chunk-2.png) 
+
+
+```r
+round(c(sum((x - mean(x))^2)/(n - 1), var(x), var(x)/n, sd(x), sd(x)/sqrt(n)), 
+    2)
+```
+
+```
+## [1] 7.92 7.92 0.01 2.81 0.09
+```
+
diff --git a/06_StatisticalInference/old_markdown/01_04_Independence/index.pdf b/06_StatisticalInference/old_markdown/01_04_Independence/index.pdf
new file mode 100644
index 000000000..fd2201506
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_04_Independence/index.pdf differ
diff --git a/06_StatisticalInference/01_05_ConditionalProbability/index.Rmd b/06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.Rmd
similarity index 97%
rename from 06_StatisticalInference/01_05_ConditionalProbability/index.Rmd
rename to 06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.Rmd
index db2151e51..93dad87ce 100644
--- a/06_StatisticalInference/01_05_ConditionalProbability/index.Rmd
+++ b/06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.Rmd
@@ -1,169 +1,169 @@
----
-title       : Conditional Probability
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Conditional probability, motivation
-
-- The probability of getting a one when rolling a (standard) die
-  is usually assumed to be one sixth
-- Suppose you were given the extra information that the die roll
-  was an odd number (hence 1, 3 or 5)
-- *conditional on this new information*, the probability of a
-  one is now one third
-
----
-
-## Conditional probability, definition
-
-- Let $B$ be an event so that $P(B) > 0$
-- Then the conditional probability of an event $A$ given that $B$ has occurred is
-  $$
-  P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
-  $$
-- Notice that if $A$ and $B$ are independent, then
-  $$
-  P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
-  $$
-
----
-
-## Example
-
-- Consider our die roll example
-- $B = \{1, 3, 5\}$
-- $A = \{1\}$
-$$
-  \begin{eqnarray*}
-P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
-  & = & \frac{P(A \cap B)}{P(B)} \\ \\
-  & = & \frac{P(A)}{P(B)} \\ \\ 
-  & = & \frac{1/6}{3/6} = \frac{1}{3}
-  \end{eqnarray*}
-$$
-
-
-
----
-
-## Bayes' rule
-
-$$
-P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
-$$
-  
-
----
-
-## Diagnostic tests
-
-- Let $+$ and $-$ be the events that the result of a diagnostic test is positive or negative respectively
-- Let $D$ and $D^c$ be the event that the subject of the test has or does not have the disease respectively 
-- The **sensitivity** is the probability that the test is positive given that the subject actually has the disease, $P(+ ~|~ D)$
-- The **specificity** is the probability that the test is negative given that the subject does not have the disease, $P(- ~|~ D^c)$
-
----
-
-## More definitions
-
-- The **positive predictive value** is the probability that the subject has the  disease given that the test is positive, $P(D ~|~ +)$
-- The **negative predictive value** is the probability that the subject does not have the disease given that the test is negative, $P(D^c ~|~ -)$
-- The **prevalence of the disease** is the marginal probability of disease, $P(D)$
-
----
-
-## More definitions
-
-- The **diagnostic likelihood ratio of a positive test**, labeled $DLR_+$, is $P(+ ~|~ D) / P(+ ~|~ D^c)$, which is the $$sensitivity / (1 - specificity)$$
-- The **diagnostic likelihood ratio of a negative test**, labeled $DLR_-$, is $P(- ~|~ D) / P(- ~|~ D^c)$, which is the $$(1 - sensitivity) / specificity$$
-
----
-
-## Example
-
-- A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%
-- Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the probability that this subject has HIV?
-- Mathematically, we want $P(D ~|~ +)$ given the sensitivity, $P(+ ~|~ D) = .997$, the specificity, $P(- ~|~ D^c) =.985$, and the prevalence $P(D) = .001$
-
----
-
-## Using Bayes' formula
-
-$$
-\begin{eqnarray*}
-  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
- & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
- & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
- & = & .062
-\end{eqnarray*}
-$$
-
-- In this population a positive test result only suggests a 6% probability that the subject has the disease 
-- (The positive predictive value is 6% for this test)
-
----
-
-## More on this example
-
-- The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity
-- Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner
-- Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject's population, only our interpretation of that evidence changes
-
----
-
-## Likelihood ratios
-
-- Using Bayes rule, we have
-  $$
-  P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
-  $$
-  and
-  $$
-  P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
-  $$
-
----
-
-## Likelihood ratios
-
-- Therefore
-$$
-\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
-$$
-ie
-$$
-\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
-$$
-- Similarly, $DLR_-$ relates the decrease in the odds of the
-  disease after a negative test result to the odds of disease prior to
-  the test.
-
----
-
-## HIV example revisited
-
-- Suppose a subject has a positive HIV test
-- $DLR_+ = .997 / (1 - .985) \approx 66$
-- The result of the positive test is that the odds of disease is now 66 times the pretest odds
-- Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease
-
----
-
-## HIV example revisited
-
-- Suppose that a subject has a negative test result 
-- $DLR_- = (1 - .997) / .985  \approx .003$
-- Therefore, the post-test odds of disease is now $.3\%$ of the pretest odds given the negative test.
-- Or, the hypothesis of disease is supported $.003$ times that of the hypothesis of absence of disease given the negative test result
-
+---
+title       : Conditional Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Conditional probability, motivation
+
+- The probability of getting a one when rolling a (standard) die
+  is usually assumed to be one sixth
+- Suppose you were given the extra information that the die roll
+  was an odd number (hence 1, 3 or 5)
+- *conditional on this new information*, the probability of a
+  one is now one third
+
+---
+
+## Conditional probability, definition
+
+- Let $B$ be an event so that $P(B) > 0$
+- Then the conditional probability of an event $A$ given that $B$ has occurred is
+  $$
+  P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
+  $$
+- Notice that if $A$ and $B$ are independent, then
+  $$
+  P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
+  $$
+
+---
+
+## Example
+
+- Consider our die roll example
+- $B = \{1, 3, 5\}$
+- $A = \{1\}$
+$$
+  \begin{eqnarray*}
+P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
+  & = & \frac{P(A \cap B)}{P(B)} \\ \\
+  & = & \frac{P(A)}{P(B)} \\ \\ 
+  & = & \frac{1/6}{3/6} = \frac{1}{3}
+  \end{eqnarray*}
+$$
+
+
+
+---
+
+## Bayes' rule
+
+$$
+P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
+$$
+  
+
+---
+
+## Diagnostic tests
+
+- Let $+$ and $-$ be the events that the result of a diagnostic test is positive or negative respectively
+- Let $D$ and $D^c$ be the event that the subject of the test has or does not have the disease respectively 
+- The **sensitivity** is the probability that the test is positive given that the subject actually has the disease, $P(+ ~|~ D)$
+- The **specificity** is the probability that the test is negative given that the subject does not have the disease, $P(- ~|~ D^c)$
+
+---
+
+## More definitions
+
+- The **positive predictive value** is the probability that the subject has the  disease given that the test is positive, $P(D ~|~ +)$
+- The **negative predictive value** is the probability that the subject does not have the disease given that the test is negative, $P(D^c ~|~ -)$
+- The **prevalence of the disease** is the marginal probability of disease, $P(D)$
+
+---
+
+## More definitions
+
+- The **diagnostic likelihood ratio of a positive test**, labeled $DLR_+$, is $P(+ ~|~ D) / P(+ ~|~ D^c)$, which is the $$sensitivity / (1 - specificity)$$
+- The **diagnostic likelihood ratio of a negative test**, labeled $DLR_-$, is $P(- ~|~ D) / P(- ~|~ D^c)$, which is the $$(1 - sensitivity) / specificity$$
+
+---
+
+## Example
+
+- A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%
+- Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the probability that this subject has HIV?
+- Mathematically, we want $P(D ~|~ +)$ given the sensitivity, $P(+ ~|~ D) = .997$, the specificity, $P(- ~|~ D^c) =.985$, and the prevalence $P(D) = .001$
+
+---
+
+## Using Bayes' formula
+
+$$
+\begin{eqnarray*}
+  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
+ & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
+ & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
+ & = & .062
+\end{eqnarray*}
+$$
+
+- In this population a positive test result only suggests a 6% probability that the subject has the disease 
+- (The positive predictive value is 6% for this test)
+
+---
+
+## More on this example
+
+- The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity
+- Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner
+- Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject's population, only our interpretation of that evidence changes
+
+---
+
+## Likelihood ratios
+
+- Using Bayes rule, we have
+  $$
+  P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
+  $$
+  and
+  $$
+  P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
+  $$
+
+---
+
+## Likelihood ratios
+
+- Therefore
+$$
+\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
+$$
+ie
+$$
+\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
+$$
+- Similarly, $DLR_-$ relates the decrease in the odds of the
+  disease after a negative test result to the odds of disease prior to
+  the test.
+
+---
+
+## HIV example revisited
+
+- Suppose a subject has a positive HIV test
+- $DLR_+ = .997 / (1 - .985) \approx 66$
+- The result of the positive test is that the odds of disease is now 66 times the pretest odds
+- Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease
+
+---
+
+## HIV example revisited
+
+- Suppose that a subject has a negative test result 
+- $DLR_- = (1 - .997) / .985  \approx .003$
+- Therefore, the post-test odds of disease is now $.3\%$ of the pretest odds given the negative test.
+- Or, the hypothesis of disease is supported $.003$ times that of the hypothesis of absence of disease given the negative test result
+
diff --git a/06_StatisticalInference/01_05_ConditionalProbability/index.html b/06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.html
similarity index 96%
rename from 06_StatisticalInference/01_05_ConditionalProbability/index.html
rename to 06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.html
index dba5dc912..0267feacb 100644
--- a/06_StatisticalInference/01_05_ConditionalProbability/index.html
+++ b/06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.html
@@ -1,411 +1,411 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Conditional Probability</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Conditional Probability">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Conditional Probability</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Conditional probability, motivation</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The probability of getting a one when rolling a (standard) die
-is usually assumed to be one sixth</li>
-<li>Suppose you were given the extra information that the die roll
-was an odd number (hence 1, 3 or 5)</li>
-<li><em>conditional on this new information</em>, the probability of a
-one is now one third</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Conditional probability, definition</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Let \(B\) be an event so that \(P(B) > 0\)</li>
-<li>Then the conditional probability of an event \(A\) given that \(B\) has occurred is
-\[
-P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
-\]</li>
-<li>Notice that if \(A\) and \(B\) are independent, then
-\[
-P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider our die roll example</li>
-<li>\(B = \{1, 3, 5\}\)</li>
-<li>\(A = \{1\}\)
-\[
-\begin{eqnarray*}
-P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
-& = & \frac{P(A \cap B)}{P(B)} \\ \\
-& = & \frac{P(A)}{P(B)} \\ \\ 
-& = & \frac{1/6}{3/6} = \frac{1}{3}
-\end{eqnarray*}
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Bayes&#39; rule</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[
-P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Diagnostic tests</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Let \(+\) and \(-\) be the events that the result of a diagnostic test is positive or negative respectively</li>
-<li>Let \(D\) and \(D^c\) be the event that the subject of the test has or does not have the disease respectively </li>
-<li>The <strong>sensitivity</strong> is the probability that the test is positive given that the subject actually has the disease, \(P(+ ~|~ D)\)</li>
-<li>The <strong>specificity</strong> is the probability that the test is negative given that the subject does not have the disease, \(P(- ~|~ D^c)\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>More definitions</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>positive predictive value</strong> is the probability that the subject has the  disease given that the test is positive, \(P(D ~|~ +)\)</li>
-<li>The <strong>negative predictive value</strong> is the probability that the subject does not have the disease given that the test is negative, \(P(D^c ~|~ -)\)</li>
-<li>The <strong>prevalence of the disease</strong> is the marginal probability of disease, \(P(D)\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>More definitions</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>diagnostic likelihood ratio of a positive test</strong>, labeled \(DLR_+\), is \(P(+ ~|~ D) / P(+ ~|~ D^c)\), which is the \[sensitivity / (1 - specificity)\]</li>
-<li>The <strong>diagnostic likelihood ratio of a negative test</strong>, labeled \(DLR_-\), is \(P(- ~|~ D) / P(- ~|~ D^c)\), which is the \[(1 - sensitivity) / specificity\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%</li>
-<li>Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the probability that this subject has HIV?</li>
-<li>Mathematically, we want \(P(D ~|~ +)\) given the sensitivity, \(P(+ ~|~ D) = .997\), the specificity, \(P(- ~|~ D^c) =.985\), and the prevalence \(P(D) = .001\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Using Bayes&#39; formula</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[
-\begin{eqnarray*}
-  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
- & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
- & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
- & = & .062
-\end{eqnarray*}
-\]</p>
-
-<ul>
-<li>In this population a positive test result only suggests a 6% probability that the subject has the disease </li>
-<li>(The positive predictive value is 6% for this test)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>More on this example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity</li>
-<li>Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner</li>
-<li>Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject&#39;s population, only our interpretation of that evidence changes</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Likelihood ratios</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Using Bayes rule, we have
-\[
-P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
-\]
-and
-\[
-P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Likelihood ratios</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Therefore
-\[
-\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
-\]
-ie
-\[
-\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
-\]</li>
-<li>Similarly, \(DLR_-\) relates the decrease in the odds of the
-disease after a negative test result to the odds of disease prior to
-the test.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>HIV example revisited</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose a subject has a positive HIV test</li>
-<li>\(DLR_+ = .997 / (1 - .985) \approx 66\)</li>
-<li>The result of the positive test is that the odds of disease is now 66 times the pretest odds</li>
-<li>Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>HIV example revisited</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that a subject has a negative test result </li>
-<li>\(DLR_- = (1 - .997) / .985  \approx .003\)</li>
-<li>Therefore, the post-test odds of disease is now \(.3\%\) of the pretest odds given the negative test.</li>
-<li>Or, the hypothesis of disease is supported \(.003\) times that of the hypothesis of absence of disease given the negative test result</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Conditional probability, motivation'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Conditional probability, definition'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Example'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Bayes&#39; rule'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Diagnostic tests'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='More definitions'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='More definitions'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Example'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Using Bayes&#39; formula'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='More on this example'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='Likelihood ratios'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Likelihood ratios'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='HIV example revisited'>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='HIV example revisited'>
-         14
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Conditional Probability</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Conditional Probability">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Conditional Probability</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Conditional probability, motivation</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The probability of getting a one when rolling a (standard) die
+is usually assumed to be one sixth</li>
+<li>Suppose you were given the extra information that the die roll
+was an odd number (hence 1, 3 or 5)</li>
+<li><em>conditional on this new information</em>, the probability of a
+one is now one third</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Conditional probability, definition</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Let \(B\) be an event so that \(P(B) > 0\)</li>
+<li>Then the conditional probability of an event \(A\) given that \(B\) has occurred is
+\[
+P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
+\]</li>
+<li>Notice that if \(A\) and \(B\) are independent, then
+\[
+P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider our die roll example</li>
+<li>\(B = \{1, 3, 5\}\)</li>
+<li>\(A = \{1\}\)
+\[
+\begin{eqnarray*}
+P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
+& = & \frac{P(A \cap B)}{P(B)} \\ \\
+& = & \frac{P(A)}{P(B)} \\ \\ 
+& = & \frac{1/6}{3/6} = \frac{1}{3}
+\end{eqnarray*}
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Bayes&#39; rule</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>\[
+P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Diagnostic tests</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Let \(+\) and \(-\) be the events that the result of a diagnostic test is positive or negative respectively</li>
+<li>Let \(D\) and \(D^c\) be the event that the subject of the test has or does not have the disease respectively </li>
+<li>The <strong>sensitivity</strong> is the probability that the test is positive given that the subject actually has the disease, \(P(+ ~|~ D)\)</li>
+<li>The <strong>specificity</strong> is the probability that the test is negative given that the subject does not have the disease, \(P(- ~|~ D^c)\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>More definitions</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>positive predictive value</strong> is the probability that the subject has the  disease given that the test is positive, \(P(D ~|~ +)\)</li>
+<li>The <strong>negative predictive value</strong> is the probability that the subject does not have the disease given that the test is negative, \(P(D^c ~|~ -)\)</li>
+<li>The <strong>prevalence of the disease</strong> is the marginal probability of disease, \(P(D)\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>More definitions</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>diagnostic likelihood ratio of a positive test</strong>, labeled \(DLR_+\), is \(P(+ ~|~ D) / P(+ ~|~ D^c)\), which is the \[sensitivity / (1 - specificity)\]</li>
+<li>The <strong>diagnostic likelihood ratio of a negative test</strong>, labeled \(DLR_-\), is \(P(- ~|~ D) / P(- ~|~ D^c)\), which is the \[(1 - sensitivity) / specificity\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%</li>
+<li>Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the probability that this subject has HIV?</li>
+<li>Mathematically, we want \(P(D ~|~ +)\) given the sensitivity, \(P(+ ~|~ D) = .997\), the specificity, \(P(- ~|~ D^c) =.985\), and the prevalence \(P(D) = .001\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Using Bayes&#39; formula</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>\[
+\begin{eqnarray*}
+  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
+ & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
+ & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
+ & = & .062
+\end{eqnarray*}
+\]</p>
+
+<ul>
+<li>In this population a positive test result only suggests a 6% probability that the subject has the disease </li>
+<li>(The positive predictive value is 6% for this test)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>More on this example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity</li>
+<li>Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner</li>
+<li>Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject&#39;s population, only our interpretation of that evidence changes</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Likelihood ratios</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Using Bayes rule, we have
+\[
+P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
+\]
+and
+\[
+P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Likelihood ratios</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Therefore
+\[
+\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
+\]
+ie
+\[
+\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
+\]</li>
+<li>Similarly, \(DLR_-\) relates the decrease in the odds of the
+disease after a negative test result to the odds of disease prior to
+the test.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>HIV example revisited</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose a subject has a positive HIV test</li>
+<li>\(DLR_+ = .997 / (1 - .985) \approx 66\)</li>
+<li>The result of the positive test is that the odds of disease is now 66 times the pretest odds</li>
+<li>Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>HIV example revisited</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that a subject has a negative test result </li>
+<li>\(DLR_- = (1 - .997) / .985  \approx .003\)</li>
+<li>Therefore, the post-test odds of disease is now \(.3\%\) of the pretest odds given the negative test.</li>
+<li>Or, the hypothesis of disease is supported \(.003\) times that of the hypothesis of absence of disease given the negative test result</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Conditional probability, motivation'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Conditional probability, definition'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Example'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Bayes&#39; rule'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Diagnostic tests'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='More definitions'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='More definitions'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Example'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Using Bayes&#39; formula'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='More on this example'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Likelihood ratios'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Likelihood ratios'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='HIV example revisited'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='HIV example revisited'>
+         14
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/01_05_ConditionalProbability/index.md b/06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.md
similarity index 97%
rename from 06_StatisticalInference/01_05_ConditionalProbability/index.md
rename to 06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.md
index db2151e51..93dad87ce 100644
--- a/06_StatisticalInference/01_05_ConditionalProbability/index.md
+++ b/06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.md
@@ -1,169 +1,169 @@
----
-title       : Conditional Probability
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-## Conditional probability, motivation
-
-- The probability of getting a one when rolling a (standard) die
-  is usually assumed to be one sixth
-- Suppose you were given the extra information that the die roll
-  was an odd number (hence 1, 3 or 5)
-- *conditional on this new information*, the probability of a
-  one is now one third
-
----
-
-## Conditional probability, definition
-
-- Let $B$ be an event so that $P(B) > 0$
-- Then the conditional probability of an event $A$ given that $B$ has occurred is
-  $$
-  P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
-  $$
-- Notice that if $A$ and $B$ are independent, then
-  $$
-  P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
-  $$
-
----
-
-## Example
-
-- Consider our die roll example
-- $B = \{1, 3, 5\}$
-- $A = \{1\}$
-$$
-  \begin{eqnarray*}
-P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
-  & = & \frac{P(A \cap B)}{P(B)} \\ \\
-  & = & \frac{P(A)}{P(B)} \\ \\ 
-  & = & \frac{1/6}{3/6} = \frac{1}{3}
-  \end{eqnarray*}
-$$
-
-
-
----
-
-## Bayes' rule
-
-$$
-P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
-$$
-  
-
----
-
-## Diagnostic tests
-
-- Let $+$ and $-$ be the events that the result of a diagnostic test is positive or negative respectively
-- Let $D$ and $D^c$ be the event that the subject of the test has or does not have the disease respectively 
-- The **sensitivity** is the probability that the test is positive given that the subject actually has the disease, $P(+ ~|~ D)$
-- The **specificity** is the probability that the test is negative given that the subject does not have the disease, $P(- ~|~ D^c)$
-
----
-
-## More definitions
-
-- The **positive predictive value** is the probability that the subject has the  disease given that the test is positive, $P(D ~|~ +)$
-- The **negative predictive value** is the probability that the subject does not have the disease given that the test is negative, $P(D^c ~|~ -)$
-- The **prevalence of the disease** is the marginal probability of disease, $P(D)$
-
----
-
-## More definitions
-
-- The **diagnostic likelihood ratio of a positive test**, labeled $DLR_+$, is $P(+ ~|~ D) / P(+ ~|~ D^c)$, which is the $$sensitivity / (1 - specificity)$$
-- The **diagnostic likelihood ratio of a negative test**, labeled $DLR_-$, is $P(- ~|~ D) / P(- ~|~ D^c)$, which is the $$(1 - sensitivity) / specificity$$
-
----
-
-## Example
-
-- A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%
-- Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the probability that this subject has HIV?
-- Mathematically, we want $P(D ~|~ +)$ given the sensitivity, $P(+ ~|~ D) = .997$, the specificity, $P(- ~|~ D^c) =.985$, and the prevalence $P(D) = .001$
-
----
-
-## Using Bayes' formula
-
-$$
-\begin{eqnarray*}
-  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
- & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
- & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
- & = & .062
-\end{eqnarray*}
-$$
-
-- In this population a positive test result only suggests a 6% probability that the subject has the disease 
-- (The positive predictive value is 6% for this test)
-
----
-
-## More on this example
-
-- The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity
-- Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner
-- Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject's population, only our interpretation of that evidence changes
-
----
-
-## Likelihood ratios
-
-- Using Bayes rule, we have
-  $$
-  P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
-  $$
-  and
-  $$
-  P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
-  $$
-
----
-
-## Likelihood ratios
-
-- Therefore
-$$
-\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
-$$
-ie
-$$
-\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
-$$
-- Similarly, $DLR_-$ relates the decrease in the odds of the
-  disease after a negative test result to the odds of disease prior to
-  the test.
-
----
-
-## HIV example revisited
-
-- Suppose a subject has a positive HIV test
-- $DLR_+ = .997 / (1 - .985) \approx 66$
-- The result of the positive test is that the odds of disease is now 66 times the pretest odds
-- Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease
-
----
-
-## HIV example revisited
-
-- Suppose that a subject has a negative test result 
-- $DLR_- = (1 - .997) / .985  \approx .003$
-- Therefore, the post-test odds of disease is now $.3\%$ of the pretest odds given the negative test.
-- Or, the hypothesis of disease is supported $.003$ times that of the hypothesis of absence of disease given the negative test result
-
+---
+title       : Conditional Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Conditional probability, motivation
+
+- The probability of getting a one when rolling a (standard) die
+  is usually assumed to be one sixth
+- Suppose you were given the extra information that the die roll
+  was an odd number (hence 1, 3 or 5)
+- *conditional on this new information*, the probability of a
+  one is now one third
+
+---
+
+## Conditional probability, definition
+
+- Let $B$ be an event so that $P(B) > 0$
+- Then the conditional probability of an event $A$ given that $B$ has occurred is
+  $$
+  P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
+  $$
+- Notice that if $A$ and $B$ are independent, then
+  $$
+  P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
+  $$
+
+---
+
+## Example
+
+- Consider our die roll example
+- $B = \{1, 3, 5\}$
+- $A = \{1\}$
+$$
+  \begin{eqnarray*}
+P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
+  & = & \frac{P(A \cap B)}{P(B)} \\ \\
+  & = & \frac{P(A)}{P(B)} \\ \\ 
+  & = & \frac{1/6}{3/6} = \frac{1}{3}
+  \end{eqnarray*}
+$$
+
+
+
+---
+
+## Bayes' rule
+
+$$
+P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
+$$
+  
+
+---
+
+## Diagnostic tests
+
+- Let $+$ and $-$ be the events that the result of a diagnostic test is positive or negative respectively
+- Let $D$ and $D^c$ be the event that the subject of the test has or does not have the disease respectively 
+- The **sensitivity** is the probability that the test is positive given that the subject actually has the disease, $P(+ ~|~ D)$
+- The **specificity** is the probability that the test is negative given that the subject does not have the disease, $P(- ~|~ D^c)$
+
+---
+
+## More definitions
+
+- The **positive predictive value** is the probability that the subject has the  disease given that the test is positive, $P(D ~|~ +)$
+- The **negative predictive value** is the probability that the subject does not have the disease given that the test is negative, $P(D^c ~|~ -)$
+- The **prevalence of the disease** is the marginal probability of disease, $P(D)$
+
+---
+
+## More definitions
+
+- The **diagnostic likelihood ratio of a positive test**, labeled $DLR_+$, is $P(+ ~|~ D) / P(+ ~|~ D^c)$, which is the $$sensitivity / (1 - specificity)$$
+- The **diagnostic likelihood ratio of a negative test**, labeled $DLR_-$, is $P(- ~|~ D) / P(- ~|~ D^c)$, which is the $$(1 - sensitivity) / specificity$$
+
+---
+
+## Example
+
+- A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%
+- Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the probability that this subject has HIV?
+- Mathematically, we want $P(D ~|~ +)$ given the sensitivity, $P(+ ~|~ D) = .997$, the specificity, $P(- ~|~ D^c) =.985$, and the prevalence $P(D) = .001$
+
+---
+
+## Using Bayes' formula
+
+$$
+\begin{eqnarray*}
+  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
+ & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
+ & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
+ & = & .062
+\end{eqnarray*}
+$$
+
+- In this population a positive test result only suggests a 6% probability that the subject has the disease 
+- (The positive predictive value is 6% for this test)
+
+---
+
+## More on this example
+
+- The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity
+- Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner
+- Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject's population, only our interpretation of that evidence changes
+
+---
+
+## Likelihood ratios
+
+- Using Bayes rule, we have
+  $$
+  P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
+  $$
+  and
+  $$
+  P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
+  $$
+
+---
+
+## Likelihood ratios
+
+- Therefore
+$$
+\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
+$$
+ie
+$$
+\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
+$$
+- Similarly, $DLR_-$ relates the decrease in the odds of the
+  disease after a negative test result to the odds of disease prior to
+  the test.
+
+---
+
+## HIV example revisited
+
+- Suppose a subject has a positive HIV test
+- $DLR_+ = .997 / (1 - .985) \approx 66$
+- The result of the positive test is that the odds of disease is now 66 times the pretest odds
+- Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease
+
+---
+
+## HIV example revisited
+
+- Suppose that a subject has a negative test result 
+- $DLR_- = (1 - .997) / .985  \approx .003$
+- Therefore, the post-test odds of disease is now $.3\%$ of the pretest odds given the negative test.
+- Or, the hypothesis of disease is supported $.003$ times that of the hypothesis of absence of disease given the negative test result
+
diff --git a/06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.pdf b/06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.pdf
new file mode 100644
index 000000000..a5a7edead
Binary files /dev/null and b/06_StatisticalInference/old_markdown/01_05_ConditionalProbability/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/02_01_CommonDistributions/fig/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..69d8b94e7
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/02_01_CommonDistributions/fig/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..84d4440e7
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/old_markdown/02_01_CommonDistributions/fig/unnamed-chunk-4.png b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..98bbf91a2
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/fig/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/old_markdown/02_01_CommonDistributions/figure/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/figure/unnamed-chunk-1.png
new file mode 100644
index 000000000..715bcbec7
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/figure/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/02_01_CommonDistributions/index.Rmd b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.Rmd
similarity index 91%
rename from 06_StatisticalInference/02_01_CommonDistributions/index.Rmd
rename to 06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.Rmd
index 7293d2964..586fd0eae 100644
--- a/06_StatisticalInference/02_01_CommonDistributions/index.Rmd
+++ b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.Rmd
@@ -1,346 +1,331 @@
----
-title       : Some Common Distributions
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-runif(1)
-```
-
-## The Bernoulli distribution
-
-- The **Bernoulli distribution** arises as the result of a binary outcome
-- Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) $p$ and $1-p$ respectively
-- The PMF for a Bernoulli random variable $X$ is $$P(X = x) =  p^x (1 - p)^{1 - x}$$
-- The mean of a Bernoulli random variable is $p$ and the variance is $p(1 - p)$
-- If we let $X$ be a Bernoulli random variable, it is typical to call $X=1$ as a "success" and $X=0$ as a "failure"
-
----
-
-## iid Bernoulli trials
-
-- If several iid Bernoulli observations, say $x_1,\ldots, x_n$, are observed the
-likelihood is 
-$$
-  \prod_{i=1}^n p^{x_i} (1 - p)^{1 - x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
-$$
-- Notice that the likelihood depends only on the sum of the $x_i$
-- Because $n$ is fixed and assumed known, this implies that the sample proportion $\sum_i x_i / n$ contains all of the relevant information about $p$
-- We can maximize the Bernoulli likelihood over $p$ to obtain that $\hat p = \sum_i x_i / n$ is the maximum likelihood estimator for $p$
-
----
-## Plotting all possible likelihoods for a small n
-```
-n <- 5
-pvals <- seq(0, 1, length = 1000)
-plot(c(0, 1), c(0, 1.2), type = "n", frame = FALSE, xlab = "p", ylab = "likelihood")
-text((0 : n) /n, 1.1, as.character(0 : n))
-sapply(0 : n, function(x) {
-  phat <- x / n
-  if (x == 0) lines(pvals,  ( (1 - pvals) / (1 - phat) )^(n-x), lwd = 3)
-  else if (x == n) lines(pvals, (pvals / phat) ^ x, lwd = 3)
-  else lines(pvals, (pvals / phat ) ^ x * ( (1 - pvals) / (1 - phat) ) ^ (n-x), lwd = 3) 
-  }
-)
-title(paste("Likelihoods for n = ", n))
-```
-
----
-```{r, fig.height=6, fig.width=6, echo = FALSE, results='hide'}
-n <- 5
-pvals <- seq(0, 1, length = 1000)
-plot(c(0, 1), c(0, 1.2), type = "n", frame = FALSE, xlab = "p", ylab = "likelihood")
-text((0 : n) /n, 1.1, as.character(0 : n))
-sapply(0 : n, function(x) {
-  phat <- x / n
-  if (x == 0) lines(pvals,  ( (1 - pvals) / (1 - phat) )^(n-x), lwd = 3)
-  else if (x == n) lines(pvals, (pvals / phat) ^ x, lwd = 3)
-  else lines(pvals, (pvals / phat ) ^ x * ( (1 - pvals) / (1 - phat) ) ^ (n-x), lwd = 3) 
-  }
-)
-title(paste("Likelihoods for n = ", n))
-```
-
----
-
-## Binomial trials
-
-- The *binomial random variables* are obtained as the sum of iid Bernoulli trials
-- In specific, let $X_1,\ldots,X_n$ be iid Bernoulli$(p)$; then $X = \sum_{i=1}^n X_i$ is a binomial random variable
-- The binomial mass function is
-$$
-P(X = x) = 
-\left(
-\begin{array}{c}
-  n \\ x
-\end{array}
-\right)
-p^x(1 - p)^{n-x}
-$$
-for $x=0,\ldots,n$
-
----
-
-## Choose
-
-- Recall that the notation 
-  $$\left(
-    \begin{array}{c}
-      n \\ x
-    \end{array}
-  \right) = \frac{n!}{x!(n-x)!}
-  $$ (read "$n$ choose $x$") counts the number of ways of selecting $x$ items out of $n$
-  without replacement disregarding the order of the items
-
-$$\left(
-    \begin{array}{c}
-      n \\ 0
-    \end{array}
-  \right) =
-\left(
-    \begin{array}{c}
-      n \\ n
-    \end{array}
-  \right) =  1
-  $$ 
-
----
-
-## Example justification of the binomial likelihood
-
-- Consider the probability of getting $6$ heads out of $10$ coin flips from a coin with success probability $p$ 
-- The probability of getting $6$ heads and $4$ tails in any specific order is
-  $$
-  p^6(1-p)^4
-  $$
-- There are 
-$$\left(
-\begin{array}{c}
-  10 \\ 6
-\end{array}
-\right)
-$$
-possible orders of $6$ heads and $4$ tails
-
----
-
-## Example
-
-- Suppose a friend has $8$ children (oh my!), $7$ of which are girls and none are twins
-- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
-$$\left(
-\begin{array}{c}
-  8 \\ 7
-\end{array}
-\right) .5^{7}(1-.5)^{1}
-+
-\left(
-\begin{array}{c}
-  8 \\ 8
-\end{array}
-\right) .5^{8}(1-.5)^{0} \approx 0.04
-$$
-```{r}
-choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
-pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
-```
-
----
-```{r, fig.height=5, fig.width=5}
-plot(pvals, dbinom(7, 8, pvals) / dbinom(7, 8, 7/8) , 
-     lwd = 3, frame = FALSE, type = "l", xlab = "p", ylab = "likelihood")
-```
-
----
-
-## The normal distribution
-
-- A random variable is said to follow a **normal** or **Gaussian** distribution with mean $\mu$ and variance $\sigma^2$ if the associated density is
-  $$
-  (2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
-  $$
-  If $X$ a RV with this density then $E[X] = \mu$ and $Var(X) = \sigma^2$
-- We write $X\sim \mbox{N}(\mu, \sigma^2)$
-- When $\mu = 0$ and $\sigma = 1$ the resulting distribution is called **the standard normal distribution**
-- The standard normal density function is labeled $\phi$
-- Standard normal RVs are often labeled $Z$
-
----
-```{r, fig.height=4.5, fig.width=4.5}
-zvals <- seq(-3, 3, length = 1000)
-plot(zvals, dnorm(zvals), 
-     type = "l", lwd = 3, frame = FALSE, xlab = "z", ylab = "Density")
-sapply(-3 : 3, function(k) abline(v = k))
-```
-
----
-
-## Facts about the normal density
-
-- If $X \sim \mbox{N}(\mu,\sigma^2)$ the $Z = \frac{X -\mu}{\sigma}$ is standard normal
-- If $Z$ is standard normal $$X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)$$
-- The non-standard normal density is $$\phi\{(x - \mu) / \sigma\}/\sigma$$
-
----
-
-## More facts about the normal density
-
-1. Approximately $68\%$, $95\%$ and $99\%$  of the normal density lies within $1$, $2$ and $3$ standard deviations from the mean, respectively
-2. $-1.28$, $-1.645$, $-1.96$ and $-2.33$ are the $10^{th}$, $5^{th}$, $2.5^{th}$ and $1^{st}$ percentiles of the standard normal distribution respectively
-3. By symmetry, $1.28$, $1.645$, $1.96$ and $2.33$ are the $90^{th}$, $95^{th}$, $97.5^{th}$ and $99^{th}$ percentiles of the standard normal distribution respectively
-
----
-
-## Question
-
-- What is the $95^{th}$ percentile of a $N(\mu, \sigma^2)$ distribution? 
-  - Quick answer in R `qnorm(.95, mean = mu, sd = sd)`
-- We want the point $x_0$ so that $P(X \leq x_0) = .95$
-$$
-  \begin{eqnarray*}
-    P(X \leq x_0) & = & P\left(\frac{X - \mu}{\sigma} \leq \frac{x_0 - \mu}{\sigma}\right) \\ \\
-                  & = & P\left(Z \leq \frac{x_0 - \mu}{\sigma}\right) =  .95
-  \end{eqnarray*}
-$$
-- Therefore
-  $$\frac{x_0 - \mu}{\sigma} = 1.645$$
-  or $x_0 = \mu + \sigma 1.645$
-- In general $x_0 = \mu + \sigma z_0$ where $z_0$ is the appropriate standard normal quantile
-
----
-
-## Question
-
-- What is the probability that a $\mbox{N}(\mu,\sigma^2)$ RV is 2 standard deviations above the mean?
-- We want to know
-$$
-  \begin{eqnarray*}
-  P(X > \mu + 2\sigma) & = & 
-P\left(\frac{X -\mu}{\sigma} > \frac{\mu + 2\sigma - \mu}{\sigma}\right)    \\ \\
-& = & P(Z \geq 2 ) \\ \\ 
-& \approx & 2.5\%
-  \end{eqnarray*}
-$$
-
----
-
-## Other properties
-
-- The normal distribution is symmetric and peaked about its mean (therefore the mean, median and mode are all equal)
-- A constant times a normally distributed random variable is also normally distributed (what is the mean and variance?)
-- Sums of normally distributed random variables are again normally distributed even if the variables are dependent (what is the mean and variance?)
-- Sample means of normally distributed random variables are again normally distributed (with what mean and variance?)
-- The square of a *standard normal* random variable follows what is called **chi-squared** distribution 
-- The exponent of a normally distributed random variables follows what is called the **log-normal** distribution 
-- As we will see later, many random variables, properly normalized, *limit* to a normal distribution
-
----
-
-## Final thoughts on normal likelihoods
-- The MLE for $\mu$ is $\bar X$.
-- The MLE for $\sigma^2$ is
-  $$
-  \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n}
-  $$
-  (Which is the biased version of the sample variance.)
-- The MLE of $\sigma$ is simply the square root of this
-  estimate
-
----
-## The Poisson distribution
-* Used to model counts
-* The Poisson mass function is
-$$
-P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
-$$
-for $x=0,1,\ldots$
-* The mean of this distribution is $\lambda$
-* The variance of this distribution is $\lambda$
-* Notice that $x$ ranges from $0$ to $\infty$
-
----
-## Some uses for the Poisson distribution
-* Modeling event/time data
-* Modeling radioactive decay
-* Modeling survival data
-* Modeling unbounded count data 
-* Modeling contingency tables
-* Approximating binomials when $n$ is large and $p$ is small
-
----
-## Poisson derivation
-* $\lambda$ is the mean number of events per unit time
-* Let $h$ be very small 
-* Suppose we assume that 
-  * Prob. of an event in an interval of length $h$ is $\lambda h$
-    while the prob. of more than one event is negligible
-  * Whether or not an event occurs in one small interval
-    does not impact whether or not an event occurs in another
-    small interval
-then, the number of events per unit time is Poisson with mean $\lambda$ 
-
----
-## Rates and Poisson random variables
-* Poisson random variables are used to model rates
-* $X \sim Poisson(\lambda t)$ where 
-  * $\lambda = E[X / t]$ is the expected count per unit of time
-  * $t$ is the total monitoring time
-
----
-## Poisson approximation to the binomial
-* When $n$ is large and $p$ is small the Poisson distribution
-  is an accurate approximation to the binomial distribution
-* Notation
-  * $\lambda = n p$
-  * $X \sim \mbox{Binomial}(n, p)$, $\lambda = n p$ and
-  * $n$ gets large 
-  * $p$ gets small
-  * $\lambda$ stays constant
-
----
-## Example
-The number of people that show up at a bus stop is Poisson with
-a mean of $2.5$ per hour.
-
-If watching the bus stop for 4 hours, what is the probability that $3$
-or fewer people show up for the whole time?
-
-```{r}
-ppois(3, lambda = 2.5 * 4)
-```
-
----
-## Example, Poisson approximation to the binomial
-
-We flip a coin with success probablity $0.01$ five hundred times. 
-
-What's the probability of 2 or fewer successes?
-
-```{r}
-pbinom(2, size = 500, prob = .01)
-ppois(2, lambda=500 * .01)
-```
-
+---
+title       : Some Common Distributions
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+
+## The Bernoulli distribution
+
+- The **Bernoulli distribution** arises as the result of a binary outcome
+- Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) $p$ and $1-p$ respectively
+- The PMF for a Bernoulli random variable $X$ is $$P(X = x) =  p^x (1 - p)^{1 - x}$$
+- The mean of a Bernoulli random variable is $p$ and the variance is $p(1 - p)$
+- If we let $X$ be a Bernoulli random variable, it is typical to call $X=1$ as a "success" and $X=0$ as a "failure"
+
+---
+
+## iid Bernoulli trials
+
+- If several iid Bernoulli observations, say $x_1,\ldots, x_n$, are observed the
+likelihood is 
+$$
+  \prod_{i=1}^n p^{x_i} (1 - p)^{1 - x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
+$$
+- Notice that the likelihood depends only on the sum of the $x_i$
+- Because $n$ is fixed and assumed known, this implies that the sample proportion $\sum_i x_i / n$ contains all of the relevant information about $p$
+- We can maximize the Bernoulli likelihood over $p$ to obtain that $\hat p = \sum_i x_i / n$ is the maximum likelihood estimator for $p$
+
+---
+## Plotting all possible likelihoods for a small n
+```
+n <- 5
+pvals <- seq(0, 1, length = 1000)
+plot(c(0, 1), c(0, 1.2), type = "n", frame = FALSE, xlab = "p", ylab = "likelihood")
+text((0 : n) /n, 1.1, as.character(0 : n))
+sapply(0 : n, function(x) {
+  phat <- x / n
+  if (x == 0) lines(pvals,  ( (1 - pvals) / (1 - phat) )^(n-x), lwd = 3)
+  else if (x == n) lines(pvals, (pvals / phat) ^ x, lwd = 3)
+  else lines(pvals, (pvals / phat ) ^ x * ( (1 - pvals) / (1 - phat) ) ^ (n-x), lwd = 3) 
+  }
+)
+title(paste("Likelihoods for n = ", n))
+```
+
+---
+```{r, fig.height=6, fig.width=6, echo = FALSE, results='hide'}
+n <- 5
+pvals <- seq(0, 1, length = 1000)
+plot(c(0, 1), c(0, 1.2), type = "n", frame = FALSE, xlab = "p", ylab = "likelihood")
+text((0 : n) /n, 1.1, as.character(0 : n))
+sapply(0 : n, function(x) {
+  phat <- x / n
+  if (x == 0) lines(pvals,  ( (1 - pvals) / (1 - phat) )^(n-x), lwd = 3)
+  else if (x == n) lines(pvals, (pvals / phat) ^ x, lwd = 3)
+  else lines(pvals, (pvals / phat ) ^ x * ( (1 - pvals) / (1 - phat) ) ^ (n-x), lwd = 3) 
+  }
+)
+title(paste("Likelihoods for n = ", n))
+```
+
+---
+
+## Binomial trials
+
+- The *binomial random variables* are obtained as the sum of iid Bernoulli trials
+- In specific, let $X_1,\ldots,X_n$ be iid Bernoulli$(p)$; then $X = \sum_{i=1}^n X_i$ is a binomial random variable
+- The binomial mass function is
+$$
+P(X = x) = 
+\left(
+\begin{array}{c}
+  n \\ x
+\end{array}
+\right)
+p^x(1 - p)^{n-x}
+$$
+for $x=0,\ldots,n$
+
+---
+
+## Choose
+
+- Recall that the notation 
+  $$\left(
+    \begin{array}{c}
+      n \\ x
+    \end{array}
+  \right) = \frac{n!}{x!(n-x)!}
+  $$ (read "$n$ choose $x$") counts the number of ways of selecting $x$ items out of $n$
+  without replacement disregarding the order of the items
+
+$$\left(
+    \begin{array}{c}
+      n \\ 0
+    \end{array}
+  \right) =
+\left(
+    \begin{array}{c}
+      n \\ n
+    \end{array}
+  \right) =  1
+  $$ 
+
+---
+
+## Example justification of the binomial likelihood
+
+- Consider the probability of getting $6$ heads out of $10$ coin flips from a coin with success probability $p$ 
+- The probability of getting $6$ heads and $4$ tails in any specific order is
+  $$
+  p^6(1-p)^4
+  $$
+- There are 
+$$\left(
+\begin{array}{c}
+  10 \\ 6
+\end{array}
+\right)
+$$
+possible orders of $6$ heads and $4$ tails
+
+---
+
+## Example
+
+- Suppose a friend has $8$ children (oh my!), $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+$$\left(
+\begin{array}{c}
+  8 \\ 7
+\end{array}
+\right) .5^{7}(1-.5)^{1}
++
+\left(
+\begin{array}{c}
+  8 \\ 8
+\end{array}
+\right) .5^{8}(1-.5)^{0} \approx 0.04
+$$
+```{r}
+choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
+pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
+```
+
+---
+```{r, fig.height=5, fig.width=5}
+plot(pvals, dbinom(7, 8, pvals) / dbinom(7, 8, 7/8) , 
+     lwd = 3, frame = FALSE, type = "l", xlab = "p", ylab = "likelihood")
+```
+
+---
+
+## The normal distribution
+
+- A random variable is said to follow a **normal** or **Gaussian** distribution with mean $\mu$ and variance $\sigma^2$ if the associated density is
+  $$
+  (2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
+  $$
+  If $X$ a RV with this density then $E[X] = \mu$ and $Var(X) = \sigma^2$
+- We write $X\sim \mbox{N}(\mu, \sigma^2)$
+- When $\mu = 0$ and $\sigma = 1$ the resulting distribution is called **the standard normal distribution**
+- The standard normal density function is labeled $\phi$
+- Standard normal RVs are often labeled $Z$
+
+---
+```{r, fig.height=5, fig.width=5, fig.align='center', results='hide'}
+zvals <- seq(-3, 3, length = 1000)
+plot(zvals, dnorm(zvals), 
+     type = "l", lwd = 3, frame = FALSE, xlab = "z", ylab = "Density")
+sapply(-3 : 3, function(k) abline(v = k))
+```
+
+---
+
+## Facts about the normal density
+
+- If $X \sim \mbox{N}(\mu,\sigma^2)$ the $Z = \frac{X -\mu}{\sigma}$ is standard normal
+- If $Z$ is standard normal $$X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)$$
+- The non-standard normal density is $$\phi\{(x - \mu) / \sigma\}/\sigma$$
+
+---
+
+## More facts about the normal density
+
+1. Approximately $68\%$, $95\%$ and $99\%$  of the normal density lies within $1$, $2$ and $3$ standard deviations from the mean, respectively
+2. $-1.28$, $-1.645$, $-1.96$ and $-2.33$ are the $10^{th}$, $5^{th}$, $2.5^{th}$ and $1^{st}$ percentiles of the standard normal distribution respectively
+3. By symmetry, $1.28$, $1.645$, $1.96$ and $2.33$ are the $90^{th}$, $95^{th}$, $97.5^{th}$ and $99^{th}$ percentiles of the standard normal distribution respectively
+
+---
+
+## Question
+
+- What is the $95^{th}$ percentile of a $N(\mu, \sigma^2)$ distribution? 
+  - Quick answer in R `qnorm(.95, mean = mu, sd = sd)`
+- We want the point $x_0$ so that $P(X \leq x_0) = .95$
+$$
+  \begin{eqnarray*}
+    P(X \leq x_0) & = & P\left(\frac{X - \mu}{\sigma} \leq \frac{x_0 - \mu}{\sigma}\right) \\ \\
+                  & = & P\left(Z \leq \frac{x_0 - \mu}{\sigma}\right) =  .95
+  \end{eqnarray*}
+$$
+- Therefore
+  $$\frac{x_0 - \mu}{\sigma} = 1.645$$
+  or $x_0 = \mu + \sigma 1.645$
+- In general $x_0 = \mu + \sigma z_0$ where $z_0$ is the appropriate standard normal quantile
+
+---
+
+## Question
+
+- What is the probability that a $\mbox{N}(\mu,\sigma^2)$ RV is 2 standard deviations above the mean?
+- We want to know
+$$
+  \begin{eqnarray*}
+  P(X > \mu + 2\sigma) & = & 
+P\left(\frac{X -\mu}{\sigma} > \frac{\mu + 2\sigma - \mu}{\sigma}\right)    \\ \\
+& = & P(Z \geq 2 ) \\ \\ 
+& \approx & 2.5\%
+  \end{eqnarray*}
+$$
+
+---
+
+## Other properties
+
+- The normal distribution is symmetric and peaked about its mean (therefore the mean, median and mode are all equal)
+- A constant times a normally distributed random variable is also normally distributed (what is the mean and variance?)
+- Sums of normally distributed random variables are again normally distributed even if the variables are dependent (what is the mean and variance?)
+- Sample means of normally distributed random variables are again normally distributed (with what mean and variance?)
+- The square of a *standard normal* random variable follows what is called **chi-squared** distribution 
+- The exponent of a normally distributed random variables follows what is called the **log-normal** distribution 
+- As we will see later, many random variables, properly normalized, *limit* to a normal distribution
+
+---
+
+## Final thoughts on normal likelihoods
+- The MLE for $\mu$ is $\bar X$.
+- The MLE for $\sigma^2$ is
+  $$
+  \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n}
+  $$
+  (Which is the biased version of the sample variance.)
+- The MLE of $\sigma$ is simply the square root of this
+  estimate
+
+---
+## The Poisson distribution
+* Used to model counts
+* The Poisson mass function is
+$$
+P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
+$$
+for $x=0,1,\ldots$
+* The mean of this distribution is $\lambda$
+* The variance of this distribution is $\lambda$
+* Notice that $x$ ranges from $0$ to $\infty$
+
+---
+## Some uses for the Poisson distribution
+* Modeling event/time data
+* Modeling radioactive decay
+* Modeling survival data
+* Modeling unbounded count data 
+* Modeling contingency tables
+* Approximating binomials when $n$ is large and $p$ is small
+
+---
+## Poisson derivation
+* $\lambda$ is the mean number of events per unit time
+* Let $h$ be very small 
+* Suppose we assume that 
+  * Prob. of an event in an interval of length $h$ is $\lambda h$
+    while the prob. of more than one event is negligible
+  * Whether or not an event occurs in one small interval
+    does not impact whether or not an event occurs in another
+    small interval
+then, the number of events per unit time is Poisson with mean $\lambda$ 
+
+---
+## Rates and Poisson random variables
+* Poisson random variables are used to model rates
+* $X \sim Poisson(\lambda t)$ where 
+  * $\lambda = E[X / t]$ is the expected count per unit of time
+  * $t$ is the total monitoring time
+
+---
+## Poisson approximation to the binomial
+* When $n$ is large and $p$ is small the Poisson distribution
+  is an accurate approximation to the binomial distribution
+* Notation
+  * $\lambda = n p$
+  * $X \sim \mbox{Binomial}(n, p)$, $\lambda = n p$ and
+  * $n$ gets large 
+  * $p$ gets small
+  * $\lambda$ stays constant
+
+---
+## Example
+The number of people that show up at a bus stop is Poisson with
+a mean of $2.5$ per hour.
+
+If watching the bus stop for 4 hours, what is the probability that $3$
+or fewer people show up for the whole time?
+
+```{r}
+ppois(3, lambda = 2.5 * 4)
+```
+
+---
+## Example, Poisson approximation to the binomial
+
+We flip a coin with success probablity $0.01$ five hundred times. 
+
+What's the probability of 2 or fewer successes?
+
+```{r}
+pbinom(2, size = 500, prob = .01)
+ppois(2, lambda=500 * .01)
+```
+
diff --git a/06_StatisticalInference/02_01_CommonDistributions/index.html b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.html
similarity index 91%
rename from 06_StatisticalInference/02_01_CommonDistributions/index.html
rename to 06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.html
index be8769a08..8b616ccc0 100644
--- a/06_StatisticalInference/02_01_CommonDistributions/index.html
+++ b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.html
@@ -1,750 +1,727 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Some Common Distributions</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Some Common Distributions">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Some Common Distributions</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>The Bernoulli distribution</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>Bernoulli distribution</strong> arises as the result of a binary outcome</li>
-<li>Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) \(p\) and \(1-p\) respectively</li>
-<li>The PMF for a Bernoulli random variable \(X\) is \[P(X = x) =  p^x (1 - p)^{1 - x}\]</li>
-<li>The mean of a Bernoulli random variable is \(p\) and the variance is \(p(1 - p)\)</li>
-<li>If we let \(X\) be a Bernoulli random variable, it is typical to call \(X=1\) as a &quot;success&quot; and \(X=0\) as a &quot;failure&quot;</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>iid Bernoulli trials</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>If several iid Bernoulli observations, say \(x_1,\ldots, x_n\), are observed the
-likelihood is 
-\[
-\prod_{i=1}^n p^{x_i} (1 - p)^{1 - x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
-\]</li>
-<li>Notice that the likelihood depends only on the sum of the \(x_i\)</li>
-<li>Because \(n\) is fixed and assumed known, this implies that the sample proportion \(\sum_i x_i / n\) contains all of the relevant information about \(p\)</li>
-<li>We can maximize the Bernoulli likelihood over \(p\) to obtain that \(\hat p = \sum_i x_i / n\) is the maximum likelihood estimator for \(p\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Plotting all possible likelihoods for a small n</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>n &lt;- 5
-pvals &lt;- seq(0, 1, length = 1000)
-plot(c(0, 1), c(0, 1.2), type = &quot;n&quot;, frame = FALSE, xlab = &quot;p&quot;, ylab = &quot;likelihood&quot;)
-text((0 : n) /n, 1.1, as.character(0 : n))
-sapply(0 : n, function(x) {
-  phat &lt;- x / n
-  if (x == 0) lines(pvals,  ( (1 - pvals) / (1 - phat) )^(n-x), lwd = 3)
-  else if (x == n) lines(pvals, (pvals / phat) ^ x, lwd = 3)
-  else lines(pvals, (pvals / phat ) ^ x * ( (1 - pvals) / (1 - phat) ) ^ (n-x), lwd = 3) 
-  }
-)
-title(paste(&quot;Likelihoods for n = &quot;, n))
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <article data-timings="">
-    <div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Binomial trials</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <em>binomial random variables</em> are obtained as the sum of iid Bernoulli trials</li>
-<li>In specific, let \(X_1,\ldots,X_n\) be iid Bernoulli\((p)\); then \(X = \sum_{i=1}^n X_i\) is a binomial random variable</li>
-<li>The binomial mass function is
-\[
-P(X = x) = 
-\left(
-\begin{array}{c}
-n \\ x
-\end{array}
-\right)
-p^x(1 - p)^{n-x}
-\]
-for \(x=0,\ldots,n\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Choose</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Recall that the notation 
-\[\left(
-\begin{array}{c}
-  n \\ x
-\end{array}
-\right) = \frac{n!}{x!(n-x)!}
-\] (read &quot;\(n\) choose \(x\)&quot;) counts the number of ways of selecting \(x\) items out of \(n\)
-without replacement disregarding the order of the items</li>
-</ul>
-
-<p>\[\left(
-    \begin{array}{c}
-      n \\ 0
-    \end{array}
-  \right) =
-\left(
-    \begin{array}{c}
-      n \\ n
-    \end{array}
-  \right) =  1
-  \] </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Example justification of the binomial likelihood</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider the probability of getting \(6\) heads out of \(10\) coin flips from a coin with success probability \(p\) </li>
-<li>The probability of getting \(6\) heads and \(4\) tails in any specific order is
-\[
-p^6(1-p)^4
-\]</li>
-<li>There are 
-\[\left(
-\begin{array}{c}
-10 \\ 6
-\end{array}
-\right)
-\]
-possible orders of \(6\) heads and \(4\) tails</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose a friend has \(8\) children (oh my!), \(7\) of which are girls and none are twins</li>
-<li>If each gender has an independent \(50\)% probability for each birth, what&#39;s the probability of getting \(7\) or more girls out of \(8\) births?
-\[\left(
-\begin{array}{c}
-8 \\ 7
-\end{array}
-\right) .5^{7}(1-.5)^{1}
-+
-\left(
-\begin{array}{c}
-8 \\ 8
-\end{array}
-\right) .5^{8}(1-.5)^{0} \approx 0.04
-\]</li>
-</ul>
-
-<pre><code class="r">choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
-</code></pre>
-
-<pre><code>[1] 0.03516
-</code></pre>
-
-<pre><code class="r">pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
-</code></pre>
-
-<pre><code>[1] 0.03516
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <article data-timings="">
-    <pre><code class="r">plot(pvals, dbinom(7, 8, pvals) / dbinom(7, 8, 7/8) , 
-     lwd = 3, frame = FALSE, type = &quot;l&quot;, xlab = &quot;p&quot;, ylab = &quot;likelihood&quot;)
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>The normal distribution</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>A random variable is said to follow a <strong>normal</strong> or <strong>Gaussian</strong> distribution with mean \(\mu\) and variance \(\sigma^2\) if the associated density is
-\[
-(2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
-\]
-If \(X\) a RV with this density then \(E[X] = \mu\) and \(Var(X) = \sigma^2\)</li>
-<li>We write \(X\sim \mbox{N}(\mu, \sigma^2)\)</li>
-<li>When \(\mu = 0\) and \(\sigma = 1\) the resulting distribution is called <strong>the standard normal distribution</strong></li>
-<li>The standard normal density function is labeled \(\phi\)</li>
-<li>Standard normal RVs are often labeled \(Z\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <article data-timings="">
-    <pre><code class="r">zvals &lt;- seq(-3, 3, length = 1000)
-plot(zvals, dnorm(zvals), 
-     type = &quot;l&quot;, lwd = 3, frame = FALSE, xlab = &quot;z&quot;, ylab = &quot;Density&quot;)
-sapply(-3 : 3, function(k) abline(v = k))
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
-
-<pre><code>[[1]]
-NULL
-
-[[2]]
-NULL
-
-[[3]]
-NULL
-
-[[4]]
-NULL
-
-[[5]]
-NULL
-
-[[6]]
-NULL
-
-[[7]]
-NULL
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Facts about the normal density</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>If \(X \sim \mbox{N}(\mu,\sigma^2)\) the \(Z = \frac{X -\mu}{\sigma}\) is standard normal</li>
-<li>If \(Z\) is standard normal \[X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)\]</li>
-<li>The non-standard normal density is \[\phi\{(x - \mu) / \sigma\}/\sigma\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>More facts about the normal density</h2>
-  </hgroup>
-  <article data-timings="">
-    <ol>
-<li>Approximately \(68\%\), \(95\%\) and \(99\%\)  of the normal density lies within \(1\), \(2\) and \(3\) standard deviations from the mean, respectively</li>
-<li>\(-1.28\), \(-1.645\), \(-1.96\) and \(-2.33\) are the \(10^{th}\), \(5^{th}\), \(2.5^{th}\) and \(1^{st}\) percentiles of the standard normal distribution respectively</li>
-<li>By symmetry, \(1.28\), \(1.645\), \(1.96\) and \(2.33\) are the \(90^{th}\), \(95^{th}\), \(97.5^{th}\) and \(99^{th}\) percentiles of the standard normal distribution respectively</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Question</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>What is the \(95^{th}\) percentile of a \(N(\mu, \sigma^2)\) distribution? 
-
-<ul>
-<li>Quick answer in R <code>qnorm(.95, mean = mu, sd = sd)</code></li>
-</ul></li>
-<li>We want the point \(x_0\) so that \(P(X \leq x_0) = .95\)
-\[
-\begin{eqnarray*}
-P(X \leq x_0) & = & P\left(\frac{X - \mu}{\sigma} \leq \frac{x_0 - \mu}{\sigma}\right) \\ \\
-              & = & P\left(Z \leq \frac{x_0 - \mu}{\sigma}\right) =  .95
-\end{eqnarray*}
-\]</li>
-<li>Therefore
-\[\frac{x_0 - \mu}{\sigma} = 1.645\]
-or \(x_0 = \mu + \sigma 1.645\)</li>
-<li>In general \(x_0 = \mu + \sigma z_0\) where \(z_0\) is the appropriate standard normal quantile</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Question</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>What is the probability that a \(\mbox{N}(\mu,\sigma^2)\) RV is 2 standard deviations above the mean?</li>
-<li>We want to know
-\[
-\begin{eqnarray*}
-P(X > \mu + 2\sigma) & = & 
-P\left(\frac{X -\mu}{\sigma} > \frac{\mu + 2\sigma - \mu}{\sigma}\right)    \\ \\
-& = & P(Z \geq 2 ) \\ \\ 
-& \approx & 2.5\%
-\end{eqnarray*}
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Other properties</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The normal distribution is symmetric and peaked about its mean (therefore the mean, median and mode are all equal)</li>
-<li>A constant times a normally distributed random variable is also normally distributed (what is the mean and variance?)</li>
-<li>Sums of normally distributed random variables are again normally distributed even if the variables are dependent (what is the mean and variance?)</li>
-<li>Sample means of normally distributed random variables are again normally distributed (with what mean and variance?)</li>
-<li>The square of a <em>standard normal</em> random variable follows what is called <strong>chi-squared</strong> distribution </li>
-<li>The exponent of a normally distributed random variables follows what is called the <strong>log-normal</strong> distribution </li>
-<li>As we will see later, many random variables, properly normalized, <em>limit</em> to a normal distribution</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Final thoughts on normal likelihoods</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The MLE for \(\mu\) is \(\bar X\).</li>
-<li>The MLE for \(\sigma^2\) is
-\[
-\frac{\sum_{i=1}^n (X_i - \bar X)^2}{n}
-\]
-(Which is the biased version of the sample variance.)</li>
-<li>The MLE of \(\sigma\) is simply the square root of this
-estimate</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h2>The Poisson distribution</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Used to model counts</li>
-<li>The Poisson mass function is
-\[
-P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
-\]
-for \(x=0,1,\ldots\)</li>
-<li>The mean of this distribution is \(\lambda\)</li>
-<li>The variance of this distribution is \(\lambda\)</li>
-<li>Notice that \(x\) ranges from \(0\) to \(\infty\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    <h2>Some uses for the Poisson distribution</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Modeling event/time data</li>
-<li>Modeling radioactive decay</li>
-<li>Modeling survival data</li>
-<li>Modeling unbounded count data </li>
-<li>Modeling contingency tables</li>
-<li>Approximating binomials when \(n\) is large and \(p\) is small</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-20" style="background:;">
-  <hgroup>
-    <h2>Poisson derivation</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>\(\lambda\) is the mean number of events per unit time</li>
-<li>Let \(h\) be very small </li>
-<li>Suppose we assume that 
-
-<ul>
-<li>Prob. of an event in an interval of length \(h\) is \(\lambda h\)
-while the prob. of more than one event is negligible</li>
-<li>Whether or not an event occurs in one small interval
-does not impact whether or not an event occurs in another
-small interval
-then, the number of events per unit time is Poisson with mean \(\lambda\) </li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-21" style="background:;">
-  <hgroup>
-    <h2>Rates and Poisson random variables</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Poisson random variables are used to model rates</li>
-<li>\(X \sim Poisson(\lambda t)\) where 
-
-<ul>
-<li>\(\lambda = E[X / t]\) is the expected count per unit of time</li>
-<li>\(t\) is the total monitoring time</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-22" style="background:;">
-  <hgroup>
-    <h2>Poisson approximation to the binomial</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>When \(n\) is large and \(p\) is small the Poisson distribution
-is an accurate approximation to the binomial distribution</li>
-<li>Notation
-
-<ul>
-<li>\(\lambda = n p\)</li>
-<li>\(X \sim \mbox{Binomial}(n, p)\), \(\lambda = n p\) and</li>
-<li>\(n\) gets large </li>
-<li>\(p\) gets small</li>
-<li>\(\lambda\) stays constant</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-23" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>The number of people that show up at a bus stop is Poisson with
-a mean of \(2.5\) per hour.</p>
-
-<p>If watching the bus stop for 4 hours, what is the probability that \(3\)
-or fewer people show up for the whole time?</p>
-
-<pre><code class="r">ppois(3, lambda = 2.5 * 4)
-</code></pre>
-
-<pre><code>[1] 0.01034
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-24" style="background:;">
-  <hgroup>
-    <h2>Example, Poisson approximation to the binomial</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>We flip a coin with success probablity \(0.01\) five hundred times. </p>
-
-<p>What&#39;s the probability of 2 or fewer successes?</p>
-
-<pre><code class="r">pbinom(2, size = 500, prob = .01)
-</code></pre>
-
-<pre><code>[1] 0.1234
-</code></pre>
-
-<pre><code class="r">ppois(2, lambda=500 * .01)
-</code></pre>
-
-<pre><code>[1] 0.1247
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='The Bernoulli distribution'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='iid Bernoulli trials'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Plotting all possible likelihoods for a small n'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title=''>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Binomial trials'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Choose'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Example justification of the binomial likelihood'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Example'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title=''>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='The normal distribution'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title=''>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Facts about the normal density'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='More facts about the normal density'>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='Question'>
-         14
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title='Question'>
-         15
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='Other properties'>
-         16
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=17 title='Final thoughts on normal likelihoods'>
-         17
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=18 title='The Poisson distribution'>
-         18
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=19 title='Some uses for the Poisson distribution'>
-         19
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=20 title='Poisson derivation'>
-         20
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=21 title='Rates and Poisson random variables'>
-         21
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=22 title='Poisson approximation to the binomial'>
-         22
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=23 title='Example'>
-         23
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=24 title='Example, Poisson approximation to the binomial'>
-         24
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Some Common Distributions</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Some Common Distributions">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Some Common Distributions</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>The Bernoulli distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>Bernoulli distribution</strong> arises as the result of a binary outcome</li>
+<li>Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) \(p\) and \(1-p\) respectively</li>
+<li>The PMF for a Bernoulli random variable \(X\) is \[P(X = x) =  p^x (1 - p)^{1 - x}\]</li>
+<li>The mean of a Bernoulli random variable is \(p\) and the variance is \(p(1 - p)\)</li>
+<li>If we let \(X\) be a Bernoulli random variable, it is typical to call \(X=1\) as a &quot;success&quot; and \(X=0\) as a &quot;failure&quot;</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>iid Bernoulli trials</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>If several iid Bernoulli observations, say \(x_1,\ldots, x_n\), are observed the
+likelihood is 
+\[
+\prod_{i=1}^n p^{x_i} (1 - p)^{1 - x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
+\]</li>
+<li>Notice that the likelihood depends only on the sum of the \(x_i\)</li>
+<li>Because \(n\) is fixed and assumed known, this implies that the sample proportion \(\sum_i x_i / n\) contains all of the relevant information about \(p\)</li>
+<li>We can maximize the Bernoulli likelihood over \(p\) to obtain that \(\hat p = \sum_i x_i / n\) is the maximum likelihood estimator for \(p\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Plotting all possible likelihoods for a small n</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code>n &lt;- 5
+pvals &lt;- seq(0, 1, length = 1000)
+plot(c(0, 1), c(0, 1.2), type = &quot;n&quot;, frame = FALSE, xlab = &quot;p&quot;, ylab = &quot;likelihood&quot;)
+text((0 : n) /n, 1.1, as.character(0 : n))
+sapply(0 : n, function(x) {
+  phat &lt;- x / n
+  if (x == 0) lines(pvals,  ( (1 - pvals) / (1 - phat) )^(n-x), lwd = 3)
+  else if (x == n) lines(pvals, (pvals / phat) ^ x, lwd = 3)
+  else lines(pvals, (pvals / phat ) ^ x * ( (1 - pvals) / (1 - phat) ) ^ (n-x), lwd = 3) 
+  }
+)
+title(paste(&quot;Likelihoods for n = &quot;, n))
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-1.png" alt="plot of chunk unnamed-chunk-1"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Binomial trials</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <em>binomial random variables</em> are obtained as the sum of iid Bernoulli trials</li>
+<li>In specific, let \(X_1,\ldots,X_n\) be iid Bernoulli\((p)\); then \(X = \sum_{i=1}^n X_i\) is a binomial random variable</li>
+<li>The binomial mass function is
+\[
+P(X = x) = 
+\left(
+\begin{array}{c}
+n \\ x
+\end{array}
+\right)
+p^x(1 - p)^{n-x}
+\]
+for \(x=0,\ldots,n\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Choose</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Recall that the notation 
+\[\left(
+\begin{array}{c}
+  n \\ x
+\end{array}
+\right) = \frac{n!}{x!(n-x)!}
+\] (read &quot;\(n\) choose \(x\)&quot;) counts the number of ways of selecting \(x\) items out of \(n\)
+without replacement disregarding the order of the items</li>
+</ul>
+
+<p>\[\left(
+    \begin{array}{c}
+      n \\ 0
+    \end{array}
+  \right) =
+\left(
+    \begin{array}{c}
+      n \\ n
+    \end{array}
+  \right) =  1
+  \] </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Example justification of the binomial likelihood</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider the probability of getting \(6\) heads out of \(10\) coin flips from a coin with success probability \(p\) </li>
+<li>The probability of getting \(6\) heads and \(4\) tails in any specific order is
+\[
+p^6(1-p)^4
+\]</li>
+<li>There are 
+\[\left(
+\begin{array}{c}
+10 \\ 6
+\end{array}
+\right)
+\]
+possible orders of \(6\) heads and \(4\) tails</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose a friend has \(8\) children (oh my!), \(7\) of which are girls and none are twins</li>
+<li>If each gender has an independent \(50\)% probability for each birth, what&#39;s the probability of getting \(7\) or more girls out of \(8\) births?
+\[\left(
+\begin{array}{c}
+8 \\ 7
+\end{array}
+\right) .5^{7}(1-.5)^{1}
++
+\left(
+\begin{array}{c}
+8 \\ 8
+\end{array}
+\right) .5^{8}(1-.5)^{0} \approx 0.04
+\]</li>
+</ul>
+
+<pre><code class="r">choose(8, 7) * 0.5^8 + choose(8, 8) * 0.5^8
+</code></pre>
+
+<pre><code>## [1] 0.03516
+</code></pre>
+
+<pre><code class="r">pbinom(6, size = 8, prob = 0.5, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.03516
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">plot(pvals, dbinom(7, 8, pvals)/dbinom(7, 8, 7/8), lwd = 3, frame = FALSE, type = &quot;l&quot;, 
+    xlab = &quot;p&quot;, ylab = &quot;likelihood&quot;)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-3.png" alt="plot of chunk unnamed-chunk-3"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>The normal distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A random variable is said to follow a <strong>normal</strong> or <strong>Gaussian</strong> distribution with mean \(\mu\) and variance \(\sigma^2\) if the associated density is
+\[
+(2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
+\]
+If \(X\) a RV with this density then \(E[X] = \mu\) and \(Var(X) = \sigma^2\)</li>
+<li>We write \(X\sim \mbox{N}(\mu, \sigma^2)\)</li>
+<li>When \(\mu = 0\) and \(\sigma = 1\) the resulting distribution is called <strong>the standard normal distribution</strong></li>
+<li>The standard normal density function is labeled \(\phi\)</li>
+<li>Standard normal RVs are often labeled \(Z\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">zvals &lt;- seq(-3, 3, length = 1000)
+plot(zvals, dnorm(zvals), type = &quot;l&quot;, lwd = 3, frame = FALSE, xlab = &quot;z&quot;, ylab = &quot;Density&quot;)
+sapply(-3:3, function(k) abline(v = k))
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" /></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Facts about the normal density</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>If \(X \sim \mbox{N}(\mu,\sigma^2)\) the \(Z = \frac{X -\mu}{\sigma}\) is standard normal</li>
+<li>If \(Z\) is standard normal \[X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)\]</li>
+<li>The non-standard normal density is \[\phi\{(x - \mu) / \sigma\}/\sigma\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>More facts about the normal density</h2>
+  </hgroup>
+  <article data-timings="">
+    <ol>
+<li>Approximately \(68\%\), \(95\%\) and \(99\%\)  of the normal density lies within \(1\), \(2\) and \(3\) standard deviations from the mean, respectively</li>
+<li>\(-1.28\), \(-1.645\), \(-1.96\) and \(-2.33\) are the \(10^{th}\), \(5^{th}\), \(2.5^{th}\) and \(1^{st}\) percentiles of the standard normal distribution respectively</li>
+<li>By symmetry, \(1.28\), \(1.645\), \(1.96\) and \(2.33\) are the \(90^{th}\), \(95^{th}\), \(97.5^{th}\) and \(99^{th}\) percentiles of the standard normal distribution respectively</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Question</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>What is the \(95^{th}\) percentile of a \(N(\mu, \sigma^2)\) distribution? 
+
+<ul>
+<li>Quick answer in R <code>qnorm(.95, mean = mu, sd = sd)</code></li>
+</ul></li>
+<li>We want the point \(x_0\) so that \(P(X \leq x_0) = .95\)
+\[
+\begin{eqnarray*}
+P(X \leq x_0) & = & P\left(\frac{X - \mu}{\sigma} \leq \frac{x_0 - \mu}{\sigma}\right) \\ \\
+              & = & P\left(Z \leq \frac{x_0 - \mu}{\sigma}\right) =  .95
+\end{eqnarray*}
+\]</li>
+<li>Therefore
+\[\frac{x_0 - \mu}{\sigma} = 1.645\]
+or \(x_0 = \mu + \sigma 1.645\)</li>
+<li>In general \(x_0 = \mu + \sigma z_0\) where \(z_0\) is the appropriate standard normal quantile</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Question</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>What is the probability that a \(\mbox{N}(\mu,\sigma^2)\) RV is 2 standard deviations above the mean?</li>
+<li>We want to know
+\[
+\begin{eqnarray*}
+P(X > \mu + 2\sigma) & = & 
+P\left(\frac{X -\mu}{\sigma} > \frac{\mu + 2\sigma - \mu}{\sigma}\right)    \\ \\
+& = & P(Z \geq 2 ) \\ \\ 
+& \approx & 2.5\%
+\end{eqnarray*}
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Other properties</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The normal distribution is symmetric and peaked about its mean (therefore the mean, median and mode are all equal)</li>
+<li>A constant times a normally distributed random variable is also normally distributed (what is the mean and variance?)</li>
+<li>Sums of normally distributed random variables are again normally distributed even if the variables are dependent (what is the mean and variance?)</li>
+<li>Sample means of normally distributed random variables are again normally distributed (with what mean and variance?)</li>
+<li>The square of a <em>standard normal</em> random variable follows what is called <strong>chi-squared</strong> distribution </li>
+<li>The exponent of a normally distributed random variables follows what is called the <strong>log-normal</strong> distribution </li>
+<li>As we will see later, many random variables, properly normalized, <em>limit</em> to a normal distribution</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Final thoughts on normal likelihoods</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The MLE for \(\mu\) is \(\bar X\).</li>
+<li>The MLE for \(\sigma^2\) is
+\[
+\frac{\sum_{i=1}^n (X_i - \bar X)^2}{n}
+\]
+(Which is the biased version of the sample variance.)</li>
+<li>The MLE of \(\sigma\) is simply the square root of this
+estimate</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>The Poisson distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Used to model counts</li>
+<li>The Poisson mass function is
+\[
+P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
+\]
+for \(x=0,1,\ldots\)</li>
+<li>The mean of this distribution is \(\lambda\)</li>
+<li>The variance of this distribution is \(\lambda\)</li>
+<li>Notice that \(x\) ranges from \(0\) to \(\infty\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>Some uses for the Poisson distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Modeling event/time data</li>
+<li>Modeling radioactive decay</li>
+<li>Modeling survival data</li>
+<li>Modeling unbounded count data </li>
+<li>Modeling contingency tables</li>
+<li>Approximating binomials when \(n\) is large and \(p\) is small</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-20" style="background:;">
+  <hgroup>
+    <h2>Poisson derivation</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>\(\lambda\) is the mean number of events per unit time</li>
+<li>Let \(h\) be very small </li>
+<li>Suppose we assume that 
+
+<ul>
+<li>Prob. of an event in an interval of length \(h\) is \(\lambda h\)
+while the prob. of more than one event is negligible</li>
+<li>Whether or not an event occurs in one small interval
+does not impact whether or not an event occurs in another
+small interval
+then, the number of events per unit time is Poisson with mean \(\lambda\) </li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-21" style="background:;">
+  <hgroup>
+    <h2>Rates and Poisson random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Poisson random variables are used to model rates</li>
+<li>\(X \sim Poisson(\lambda t)\) where 
+
+<ul>
+<li>\(\lambda = E[X / t]\) is the expected count per unit of time</li>
+<li>\(t\) is the total monitoring time</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-22" style="background:;">
+  <hgroup>
+    <h2>Poisson approximation to the binomial</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>When \(n\) is large and \(p\) is small the Poisson distribution
+is an accurate approximation to the binomial distribution</li>
+<li>Notation
+
+<ul>
+<li>\(\lambda = n p\)</li>
+<li>\(X \sim \mbox{Binomial}(n, p)\), \(\lambda = n p\) and</li>
+<li>\(n\) gets large </li>
+<li>\(p\) gets small</li>
+<li>\(\lambda\) stays constant</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-23" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>The number of people that show up at a bus stop is Poisson with
+a mean of \(2.5\) per hour.</p>
+
+<p>If watching the bus stop for 4 hours, what is the probability that \(3\)
+or fewer people show up for the whole time?</p>
+
+<pre><code class="r">ppois(3, lambda = 2.5 * 4)
+</code></pre>
+
+<pre><code>## [1] 0.01034
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-24" style="background:;">
+  <hgroup>
+    <h2>Example, Poisson approximation to the binomial</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>We flip a coin with success probablity \(0.01\) five hundred times. </p>
+
+<p>What&#39;s the probability of 2 or fewer successes?</p>
+
+<pre><code class="r">pbinom(2, size = 500, prob = 0.01)
+</code></pre>
+
+<pre><code>## [1] 0.1234
+</code></pre>
+
+<pre><code class="r">ppois(2, lambda = 500 * 0.01)
+</code></pre>
+
+<pre><code>## [1] 0.1247
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='The Bernoulli distribution'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='iid Bernoulli trials'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Plotting all possible likelihoods for a small n'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title=''>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Binomial trials'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Choose'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Example justification of the binomial likelihood'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Example'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title=''>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='The normal distribution'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title=''>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Facts about the normal density'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='More facts about the normal density'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Question'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Question'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Other properties'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Final thoughts on normal likelihoods'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='The Poisson distribution'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='Some uses for the Poisson distribution'>
+         19
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=20 title='Poisson derivation'>
+         20
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=21 title='Rates and Poisson random variables'>
+         21
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=22 title='Poisson approximation to the binomial'>
+         22
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=23 title='Example'>
+         23
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=24 title='Example, Poisson approximation to the binomial'>
+         24
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/02_01_CommonDistributions/index.md b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.md
similarity index 86%
rename from 06_StatisticalInference/02_01_CommonDistributions/index.md
rename to 06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.md
index 186885c92..a7c796780 100644
--- a/06_StatisticalInference/02_01_CommonDistributions/index.md
+++ b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.md
@@ -1,383 +1,358 @@
----
-title       : Some Common Distributions
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## The Bernoulli distribution
-
-- The **Bernoulli distribution** arises as the result of a binary outcome
-- Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) $p$ and $1-p$ respectively
-- The PMF for a Bernoulli random variable $X$ is $$P(X = x) =  p^x (1 - p)^{1 - x}$$
-- The mean of a Bernoulli random variable is $p$ and the variance is $p(1 - p)$
-- If we let $X$ be a Bernoulli random variable, it is typical to call $X=1$ as a "success" and $X=0$ as a "failure"
-
----
-
-## iid Bernoulli trials
-
-- If several iid Bernoulli observations, say $x_1,\ldots, x_n$, are observed the
-likelihood is 
-$$
-  \prod_{i=1}^n p^{x_i} (1 - p)^{1 - x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
-$$
-- Notice that the likelihood depends only on the sum of the $x_i$
-- Because $n$ is fixed and assumed known, this implies that the sample proportion $\sum_i x_i / n$ contains all of the relevant information about $p$
-- We can maximize the Bernoulli likelihood over $p$ to obtain that $\hat p = \sum_i x_i / n$ is the maximum likelihood estimator for $p$
-
----
-## Plotting all possible likelihoods for a small n
-```
-n <- 5
-pvals <- seq(0, 1, length = 1000)
-plot(c(0, 1), c(0, 1.2), type = "n", frame = FALSE, xlab = "p", ylab = "likelihood")
-text((0 : n) /n, 1.1, as.character(0 : n))
-sapply(0 : n, function(x) {
-  phat <- x / n
-  if (x == 0) lines(pvals,  ( (1 - pvals) / (1 - phat) )^(n-x), lwd = 3)
-  else if (x == n) lines(pvals, (pvals / phat) ^ x, lwd = 3)
-  else lines(pvals, (pvals / phat ) ^ x * ( (1 - pvals) / (1 - phat) ) ^ (n-x), lwd = 3) 
-  }
-)
-title(paste("Likelihoods for n = ", n))
-```
-
----
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
-
-
----
-
-## Binomial trials
-
-- The *binomial random variables* are obtained as the sum of iid Bernoulli trials
-- In specific, let $X_1,\ldots,X_n$ be iid Bernoulli$(p)$; then $X = \sum_{i=1}^n X_i$ is a binomial random variable
-- The binomial mass function is
-$$
-P(X = x) = 
-\left(
-\begin{array}{c}
-  n \\ x
-\end{array}
-\right)
-p^x(1 - p)^{n-x}
-$$
-for $x=0,\ldots,n$
-
----
-
-## Choose
-
-- Recall that the notation 
-  $$\left(
-    \begin{array}{c}
-      n \\ x
-    \end{array}
-  \right) = \frac{n!}{x!(n-x)!}
-  $$ (read "$n$ choose $x$") counts the number of ways of selecting $x$ items out of $n$
-  without replacement disregarding the order of the items
-
-$$\left(
-    \begin{array}{c}
-      n \\ 0
-    \end{array}
-  \right) =
-\left(
-    \begin{array}{c}
-      n \\ n
-    \end{array}
-  \right) =  1
-  $$ 
-
----
-
-## Example justification of the binomial likelihood
-
-- Consider the probability of getting $6$ heads out of $10$ coin flips from a coin with success probability $p$ 
-- The probability of getting $6$ heads and $4$ tails in any specific order is
-  $$
-  p^6(1-p)^4
-  $$
-- There are 
-$$\left(
-\begin{array}{c}
-  10 \\ 6
-\end{array}
-\right)
-$$
-possible orders of $6$ heads and $4$ tails
-
----
-
-## Example
-
-- Suppose a friend has $8$ children (oh my!), $7$ of which are girls and none are twins
-- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
-$$\left(
-\begin{array}{c}
-  8 \\ 7
-\end{array}
-\right) .5^{7}(1-.5)^{1}
-+
-\left(
-\begin{array}{c}
-  8 \\ 8
-\end{array}
-\right) .5^{8}(1-.5)^{0} \approx 0.04
-$$
-
-```r
-choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
-```
-
-```
-[1] 0.03516
-```
-
-```r
-pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
-```
-
-```
-[1] 0.03516
-```
-
-
----
-
-```r
-plot(pvals, dbinom(7, 8, pvals) / dbinom(7, 8, 7/8) , 
-     lwd = 3, frame = FALSE, type = "l", xlab = "p", ylab = "likelihood")
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
-
----
-
-## The normal distribution
-
-- A random variable is said to follow a **normal** or **Gaussian** distribution with mean $\mu$ and variance $\sigma^2$ if the associated density is
-  $$
-  (2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
-  $$
-  If $X$ a RV with this density then $E[X] = \mu$ and $Var(X) = \sigma^2$
-- We write $X\sim \mbox{N}(\mu, \sigma^2)$
-- When $\mu = 0$ and $\sigma = 1$ the resulting distribution is called **the standard normal distribution**
-- The standard normal density function is labeled $\phi$
-- Standard normal RVs are often labeled $Z$
-
----
-
-```r
-zvals <- seq(-3, 3, length = 1000)
-plot(zvals, dnorm(zvals), 
-     type = "l", lwd = 3, frame = FALSE, xlab = "z", ylab = "Density")
-sapply(-3 : 3, function(k) abline(v = k))
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
-
-```
-[[1]]
-NULL
-
-[[2]]
-NULL
-
-[[3]]
-NULL
-
-[[4]]
-NULL
-
-[[5]]
-NULL
-
-[[6]]
-NULL
-
-[[7]]
-NULL
-```
-
-
----
-
-## Facts about the normal density
-
-- If $X \sim \mbox{N}(\mu,\sigma^2)$ the $Z = \frac{X -\mu}{\sigma}$ is standard normal
-- If $Z$ is standard normal $$X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)$$
-- The non-standard normal density is $$\phi\{(x - \mu) / \sigma\}/\sigma$$
-
----
-
-## More facts about the normal density
-
-1. Approximately $68\%$, $95\%$ and $99\%$  of the normal density lies within $1$, $2$ and $3$ standard deviations from the mean, respectively
-2. $-1.28$, $-1.645$, $-1.96$ and $-2.33$ are the $10^{th}$, $5^{th}$, $2.5^{th}$ and $1^{st}$ percentiles of the standard normal distribution respectively
-3. By symmetry, $1.28$, $1.645$, $1.96$ and $2.33$ are the $90^{th}$, $95^{th}$, $97.5^{th}$ and $99^{th}$ percentiles of the standard normal distribution respectively
-
----
-
-## Question
-
-- What is the $95^{th}$ percentile of a $N(\mu, \sigma^2)$ distribution? 
-  - Quick answer in R `qnorm(.95, mean = mu, sd = sd)`
-- We want the point $x_0$ so that $P(X \leq x_0) = .95$
-$$
-  \begin{eqnarray*}
-    P(X \leq x_0) & = & P\left(\frac{X - \mu}{\sigma} \leq \frac{x_0 - \mu}{\sigma}\right) \\ \\
-                  & = & P\left(Z \leq \frac{x_0 - \mu}{\sigma}\right) =  .95
-  \end{eqnarray*}
-$$
-- Therefore
-  $$\frac{x_0 - \mu}{\sigma} = 1.645$$
-  or $x_0 = \mu + \sigma 1.645$
-- In general $x_0 = \mu + \sigma z_0$ where $z_0$ is the appropriate standard normal quantile
-
----
-
-## Question
-
-- What is the probability that a $\mbox{N}(\mu,\sigma^2)$ RV is 2 standard deviations above the mean?
-- We want to know
-$$
-  \begin{eqnarray*}
-  P(X > \mu + 2\sigma) & = & 
-P\left(\frac{X -\mu}{\sigma} > \frac{\mu + 2\sigma - \mu}{\sigma}\right)    \\ \\
-& = & P(Z \geq 2 ) \\ \\ 
-& \approx & 2.5\%
-  \end{eqnarray*}
-$$
-
----
-
-## Other properties
-
-- The normal distribution is symmetric and peaked about its mean (therefore the mean, median and mode are all equal)
-- A constant times a normally distributed random variable is also normally distributed (what is the mean and variance?)
-- Sums of normally distributed random variables are again normally distributed even if the variables are dependent (what is the mean and variance?)
-- Sample means of normally distributed random variables are again normally distributed (with what mean and variance?)
-- The square of a *standard normal* random variable follows what is called **chi-squared** distribution 
-- The exponent of a normally distributed random variables follows what is called the **log-normal** distribution 
-- As we will see later, many random variables, properly normalized, *limit* to a normal distribution
-
----
-
-## Final thoughts on normal likelihoods
-- The MLE for $\mu$ is $\bar X$.
-- The MLE for $\sigma^2$ is
-  $$
-  \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n}
-  $$
-  (Which is the biased version of the sample variance.)
-- The MLE of $\sigma$ is simply the square root of this
-  estimate
-
----
-## The Poisson distribution
-* Used to model counts
-* The Poisson mass function is
-$$
-P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
-$$
-for $x=0,1,\ldots$
-* The mean of this distribution is $\lambda$
-* The variance of this distribution is $\lambda$
-* Notice that $x$ ranges from $0$ to $\infty$
-
----
-## Some uses for the Poisson distribution
-* Modeling event/time data
-* Modeling radioactive decay
-* Modeling survival data
-* Modeling unbounded count data 
-* Modeling contingency tables
-* Approximating binomials when $n$ is large and $p$ is small
-
----
-## Poisson derivation
-* $\lambda$ is the mean number of events per unit time
-* Let $h$ be very small 
-* Suppose we assume that 
-  * Prob. of an event in an interval of length $h$ is $\lambda h$
-    while the prob. of more than one event is negligible
-  * Whether or not an event occurs in one small interval
-    does not impact whether or not an event occurs in another
-    small interval
-then, the number of events per unit time is Poisson with mean $\lambda$ 
-
----
-## Rates and Poisson random variables
-* Poisson random variables are used to model rates
-* $X \sim Poisson(\lambda t)$ where 
-  * $\lambda = E[X / t]$ is the expected count per unit of time
-  * $t$ is the total monitoring time
-
----
-## Poisson approximation to the binomial
-* When $n$ is large and $p$ is small the Poisson distribution
-  is an accurate approximation to the binomial distribution
-* Notation
-  * $\lambda = n p$
-  * $X \sim \mbox{Binomial}(n, p)$, $\lambda = n p$ and
-  * $n$ gets large 
-  * $p$ gets small
-  * $\lambda$ stays constant
-
----
-## Example
-The number of people that show up at a bus stop is Poisson with
-a mean of $2.5$ per hour.
-
-If watching the bus stop for 4 hours, what is the probability that $3$
-or fewer people show up for the whole time?
-
-
-```r
-ppois(3, lambda = 2.5 * 4)
-```
-
-```
-[1] 0.01034
-```
-
-
----
-## Example, Poisson approximation to the binomial
-
-We flip a coin with success probablity $0.01$ five hundred times. 
-
-What's the probability of 2 or fewer successes?
-
-
-```r
-pbinom(2, size = 500, prob = .01)
-```
-
-```
-[1] 0.1234
-```
-
-```r
-ppois(2, lambda=500 * .01)
-```
-
-```
-[1] 0.1247
-```
-
-
+---
+title       : Some Common Distributions
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+
+## The Bernoulli distribution
+
+- The **Bernoulli distribution** arises as the result of a binary outcome
+- Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) $p$ and $1-p$ respectively
+- The PMF for a Bernoulli random variable $X$ is $$P(X = x) =  p^x (1 - p)^{1 - x}$$
+- The mean of a Bernoulli random variable is $p$ and the variance is $p(1 - p)$
+- If we let $X$ be a Bernoulli random variable, it is typical to call $X=1$ as a "success" and $X=0$ as a "failure"
+
+---
+
+## iid Bernoulli trials
+
+- If several iid Bernoulli observations, say $x_1,\ldots, x_n$, are observed the
+likelihood is 
+$$
+  \prod_{i=1}^n p^{x_i} (1 - p)^{1 - x_i} = p^{\sum x_i} (1 - p)^{n - \sum x_i}
+$$
+- Notice that the likelihood depends only on the sum of the $x_i$
+- Because $n$ is fixed and assumed known, this implies that the sample proportion $\sum_i x_i / n$ contains all of the relevant information about $p$
+- We can maximize the Bernoulli likelihood over $p$ to obtain that $\hat p = \sum_i x_i / n$ is the maximum likelihood estimator for $p$
+
+---
+## Plotting all possible likelihoods for a small n
+```
+n <- 5
+pvals <- seq(0, 1, length = 1000)
+plot(c(0, 1), c(0, 1.2), type = "n", frame = FALSE, xlab = "p", ylab = "likelihood")
+text((0 : n) /n, 1.1, as.character(0 : n))
+sapply(0 : n, function(x) {
+  phat <- x / n
+  if (x == 0) lines(pvals,  ( (1 - pvals) / (1 - phat) )^(n-x), lwd = 3)
+  else if (x == n) lines(pvals, (pvals / phat) ^ x, lwd = 3)
+  else lines(pvals, (pvals / phat ) ^ x * ( (1 - pvals) / (1 - phat) ) ^ (n-x), lwd = 3) 
+  }
+)
+title(paste("Likelihoods for n = ", n))
+```
+
+---
+![plot of chunk unnamed-chunk-1](assets/fig/unnamed-chunk-1.png) 
+
+
+---
+
+## Binomial trials
+
+- The *binomial random variables* are obtained as the sum of iid Bernoulli trials
+- In specific, let $X_1,\ldots,X_n$ be iid Bernoulli$(p)$; then $X = \sum_{i=1}^n X_i$ is a binomial random variable
+- The binomial mass function is
+$$
+P(X = x) = 
+\left(
+\begin{array}{c}
+  n \\ x
+\end{array}
+\right)
+p^x(1 - p)^{n-x}
+$$
+for $x=0,\ldots,n$
+
+---
+
+## Choose
+
+- Recall that the notation 
+  $$\left(
+    \begin{array}{c}
+      n \\ x
+    \end{array}
+  \right) = \frac{n!}{x!(n-x)!}
+  $$ (read "$n$ choose $x$") counts the number of ways of selecting $x$ items out of $n$
+  without replacement disregarding the order of the items
+
+$$\left(
+    \begin{array}{c}
+      n \\ 0
+    \end{array}
+  \right) =
+\left(
+    \begin{array}{c}
+      n \\ n
+    \end{array}
+  \right) =  1
+  $$ 
+
+---
+
+## Example justification of the binomial likelihood
+
+- Consider the probability of getting $6$ heads out of $10$ coin flips from a coin with success probability $p$ 
+- The probability of getting $6$ heads and $4$ tails in any specific order is
+  $$
+  p^6(1-p)^4
+  $$
+- There are 
+$$\left(
+\begin{array}{c}
+  10 \\ 6
+\end{array}
+\right)
+$$
+possible orders of $6$ heads and $4$ tails
+
+---
+
+## Example
+
+- Suppose a friend has $8$ children (oh my!), $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+$$\left(
+\begin{array}{c}
+  8 \\ 7
+\end{array}
+\right) .5^{7}(1-.5)^{1}
++
+\left(
+\begin{array}{c}
+  8 \\ 8
+\end{array}
+\right) .5^{8}(1-.5)^{0} \approx 0.04
+$$
+
+```r
+choose(8, 7) * 0.5^8 + choose(8, 8) * 0.5^8
+```
+
+```
+## [1] 0.03516
+```
+
+```r
+pbinom(6, size = 8, prob = 0.5, lower.tail = FALSE)
+```
+
+```
+## [1] 0.03516
+```
+
+
+---
+
+```r
+plot(pvals, dbinom(7, 8, pvals)/dbinom(7, 8, 7/8), lwd = 3, frame = FALSE, type = "l", 
+    xlab = "p", ylab = "likelihood")
+```
+
+![plot of chunk unnamed-chunk-3](assets/fig/unnamed-chunk-3.png) 
+
+
+---
+
+## The normal distribution
+
+- A random variable is said to follow a **normal** or **Gaussian** distribution with mean $\mu$ and variance $\sigma^2$ if the associated density is
+  $$
+  (2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
+  $$
+  If $X$ a RV with this density then $E[X] = \mu$ and $Var(X) = \sigma^2$
+- We write $X\sim \mbox{N}(\mu, \sigma^2)$
+- When $\mu = 0$ and $\sigma = 1$ the resulting distribution is called **the standard normal distribution**
+- The standard normal density function is labeled $\phi$
+- Standard normal RVs are often labeled $Z$
+
+---
+
+```r
+zvals <- seq(-3, 3, length = 1000)
+plot(zvals, dnorm(zvals), type = "l", lwd = 3, frame = FALSE, xlab = "z", ylab = "Density")
+sapply(-3:3, function(k) abline(v = k))
+```
+
+<img src="assets/fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" style="display: block; margin: auto;" />
+
+
+---
+
+## Facts about the normal density
+
+- If $X \sim \mbox{N}(\mu,\sigma^2)$ the $Z = \frac{X -\mu}{\sigma}$ is standard normal
+- If $Z$ is standard normal $$X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)$$
+- The non-standard normal density is $$\phi\{(x - \mu) / \sigma\}/\sigma$$
+
+---
+
+## More facts about the normal density
+
+1. Approximately $68\%$, $95\%$ and $99\%$  of the normal density lies within $1$, $2$ and $3$ standard deviations from the mean, respectively
+2. $-1.28$, $-1.645$, $-1.96$ and $-2.33$ are the $10^{th}$, $5^{th}$, $2.5^{th}$ and $1^{st}$ percentiles of the standard normal distribution respectively
+3. By symmetry, $1.28$, $1.645$, $1.96$ and $2.33$ are the $90^{th}$, $95^{th}$, $97.5^{th}$ and $99^{th}$ percentiles of the standard normal distribution respectively
+
+---
+
+## Question
+
+- What is the $95^{th}$ percentile of a $N(\mu, \sigma^2)$ distribution? 
+  - Quick answer in R `qnorm(.95, mean = mu, sd = sd)`
+- We want the point $x_0$ so that $P(X \leq x_0) = .95$
+$$
+  \begin{eqnarray*}
+    P(X \leq x_0) & = & P\left(\frac{X - \mu}{\sigma} \leq \frac{x_0 - \mu}{\sigma}\right) \\ \\
+                  & = & P\left(Z \leq \frac{x_0 - \mu}{\sigma}\right) =  .95
+  \end{eqnarray*}
+$$
+- Therefore
+  $$\frac{x_0 - \mu}{\sigma} = 1.645$$
+  or $x_0 = \mu + \sigma 1.645$
+- In general $x_0 = \mu + \sigma z_0$ where $z_0$ is the appropriate standard normal quantile
+
+---
+
+## Question
+
+- What is the probability that a $\mbox{N}(\mu,\sigma^2)$ RV is 2 standard deviations above the mean?
+- We want to know
+$$
+  \begin{eqnarray*}
+  P(X > \mu + 2\sigma) & = & 
+P\left(\frac{X -\mu}{\sigma} > \frac{\mu + 2\sigma - \mu}{\sigma}\right)    \\ \\
+& = & P(Z \geq 2 ) \\ \\ 
+& \approx & 2.5\%
+  \end{eqnarray*}
+$$
+
+---
+
+## Other properties
+
+- The normal distribution is symmetric and peaked about its mean (therefore the mean, median and mode are all equal)
+- A constant times a normally distributed random variable is also normally distributed (what is the mean and variance?)
+- Sums of normally distributed random variables are again normally distributed even if the variables are dependent (what is the mean and variance?)
+- Sample means of normally distributed random variables are again normally distributed (with what mean and variance?)
+- The square of a *standard normal* random variable follows what is called **chi-squared** distribution 
+- The exponent of a normally distributed random variables follows what is called the **log-normal** distribution 
+- As we will see later, many random variables, properly normalized, *limit* to a normal distribution
+
+---
+
+## Final thoughts on normal likelihoods
+- The MLE for $\mu$ is $\bar X$.
+- The MLE for $\sigma^2$ is
+  $$
+  \frac{\sum_{i=1}^n (X_i - \bar X)^2}{n}
+  $$
+  (Which is the biased version of the sample variance.)
+- The MLE of $\sigma$ is simply the square root of this
+  estimate
+
+---
+## The Poisson distribution
+* Used to model counts
+* The Poisson mass function is
+$$
+P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
+$$
+for $x=0,1,\ldots$
+* The mean of this distribution is $\lambda$
+* The variance of this distribution is $\lambda$
+* Notice that $x$ ranges from $0$ to $\infty$
+
+---
+## Some uses for the Poisson distribution
+* Modeling event/time data
+* Modeling radioactive decay
+* Modeling survival data
+* Modeling unbounded count data 
+* Modeling contingency tables
+* Approximating binomials when $n$ is large and $p$ is small
+
+---
+## Poisson derivation
+* $\lambda$ is the mean number of events per unit time
+* Let $h$ be very small 
+* Suppose we assume that 
+  * Prob. of an event in an interval of length $h$ is $\lambda h$
+    while the prob. of more than one event is negligible
+  * Whether or not an event occurs in one small interval
+    does not impact whether or not an event occurs in another
+    small interval
+then, the number of events per unit time is Poisson with mean $\lambda$ 
+
+---
+## Rates and Poisson random variables
+* Poisson random variables are used to model rates
+* $X \sim Poisson(\lambda t)$ where 
+  * $\lambda = E[X / t]$ is the expected count per unit of time
+  * $t$ is the total monitoring time
+
+---
+## Poisson approximation to the binomial
+* When $n$ is large and $p$ is small the Poisson distribution
+  is an accurate approximation to the binomial distribution
+* Notation
+  * $\lambda = n p$
+  * $X \sim \mbox{Binomial}(n, p)$, $\lambda = n p$ and
+  * $n$ gets large 
+  * $p$ gets small
+  * $\lambda$ stays constant
+
+---
+## Example
+The number of people that show up at a bus stop is Poisson with
+a mean of $2.5$ per hour.
+
+If watching the bus stop for 4 hours, what is the probability that $3$
+or fewer people show up for the whole time?
+
+
+```r
+ppois(3, lambda = 2.5 * 4)
+```
+
+```
+## [1] 0.01034
+```
+
+
+---
+## Example, Poisson approximation to the binomial
+
+We flip a coin with success probablity $0.01$ five hundred times. 
+
+What's the probability of 2 or fewer successes?
+
+
+```r
+pbinom(2, size = 500, prob = 0.01)
+```
+
+```
+## [1] 0.1234
+```
+
+```r
+ppois(2, lambda = 500 * 0.01)
+```
+
+```
+## [1] 0.1247
+```
+
+
diff --git a/06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.pdf b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.pdf
new file mode 100644
index 000000000..899e8bd29
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_01_CommonDistributions/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/02_02_Asymptopia/assets/fig/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/02_02_Asymptopia/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..94ea19152
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_02_Asymptopia/assets/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/02_02_Asymptopia/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/02_02_Asymptopia/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..2a217f6ad
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_02_Asymptopia/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/old_markdown/02_02_Asymptopia/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/02_02_Asymptopia/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..484506bca
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_02_Asymptopia/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/old_markdown/02_02_Asymptopia/fig/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/02_02_Asymptopia/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..a7dcc8c8a
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_02_Asymptopia/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/02_02_Asymptopia/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/02_02_Asymptopia/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..93ed55b64
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_02_Asymptopia/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/old_markdown/02_02_Asymptopia/fig/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/02_02_Asymptopia/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..a6ee3f4ed
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_02_Asymptopia/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/02_02_Asymptopia/index.Rmd b/06_StatisticalInference/old_markdown/02_02_Asymptopia/index.Rmd
similarity index 92%
rename from 06_StatisticalInference/02_02_Asymptopia/index.Rmd
rename to 06_StatisticalInference/old_markdown/02_02_Asymptopia/index.Rmd
index a199bd6bf..646a93fe6 100644
--- a/06_StatisticalInference/02_02_Asymptopia/index.Rmd
+++ b/06_StatisticalInference/old_markdown/02_02_Asymptopia/index.Rmd
@@ -1,271 +1,255 @@
----
-title       : A trip to Asymptopia
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-runif(1)
-```
-## Asymptotics
-* Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)
-* (Asymptopia is my name for the land of asymptotics, where everything works out well and there's no messes. The land of infinite data is nice that way.)
-* Asymptotics are incredibly useful for simple statistical inference and approximations 
-* (Not covered in this class) Asymptotics often lead to nice understanding of procedures
-* Asymptotics generally give no assurances about finite sample performance
-  * The kinds of asymptotics that do are orders of magnitude more difficult to work with
-* Asymptotics form the basis for frequency interpretation of probabilities 
-  (the long run proportion of times an event occurs)
-* To understand asymptotics, we need a very basic understanding of limits.
-
-
----
-## Numerical limits
-
-- Imagine a sequence
-
-  - $a_1 = .9$,
-  - $a_2 = .99$,
-  - $a_3 = .999$, ...
-
-- Clearly this sequence converges to $1$
-- Definition of a limit: For any fixed distance we can find a point in the sequence so that the sequence is closer to the limit than that distance from that point on
-
----
-
-## Limits of random variables
-
-- The problem is harder for random variables
-- Consider $\bar X_n$ the sample average of the first $n$ of a collection of $iid$ observations
-
-  - Example $\bar X_n$ could be the average of the result of $n$ coin flips (i.e. the sample proportion of heads)
-
-- We say that $\bar X_n$ converges in probability to a limit if for any fixed distance the  probability of $\bar X_n$ being closer (further away) than that distance from the limit converges to one (zero)
-
----
-
-## The Law of Large Numbers
-
-- Establishing that a random sequence converges to a limit is hard
-- Fortunately, we have a theorem that does all the work for us, called
-    the **Law of Large Numbers**
-- The law of large numbers states that if $X_1,\ldots X_n$ are iid from a population with mean $\mu$ and variance $\sigma^2$ then $\bar X_n$ converges in probability to $\mu$
-- (There are many variations on the LLN; we are using a particularly lazy version, my favorite kind of version)
-
----
-## Law of large numbers in action
-```{r, fig.height=4, fig.width=4}
-n <- 10000; means <- cumsum(rnorm(n)) / (1  : n)
-plot(1 : n, means, type = "l", lwd = 2, 
-     frame = FALSE, ylab = "cumulative means", xlab = "sample size")
-abline(h = 0)
-```
----
-## Discussion
-- An estimator is **consistent** if it converges to what you want to estimate
-  - Consistency is neither necessary nor sufficient for one estimator to be better than another
-  - Typically, good estimators are consistent; it's not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer
-- The LLN basically states that the sample mean is consistent
-- The sample variance and the sample standard deviation are consistent as well
-- Recall also that the sample mean and the sample variance are unbiased as well
-- (The sample standard deviation is biased, by the way)
-
----
-
-## The Central Limit Theorem
-
-- The **Central Limit Theorem** (CLT) is one of the most important theorems in statistics
-- For our purposes, the CLT states that the distribution of averages of iid variables, properly normalized, becomes that of a standard normal as the sample size increases
-- The CLT applies in an endless variety of settings
-- Let $X_1,\ldots,X_n$ be a collection of iid random variables with mean $\mu$ and variance $\sigma^2$
-- Let $\bar X_n$ be their sample average
-- Then $\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}$ has a distribution like that of a standard normal for large $n$.
-- Remember the form
-$$\frac{\bar X_n - \mu}{\sigma / \sqrt{n}} = 
-    \frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}.
-$$
-- Usually, replacing the standard error by its estimated value doesn't change the CLT
-
----
-
-## Example
-
-- Simulate a standard normal random variable by rolling $n$ (six sided)
-- Let $X_i$ be the outcome for die $i$
-- Then note that $\mu = E[X_i] = 3.5$
-- $Var(X_i) = 2.92$ 
-- SE $\sqrt{2.92 / n} = 1.71 / \sqrt{n}$
-- Standardized mean
-$$
-    \frac{\bar X_n - 3.5}{1.71/\sqrt{n}}
-$$ 
-
----
-## Simulation of mean of $n$ dice
-```{r, echo = FALSE, fig.width=9, fig.height = 3}
-par(mfrow = c(1, 3))
-for (n in c(1, 2, 6)){
-  temp <- matrix(sample(1 : 6, n * 10000, replace = TRUE), ncol = n)
-  temp <- apply(temp, 1, mean)
-  temp <- (temp - 3.5) / (1.71 / sqrt(n)) 
-  dty <- density(temp)
-  plot(dty$x, dty$y, xlab = "", ylab = "density", type = "n", xlim = c(-3, 3), ylim = c(0, .5))
-  title(paste("sample mean of", n, "obs"))
-  lines(seq(-3, 3, length = 100), dnorm(seq(-3, 3, length = 100)), col = grey(.8), lwd = 3)
-  lines(dty$x, dty$y, lwd = 2)
-}
-```
----
-
-## Coin CLT
-
- - Let $X_i$ be the $0$ or $1$ result of the $i^{th}$ flip of a possibly unfair coin
-- The sample proportion, say $\hat p$, is the average of the coin flips
-- $E[X_i] = p$ and $Var(X_i) = p(1-p)$
-- Standard error of the mean is $\sqrt{p(1-p)/n}$
-- Then
-$$
-    \frac{\hat p - p}{\sqrt{p(1-p)/n}}
-$$
-will be approximately normally distributed
-
----
-
-```{r, echo = FALSE, fig.width=7.5, fig.height = 5}
-par(mfrow = c(2, 3))
-for (n in c(1, 10, 20)){
-  temp <- matrix(sample(0 : 1, n * 10000, replace = TRUE), ncol = n)
-  temp <- apply(temp, 1, mean)
-  temp <- (temp - .5) * 2 * sqrt(n)
-  dty <- density(temp)
-  plot(dty$x, dty$y, xlab = "", ylab = "density", type = "n", xlim = c(-3, 3), ylim = c(0, .5))
-  title(paste("sample mean of", n, "obs"))
-  lines(seq(-3, 3, length = 100), dnorm(seq(-3, 3, length = 100)), col = grey(.8), lwd = 3)
-  lines(dty$x, dty$y, lwd = 2)
-}
-for (n in c(1, 10, 20)){
-  temp <- matrix(sample(0 : 1, n * 10000, replace = TRUE, prob = c(.9, .1)), ncol = n)
-  temp <- apply(temp, 1, mean)
-  temp <- (temp - .1) / sqrt(.1 * .9 / n)
-  dty <- density(temp)
-  plot(dty$x, dty$y, xlab = "", ylab = "density", type = "n", xlim = c(-3, 3), ylim = c(0, .5))
-  title(paste("sample mean of", n, "obs"))
-  lines(seq(-3, 3, length = 100), dnorm(seq(-3, 3, length = 100)), col = grey(.8), lwd = 3)
-  lines(dty$x, dty$y, lwd = 2)
-}
-```
-
----
-
-## CLT in practice
-
-- In practice the CLT is mostly useful as an approximation
-$$
-    P\left( \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq z \right) \approx \Phi(z).  
-$$
-- Recall $1.96$ is a good approximation to the $.975^{th}$ quantile of the standard normal
-- Consider
-$$
-    \begin{eqnarray*}
-      .95 & \approx & P\left( -1.96 \leq \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq 1.96 \right)\\ \\
-      & =       & P\left(\bar X_n +1.96 \sigma/\sqrt{n} \geq \mu \geq \bar X_n - 1.96\sigma/\sqrt{n} \right),\\
-    \end{eqnarray*}
-$$
-
----
-
-## Confidence intervals
-
-- Therefore, according to the CLT, the probability that the random interval $$\bar X_n \pm z_{1-\alpha/2}\sigma / \sqrt{n}$$ contains $\mu$ is approximately 100$(1-\alpha)$%, where $z_{1-\alpha/2}$ is the $1-\alpha/2$ quantile of the standard normal distribution
-- This is called a $100(1 - \alpha)$% **confidence interval** for $\mu$
-- We can replace the unknown $\sigma$ with $s$
-
----
-## Give a confidence interval for the average height of sons
-in Galton's data
-```{r}
-library(UsingR);data(father.son); x <- father.son$sheight
-(mean(x) + c(-1, 1) * qnorm(.975) * sd(x) / sqrt(length(x))) / 12
-```
-
----
-
-## Sample proportions
-
-- In the event that each $X_i$ is $0$ or $1$ with common success probability $p$ then $\sigma^2 = p(1 - p)$
-- The interval takes the form
-$$
-    \hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
-$$
-- Replacing $p$ by $\hat p$ in the standard error results in what is called a Wald confidence interval for $p$
-- Also note that $p(1-p) \leq 1/4$ for $0 \leq p \leq 1$
-- Let $\alpha = .05$ so that $z_{1 -\alpha/2} = 1.96 \approx 2$ then
-$$
-    2  \sqrt{\frac{p(1 - p)}{n}} \leq 2 \sqrt{\frac{1}{4n}} = \frac{1}{\sqrt{n}} 
-$$
-- Therefore $\hat p \pm \frac{1}{\sqrt{n}}$ is a quick CI estimate for $p$
-
----
-## Example
-* Your campaign advisor told you that in a random sample of 100 likely voters,
-  56 intent to vote for you. 
-  * Can you relax? Do you have this race in the bag?
-  * Without access to a computer or calculator, how precise is this estimate?
-* `1/sqrt(100)=.1` so a back of the envelope calculation gives an approximate 95% interval of `(0.46, 0.66)`
-  * Not enough for you to relax, better go do more campaigning!
-* Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.
-```{r}
-round(1 / sqrt(10 ^ (1 : 6)), 3)
-```
----
-## Poisson interval
-* A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?
-* $X \sim Poisson(\lambda t)$.
-* Estimate $\hat \lambda = X/t$
-* $Var(\hat \lambda) = \lambda / t$ 
-$$
-\frac{\hat \lambda - \lambda}{\sqrt{\hat \lambda / t}} 
-= 
-\frac{X - t \lambda}{\sqrt{X}} 
-\rightarrow N(0,1)
-$$
-* This isn't the best interval.
-  * There are better asymptotic intervals.
-  * You can get an exact CI in this case.
-
----
-### R code
-```{r}
-x <- 5; t <- 94.32; lambda <- x / t
-round(lambda + c(-1, 1) * qnorm(.975) * sqrt(lambda / t), 3)
-poisson.test(x, T = 94.32)$conf
-```
-
----
-## In the regression class
-```{r}
-exp(confint(glm(x ~ 1 + offset(log(t)), family = poisson(link = log))))
-```
-
+---
+title       : A trip to Asymptopia
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Asymptotics
+* Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)
+* (Asymptopia is my name for the land of asymptotics, where everything works out well and there's no messes. The land of infinite data is nice that way.)
+* Asymptotics are incredibly useful for simple statistical inference and approximations 
+* (Not covered in this class) Asymptotics often lead to nice understanding of procedures
+* Asymptotics generally give no assurances about finite sample performance
+  * The kinds of asymptotics that do are orders of magnitude more difficult to work with
+* Asymptotics form the basis for frequency interpretation of probabilities 
+  (the long run proportion of times an event occurs)
+* To understand asymptotics, we need a very basic understanding of limits.
+
+
+---
+## Numerical limits
+
+- Imagine a sequence
+
+  - $a_1 = .9$,
+  - $a_2 = .99$,
+  - $a_3 = .999$, ...
+
+- Clearly this sequence converges to $1$
+- Definition of a limit: For any fixed distance we can find a point in the sequence so that the sequence is closer to the limit than that distance from that point on
+
+---
+
+## Limits of random variables
+
+- The problem is harder for random variables
+- Consider $\bar X_n$ the sample average of the first $n$ of a collection of $iid$ observations
+
+  - Example $\bar X_n$ could be the average of the result of $n$ coin flips (i.e. the sample proportion of heads)
+
+- We say that $\bar X_n$ converges in probability to a limit if for any fixed distance the  probability of $\bar X_n$ being closer (further away) than that distance from the limit converges to one (zero)
+
+---
+
+## The Law of Large Numbers
+
+- Establishing that a random sequence converges to a limit is hard
+- Fortunately, we have a theorem that does all the work for us, called
+    the **Law of Large Numbers**
+- The law of large numbers states that if $X_1,\ldots X_n$ are iid from a population with mean $\mu$ and variance $\sigma^2$ then $\bar X_n$ converges in probability to $\mu$
+- (There are many variations on the LLN; we are using a particularly lazy version, my favorite kind of version)
+
+---
+## Law of large numbers in action
+```{r, fig.height=4, fig.width=4}
+n <- 10000; means <- cumsum(rnorm(n)) / (1  : n)
+plot(1 : n, means, type = "l", lwd = 2, 
+     frame = FALSE, ylab = "cumulative means", xlab = "sample size")
+abline(h = 0)
+```
+---
+## Discussion
+- An estimator is **consistent** if it converges to what you want to estimate
+  - Consistency is neither necessary nor sufficient for one estimator to be better than another
+  - Typically, good estimators are consistent; it's not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer
+- The LLN basically states that the sample mean is consistent
+- The sample variance and the sample standard deviation are consistent as well
+- Recall also that the sample mean and the sample variance are unbiased as well
+- (The sample standard deviation is biased, by the way)
+
+---
+
+## The Central Limit Theorem
+
+- The **Central Limit Theorem** (CLT) is one of the most important theorems in statistics
+- For our purposes, the CLT states that the distribution of averages of iid variables, properly normalized, becomes that of a standard normal as the sample size increases
+- The CLT applies in an endless variety of settings
+- Let $X_1,\ldots,X_n$ be a collection of iid random variables with mean $\mu$ and variance $\sigma^2$
+- Let $\bar X_n$ be their sample average
+- Then $\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}$ has a distribution like that of a standard normal for large $n$.
+- Remember the form
+$$\frac{\bar X_n - \mu}{\sigma / \sqrt{n}} = 
+    \frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}.
+$$
+- Usually, replacing the standard error by its estimated value doesn't change the CLT
+
+---
+
+## Example
+
+- Simulate a standard normal random variable by rolling $n$ (six sided)
+- Let $X_i$ be the outcome for die $i$
+- Then note that $\mu = E[X_i] = 3.5$
+- $Var(X_i) = 2.92$ 
+- SE $\sqrt{2.92 / n} = 1.71 / \sqrt{n}$
+- Standardized mean
+$$
+    \frac{\bar X_n - 3.5}{1.71/\sqrt{n}}
+$$ 
+
+---
+## Simulation of mean of $n$ dice
+```{r, echo = FALSE, fig.width=9, fig.height = 3}
+par(mfrow = c(1, 3))
+for (n in c(1, 2, 6)){
+  temp <- matrix(sample(1 : 6, n * 10000, replace = TRUE), ncol = n)
+  temp <- apply(temp, 1, mean)
+  temp <- (temp - 3.5) / (1.71 / sqrt(n)) 
+  dty <- density(temp)
+  plot(dty$x, dty$y, xlab = "", ylab = "density", type = "n", xlim = c(-3, 3), ylim = c(0, .5))
+  title(paste("sample mean of", n, "obs"))
+  lines(seq(-3, 3, length = 100), dnorm(seq(-3, 3, length = 100)), col = grey(.8), lwd = 3)
+  lines(dty$x, dty$y, lwd = 2)
+}
+```
+---
+
+## Coin CLT
+
+ - Let $X_i$ be the $0$ or $1$ result of the $i^{th}$ flip of a possibly unfair coin
+- The sample proportion, say $\hat p$, is the average of the coin flips
+- $E[X_i] = p$ and $Var(X_i) = p(1-p)$
+- Standard error of the mean is $\sqrt{p(1-p)/n}$
+- Then
+$$
+    \frac{\hat p - p}{\sqrt{p(1-p)/n}}
+$$
+will be approximately normally distributed
+
+---
+
+```{r, echo = FALSE, fig.width=7.5, fig.height = 5}
+par(mfrow = c(2, 3))
+for (n in c(1, 10, 20)){
+  temp <- matrix(sample(0 : 1, n * 10000, replace = TRUE), ncol = n)
+  temp <- apply(temp, 1, mean)
+  temp <- (temp - .5) * 2 * sqrt(n)
+  dty <- density(temp)
+  plot(dty$x, dty$y, xlab = "", ylab = "density", type = "n", xlim = c(-3, 3), ylim = c(0, .5))
+  title(paste("sample mean of", n, "obs"))
+  lines(seq(-3, 3, length = 100), dnorm(seq(-3, 3, length = 100)), col = grey(.8), lwd = 3)
+  lines(dty$x, dty$y, lwd = 2)
+}
+for (n in c(1, 10, 20)){
+  temp <- matrix(sample(0 : 1, n * 10000, replace = TRUE, prob = c(.9, .1)), ncol = n)
+  temp <- apply(temp, 1, mean)
+  temp <- (temp - .1) / sqrt(.1 * .9 / n)
+  dty <- density(temp)
+  plot(dty$x, dty$y, xlab = "", ylab = "density", type = "n", xlim = c(-3, 3), ylim = c(0, .5))
+  title(paste("sample mean of", n, "obs"))
+  lines(seq(-3, 3, length = 100), dnorm(seq(-3, 3, length = 100)), col = grey(.8), lwd = 3)
+  lines(dty$x, dty$y, lwd = 2)
+}
+```
+
+---
+
+## CLT in practice
+
+- In practice the CLT is mostly useful as an approximation
+$$
+    P\left( \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq z \right) \approx \Phi(z).  
+$$
+- Recall $1.96$ is a good approximation to the $.975^{th}$ quantile of the standard normal
+- Consider
+$$
+    \begin{eqnarray*}
+      .95 & \approx & P\left( -1.96 \leq \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq 1.96 \right)\\ \\
+      & =       & P\left(\bar X_n +1.96 \sigma/\sqrt{n} \geq \mu \geq \bar X_n - 1.96\sigma/\sqrt{n} \right),\\
+    \end{eqnarray*}
+$$
+
+---
+
+## Confidence intervals
+
+- Therefore, according to the CLT, the probability that the random interval $$\bar X_n \pm z_{1-\alpha/2}\sigma / \sqrt{n}$$ contains $\mu$ is approximately 100$(1-\alpha)$%, where $z_{1-\alpha/2}$ is the $1-\alpha/2$ quantile of the standard normal distribution
+- This is called a $100(1 - \alpha)$% **confidence interval** for $\mu$
+- We can replace the unknown $\sigma$ with $s$
+
+---
+## Give a confidence interval for the average height of sons
+in Galton's data
+```{r}
+library(UsingR);data(father.son); x <- father.son$sheight
+(mean(x) + c(-1, 1) * qnorm(.975) * sd(x) / sqrt(length(x))) / 12
+```
+
+---
+
+## Sample proportions
+
+- In the event that each $X_i$ is $0$ or $1$ with common success probability $p$ then $\sigma^2 = p(1 - p)$
+- The interval takes the form
+$$
+    \hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
+$$
+- Replacing $p$ by $\hat p$ in the standard error results in what is called a Wald confidence interval for $p$
+- Also note that $p(1-p) \leq 1/4$ for $0 \leq p \leq 1$
+- Let $\alpha = .05$ so that $z_{1 -\alpha/2} = 1.96 \approx 2$ then
+$$
+    2  \sqrt{\frac{p(1 - p)}{n}} \leq 2 \sqrt{\frac{1}{4n}} = \frac{1}{\sqrt{n}} 
+$$
+- Therefore $\hat p \pm \frac{1}{\sqrt{n}}$ is a quick CI estimate for $p$
+
+---
+## Example
+* Your campaign advisor told you that in a random sample of 100 likely voters,
+  56 intent to vote for you. 
+  * Can you relax? Do you have this race in the bag?
+  * Without access to a computer or calculator, how precise is this estimate?
+* `1/sqrt(100)=.1` so a back of the envelope calculation gives an approximate 95% interval of `(0.46, 0.66)`
+  * Not enough for you to relax, better go do more campaigning!
+* Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.
+```{r}
+round(1 / sqrt(10 ^ (1 : 6)), 3)
+```
+---
+## Poisson interval
+* A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?
+* $X \sim Poisson(\lambda t)$.
+* Estimate $\hat \lambda = X/t$
+* $Var(\hat \lambda) = \lambda / t$ 
+$$
+\frac{\hat \lambda - \lambda}{\sqrt{\hat \lambda / t}} 
+= 
+\frac{X - t \lambda}{\sqrt{X}} 
+\rightarrow N(0,1)
+$$
+* This isn't the best interval.
+  * There are better asymptotic intervals.
+  * You can get an exact CI in this case.
+
+---
+### R code
+```{r}
+x <- 5; t <- 94.32; lambda <- x / t
+round(lambda + c(-1, 1) * qnorm(.975) * sqrt(lambda / t), 3)
+poisson.test(x, T = 94.32)$conf
+```
+
+---
+## In the regression class
+```{r}
+exp(confint(glm(x ~ 1 + offset(log(t)), family = poisson(link = log))))
+```
+
diff --git a/06_StatisticalInference/02_02_Asymptopia/index.html b/06_StatisticalInference/old_markdown/02_02_Asymptopia/index.html
similarity index 90%
rename from 06_StatisticalInference/02_02_Asymptopia/index.html
rename to 06_StatisticalInference/old_markdown/02_02_Asymptopia/index.html
index 0cf77bf3a..025ef3536 100644
--- a/06_StatisticalInference/02_02_Asymptopia/index.html
+++ b/06_StatisticalInference/old_markdown/02_02_Asymptopia/index.html
@@ -1,580 +1,588 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>A trip to Asymptopia</title>
-  <meta charset="utf-8">
-  <meta name="description" content="A trip to Asymptopia">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>A trip to Asymptopia</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Asymptotics</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)</li>
-<li>(Asymptopia is my name for the land of asymptotics, where everything works out well and there&#39;s no messes. The land of infinite data is nice that way.)</li>
-<li>Asymptotics are incredibly useful for simple statistical inference and approximations </li>
-<li>(Not covered in this class) Asymptotics often lead to nice understanding of procedures</li>
-<li>Asymptotics generally give no assurances about finite sample performance
-
-<ul>
-<li>The kinds of asymptotics that do are orders of magnitude more difficult to work with</li>
-</ul></li>
-<li>Asymptotics form the basis for frequency interpretation of probabilities 
-(the long run proportion of times an event occurs)</li>
-<li>To understand asymptotics, we need a very basic understanding of limits.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Numerical limits</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li><p>Imagine a sequence</p>
-
-<ul>
-<li>\(a_1 = .9\),</li>
-<li>\(a_2 = .99\),</li>
-<li>\(a_3 = .999\), ...</li>
-</ul></li>
-<li><p>Clearly this sequence converges to \(1\)</p></li>
-<li><p>Definition of a limit: For any fixed distance we can find a point in the sequence so that the sequence is closer to the limit than that distance from that point on</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Limits of random variables</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The problem is harder for random variables</li>
-<li><p>Consider \(\bar X_n\) the sample average of the first \(n\) of a collection of \(iid\) observations</p>
-
-<ul>
-<li>Example \(\bar X_n\) could be the average of the result of \(n\) coin flips (i.e. the sample proportion of heads)</li>
-</ul></li>
-<li><p>We say that \(\bar X_n\) converges in probability to a limit if for any fixed distance the  probability of \(\bar X_n\) being closer (further away) than that distance from the limit converges to one (zero)</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>The Law of Large Numbers</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Establishing that a random sequence converges to a limit is hard</li>
-<li>Fortunately, we have a theorem that does all the work for us, called
-the <strong>Law of Large Numbers</strong></li>
-<li>The law of large numbers states that if \(X_1,\ldots X_n\) are iid from a population with mean \(\mu\) and variance \(\sigma^2\) then \(\bar X_n\) converges in probability to \(\mu\)</li>
-<li>(There are many variations on the LLN; we are using a particularly lazy version, my favorite kind of version)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Law of large numbers in action</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">n &lt;- 10000; means &lt;- cumsum(rnorm(n)) / (1  : n)
-plot(1 : n, means, type = &quot;l&quot;, lwd = 2, 
-     frame = FALSE, ylab = &quot;cumulative means&quot;, xlab = &quot;sample size&quot;)
-abline(h = 0)
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Discussion</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>An estimator is <strong>consistent</strong> if it converges to what you want to estimate
-
-<ul>
-<li>Consistency is neither necessary nor sufficient for one estimator to be better than another</li>
-<li>Typically, good estimators are consistent; it&#39;s not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer</li>
-</ul></li>
-<li>The LLN basically states that the sample mean is consistent</li>
-<li>The sample variance and the sample standard deviation are consistent as well</li>
-<li>Recall also that the sample mean and the sample variance are unbiased as well</li>
-<li>(The sample standard deviation is biased, by the way)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>The Central Limit Theorem</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The <strong>Central Limit Theorem</strong> (CLT) is one of the most important theorems in statistics</li>
-<li>For our purposes, the CLT states that the distribution of averages of iid variables, properly normalized, becomes that of a standard normal as the sample size increases</li>
-<li>The CLT applies in an endless variety of settings</li>
-<li>Let \(X_1,\ldots,X_n\) be a collection of iid random variables with mean \(\mu\) and variance \(\sigma^2\)</li>
-<li>Let \(\bar X_n\) be their sample average</li>
-<li>Then \(\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}\) has a distribution like that of a standard normal for large \(n\).</li>
-<li>Remember the form
-\[\frac{\bar X_n - \mu}{\sigma / \sqrt{n}} = 
-\frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}.
-\]</li>
-<li>Usually, replacing the standard error by its estimated value doesn&#39;t change the CLT</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Simulate a standard normal random variable by rolling \(n\) (six sided)</li>
-<li>Let \(X_i\) be the outcome for die \(i\)</li>
-<li>Then note that \(\mu = E[X_i] = 3.5\)</li>
-<li>\(Var(X_i) = 2.92\) </li>
-<li>SE \(\sqrt{2.92 / n} = 1.71 / \sqrt{n}\)</li>
-<li>Standardized mean
-\[
-\frac{\bar X_n - 3.5}{1.71/\sqrt{n}}
-\] </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Simulation of mean of \(n\) dice</h2>
-  </hgroup>
-  <article data-timings="">
-    <div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Coin CLT</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Let \(X_i\) be the \(0\) or \(1\) result of the \(i^{th}\) flip of a possibly unfair coin
-
-<ul>
-<li>The sample proportion, say \(\hat p\), is the average of the coin flips</li>
-<li>\(E[X_i] = p\) and \(Var(X_i) = p(1-p)\)</li>
-<li>Standard error of the mean is \(\sqrt{p(1-p)/n}\)</li>
-<li>Then
-\[
-\frac{\hat p - p}{\sqrt{p(1-p)/n}}
-\]
-will be approximately normally distributed</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <article data-timings="">
-    <div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>CLT in practice</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>In practice the CLT is mostly useful as an approximation
-\[
-P\left( \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq z \right) \approx \Phi(z).  
-\]</li>
-<li>Recall \(1.96\) is a good approximation to the \(.975^{th}\) quantile of the standard normal</li>
-<li>Consider
-\[
-\begin{eqnarray*}
-  .95 & \approx & P\left( -1.96 \leq \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq 1.96 \right)\\ \\
-  & =       & P\left(\bar X_n +1.96 \sigma/\sqrt{n} \geq \mu \geq \bar X_n - 1.96\sigma/\sqrt{n} \right),\\
-\end{eqnarray*}
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>Confidence intervals</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Therefore, according to the CLT, the probability that the random interval \[\bar X_n \pm z_{1-\alpha/2}\sigma / \sqrt{n}\] contains \(\mu\) is approximately 100\((1-\alpha)\)%, where \(z_{1-\alpha/2}\) is the \(1-\alpha/2\) quantile of the standard normal distribution</li>
-<li>This is called a \(100(1 - \alpha)\)% <strong>confidence interval</strong> for \(\mu\)</li>
-<li>We can replace the unknown \(\sigma\) with \(s\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Give a confidence interval for the average height of sons</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>in Galton&#39;s data</p>
-
-<pre><code class="r">library(UsingR);data(father.son); x &lt;- father.son$sheight
-(mean(x) + c(-1, 1) * qnorm(.975) * sd(x) / sqrt(length(x))) / 12
-</code></pre>
-
-<pre><code>[1] 5.710 5.738
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Sample proportions</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>In the event that each \(X_i\) is \(0\) or \(1\) with common success probability \(p\) then \(\sigma^2 = p(1 - p)\)</li>
-<li>The interval takes the form
-\[
-\hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
-\]</li>
-<li>Replacing \(p\) by \(\hat p\) in the standard error results in what is called a Wald confidence interval for \(p\)</li>
-<li>Also note that \(p(1-p) \leq 1/4\) for \(0 \leq p \leq 1\)</li>
-<li>Let \(\alpha = .05\) so that \(z_{1 -\alpha/2} = 1.96 \approx 2\) then
-\[
-2  \sqrt{\frac{p(1 - p)}{n}} \leq 2 \sqrt{\frac{1}{4n}} = \frac{1}{\sqrt{n}} 
-\]</li>
-<li>Therefore \(\hat p \pm \frac{1}{\sqrt{n}}\) is a quick CI estimate for \(p\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Your campaign advisor told you that in a random sample of 100 likely voters,
-56 intent to vote for you. 
-
-<ul>
-<li>Can you relax? Do you have this race in the bag?</li>
-<li>Without access to a computer or calculator, how precise is this estimate?</li>
-</ul></li>
-<li><code>1/sqrt(100)=.1</code> so a back of the envelope calculation gives an approximate 95% interval of <code>(0.46, 0.66)</code>
-
-<ul>
-<li>Not enough for you to relax, better go do more campaigning!</li>
-</ul></li>
-<li>Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.</li>
-</ul>
-
-<pre><code class="r">round(1 / sqrt(10 ^ (1 : 6)), 3)
-</code></pre>
-
-<pre><code>[1] 0.316 0.100 0.032 0.010 0.003 0.001
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Poisson interval</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?</li>
-<li>\(X \sim Poisson(\lambda t)\).</li>
-<li>Estimate \(\hat \lambda = X/t\)</li>
-<li>\(Var(\hat \lambda) = \lambda / t\) 
-\[
-\frac{\hat \lambda - \lambda}{\sqrt{\hat \lambda / t}} 
-= 
-\frac{X - t \lambda}{\sqrt{X}} 
-\rightarrow N(0,1)
-\]</li>
-<li>This isn&#39;t the best interval.
-
-<ul>
-<li>There are better asymptotic intervals.</li>
-<li>You can get an exact CI in this case.</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h3>R code</h3>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">x &lt;- 5; t &lt;- 94.32; lambda &lt;- x / t
-round(lambda + c(-1, 1) * qnorm(.975) * sqrt(lambda / t), 3)
-</code></pre>
-
-<pre><code>[1] 0.007 0.099
-</code></pre>
-
-<pre><code class="r">poisson.test(x, T = 94.32)$conf
-</code></pre>
-
-<pre><code>[1] 0.01721 0.12371
-attr(,&quot;conf.level&quot;)
-[1] 0.95
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    <h2>In the regression class</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">exp(confint(glm(x ~ 1 + offset(log(t)), family = poisson(link = log))))
-</code></pre>
-
-<pre><code>  2.5 %  97.5 % 
-0.01901 0.11393 
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Asymptotics'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Numerical limits'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Limits of random variables'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='The Law of Large Numbers'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Law of large numbers in action'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Discussion'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='The Central Limit Theorem'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Example'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Simulation of mean of \(n\) dice'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Coin CLT'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title=''>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='CLT in practice'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='Confidence intervals'>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='Give a confidence interval for the average height of sons'>
-         14
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title='Sample proportions'>
-         15
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='Example'>
-         16
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=17 title='Poisson interval'>
-         17
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=18 title='R code'>
-         18
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=19 title='In the regression class'>
-         19
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>A trip to Asymptopia</title>
+  <meta charset="utf-8">
+  <meta name="description" content="A trip to Asymptopia">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>A trip to Asymptopia</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Asymptotics</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)</li>
+<li>(Asymptopia is my name for the land of asymptotics, where everything works out well and there&#39;s no messes. The land of infinite data is nice that way.)</li>
+<li>Asymptotics are incredibly useful for simple statistical inference and approximations </li>
+<li>(Not covered in this class) Asymptotics often lead to nice understanding of procedures</li>
+<li>Asymptotics generally give no assurances about finite sample performance
+
+<ul>
+<li>The kinds of asymptotics that do are orders of magnitude more difficult to work with</li>
+</ul></li>
+<li>Asymptotics form the basis for frequency interpretation of probabilities 
+(the long run proportion of times an event occurs)</li>
+<li>To understand asymptotics, we need a very basic understanding of limits.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Numerical limits</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li><p>Imagine a sequence</p>
+
+<ul>
+<li>\(a_1 = .9\),</li>
+<li>\(a_2 = .99\),</li>
+<li>\(a_3 = .999\), ...</li>
+</ul></li>
+<li><p>Clearly this sequence converges to \(1\)</p></li>
+<li><p>Definition of a limit: For any fixed distance we can find a point in the sequence so that the sequence is closer to the limit than that distance from that point on</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Limits of random variables</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The problem is harder for random variables</li>
+<li><p>Consider \(\bar X_n\) the sample average of the first \(n\) of a collection of \(iid\) observations</p>
+
+<ul>
+<li>Example \(\bar X_n\) could be the average of the result of \(n\) coin flips (i.e. the sample proportion of heads)</li>
+</ul></li>
+<li><p>We say that \(\bar X_n\) converges in probability to a limit if for any fixed distance the  probability of \(\bar X_n\) being closer (further away) than that distance from the limit converges to one (zero)</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>The Law of Large Numbers</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Establishing that a random sequence converges to a limit is hard</li>
+<li>Fortunately, we have a theorem that does all the work for us, called
+the <strong>Law of Large Numbers</strong></li>
+<li>The law of large numbers states that if \(X_1,\ldots X_n\) are iid from a population with mean \(\mu\) and variance \(\sigma^2\) then \(\bar X_n\) converges in probability to \(\mu\)</li>
+<li>(There are many variations on the LLN; we are using a particularly lazy version, my favorite kind of version)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Law of large numbers in action</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">n &lt;- 10000
+means &lt;- cumsum(rnorm(n))/(1:n)
+plot(1:n, means, type = &quot;l&quot;, lwd = 2, frame = FALSE, ylab = &quot;cumulative means&quot;, 
+    xlab = &quot;sample size&quot;)
+abline(h = 0)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-1.png" alt="plot of chunk unnamed-chunk-1"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Discussion</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>An estimator is <strong>consistent</strong> if it converges to what you want to estimate
+
+<ul>
+<li>Consistency is neither necessary nor sufficient for one estimator to be better than another</li>
+<li>Typically, good estimators are consistent; it&#39;s not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer</li>
+</ul></li>
+<li>The LLN basically states that the sample mean is consistent</li>
+<li>The sample variance and the sample standard deviation are consistent as well</li>
+<li>Recall also that the sample mean and the sample variance are unbiased as well</li>
+<li>(The sample standard deviation is biased, by the way)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>The Central Limit Theorem</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The <strong>Central Limit Theorem</strong> (CLT) is one of the most important theorems in statistics</li>
+<li>For our purposes, the CLT states that the distribution of averages of iid variables, properly normalized, becomes that of a standard normal as the sample size increases</li>
+<li>The CLT applies in an endless variety of settings</li>
+<li>Let \(X_1,\ldots,X_n\) be a collection of iid random variables with mean \(\mu\) and variance \(\sigma^2\)</li>
+<li>Let \(\bar X_n\) be their sample average</li>
+<li>Then \(\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}\) has a distribution like that of a standard normal for large \(n\).</li>
+<li>Remember the form
+\[\frac{\bar X_n - \mu}{\sigma / \sqrt{n}} = 
+\frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}.
+\]</li>
+<li>Usually, replacing the standard error by its estimated value doesn&#39;t change the CLT</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Simulate a standard normal random variable by rolling \(n\) (six sided)</li>
+<li>Let \(X_i\) be the outcome for die \(i\)</li>
+<li>Then note that \(\mu = E[X_i] = 3.5\)</li>
+<li>\(Var(X_i) = 2.92\) </li>
+<li>SE \(\sqrt{2.92 / n} = 1.71 / \sqrt{n}\)</li>
+<li>Standardized mean
+\[
+\frac{\bar X_n - 3.5}{1.71/\sqrt{n}}
+\] </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Simulation of mean of \(n\) dice</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-2.png" alt="plot of chunk unnamed-chunk-2"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Coin CLT</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Let \(X_i\) be the \(0\) or \(1\) result of the \(i^{th}\) flip of a possibly unfair coin
+
+<ul>
+<li>The sample proportion, say \(\hat p\), is the average of the coin flips</li>
+<li>\(E[X_i] = p\) and \(Var(X_i) = p(1-p)\)</li>
+<li>Standard error of the mean is \(\sqrt{p(1-p)/n}\)</li>
+<li>Then
+\[
+\frac{\hat p - p}{\sqrt{p(1-p)/n}}
+\]
+will be approximately normally distributed</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-3.png" alt="plot of chunk unnamed-chunk-3"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>CLT in practice</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In practice the CLT is mostly useful as an approximation
+\[
+P\left( \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq z \right) \approx \Phi(z).  
+\]</li>
+<li>Recall \(1.96\) is a good approximation to the \(.975^{th}\) quantile of the standard normal</li>
+<li>Consider
+\[
+\begin{eqnarray*}
+  .95 & \approx & P\left( -1.96 \leq \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq 1.96 \right)\\ \\
+  & =       & P\left(\bar X_n +1.96 \sigma/\sqrt{n} \geq \mu \geq \bar X_n - 1.96\sigma/\sqrt{n} \right),\\
+\end{eqnarray*}
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Confidence intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Therefore, according to the CLT, the probability that the random interval \[\bar X_n \pm z_{1-\alpha/2}\sigma / \sqrt{n}\] contains \(\mu\) is approximately 100\((1-\alpha)\)%, where \(z_{1-\alpha/2}\) is the \(1-\alpha/2\) quantile of the standard normal distribution</li>
+<li>This is called a \(100(1 - \alpha)\)% <strong>confidence interval</strong> for \(\mu\)</li>
+<li>We can replace the unknown \(\sigma\) with \(s\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Give a confidence interval for the average height of sons</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>in Galton&#39;s data</p>
+
+<pre><code class="r">library(UsingR)
+data(father.son)
+x &lt;- father.son$sheight
+(mean(x) + c(-1, 1) * qnorm(0.975) * sd(x)/sqrt(length(x)))/12
+</code></pre>
+
+<pre><code>## [1] 5.710 5.738
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Sample proportions</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In the event that each \(X_i\) is \(0\) or \(1\) with common success probability \(p\) then \(\sigma^2 = p(1 - p)\)</li>
+<li>The interval takes the form
+\[
+\hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
+\]</li>
+<li>Replacing \(p\) by \(\hat p\) in the standard error results in what is called a Wald confidence interval for \(p\)</li>
+<li>Also note that \(p(1-p) \leq 1/4\) for \(0 \leq p \leq 1\)</li>
+<li>Let \(\alpha = .05\) so that \(z_{1 -\alpha/2} = 1.96 \approx 2\) then
+\[
+2  \sqrt{\frac{p(1 - p)}{n}} \leq 2 \sqrt{\frac{1}{4n}} = \frac{1}{\sqrt{n}} 
+\]</li>
+<li>Therefore \(\hat p \pm \frac{1}{\sqrt{n}}\) is a quick CI estimate for \(p\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Your campaign advisor told you that in a random sample of 100 likely voters,
+56 intent to vote for you. 
+
+<ul>
+<li>Can you relax? Do you have this race in the bag?</li>
+<li>Without access to a computer or calculator, how precise is this estimate?</li>
+</ul></li>
+<li><code>1/sqrt(100)=.1</code> so a back of the envelope calculation gives an approximate 95% interval of <code>(0.46, 0.66)</code>
+
+<ul>
+<li>Not enough for you to relax, better go do more campaigning!</li>
+</ul></li>
+<li>Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.</li>
+</ul>
+
+<pre><code class="r">round(1/sqrt(10^(1:6)), 3)
+</code></pre>
+
+<pre><code>## [1] 0.316 0.100 0.032 0.010 0.003 0.001
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Poisson interval</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?</li>
+<li>\(X \sim Poisson(\lambda t)\).</li>
+<li>Estimate \(\hat \lambda = X/t\)</li>
+<li>\(Var(\hat \lambda) = \lambda / t\) 
+\[
+\frac{\hat \lambda - \lambda}{\sqrt{\hat \lambda / t}} 
+= 
+\frac{X - t \lambda}{\sqrt{X}} 
+\rightarrow N(0,1)
+\]</li>
+<li>This isn&#39;t the best interval.
+
+<ul>
+<li>There are better asymptotic intervals.</li>
+<li>You can get an exact CI in this case.</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h3>R code</h3>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">x &lt;- 5
+t &lt;- 94.32
+lambda &lt;- x/t
+round(lambda + c(-1, 1) * qnorm(0.975) * sqrt(lambda/t), 3)
+</code></pre>
+
+<pre><code>## [1] 0.007 0.099
+</code></pre>
+
+<pre><code class="r">poisson.test(x, T = 94.32)$conf
+</code></pre>
+
+<pre><code>## [1] 0.01721 0.12371
+## attr(,&quot;conf.level&quot;)
+## [1] 0.95
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>In the regression class</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">exp(confint(glm(x ~ 1 + offset(log(t)), family = poisson(link = log))))
+</code></pre>
+
+<pre><code>## Waiting for profiling to be done...
+</code></pre>
+
+<pre><code>##   2.5 %  97.5 % 
+## 0.01901 0.11393
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Asymptotics'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Numerical limits'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Limits of random variables'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='The Law of Large Numbers'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Law of large numbers in action'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Discussion'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='The Central Limit Theorem'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Example'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Simulation of mean of \(n\) dice'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Coin CLT'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title=''>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='CLT in practice'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Confidence intervals'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Give a confidence interval for the average height of sons'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Sample proportions'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Example'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Poisson interval'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='R code'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='In the regression class'>
+         19
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/02_02_Asymptopia/index.md b/06_StatisticalInference/old_markdown/02_02_Asymptopia/index.md
similarity index 85%
rename from 06_StatisticalInference/02_02_Asymptopia/index.md
rename to 06_StatisticalInference/old_markdown/02_02_Asymptopia/index.md
index 1e6b29c78..c2a83dc2c 100644
--- a/06_StatisticalInference/02_02_Asymptopia/index.md
+++ b/06_StatisticalInference/old_markdown/02_02_Asymptopia/index.md
@@ -1,263 +1,270 @@
----
-title       : A trip to Asymptopia
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-## Asymptotics
-* Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)
-* (Asymptopia is my name for the land of asymptotics, where everything works out well and there's no messes. The land of infinite data is nice that way.)
-* Asymptotics are incredibly useful for simple statistical inference and approximations 
-* (Not covered in this class) Asymptotics often lead to nice understanding of procedures
-* Asymptotics generally give no assurances about finite sample performance
-  * The kinds of asymptotics that do are orders of magnitude more difficult to work with
-* Asymptotics form the basis for frequency interpretation of probabilities 
-  (the long run proportion of times an event occurs)
-* To understand asymptotics, we need a very basic understanding of limits.
-
-
----
-## Numerical limits
-
-- Imagine a sequence
-
-  - $a_1 = .9$,
-  - $a_2 = .99$,
-  - $a_3 = .999$, ...
-
-- Clearly this sequence converges to $1$
-- Definition of a limit: For any fixed distance we can find a point in the sequence so that the sequence is closer to the limit than that distance from that point on
-
----
-
-## Limits of random variables
-
-- The problem is harder for random variables
-- Consider $\bar X_n$ the sample average of the first $n$ of a collection of $iid$ observations
-
-  - Example $\bar X_n$ could be the average of the result of $n$ coin flips (i.e. the sample proportion of heads)
-
-- We say that $\bar X_n$ converges in probability to a limit if for any fixed distance the  probability of $\bar X_n$ being closer (further away) than that distance from the limit converges to one (zero)
-
----
-
-## The Law of Large Numbers
-
-- Establishing that a random sequence converges to a limit is hard
-- Fortunately, we have a theorem that does all the work for us, called
-    the **Law of Large Numbers**
-- The law of large numbers states that if $X_1,\ldots X_n$ are iid from a population with mean $\mu$ and variance $\sigma^2$ then $\bar X_n$ converges in probability to $\mu$
-- (There are many variations on the LLN; we are using a particularly lazy version, my favorite kind of version)
-
----
-## Law of large numbers in action
-
-```r
-n <- 10000; means <- cumsum(rnorm(n)) / (1  : n)
-plot(1 : n, means, type = "l", lwd = 2, 
-     frame = FALSE, ylab = "cumulative means", xlab = "sample size")
-abline(h = 0)
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
-
----
-## Discussion
-- An estimator is **consistent** if it converges to what you want to estimate
-  - Consistency is neither necessary nor sufficient for one estimator to be better than another
-  - Typically, good estimators are consistent; it's not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer
-- The LLN basically states that the sample mean is consistent
-- The sample variance and the sample standard deviation are consistent as well
-- Recall also that the sample mean and the sample variance are unbiased as well
-- (The sample standard deviation is biased, by the way)
-
----
-
-## The Central Limit Theorem
-
-- The **Central Limit Theorem** (CLT) is one of the most important theorems in statistics
-- For our purposes, the CLT states that the distribution of averages of iid variables, properly normalized, becomes that of a standard normal as the sample size increases
-- The CLT applies in an endless variety of settings
-- Let $X_1,\ldots,X_n$ be a collection of iid random variables with mean $\mu$ and variance $\sigma^2$
-- Let $\bar X_n$ be their sample average
-- Then $\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}$ has a distribution like that of a standard normal for large $n$.
-- Remember the form
-$$\frac{\bar X_n - \mu}{\sigma / \sqrt{n}} = 
-    \frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}.
-$$
-- Usually, replacing the standard error by its estimated value doesn't change the CLT
-
----
-
-## Example
-
-- Simulate a standard normal random variable by rolling $n$ (six sided)
-- Let $X_i$ be the outcome for die $i$
-- Then note that $\mu = E[X_i] = 3.5$
-- $Var(X_i) = 2.92$ 
-- SE $\sqrt{2.92 / n} = 1.71 / \sqrt{n}$
-- Standardized mean
-$$
-    \frac{\bar X_n - 3.5}{1.71/\sqrt{n}}
-$$ 
-
----
-## Simulation of mean of $n$ dice
-<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
----
-
-## Coin CLT
-
- - Let $X_i$ be the $0$ or $1$ result of the $i^{th}$ flip of a possibly unfair coin
-- The sample proportion, say $\hat p$, is the average of the coin flips
-- $E[X_i] = p$ and $Var(X_i) = p(1-p)$
-- Standard error of the mean is $\sqrt{p(1-p)/n}$
-- Then
-$$
-    \frac{\hat p - p}{\sqrt{p(1-p)/n}}
-$$
-will be approximately normally distributed
-
----
-
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
-
----
-
-## CLT in practice
-
-- In practice the CLT is mostly useful as an approximation
-$$
-    P\left( \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq z \right) \approx \Phi(z).  
-$$
-- Recall $1.96$ is a good approximation to the $.975^{th}$ quantile of the standard normal
-- Consider
-$$
-    \begin{eqnarray*}
-      .95 & \approx & P\left( -1.96 \leq \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq 1.96 \right)\\ \\
-      & =       & P\left(\bar X_n +1.96 \sigma/\sqrt{n} \geq \mu \geq \bar X_n - 1.96\sigma/\sqrt{n} \right),\\
-    \end{eqnarray*}
-$$
-
----
-
-## Confidence intervals
-
-- Therefore, according to the CLT, the probability that the random interval $$\bar X_n \pm z_{1-\alpha/2}\sigma / \sqrt{n}$$ contains $\mu$ is approximately 100$(1-\alpha)$%, where $z_{1-\alpha/2}$ is the $1-\alpha/2$ quantile of the standard normal distribution
-- This is called a $100(1 - \alpha)$% **confidence interval** for $\mu$
-- We can replace the unknown $\sigma$ with $s$
-
----
-## Give a confidence interval for the average height of sons
-in Galton's data
-
-```r
-library(UsingR);data(father.son); x <- father.son$sheight
-(mean(x) + c(-1, 1) * qnorm(.975) * sd(x) / sqrt(length(x))) / 12
-```
-
-```
-[1] 5.710 5.738
-```
-
-
----
-
-## Sample proportions
-
-- In the event that each $X_i$ is $0$ or $1$ with common success probability $p$ then $\sigma^2 = p(1 - p)$
-- The interval takes the form
-$$
-    \hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
-$$
-- Replacing $p$ by $\hat p$ in the standard error results in what is called a Wald confidence interval for $p$
-- Also note that $p(1-p) \leq 1/4$ for $0 \leq p \leq 1$
-- Let $\alpha = .05$ so that $z_{1 -\alpha/2} = 1.96 \approx 2$ then
-$$
-    2  \sqrt{\frac{p(1 - p)}{n}} \leq 2 \sqrt{\frac{1}{4n}} = \frac{1}{\sqrt{n}} 
-$$
-- Therefore $\hat p \pm \frac{1}{\sqrt{n}}$ is a quick CI estimate for $p$
-
----
-## Example
-* Your campaign advisor told you that in a random sample of 100 likely voters,
-  56 intent to vote for you. 
-  * Can you relax? Do you have this race in the bag?
-  * Without access to a computer or calculator, how precise is this estimate?
-* `1/sqrt(100)=.1` so a back of the envelope calculation gives an approximate 95% interval of `(0.46, 0.66)`
-  * Not enough for you to relax, better go do more campaigning!
-* Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.
-
-```r
-round(1 / sqrt(10 ^ (1 : 6)), 3)
-```
-
-```
-[1] 0.316 0.100 0.032 0.010 0.003 0.001
-```
-
----
-## Poisson interval
-* A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?
-* $X \sim Poisson(\lambda t)$.
-* Estimate $\hat \lambda = X/t$
-* $Var(\hat \lambda) = \lambda / t$ 
-$$
-\frac{\hat \lambda - \lambda}{\sqrt{\hat \lambda / t}} 
-= 
-\frac{X - t \lambda}{\sqrt{X}} 
-\rightarrow N(0,1)
-$$
-* This isn't the best interval.
-  * There are better asymptotic intervals.
-  * You can get an exact CI in this case.
-
----
-### R code
-
-```r
-x <- 5; t <- 94.32; lambda <- x / t
-round(lambda + c(-1, 1) * qnorm(.975) * sqrt(lambda / t), 3)
-```
-
-```
-[1] 0.007 0.099
-```
-
-```r
-poisson.test(x, T = 94.32)$conf
-```
-
-```
-[1] 0.01721 0.12371
-attr(,"conf.level")
-[1] 0.95
-```
-
-
----
-## In the regression class
-
-```r
-exp(confint(glm(x ~ 1 + offset(log(t)), family = poisson(link = log))))
-```
-
-```
-  2.5 %  97.5 % 
-0.01901 0.11393 
-```
-
-
+---
+title       : A trip to Asymptopia
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Asymptotics
+* Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)
+* (Asymptopia is my name for the land of asymptotics, where everything works out well and there's no messes. The land of infinite data is nice that way.)
+* Asymptotics are incredibly useful for simple statistical inference and approximations 
+* (Not covered in this class) Asymptotics often lead to nice understanding of procedures
+* Asymptotics generally give no assurances about finite sample performance
+  * The kinds of asymptotics that do are orders of magnitude more difficult to work with
+* Asymptotics form the basis for frequency interpretation of probabilities 
+  (the long run proportion of times an event occurs)
+* To understand asymptotics, we need a very basic understanding of limits.
+
+
+---
+## Numerical limits
+
+- Imagine a sequence
+
+  - $a_1 = .9$,
+  - $a_2 = .99$,
+  - $a_3 = .999$, ...
+
+- Clearly this sequence converges to $1$
+- Definition of a limit: For any fixed distance we can find a point in the sequence so that the sequence is closer to the limit than that distance from that point on
+
+---
+
+## Limits of random variables
+
+- The problem is harder for random variables
+- Consider $\bar X_n$ the sample average of the first $n$ of a collection of $iid$ observations
+
+  - Example $\bar X_n$ could be the average of the result of $n$ coin flips (i.e. the sample proportion of heads)
+
+- We say that $\bar X_n$ converges in probability to a limit if for any fixed distance the  probability of $\bar X_n$ being closer (further away) than that distance from the limit converges to one (zero)
+
+---
+
+## The Law of Large Numbers
+
+- Establishing that a random sequence converges to a limit is hard
+- Fortunately, we have a theorem that does all the work for us, called
+    the **Law of Large Numbers**
+- The law of large numbers states that if $X_1,\ldots X_n$ are iid from a population with mean $\mu$ and variance $\sigma^2$ then $\bar X_n$ converges in probability to $\mu$
+- (There are many variations on the LLN; we are using a particularly lazy version, my favorite kind of version)
+
+---
+## Law of large numbers in action
+
+```r
+n <- 10000
+means <- cumsum(rnorm(n))/(1:n)
+plot(1:n, means, type = "l", lwd = 2, frame = FALSE, ylab = "cumulative means", 
+    xlab = "sample size")
+abline(h = 0)
+```
+
+![plot of chunk unnamed-chunk-1](assets/fig/unnamed-chunk-1.png) 
+
+---
+## Discussion
+- An estimator is **consistent** if it converges to what you want to estimate
+  - Consistency is neither necessary nor sufficient for one estimator to be better than another
+  - Typically, good estimators are consistent; it's not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer
+- The LLN basically states that the sample mean is consistent
+- The sample variance and the sample standard deviation are consistent as well
+- Recall also that the sample mean and the sample variance are unbiased as well
+- (The sample standard deviation is biased, by the way)
+
+---
+
+## The Central Limit Theorem
+
+- The **Central Limit Theorem** (CLT) is one of the most important theorems in statistics
+- For our purposes, the CLT states that the distribution of averages of iid variables, properly normalized, becomes that of a standard normal as the sample size increases
+- The CLT applies in an endless variety of settings
+- Let $X_1,\ldots,X_n$ be a collection of iid random variables with mean $\mu$ and variance $\sigma^2$
+- Let $\bar X_n$ be their sample average
+- Then $\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}$ has a distribution like that of a standard normal for large $n$.
+- Remember the form
+$$\frac{\bar X_n - \mu}{\sigma / \sqrt{n}} = 
+    \frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}.
+$$
+- Usually, replacing the standard error by its estimated value doesn't change the CLT
+
+---
+
+## Example
+
+- Simulate a standard normal random variable by rolling $n$ (six sided)
+- Let $X_i$ be the outcome for die $i$
+- Then note that $\mu = E[X_i] = 3.5$
+- $Var(X_i) = 2.92$ 
+- SE $\sqrt{2.92 / n} = 1.71 / \sqrt{n}$
+- Standardized mean
+$$
+    \frac{\bar X_n - 3.5}{1.71/\sqrt{n}}
+$$ 
+
+---
+## Simulation of mean of $n$ dice
+![plot of chunk unnamed-chunk-2](assets/fig/unnamed-chunk-2.png) 
+
+---
+
+## Coin CLT
+
+ - Let $X_i$ be the $0$ or $1$ result of the $i^{th}$ flip of a possibly unfair coin
+- The sample proportion, say $\hat p$, is the average of the coin flips
+- $E[X_i] = p$ and $Var(X_i) = p(1-p)$
+- Standard error of the mean is $\sqrt{p(1-p)/n}$
+- Then
+$$
+    \frac{\hat p - p}{\sqrt{p(1-p)/n}}
+$$
+will be approximately normally distributed
+
+---
+
+![plot of chunk unnamed-chunk-3](assets/fig/unnamed-chunk-3.png) 
+
+
+---
+
+## CLT in practice
+
+- In practice the CLT is mostly useful as an approximation
+$$
+    P\left( \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq z \right) \approx \Phi(z).  
+$$
+- Recall $1.96$ is a good approximation to the $.975^{th}$ quantile of the standard normal
+- Consider
+$$
+    \begin{eqnarray*}
+      .95 & \approx & P\left( -1.96 \leq \frac{\bar X_n - \mu}{\sigma / \sqrt{n}} \leq 1.96 \right)\\ \\
+      & =       & P\left(\bar X_n +1.96 \sigma/\sqrt{n} \geq \mu \geq \bar X_n - 1.96\sigma/\sqrt{n} \right),\\
+    \end{eqnarray*}
+$$
+
+---
+
+## Confidence intervals
+
+- Therefore, according to the CLT, the probability that the random interval $$\bar X_n \pm z_{1-\alpha/2}\sigma / \sqrt{n}$$ contains $\mu$ is approximately 100$(1-\alpha)$%, where $z_{1-\alpha/2}$ is the $1-\alpha/2$ quantile of the standard normal distribution
+- This is called a $100(1 - \alpha)$% **confidence interval** for $\mu$
+- We can replace the unknown $\sigma$ with $s$
+
+---
+## Give a confidence interval for the average height of sons
+in Galton's data
+
+```r
+library(UsingR)
+data(father.son)
+x <- father.son$sheight
+(mean(x) + c(-1, 1) * qnorm(0.975) * sd(x)/sqrt(length(x)))/12
+```
+
+```
+## [1] 5.710 5.738
+```
+
+
+---
+
+## Sample proportions
+
+- In the event that each $X_i$ is $0$ or $1$ with common success probability $p$ then $\sigma^2 = p(1 - p)$
+- The interval takes the form
+$$
+    \hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
+$$
+- Replacing $p$ by $\hat p$ in the standard error results in what is called a Wald confidence interval for $p$
+- Also note that $p(1-p) \leq 1/4$ for $0 \leq p \leq 1$
+- Let $\alpha = .05$ so that $z_{1 -\alpha/2} = 1.96 \approx 2$ then
+$$
+    2  \sqrt{\frac{p(1 - p)}{n}} \leq 2 \sqrt{\frac{1}{4n}} = \frac{1}{\sqrt{n}} 
+$$
+- Therefore $\hat p \pm \frac{1}{\sqrt{n}}$ is a quick CI estimate for $p$
+
+---
+## Example
+* Your campaign advisor told you that in a random sample of 100 likely voters,
+  56 intent to vote for you. 
+  * Can you relax? Do you have this race in the bag?
+  * Without access to a computer or calculator, how precise is this estimate?
+* `1/sqrt(100)=.1` so a back of the envelope calculation gives an approximate 95% interval of `(0.46, 0.66)`
+  * Not enough for you to relax, better go do more campaigning!
+* Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.
+
+```r
+round(1/sqrt(10^(1:6)), 3)
+```
+
+```
+## [1] 0.316 0.100 0.032 0.010 0.003 0.001
+```
+
+---
+## Poisson interval
+* A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?
+* $X \sim Poisson(\lambda t)$.
+* Estimate $\hat \lambda = X/t$
+* $Var(\hat \lambda) = \lambda / t$ 
+$$
+\frac{\hat \lambda - \lambda}{\sqrt{\hat \lambda / t}} 
+= 
+\frac{X - t \lambda}{\sqrt{X}} 
+\rightarrow N(0,1)
+$$
+* This isn't the best interval.
+  * There are better asymptotic intervals.
+  * You can get an exact CI in this case.
+
+---
+### R code
+
+```r
+x <- 5
+t <- 94.32
+lambda <- x/t
+round(lambda + c(-1, 1) * qnorm(0.975) * sqrt(lambda/t), 3)
+```
+
+```
+## [1] 0.007 0.099
+```
+
+```r
+poisson.test(x, T = 94.32)$conf
+```
+
+```
+## [1] 0.01721 0.12371
+## attr(,"conf.level")
+## [1] 0.95
+```
+
+
+---
+## In the regression class
+
+```r
+exp(confint(glm(x ~ 1 + offset(log(t)), family = poisson(link = log))))
+```
+
+```
+## Waiting for profiling to be done...
+```
+
+```
+##   2.5 %  97.5 % 
+## 0.01901 0.11393
+```
+
+
diff --git a/06_StatisticalInference/old_markdown/02_02_Asymptopia/index.pdf b/06_StatisticalInference/old_markdown/02_02_Asymptopia/index.pdf
new file mode 100644
index 000000000..d43b4b4a8
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_02_Asymptopia/index.pdf differ
diff --git a/06_StatisticalInference/02_03_tCIs/index.Rmd b/06_StatisticalInference/old_markdown/02_03_tCIs/index.Rmd
similarity index 97%
rename from 06_StatisticalInference/02_03_tCIs/index.Rmd
rename to 06_StatisticalInference/old_markdown/02_03_tCIs/index.Rmd
index a28c1c306..d2f663f19 100644
--- a/06_StatisticalInference/02_03_tCIs/index.Rmd
+++ b/06_StatisticalInference/old_markdown/02_03_tCIs/index.Rmd
@@ -1,158 +1,161 @@
----
-title       : T Confidence Intervals
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-## Confidence intervals
-
-- In the previous, we discussed creating a confidence interval using the CLT
-- In this lecture, we discuss some methods for small samples, notably Gosset's $t$ distribution
-- To discuss the $t$ distribution we must discuss the Chi-squared distribution
-- Throughout we use the following general procedure for creating CIs
-
-  a. Create a **Pivot** or statistic that does not depend on the parameter of interest
-  
-  b. Solve the probability that the pivot lies between bounds for the parameter
-
----
-
-## The Chi-squared distribution
-
-- Suppose that $S^2$ is the sample variance from a collection of iid $N(\mu,\sigma^2)$ data; then 
-$$
-    \frac{(n - 1) S^2}{\sigma^2} \sim \chi^2_{n-1}
-$$
-which reads: follows a Chi-squared distribution with $n-1$ degrees of freedom
-- The Chi-squared distribution is skewed and has support on $0$ to $\infty$
-- The mean of the Chi-squared is its degrees of freedom 
-- The variance of the Chi-squared distribution is twice the degrees of freedom
-
----
-
-## Confidence interval for the variance
-
-Note that if $\chi^2_{n-1, \alpha}$ is the $\alpha$ quantile of the
-Chi-squared distribution then
-
-$$
-\begin{eqnarray*}
-  1 - \alpha & = & P \left( \chi^2_{n-1, \alpha/2} \leq  \frac{(n - 1) S^2}{\sigma^2} \leq  \chi^2_{n-1,1 - \alpha/2} \right) \\ \\
-& = &  P\left(\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}} \leq \sigma^2 \leq 
-\frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}} \right) \\
-\end{eqnarray*}
-$$
-So that 
-$$
-\left[\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}}, \frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}}\right]
-$$
-is a $100(1-\alpha)\%$ confidence interval for $\sigma^2$
-
----
-
-## Notes about this interval
-
-- This interval relies heavily on the assumed normality
-- Square-rooting the endpoints yields a CI for $\sigma$
-
----
-## Example
-### Confidence interval for the standard deviation of sons' heights from Galton's data
-```{r}
-library(UsingR); data(father.son); x <- father.son$sheight
-s <- sd(x); n <- length(x)
-round(sqrt( (n-1) * s ^ 2 / qchisq(c(.975, .025), n - 1) ), 3)
-```
-
----
-
-## Gosset's $t$ distribution
-
-- Invented by William Gosset (under the pseudonym "Student") in 1908
-- Has thicker tails than the normal
-- Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger
-- Is obtained as 
-$$
-\frac{Z}{\sqrt{\frac{\chi^2}{df}}}
-$$
-where $Z$ and $\chi^2$ are independent standard normals and
-Chi-squared distributions respectively
-
----
-
-## Result
-
-- Suppose that $(X_1,\ldots,X_n)$ are iid $N(\mu,\sigma^2)$, then:
-  a. $\frac{\bar X - \mu}{\sigma / \sqrt{n}}$ is standard normal
-  b. $\sqrt{\frac{(n - 1) S^2}{\sigma^2 (n - 1)}} = S / \sigma$ is the square root of a Chi-squared divided by its df
-
-- Therefore 
-$$
-\frac{\frac{\bar X - \mu}{\sigma /\sqrt{n}}}{S/\sigma}  
-= \frac{\bar X - \mu}{S/\sqrt{n}}
-$$
-    follows Gosset's $t$ distribution with $n-1$ degrees of freedom
-
----
-
-## Confidence intervals for the mean
-
-- Notice that the $t$ statistic is a pivot, therefore we use it to create a confidence interval for $\mu$
-- Let $t_{df,\alpha}$ be the $\alpha^{th}$ quantile of the t distribution with $df$ degrees of freedom
-$$
-  \begin{eqnarray*}
-&   & 1 - \alpha \\
-& = & P\left(-t_{n-1,1-\alpha/2} \leq \frac{\bar X - \mu}{S/\sqrt{n}} \leq t_{n-1,1-\alpha/2}\right) \\ \\
-& = & P\left(\bar X - t_{n-1,1-\alpha/2} S / \sqrt{n} \leq \mu  
-      \leq \bar X + t_{n-1,1-\alpha/2}S /\sqrt{n}\right)
-  \end{eqnarray*}
-$$
-- Interval is $\bar X \pm t_{n-1,1-\alpha/2} S/\sqrt{n}$
-
----
-
-## Note's about the $t$ interval
-
-- The $t$ interval technically assumes that the data are iid normal, though it is robust to this assumption
-- It works well whenever the distribution of the data is roughly symmetric and mound shaped
-- Paired observations are often analyzed using the $t$ interval by taking differences
-- For large degrees of freedom, $t$ quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded
-- For skewed distributions, the spirit of the $t$ interval assumptions are violated
-- Also, for skewed distributions, it doesn't make a lot of sense to center the interval at the mean
-- In this case, consider taking logs or using a different summary like the median
-- For highly discrete data, like binary, other intervals are available
-
----
-
-## Sleep data
-
-In R typing `data(sleep)` brings up the sleep data originally
-analyzed in Gosset's Biometrika paper, which shows the increase in
-hours for 10 patients on two soporific drugs. R treats the data as two
-groups rather than paired.
-
----
-## The data
-```{r}
-data(sleep)
-head(sleep)
-```
-
----
-```{r}
-g1 <- sleep$extra[1 : 10]; g2 <- sleep$extra[11 : 20]
-difference <- g2 - g1
-mn <- mean(difference); s <- sd(difference); n <- 10
-mn + c(-1, 1) * qt(.975, n-1) * s / sqrt(n)
-t.test(difference)$conf.int
-```
-
+---
+title       : T Confidence Intervals
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Confidence intervals
+
+- In the previous, we discussed creating a confidence interval using the CLT
+- In this lecture, we discuss some methods for small samples, notably Gosset's $t$ distribution
+- To discuss the $t$ distribution we must discuss the Chi-squared distribution
+- Throughout we use the following general procedure for creating CIs
+
+  a. Create a **Pivot** or statistic that does not depend on the parameter of interest
+  
+  b. Solve the probability that the pivot lies between bounds for the parameter
+
+---
+
+## The Chi-squared distribution
+
+- Suppose that $S^2$ is the sample variance from a collection of iid $N(\mu,\sigma^2)$ data; then 
+$$
+    \frac{(n - 1) S^2}{\sigma^2} \sim \chi^2_{n-1}
+$$
+which reads: follows a Chi-squared distribution with $n-1$ degrees of freedom
+- The Chi-squared distribution is skewed and has support on $0$ to $\infty$
+- The mean of the Chi-squared is its degrees of freedom 
+- The variance of the Chi-squared distribution is twice the degrees of freedom
+
+---
+
+## Confidence interval for the variance
+
+Note that if $\chi^2_{n-1, \alpha}$ is the $\alpha$ quantile of the
+Chi-squared distribution then
+
+$$
+\begin{eqnarray*}
+  1 - \alpha & = & P \left( \chi^2_{n-1, \alpha/2} \leq  \frac{(n - 1) S^2}{\sigma^2} \leq  \chi^2_{n-1,1 - \alpha/2} \right) \\ \\
+& = &  P\left(\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}} \leq \sigma^2 \leq 
+\frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}} \right) \\
+\end{eqnarray*}
+$$
+So that 
+$$
+\left[\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}}, \frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}}\right]
+$$
+is a $100(1-\alpha)\%$ confidence interval for $\sigma^2$
+
+---
+
+## Notes about this interval
+
+- This interval relies heavily on the assumed normality
+- Square-rooting the endpoints yields a CI for $\sigma$
+
+---
+## Example
+### Confidence interval for the standard deviation of sons' heights from Galton's data
+```{r}
+library(UsingR); data(father.son); x <- father.son$sheight
+s <- sd(x); n <- length(x)
+round(sqrt( (n-1) * s ^ 2 / qchisq(c(.975, .025), n - 1) ), 3)
+```
+
+---
+
+## Gosset's $t$ distribution
+
+- Invented by William Gosset (under the pseudonym "Student") in 1908
+- Has thicker tails than the normal
+- Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger
+- Is obtained as 
+$$
+\frac{Z}{\sqrt{\frac{\chi^2}{df}}}
+$$
+where $Z$ and $\chi^2$ are independent standard normals and
+Chi-squared distributions respectively
+
+---
+
+## Result
+
+- Suppose that $(X_1,\ldots,X_n)$ are iid $N(\mu,\sigma^2)$, then:
+  a. $\frac{\bar X - \mu}{\sigma / \sqrt{n}}$ is standard normal
+  b. $\sqrt{\frac{(n - 1) S^2}{\sigma^2 (n - 1)}} = S / \sigma$ is the square root of a Chi-squared divided by its df
+
+- Therefore 
+$$
+\frac{\frac{\bar X - \mu}{\sigma /\sqrt{n}}}{S/\sigma}  
+= \frac{\bar X - \mu}{S/\sqrt{n}}
+$$
+    follows Gosset's $t$ distribution with $n-1$ degrees of freedom
+
+---
+
+## Confidence intervals for the mean
+
+- Notice that the $t$ statistic is a pivot, therefore we use it to create a confidence interval for $\mu$
+- Let $t_{df,\alpha}$ be the $\alpha^{th}$ quantile of the t distribution with $df$ degrees of freedom
+$$
+  \begin{eqnarray*}
+&   & 1 - \alpha \\
+& = & P\left(-t_{n-1,1-\alpha/2} \leq \frac{\bar X - \mu}{S/\sqrt{n}} \leq t_{n-1,1-\alpha/2}\right) \\ \\
+& = & P\left(\bar X - t_{n-1,1-\alpha/2} S / \sqrt{n} \leq \mu  
+      \leq \bar X + t_{n-1,1-\alpha/2}S /\sqrt{n}\right)
+  \end{eqnarray*}
+$$
+- Interval is $\bar X \pm t_{n-1,1-\alpha/2} S/\sqrt{n}$
+
+---
+
+## Note's about the $t$ interval
+
+- The $t$ interval technically assumes that the data are iid normal, though it is robust to this assumption
+- It works well whenever the distribution of the data is roughly symmetric and mound shaped
+- Paired observations are often analyzed using the $t$ interval by taking differences
+- For large degrees of freedom, $t$ quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded
+- For skewed distributions, the spirit of the $t$ interval assumptions are violated
+- Also, for skewed distributions, it doesn't make a lot of sense to center the interval at the mean
+- In this case, consider taking logs or using a different summary like the median
+- For highly discrete data, like binary, other intervals are available
+
+---
+
+## Sleep data
+
+In R typing `data(sleep)` brings up the sleep data originally
+analyzed in Gosset's Biometrika paper, which shows the increase in
+hours for 10 patients on two soporific drugs. R treats the data as two
+groups rather than paired.
+
+---
+## The data
+```{r}
+data(sleep)
+head(sleep)
+```
+
+---
+## Results
+```{r, echo=TRUE}
+g1 <- sleep$extra[1 : 10]; g2 <- sleep$extra[11 : 20]
+difference <- g2 - g1
+mn <- mean(difference); s <- sd(difference); n <- 10
+mn + c(-1, 1) * qt(.975, n-1) * s / sqrt(n)
+t.test(difference)$conf.int
+```
+
+
+
diff --git a/06_StatisticalInference/02_03_tCIs/index.html b/06_StatisticalInference/old_markdown/02_03_tCIs/index.html
similarity index 96%
rename from 06_StatisticalInference/02_03_tCIs/index.html
rename to 06_StatisticalInference/old_markdown/02_03_tCIs/index.html
index 8a5b3d4eb..8b2b6c9d9 100644
--- a/06_StatisticalInference/02_03_tCIs/index.html
+++ b/06_StatisticalInference/old_markdown/02_03_tCIs/index.html
@@ -1,399 +1,402 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>T Confidence Intervals</title>
-  <meta charset="utf-8">
-  <meta name="description" content="T Confidence Intervals">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>T Confidence Intervals</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Confidence intervals</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>In the previous, we discussed creating a confidence interval using the CLT</li>
-<li>In this lecture, we discuss some methods for small samples, notably Gosset&#39;s \(t\) distribution</li>
-<li>To discuss the \(t\) distribution we must discuss the Chi-squared distribution</li>
-<li><p>Throughout we use the following general procedure for creating CIs</p>
-
-<p>a. Create a <strong>Pivot</strong> or statistic that does not depend on the parameter of interest</p>
-
-<p>b. Solve the probability that the pivot lies between bounds for the parameter</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>The Chi-squared distribution</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that \(S^2\) is the sample variance from a collection of iid \(N(\mu,\sigma^2)\) data; then 
-\[
-\frac{(n - 1) S^2}{\sigma^2} \sim \chi^2_{n-1}
-\]
-which reads: follows a Chi-squared distribution with \(n-1\) degrees of freedom</li>
-<li>The Chi-squared distribution is skewed and has support on \(0\) to \(\infty\)</li>
-<li>The mean of the Chi-squared is its degrees of freedom </li>
-<li>The variance of the Chi-squared distribution is twice the degrees of freedom</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Confidence interval for the variance</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Note that if \(\chi^2_{n-1, \alpha}\) is the \(\alpha\) quantile of the
-Chi-squared distribution then</p>
-
-<p>\[
-\begin{eqnarray*}
-  1 - \alpha & = & P \left( \chi^2_{n-1, \alpha/2} \leq  \frac{(n - 1) S^2}{\sigma^2} \leq  \chi^2_{n-1,1 - \alpha/2} \right) \\ \\
-& = &  P\left(\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}} \leq \sigma^2 \leq 
-\frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}} \right) \\
-\end{eqnarray*}
-\]
-So that 
-\[
-\left[\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}}, \frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}}\right]
-\]
-is a \(100(1-\alpha)\%\) confidence interval for \(\sigma^2\)</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Notes about this interval</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>This interval relies heavily on the assumed normality</li>
-<li>Square-rooting the endpoints yields a CI for \(\sigma\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <h3>Confidence interval for the standard deviation of sons&#39; heights from Galton&#39;s data</h3>
-
-<pre><code class="r">library(UsingR)
-data(father.son)
-x &lt;- father.son$sheight
-s &lt;- sd(x)
-n &lt;- length(x)
-round(sqrt((n - 1) * s^2/qchisq(c(0.975, 0.025), n - 1)), 3)
-</code></pre>
-
-<pre><code>## [1] 2.701 2.939
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Gosset&#39;s \(t\) distribution</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Invented by William Gosset (under the pseudonym &quot;Student&quot;) in 1908</li>
-<li>Has thicker tails than the normal</li>
-<li>Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger</li>
-<li>Is obtained as 
-\[
-\frac{Z}{\sqrt{\frac{\chi^2}{df}}}
-\]
-where \(Z\) and \(\chi^2\) are independent standard normals and
-Chi-squared distributions respectively</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Result</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li><p>Suppose that \((X_1,\ldots,X_n)\) are iid \(N(\mu,\sigma^2)\), then:
-a. \(\frac{\bar X - \mu}{\sigma / \sqrt{n}}\) is standard normal
-b. \(\sqrt{\frac{(n - 1) S^2}{\sigma^2 (n - 1)}} = S / \sigma\) is the square root of a Chi-squared divided by its df</p></li>
-<li><p>Therefore 
-\[
-\frac{\frac{\bar X - \mu}{\sigma /\sqrt{n}}}{S/\sigma}  
-= \frac{\bar X - \mu}{S/\sqrt{n}}
-\]
-follows Gosset&#39;s \(t\) distribution with \(n-1\) degrees of freedom</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Confidence intervals for the mean</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Notice that the \(t\) statistic is a pivot, therefore we use it to create a confidence interval for \(\mu\)</li>
-<li>Let \(t_{df,\alpha}\) be the \(\alpha^{th}\) quantile of the t distribution with \(df\) degrees of freedom
-\[
-\begin{eqnarray*}
-&   & 1 - \alpha \\
-& = & P\left(-t_{n-1,1-\alpha/2} \leq \frac{\bar X - \mu}{S/\sqrt{n}} \leq t_{n-1,1-\alpha/2}\right) \\ \\
-& = & P\left(\bar X - t_{n-1,1-\alpha/2} S / \sqrt{n} \leq \mu  
-  \leq \bar X + t_{n-1,1-\alpha/2}S /\sqrt{n}\right)
-\end{eqnarray*}
-\]</li>
-<li>Interval is \(\bar X \pm t_{n-1,1-\alpha/2} S/\sqrt{n}\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Note&#39;s about the \(t\) interval</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The \(t\) interval technically assumes that the data are iid normal, though it is robust to this assumption</li>
-<li>It works well whenever the distribution of the data is roughly symmetric and mound shaped</li>
-<li>Paired observations are often analyzed using the \(t\) interval by taking differences</li>
-<li>For large degrees of freedom, \(t\) quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded</li>
-<li>For skewed distributions, the spirit of the \(t\) interval assumptions are violated</li>
-<li>Also, for skewed distributions, it doesn&#39;t make a lot of sense to center the interval at the mean</li>
-<li>In this case, consider taking logs or using a different summary like the median</li>
-<li>For highly discrete data, like binary, other intervals are available</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Sleep data</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>In R typing <code>data(sleep)</code> brings up the sleep data originally
-analyzed in Gosset&#39;s Biometrika paper, which shows the increase in
-hours for 10 patients on two soporific drugs. R treats the data as two
-groups rather than paired.</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>The data</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">data(sleep)
-head(sleep)
-</code></pre>
-
-<pre><code>##   extra group ID
-## 1   0.7     1  1
-## 2  -1.6     1  2
-## 3  -0.2     1  3
-## 4  -1.2     1  4
-## 5  -0.1     1  5
-## 6   3.4     1  6
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <article data-timings="">
-    <pre><code class="r">g1 &lt;- sleep$extra[1:10]
-g2 &lt;- sleep$extra[11:20]
-difference &lt;- g2 - g1
-mn &lt;- mean(difference)
-s &lt;- sd(difference)
-n &lt;- 10
-mn + c(-1, 1) * qt(0.975, n - 1) * s/sqrt(n)
-</code></pre>
-
-<pre><code>## [1] 0.7001 2.4599
-</code></pre>
-
-<pre><code class="r">t.test(difference)$conf.int
-</code></pre>
-
-<pre><code>## [1] 0.7001 2.4599
-## attr(,&quot;conf.level&quot;)
-## [1] 0.95
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Confidence intervals'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='The Chi-squared distribution'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Confidence interval for the variance'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Notes about this interval'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Example'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Gosset&#39;s \(t\) distribution'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Result'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Confidence intervals for the mean'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Note&#39;s about the \(t\) interval'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Sleep data'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='The data'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title=''>
-         12
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>T Confidence Intervals</title>
+  <meta charset="utf-8">
+  <meta name="description" content="T Confidence Intervals">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>T Confidence Intervals</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Confidence intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In the previous, we discussed creating a confidence interval using the CLT</li>
+<li>In this lecture, we discuss some methods for small samples, notably Gosset&#39;s \(t\) distribution</li>
+<li>To discuss the \(t\) distribution we must discuss the Chi-squared distribution</li>
+<li><p>Throughout we use the following general procedure for creating CIs</p>
+
+<p>a. Create a <strong>Pivot</strong> or statistic that does not depend on the parameter of interest</p>
+
+<p>b. Solve the probability that the pivot lies between bounds for the parameter</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>The Chi-squared distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that \(S^2\) is the sample variance from a collection of iid \(N(\mu,\sigma^2)\) data; then 
+\[
+\frac{(n - 1) S^2}{\sigma^2} \sim \chi^2_{n-1}
+\]
+which reads: follows a Chi-squared distribution with \(n-1\) degrees of freedom</li>
+<li>The Chi-squared distribution is skewed and has support on \(0\) to \(\infty\)</li>
+<li>The mean of the Chi-squared is its degrees of freedom </li>
+<li>The variance of the Chi-squared distribution is twice the degrees of freedom</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Confidence interval for the variance</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Note that if \(\chi^2_{n-1, \alpha}\) is the \(\alpha\) quantile of the
+Chi-squared distribution then</p>
+
+<p>\[
+\begin{eqnarray*}
+  1 - \alpha & = & P \left( \chi^2_{n-1, \alpha/2} \leq  \frac{(n - 1) S^2}{\sigma^2} \leq  \chi^2_{n-1,1 - \alpha/2} \right) \\ \\
+& = &  P\left(\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}} \leq \sigma^2 \leq 
+\frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}} \right) \\
+\end{eqnarray*}
+\]
+So that 
+\[
+\left[\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}}, \frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}}\right]
+\]
+is a \(100(1-\alpha)\%\) confidence interval for \(\sigma^2\)</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Notes about this interval</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>This interval relies heavily on the assumed normality</li>
+<li>Square-rooting the endpoints yields a CI for \(\sigma\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>Confidence interval for the standard deviation of sons&#39; heights from Galton&#39;s data</h3>
+
+<pre><code class="r">library(UsingR)
+data(father.son)
+x &lt;- father.son$sheight
+s &lt;- sd(x)
+n &lt;- length(x)
+round(sqrt((n - 1) * s^2/qchisq(c(0.975, 0.025), n - 1)), 3)
+</code></pre>
+
+<pre><code>## [1] 2.701 2.939
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Gosset&#39;s \(t\) distribution</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Invented by William Gosset (under the pseudonym &quot;Student&quot;) in 1908</li>
+<li>Has thicker tails than the normal</li>
+<li>Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger</li>
+<li>Is obtained as 
+\[
+\frac{Z}{\sqrt{\frac{\chi^2}{df}}}
+\]
+where \(Z\) and \(\chi^2\) are independent standard normals and
+Chi-squared distributions respectively</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Result</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li><p>Suppose that \((X_1,\ldots,X_n)\) are iid \(N(\mu,\sigma^2)\), then:
+a. \(\frac{\bar X - \mu}{\sigma / \sqrt{n}}\) is standard normal
+b. \(\sqrt{\frac{(n - 1) S^2}{\sigma^2 (n - 1)}} = S / \sigma\) is the square root of a Chi-squared divided by its df</p></li>
+<li><p>Therefore 
+\[
+\frac{\frac{\bar X - \mu}{\sigma /\sqrt{n}}}{S/\sigma}  
+= \frac{\bar X - \mu}{S/\sqrt{n}}
+\]
+follows Gosset&#39;s \(t\) distribution with \(n-1\) degrees of freedom</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Confidence intervals for the mean</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Notice that the \(t\) statistic is a pivot, therefore we use it to create a confidence interval for \(\mu\)</li>
+<li>Let \(t_{df,\alpha}\) be the \(\alpha^{th}\) quantile of the t distribution with \(df\) degrees of freedom
+\[
+\begin{eqnarray*}
+&   & 1 - \alpha \\
+& = & P\left(-t_{n-1,1-\alpha/2} \leq \frac{\bar X - \mu}{S/\sqrt{n}} \leq t_{n-1,1-\alpha/2}\right) \\ \\
+& = & P\left(\bar X - t_{n-1,1-\alpha/2} S / \sqrt{n} \leq \mu  
+  \leq \bar X + t_{n-1,1-\alpha/2}S /\sqrt{n}\right)
+\end{eqnarray*}
+\]</li>
+<li>Interval is \(\bar X \pm t_{n-1,1-\alpha/2} S/\sqrt{n}\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Note&#39;s about the \(t\) interval</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The \(t\) interval technically assumes that the data are iid normal, though it is robust to this assumption</li>
+<li>It works well whenever the distribution of the data is roughly symmetric and mound shaped</li>
+<li>Paired observations are often analyzed using the \(t\) interval by taking differences</li>
+<li>For large degrees of freedom, \(t\) quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded</li>
+<li>For skewed distributions, the spirit of the \(t\) interval assumptions are violated</li>
+<li>Also, for skewed distributions, it doesn&#39;t make a lot of sense to center the interval at the mean</li>
+<li>In this case, consider taking logs or using a different summary like the median</li>
+<li>For highly discrete data, like binary, other intervals are available</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Sleep data</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>In R typing <code>data(sleep)</code> brings up the sleep data originally
+analyzed in Gosset&#39;s Biometrika paper, which shows the increase in
+hours for 10 patients on two soporific drugs. R treats the data as two
+groups rather than paired.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>The data</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">data(sleep)
+head(sleep)
+</code></pre>
+
+<pre><code>##   extra group ID
+## 1   0.7     1  1
+## 2  -1.6     1  2
+## 3  -0.2     1  3
+## 4  -1.2     1  4
+## 5  -0.1     1  5
+## 6   3.4     1  6
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Results</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">g1 &lt;- sleep$extra[1:10]
+g2 &lt;- sleep$extra[11:20]
+difference &lt;- g2 - g1
+mn &lt;- mean(difference)
+s &lt;- sd(difference)
+n &lt;- 10
+mn + c(-1, 1) * qt(0.975, n - 1) * s/sqrt(n)
+</code></pre>
+
+<pre><code>## [1] 0.7001 2.4599
+</code></pre>
+
+<pre><code class="r">t.test(difference)$conf.int
+</code></pre>
+
+<pre><code>## [1] 0.7001 2.4599
+## attr(,&quot;conf.level&quot;)
+## [1] 0.95
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Confidence intervals'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='The Chi-squared distribution'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Confidence interval for the variance'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Notes about this interval'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Example'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Gosset&#39;s \(t\) distribution'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Result'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Confidence intervals for the mean'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Note&#39;s about the \(t\) interval'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Sleep data'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='The data'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Results'>
+         12
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/02_03_tCIs/index.md b/06_StatisticalInference/old_markdown/02_03_tCIs/index.md
similarity index 96%
rename from 06_StatisticalInference/02_03_tCIs/index.md
rename to 06_StatisticalInference/old_markdown/02_03_tCIs/index.md
index 5f8bd59ec..fbcd970b6 100644
--- a/06_StatisticalInference/02_03_tCIs/index.md
+++ b/06_StatisticalInference/old_markdown/02_03_tCIs/index.md
@@ -1,197 +1,200 @@
----
-title       : T Confidence Intervals
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-## Confidence intervals
-
-- In the previous, we discussed creating a confidence interval using the CLT
-- In this lecture, we discuss some methods for small samples, notably Gosset's $t$ distribution
-- To discuss the $t$ distribution we must discuss the Chi-squared distribution
-- Throughout we use the following general procedure for creating CIs
-
-  a. Create a **Pivot** or statistic that does not depend on the parameter of interest
-  
-  b. Solve the probability that the pivot lies between bounds for the parameter
-
----
-
-## The Chi-squared distribution
-
-- Suppose that $S^2$ is the sample variance from a collection of iid $N(\mu,\sigma^2)$ data; then 
-$$
-    \frac{(n - 1) S^2}{\sigma^2} \sim \chi^2_{n-1}
-$$
-which reads: follows a Chi-squared distribution with $n-1$ degrees of freedom
-- The Chi-squared distribution is skewed and has support on $0$ to $\infty$
-- The mean of the Chi-squared is its degrees of freedom 
-- The variance of the Chi-squared distribution is twice the degrees of freedom
-
----
-
-## Confidence interval for the variance
-
-Note that if $\chi^2_{n-1, \alpha}$ is the $\alpha$ quantile of the
-Chi-squared distribution then
-
-$$
-\begin{eqnarray*}
-  1 - \alpha & = & P \left( \chi^2_{n-1, \alpha/2} \leq  \frac{(n - 1) S^2}{\sigma^2} \leq  \chi^2_{n-1,1 - \alpha/2} \right) \\ \\
-& = &  P\left(\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}} \leq \sigma^2 \leq 
-\frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}} \right) \\
-\end{eqnarray*}
-$$
-So that 
-$$
-\left[\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}}, \frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}}\right]
-$$
-is a $100(1-\alpha)\%$ confidence interval for $\sigma^2$
-
----
-
-## Notes about this interval
-
-- This interval relies heavily on the assumed normality
-- Square-rooting the endpoints yields a CI for $\sigma$
-
----
-## Example
-### Confidence interval for the standard deviation of sons' heights from Galton's data
-
-```r
-library(UsingR)
-data(father.son)
-x <- father.son$sheight
-s <- sd(x)
-n <- length(x)
-round(sqrt((n - 1) * s^2/qchisq(c(0.975, 0.025), n - 1)), 3)
-```
-
-```
-## [1] 2.701 2.939
-```
-
-
----
-
-## Gosset's $t$ distribution
-
-- Invented by William Gosset (under the pseudonym "Student") in 1908
-- Has thicker tails than the normal
-- Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger
-- Is obtained as 
-$$
-\frac{Z}{\sqrt{\frac{\chi^2}{df}}}
-$$
-where $Z$ and $\chi^2$ are independent standard normals and
-Chi-squared distributions respectively
-
----
-
-## Result
-
-- Suppose that $(X_1,\ldots,X_n)$ are iid $N(\mu,\sigma^2)$, then:
-  a. $\frac{\bar X - \mu}{\sigma / \sqrt{n}}$ is standard normal
-  b. $\sqrt{\frac{(n - 1) S^2}{\sigma^2 (n - 1)}} = S / \sigma$ is the square root of a Chi-squared divided by its df
-
-- Therefore 
-$$
-\frac{\frac{\bar X - \mu}{\sigma /\sqrt{n}}}{S/\sigma}  
-= \frac{\bar X - \mu}{S/\sqrt{n}}
-$$
-    follows Gosset's $t$ distribution with $n-1$ degrees of freedom
-
----
-
-## Confidence intervals for the mean
-
-- Notice that the $t$ statistic is a pivot, therefore we use it to create a confidence interval for $\mu$
-- Let $t_{df,\alpha}$ be the $\alpha^{th}$ quantile of the t distribution with $df$ degrees of freedom
-$$
-  \begin{eqnarray*}
-&   & 1 - \alpha \\
-& = & P\left(-t_{n-1,1-\alpha/2} \leq \frac{\bar X - \mu}{S/\sqrt{n}} \leq t_{n-1,1-\alpha/2}\right) \\ \\
-& = & P\left(\bar X - t_{n-1,1-\alpha/2} S / \sqrt{n} \leq \mu  
-      \leq \bar X + t_{n-1,1-\alpha/2}S /\sqrt{n}\right)
-  \end{eqnarray*}
-$$
-- Interval is $\bar X \pm t_{n-1,1-\alpha/2} S/\sqrt{n}$
-
----
-
-## Note's about the $t$ interval
-
-- The $t$ interval technically assumes that the data are iid normal, though it is robust to this assumption
-- It works well whenever the distribution of the data is roughly symmetric and mound shaped
-- Paired observations are often analyzed using the $t$ interval by taking differences
-- For large degrees of freedom, $t$ quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded
-- For skewed distributions, the spirit of the $t$ interval assumptions are violated
-- Also, for skewed distributions, it doesn't make a lot of sense to center the interval at the mean
-- In this case, consider taking logs or using a different summary like the median
-- For highly discrete data, like binary, other intervals are available
-
----
-
-## Sleep data
-
-In R typing `data(sleep)` brings up the sleep data originally
-analyzed in Gosset's Biometrika paper, which shows the increase in
-hours for 10 patients on two soporific drugs. R treats the data as two
-groups rather than paired.
-
----
-## The data
-
-```r
-data(sleep)
-head(sleep)
-```
-
-```
-##   extra group ID
-## 1   0.7     1  1
-## 2  -1.6     1  2
-## 3  -0.2     1  3
-## 4  -1.2     1  4
-## 5  -0.1     1  5
-## 6   3.4     1  6
-```
-
-
----
-
-```r
-g1 <- sleep$extra[1:10]
-g2 <- sleep$extra[11:20]
-difference <- g2 - g1
-mn <- mean(difference)
-s <- sd(difference)
-n <- 10
-mn + c(-1, 1) * qt(0.975, n - 1) * s/sqrt(n)
-```
-
-```
-## [1] 0.7001 2.4599
-```
-
-```r
-t.test(difference)$conf.int
-```
-
-```
-## [1] 0.7001 2.4599
-## attr(,"conf.level")
-## [1] 0.95
-```
-
-
+---
+title       : T Confidence Intervals
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Confidence intervals
+
+- In the previous, we discussed creating a confidence interval using the CLT
+- In this lecture, we discuss some methods for small samples, notably Gosset's $t$ distribution
+- To discuss the $t$ distribution we must discuss the Chi-squared distribution
+- Throughout we use the following general procedure for creating CIs
+
+  a. Create a **Pivot** or statistic that does not depend on the parameter of interest
+  
+  b. Solve the probability that the pivot lies between bounds for the parameter
+
+---
+
+## The Chi-squared distribution
+
+- Suppose that $S^2$ is the sample variance from a collection of iid $N(\mu,\sigma^2)$ data; then 
+$$
+    \frac{(n - 1) S^2}{\sigma^2} \sim \chi^2_{n-1}
+$$
+which reads: follows a Chi-squared distribution with $n-1$ degrees of freedom
+- The Chi-squared distribution is skewed and has support on $0$ to $\infty$
+- The mean of the Chi-squared is its degrees of freedom 
+- The variance of the Chi-squared distribution is twice the degrees of freedom
+
+---
+
+## Confidence interval for the variance
+
+Note that if $\chi^2_{n-1, \alpha}$ is the $\alpha$ quantile of the
+Chi-squared distribution then
+
+$$
+\begin{eqnarray*}
+  1 - \alpha & = & P \left( \chi^2_{n-1, \alpha/2} \leq  \frac{(n - 1) S^2}{\sigma^2} \leq  \chi^2_{n-1,1 - \alpha/2} \right) \\ \\
+& = &  P\left(\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}} \leq \sigma^2 \leq 
+\frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}} \right) \\
+\end{eqnarray*}
+$$
+So that 
+$$
+\left[\frac{(n-1)S^2}{\chi^2_{n-1,1-\alpha/2}}, \frac{(n-1)S^2}{\chi^2_{n-1,\alpha/2}}\right]
+$$
+is a $100(1-\alpha)\%$ confidence interval for $\sigma^2$
+
+---
+
+## Notes about this interval
+
+- This interval relies heavily on the assumed normality
+- Square-rooting the endpoints yields a CI for $\sigma$
+
+---
+## Example
+### Confidence interval for the standard deviation of sons' heights from Galton's data
+
+```r
+library(UsingR)
+data(father.son)
+x <- father.son$sheight
+s <- sd(x)
+n <- length(x)
+round(sqrt((n - 1) * s^2/qchisq(c(0.975, 0.025), n - 1)), 3)
+```
+
+```
+## [1] 2.701 2.939
+```
+
+
+---
+
+## Gosset's $t$ distribution
+
+- Invented by William Gosset (under the pseudonym "Student") in 1908
+- Has thicker tails than the normal
+- Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger
+- Is obtained as 
+$$
+\frac{Z}{\sqrt{\frac{\chi^2}{df}}}
+$$
+where $Z$ and $\chi^2$ are independent standard normals and
+Chi-squared distributions respectively
+
+---
+
+## Result
+
+- Suppose that $(X_1,\ldots,X_n)$ are iid $N(\mu,\sigma^2)$, then:
+  a. $\frac{\bar X - \mu}{\sigma / \sqrt{n}}$ is standard normal
+  b. $\sqrt{\frac{(n - 1) S^2}{\sigma^2 (n - 1)}} = S / \sigma$ is the square root of a Chi-squared divided by its df
+
+- Therefore 
+$$
+\frac{\frac{\bar X - \mu}{\sigma /\sqrt{n}}}{S/\sigma}  
+= \frac{\bar X - \mu}{S/\sqrt{n}}
+$$
+    follows Gosset's $t$ distribution with $n-1$ degrees of freedom
+
+---
+
+## Confidence intervals for the mean
+
+- Notice that the $t$ statistic is a pivot, therefore we use it to create a confidence interval for $\mu$
+- Let $t_{df,\alpha}$ be the $\alpha^{th}$ quantile of the t distribution with $df$ degrees of freedom
+$$
+  \begin{eqnarray*}
+&   & 1 - \alpha \\
+& = & P\left(-t_{n-1,1-\alpha/2} \leq \frac{\bar X - \mu}{S/\sqrt{n}} \leq t_{n-1,1-\alpha/2}\right) \\ \\
+& = & P\left(\bar X - t_{n-1,1-\alpha/2} S / \sqrt{n} \leq \mu  
+      \leq \bar X + t_{n-1,1-\alpha/2}S /\sqrt{n}\right)
+  \end{eqnarray*}
+$$
+- Interval is $\bar X \pm t_{n-1,1-\alpha/2} S/\sqrt{n}$
+
+---
+
+## Note's about the $t$ interval
+
+- The $t$ interval technically assumes that the data are iid normal, though it is robust to this assumption
+- It works well whenever the distribution of the data is roughly symmetric and mound shaped
+- Paired observations are often analyzed using the $t$ interval by taking differences
+- For large degrees of freedom, $t$ quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded
+- For skewed distributions, the spirit of the $t$ interval assumptions are violated
+- Also, for skewed distributions, it doesn't make a lot of sense to center the interval at the mean
+- In this case, consider taking logs or using a different summary like the median
+- For highly discrete data, like binary, other intervals are available
+
+---
+
+## Sleep data
+
+In R typing `data(sleep)` brings up the sleep data originally
+analyzed in Gosset's Biometrika paper, which shows the increase in
+hours for 10 patients on two soporific drugs. R treats the data as two
+groups rather than paired.
+
+---
+## The data
+
+```r
+data(sleep)
+head(sleep)
+```
+
+```
+##   extra group ID
+## 1   0.7     1  1
+## 2  -1.6     1  2
+## 3  -0.2     1  3
+## 4  -1.2     1  4
+## 5  -0.1     1  5
+## 6   3.4     1  6
+```
+
+
+---
+## Results
+
+```r
+g1 <- sleep$extra[1:10]
+g2 <- sleep$extra[11:20]
+difference <- g2 - g1
+mn <- mean(difference)
+s <- sd(difference)
+n <- 10
+mn + c(-1, 1) * qt(0.975, n - 1) * s/sqrt(n)
+```
+
+```
+## [1] 0.7001 2.4599
+```
+
+```r
+t.test(difference)$conf.int
+```
+
+```
+## [1] 0.7001 2.4599
+## attr(,"conf.level")
+## [1] 0.95
+```
+
+
+
+
diff --git a/06_StatisticalInference/old_markdown/02_03_tCIs/index.pdf b/06_StatisticalInference/old_markdown/02_03_tCIs/index.pdf
new file mode 100644
index 000000000..19947b5e5
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_03_tCIs/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/02_04_Likeklihood/assets/fig/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/02_04_Likeklihood/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..eea114c31
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_04_Likeklihood/assets/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/02_04_Likeklihood/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/02_04_Likeklihood/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..d00011e8a
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_04_Likeklihood/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/02_04_Likeklihood/fig/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/02_04_Likeklihood/fig/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/02_04_Likeklihood/fig/unnamed-chunk-1.png
rename to 06_StatisticalInference/old_markdown/02_04_Likeklihood/fig/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/02_04_Likeklihood/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/02_04_Likeklihood/fig/unnamed-chunk-2.png
similarity index 100%
rename from 06_StatisticalInference/02_04_Likeklihood/fig/unnamed-chunk-2.png
rename to 06_StatisticalInference/old_markdown/02_04_Likeklihood/fig/unnamed-chunk-2.png
diff --git a/06_StatisticalInference/02_04_Likeklihood/figure/fig.width==45.png b/06_StatisticalInference/old_markdown/02_04_Likeklihood/figure/fig.width==45.png
similarity index 100%
rename from 06_StatisticalInference/02_04_Likeklihood/figure/fig.width==45.png
rename to 06_StatisticalInference/old_markdown/02_04_Likeklihood/figure/fig.width==45.png
diff --git a/06_StatisticalInference/02_04_Likeklihood/figure/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/02_04_Likeklihood/figure/unnamed-chunk-1.png
similarity index 100%
rename from 06_StatisticalInference/02_04_Likeklihood/figure/unnamed-chunk-1.png
rename to 06_StatisticalInference/old_markdown/02_04_Likeklihood/figure/unnamed-chunk-1.png
diff --git a/06_StatisticalInference/02_04_Likeklihood/figure/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/02_04_Likeklihood/figure/unnamed-chunk-2.png
similarity index 100%
rename from 06_StatisticalInference/02_04_Likeklihood/figure/unnamed-chunk-2.png
rename to 06_StatisticalInference/old_markdown/02_04_Likeklihood/figure/unnamed-chunk-2.png
diff --git a/06_StatisticalInference/02_04_Likeklihood/figure/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/02_04_Likeklihood/figure/unnamed-chunk-3.png
similarity index 100%
rename from 06_StatisticalInference/02_04_Likeklihood/figure/unnamed-chunk-3.png
rename to 06_StatisticalInference/old_markdown/02_04_Likeklihood/figure/unnamed-chunk-3.png
diff --git a/06_StatisticalInference/02_04_Likeklihood/index.md b/06_StatisticalInference/old_markdown/02_04_Likeklihood/index.Rmd
similarity index 91%
rename from 06_StatisticalInference/02_04_Likeklihood/index.md
rename to 06_StatisticalInference/old_markdown/02_04_Likeklihood/index.Rmd
index d7cc17b36..d62b9a3c8 100644
--- a/06_StatisticalInference/02_04_Likeklihood/index.md
+++ b/06_StatisticalInference/old_markdown/02_04_Likeklihood/index.Rmd
@@ -1,138 +1,128 @@
----
-title       : Likelihood
-subtitle    : Statistical Inference
-author      : Brian Caffo, Roger Peng, Jeff Leek
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## Likelihood
-
-- A common and fruitful approach to statistics is to assume that the data arises from a family of distributions indexed by a parameter that represents a useful summary of the distribution
-- The **likelihood** of a collection of data is the joint density evaluated as a function of the parameters with the data fixed
-- Likelihood analysis of data uses the likelihood to perform inference regarding the unknown parameter
-
----
-
-## Likelihood
-
-Given a statistical probability mass function or density, say $f(x, \theta)$, where $\theta$ is an unknown parameter, the **likelihood** is $f$ viewed as a function of $\theta$ for a fixed, observed value of $x$. 
-
----
-
-## Interpretations of likelihoods
-
-The likelihood has the following properties:
-
-1. Ratios of likelihood values measure the relative evidence of one value of the unknown parameter to another.
-2. Given a statistical model and observed data, all of the relevant information contained in the data regarding the unknown parameter is contained in the likelihood.
-3. If $\{X_i\}$ are independent random variables, then their likelihoods multiply.  That is, the likelihood of the parameters given all of the $X_i$ is simply the product of the individual likelihoods.
-
----
-
-## Example
-
-- Suppose that we flip a coin with success probability $\theta$
-- Recall that the mass function for $x$
-  $$
-  f(x,\theta) = \theta^x(1 - \theta)^{1 - x}  ~~~\mbox{for}~~~ \theta \in [0,1].
-  $$
-  where $x$ is either $0$ (Tails) or $1$ (Heads) 
-- Suppose that the result is a head
-- The likelihood is
-  $$
-  {\cal L}(\theta, 1) = \theta^1 (1 - \theta)^{1 - 1} = \theta  ~~~\mbox{for} ~~~ \theta \in [0,1].
-  $$
-- Therefore, ${\cal L}(.5, 1) / {\cal L}(.25, 1) = 2$, 
-- There is twice as much evidence supporting the hypothesis that $\theta = .5$ to the hypothesis that $\theta = .25$
-
----
-
-## Example continued
-
-- Suppose now that we flip our coin from the previous example 4 times and get the sequence 1, 0, 1, 1
-- The likelihood is:
-$$
-  \begin{eqnarray*}
-  {\cal L}(\theta, 1,0,1,1) & = & \theta^1 (1 - \theta)^{1 - 1}
-  \theta^0 (1 - \theta)^{1 - 0}  \\
-& \times & \theta^1 (1 - \theta)^{1 - 1} 
-   \theta^1 (1 - \theta)^{1 - 1}\\
-& = &  \theta^3(1 - \theta)^1
-  \end{eqnarray*}
-$$
-- This likelihood only depends on the total number of heads and the total number of tails; we might write ${\cal L}(\theta, 1, 3)$ for shorthand
-- Now consider ${\cal L}(.5, 1, 3) / {\cal L}(.25, 1, 3) = 5.33$
-- There is over five times as much evidence supporting the hypothesis that $\theta = .5$ over that $\theta = .25$
-
----
-
-## Plotting likelihoods
-
-- Generally, we want to consider all the values of $\theta$ between 0 and 1
-- A **likelihood plot** displays $\theta$ by ${\cal L}(\theta,x)$
-- Because the likelihood measures *relative evidence*, dividing the curve by its maximum value (or any other value for that matter) does not change its interpretation
-
----
-
-```r
-pvals <- seq(0, 1, length = 1000)
-plot(pvals, dbinom(3, 4, pvals) / dbinom(3, 4, 3/4), type = "l", frame = FALSE, lwd = 3, xlab = "p", ylab = "likelihood / max likelihood")
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
-
-
-
----
-
-## Maximum likelihood
-
-- The value of $\theta$ where the curve reaches its maximum has a special meaning
-- It is the value of $\theta$ that is most well supported by the data
-- This point is called the **maximum likelihood estimate** (or MLE) of $\theta$
-  $$
-  MLE = \mathrm{argmax}_\theta {\cal L}(\theta, x).
-  $$
-- Another interpretation of the MLE is that it is the value of $\theta$ that would make the data that we observed most probable
-
----
-## Some results
-* $X_1, \ldots, X_n \stackrel{iid}{\sim} N(\mu, \sigma^2)$ the MLE of $\mu$ is $\bar X$ and the ML of $\sigma^2$ is the biased sample variance estimate.
-* If $X_1,\ldots, X_n \stackrel{iid}{\sim} Bernoulli(p)$ then the MLE of $p$ is $\bar X$ (the sample proportion of 1s).
-* If $X_i \stackrel{iid}{\sim} Binomial(n_i, p)$ then the MLE of $p$ is $\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n n_i}$ (the sample proportion of 1s).
-* If $X \stackrel{iid}{\sim} Poisson(\lambda t)$ then the MLE of $\lambda$ is $X/t$.
-* If $X_i \stackrel{iid}{\sim} Poisson(\lambda t_i)$ then the MLE of $\lambda$ is
-  $\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n t_i}$
-
----
-## Example
-* You saw 5 failure events per 94 days of monitoring a nuclear pump. 
-* Assuming Poisson, plot the likelihood
-
----
-
-```r
-lambda <- seq(0, .2, length = 1000)
-likelihood <- dpois(5, 94 * lambda) / dpois(5, 5)
-plot(lambda, likelihood, frame = FALSE, lwd = 3, type = "l", xlab = expression(lambda))
-lines(rep(5/94, 2), 0 : 1, col = "red", lwd = 3)
-lines(range(lambda[likelihood > 1/16]), rep(1/16, 2), lwd = 2)
-lines(range(lambda[likelihood > 1/8]), rep(1/8, 2), lwd = 2)
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
-
-
-
+---
+title       : Likelihood
+subtitle    : Statistical Inference
+author      : Brian Caffo, Roger Peng, Jeff Leek
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Likelihood
+
+- A common and fruitful approach to statistics is to assume that the data arises from a family of distributions indexed by a parameter that represents a useful summary of the distribution
+- The **likelihood** of a collection of data is the joint density evaluated as a function of the parameters with the data fixed
+- Likelihood analysis of data uses the likelihood to perform inference regarding the unknown parameter
+
+---
+
+## Likelihood
+
+Given a statistical probability mass function or density, say $f(x, \theta)$, where $\theta$ is an unknown parameter, the **likelihood** is $f$ viewed as a function of $\theta$ for a fixed, observed value of $x$. 
+
+---
+
+## Interpretations of likelihoods
+
+The likelihood has the following properties:
+
+1. Ratios of likelihood values measure the relative evidence of one value of the unknown parameter to another.
+2. Given a statistical model and observed data, all of the relevant information contained in the data regarding the unknown parameter is contained in the likelihood.
+3. If $\{X_i\}$ are independent random variables, then their likelihoods multiply.  That is, the likelihood of the parameters given all of the $X_i$ is simply the product of the individual likelihoods.
+
+---
+
+## Example
+
+- Suppose that we flip a coin with success probability $\theta$
+- Recall that the mass function for $x$
+  $$
+  f(x,\theta) = \theta^x(1 - \theta)^{1 - x}  ~~~\mbox{for}~~~ \theta \in [0,1].
+  $$
+  where $x$ is either $0$ (Tails) or $1$ (Heads) 
+- Suppose that the result is a head
+- The likelihood is
+  $$
+  {\cal L}(\theta, 1) = \theta^1 (1 - \theta)^{1 - 1} = \theta  ~~~\mbox{for} ~~~ \theta \in [0,1].
+  $$
+- Therefore, ${\cal L}(.5, 1) / {\cal L}(.25, 1) = 2$, 
+- There is twice as much evidence supporting the hypothesis that $\theta = .5$ to the hypothesis that $\theta = .25$
+
+---
+
+## Example continued
+
+- Suppose now that we flip our coin from the previous example 4 times and get the sequence 1, 0, 1, 1
+- The likelihood is:
+$$
+  \begin{eqnarray*}
+  {\cal L}(\theta, 1,0,1,1) & = & \theta^1 (1 - \theta)^{1 - 1}
+  \theta^0 (1 - \theta)^{1 - 0}  \\
+& \times & \theta^1 (1 - \theta)^{1 - 1} 
+   \theta^1 (1 - \theta)^{1 - 1}\\
+& = &  \theta^3(1 - \theta)^1
+  \end{eqnarray*}
+$$
+- This likelihood only depends on the total number of heads and the total number of tails; we might write ${\cal L}(\theta, 1, 3)$ for shorthand
+- Now consider ${\cal L}(.5, 1, 3) / {\cal L}(.25, 1, 3) = 5.33$
+- There is over five times as much evidence supporting the hypothesis that $\theta = .5$ over that $\theta = .25$
+
+---
+
+## Plotting likelihoods
+
+- Generally, we want to consider all the values of $\theta$ between 0 and 1
+- A **likelihood plot** displays $\theta$ by ${\cal L}(\theta,x)$
+- Because the likelihood measures *relative evidence*, dividing the curve by its maximum value (or any other value for that matter) does not change its interpretation
+
+---
+```{r, fig.height=4.5, fig.width=4.5}
+pvals <- seq(0, 1, length = 1000)
+plot(pvals, dbinom(3, 4, pvals) / dbinom(3, 4, 3/4), type = "l", frame = FALSE, lwd = 3, xlab = "p", ylab = "likelihood / max likelihood")
+```
+
+
+---
+
+## Maximum likelihood
+
+- The value of $\theta$ where the curve reaches its maximum has a special meaning
+- It is the value of $\theta$ that is most well supported by the data
+- This point is called the **maximum likelihood estimate** (or MLE) of $\theta$
+  $$
+  MLE = \mathrm{argmax}_\theta {\cal L}(\theta, x).
+  $$
+- Another interpretation of the MLE is that it is the value of $\theta$ that would make the data that we observed most probable
+
+---
+## Some results
+* $X_1, \ldots, X_n \stackrel{iid}{\sim} N(\mu, \sigma^2)$ the MLE of $\mu$ is $\bar X$ and the ML of $\sigma^2$ is the biased sample variance estimate.
+* If $X_1,\ldots, X_n \stackrel{iid}{\sim} Bernoulli(p)$ then the MLE of $p$ is $\bar X$ (the sample proportion of 1s).
+* If $X_i \stackrel{iid}{\sim} Binomial(n_i, p)$ then the MLE of $p$ is $\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n n_i}$ (the sample proportion of 1s).
+* If $X \stackrel{iid}{\sim} Poisson(\lambda t)$ then the MLE of $\lambda$ is $X/t$.
+* If $X_i \stackrel{iid}{\sim} Poisson(\lambda t_i)$ then the MLE of $\lambda$ is
+  $\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n t_i}$
+
+---
+## Example
+* You saw 5 failure events per 94 days of monitoring a nuclear pump. 
+* Assuming Poisson, plot the likelihood
+
+---
+```{r, fig.height=4, fig.width=4, echo= TRUE}
+lambda <- seq(0, .2, length = 1000)
+likelihood <- dpois(5, 94 * lambda) / dpois(5, 5)
+plot(lambda, likelihood, frame = FALSE, lwd = 3, type = "l", xlab = expression(lambda))
+lines(rep(5/94, 2), 0 : 1, col = "red", lwd = 3)
+lines(range(lambda[likelihood > 1/16]), rep(1/16, 2), lwd = 2)
+lines(range(lambda[likelihood > 1/8]), rep(1/8, 2), lwd = 2)
+```
+
+
+
diff --git a/06_StatisticalInference/02_04_Likeklihood/index.html b/06_StatisticalInference/old_markdown/02_04_Likeklihood/index.html
similarity index 91%
rename from 06_StatisticalInference/02_04_Likeklihood/index.html
rename to 06_StatisticalInference/old_markdown/02_04_Likeklihood/index.html
index d7f7d2684..527e9b425 100644
--- a/06_StatisticalInference/02_04_Likeklihood/index.html
+++ b/06_StatisticalInference/old_markdown/02_04_Likeklihood/index.html
@@ -1,333 +1,334 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Likelihood</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Likelihood">
-  <meta name="author" content="Brian Caffo, Roger Peng, Jeff Leek">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Likelihood</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Roger Peng, Jeff Leek<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Likelihood</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>A common and fruitful approach to statistics is to assume that the data arises from a family of distributions indexed by a parameter that represents a useful summary of the distribution</li>
-<li>The <strong>likelihood</strong> of a collection of data is the joint density evaluated as a function of the parameters with the data fixed</li>
-<li>Likelihood analysis of data uses the likelihood to perform inference regarding the unknown parameter</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Likelihood</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Given a statistical probability mass function or density, say \(f(x, \theta)\), where \(\theta\) is an unknown parameter, the <strong>likelihood</strong> is \(f\) viewed as a function of \(\theta\) for a fixed, observed value of \(x\). </p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Interpretations of likelihoods</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>The likelihood has the following properties:</p>
-
-<ol>
-<li>Ratios of likelihood values measure the relative evidence of one value of the unknown parameter to another.</li>
-<li>Given a statistical model and observed data, all of the relevant information contained in the data regarding the unknown parameter is contained in the likelihood.</li>
-<li>If \(\{X_i\}\) are independent random variables, then their likelihoods multiply.  That is, the likelihood of the parameters given all of the \(X_i\) is simply the product of the individual likelihoods.</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that we flip a coin with success probability \(\theta\)</li>
-<li>Recall that the mass function for \(x\)
-\[
-f(x,\theta) = \theta^x(1 - \theta)^{1 - x}  ~~~\mbox{for}~~~ \theta \in [0,1].
-\]
-where \(x\) is either \(0\) (Tails) or \(1\) (Heads) </li>
-<li>Suppose that the result is a head</li>
-<li>The likelihood is
-\[
-{\cal L}(\theta, 1) = \theta^1 (1 - \theta)^{1 - 1} = \theta  ~~~\mbox{for} ~~~ \theta \in [0,1].
-\]</li>
-<li>Therefore, \({\cal L}(.5, 1) / {\cal L}(.25, 1) = 2\), </li>
-<li>There is twice as much evidence supporting the hypothesis that \(\theta = .5\) to the hypothesis that \(\theta = .25\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Example continued</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose now that we flip our coin from the previous example 4 times and get the sequence 1, 0, 1, 1</li>
-<li>The likelihood is:
-\[
-\begin{eqnarray*}
-{\cal L}(\theta, 1,0,1,1) & = & \theta^1 (1 - \theta)^{1 - 1}
-\theta^0 (1 - \theta)^{1 - 0}  \\
-& \times & \theta^1 (1 - \theta)^{1 - 1} 
-\theta^1 (1 - \theta)^{1 - 1}\\
-& = &  \theta^3(1 - \theta)^1
-\end{eqnarray*}
-\]</li>
-<li>This likelihood only depends on the total number of heads and the total number of tails; we might write \({\cal L}(\theta, 1, 3)\) for shorthand</li>
-<li>Now consider \({\cal L}(.5, 1, 3) / {\cal L}(.25, 1, 3) = 5.33\)</li>
-<li>There is over five times as much evidence supporting the hypothesis that \(\theta = .5\) over that \(\theta = .25\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Plotting likelihoods</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Generally, we want to consider all the values of \(\theta\) between 0 and 1</li>
-<li>A <strong>likelihood plot</strong> displays \(\theta\) by \({\cal L}(\theta,x)\)</li>
-<li>Because the likelihood measures <em>relative evidence</em>, dividing the curve by its maximum value (or any other value for that matter) does not change its interpretation</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <article data-timings="">
-    <pre><code class="r">pvals &lt;- seq(0, 1, length = 1000)
-plot(pvals, dbinom(3, 4, pvals) / dbinom(3, 4, 3/4), type = &quot;l&quot;, frame = FALSE, lwd = 3, xlab = &quot;p&quot;, ylab = &quot;likelihood / max likelihood&quot;)
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Maximum likelihood</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The value of \(\theta\) where the curve reaches its maximum has a special meaning</li>
-<li>It is the value of \(\theta\) that is most well supported by the data</li>
-<li>This point is called the <strong>maximum likelihood estimate</strong> (or MLE) of \(\theta\)
-\[
-MLE = \mathrm{argmax}_\theta {\cal L}(\theta, x).
-\]</li>
-<li>Another interpretation of the MLE is that it is the value of \(\theta\) that would make the data that we observed most probable</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Some results</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>\(X_1, \ldots, X_n \stackrel{iid}{\sim} N(\mu, \sigma^2)\) the MLE of \(\mu\) is \(\bar X\) and the ML of \(\sigma^2\) is the biased sample variance estimate.</li>
-<li>If \(X_1,\ldots, X_n \stackrel{iid}{\sim} Bernoulli(p)\) then the MLE of \(p\) is \(\bar X\) (the sample proportion of 1s).</li>
-<li>If \(X_i \stackrel{iid}{\sim} Binomial(n_i, p)\) then the MLE of \(p\) is \(\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n n_i}\) (the sample proportion of 1s).</li>
-<li>If \(X \stackrel{iid}{\sim} Poisson(\lambda t)\) then the MLE of \(\lambda\) is \(X/t\).</li>
-<li>If \(X_i \stackrel{iid}{\sim} Poisson(\lambda t_i)\) then the MLE of \(\lambda\) is
-\(\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n t_i}\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>You saw 5 failure events per 94 days of monitoring a nuclear pump. </li>
-<li>Assuming Poisson, plot the likelihood</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <article data-timings="">
-    <pre><code class="r">lambda &lt;- seq(0, .2, length = 1000)
-likelihood &lt;- dpois(5, 94 * lambda) / dpois(5, 5)
-plot(lambda, likelihood, frame = FALSE, lwd = 3, type = &quot;l&quot;, xlab = expression(lambda))
-lines(rep(5/94, 2), 0 : 1, col = &quot;red&quot;, lwd = 3)
-lines(range(lambda[likelihood &gt; 1/16]), rep(1/16, 2), lwd = 2)
-lines(range(lambda[likelihood &gt; 1/8]), rep(1/8, 2), lwd = 2)
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Likelihood'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Likelihood'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Interpretations of likelihoods'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Example'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Example continued'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Plotting likelihoods'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title=''>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Maximum likelihood'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Some results'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Example'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title=''>
-         11
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Likelihood</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Likelihood">
+  <meta name="author" content="Brian Caffo, Roger Peng, Jeff Leek">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Likelihood</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Roger Peng, Jeff Leek<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Likelihood</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A common and fruitful approach to statistics is to assume that the data arises from a family of distributions indexed by a parameter that represents a useful summary of the distribution</li>
+<li>The <strong>likelihood</strong> of a collection of data is the joint density evaluated as a function of the parameters with the data fixed</li>
+<li>Likelihood analysis of data uses the likelihood to perform inference regarding the unknown parameter</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Likelihood</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Given a statistical probability mass function or density, say \(f(x, \theta)\), where \(\theta\) is an unknown parameter, the <strong>likelihood</strong> is \(f\) viewed as a function of \(\theta\) for a fixed, observed value of \(x\). </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Interpretations of likelihoods</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>The likelihood has the following properties:</p>
+
+<ol>
+<li>Ratios of likelihood values measure the relative evidence of one value of the unknown parameter to another.</li>
+<li>Given a statistical model and observed data, all of the relevant information contained in the data regarding the unknown parameter is contained in the likelihood.</li>
+<li>If \(\{X_i\}\) are independent random variables, then their likelihoods multiply.  That is, the likelihood of the parameters given all of the \(X_i\) is simply the product of the individual likelihoods.</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that we flip a coin with success probability \(\theta\)</li>
+<li>Recall that the mass function for \(x\)
+\[
+f(x,\theta) = \theta^x(1 - \theta)^{1 - x}  ~~~\mbox{for}~~~ \theta \in [0,1].
+\]
+where \(x\) is either \(0\) (Tails) or \(1\) (Heads) </li>
+<li>Suppose that the result is a head</li>
+<li>The likelihood is
+\[
+{\cal L}(\theta, 1) = \theta^1 (1 - \theta)^{1 - 1} = \theta  ~~~\mbox{for} ~~~ \theta \in [0,1].
+\]</li>
+<li>Therefore, \({\cal L}(.5, 1) / {\cal L}(.25, 1) = 2\), </li>
+<li>There is twice as much evidence supporting the hypothesis that \(\theta = .5\) to the hypothesis that \(\theta = .25\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose now that we flip our coin from the previous example 4 times and get the sequence 1, 0, 1, 1</li>
+<li>The likelihood is:
+\[
+\begin{eqnarray*}
+{\cal L}(\theta, 1,0,1,1) & = & \theta^1 (1 - \theta)^{1 - 1}
+\theta^0 (1 - \theta)^{1 - 0}  \\
+& \times & \theta^1 (1 - \theta)^{1 - 1} 
+\theta^1 (1 - \theta)^{1 - 1}\\
+& = &  \theta^3(1 - \theta)^1
+\end{eqnarray*}
+\]</li>
+<li>This likelihood only depends on the total number of heads and the total number of tails; we might write \({\cal L}(\theta, 1, 3)\) for shorthand</li>
+<li>Now consider \({\cal L}(.5, 1, 3) / {\cal L}(.25, 1, 3) = 5.33\)</li>
+<li>There is over five times as much evidence supporting the hypothesis that \(\theta = .5\) over that \(\theta = .25\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Plotting likelihoods</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Generally, we want to consider all the values of \(\theta\) between 0 and 1</li>
+<li>A <strong>likelihood plot</strong> displays \(\theta\) by \({\cal L}(\theta,x)\)</li>
+<li>Because the likelihood measures <em>relative evidence</em>, dividing the curve by its maximum value (or any other value for that matter) does not change its interpretation</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">pvals &lt;- seq(0, 1, length = 1000)
+plot(pvals, dbinom(3, 4, pvals)/dbinom(3, 4, 3/4), type = &quot;l&quot;, frame = FALSE, 
+    lwd = 3, xlab = &quot;p&quot;, ylab = &quot;likelihood / max likelihood&quot;)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-1.png" alt="plot of chunk unnamed-chunk-1"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Maximum likelihood</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The value of \(\theta\) where the curve reaches its maximum has a special meaning</li>
+<li>It is the value of \(\theta\) that is most well supported by the data</li>
+<li>This point is called the <strong>maximum likelihood estimate</strong> (or MLE) of \(\theta\)
+\[
+MLE = \mathrm{argmax}_\theta {\cal L}(\theta, x).
+\]</li>
+<li>Another interpretation of the MLE is that it is the value of \(\theta\) that would make the data that we observed most probable</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Some results</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>\(X_1, \ldots, X_n \stackrel{iid}{\sim} N(\mu, \sigma^2)\) the MLE of \(\mu\) is \(\bar X\) and the ML of \(\sigma^2\) is the biased sample variance estimate.</li>
+<li>If \(X_1,\ldots, X_n \stackrel{iid}{\sim} Bernoulli(p)\) then the MLE of \(p\) is \(\bar X\) (the sample proportion of 1s).</li>
+<li>If \(X_i \stackrel{iid}{\sim} Binomial(n_i, p)\) then the MLE of \(p\) is \(\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n n_i}\) (the sample proportion of 1s).</li>
+<li>If \(X \stackrel{iid}{\sim} Poisson(\lambda t)\) then the MLE of \(\lambda\) is \(X/t\).</li>
+<li>If \(X_i \stackrel{iid}{\sim} Poisson(\lambda t_i)\) then the MLE of \(\lambda\) is
+\(\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n t_i}\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>You saw 5 failure events per 94 days of monitoring a nuclear pump. </li>
+<li>Assuming Poisson, plot the likelihood</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">lambda &lt;- seq(0, 0.2, length = 1000)
+likelihood &lt;- dpois(5, 94 * lambda)/dpois(5, 5)
+plot(lambda, likelihood, frame = FALSE, lwd = 3, type = &quot;l&quot;, xlab = expression(lambda))
+lines(rep(5/94, 2), 0:1, col = &quot;red&quot;, lwd = 3)
+lines(range(lambda[likelihood &gt; 1/16]), rep(1/16, 2), lwd = 2)
+lines(range(lambda[likelihood &gt; 1/8]), rep(1/8, 2), lwd = 2)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-2.png" alt="plot of chunk unnamed-chunk-2"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Likelihood'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Likelihood'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Interpretations of likelihoods'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Example'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Example continued'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Plotting likelihoods'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title=''>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Maximum likelihood'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Some results'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Example'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title=''>
+         11
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/02_04_Likeklihood/index.Rmd b/06_StatisticalInference/old_markdown/02_04_Likeklihood/index.md
similarity index 82%
rename from 06_StatisticalInference/02_04_Likeklihood/index.Rmd
rename to 06_StatisticalInference/old_markdown/02_04_Likeklihood/index.md
index 56f723f8d..1e0739357 100644
--- a/06_StatisticalInference/02_04_Likeklihood/index.Rmd
+++ b/06_StatisticalInference/old_markdown/02_04_Likeklihood/index.md
@@ -1,144 +1,137 @@
----
-title       : Likelihood
-subtitle    : Statistical Inference
-author      : Brian Caffo, Roger Peng, Jeff Leek
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-runif(1)
-```
-
-## Likelihood
-
-- A common and fruitful approach to statistics is to assume that the data arises from a family of distributions indexed by a parameter that represents a useful summary of the distribution
-- The **likelihood** of a collection of data is the joint density evaluated as a function of the parameters with the data fixed
-- Likelihood analysis of data uses the likelihood to perform inference regarding the unknown parameter
-
----
-
-## Likelihood
-
-Given a statistical probability mass function or density, say $f(x, \theta)$, where $\theta$ is an unknown parameter, the **likelihood** is $f$ viewed as a function of $\theta$ for a fixed, observed value of $x$. 
-
----
-
-## Interpretations of likelihoods
-
-The likelihood has the following properties:
-
-1. Ratios of likelihood values measure the relative evidence of one value of the unknown parameter to another.
-2. Given a statistical model and observed data, all of the relevant information contained in the data regarding the unknown parameter is contained in the likelihood.
-3. If $\{X_i\}$ are independent random variables, then their likelihoods multiply.  That is, the likelihood of the parameters given all of the $X_i$ is simply the product of the individual likelihoods.
-
----
-
-## Example
-
-- Suppose that we flip a coin with success probability $\theta$
-- Recall that the mass function for $x$
-  $$
-  f(x,\theta) = \theta^x(1 - \theta)^{1 - x}  ~~~\mbox{for}~~~ \theta \in [0,1].
-  $$
-  where $x$ is either $0$ (Tails) or $1$ (Heads) 
-- Suppose that the result is a head
-- The likelihood is
-  $$
-  {\cal L}(\theta, 1) = \theta^1 (1 - \theta)^{1 - 1} = \theta  ~~~\mbox{for} ~~~ \theta \in [0,1].
-  $$
-- Therefore, ${\cal L}(.5, 1) / {\cal L}(.25, 1) = 2$, 
-- There is twice as much evidence supporting the hypothesis that $\theta = .5$ to the hypothesis that $\theta = .25$
-
----
-
-## Example continued
-
-- Suppose now that we flip our coin from the previous example 4 times and get the sequence 1, 0, 1, 1
-- The likelihood is:
-$$
-  \begin{eqnarray*}
-  {\cal L}(\theta, 1,0,1,1) & = & \theta^1 (1 - \theta)^{1 - 1}
-  \theta^0 (1 - \theta)^{1 - 0}  \\
-& \times & \theta^1 (1 - \theta)^{1 - 1} 
-   \theta^1 (1 - \theta)^{1 - 1}\\
-& = &  \theta^3(1 - \theta)^1
-  \end{eqnarray*}
-$$
-- This likelihood only depends on the total number of heads and the total number of tails; we might write ${\cal L}(\theta, 1, 3)$ for shorthand
-- Now consider ${\cal L}(.5, 1, 3) / {\cal L}(.25, 1, 3) = 5.33$
-- There is over five times as much evidence supporting the hypothesis that $\theta = .5$ over that $\theta = .25$
-
----
-
-## Plotting likelihoods
-
-- Generally, we want to consider all the values of $\theta$ between 0 and 1
-- A **likelihood plot** displays $\theta$ by ${\cal L}(\theta,x)$
-- Because the likelihood measures *relative evidence*, dividing the curve by its maximum value (or any other value for that matter) does not change its interpretation
-
----
-```{r, fig.height=4.5, fig.width=4.5}
-pvals <- seq(0, 1, length = 1000)
-plot(pvals, dbinom(3, 4, pvals) / dbinom(3, 4, 3/4), type = "l", frame = FALSE, lwd = 3, xlab = "p", ylab = "likelihood / max likelihood")
-```
-
-
----
-
-## Maximum likelihood
-
-- The value of $\theta$ where the curve reaches its maximum has a special meaning
-- It is the value of $\theta$ that is most well supported by the data
-- This point is called the **maximum likelihood estimate** (or MLE) of $\theta$
-  $$
-  MLE = \mathrm{argmax}_\theta {\cal L}(\theta, x).
-  $$
-- Another interpretation of the MLE is that it is the value of $\theta$ that would make the data that we observed most probable
-
----
-## Some results
-* $X_1, \ldots, X_n \stackrel{iid}{\sim} N(\mu, \sigma^2)$ the MLE of $\mu$ is $\bar X$ and the ML of $\sigma^2$ is the biased sample variance estimate.
-* If $X_1,\ldots, X_n \stackrel{iid}{\sim} Bernoulli(p)$ then the MLE of $p$ is $\bar X$ (the sample proportion of 1s).
-* If $X_i \stackrel{iid}{\sim} Binomial(n_i, p)$ then the MLE of $p$ is $\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n n_i}$ (the sample proportion of 1s).
-* If $X \stackrel{iid}{\sim} Poisson(\lambda t)$ then the MLE of $\lambda$ is $X/t$.
-* If $X_i \stackrel{iid}{\sim} Poisson(\lambda t_i)$ then the MLE of $\lambda$ is
-  $\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n t_i}$
-
----
-## Example
-* You saw 5 failure events per 94 days of monitoring a nuclear pump. 
-* Assuming Poisson, plot the likelihood
-
----
-```{r, fig.height=4, fig.width=4, echo= TRUE}
-lambda <- seq(0, .2, length = 1000)
-likelihood <- dpois(5, 94 * lambda) / dpois(5, 5)
-plot(lambda, likelihood, frame = FALSE, lwd = 3, type = "l", xlab = expression(lambda))
-lines(rep(5/94, 2), 0 : 1, col = "red", lwd = 3)
-lines(range(lambda[likelihood > 1/16]), rep(1/16, 2), lwd = 2)
-lines(range(lambda[likelihood > 1/8]), rep(1/8, 2), lwd = 2)
-```
-
-
-
+---
+title       : Likelihood
+subtitle    : Statistical Inference
+author      : Brian Caffo, Roger Peng, Jeff Leek
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Likelihood
+
+- A common and fruitful approach to statistics is to assume that the data arises from a family of distributions indexed by a parameter that represents a useful summary of the distribution
+- The **likelihood** of a collection of data is the joint density evaluated as a function of the parameters with the data fixed
+- Likelihood analysis of data uses the likelihood to perform inference regarding the unknown parameter
+
+---
+
+## Likelihood
+
+Given a statistical probability mass function or density, say $f(x, \theta)$, where $\theta$ is an unknown parameter, the **likelihood** is $f$ viewed as a function of $\theta$ for a fixed, observed value of $x$. 
+
+---
+
+## Interpretations of likelihoods
+
+The likelihood has the following properties:
+
+1. Ratios of likelihood values measure the relative evidence of one value of the unknown parameter to another.
+2. Given a statistical model and observed data, all of the relevant information contained in the data regarding the unknown parameter is contained in the likelihood.
+3. If $\{X_i\}$ are independent random variables, then their likelihoods multiply.  That is, the likelihood of the parameters given all of the $X_i$ is simply the product of the individual likelihoods.
+
+---
+
+## Example
+
+- Suppose that we flip a coin with success probability $\theta$
+- Recall that the mass function for $x$
+  $$
+  f(x,\theta) = \theta^x(1 - \theta)^{1 - x}  ~~~\mbox{for}~~~ \theta \in [0,1].
+  $$
+  where $x$ is either $0$ (Tails) or $1$ (Heads) 
+- Suppose that the result is a head
+- The likelihood is
+  $$
+  {\cal L}(\theta, 1) = \theta^1 (1 - \theta)^{1 - 1} = \theta  ~~~\mbox{for} ~~~ \theta \in [0,1].
+  $$
+- Therefore, ${\cal L}(.5, 1) / {\cal L}(.25, 1) = 2$, 
+- There is twice as much evidence supporting the hypothesis that $\theta = .5$ to the hypothesis that $\theta = .25$
+
+---
+
+## Example continued
+
+- Suppose now that we flip our coin from the previous example 4 times and get the sequence 1, 0, 1, 1
+- The likelihood is:
+$$
+  \begin{eqnarray*}
+  {\cal L}(\theta, 1,0,1,1) & = & \theta^1 (1 - \theta)^{1 - 1}
+  \theta^0 (1 - \theta)^{1 - 0}  \\
+& \times & \theta^1 (1 - \theta)^{1 - 1} 
+   \theta^1 (1 - \theta)^{1 - 1}\\
+& = &  \theta^3(1 - \theta)^1
+  \end{eqnarray*}
+$$
+- This likelihood only depends on the total number of heads and the total number of tails; we might write ${\cal L}(\theta, 1, 3)$ for shorthand
+- Now consider ${\cal L}(.5, 1, 3) / {\cal L}(.25, 1, 3) = 5.33$
+- There is over five times as much evidence supporting the hypothesis that $\theta = .5$ over that $\theta = .25$
+
+---
+
+## Plotting likelihoods
+
+- Generally, we want to consider all the values of $\theta$ between 0 and 1
+- A **likelihood plot** displays $\theta$ by ${\cal L}(\theta,x)$
+- Because the likelihood measures *relative evidence*, dividing the curve by its maximum value (or any other value for that matter) does not change its interpretation
+
+---
+
+```r
+pvals <- seq(0, 1, length = 1000)
+plot(pvals, dbinom(3, 4, pvals)/dbinom(3, 4, 3/4), type = "l", frame = FALSE, 
+    lwd = 3, xlab = "p", ylab = "likelihood / max likelihood")
+```
+
+![plot of chunk unnamed-chunk-1](assets/fig/unnamed-chunk-1.png) 
+
+
+
+---
+
+## Maximum likelihood
+
+- The value of $\theta$ where the curve reaches its maximum has a special meaning
+- It is the value of $\theta$ that is most well supported by the data
+- This point is called the **maximum likelihood estimate** (or MLE) of $\theta$
+  $$
+  MLE = \mathrm{argmax}_\theta {\cal L}(\theta, x).
+  $$
+- Another interpretation of the MLE is that it is the value of $\theta$ that would make the data that we observed most probable
+
+---
+## Some results
+* $X_1, \ldots, X_n \stackrel{iid}{\sim} N(\mu, \sigma^2)$ the MLE of $\mu$ is $\bar X$ and the ML of $\sigma^2$ is the biased sample variance estimate.
+* If $X_1,\ldots, X_n \stackrel{iid}{\sim} Bernoulli(p)$ then the MLE of $p$ is $\bar X$ (the sample proportion of 1s).
+* If $X_i \stackrel{iid}{\sim} Binomial(n_i, p)$ then the MLE of $p$ is $\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n n_i}$ (the sample proportion of 1s).
+* If $X \stackrel{iid}{\sim} Poisson(\lambda t)$ then the MLE of $\lambda$ is $X/t$.
+* If $X_i \stackrel{iid}{\sim} Poisson(\lambda t_i)$ then the MLE of $\lambda$ is
+  $\frac{\sum_{i=1}^n X_i}{\sum_{i=1}^n t_i}$
+
+---
+## Example
+* You saw 5 failure events per 94 days of monitoring a nuclear pump. 
+* Assuming Poisson, plot the likelihood
+
+---
+
+```r
+lambda <- seq(0, 0.2, length = 1000)
+likelihood <- dpois(5, 94 * lambda)/dpois(5, 5)
+plot(lambda, likelihood, frame = FALSE, lwd = 3, type = "l", xlab = expression(lambda))
+lines(rep(5/94, 2), 0:1, col = "red", lwd = 3)
+lines(range(lambda[likelihood > 1/16]), rep(1/16, 2), lwd = 2)
+lines(range(lambda[likelihood > 1/8]), rep(1/8, 2), lwd = 2)
+```
+
+![plot of chunk unnamed-chunk-2](assets/fig/unnamed-chunk-2.png) 
+
+
+
+
diff --git a/06_StatisticalInference/old_markdown/02_04_Likeklihood/index.pdf b/06_StatisticalInference/old_markdown/02_04_Likeklihood/index.pdf
new file mode 100644
index 000000000..4e3388a80
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_04_Likeklihood/index.pdf differ
diff --git a/06_StatisticalInference/02_05_Bayes/index.Rmd b/06_StatisticalInference/old_markdown/02_05_Bayes/index.Rmd
similarity index 88%
rename from 06_StatisticalInference/02_05_Bayes/index.Rmd
rename to 06_StatisticalInference/old_markdown/02_05_Bayes/index.Rmd
index b1aae880c..7c49de8da 100644
--- a/06_StatisticalInference/02_05_Bayes/index.Rmd
+++ b/06_StatisticalInference/old_markdown/02_05_Bayes/index.Rmd
@@ -1,204 +1,187 @@
----
-title       : Bayesian inference
-subtitle    : Statistical Inference
-author      : Brian Caffo, Roger Peng, Jeff Leek
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-runif(1)
-```
-
-## Bayesian analysis
-- Bayesian statistics posits a *prior* on the parameter
-  of interest
-- All inferences are then performed on the distribution of 
-  the parameter given the data, called the posterior
-- In general,
-  $$
-  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-  $$
-- Therefore (as we saw in diagnostic testing) the likelihood is
-  the factor by which our prior beliefs are updated to produce
-  conclusions in the light of the data
-
----
-## Prior specification
-- The beta distribution is the default prior
-  for parameters between $0$ and $1$.
-- The beta density depends on two parameters $\alpha$ and $\beta$
-$$
-\frac{\Gamma(\alpha +  \beta)}{\Gamma(\alpha)\Gamma(\beta)}
- p ^ {\alpha - 1} (1 - p) ^ {\beta - 1} ~~~~\mbox{for} ~~ 0 \leq p \leq 1
-$$
-- The mean of the beta density is $\alpha / (\alpha + \beta)$
-- The variance of the beta density is 
-$$\frac{\alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}$$
-- The uniform density is the special case where $\alpha = \beta = 1$
-
----
-
-```
-## Exploring the beta density
-library(manipulate)
-pvals <- seq(0.01, 0.99, length = 1000)
-manipulate(
-    plot(pvals, dbeta(pvals, alpha, beta), type = "l", lwd = 3, frame = FALSE),
-    alpha = slider(0.01, 10, initial = 1, step = .5),
-    beta = slider(0.01, 10, initial = 1, step = .5)
-    )
-```
-
----
-## Posterior 
-- Suppose that we chose values of $\alpha$ and $\beta$ so that
-  the beta prior is indicative of our degree of belief regarding $p$
-  in the absence of data
-- Then using the rule that
-  $$
-  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-  $$
-  and throwing out anything that doesn't depend on $p$, we have that
-$$
-\begin{align}
-\mbox{Posterior} &\propto  p^x(1 - p)^{n-x} \times p^{\alpha -1} (1 - p)^{\beta - 1} \\
-                 &  =      p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1}
-\end{align}
-$$
-- This density is just another beta density with parameters
-  $\tilde \alpha = x + \alpha$ and $\tilde \beta = n - x + \beta$
-
-
----
-## Posterior mean
-
-$$
-\begin{align}
-E[p ~|~ X] & =   \frac{\tilde \alpha}{\tilde \alpha + \tilde \beta}\\ \\
-& =  \frac{x + \alpha}{x + \alpha + n - x + \beta}\\ \\
-& =  \frac{x + \alpha}{n + \alpha + \beta} \\ \\
-& =  \frac{x}{n} \times \frac{n}{n + \alpha + \beta} + \frac{\alpha}{\alpha + \beta} \times \frac{\alpha + \beta}{n + \alpha + \beta} \\ \\
-& =  \mbox{MLE} \times \pi + \mbox{Prior Mean} \times (1 - \pi)
-\end{align}
-$$
-
----
-## Thoughts
-
-- The posterior mean is a mixture of the MLE ($\hat p$) and the
-  prior mean
-- $\pi$ goes to $1$ as $n$ gets large; for large $n$ the data swamps the prior
-- For small $n$, the prior mean dominates 
-- Generalizes how science should ideally work; as data becomes
-  increasingly available, prior beliefs should matter less and less
-- With a prior that is degenerate at a value, no amount of data
-  can overcome the prior
-
----
-## Example
-
-- Suppose that in a random sample of an at-risk population
-$13$ of $20$ subjects had hypertension. Estimate the prevalence
-of hypertension in this population.
-- $x = 13$ and $n=20$
-- Consider a uniform prior, $\alpha = \beta = 1$
-- The posterior is proportional to (see formula above)
-$$
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^x (1 - p)^{n-x}
-$$
-That is, for the uniform prior, the posterior is the likelihood
-- Consider the instance where $\alpha = \beta = 2$ (recall this prior
-is humped around the point $.5$) the posterior is
-$$
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^{x + 1} (1 - p)^{n-x + 1}
-$$
-- The "Jeffrey's prior" which has some theoretical benefits
-  puts $\alpha = \beta = .5$
-
----
-```
-pvals <- seq(0.01, 0.99, length = 1000)
-x <- 13; n <- 20
-myPlot <- function(alpha, beta){
-    plot(0 : 1, 0 : 1, type = "n", xlab = "p", ylab = "", frame = FALSE)
-    lines(pvals, dbeta(pvals, alpha, beta) / max(dbeta(pvals, alpha, beta)), 
-            lwd = 3, col = "darkred")
-    lines(pvals, dbinom(x,n,pvals) / dbinom(x,n,x/n), lwd = 3, col = "darkblue")
-    lines(pvals, dbeta(pvals, alpha+x, beta+(n-x)) / max(dbeta(pvals, alpha+x, beta+(n-x))),
-        lwd = 3, col = "darkgreen")
-    title("red=prior,green=posterior,blue=likelihood")
-}
-manipulate(
-    myPlot(alpha, beta),
-    alpha = slider(0.01, 100, initial = 1, step = .5),
-    beta = slider(0.01, 100, initial = 1, step = .5)
-    )
-```
-
----
-## Credible intervals
-- A Bayesian credible interval is the  Bayesian analog of a confidence
-  interval
-- A $95\%$ credible interval, $[a, b]$ would satisfy
-  $$
-  P(p \in [a, b] ~|~ x) = .95
-  $$
-- The best credible intervals chop off the posterior with a horizontal
-  line in the same way we did for likelihoods 
-- These are called highest posterior density (HPD) intervals
-
----
-## Getting HPD intervals for this example
-- Install the \texttt{binom} package, then the command
-```{r}
-library(binom)
-binom.bayes(13, 20, type = "highest")
-```
-gives the HPD interval. 
-- The default credible level is $95\%$ and
-the default prior is the Jeffrey's prior.
-
----
-```
-pvals <- seq(0.01, 0.99, length = 1000)
-x <- 13; n <- 20
-myPlot2 <- function(alpha, beta, cl){
-    plot(pvals, dbeta(pvals, alpha+x, beta+(n-x)), type = "l", lwd = 3,
-    xlab = "p", ylab = "", frame = FALSE)
-    out <- binom.bayes(x, n, type = "highest", 
-        prior.shape1 = alpha, 
-        prior.shape2 = beta, 
-        conf.level = cl)
-    p1 <- out$lower; p2 <- out$upper
-    lines(c(p1, p1, p2, p2), c(0, dbeta(c(p1, p2), alpha+x, beta+(n-x)), 0), 
-        type = "l", lwd = 3, col = "darkred")
-}
-manipulate(
-    myPlot2(alpha, beta, cl),
-    alpha = slider(0.01, 10, initial = 1, step = .5),
-    beta = slider(0.01, 10, initial = 1, step = .5),
-    cl = slider(0.01, 0.99, initial = 0.95, step = .01)
-    )
-```
-
+---
+title       : Bayesian inference
+subtitle    : Statistical Inference
+author      : Brian Caffo, Roger Peng, Jeff Leek
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Bayesian analysis
+- Bayesian statistics posits a *prior* on the parameter
+  of interest
+- All inferences are then performed on the distribution of 
+  the parameter given the data, called the posterior
+- In general,
+  $$
+  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
+  $$
+- Therefore (as we saw in diagnostic testing) the likelihood is
+  the factor by which our prior beliefs are updated to produce
+  conclusions in the light of the data
+
+---
+## Prior specification
+- The beta distribution is the default prior
+  for parameters between $0$ and $1$.
+- The beta density depends on two parameters $\alpha$ and $\beta$
+$$
+\frac{\Gamma(\alpha +  \beta)}{\Gamma(\alpha)\Gamma(\beta)}
+ p ^ {\alpha - 1} (1 - p) ^ {\beta - 1} ~~~~\mbox{for} ~~ 0 \leq p \leq 1
+$$
+- The mean of the beta density is $\alpha / (\alpha + \beta)$
+- The variance of the beta density is 
+$$\frac{\alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}$$
+- The uniform density is the special case where $\alpha = \beta = 1$
+
+---
+
+```
+## Exploring the beta density
+library(manipulate)
+pvals <- seq(0.01, 0.99, length = 1000)
+manipulate(
+    plot(pvals, dbeta(pvals, alpha, beta), type = "l", lwd = 3, frame = FALSE),
+    alpha = slider(0.01, 10, initial = 1, step = .5),
+    beta = slider(0.01, 10, initial = 1, step = .5)
+    )
+```
+
+---
+## Posterior 
+- Suppose that we chose values of $\alpha$ and $\beta$ so that
+  the beta prior is indicative of our degree of belief regarding $p$
+  in the absence of data
+- Then using the rule that
+  $$
+  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
+  $$
+  and throwing out anything that doesn't depend on $p$, we have that
+$$
+\begin{align}
+\mbox{Posterior} &\propto  p^x(1 - p)^{n-x} \times p^{\alpha -1} (1 - p)^{\beta - 1} \\
+                 &  =      p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1}
+\end{align}
+$$
+- This density is just another beta density with parameters
+  $\tilde \alpha = x + \alpha$ and $\tilde \beta = n - x + \beta$
+
+
+---
+## Posterior mean
+
+$$
+\begin{align}
+E[p ~|~ X] & =   \frac{\tilde \alpha}{\tilde \alpha + \tilde \beta}\\ \\
+& =  \frac{x + \alpha}{x + \alpha + n - x + \beta}\\ \\
+& =  \frac{x + \alpha}{n + \alpha + \beta} \\ \\
+& =  \frac{x}{n} \times \frac{n}{n + \alpha + \beta} + \frac{\alpha}{\alpha + \beta} \times \frac{\alpha + \beta}{n + \alpha + \beta} \\ \\
+& =  \mbox{MLE} \times \pi + \mbox{Prior Mean} \times (1 - \pi)
+\end{align}
+$$
+
+---
+## Thoughts
+
+- The posterior mean is a mixture of the MLE ($\hat p$) and the
+  prior mean
+- $\pi$ goes to $1$ as $n$ gets large; for large $n$ the data swamps the prior
+- For small $n$, the prior mean dominates 
+- Generalizes how science should ideally work; as data becomes
+  increasingly available, prior beliefs should matter less and less
+- With a prior that is degenerate at a value, no amount of data
+  can overcome the prior
+
+---
+## Example
+
+- Suppose that in a random sample of an at-risk population
+$13$ of $20$ subjects had hypertension. Estimate the prevalence
+of hypertension in this population.
+- $x = 13$ and $n=20$
+- Consider a uniform prior, $\alpha = \beta = 1$
+- The posterior is proportional to (see formula above)
+$$
+p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^x (1 - p)^{n-x}
+$$
+That is, for the uniform prior, the posterior is the likelihood
+- Consider the instance where $\alpha = \beta = 2$ (recall this prior
+is humped around the point $.5$) the posterior is
+$$
+p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^{x + 1} (1 - p)^{n-x + 1}
+$$
+- The "Jeffrey's prior" which has some theoretical benefits
+  puts $\alpha = \beta = .5$
+
+---
+```
+pvals <- seq(0.01, 0.99, length = 1000)
+x <- 13; n <- 20
+myPlot <- function(alpha, beta){
+    plot(0 : 1, 0 : 1, type = "n", xlab = "p", ylab = "", frame = FALSE)
+    lines(pvals, dbeta(pvals, alpha, beta) / max(dbeta(pvals, alpha, beta)), 
+            lwd = 3, col = "darkred")
+    lines(pvals, dbinom(x,n,pvals) / dbinom(x,n,x/n), lwd = 3, col = "darkblue")
+    lines(pvals, dbeta(pvals, alpha+x, beta+(n-x)) / max(dbeta(pvals, alpha+x, beta+(n-x))),
+        lwd = 3, col = "darkgreen")
+    title("red=prior,green=posterior,blue=likelihood")
+}
+manipulate(
+    myPlot(alpha, beta),
+    alpha = slider(0.01, 100, initial = 1, step = .5),
+    beta = slider(0.01, 100, initial = 1, step = .5)
+    )
+```
+
+---
+## Credible intervals
+- A Bayesian credible interval is the  Bayesian analog of a confidence
+  interval
+- A $95\%$ credible interval, $[a, b]$ would satisfy
+  $$
+  P(p \in [a, b] ~|~ x) = .95
+  $$
+- The best credible intervals chop off the posterior with a horizontal
+  line in the same way we did for likelihoods 
+- These are called highest posterior density (HPD) intervals
+
+---
+## Getting HPD intervals for this example
+- Install the \texttt{binom} package, then the command
+```{r}
+library(binom)
+binom.bayes(13, 20, type = "highest")
+```
+gives the HPD interval. 
+- The default credible level is $95\%$ and
+the default prior is the Jeffrey's prior.
+
+---
+```
+pvals <- seq(0.01, 0.99, length = 1000)
+x <- 13; n <- 20
+myPlot2 <- function(alpha, beta, cl){
+    plot(pvals, dbeta(pvals, alpha+x, beta+(n-x)), type = "l", lwd = 3,
+    xlab = "p", ylab = "", frame = FALSE)
+    out <- binom.bayes(x, n, type = "highest", 
+        prior.shape1 = alpha, 
+        prior.shape2 = beta, 
+        conf.level = cl)
+    p1 <- out$lower; p2 <- out$upper
+    lines(c(p1, p1, p2, p2), c(0, dbeta(c(p1, p2), alpha+x, beta+(n-x)), 0), 
+        type = "l", lwd = 3, col = "darkred")
+}
+manipulate(
+    myPlot2(alpha, beta, cl),
+    alpha = slider(0.01, 10, initial = 1, step = .5),
+    beta = slider(0.01, 10, initial = 1, step = .5),
+    cl = slider(0.01, 0.99, initial = 0.95, step = .01)
+    )
+```
+
diff --git a/06_StatisticalInference/02_05_Bayes/index.html b/06_StatisticalInference/old_markdown/02_05_Bayes/index.html
similarity index 94%
rename from 06_StatisticalInference/02_05_Bayes/index.html
rename to 06_StatisticalInference/old_markdown/02_05_Bayes/index.html
index 5fb2d1b05..ac1863c48 100644
--- a/06_StatisticalInference/02_05_Bayes/index.html
+++ b/06_StatisticalInference/old_markdown/02_05_Bayes/index.html
@@ -1,403 +1,407 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Bayesian inference</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Bayesian inference">
-  <meta name="author" content="Brian Caffo, Roger Peng, Jeff Leek">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Bayesian inference</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Roger Peng, Jeff Leek<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Bayesian analysis</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Bayesian statistics posits a <em>prior</em> on the parameter
-of interest</li>
-<li>All inferences are then performed on the distribution of 
-the parameter given the data, called the posterior</li>
-<li>In general,
-\[
-\mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-\]</li>
-<li>Therefore (as we saw in diagnostic testing) the likelihood is
-the factor by which our prior beliefs are updated to produce
-conclusions in the light of the data</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Prior specification</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The beta distribution is the default prior
-for parameters between \(0\) and \(1\).</li>
-<li>The beta density depends on two parameters \(\alpha\) and \(\beta\)
-\[
-\frac{\Gamma(\alpha +  \beta)}{\Gamma(\alpha)\Gamma(\beta)}
-p ^ {\alpha - 1} (1 - p) ^ {\beta - 1} ~~~~\mbox{for} ~~ 0 \leq p \leq 1
-\]</li>
-<li>The mean of the beta density is \(\alpha / (\alpha + \beta)\)</li>
-<li>The variance of the beta density is 
-\[\frac{\alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}\]</li>
-<li>The uniform density is the special case where \(\alpha = \beta = 1\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <article data-timings="">
-    <pre><code>## Exploring the beta density
-library(manipulate)
-pvals &lt;- seq(0.01, 0.99, length = 1000)
-manipulate(
-    plot(pvals, dbeta(pvals, alpha, beta), type = &quot;l&quot;, lwd = 3, frame = FALSE),
-    alpha = slider(0.01, 10, initial = 1, step = .5),
-    beta = slider(0.01, 10, initial = 1, step = .5)
-    )
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Posterior</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that we chose values of \(\alpha\) and \(\beta\) so that
-the beta prior is indicative of our degree of belief regarding \(p\)
-in the absence of data</li>
-<li>Then using the rule that
-\[
-\mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-\]
-and throwing out anything that doesn&#39;t depend on \(p\), we have that
-\[
-\begin{align}
-\mbox{Posterior} &\propto  p^x(1 - p)^{n-x} \times p^{\alpha -1} (1 - p)^{\beta - 1} \\
-             &  =      p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1}
-\end{align}
-\]</li>
-<li>This density is just another beta density with parameters
-\(\tilde \alpha = x + \alpha\) and \(\tilde \beta = n - x + \beta\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Posterior mean</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[
-\begin{align}
-E[p ~|~ X] & =   \frac{\tilde \alpha}{\tilde \alpha + \tilde \beta}\\ \\
-& =  \frac{x + \alpha}{x + \alpha + n - x + \beta}\\ \\
-& =  \frac{x + \alpha}{n + \alpha + \beta} \\ \\
-& =  \frac{x}{n} \times \frac{n}{n + \alpha + \beta} + \frac{\alpha}{\alpha + \beta} \times \frac{\alpha + \beta}{n + \alpha + \beta} \\ \\
-& =  \mbox{MLE} \times \pi + \mbox{Prior Mean} \times (1 - \pi)
-\end{align}
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Thoughts</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The posterior mean is a mixture of the MLE (\(\hat p\)) and the
-prior mean</li>
-<li>\(\pi\) goes to \(1\) as \(n\) gets large; for large \(n\) the data swamps the prior</li>
-<li>For small \(n\), the prior mean dominates </li>
-<li>Generalizes how science should ideally work; as data becomes
-increasingly available, prior beliefs should matter less and less</li>
-<li>With a prior that is degenerate at a value, no amount of data
-can overcome the prior</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that in a random sample of an at-risk population
-\(13\) of \(20\) subjects had hypertension. Estimate the prevalence
-of hypertension in this population.</li>
-<li>\(x = 13\) and \(n=20\)</li>
-<li>Consider a uniform prior, \(\alpha = \beta = 1\)</li>
-<li>The posterior is proportional to (see formula above)
-\[
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^x (1 - p)^{n-x}
-\]
-That is, for the uniform prior, the posterior is the likelihood</li>
-<li>Consider the instance where \(\alpha = \beta = 2\) (recall this prior
-is humped around the point \(.5\)) the posterior is
-\[
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^{x + 1} (1 - p)^{n-x + 1}
-\]</li>
-<li>The &quot;Jeffrey&#39;s prior&quot; which has some theoretical benefits
-puts \(\alpha = \beta = .5\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <article data-timings="">
-    <pre><code>pvals &lt;- seq(0.01, 0.99, length = 1000)
-x &lt;- 13; n &lt;- 20
-myPlot &lt;- function(alpha, beta){
-    plot(0 : 1, 0 : 1, type = &quot;n&quot;, xlab = &quot;p&quot;, ylab = &quot;&quot;, frame = FALSE)
-    lines(pvals, dbeta(pvals, alpha, beta) / max(dbeta(pvals, alpha, beta)), 
-            lwd = 3, col = &quot;darkred&quot;)
-    lines(pvals, dbinom(x,n,pvals) / dbinom(x,n,x/n), lwd = 3, col = &quot;darkblue&quot;)
-    lines(pvals, dbeta(pvals, alpha+x, beta+(n-x)) / max(dbeta(pvals, alpha+x, beta+(n-x))),
-        lwd = 3, col = &quot;darkgreen&quot;)
-    title(&quot;red=prior,green=posterior,blue=likelihood&quot;)
-}
-manipulate(
-    myPlot(alpha, beta),
-    alpha = slider(0.01, 10, initial = 1, step = .5),
-    beta = slider(0.01, 10, initial = 1, step = .5)
-    )
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Credible intervals</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>A Bayesian credible interval is the  Bayesian analog of a confidence
-interval</li>
-<li>A \(95\%\) credible interval, \([a, b]\) would satisfy
-\[
-P(p \in [a, b] ~|~ x) = .95
-\]</li>
-<li>The best credible intervals chop off the posterior with a horizontal
-line in the same way we did for likelihoods </li>
-<li>These are called highest posterior density (HPD) intervals</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Getting HPD intervals for this example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Install the \texttt{binom} package, then the command</li>
-</ul>
-
-<pre><code class="r">library(binom)
-binom.bayes(13, 20, type = &quot;highest&quot;)
-</code></pre>
-
-<pre><code>  method  x  n shape1 shape2   mean  lower  upper  sig
-1  bayes 13 20   13.5    7.5 0.6429 0.4423 0.8361 0.05
-</code></pre>
-
-<p>gives the HPD interval. </p>
-
-<ul>
-<li>The default credible level is \(95\%\) and
-the default prior is the Jeffrey&#39;s prior.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <article data-timings="">
-    <pre><code>pvals &lt;- seq(0.01, 0.99, length = 1000)
-x &lt;- 13; n &lt;- 20
-myPlot2 &lt;- function(alpha, beta, cl){
-    plot(pvals, dbeta(pvals, alpha+x, beta+(n-x)), type = &quot;l&quot;, lwd = 3,
-    xlab = &quot;p&quot;, ylab = &quot;&quot;, frame = FALSE)
-    out &lt;- binom.bayes(x, n, type = &quot;highest&quot;, 
-        prior.shape1 = alpha, 
-        prior.shape2 = beta, 
-        conf.level = cl)
-    p1 &lt;- out$lower; p2 &lt;- out$upper
-    lines(c(p1, p1, p2, p2), c(0, dbeta(c(p1, p2), alpha+x, beta+(n-x)), 0), 
-        type = &quot;l&quot;, lwd = 3, col = &quot;darkred&quot;)
-}
-manipulate(
-    myPlot2(alpha, beta, cl),
-    alpha = slider(0.01, 10, initial = 1, step = .5),
-    beta = slider(0.01, 10, initial = 1, step = .5),
-    cl = slider(0.01, 0.99, initial = 0.95, step = .01)
-    )
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Bayesian analysis'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Prior specification'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title=''>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Posterior'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Posterior mean'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Thoughts'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Example'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title=''>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Credible intervals'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Getting HPD intervals for this example'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title=''>
-         11
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Bayesian inference</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Bayesian inference">
+  <meta name="author" content="Brian Caffo, Roger Peng, Jeff Leek">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Bayesian inference</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Roger Peng, Jeff Leek<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Bayesian analysis</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Bayesian statistics posits a <em>prior</em> on the parameter
+of interest</li>
+<li>All inferences are then performed on the distribution of 
+the parameter given the data, called the posterior</li>
+<li>In general,
+\[
+\mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
+\]</li>
+<li>Therefore (as we saw in diagnostic testing) the likelihood is
+the factor by which our prior beliefs are updated to produce
+conclusions in the light of the data</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Prior specification</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The beta distribution is the default prior
+for parameters between \(0\) and \(1\).</li>
+<li>The beta density depends on two parameters \(\alpha\) and \(\beta\)
+\[
+\frac{\Gamma(\alpha +  \beta)}{\Gamma(\alpha)\Gamma(\beta)}
+p ^ {\alpha - 1} (1 - p) ^ {\beta - 1} ~~~~\mbox{for} ~~ 0 \leq p \leq 1
+\]</li>
+<li>The mean of the beta density is \(\alpha / (\alpha + \beta)\)</li>
+<li>The variance of the beta density is 
+\[\frac{\alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}\]</li>
+<li>The uniform density is the special case where \(\alpha = \beta = 1\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <article data-timings="">
+    <pre><code>## Exploring the beta density
+library(manipulate)
+pvals &lt;- seq(0.01, 0.99, length = 1000)
+manipulate(
+    plot(pvals, dbeta(pvals, alpha, beta), type = &quot;l&quot;, lwd = 3, frame = FALSE),
+    alpha = slider(0.01, 10, initial = 1, step = .5),
+    beta = slider(0.01, 10, initial = 1, step = .5)
+    )
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Posterior</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that we chose values of \(\alpha\) and \(\beta\) so that
+the beta prior is indicative of our degree of belief regarding \(p\)
+in the absence of data</li>
+<li>Then using the rule that
+\[
+\mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
+\]
+and throwing out anything that doesn&#39;t depend on \(p\), we have that
+\[
+\begin{align}
+\mbox{Posterior} &\propto  p^x(1 - p)^{n-x} \times p^{\alpha -1} (1 - p)^{\beta - 1} \\
+             &  =      p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1}
+\end{align}
+\]</li>
+<li>This density is just another beta density with parameters
+\(\tilde \alpha = x + \alpha\) and \(\tilde \beta = n - x + \beta\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Posterior mean</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>\[
+\begin{align}
+E[p ~|~ X] & =   \frac{\tilde \alpha}{\tilde \alpha + \tilde \beta}\\ \\
+& =  \frac{x + \alpha}{x + \alpha + n - x + \beta}\\ \\
+& =  \frac{x + \alpha}{n + \alpha + \beta} \\ \\
+& =  \frac{x}{n} \times \frac{n}{n + \alpha + \beta} + \frac{\alpha}{\alpha + \beta} \times \frac{\alpha + \beta}{n + \alpha + \beta} \\ \\
+& =  \mbox{MLE} \times \pi + \mbox{Prior Mean} \times (1 - \pi)
+\end{align}
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Thoughts</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The posterior mean is a mixture of the MLE (\(\hat p\)) and the
+prior mean</li>
+<li>\(\pi\) goes to \(1\) as \(n\) gets large; for large \(n\) the data swamps the prior</li>
+<li>For small \(n\), the prior mean dominates </li>
+<li>Generalizes how science should ideally work; as data becomes
+increasingly available, prior beliefs should matter less and less</li>
+<li>With a prior that is degenerate at a value, no amount of data
+can overcome the prior</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that in a random sample of an at-risk population
+\(13\) of \(20\) subjects had hypertension. Estimate the prevalence
+of hypertension in this population.</li>
+<li>\(x = 13\) and \(n=20\)</li>
+<li>Consider a uniform prior, \(\alpha = \beta = 1\)</li>
+<li>The posterior is proportional to (see formula above)
+\[
+p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^x (1 - p)^{n-x}
+\]
+That is, for the uniform prior, the posterior is the likelihood</li>
+<li>Consider the instance where \(\alpha = \beta = 2\) (recall this prior
+is humped around the point \(.5\)) the posterior is
+\[
+p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^{x + 1} (1 - p)^{n-x + 1}
+\]</li>
+<li>The &quot;Jeffrey&#39;s prior&quot; which has some theoretical benefits
+puts \(\alpha = \beta = .5\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <article data-timings="">
+    <pre><code>pvals &lt;- seq(0.01, 0.99, length = 1000)
+x &lt;- 13; n &lt;- 20
+myPlot &lt;- function(alpha, beta){
+    plot(0 : 1, 0 : 1, type = &quot;n&quot;, xlab = &quot;p&quot;, ylab = &quot;&quot;, frame = FALSE)
+    lines(pvals, dbeta(pvals, alpha, beta) / max(dbeta(pvals, alpha, beta)), 
+            lwd = 3, col = &quot;darkred&quot;)
+    lines(pvals, dbinom(x,n,pvals) / dbinom(x,n,x/n), lwd = 3, col = &quot;darkblue&quot;)
+    lines(pvals, dbeta(pvals, alpha+x, beta+(n-x)) / max(dbeta(pvals, alpha+x, beta+(n-x))),
+        lwd = 3, col = &quot;darkgreen&quot;)
+    title(&quot;red=prior,green=posterior,blue=likelihood&quot;)
+}
+manipulate(
+    myPlot(alpha, beta),
+    alpha = slider(0.01, 100, initial = 1, step = .5),
+    beta = slider(0.01, 100, initial = 1, step = .5)
+    )
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Credible intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A Bayesian credible interval is the  Bayesian analog of a confidence
+interval</li>
+<li>A \(95\%\) credible interval, \([a, b]\) would satisfy
+\[
+P(p \in [a, b] ~|~ x) = .95
+\]</li>
+<li>The best credible intervals chop off the posterior with a horizontal
+line in the same way we did for likelihoods </li>
+<li>These are called highest posterior density (HPD) intervals</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Getting HPD intervals for this example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Install the \texttt{binom} package, then the command</li>
+</ul>
+
+<pre><code class="r">library(binom)
+</code></pre>
+
+<pre><code>## Error: there is no package called &#39;binom&#39;
+</code></pre>
+
+<pre><code class="r">binom.bayes(13, 20, type = &quot;highest&quot;)
+</code></pre>
+
+<pre><code>## Error: could not find function &quot;binom.bayes&quot;
+</code></pre>
+
+<p>gives the HPD interval. </p>
+
+<ul>
+<li>The default credible level is \(95\%\) and
+the default prior is the Jeffrey&#39;s prior.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <article data-timings="">
+    <pre><code>pvals &lt;- seq(0.01, 0.99, length = 1000)
+x &lt;- 13; n &lt;- 20
+myPlot2 &lt;- function(alpha, beta, cl){
+    plot(pvals, dbeta(pvals, alpha+x, beta+(n-x)), type = &quot;l&quot;, lwd = 3,
+    xlab = &quot;p&quot;, ylab = &quot;&quot;, frame = FALSE)
+    out &lt;- binom.bayes(x, n, type = &quot;highest&quot;, 
+        prior.shape1 = alpha, 
+        prior.shape2 = beta, 
+        conf.level = cl)
+    p1 &lt;- out$lower; p2 &lt;- out$upper
+    lines(c(p1, p1, p2, p2), c(0, dbeta(c(p1, p2), alpha+x, beta+(n-x)), 0), 
+        type = &quot;l&quot;, lwd = 3, col = &quot;darkred&quot;)
+}
+manipulate(
+    myPlot2(alpha, beta, cl),
+    alpha = slider(0.01, 10, initial = 1, step = .5),
+    beta = slider(0.01, 10, initial = 1, step = .5),
+    cl = slider(0.01, 0.99, initial = 0.95, step = .01)
+    )
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Bayesian analysis'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Prior specification'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title=''>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Posterior'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Posterior mean'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Thoughts'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Example'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title=''>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Credible intervals'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Getting HPD intervals for this example'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title=''>
+         11
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/02_05_Bayes/index.md b/06_StatisticalInference/old_markdown/02_05_Bayes/index.md
similarity index 93%
rename from 06_StatisticalInference/02_05_Bayes/index.md
rename to 06_StatisticalInference/old_markdown/02_05_Bayes/index.md
index ec53a034d..bb82da2b2 100644
--- a/06_StatisticalInference/02_05_Bayes/index.md
+++ b/06_StatisticalInference/old_markdown/02_05_Bayes/index.md
@@ -1,197 +1,200 @@
----
-title       : Bayesian inference
-subtitle    : Statistical Inference
-author      : Brian Caffo, Roger Peng, Jeff Leek
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## Bayesian analysis
-- Bayesian statistics posits a *prior* on the parameter
-  of interest
-- All inferences are then performed on the distribution of 
-  the parameter given the data, called the posterior
-- In general,
-  $$
-  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-  $$
-- Therefore (as we saw in diagnostic testing) the likelihood is
-  the factor by which our prior beliefs are updated to produce
-  conclusions in the light of the data
-
----
-## Prior specification
-- The beta distribution is the default prior
-  for parameters between $0$ and $1$.
-- The beta density depends on two parameters $\alpha$ and $\beta$
-$$
-\frac{\Gamma(\alpha +  \beta)}{\Gamma(\alpha)\Gamma(\beta)}
- p ^ {\alpha - 1} (1 - p) ^ {\beta - 1} ~~~~\mbox{for} ~~ 0 \leq p \leq 1
-$$
-- The mean of the beta density is $\alpha / (\alpha + \beta)$
-- The variance of the beta density is 
-$$\frac{\alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}$$
-- The uniform density is the special case where $\alpha = \beta = 1$
-
----
-
-```
-## Exploring the beta density
-library(manipulate)
-pvals <- seq(0.01, 0.99, length = 1000)
-manipulate(
-    plot(pvals, dbeta(pvals, alpha, beta), type = "l", lwd = 3, frame = FALSE),
-    alpha = slider(0.01, 10, initial = 1, step = .5),
-    beta = slider(0.01, 10, initial = 1, step = .5)
-    )
-```
-
----
-## Posterior 
-- Suppose that we chose values of $\alpha$ and $\beta$ so that
-  the beta prior is indicative of our degree of belief regarding $p$
-  in the absence of data
-- Then using the rule that
-  $$
-  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
-  $$
-  and throwing out anything that doesn't depend on $p$, we have that
-$$
-\begin{align}
-\mbox{Posterior} &\propto  p^x(1 - p)^{n-x} \times p^{\alpha -1} (1 - p)^{\beta - 1} \\
-                 &  =      p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1}
-\end{align}
-$$
-- This density is just another beta density with parameters
-  $\tilde \alpha = x + \alpha$ and $\tilde \beta = n - x + \beta$
-
-
----
-## Posterior mean
-
-$$
-\begin{align}
-E[p ~|~ X] & =   \frac{\tilde \alpha}{\tilde \alpha + \tilde \beta}\\ \\
-& =  \frac{x + \alpha}{x + \alpha + n - x + \beta}\\ \\
-& =  \frac{x + \alpha}{n + \alpha + \beta} \\ \\
-& =  \frac{x}{n} \times \frac{n}{n + \alpha + \beta} + \frac{\alpha}{\alpha + \beta} \times \frac{\alpha + \beta}{n + \alpha + \beta} \\ \\
-& =  \mbox{MLE} \times \pi + \mbox{Prior Mean} \times (1 - \pi)
-\end{align}
-$$
-
----
-## Thoughts
-
-- The posterior mean is a mixture of the MLE ($\hat p$) and the
-  prior mean
-- $\pi$ goes to $1$ as $n$ gets large; for large $n$ the data swamps the prior
-- For small $n$, the prior mean dominates 
-- Generalizes how science should ideally work; as data becomes
-  increasingly available, prior beliefs should matter less and less
-- With a prior that is degenerate at a value, no amount of data
-  can overcome the prior
-
----
-## Example
-
-- Suppose that in a random sample of an at-risk population
-$13$ of $20$ subjects had hypertension. Estimate the prevalence
-of hypertension in this population.
-- $x = 13$ and $n=20$
-- Consider a uniform prior, $\alpha = \beta = 1$
-- The posterior is proportional to (see formula above)
-$$
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^x (1 - p)^{n-x}
-$$
-That is, for the uniform prior, the posterior is the likelihood
-- Consider the instance where $\alpha = \beta = 2$ (recall this prior
-is humped around the point $.5$) the posterior is
-$$
-p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^{x + 1} (1 - p)^{n-x + 1}
-$$
-- The "Jeffrey's prior" which has some theoretical benefits
-  puts $\alpha = \beta = .5$
-
----
-```
-pvals <- seq(0.01, 0.99, length = 1000)
-x <- 13; n <- 20
-myPlot <- function(alpha, beta){
-    plot(0 : 1, 0 : 1, type = "n", xlab = "p", ylab = "", frame = FALSE)
-    lines(pvals, dbeta(pvals, alpha, beta) / max(dbeta(pvals, alpha, beta)), 
-            lwd = 3, col = "darkred")
-    lines(pvals, dbinom(x,n,pvals) / dbinom(x,n,x/n), lwd = 3, col = "darkblue")
-    lines(pvals, dbeta(pvals, alpha+x, beta+(n-x)) / max(dbeta(pvals, alpha+x, beta+(n-x))),
-        lwd = 3, col = "darkgreen")
-    title("red=prior,green=posterior,blue=likelihood")
-}
-manipulate(
-    myPlot(alpha, beta),
-    alpha = slider(0.01, 10, initial = 1, step = .5),
-    beta = slider(0.01, 10, initial = 1, step = .5)
-    )
-```
-
----
-## Credible intervals
-- A Bayesian credible interval is the  Bayesian analog of a confidence
-  interval
-- A $95\%$ credible interval, $[a, b]$ would satisfy
-  $$
-  P(p \in [a, b] ~|~ x) = .95
-  $$
-- The best credible intervals chop off the posterior with a horizontal
-  line in the same way we did for likelihoods 
-- These are called highest posterior density (HPD) intervals
-
----
-## Getting HPD intervals for this example
-- Install the \texttt{binom} package, then the command
-
-```r
-library(binom)
-binom.bayes(13, 20, type = "highest")
-```
-
-```
-  method  x  n shape1 shape2   mean  lower  upper  sig
-1  bayes 13 20   13.5    7.5 0.6429 0.4423 0.8361 0.05
-```
-
-gives the HPD interval. 
-- The default credible level is $95\%$ and
-the default prior is the Jeffrey's prior.
-
----
-```
-pvals <- seq(0.01, 0.99, length = 1000)
-x <- 13; n <- 20
-myPlot2 <- function(alpha, beta, cl){
-    plot(pvals, dbeta(pvals, alpha+x, beta+(n-x)), type = "l", lwd = 3,
-    xlab = "p", ylab = "", frame = FALSE)
-    out <- binom.bayes(x, n, type = "highest", 
-        prior.shape1 = alpha, 
-        prior.shape2 = beta, 
-        conf.level = cl)
-    p1 <- out$lower; p2 <- out$upper
-    lines(c(p1, p1, p2, p2), c(0, dbeta(c(p1, p2), alpha+x, beta+(n-x)), 0), 
-        type = "l", lwd = 3, col = "darkred")
-}
-manipulate(
-    myPlot2(alpha, beta, cl),
-    alpha = slider(0.01, 10, initial = 1, step = .5),
-    beta = slider(0.01, 10, initial = 1, step = .5),
-    cl = slider(0.01, 0.99, initial = 0.95, step = .01)
-    )
-```
-
+---
+title       : Bayesian inference
+subtitle    : Statistical Inference
+author      : Brian Caffo, Roger Peng, Jeff Leek
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Bayesian analysis
+- Bayesian statistics posits a *prior* on the parameter
+  of interest
+- All inferences are then performed on the distribution of 
+  the parameter given the data, called the posterior
+- In general,
+  $$
+  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
+  $$
+- Therefore (as we saw in diagnostic testing) the likelihood is
+  the factor by which our prior beliefs are updated to produce
+  conclusions in the light of the data
+
+---
+## Prior specification
+- The beta distribution is the default prior
+  for parameters between $0$ and $1$.
+- The beta density depends on two parameters $\alpha$ and $\beta$
+$$
+\frac{\Gamma(\alpha +  \beta)}{\Gamma(\alpha)\Gamma(\beta)}
+ p ^ {\alpha - 1} (1 - p) ^ {\beta - 1} ~~~~\mbox{for} ~~ 0 \leq p \leq 1
+$$
+- The mean of the beta density is $\alpha / (\alpha + \beta)$
+- The variance of the beta density is 
+$$\frac{\alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}$$
+- The uniform density is the special case where $\alpha = \beta = 1$
+
+---
+
+```
+## Exploring the beta density
+library(manipulate)
+pvals <- seq(0.01, 0.99, length = 1000)
+manipulate(
+    plot(pvals, dbeta(pvals, alpha, beta), type = "l", lwd = 3, frame = FALSE),
+    alpha = slider(0.01, 10, initial = 1, step = .5),
+    beta = slider(0.01, 10, initial = 1, step = .5)
+    )
+```
+
+---
+## Posterior 
+- Suppose that we chose values of $\alpha$ and $\beta$ so that
+  the beta prior is indicative of our degree of belief regarding $p$
+  in the absence of data
+- Then using the rule that
+  $$
+  \mbox{Posterior} \propto \mbox{Likelihood} \times \mbox{Prior}
+  $$
+  and throwing out anything that doesn't depend on $p$, we have that
+$$
+\begin{align}
+\mbox{Posterior} &\propto  p^x(1 - p)^{n-x} \times p^{\alpha -1} (1 - p)^{\beta - 1} \\
+                 &  =      p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1}
+\end{align}
+$$
+- This density is just another beta density with parameters
+  $\tilde \alpha = x + \alpha$ and $\tilde \beta = n - x + \beta$
+
+
+---
+## Posterior mean
+
+$$
+\begin{align}
+E[p ~|~ X] & =   \frac{\tilde \alpha}{\tilde \alpha + \tilde \beta}\\ \\
+& =  \frac{x + \alpha}{x + \alpha + n - x + \beta}\\ \\
+& =  \frac{x + \alpha}{n + \alpha + \beta} \\ \\
+& =  \frac{x}{n} \times \frac{n}{n + \alpha + \beta} + \frac{\alpha}{\alpha + \beta} \times \frac{\alpha + \beta}{n + \alpha + \beta} \\ \\
+& =  \mbox{MLE} \times \pi + \mbox{Prior Mean} \times (1 - \pi)
+\end{align}
+$$
+
+---
+## Thoughts
+
+- The posterior mean is a mixture of the MLE ($\hat p$) and the
+  prior mean
+- $\pi$ goes to $1$ as $n$ gets large; for large $n$ the data swamps the prior
+- For small $n$, the prior mean dominates 
+- Generalizes how science should ideally work; as data becomes
+  increasingly available, prior beliefs should matter less and less
+- With a prior that is degenerate at a value, no amount of data
+  can overcome the prior
+
+---
+## Example
+
+- Suppose that in a random sample of an at-risk population
+$13$ of $20$ subjects had hypertension. Estimate the prevalence
+of hypertension in this population.
+- $x = 13$ and $n=20$
+- Consider a uniform prior, $\alpha = \beta = 1$
+- The posterior is proportional to (see formula above)
+$$
+p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^x (1 - p)^{n-x}
+$$
+That is, for the uniform prior, the posterior is the likelihood
+- Consider the instance where $\alpha = \beta = 2$ (recall this prior
+is humped around the point $.5$) the posterior is
+$$
+p^{x + \alpha - 1} (1 - p)^{n - x + \beta - 1} = p^{x + 1} (1 - p)^{n-x + 1}
+$$
+- The "Jeffrey's prior" which has some theoretical benefits
+  puts $\alpha = \beta = .5$
+
+---
+```
+pvals <- seq(0.01, 0.99, length = 1000)
+x <- 13; n <- 20
+myPlot <- function(alpha, beta){
+    plot(0 : 1, 0 : 1, type = "n", xlab = "p", ylab = "", frame = FALSE)
+    lines(pvals, dbeta(pvals, alpha, beta) / max(dbeta(pvals, alpha, beta)), 
+            lwd = 3, col = "darkred")
+    lines(pvals, dbinom(x,n,pvals) / dbinom(x,n,x/n), lwd = 3, col = "darkblue")
+    lines(pvals, dbeta(pvals, alpha+x, beta+(n-x)) / max(dbeta(pvals, alpha+x, beta+(n-x))),
+        lwd = 3, col = "darkgreen")
+    title("red=prior,green=posterior,blue=likelihood")
+}
+manipulate(
+    myPlot(alpha, beta),
+    alpha = slider(0.01, 100, initial = 1, step = .5),
+    beta = slider(0.01, 100, initial = 1, step = .5)
+    )
+```
+
+---
+## Credible intervals
+- A Bayesian credible interval is the  Bayesian analog of a confidence
+  interval
+- A $95\%$ credible interval, $[a, b]$ would satisfy
+  $$
+  P(p \in [a, b] ~|~ x) = .95
+  $$
+- The best credible intervals chop off the posterior with a horizontal
+  line in the same way we did for likelihoods 
+- These are called highest posterior density (HPD) intervals
+
+---
+## Getting HPD intervals for this example
+- Install the \texttt{binom} package, then the command
+
+```r
+library(binom)
+```
+
+```
+## Error: there is no package called 'binom'
+```
+
+```r
+binom.bayes(13, 20, type = "highest")
+```
+
+```
+## Error: could not find function "binom.bayes"
+```
+
+gives the HPD interval. 
+- The default credible level is $95\%$ and
+the default prior is the Jeffrey's prior.
+
+---
+```
+pvals <- seq(0.01, 0.99, length = 1000)
+x <- 13; n <- 20
+myPlot2 <- function(alpha, beta, cl){
+    plot(pvals, dbeta(pvals, alpha+x, beta+(n-x)), type = "l", lwd = 3,
+    xlab = "p", ylab = "", frame = FALSE)
+    out <- binom.bayes(x, n, type = "highest", 
+        prior.shape1 = alpha, 
+        prior.shape2 = beta, 
+        conf.level = cl)
+    p1 <- out$lower; p2 <- out$upper
+    lines(c(p1, p1, p2, p2), c(0, dbeta(c(p1, p2), alpha+x, beta+(n-x)), 0), 
+        type = "l", lwd = 3, col = "darkred")
+}
+manipulate(
+    myPlot2(alpha, beta, cl),
+    alpha = slider(0.01, 10, initial = 1, step = .5),
+    beta = slider(0.01, 10, initial = 1, step = .5),
+    cl = slider(0.01, 0.99, initial = 0.95, step = .01)
+    )
+```
+
diff --git a/06_StatisticalInference/old_markdown/02_05_Bayes/index.pdf b/06_StatisticalInference/old_markdown/02_05_Bayes/index.pdf
new file mode 100644
index 000000000..ae65bc28f
Binary files /dev/null and b/06_StatisticalInference/old_markdown/02_05_Bayes/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..e5ca858ce
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/03_01_TwoGroupIntervals/fig/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/fig/unnamed-chunk-3.png
similarity index 100%
rename from 06_StatisticalInference/03_01_TwoGroupIntervals/fig/unnamed-chunk-3.png
rename to 06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/fig/unnamed-chunk-3.png
diff --git a/06_StatisticalInference/03_01_TwoGroupIntervals/index.md b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.Rmd
similarity index 91%
rename from 06_StatisticalInference/03_01_TwoGroupIntervals/index.md
rename to 06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.Rmd
index 7bcda6797..226588ff5 100644
--- a/06_StatisticalInference/03_01_TwoGroupIntervals/index.md
+++ b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.Rmd
@@ -1,197 +1,170 @@
----
-title       : Two group intervals
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## Independent group $t$ confidence intervals
-
-- Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo
-- We cannot use the paired t test because the groups are independent and may have different sample sizes
-- We now present methods for comparing independent groups
-
----
-
-## Notation
-
-- Let $X_1,\ldots,X_{n_x}$ be iid $N(\mu_x,\sigma^2)$
-- Let $Y_1,\ldots,Y_{n_y}$ be iid $N(\mu_y, \sigma^2)$
-- Let $\bar X$, $\bar Y$, $S_x$, $S_y$ be the means and standard deviations
-- Using the fact that linear combinations of normals are again normal, we know that $\bar Y - \bar X$ is also normal with mean $\mu_y - \mu_x$ and variance $\sigma^2 (\frac{1}{n_x} + \frac{1}{n_y})$
-- The pooled variance estimator $$S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)$$ is a good estimator of $\sigma^2$
-
----
-
-## Note
-
-- The pooled estimator is a mixture of the group variances, placing greater weight on whichever has a larger sample size
-- If the sample sizes are the same the pooled variance estimate is the average of the group variances
-- The pooled estimator is unbiased
-$$
-    \begin{eqnarray*}
-    E[S_p^2] & = & \frac{(n_x - 1) E[S_x^2] + (n_y - 1) E[S_y^2]}{n_x + n_y - 2}\\
-            & = & \frac{(n_x - 1)\sigma^2 + (n_y - 1)\sigma^2}{n_x + n_y - 2}
-    \end{eqnarray*}
-$$
-- The pooled variance  estimate is independent of $\bar Y - \bar X$ since $S_x$ is independent of $\bar X$ and $S_y$ is independent of $\bar Y$ and the groups are independent
-
----
-
-## Result
-
-- The sum of two independent Chi-squared random variables is Chi-squared with degrees of freedom equal to the sum of the degrees of freedom of the summands
-- Therefore
-$$
-    \begin{eqnarray*}
-      (n_x + n_y - 2) S_p^2 / \sigma^2 & = & (n_x - 1)S_x^2 /\sigma^2 + (n_y - 1)S_y^2/\sigma^2 \\ \\
-      & = & \chi^2_{n_x - 1} + \chi^2_{n_y-1} \\ \\
-      & = & \chi^2_{n_x + n_y - 2}
-    \end{eqnarray*}
-$$
-
----
-
-## Putting this all together
-
-- The statistic
-$$
-    \frac{\frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\sigma \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}}%
-    {\sqrt{\frac{(n_x + n_y - 2) S_p^2}{(n_x + n_y - 2)\sigma^2}}}
-    = \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{S_p \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}
-$$
-is a standard normal divided by the square root of an independent Chi-squared divided by its degrees of freedom 
-- Therefore this statistic follows Gosset's $t$ distribution with $n_x + n_y - 2$ degrees of freedom
-- Notice the form is (estimator - true value) / SE
-
----
-
-## Confidence interval
-
-- Therefore a $(1 - \alpha)\times 100\%$ confidence interval for $\mu_y - \mu_x$ is 
-$$
-    \bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
-$$
-- Remember this interval is assuming a constant variance across the two groups
-- If there is some doubt, assume a different variance per group, which we will discuss later
-
----
-
-
-## Example
-### Based on Rosner, Fundamentals of Biostatistics
-
-- Comparing SBP for 8 oral contraceptive users versus 21 controls
-- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
-- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
-- Pooled variance estimate
-
-```r
-sp <- sqrt((7 * 15.34^2 + 20 * 18.23^2) / (8 + 21 - 2))
-132.86 - 127.44 + c(-1, 1) * qt(.975, 27) * sp * (1 / 8 + 1 / 21)^.5
-```
-
-```
-[1] -9.521 20.361
-```
-
-
----
-
-```r
-data(sleep)
-x1 <- sleep$extra[sleep$group == 1]
-x2 <- sleep$extra[sleep$group == 2]
-n1 <- length(x1)
-n2 <- length(x2)
-sp <- sqrt( ((n1 - 1) * sd(x1)^2 + (n2-1) * sd(x2)^2) / (n1 + n2-2))
-md <- mean(x1) - mean(x2)
-semd <- sp * sqrt(1 / n1 + 1/n2)
-md + c(-1, 1) * qt(.975, n1 + n2 - 2) * semd
-```
-
-```
-[1] -3.3639  0.2039
-```
-
-```r
-t.test(x1, x2, paired = FALSE, var.equal = TRUE)$conf
-```
-
-```
-[1] -3.3639  0.2039
-attr(,"conf.level")
-[1] 0.95
-```
-
-```r
-t.test(x1, x2, paired = TRUE)$conf
-```
-
-```
-[1] -2.4599 -0.7001
-attr(,"conf.level")
-[1] 0.95
-```
-
-
----
-## Ignoring pairing
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
-
----
-
-## Unequal variances
-
-- Under unequal variances
-$$
-    \bar Y - \bar X \sim N\left(\mu_y - \mu_x, \frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)
-$$
-- The statistic 
-$$
-    \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\left(\frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)^{1/2}}
-$$
-approximately follows Gosset's $t$ distribution with degrees of freedom equal to
-$$
-    \frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
-    {\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
-      \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
-$$
-
----
-
-## Example
-
-- Comparing SBP for 8 oral contraceptive users versus 21 controls
-- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
-- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
-- $df=15.04$, $t_{15.04, .975} = 2.13$
-- Interval
-$$
-132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
-= [-8.91, 19.75]
-$$
-- In R, `t.test(..., var.equal = FALSE)`
-
----
-## Comparing other kinds of data
-* For binomial data, there's lots of ways to compare two groups
-  * Relative risk, risk difference, odds ratio.
-  * Chi-squared tests, normal approximations, exact tests.
-* For count data, there's also Chi-squared tests and exact tests.
-* We'll leave the discussions for comparing groups of data for binary
-  and count data until covering glms in the regression class.
-* In addition, Mathematical Biostatistics Boot Camp 2 covers many special
-  cases relevant to biostatistics.
+---
+title       : Two group intervals
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Independent group $t$ confidence intervals
+
+- Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo
+- We cannot use the paired t test because the groups are independent and may have different sample sizes
+- We now present methods for comparing independent groups
+
+---
+
+## Notation
+
+- Let $X_1,\ldots,X_{n_x}$ be iid $N(\mu_x,\sigma^2)$
+- Let $Y_1,\ldots,Y_{n_y}$ be iid $N(\mu_y, \sigma^2)$
+- Let $\bar X$, $\bar Y$, $S_x$, $S_y$ be the means and standard deviations
+- Using the fact that linear combinations of normals are again normal, we know that $\bar Y - \bar X$ is also normal with mean $\mu_y - \mu_x$ and variance $\sigma^2 (\frac{1}{n_x} + \frac{1}{n_y})$
+- The pooled variance estimator $$S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)$$ is a good estimator of $\sigma^2$
+
+---
+
+## Note
+
+- The pooled estimator is a mixture of the group variances, placing greater weight on whichever has a larger sample size
+- If the sample sizes are the same the pooled variance estimate is the average of the group variances
+- The pooled estimator is unbiased
+$$
+    \begin{eqnarray*}
+    E[S_p^2] & = & \frac{(n_x - 1) E[S_x^2] + (n_y - 1) E[S_y^2]}{n_x + n_y - 2}\\
+            & = & \frac{(n_x - 1)\sigma^2 + (n_y - 1)\sigma^2}{n_x + n_y - 2}
+    \end{eqnarray*}
+$$
+- The pooled variance  estimate is independent of $\bar Y - \bar X$ since $S_x$ is independent of $\bar X$ and $S_y$ is independent of $\bar Y$ and the groups are independent
+
+---
+
+## Result
+
+- The sum of two independent Chi-squared random variables is Chi-squared with degrees of freedom equal to the sum of the degrees of freedom of the summands
+- Therefore
+$$
+    \begin{eqnarray*}
+      (n_x + n_y - 2) S_p^2 / \sigma^2 & = & (n_x - 1)S_x^2 /\sigma^2 + (n_y - 1)S_y^2/\sigma^2 \\ \\
+      & = & \chi^2_{n_x - 1} + \chi^2_{n_y-1} \\ \\
+      & = & \chi^2_{n_x + n_y - 2}
+    \end{eqnarray*}
+$$
+
+---
+
+## Putting this all together
+
+- The statistic
+$$
+    \frac{\frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\sigma \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}}%
+    {\sqrt{\frac{(n_x + n_y - 2) S_p^2}{(n_x + n_y - 2)\sigma^2}}}
+    = \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{S_p \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}
+$$
+is a standard normal divided by the square root of an independent Chi-squared divided by its degrees of freedom 
+- Therefore this statistic follows Gosset's $t$ distribution with $n_x + n_y - 2$ degrees of freedom
+- Notice the form is (estimator - true value) / SE
+
+---
+
+## Confidence interval
+
+- Therefore a $(1 - \alpha)\times 100\%$ confidence interval for $\mu_y - \mu_x$ is 
+$$
+    \bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
+$$
+- Remember this interval is assuming a constant variance across the two groups
+- If there is some doubt, assume a different variance per group, which we will discuss later
+
+---
+
+
+## Example
+### Based on Rosner, Fundamentals of Biostatistics
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- Pooled variance estimate
+```{r}
+sp <- sqrt((7 * 15.34^2 + 20 * 18.23^2) / (8 + 21 - 2))
+132.86 - 127.44 + c(-1, 1) * qt(.975, 27) * sp * (1 / 8 + 1 / 21)^.5
+```
+
+---
+```{r}
+data(sleep)
+x1 <- sleep$extra[sleep$group == 1]
+x2 <- sleep$extra[sleep$group == 2]
+n1 <- length(x1)
+n2 <- length(x2)
+sp <- sqrt( ((n1 - 1) * sd(x1)^2 + (n2-1) * sd(x2)^2) / (n1 + n2-2))
+md <- mean(x1) - mean(x2)
+semd <- sp * sqrt(1 / n1 + 1/n2)
+md + c(-1, 1) * qt(.975, n1 + n2 - 2) * semd
+t.test(x1, x2, paired = FALSE, var.equal = TRUE)$conf
+t.test(x1, x2, paired = TRUE)$conf
+```
+
+---
+## Ignoring pairing
+```{r, echo = FALSE, fig.width=5, fig.height=5}
+plot(c(0.5, 2.5), range(x1, x2), type = "n", frame = FALSE, xlab = "group", ylab = "Extra", axes = FALSE)
+axis(2)
+axis(1, at = 1 : 2, labels = c("Group 1", "Group 2"))
+for (i in 1 : n1) lines(c(1, 2), c(x1[i], x2[i]), lwd = 2, col = "red")
+for (i in 1 : n1) points(c(1, 2), c(x1[i], x2[i]), lwd = 2, col = "black", bg = "salmon", pch = 21, cex = 3)
+```
+
+---
+
+## Unequal variances
+
+- Under unequal variances
+$$
+    \bar Y - \bar X \sim N\left(\mu_y - \mu_x, \frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)
+$$
+- The statistic 
+$$
+    \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\left(\frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)^{1/2}}
+$$
+approximately follows Gosset's $t$ distribution with degrees of freedom equal to
+$$
+    \frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
+    {\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
+      \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
+$$
+
+---
+
+## Example
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- $df=15.04$, $t_{15.04, .975} = 2.13$
+- Interval
+$$
+132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
+= [-8.91, 19.75]
+$$
+- In R, `t.test(..., var.equal = FALSE)`
+
+---
+## Comparing other kinds of data
+* For binomial data, there's lots of ways to compare two groups
+  * Relative risk, risk difference, odds ratio.
+  * Chi-squared tests, normal approximations, exact tests.
+* For count data, there's also Chi-squared tests and exact tests.
+* We'll leave the discussions for comparing groups of data for binary
+  and count data until covering glms in the regression class.
+* In addition, Mathematical Biostatistics Boot Camp 2 covers many special
+  cases relevant to biostatistics.
diff --git a/06_StatisticalInference/03_01_TwoGroupIntervals/index.html b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.html
similarity index 92%
rename from 06_StatisticalInference/03_01_TwoGroupIntervals/index.html
rename to 06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.html
index 910ae14e9..bb7feb5c5 100644
--- a/06_StatisticalInference/03_01_TwoGroupIntervals/index.html
+++ b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.html
@@ -1,408 +1,408 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Two group intervals</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Two group intervals">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Two group intervals</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Independent group \(t\) confidence intervals</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo</li>
-<li>We cannot use the paired t test because the groups are independent and may have different sample sizes</li>
-<li>We now present methods for comparing independent groups</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Notation</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Let \(X_1,\ldots,X_{n_x}\) be iid \(N(\mu_x,\sigma^2)\)</li>
-<li>Let \(Y_1,\ldots,Y_{n_y}\) be iid \(N(\mu_y, \sigma^2)\)</li>
-<li>Let \(\bar X\), \(\bar Y\), \(S_x\), \(S_y\) be the means and standard deviations</li>
-<li>Using the fact that linear combinations of normals are again normal, we know that \(\bar Y - \bar X\) is also normal with mean \(\mu_y - \mu_x\) and variance \(\sigma^2 (\frac{1}{n_x} + \frac{1}{n_y})\)</li>
-<li>The pooled variance estimator \[S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)\] is a good estimator of \(\sigma^2\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Note</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The pooled estimator is a mixture of the group variances, placing greater weight on whichever has a larger sample size</li>
-<li>If the sample sizes are the same the pooled variance estimate is the average of the group variances</li>
-<li>The pooled estimator is unbiased
-\[
-\begin{eqnarray*}
-E[S_p^2] & = & \frac{(n_x - 1) E[S_x^2] + (n_y - 1) E[S_y^2]}{n_x + n_y - 2}\\
-        & = & \frac{(n_x - 1)\sigma^2 + (n_y - 1)\sigma^2}{n_x + n_y - 2}
-\end{eqnarray*}
-\]</li>
-<li>The pooled variance  estimate is independent of \(\bar Y - \bar X\) since \(S_x\) is independent of \(\bar X\) and \(S_y\) is independent of \(\bar Y\) and the groups are independent</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Result</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The sum of two independent Chi-squared random variables is Chi-squared with degrees of freedom equal to the sum of the degrees of freedom of the summands</li>
-<li>Therefore
-\[
-\begin{eqnarray*}
-  (n_x + n_y - 2) S_p^2 / \sigma^2 & = & (n_x - 1)S_x^2 /\sigma^2 + (n_y - 1)S_y^2/\sigma^2 \\ \\
-  & = & \chi^2_{n_x - 1} + \chi^2_{n_y-1} \\ \\
-  & = & \chi^2_{n_x + n_y - 2}
-\end{eqnarray*}
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Putting this all together</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The statistic
-\[
-\frac{\frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\sigma \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}}%
-{\sqrt{\frac{(n_x + n_y - 2) S_p^2}{(n_x + n_y - 2)\sigma^2}}}
-= \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{S_p \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}
-\]
-is a standard normal divided by the square root of an independent Chi-squared divided by its degrees of freedom </li>
-<li>Therefore this statistic follows Gosset&#39;s \(t\) distribution with \(n_x + n_y - 2\) degrees of freedom</li>
-<li>Notice the form is (estimator - true value) / SE</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Confidence interval</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Therefore a \((1 - \alpha)\times 100\%\) confidence interval for \(\mu_y - \mu_x\) is 
-\[
-\bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
-\]</li>
-<li>Remember this interval is assuming a constant variance across the two groups</li>
-<li>If there is some doubt, assume a different variance per group, which we will discuss later</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <h3>Based on Rosner, Fundamentals of Biostatistics</h3>
-
-<ul>
-<li>Comparing SBP for 8 oral contraceptive users versus 21 controls</li>
-<li>\(\bar X_{OC} = 132.86\) mmHg with \(s_{OC} = 15.34\) mmHg</li>
-<li>\(\bar X_{C} = 127.44\) mmHg with \(s_{C} = 18.23\) mmHg</li>
-<li>Pooled variance estimate</li>
-</ul>
-
-<pre><code class="r">sp &lt;- sqrt((7 * 15.34^2 + 20 * 18.23^2) / (8 + 21 - 2))
-132.86 - 127.44 + c(-1, 1) * qt(.975, 27) * sp * (1 / 8 + 1 / 21)^.5
-</code></pre>
-
-<pre><code>[1] -9.521 20.361
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <article data-timings="">
-    <pre><code class="r">data(sleep)
-x1 &lt;- sleep$extra[sleep$group == 1]
-x2 &lt;- sleep$extra[sleep$group == 2]
-n1 &lt;- length(x1)
-n2 &lt;- length(x2)
-sp &lt;- sqrt( ((n1 - 1) * sd(x1)^2 + (n2-1) * sd(x2)^2) / (n1 + n2-2))
-md &lt;- mean(x1) - mean(x2)
-semd &lt;- sp * sqrt(1 / n1 + 1/n2)
-md + c(-1, 1) * qt(.975, n1 + n2 - 2) * semd
-</code></pre>
-
-<pre><code>[1] -3.3639  0.2039
-</code></pre>
-
-<pre><code class="r">t.test(x1, x2, paired = FALSE, var.equal = TRUE)$conf
-</code></pre>
-
-<pre><code>[1] -3.3639  0.2039
-attr(,&quot;conf.level&quot;)
-[1] 0.95
-</code></pre>
-
-<pre><code class="r">t.test(x1, x2, paired = TRUE)$conf
-</code></pre>
-
-<pre><code>[1] -2.4599 -0.7001
-attr(,&quot;conf.level&quot;)
-[1] 0.95
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Ignoring pairing</h2>
-  </hgroup>
-  <article data-timings="">
-    <div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Unequal variances</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Under unequal variances
-\[
-\bar Y - \bar X \sim N\left(\mu_y - \mu_x, \frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)
-\]</li>
-<li>The statistic 
-\[
-\frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\left(\frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)^{1/2}}
-\]
-approximately follows Gosset&#39;s \(t\) distribution with degrees of freedom equal to
-\[
-\frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
-{\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
-  \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Comparing SBP for 8 oral contraceptive users versus 21 controls</li>
-<li>\(\bar X_{OC} = 132.86\) mmHg with \(s_{OC} = 15.34\) mmHg</li>
-<li>\(\bar X_{C} = 127.44\) mmHg with \(s_{C} = 18.23\) mmHg</li>
-<li>\(df=15.04\), \(t_{15.04, .975} = 2.13\)</li>
-<li>Interval
-\[
-132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
-= [-8.91, 19.75]
-\]</li>
-<li>In R, <code>t.test(..., var.equal = FALSE)</code></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Comparing other kinds of data</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>For binomial data, there&#39;s lots of ways to compare two groups
-
-<ul>
-<li>Relative risk, risk difference, odds ratio.</li>
-<li>Chi-squared tests, normal approximations, exact tests.</li>
-</ul></li>
-<li>For count data, there&#39;s also Chi-squared tests and exact tests.</li>
-<li>We&#39;ll leave the discussions for comparing groups of data for binary
-and count data until covering glms in the regression class.</li>
-<li>In addition, Mathematical Biostatistics Boot Camp 2 covers many special
-cases relevant to biostatistics.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Independent group \(t\) confidence intervals'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Notation'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Note'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Result'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Putting this all together'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Confidence interval'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Example'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title=''>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Ignoring pairing'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Unequal variances'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='Example'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Comparing other kinds of data'>
-         12
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Two group intervals</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Two group intervals">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Two group intervals</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Independent group \(t\) confidence intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo</li>
+<li>We cannot use the paired t test because the groups are independent and may have different sample sizes</li>
+<li>We now present methods for comparing independent groups</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Notation</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Let \(X_1,\ldots,X_{n_x}\) be iid \(N(\mu_x,\sigma^2)\)</li>
+<li>Let \(Y_1,\ldots,Y_{n_y}\) be iid \(N(\mu_y, \sigma^2)\)</li>
+<li>Let \(\bar X\), \(\bar Y\), \(S_x\), \(S_y\) be the means and standard deviations</li>
+<li>Using the fact that linear combinations of normals are again normal, we know that \(\bar Y - \bar X\) is also normal with mean \(\mu_y - \mu_x\) and variance \(\sigma^2 (\frac{1}{n_x} + \frac{1}{n_y})\)</li>
+<li>The pooled variance estimator \[S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)\] is a good estimator of \(\sigma^2\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Note</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The pooled estimator is a mixture of the group variances, placing greater weight on whichever has a larger sample size</li>
+<li>If the sample sizes are the same the pooled variance estimate is the average of the group variances</li>
+<li>The pooled estimator is unbiased
+\[
+\begin{eqnarray*}
+E[S_p^2] & = & \frac{(n_x - 1) E[S_x^2] + (n_y - 1) E[S_y^2]}{n_x + n_y - 2}\\
+        & = & \frac{(n_x - 1)\sigma^2 + (n_y - 1)\sigma^2}{n_x + n_y - 2}
+\end{eqnarray*}
+\]</li>
+<li>The pooled variance  estimate is independent of \(\bar Y - \bar X\) since \(S_x\) is independent of \(\bar X\) and \(S_y\) is independent of \(\bar Y\) and the groups are independent</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Result</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The sum of two independent Chi-squared random variables is Chi-squared with degrees of freedom equal to the sum of the degrees of freedom of the summands</li>
+<li>Therefore
+\[
+\begin{eqnarray*}
+  (n_x + n_y - 2) S_p^2 / \sigma^2 & = & (n_x - 1)S_x^2 /\sigma^2 + (n_y - 1)S_y^2/\sigma^2 \\ \\
+  & = & \chi^2_{n_x - 1} + \chi^2_{n_y-1} \\ \\
+  & = & \chi^2_{n_x + n_y - 2}
+\end{eqnarray*}
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Putting this all together</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The statistic
+\[
+\frac{\frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\sigma \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}}%
+{\sqrt{\frac{(n_x + n_y - 2) S_p^2}{(n_x + n_y - 2)\sigma^2}}}
+= \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{S_p \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}
+\]
+is a standard normal divided by the square root of an independent Chi-squared divided by its degrees of freedom </li>
+<li>Therefore this statistic follows Gosset&#39;s \(t\) distribution with \(n_x + n_y - 2\) degrees of freedom</li>
+<li>Notice the form is (estimator - true value) / SE</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Confidence interval</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Therefore a \((1 - \alpha)\times 100\%\) confidence interval for \(\mu_y - \mu_x\) is 
+\[
+\bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
+\]</li>
+<li>Remember this interval is assuming a constant variance across the two groups</li>
+<li>If there is some doubt, assume a different variance per group, which we will discuss later</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>Based on Rosner, Fundamentals of Biostatistics</h3>
+
+<ul>
+<li>Comparing SBP for 8 oral contraceptive users versus 21 controls</li>
+<li>\(\bar X_{OC} = 132.86\) mmHg with \(s_{OC} = 15.34\) mmHg</li>
+<li>\(\bar X_{C} = 127.44\) mmHg with \(s_{C} = 18.23\) mmHg</li>
+<li>Pooled variance estimate</li>
+</ul>
+
+<pre><code class="r">sp &lt;- sqrt((7 * 15.34^2 + 20 * 18.23^2)/(8 + 21 - 2))
+132.86 - 127.44 + c(-1, 1) * qt(0.975, 27) * sp * (1/8 + 1/21)^0.5
+</code></pre>
+
+<pre><code>## [1] -9.521 20.361
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">data(sleep)
+x1 &lt;- sleep$extra[sleep$group == 1]
+x2 &lt;- sleep$extra[sleep$group == 2]
+n1 &lt;- length(x1)
+n2 &lt;- length(x2)
+sp &lt;- sqrt(((n1 - 1) * sd(x1)^2 + (n2 - 1) * sd(x2)^2)/(n1 + n2 - 2))
+md &lt;- mean(x1) - mean(x2)
+semd &lt;- sp * sqrt(1/n1 + 1/n2)
+md + c(-1, 1) * qt(0.975, n1 + n2 - 2) * semd
+</code></pre>
+
+<pre><code>## [1] -3.3639  0.2039
+</code></pre>
+
+<pre><code class="r">t.test(x1, x2, paired = FALSE, var.equal = TRUE)$conf
+</code></pre>
+
+<pre><code>## [1] -3.3639  0.2039
+## attr(,&quot;conf.level&quot;)
+## [1] 0.95
+</code></pre>
+
+<pre><code class="r">t.test(x1, x2, paired = TRUE)$conf
+</code></pre>
+
+<pre><code>## [1] -2.4599 -0.7001
+## attr(,&quot;conf.level&quot;)
+## [1] 0.95
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Ignoring pairing</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-3.png" alt="plot of chunk unnamed-chunk-3"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Unequal variances</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Under unequal variances
+\[
+\bar Y - \bar X \sim N\left(\mu_y - \mu_x, \frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)
+\]</li>
+<li>The statistic 
+\[
+\frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\left(\frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)^{1/2}}
+\]
+approximately follows Gosset&#39;s \(t\) distribution with degrees of freedom equal to
+\[
+\frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
+{\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
+  \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
+\]</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Comparing SBP for 8 oral contraceptive users versus 21 controls</li>
+<li>\(\bar X_{OC} = 132.86\) mmHg with \(s_{OC} = 15.34\) mmHg</li>
+<li>\(\bar X_{C} = 127.44\) mmHg with \(s_{C} = 18.23\) mmHg</li>
+<li>\(df=15.04\), \(t_{15.04, .975} = 2.13\)</li>
+<li>Interval
+\[
+132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
+= [-8.91, 19.75]
+\]</li>
+<li>In R, <code>t.test(..., var.equal = FALSE)</code></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Comparing other kinds of data</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>For binomial data, there&#39;s lots of ways to compare two groups
+
+<ul>
+<li>Relative risk, risk difference, odds ratio.</li>
+<li>Chi-squared tests, normal approximations, exact tests.</li>
+</ul></li>
+<li>For count data, there&#39;s also Chi-squared tests and exact tests.</li>
+<li>We&#39;ll leave the discussions for comparing groups of data for binary
+and count data until covering glms in the regression class.</li>
+<li>In addition, Mathematical Biostatistics Boot Camp 2 covers many special
+cases relevant to biostatistics.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Independent group \(t\) confidence intervals'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Notation'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Note'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Result'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Putting this all together'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Confidence interval'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Example'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title=''>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Ignoring pairing'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Unequal variances'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Comparing other kinds of data'>
+         12
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/03_01_TwoGroupIntervals/index.Rmd b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.md
similarity index 79%
rename from 06_StatisticalInference/03_01_TwoGroupIntervals/index.Rmd
rename to 06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.md
index d139bf681..d856f7887 100644
--- a/06_StatisticalInference/03_01_TwoGroupIntervals/index.Rmd
+++ b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.md
@@ -1,186 +1,195 @@
----
-title       : Two group intervals
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-runif(1)
-```
-
-## Independent group $t$ confidence intervals
-
-- Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo
-- We cannot use the paired t test because the groups are independent and may have different sample sizes
-- We now present methods for comparing independent groups
-
----
-
-## Notation
-
-- Let $X_1,\ldots,X_{n_x}$ be iid $N(\mu_x,\sigma^2)$
-- Let $Y_1,\ldots,Y_{n_y}$ be iid $N(\mu_y, \sigma^2)$
-- Let $\bar X$, $\bar Y$, $S_x$, $S_y$ be the means and standard deviations
-- Using the fact that linear combinations of normals are again normal, we know that $\bar Y - \bar X$ is also normal with mean $\mu_y - \mu_x$ and variance $\sigma^2 (\frac{1}{n_x} + \frac{1}{n_y})$
-- The pooled variance estimator $$S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)$$ is a good estimator of $\sigma^2$
-
----
-
-## Note
-
-- The pooled estimator is a mixture of the group variances, placing greater weight on whichever has a larger sample size
-- If the sample sizes are the same the pooled variance estimate is the average of the group variances
-- The pooled estimator is unbiased
-$$
-    \begin{eqnarray*}
-    E[S_p^2] & = & \frac{(n_x - 1) E[S_x^2] + (n_y - 1) E[S_y^2]}{n_x + n_y - 2}\\
-            & = & \frac{(n_x - 1)\sigma^2 + (n_y - 1)\sigma^2}{n_x + n_y - 2}
-    \end{eqnarray*}
-$$
-- The pooled variance  estimate is independent of $\bar Y - \bar X$ since $S_x$ is independent of $\bar X$ and $S_y$ is independent of $\bar Y$ and the groups are independent
-
----
-
-## Result
-
-- The sum of two independent Chi-squared random variables is Chi-squared with degrees of freedom equal to the sum of the degrees of freedom of the summands
-- Therefore
-$$
-    \begin{eqnarray*}
-      (n_x + n_y - 2) S_p^2 / \sigma^2 & = & (n_x - 1)S_x^2 /\sigma^2 + (n_y - 1)S_y^2/\sigma^2 \\ \\
-      & = & \chi^2_{n_x - 1} + \chi^2_{n_y-1} \\ \\
-      & = & \chi^2_{n_x + n_y - 2}
-    \end{eqnarray*}
-$$
-
----
-
-## Putting this all together
-
-- The statistic
-$$
-    \frac{\frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\sigma \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}}%
-    {\sqrt{\frac{(n_x + n_y - 2) S_p^2}{(n_x + n_y - 2)\sigma^2}}}
-    = \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{S_p \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}
-$$
-is a standard normal divided by the square root of an independent Chi-squared divided by its degrees of freedom 
-- Therefore this statistic follows Gosset's $t$ distribution with $n_x + n_y - 2$ degrees of freedom
-- Notice the form is (estimator - true value) / SE
-
----
-
-## Confidence interval
-
-- Therefore a $(1 - \alpha)\times 100\%$ confidence interval for $\mu_y - \mu_x$ is 
-$$
-    \bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
-$$
-- Remember this interval is assuming a constant variance across the two groups
-- If there is some doubt, assume a different variance per group, which we will discuss later
-
----
-
-
-## Example
-### Based on Rosner, Fundamentals of Biostatistics
-
-- Comparing SBP for 8 oral contraceptive users versus 21 controls
-- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
-- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
-- Pooled variance estimate
-```{r}
-sp <- sqrt((7 * 15.34^2 + 20 * 18.23^2) / (8 + 21 - 2))
-132.86 - 127.44 + c(-1, 1) * qt(.975, 27) * sp * (1 / 8 + 1 / 21)^.5
-```
-
----
-```{r}
-data(sleep)
-x1 <- sleep$extra[sleep$group == 1]
-x2 <- sleep$extra[sleep$group == 2]
-n1 <- length(x1)
-n2 <- length(x2)
-sp <- sqrt( ((n1 - 1) * sd(x1)^2 + (n2-1) * sd(x2)^2) / (n1 + n2-2))
-md <- mean(x1) - mean(x2)
-semd <- sp * sqrt(1 / n1 + 1/n2)
-md + c(-1, 1) * qt(.975, n1 + n2 - 2) * semd
-t.test(x1, x2, paired = FALSE, var.equal = TRUE)$conf
-t.test(x1, x2, paired = TRUE)$conf
-```
-
----
-## Ignoring pairing
-```{r, echo = FALSE, fig.width=5, fig.height=5}
-plot(c(0.5, 2.5), range(x1, x2), type = "n", frame = FALSE, xlab = "group", ylab = "Extra", axes = FALSE)
-axis(2)
-axis(1, at = 1 : 2, labels = c("Group 1", "Group 2"))
-for (i in 1 : n1) lines(c(1, 2), c(x1[i], x2[i]), lwd = 2, col = "red")
-for (i in 1 : n1) points(c(1, 2), c(x1[i], x2[i]), lwd = 2, col = "black", bg = "salmon", pch = 21, cex = 3)
-```
-
----
-
-## Unequal variances
-
-- Under unequal variances
-$$
-    \bar Y - \bar X \sim N\left(\mu_y - \mu_x, \frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)
-$$
-- The statistic 
-$$
-    \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\left(\frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)^{1/2}}
-$$
-approximately follows Gosset's $t$ distribution with degrees of freedom equal to
-$$
-    \frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
-    {\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
-      \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
-$$
-
----
-
-## Example
-
-- Comparing SBP for 8 oral contraceptive users versus 21 controls
-- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
-- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
-- $df=15.04$, $t_{15.04, .975} = 2.13$
-- Interval
-$$
-132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
-= [-8.91, 19.75]
-$$
-- In R, `t.test(..., var.equal = FALSE)`
-
----
-## Comparing other kinds of data
-* For binomial data, there's lots of ways to compare two groups
-  * Relative risk, risk difference, odds ratio.
-  * Chi-squared tests, normal approximations, exact tests.
-* For count data, there's also Chi-squared tests and exact tests.
-* We'll leave the discussions for comparing groups of data for binary
-  and count data until covering glms in the regression class.
-* In addition, Mathematical Biostatistics Boot Camp 2 covers many special
-  cases relevant to biostatistics.
+---
+title       : Two group intervals
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Independent group $t$ confidence intervals
+
+- Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo
+- We cannot use the paired t test because the groups are independent and may have different sample sizes
+- We now present methods for comparing independent groups
+
+---
+
+## Notation
+
+- Let $X_1,\ldots,X_{n_x}$ be iid $N(\mu_x,\sigma^2)$
+- Let $Y_1,\ldots,Y_{n_y}$ be iid $N(\mu_y, \sigma^2)$
+- Let $\bar X$, $\bar Y$, $S_x$, $S_y$ be the means and standard deviations
+- Using the fact that linear combinations of normals are again normal, we know that $\bar Y - \bar X$ is also normal with mean $\mu_y - \mu_x$ and variance $\sigma^2 (\frac{1}{n_x} + \frac{1}{n_y})$
+- The pooled variance estimator $$S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)$$ is a good estimator of $\sigma^2$
+
+---
+
+## Note
+
+- The pooled estimator is a mixture of the group variances, placing greater weight on whichever has a larger sample size
+- If the sample sizes are the same the pooled variance estimate is the average of the group variances
+- The pooled estimator is unbiased
+$$
+    \begin{eqnarray*}
+    E[S_p^2] & = & \frac{(n_x - 1) E[S_x^2] + (n_y - 1) E[S_y^2]}{n_x + n_y - 2}\\
+            & = & \frac{(n_x - 1)\sigma^2 + (n_y - 1)\sigma^2}{n_x + n_y - 2}
+    \end{eqnarray*}
+$$
+- The pooled variance  estimate is independent of $\bar Y - \bar X$ since $S_x$ is independent of $\bar X$ and $S_y$ is independent of $\bar Y$ and the groups are independent
+
+---
+
+## Result
+
+- The sum of two independent Chi-squared random variables is Chi-squared with degrees of freedom equal to the sum of the degrees of freedom of the summands
+- Therefore
+$$
+    \begin{eqnarray*}
+      (n_x + n_y - 2) S_p^2 / \sigma^2 & = & (n_x - 1)S_x^2 /\sigma^2 + (n_y - 1)S_y^2/\sigma^2 \\ \\
+      & = & \chi^2_{n_x - 1} + \chi^2_{n_y-1} \\ \\
+      & = & \chi^2_{n_x + n_y - 2}
+    \end{eqnarray*}
+$$
+
+---
+
+## Putting this all together
+
+- The statistic
+$$
+    \frac{\frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\sigma \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}}%
+    {\sqrt{\frac{(n_x + n_y - 2) S_p^2}{(n_x + n_y - 2)\sigma^2}}}
+    = \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{S_p \left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}}
+$$
+is a standard normal divided by the square root of an independent Chi-squared divided by its degrees of freedom 
+- Therefore this statistic follows Gosset's $t$ distribution with $n_x + n_y - 2$ degrees of freedom
+- Notice the form is (estimator - true value) / SE
+
+---
+
+## Confidence interval
+
+- Therefore a $(1 - \alpha)\times 100\%$ confidence interval for $\mu_y - \mu_x$ is 
+$$
+    \bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
+$$
+- Remember this interval is assuming a constant variance across the two groups
+- If there is some doubt, assume a different variance per group, which we will discuss later
+
+---
+
+
+## Example
+### Based on Rosner, Fundamentals of Biostatistics
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- Pooled variance estimate
+
+```r
+sp <- sqrt((7 * 15.34^2 + 20 * 18.23^2)/(8 + 21 - 2))
+132.86 - 127.44 + c(-1, 1) * qt(0.975, 27) * sp * (1/8 + 1/21)^0.5
+```
+
+```
+## [1] -9.521 20.361
+```
+
+
+---
+
+```r
+data(sleep)
+x1 <- sleep$extra[sleep$group == 1]
+x2 <- sleep$extra[sleep$group == 2]
+n1 <- length(x1)
+n2 <- length(x2)
+sp <- sqrt(((n1 - 1) * sd(x1)^2 + (n2 - 1) * sd(x2)^2)/(n1 + n2 - 2))
+md <- mean(x1) - mean(x2)
+semd <- sp * sqrt(1/n1 + 1/n2)
+md + c(-1, 1) * qt(0.975, n1 + n2 - 2) * semd
+```
+
+```
+## [1] -3.3639  0.2039
+```
+
+```r
+t.test(x1, x2, paired = FALSE, var.equal = TRUE)$conf
+```
+
+```
+## [1] -3.3639  0.2039
+## attr(,"conf.level")
+## [1] 0.95
+```
+
+```r
+t.test(x1, x2, paired = TRUE)$conf
+```
+
+```
+## [1] -2.4599 -0.7001
+## attr(,"conf.level")
+## [1] 0.95
+```
+
+
+---
+## Ignoring pairing
+![plot of chunk unnamed-chunk-3](assets/fig/unnamed-chunk-3.png) 
+
+
+---
+
+## Unequal variances
+
+- Under unequal variances
+$$
+    \bar Y - \bar X \sim N\left(\mu_y - \mu_x, \frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)
+$$
+- The statistic 
+$$
+    \frac{\bar Y - \bar X - (\mu_y - \mu_x)}{\left(\frac{s_x^2}{n_x} + \frac{\sigma_y^2}{n_y}\right)^{1/2}}
+$$
+approximately follows Gosset's $t$ distribution with degrees of freedom equal to
+$$
+    \frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
+    {\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
+      \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
+$$
+
+---
+
+## Example
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- $df=15.04$, $t_{15.04, .975} = 2.13$
+- Interval
+$$
+132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
+= [-8.91, 19.75]
+$$
+- In R, `t.test(..., var.equal = FALSE)`
+
+---
+## Comparing other kinds of data
+* For binomial data, there's lots of ways to compare two groups
+  * Relative risk, risk difference, odds ratio.
+  * Chi-squared tests, normal approximations, exact tests.
+* For count data, there's also Chi-squared tests and exact tests.
+* We'll leave the discussions for comparing groups of data for binary
+  and count data until covering glms in the regression class.
+* In addition, Mathematical Biostatistics Boot Camp 2 covers many special
+  cases relevant to biostatistics.
diff --git a/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.pdf b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.pdf
new file mode 100644
index 000000000..b46a7169f
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_01_TwoGroupIntervals/index.pdf differ
diff --git a/06_StatisticalInference/03_02_HypothesisTesting/index.Rmd b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.Rmd
similarity index 90%
rename from 06_StatisticalInference/03_02_HypothesisTesting/index.Rmd
rename to 06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.Rmd
index 3cfdbb6fd..79a50d8d6 100644
--- a/06_StatisticalInference/03_02_HypothesisTesting/index.Rmd
+++ b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.Rmd
@@ -1,215 +1,199 @@
----
-title       : Hypothesis testing
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-runif(1)
-```
-
-## Hypothesis testing
-* Hypothesis testing is concerned with making decisions using data
-* A null hypothesis is specified that represents the status quo,
-  usually labeled $H_0$
-* The null hypothesis is assumed true and statistical evidence is required
-  to reject it in favor of a research or alternative hypothesis 
-
----
-## Example
-* A respiratory disturbance index of more than $30$ events / hour, say, is 
-  considered evidence of severe sleep disordered breathing (SDB).
-* Suppose that in a sample of $100$ overweight subjects with other
-  risk factors for sleep disordered breathing at a sleep clinic, the
-  mean RDI was $32$ events / hour with a standard deviation of $10$ events / hour.
-* We might want to test the hypothesis that 
-  * $H_0 : \mu = 30$
-  * $H_a : \mu > 30$
-  * where $\mu$ is the population mean RDI.
-
----
-## Hypothesis testing
-* The alternative hypotheses are typically of the form $<$, $>$ or $\neq$
-* Note that there are four possible outcomes of our statistical decision process
-
-Truth | Decide | Result |
----|---|---|
-$H_0$ | $H_0$ | Correctly accept null |
-$H_0$ | $H_a$ | Type I error |
-$H_a$ | $H_a$ | Correctly reject null |
-$H_a$ | $H_0$ | Type II error |
-
----
-## Discussion
-* Consider a court of law; the null hypothesis is that the
-  defendant is innocent
-* We require evidence to reject the null hypothesis (convict)
-* If we require little evidence, then we would increase the
-  percentage of innocent people convicted (type I errors); however we
-  would also increase the percentage of guilty people convicted
-  (correctly rejecting the null)
-* If we require a lot of evidence, then we increase the the
-  percentage of innocent people let free (correctly accepting the
-  null) while we would also increase the percentage of guilty people
-  let free (type II errors)
-
----
-## Example
-* Consider our example again
-* A reasonable strategy would reject the null hypothesis if
-  $\bar X$ was larger than some constant, say $C$
-* Typically, $C$ is chosen so that the probability of a Type I
-  error, $\alpha$, is $.05$ (or some other relevant constant)
-* $\alpha$ = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct
-
----
-## Example continued
-
-
-$$
-\begin{align}
-0.05  & =  P\left(\bar X \geq C ~|~ \mu = 30 \right) \\
-      & =  P\left(\frac{\bar X - 30}{10 / \sqrt{100}} \geq \frac{C - 30}{10/\sqrt{100}} ~|~ \mu = 30\right) \\
-      & =  P\left(Z \geq \frac{C - 30}{1}\right) \\
-\end{align}
-$$
-
-* Hence $(C - 30) / 1 = 1.645$ implying $C = 31.645$
-* Since our mean is $32$ we reject the null hypothesis
-
----
-## Discussion
-* In general we don't convert $C$ back to the original scale
-* We would just reject because the Z-score; which is how many
-  standard errors the sample mean is above the hypothesized mean
-  $$
-  \frac{32 - 30}{10 / \sqrt{100}} = 2
-  $$
-  is greater than $1.645$
-* Or, whenever $\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}$
-
----
-## General rules
-* The $Z$ test for $H_0:\mu = \mu_0$ versus 
-  * $H_1: \mu < \mu_0$
-  * $H_2: \mu \neq \mu_0$
-  * $H_3: \mu > \mu_0$ 
-* Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $
-* Reject the null hypothesis when 
-  * $TS \leq -Z_{1 - \alpha}$
-  * $|TS| \geq Z_{1 - \alpha / 2}$
-  * $TS \geq Z_{1 - \alpha}$
-
----
-## Notes
-* We have fixed $\alpha$ to be low, so if we reject $H_0$ (either
-  our model is wrong) or there is a low probability that we have made
-  an error
-* We have not fixed the probability of a type II error, $\beta$;
-  therefore we tend to say ``Fail to reject $H_0$'' rather than
-  accepting $H_0$
-* Statistical significance is no the same as scientific
-  significance
-* The region of TS values for which you reject $H_0$ is called the
-  rejection region
-
----
-## More notes
-* The $Z$ test requires the assumptions of the CLT and for $n$ to be large enough
-  for it to apply
-* If $n$ is small, then a Gossett's $T$ test is performed exactly in the same way,
-  with the normal quantiles replaced by the appropriate Student's $T$ quantiles and
-  $n-1$ df
-* The probability of rejecting the null hypothesis when it is false is called *power*
-* Power is a used a lot to calculate sample sizes for experiments
-
----
-## Example reconsidered
-- Consider our example again. Suppose that $n= 16$ (rather than
-$100$). Then consider that
-$$
-.05 = P\left(\frac{\bar X - 30}{s / \sqrt{16}} \geq t_{1-\alpha, 15} ~|~ \mu = 30 \right)
-$$
-- So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $, while the critical value is $t_{1-\alpha, 15} = 1.75$. 
-- We now fail to reject.
-
----
-## Two sided tests
-* Suppose that we would reject the null hypothesis if in fact the 
-  mean was too large or too small
-* That is, we want to test the alternative $H_a : \mu \neq 30$
-  (doesn't make a lot of sense in our setting)
-* Then note
-$$
- \alpha = P\left(\left. \left|\frac{\bar X - 30}{s /\sqrt{16}}\right| > t_{1-\alpha/2,15} ~\right|~ \mu = 30\right)
-$$
-* That is we will reject if the test statistic, $0.8$, is either
-  too large or too small, but the critical value is calculated using
-  $\alpha / 2$
-* In our example the critical value is $2.13$, so we fail to reject.
-
----
-## T test in R
-```{r}
-library(UsingR); data(father.son)
-t.test(father.son$sheight - father.son$fheight)
-```
-
----
-## Connections with confidence intervals
-* Consider testing $H_0: \mu = \mu_0$ versus $H_a: \mu \neq \mu_0$
-* Take the set of all possible values for which you fail to reject $H_0$, this set is a $(1-\alpha)100\%$ confidence interval for $\mu$
-* The same works in reverse; if a $(1-\alpha)100\%$ interval
-  contains $\mu_0$, then we *fail  to* reject $H_0$
-
----
-## Exact binomial test
-- Recall this problem, *Suppose a friend has $8$ children, $7$ of which are girls and none are twins*
-- Perform the relevant hypothesis test. $H_0 : p = 0.5$ $H_a : p > 0.5$
-  - What is the relevant rejection region so that the probability of rejecting is (less than) 5%?
-  
-Rejection region | Type I error rate |
----|---|
-[0 : 8] | `r pbinom(-1, size = 8, p = .5, lower.tail = FALSE)`
-[1 : 8] | `r pbinom( 0, size = 8, p = .5, lower.tail = FALSE)`
-[2 : 8] | `r pbinom( 1, size = 8, p = .5, lower.tail = FALSE)`
-[3 : 8] | `r pbinom( 2, size = 8, p = .5, lower.tail = FALSE)`
-[4 : 8] | `r pbinom( 3, size = 8, p = .5, lower.tail = FALSE)`
-[5 : 8] | `r pbinom( 4, size = 8, p = .5, lower.tail = FALSE)`
-[6 : 8] | `r pbinom( 5, size = 8, p = .5, lower.tail = FALSE)`
-[7 : 8] | `r pbinom( 6, size = 8, p = .5, lower.tail = FALSE)`
-[8 : 8] | `r pbinom( 7, size = 8, p = .5, lower.tail = FALSE)`
-
----
-## Notes
-* It's impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
-  * The closest is the rejection region [7 : 8]
-  * Any alpha level lower than `r 1 / 2 ^8` is not attainable.
-* For larger sample sizes, we could do a normal approximation, but you already knew this.
-* Two sided test isn't obvious. 
-  * Given a way to do two sided tests, we could take the set of values of $p_0$ for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)
-* For these problems, people always create a P-value (next lecture) rather than computing the rejection region.
-
-
+---
+title       : Hypothesis testing
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Hypothesis testing
+* Hypothesis testing is concerned with making decisions using data
+* A null hypothesis is specified that represents the status quo,
+  usually labeled $H_0$
+* The null hypothesis is assumed true and statistical evidence is required
+  to reject it in favor of a research or alternative hypothesis 
+
+---
+## Example
+* A respiratory disturbance index of more than $30$ events / hour, say, is 
+  considered evidence of severe sleep disordered breathing (SDB).
+* Suppose that in a sample of $100$ overweight subjects with other
+  risk factors for sleep disordered breathing at a sleep clinic, the
+  mean RDI was $32$ events / hour with a standard deviation of $10$ events / hour.
+* We might want to test the hypothesis that 
+  * $H_0 : \mu = 30$
+  * $H_a : \mu > 30$
+  * where $\mu$ is the population mean RDI.
+
+---
+## Hypothesis testing
+* The alternative hypotheses are typically of the form $<$, $>$ or $\neq$
+* Note that there are four possible outcomes of our statistical decision process
+
+Truth | Decide | Result |
+---|---|---|
+$H_0$ | $H_0$ | Correctly accept null |
+$H_0$ | $H_a$ | Type I error |
+$H_a$ | $H_a$ | Correctly reject null |
+$H_a$ | $H_0$ | Type II error |
+
+---
+## Discussion
+* Consider a court of law; the null hypothesis is that the
+  defendant is innocent
+* We require evidence to reject the null hypothesis (convict)
+* If we require little evidence, then we would increase the
+  percentage of innocent people convicted (type I errors); however we
+  would also increase the percentage of guilty people convicted
+  (correctly rejecting the null)
+* If we require a lot of evidence, then we increase the the
+  percentage of innocent people let free (correctly accepting the
+  null) while we would also increase the percentage of guilty people
+  let free (type II errors)
+
+---
+## Example
+* Consider our example again
+* A reasonable strategy would reject the null hypothesis if
+  $\bar X$ was larger than some constant, say $C$
+* Typically, $C$ is chosen so that the probability of a Type I
+  error, $\alpha$, is $.05$ (or some other relevant constant)
+* $\alpha$ = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct
+
+---
+## Example continued
+
+
+$$
+\begin{align}
+0.05  & =  P\left(\bar X \geq C ~|~ \mu = 30 \right) \\
+      & =  P\left(\frac{\bar X - 30}{10 / \sqrt{100}} \geq \frac{C - 30}{10/\sqrt{100}} ~|~ \mu = 30\right) \\
+      & =  P\left(Z \geq \frac{C - 30}{1}\right) \\
+\end{align}
+$$
+
+* Hence $(C - 30) / 1 = 1.645$ implying $C = 31.645$
+* Since our mean is $32$ we reject the null hypothesis
+
+---
+## Discussion
+* In general we don't convert $C$ back to the original scale
+* We would just reject because the Z-score; which is how many
+  standard errors the sample mean is above the hypothesized mean
+  $$
+  \frac{32 - 30}{10 / \sqrt{100}} = 2
+  $$
+  is greater than $1.645$
+* Or, whenever $\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}$
+
+---
+## General rules
+* The $Z$ test for $H_0:\mu = \mu_0$ versus 
+  * $H_1: \mu < \mu_0$
+  * $H_2: \mu \neq \mu_0$
+  * $H_3: \mu > \mu_0$ 
+* Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $
+* Reject the null hypothesis when 
+  * $TS \leq -Z_{1 - \alpha}$
+  * $|TS| \geq Z_{1 - \alpha / 2}$
+  * $TS \geq Z_{1 - \alpha}$
+
+---
+## Notes
+* We have fixed $\alpha$ to be low, so if we reject $H_0$ (either
+  our model is wrong) or there is a low probability that we have made
+  an error
+* We have not fixed the probability of a type II error, $\beta$;
+  therefore we tend to say ``Fail to reject $H_0$'' rather than
+  accepting $H_0$
+* Statistical significance is no the same as scientific
+  significance
+* The region of TS values for which you reject $H_0$ is called the
+  rejection region
+
+---
+## More notes
+* The $Z$ test requires the assumptions of the CLT and for $n$ to be large enough
+  for it to apply
+* If $n$ is small, then a Gossett's $T$ test is performed exactly in the same way,
+  with the normal quantiles replaced by the appropriate Student's $T$ quantiles and
+  $n-1$ df
+* The probability of rejecting the null hypothesis when it is false is called *power*
+* Power is a used a lot to calculate sample sizes for experiments
+
+---
+## Example reconsidered
+- Consider our example again. Suppose that $n= 16$ (rather than
+$100$). Then consider that
+$$
+.05 = P\left(\frac{\bar X - 30}{s / \sqrt{16}} \geq t_{1-\alpha, 15} ~|~ \mu = 30 \right)
+$$
+- So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $, while the critical value is $t_{1-\alpha, 15} = 1.75$. 
+- We now fail to reject.
+
+---
+## Two sided tests
+* Suppose that we would reject the null hypothesis if in fact the 
+  mean was too large or too small
+* That is, we want to test the alternative $H_a : \mu \neq 30$
+  (doesn't make a lot of sense in our setting)
+* Then note
+$$
+ \alpha = P\left(\left. \left|\frac{\bar X - 30}{s /\sqrt{16}}\right| > t_{1-\alpha/2,15} ~\right|~ \mu = 30\right)
+$$
+* That is we will reject if the test statistic, $0.8$, is either
+  too large or too small, but the critical value is calculated using
+  $\alpha / 2$
+* In our example the critical value is $2.13$, so we fail to reject.
+
+---
+## T test in R
+```{r}
+library(UsingR); data(father.son)
+t.test(father.son$sheight - father.son$fheight)
+```
+
+---
+## Connections with confidence intervals
+* Consider testing $H_0: \mu = \mu_0$ versus $H_a: \mu \neq \mu_0$
+* Take the set of all possible values for which you fail to reject $H_0$, this set is a $(1-\alpha)100\%$ confidence interval for $\mu$
+* The same works in reverse; if a $(1-\alpha)100\%$ interval
+  contains $\mu_0$, then we *fail  to* reject $H_0$
+
+---
+## Exact binomial test
+- Recall this problem, *Suppose a friend has $8$ children, $7$ of which are girls and none are twins*
+- Perform the relevant hypothesis test. $H_0 : p = 0.5$ $H_a : p > 0.5$
+  - What is the relevant rejection region so that the probability of rejecting is (less than) 5%?
+  
+Rejection region | Type I error rate |
+---|---|
+[0 : 8] | `r pbinom(-1, size = 8, p = .5, lower.tail = FALSE)`
+[1 : 8] | `r pbinom( 0, size = 8, p = .5, lower.tail = FALSE)`
+[2 : 8] | `r pbinom( 1, size = 8, p = .5, lower.tail = FALSE)`
+[3 : 8] | `r pbinom( 2, size = 8, p = .5, lower.tail = FALSE)`
+[4 : 8] | `r pbinom( 3, size = 8, p = .5, lower.tail = FALSE)`
+[5 : 8] | `r pbinom( 4, size = 8, p = .5, lower.tail = FALSE)`
+[6 : 8] | `r pbinom( 5, size = 8, p = .5, lower.tail = FALSE)`
+[7 : 8] | `r pbinom( 6, size = 8, p = .5, lower.tail = FALSE)`
+[8 : 8] | `r pbinom( 7, size = 8, p = .5, lower.tail = FALSE)`
+
+---
+## Notes
+* It's impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
+  * The closest is the rejection region [7 : 8]
+  * Any alpha level lower than `r 1 / 2 ^8` is not attainable.
+* For larger sample sizes, we could do a normal approximation, but you already knew this.
+* Two sided test isn't obvious. 
+  * Given a way to do two sided tests, we could take the set of values of $p_0$ for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)
+* For these problems, people always create a P-value (next lecture) rather than computing the rejection region.
+
+
diff --git a/06_StatisticalInference/03_02_HypothesisTesting/index.html b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.html
similarity index 94%
rename from 06_StatisticalInference/03_02_HypothesisTesting/index.html
rename to 06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.html
index e8cc58b1c..a855fe1a6 100644
--- a/06_StatisticalInference/03_02_HypothesisTesting/index.html
+++ b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.html
@@ -1,582 +1,583 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Hypothesis testing</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Hypothesis testing">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Hypothesis testing</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Hypothesis testing</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Hypothesis testing is concerned with making decisions using data</li>
-<li>A null hypothesis is specified that represents the status quo,
-usually labeled \(H_0\)</li>
-<li>The null hypothesis is assumed true and statistical evidence is required
-to reject it in favor of a research or alternative hypothesis </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>A respiratory disturbance index of more than \(30\) events / hour, say, is 
-considered evidence of severe sleep disordered breathing (SDB).</li>
-<li>Suppose that in a sample of \(100\) overweight subjects with other
-risk factors for sleep disordered breathing at a sleep clinic, the
-mean RDI was \(32\) events / hour with a standard deviation of \(10\) events / hour.</li>
-<li>We might want to test the hypothesis that 
-
-<ul>
-<li>\(H_0 : \mu = 30\)</li>
-<li>\(H_a : \mu > 30\)</li>
-<li>where \(\mu\) is the population mean RDI.</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Hypothesis testing</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The alternative hypotheses are typically of the form \(<\), \(>\) or \(\neq\)</li>
-<li>Note that there are four possible outcomes of our statistical decision process</li>
-</ul>
-
-<table><thead>
-<tr>
-<th>Truth</th>
-<th>Decide</th>
-<th>Result</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>\(H_0\)</td>
-<td>\(H_0\)</td>
-<td>Correctly accept null</td>
-</tr>
-<tr>
-<td>\(H_0\)</td>
-<td>\(H_a\)</td>
-<td>Type I error</td>
-</tr>
-<tr>
-<td>\(H_a\)</td>
-<td>\(H_a\)</td>
-<td>Correctly reject null</td>
-</tr>
-<tr>
-<td>\(H_a\)</td>
-<td>\(H_0\)</td>
-<td>Type II error</td>
-</tr>
-</tbody></table>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Discussion</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider a court of law; the null hypothesis is that the
-defendant is innocent</li>
-<li>We require evidence to reject the null hypothesis (convict)</li>
-<li>If we require little evidence, then we would increase the
-percentage of innocent people convicted (type I errors); however we
-would also increase the percentage of guilty people convicted
-(correctly rejecting the null)</li>
-<li>If we require a lot of evidence, then we increase the the
-percentage of innocent people let free (correctly accepting the
-null) while we would also increase the percentage of guilty people
-let free (type II errors)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider our example again</li>
-<li>A reasonable strategy would reject the null hypothesis if
-\(\bar X\) was larger than some constant, say \(C\)</li>
-<li>Typically, \(C\) is chosen so that the probability of a Type I
-error, \(\alpha\), is \(.05\) (or some other relevant constant)</li>
-<li>\(\alpha\) = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Example continued</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[
-\begin{align}
-0.05  & =  P\left(\bar X \geq C ~|~ \mu = 30 \right) \\
-      & =  P\left(\frac{\bar X - 30}{10 / \sqrt{100}} \geq \frac{C - 30}{10/\sqrt{100}} ~|~ \mu = 30\right) \\
-      & =  P\left(Z \geq \frac{C - 30}{1}\right) \\
-\end{align}
-\]</p>
-
-<ul>
-<li>Hence \((C - 30) / 1 = 1.645\) implying \(C = 31.645\)</li>
-<li>Since our mean is \(32\) we reject the null hypothesis</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Discussion</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>In general we don&#39;t convert \(C\) back to the original scale</li>
-<li>We would just reject because the Z-score; which is how many
-standard errors the sample mean is above the hypothesized mean
-\[
-\frac{32 - 30}{10 / \sqrt{100}} = 2
-\]
-is greater than \(1.645\)</li>
-<li>Or, whenever \(\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>General rules</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The \(Z\) test for \(H_0:\mu = \mu_0\) versus 
-
-<ul>
-<li>\(H_1: \mu < \mu_0\)</li>
-<li>\(H_2: \mu \neq \mu_0\)</li>
-<li>\(H_3: \mu > \mu_0\) </li>
-</ul></li>
-<li>Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $</li>
-<li>Reject the null hypothesis when 
-
-<ul>
-<li>\(TS \leq -Z_{1 - \alpha}\)</li>
-<li>\(|TS| \geq Z_{1 - \alpha / 2}\)</li>
-<li>\(TS \geq Z_{1 - \alpha}\)</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Notes</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>We have fixed \(\alpha\) to be low, so if we reject \(H_0\) (either
-our model is wrong) or there is a low probability that we have made
-an error</li>
-<li>We have not fixed the probability of a type II error, \(\beta\);
-therefore we tend to say ``Fail to reject \(H_0\)&#39;&#39; rather than
-accepting \(H_0\)</li>
-<li>Statistical significance is no the same as scientific
-significance</li>
-<li>The region of TS values for which you reject \(H_0\) is called the
-rejection region</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>More notes</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The \(Z\) test requires the assumptions of the CLT and for \(n\) to be large enough
-for it to apply</li>
-<li>If \(n\) is small, then a Gossett&#39;s \(T\) test is performed exactly in the same way,
-with the normal quantiles replaced by the appropriate Student&#39;s \(T\) quantiles and
-\(n-1\) df</li>
-<li>The probability of rejecting the null hypothesis when it is false is called <em>power</em></li>
-<li>Power is a used a lot to calculate sample sizes for experiments</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Example reconsidered</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider our example again. Suppose that \(n= 16\) (rather than
-\(100\)). Then consider that
-\[
-.05 = P\left(\frac{\bar X - 30}{s / \sqrt{16}} \geq t_{1-\alpha, 15} ~|~ \mu = 30 \right)
-\]</li>
-<li>So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $, while the critical value is \(t_{1-\alpha, 15} = 1.75\). </li>
-<li>We now fail to reject.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Two sided tests</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that we would reject the null hypothesis if in fact the 
-mean was too large or too small</li>
-<li>That is, we want to test the alternative \(H_a : \mu \neq 30\)
-(doesn&#39;t make a lot of sense in our setting)</li>
-<li>Then note
-\[
-\alpha = P\left(\left. \left|\frac{\bar X - 30}{s /\sqrt{16}}\right| > t_{1-\alpha/2,15} ~\right|~ \mu = 30\right)
-\]</li>
-<li>That is we will reject if the test statistic, \(0.8\), is either
-too large or too small, but the critical value is calculated using
-\(\alpha / 2\)</li>
-<li>In our example the critical value is \(2.13\), so we fail to reject.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>T test in R</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">library(UsingR); data(father.son)
-t.test(father.son$sheight - father.son$fheight)
-</code></pre>
-
-<pre><code>
-    One Sample t-test
-
-data:  father.son$sheight - father.son$fheight
-t = 11.79, df = 1077, p-value &lt; 2.2e-16
-alternative hypothesis: true mean is not equal to 0
-95 percent confidence interval:
- 0.831 1.163
-sample estimates:
-mean of x 
-    0.997 
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Connections with confidence intervals</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider testing \(H_0: \mu = \mu_0\) versus \(H_a: \mu \neq \mu_0\)</li>
-<li>Take the set of all possible values for which you fail to reject \(H_0\), this set is a \((1-\alpha)100\%\) confidence interval for \(\mu\)</li>
-<li>The same works in reverse; if a \((1-\alpha)100\%\) interval
-contains \(\mu_0\), then we <em>fail  to</em> reject \(H_0\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Exact binomial test</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Recall this problem, <em>Suppose a friend has \(8\) children, \(7\) of which are girls and none are twins</em></li>
-<li>Perform the relevant hypothesis test. \(H_0 : p = 0.5\) \(H_a : p > 0.5\)
-
-<ul>
-<li>What is the relevant rejection region so that the probability of rejecting is (less than) 5%?</li>
-</ul></li>
-</ul>
-
-<table><thead>
-<tr>
-<th>Rejection region</th>
-<th>Type I error rate</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>[0 : 8]</td>
-<td>1</td>
-</tr>
-<tr>
-<td>[1 : 8]</td>
-<td>0.9961</td>
-</tr>
-<tr>
-<td>[2 : 8]</td>
-<td>0.9648</td>
-</tr>
-<tr>
-<td>[3 : 8]</td>
-<td>0.8555</td>
-</tr>
-<tr>
-<td>[4 : 8]</td>
-<td>0.6367</td>
-</tr>
-<tr>
-<td>[5 : 8]</td>
-<td>0.3633</td>
-</tr>
-<tr>
-<td>[6 : 8]</td>
-<td>0.1445</td>
-</tr>
-<tr>
-<td>[7 : 8]</td>
-<td>0.0352</td>
-</tr>
-<tr>
-<td>[8 : 8]</td>
-<td>0.0039</td>
-</tr>
-</tbody></table>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Notes</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>It&#39;s impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
-
-<ul>
-<li>The closest is the rejection region [7 : 8]</li>
-<li>Any alpha level lower than 0.0039 is not attainable.</li>
-</ul></li>
-<li>For larger sample sizes, we could do a normal approximation, but you already knew this.</li>
-<li>Two sided test isn&#39;t obvious. 
-
-<ul>
-<li>Given a way to do two sided tests, we could take the set of values of \(p_0\) for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)</li>
-</ul></li>
-<li>For these problems, people always create a P-value (next lecture) rather than computing the rejection region.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Hypothesis testing'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Example'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Hypothesis testing'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Discussion'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Example'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Example continued'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Discussion'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='General rules'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Notes'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='More notes'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='Example reconsidered'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Two sided tests'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='T test in R'>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='Connections with confidence intervals'>
-         14
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title='Exact binomial test'>
-         15
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='Notes'>
-         16
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Hypothesis testing</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Hypothesis testing">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Hypothesis testing</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Hypothesis testing</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Hypothesis testing is concerned with making decisions using data</li>
+<li>A null hypothesis is specified that represents the status quo,
+usually labeled \(H_0\)</li>
+<li>The null hypothesis is assumed true and statistical evidence is required
+to reject it in favor of a research or alternative hypothesis </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>A respiratory disturbance index of more than \(30\) events / hour, say, is 
+considered evidence of severe sleep disordered breathing (SDB).</li>
+<li>Suppose that in a sample of \(100\) overweight subjects with other
+risk factors for sleep disordered breathing at a sleep clinic, the
+mean RDI was \(32\) events / hour with a standard deviation of \(10\) events / hour.</li>
+<li>We might want to test the hypothesis that 
+
+<ul>
+<li>\(H_0 : \mu = 30\)</li>
+<li>\(H_a : \mu > 30\)</li>
+<li>where \(\mu\) is the population mean RDI.</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Hypothesis testing</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The alternative hypotheses are typically of the form \(<\), \(>\) or \(\neq\)</li>
+<li>Note that there are four possible outcomes of our statistical decision process</li>
+</ul>
+
+<table><thead>
+<tr>
+<th>Truth</th>
+<th>Decide</th>
+<th>Result</th>
+</tr>
+</thead><tbody>
+<tr>
+<td>\(H_0\)</td>
+<td>\(H_0\)</td>
+<td>Correctly accept null</td>
+</tr>
+<tr>
+<td>\(H_0\)</td>
+<td>\(H_a\)</td>
+<td>Type I error</td>
+</tr>
+<tr>
+<td>\(H_a\)</td>
+<td>\(H_a\)</td>
+<td>Correctly reject null</td>
+</tr>
+<tr>
+<td>\(H_a\)</td>
+<td>\(H_0\)</td>
+<td>Type II error</td>
+</tr>
+</tbody></table>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Discussion</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider a court of law; the null hypothesis is that the
+defendant is innocent</li>
+<li>We require evidence to reject the null hypothesis (convict)</li>
+<li>If we require little evidence, then we would increase the
+percentage of innocent people convicted (type I errors); however we
+would also increase the percentage of guilty people convicted
+(correctly rejecting the null)</li>
+<li>If we require a lot of evidence, then we increase the the
+percentage of innocent people let free (correctly accepting the
+null) while we would also increase the percentage of guilty people
+let free (type II errors)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider our example again</li>
+<li>A reasonable strategy would reject the null hypothesis if
+\(\bar X\) was larger than some constant, say \(C\)</li>
+<li>Typically, \(C\) is chosen so that the probability of a Type I
+error, \(\alpha\), is \(.05\) (or some other relevant constant)</li>
+<li>\(\alpha\) = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>\[
+\begin{align}
+0.05  & =  P\left(\bar X \geq C ~|~ \mu = 30 \right) \\
+      & =  P\left(\frac{\bar X - 30}{10 / \sqrt{100}} \geq \frac{C - 30}{10/\sqrt{100}} ~|~ \mu = 30\right) \\
+      & =  P\left(Z \geq \frac{C - 30}{1}\right) \\
+\end{align}
+\]</p>
+
+<ul>
+<li>Hence \((C - 30) / 1 = 1.645\) implying \(C = 31.645\)</li>
+<li>Since our mean is \(32\) we reject the null hypothesis</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Discussion</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In general we don&#39;t convert \(C\) back to the original scale</li>
+<li>We would just reject because the Z-score; which is how many
+standard errors the sample mean is above the hypothesized mean
+\[
+\frac{32 - 30}{10 / \sqrt{100}} = 2
+\]
+is greater than \(1.645\)</li>
+<li>Or, whenever \(\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>General rules</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The \(Z\) test for \(H_0:\mu = \mu_0\) versus 
+
+<ul>
+<li>\(H_1: \mu < \mu_0\)</li>
+<li>\(H_2: \mu \neq \mu_0\)</li>
+<li>\(H_3: \mu > \mu_0\) </li>
+</ul></li>
+<li>Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $</li>
+<li>Reject the null hypothesis when 
+
+<ul>
+<li>\(TS \leq -Z_{1 - \alpha}\)</li>
+<li>\(|TS| \geq Z_{1 - \alpha / 2}\)</li>
+<li>\(TS \geq Z_{1 - \alpha}\)</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>We have fixed \(\alpha\) to be low, so if we reject \(H_0\) (either
+our model is wrong) or there is a low probability that we have made
+an error</li>
+<li>We have not fixed the probability of a type II error, \(\beta\);
+therefore we tend to say ``Fail to reject \(H_0\)&#39;&#39; rather than
+accepting \(H_0\)</li>
+<li>Statistical significance is no the same as scientific
+significance</li>
+<li>The region of TS values for which you reject \(H_0\) is called the
+rejection region</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>More notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The \(Z\) test requires the assumptions of the CLT and for \(n\) to be large enough
+for it to apply</li>
+<li>If \(n\) is small, then a Gossett&#39;s \(T\) test is performed exactly in the same way,
+with the normal quantiles replaced by the appropriate Student&#39;s \(T\) quantiles and
+\(n-1\) df</li>
+<li>The probability of rejecting the null hypothesis when it is false is called <em>power</em></li>
+<li>Power is a used a lot to calculate sample sizes for experiments</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example reconsidered</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider our example again. Suppose that \(n= 16\) (rather than
+\(100\)). Then consider that
+\[
+.05 = P\left(\frac{\bar X - 30}{s / \sqrt{16}} \geq t_{1-\alpha, 15} ~|~ \mu = 30 \right)
+\]</li>
+<li>So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $, while the critical value is \(t_{1-\alpha, 15} = 1.75\). </li>
+<li>We now fail to reject.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Two sided tests</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that we would reject the null hypothesis if in fact the 
+mean was too large or too small</li>
+<li>That is, we want to test the alternative \(H_a : \mu \neq 30\)
+(doesn&#39;t make a lot of sense in our setting)</li>
+<li>Then note
+\[
+\alpha = P\left(\left. \left|\frac{\bar X - 30}{s /\sqrt{16}}\right| > t_{1-\alpha/2,15} ~\right|~ \mu = 30\right)
+\]</li>
+<li>That is we will reject if the test statistic, \(0.8\), is either
+too large or too small, but the critical value is calculated using
+\(\alpha / 2\)</li>
+<li>In our example the critical value is \(2.13\), so we fail to reject.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>T test in R</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">library(UsingR)
+data(father.son)
+t.test(father.son$sheight - father.son$fheight)
+</code></pre>
+
+<pre><code>## 
+##  One Sample t-test
+## 
+## data:  father.son$sheight - father.son$fheight
+## t = 11.79, df = 1077, p-value &lt; 2.2e-16
+## alternative hypothesis: true mean is not equal to 0
+## 95 percent confidence interval:
+##  0.831 1.163
+## sample estimates:
+## mean of x 
+##     0.997
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Connections with confidence intervals</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider testing \(H_0: \mu = \mu_0\) versus \(H_a: \mu \neq \mu_0\)</li>
+<li>Take the set of all possible values for which you fail to reject \(H_0\), this set is a \((1-\alpha)100\%\) confidence interval for \(\mu\)</li>
+<li>The same works in reverse; if a \((1-\alpha)100\%\) interval
+contains \(\mu_0\), then we <em>fail  to</em> reject \(H_0\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Exact binomial test</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Recall this problem, <em>Suppose a friend has \(8\) children, \(7\) of which are girls and none are twins</em></li>
+<li>Perform the relevant hypothesis test. \(H_0 : p = 0.5\) \(H_a : p > 0.5\)
+
+<ul>
+<li>What is the relevant rejection region so that the probability of rejecting is (less than) 5%?</li>
+</ul></li>
+</ul>
+
+<table><thead>
+<tr>
+<th>Rejection region</th>
+<th>Type I error rate</th>
+</tr>
+</thead><tbody>
+<tr>
+<td>[0 : 8]</td>
+<td>1</td>
+</tr>
+<tr>
+<td>[1 : 8]</td>
+<td>0.9961</td>
+</tr>
+<tr>
+<td>[2 : 8]</td>
+<td>0.9648</td>
+</tr>
+<tr>
+<td>[3 : 8]</td>
+<td>0.8555</td>
+</tr>
+<tr>
+<td>[4 : 8]</td>
+<td>0.6367</td>
+</tr>
+<tr>
+<td>[5 : 8]</td>
+<td>0.3633</td>
+</tr>
+<tr>
+<td>[6 : 8]</td>
+<td>0.1445</td>
+</tr>
+<tr>
+<td>[7 : 8]</td>
+<td>0.0352</td>
+</tr>
+<tr>
+<td>[8 : 8]</td>
+<td>0.0039</td>
+</tr>
+</tbody></table>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>It&#39;s impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
+
+<ul>
+<li>The closest is the rejection region [7 : 8]</li>
+<li>Any alpha level lower than 0.0039 is not attainable.</li>
+</ul></li>
+<li>For larger sample sizes, we could do a normal approximation, but you already knew this.</li>
+<li>Two sided test isn&#39;t obvious. 
+
+<ul>
+<li>Given a way to do two sided tests, we could take the set of values of \(p_0\) for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)</li>
+</ul></li>
+<li>For these problems, people always create a P-value (next lecture) rather than computing the rejection region.</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Hypothesis testing'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Example'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Hypothesis testing'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Discussion'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Example'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Example continued'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Discussion'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='General rules'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Notes'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='More notes'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example reconsidered'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Two sided tests'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='T test in R'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Connections with confidence intervals'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Exact binomial test'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Notes'>
+         16
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/03_02_HypothesisTesting/index.md b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.md
similarity index 93%
rename from 06_StatisticalInference/03_02_HypothesisTesting/index.md
rename to 06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.md
index 94bf98520..7f584af2c 100644
--- a/06_StatisticalInference/03_02_HypothesisTesting/index.md
+++ b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.md
@@ -1,217 +1,216 @@
----
-title       : Hypothesis testing
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## Hypothesis testing
-* Hypothesis testing is concerned with making decisions using data
-* A null hypothesis is specified that represents the status quo,
-  usually labeled $H_0$
-* The null hypothesis is assumed true and statistical evidence is required
-  to reject it in favor of a research or alternative hypothesis 
-
----
-## Example
-* A respiratory disturbance index of more than $30$ events / hour, say, is 
-  considered evidence of severe sleep disordered breathing (SDB).
-* Suppose that in a sample of $100$ overweight subjects with other
-  risk factors for sleep disordered breathing at a sleep clinic, the
-  mean RDI was $32$ events / hour with a standard deviation of $10$ events / hour.
-* We might want to test the hypothesis that 
-  * $H_0 : \mu = 30$
-  * $H_a : \mu > 30$
-  * where $\mu$ is the population mean RDI.
-
----
-## Hypothesis testing
-* The alternative hypotheses are typically of the form $<$, $>$ or $\neq$
-* Note that there are four possible outcomes of our statistical decision process
-
-Truth | Decide | Result |
----|---|---|
-$H_0$ | $H_0$ | Correctly accept null |
-$H_0$ | $H_a$ | Type I error |
-$H_a$ | $H_a$ | Correctly reject null |
-$H_a$ | $H_0$ | Type II error |
-
----
-## Discussion
-* Consider a court of law; the null hypothesis is that the
-  defendant is innocent
-* We require evidence to reject the null hypothesis (convict)
-* If we require little evidence, then we would increase the
-  percentage of innocent people convicted (type I errors); however we
-  would also increase the percentage of guilty people convicted
-  (correctly rejecting the null)
-* If we require a lot of evidence, then we increase the the
-  percentage of innocent people let free (correctly accepting the
-  null) while we would also increase the percentage of guilty people
-  let free (type II errors)
-
----
-## Example
-* Consider our example again
-* A reasonable strategy would reject the null hypothesis if
-  $\bar X$ was larger than some constant, say $C$
-* Typically, $C$ is chosen so that the probability of a Type I
-  error, $\alpha$, is $.05$ (or some other relevant constant)
-* $\alpha$ = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct
-
----
-## Example continued
-
-
-$$
-\begin{align}
-0.05  & =  P\left(\bar X \geq C ~|~ \mu = 30 \right) \\
-      & =  P\left(\frac{\bar X - 30}{10 / \sqrt{100}} \geq \frac{C - 30}{10/\sqrt{100}} ~|~ \mu = 30\right) \\
-      & =  P\left(Z \geq \frac{C - 30}{1}\right) \\
-\end{align}
-$$
-
-* Hence $(C - 30) / 1 = 1.645$ implying $C = 31.645$
-* Since our mean is $32$ we reject the null hypothesis
-
----
-## Discussion
-* In general we don't convert $C$ back to the original scale
-* We would just reject because the Z-score; which is how many
-  standard errors the sample mean is above the hypothesized mean
-  $$
-  \frac{32 - 30}{10 / \sqrt{100}} = 2
-  $$
-  is greater than $1.645$
-* Or, whenever $\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}$
-
----
-## General rules
-* The $Z$ test for $H_0:\mu = \mu_0$ versus 
-  * $H_1: \mu < \mu_0$
-  * $H_2: \mu \neq \mu_0$
-  * $H_3: \mu > \mu_0$ 
-* Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $
-* Reject the null hypothesis when 
-  * $TS \leq -Z_{1 - \alpha}$
-  * $|TS| \geq Z_{1 - \alpha / 2}$
-  * $TS \geq Z_{1 - \alpha}$
-
----
-## Notes
-* We have fixed $\alpha$ to be low, so if we reject $H_0$ (either
-  our model is wrong) or there is a low probability that we have made
-  an error
-* We have not fixed the probability of a type II error, $\beta$;
-  therefore we tend to say ``Fail to reject $H_0$'' rather than
-  accepting $H_0$
-* Statistical significance is no the same as scientific
-  significance
-* The region of TS values for which you reject $H_0$ is called the
-  rejection region
-
----
-## More notes
-* The $Z$ test requires the assumptions of the CLT and for $n$ to be large enough
-  for it to apply
-* If $n$ is small, then a Gossett's $T$ test is performed exactly in the same way,
-  with the normal quantiles replaced by the appropriate Student's $T$ quantiles and
-  $n-1$ df
-* The probability of rejecting the null hypothesis when it is false is called *power*
-* Power is a used a lot to calculate sample sizes for experiments
-
----
-## Example reconsidered
-- Consider our example again. Suppose that $n= 16$ (rather than
-$100$). Then consider that
-$$
-.05 = P\left(\frac{\bar X - 30}{s / \sqrt{16}} \geq t_{1-\alpha, 15} ~|~ \mu = 30 \right)
-$$
-- So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $, while the critical value is $t_{1-\alpha, 15} = 1.75$. 
-- We now fail to reject.
-
----
-## Two sided tests
-* Suppose that we would reject the null hypothesis if in fact the 
-  mean was too large or too small
-* That is, we want to test the alternative $H_a : \mu \neq 30$
-  (doesn't make a lot of sense in our setting)
-* Then note
-$$
- \alpha = P\left(\left. \left|\frac{\bar X - 30}{s /\sqrt{16}}\right| > t_{1-\alpha/2,15} ~\right|~ \mu = 30\right)
-$$
-* That is we will reject if the test statistic, $0.8$, is either
-  too large or too small, but the critical value is calculated using
-  $\alpha / 2$
-* In our example the critical value is $2.13$, so we fail to reject.
-
----
-## T test in R
-
-```r
-library(UsingR); data(father.son)
-t.test(father.son$sheight - father.son$fheight)
-```
-
-```
-
-	One Sample t-test
-
-data:  father.son$sheight - father.son$fheight
-t = 11.79, df = 1077, p-value < 2.2e-16
-alternative hypothesis: true mean is not equal to 0
-95 percent confidence interval:
- 0.831 1.163
-sample estimates:
-mean of x 
-    0.997 
-```
-
-
----
-## Connections with confidence intervals
-* Consider testing $H_0: \mu = \mu_0$ versus $H_a: \mu \neq \mu_0$
-* Take the set of all possible values for which you fail to reject $H_0$, this set is a $(1-\alpha)100\%$ confidence interval for $\mu$
-* The same works in reverse; if a $(1-\alpha)100\%$ interval
-  contains $\mu_0$, then we *fail  to* reject $H_0$
-
----
-## Exact binomial test
-- Recall this problem, *Suppose a friend has $8$ children, $7$ of which are girls and none are twins*
-- Perform the relevant hypothesis test. $H_0 : p = 0.5$ $H_a : p > 0.5$
-  - What is the relevant rejection region so that the probability of rejecting is (less than) 5%?
-  
-Rejection region | Type I error rate |
----|---|
-[0 : 8] | 1
-[1 : 8] | 0.9961
-[2 : 8] | 0.9648
-[3 : 8] | 0.8555
-[4 : 8] | 0.6367
-[5 : 8] | 0.3633
-[6 : 8] | 0.1445
-[7 : 8] | 0.0352
-[8 : 8] | 0.0039
-
----
-## Notes
-* It's impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
-  * The closest is the rejection region [7 : 8]
-  * Any alpha level lower than 0.0039 is not attainable.
-* For larger sample sizes, we could do a normal approximation, but you already knew this.
-* Two sided test isn't obvious. 
-  * Given a way to do two sided tests, we could take the set of values of $p_0$ for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)
-* For these problems, people always create a P-value (next lecture) rather than computing the rejection region.
-
-
+---
+title       : Hypothesis testing
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Hypothesis testing
+* Hypothesis testing is concerned with making decisions using data
+* A null hypothesis is specified that represents the status quo,
+  usually labeled $H_0$
+* The null hypothesis is assumed true and statistical evidence is required
+  to reject it in favor of a research or alternative hypothesis 
+
+---
+## Example
+* A respiratory disturbance index of more than $30$ events / hour, say, is 
+  considered evidence of severe sleep disordered breathing (SDB).
+* Suppose that in a sample of $100$ overweight subjects with other
+  risk factors for sleep disordered breathing at a sleep clinic, the
+  mean RDI was $32$ events / hour with a standard deviation of $10$ events / hour.
+* We might want to test the hypothesis that 
+  * $H_0 : \mu = 30$
+  * $H_a : \mu > 30$
+  * where $\mu$ is the population mean RDI.
+
+---
+## Hypothesis testing
+* The alternative hypotheses are typically of the form $<$, $>$ or $\neq$
+* Note that there are four possible outcomes of our statistical decision process
+
+Truth | Decide | Result |
+---|---|---|
+$H_0$ | $H_0$ | Correctly accept null |
+$H_0$ | $H_a$ | Type I error |
+$H_a$ | $H_a$ | Correctly reject null |
+$H_a$ | $H_0$ | Type II error |
+
+---
+## Discussion
+* Consider a court of law; the null hypothesis is that the
+  defendant is innocent
+* We require evidence to reject the null hypothesis (convict)
+* If we require little evidence, then we would increase the
+  percentage of innocent people convicted (type I errors); however we
+  would also increase the percentage of guilty people convicted
+  (correctly rejecting the null)
+* If we require a lot of evidence, then we increase the the
+  percentage of innocent people let free (correctly accepting the
+  null) while we would also increase the percentage of guilty people
+  let free (type II errors)
+
+---
+## Example
+* Consider our example again
+* A reasonable strategy would reject the null hypothesis if
+  $\bar X$ was larger than some constant, say $C$
+* Typically, $C$ is chosen so that the probability of a Type I
+  error, $\alpha$, is $.05$ (or some other relevant constant)
+* $\alpha$ = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct
+
+---
+## Example continued
+
+
+$$
+\begin{align}
+0.05  & =  P\left(\bar X \geq C ~|~ \mu = 30 \right) \\
+      & =  P\left(\frac{\bar X - 30}{10 / \sqrt{100}} \geq \frac{C - 30}{10/\sqrt{100}} ~|~ \mu = 30\right) \\
+      & =  P\left(Z \geq \frac{C - 30}{1}\right) \\
+\end{align}
+$$
+
+* Hence $(C - 30) / 1 = 1.645$ implying $C = 31.645$
+* Since our mean is $32$ we reject the null hypothesis
+
+---
+## Discussion
+* In general we don't convert $C$ back to the original scale
+* We would just reject because the Z-score; which is how many
+  standard errors the sample mean is above the hypothesized mean
+  $$
+  \frac{32 - 30}{10 / \sqrt{100}} = 2
+  $$
+  is greater than $1.645$
+* Or, whenever $\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}$
+
+---
+## General rules
+* The $Z$ test for $H_0:\mu = \mu_0$ versus 
+  * $H_1: \mu < \mu_0$
+  * $H_2: \mu \neq \mu_0$
+  * $H_3: \mu > \mu_0$ 
+* Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $
+* Reject the null hypothesis when 
+  * $TS \leq -Z_{1 - \alpha}$
+  * $|TS| \geq Z_{1 - \alpha / 2}$
+  * $TS \geq Z_{1 - \alpha}$
+
+---
+## Notes
+* We have fixed $\alpha$ to be low, so if we reject $H_0$ (either
+  our model is wrong) or there is a low probability that we have made
+  an error
+* We have not fixed the probability of a type II error, $\beta$;
+  therefore we tend to say ``Fail to reject $H_0$'' rather than
+  accepting $H_0$
+* Statistical significance is no the same as scientific
+  significance
+* The region of TS values for which you reject $H_0$ is called the
+  rejection region
+
+---
+## More notes
+* The $Z$ test requires the assumptions of the CLT and for $n$ to be large enough
+  for it to apply
+* If $n$ is small, then a Gossett's $T$ test is performed exactly in the same way,
+  with the normal quantiles replaced by the appropriate Student's $T$ quantiles and
+  $n-1$ df
+* The probability of rejecting the null hypothesis when it is false is called *power*
+* Power is a used a lot to calculate sample sizes for experiments
+
+---
+## Example reconsidered
+- Consider our example again. Suppose that $n= 16$ (rather than
+$100$). Then consider that
+$$
+.05 = P\left(\frac{\bar X - 30}{s / \sqrt{16}} \geq t_{1-\alpha, 15} ~|~ \mu = 30 \right)
+$$
+- So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $, while the critical value is $t_{1-\alpha, 15} = 1.75$. 
+- We now fail to reject.
+
+---
+## Two sided tests
+* Suppose that we would reject the null hypothesis if in fact the 
+  mean was too large or too small
+* That is, we want to test the alternative $H_a : \mu \neq 30$
+  (doesn't make a lot of sense in our setting)
+* Then note
+$$
+ \alpha = P\left(\left. \left|\frac{\bar X - 30}{s /\sqrt{16}}\right| > t_{1-\alpha/2,15} ~\right|~ \mu = 30\right)
+$$
+* That is we will reject if the test statistic, $0.8$, is either
+  too large or too small, but the critical value is calculated using
+  $\alpha / 2$
+* In our example the critical value is $2.13$, so we fail to reject.
+
+---
+## T test in R
+
+```r
+library(UsingR)
+data(father.son)
+t.test(father.son$sheight - father.son$fheight)
+```
+
+```
+## 
+## 	One Sample t-test
+## 
+## data:  father.son$sheight - father.son$fheight
+## t = 11.79, df = 1077, p-value < 2.2e-16
+## alternative hypothesis: true mean is not equal to 0
+## 95 percent confidence interval:
+##  0.831 1.163
+## sample estimates:
+## mean of x 
+##     0.997
+```
+
+
+---
+## Connections with confidence intervals
+* Consider testing $H_0: \mu = \mu_0$ versus $H_a: \mu \neq \mu_0$
+* Take the set of all possible values for which you fail to reject $H_0$, this set is a $(1-\alpha)100\%$ confidence interval for $\mu$
+* The same works in reverse; if a $(1-\alpha)100\%$ interval
+  contains $\mu_0$, then we *fail  to* reject $H_0$
+
+---
+## Exact binomial test
+- Recall this problem, *Suppose a friend has $8$ children, $7$ of which are girls and none are twins*
+- Perform the relevant hypothesis test. $H_0 : p = 0.5$ $H_a : p > 0.5$
+  - What is the relevant rejection region so that the probability of rejecting is (less than) 5%?
+  
+Rejection region | Type I error rate |
+---|---|
+[0 : 8] | 1
+[1 : 8] | 0.9961
+[2 : 8] | 0.9648
+[3 : 8] | 0.8555
+[4 : 8] | 0.6367
+[5 : 8] | 0.3633
+[6 : 8] | 0.1445
+[7 : 8] | 0.0352
+[8 : 8] | 0.0039
+
+---
+## Notes
+* It's impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
+  * The closest is the rejection region [7 : 8]
+  * Any alpha level lower than 0.0039 is not attainable.
+* For larger sample sizes, we could do a normal approximation, but you already knew this.
+* Two sided test isn't obvious. 
+  * Given a way to do two sided tests, we could take the set of values of $p_0$ for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)
+* For these problems, people always create a P-value (next lecture) rather than computing the rejection region.
+
+
diff --git a/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.pdf b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.pdf
new file mode 100644
index 000000000..c5db5a783
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/lecture1.tex b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/lecture1.tex
new file mode 100644
index 000000000..a6f05e6a8
--- /dev/null
+++ b/06_StatisticalInference/old_markdown/03_02_HypothesisTesting/lecture1.tex
@@ -0,0 +1,304 @@
+\documentclass[aspectratio=169]{beamer}
+\mode<presentation>
+%\usetheme{Warsaw}
+%\usetheme{Goettingen}
+\usetheme{Hannover}
+%\useoutertheme{default}
+
+%\useoutertheme{infolines}
+\useoutertheme{sidebar}
+\usecolortheme{dolphin}
+
+\usepackage{amsmath}
+\usepackage{amssymb}
+\usepackage{enumerate}
+%test
+%some bold math symbosl
+\newcommand{\Cov}{\mathrm{Cov}}
+\newcommand{\Var}{\mathrm{Var}}
+\newcommand{\brho}{\boldsymbol{\rho}}
+\newcommand{\bSigma}{\boldsymbol{\Sigma}}
+\newcommand{\btheta}{\boldsymbol{\theta}}
+\newcommand{\bbeta}{\boldsymbol{\beta}}
+\newcommand{\bmu}{\boldsymbol{\mu}}
+\newcommand{\bW}{\mathbf{W}}
+\newcommand{\one}{\mathbf{1}}
+\newcommand{\bH}{\mathbf{H}}
+\newcommand{\by}{\mathbf{y}}
+\newcommand{\bolde}{\mathbf{e}}
+\newcommand{\bx}{\mathbf{x}}
+
+\newcommand{\cpp}[1]{\texttt{#1}}
+
+\title{Mathematical Biostatistics Boot Camp 2: Lecture 1, Hypothesis Testing}
+\author{Brian Caffo}
+\date{\today}
+\institute[Department of Biostatistics]{
+  Department of Biostatistics \\
+  Johns Hopkins Bloomberg School of Public Health\\
+  Johns Hopkins University
+}
+
+
+%\logo{\includegraphics[height=0.5cm]{Logo_PPT.pdf}}
+
+\begin{document}
+\frame{\titlepage}
+
+%\section{Table of contents}
+\frame{
+  \frametitle{Table of contents}
+  \tableofcontents
+}
+
+
+\section{Hypothesis testing}
+\begin{frame}\frametitle{Hypothesis Testing}
+\begin{itemize}
+\item Hypothesis testing is concerned with making decisions using data
+\item A null hypothesis is specified that represents the status quo,
+  usually labeled $H_0$
+\item The null hypothesis is assumed true and statistical evidence is required
+  to reject it in favor of a research or alternative hypothesis 
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Example}
+\begin{itemize}
+\item A respiratory disturbance index of more than $30$ events / hour, say, is 
+  considered evidence of severe sleep disordered breathing (SDB).
+\item Suppose that in a sample of $100$ overweight subjects with other
+  risk factors for sleep disordered breathing at a sleep clinic, the
+  mean RDI was $32$ events / hour with a standard deviation of $10$ events / hour.
+\item We might want to test the hypothesis that 
+  \begin{itemize}
+  \item $H_0 : \mu = 30$
+  \item $H_a : \mu > 30$
+  \end{itemize}
+  where $\mu$ is the population mean RDI.
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Hypothesis testing}
+\begin{itemize}
+\item The alternative hypotheses are typically of the form $<$, $>$ or $\neq$
+\item Note that there are four possible outcomes of our statistical decision
+  process
+\begin{center}
+  \begin{tabular}{lccc}
+      & \multicolumn{2}{c}{Decision} \\
+Truth & $H_0$ & $H_a$ \\ \hline
+$H_0$ & Correctly accept null & Type I error \\
+$H_a$ & Type II error & Correctly reject null  \\ \hline
+  \end{tabular}
+\end{center}
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Discussion}
+\begin{itemize}
+\item Consider a court of law; the null hypothesis is that the
+  defendant is innocent
+\item We require evidence to reject the null hypothesis (convict)
+\item If we require little evidence, then we would increase the
+  percentage of innocent people convicted (type I errors); however we
+  would also increase the percentage of guilty people convicted
+  (correctly rejecting the null)
+\item If we require a lot of evidence, then we increase the the
+  percentage of innocent people let free (correctly accepting the
+  null) while we would also increase the percentage of guilty people
+  let free (type II errors)
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Example}
+\begin{itemize}
+\item Consider our example again
+\item A reasonable strategy would reject the null hypothesis if
+  $\bar X$ was larger than some constant, say $C$
+\item Typically, $C$ is chosen so that the probability of a Type I
+  error, $\alpha$, is $.05$ (or some other relevant constant)
+\item $\alpha$ = Type I error rate = Probability of rejecting the null hypothesis
+  when, in fact, the null hypothesis is correct
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Continued}
+\begin{itemize}
+\item Note that 
+  \begin{eqnarray*}
+    .05 & = & P\left(\bar X \geq C ~|~ \mu = 30 \right) \\ \\
+        & = & P\left(\frac{\bar X - 30}{10 / \sqrt{100}} \geq \frac{C - 30}{10/\sqrt{100}} ~|~ \mu = 30\right) \\ \\
+        & = & P\left(Z \geq \frac{C - 30}{1}\right)
+  \end{eqnarray*}
+\item Hence $(C - 30) / 1 = 1.645$ implying $C = 31.645$
+\item Since our mean is $32$ we reject the null hypothesis
+\end{itemize}
+\end{frame}
+
+
+\begin{frame}\frametitle{Discussion}
+\begin{itemize}
+\item In general we don't convert $C$ back to the original scale
+\item We would just reject because the Z-score; which is how many
+  standard errors the sample mean is above the hypothesized mean
+  $$
+  \frac{32 - 30}{10 / \sqrt{100}} = 2
+  $$
+  is greater than $1.645$
+\end{itemize}
+
+\section{General rules}
+\end{frame}
+\begin{frame}\frametitle{General rule}
+\begin{itemize}
+\item The $Z$ test for $H_0:\mu = \mu_0$ versus 
+  \begin{itemize}
+  \item $H_1: \mu < \mu_0$
+  \item $H_2: \mu \neq \mu_0$
+  \item $H_3: \mu > \mu_0$ 
+  \end{itemize}
+ \item Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $
+ \item Reject the null hypothesis when 
+  \begin{enumerate}[$H_1:$]
+  \item $TS \leq -Z_{1 - \alpha}$
+  \item $|TS| \geq Z_{1 - \alpha / 2}$
+  \item $TS \geq Z_{1 - \alpha}$
+  \end{enumerate}
+\end{itemize}
+
+\end{frame}
+\begin{frame}\frametitle{Notes}
+\begin{itemize}
+\item We have fixed $\alpha$ to be low, so if we reject $H_0$ (either
+  our model is wrong) or there is a low probability that we have made
+  an error
+\item We have not fixed the probability of a type II error, $\beta$;
+  therefore we tend to say ``Fail to reject $H_0$'' rather than
+  accepting $H_0$
+\item Statistical significance is no the same as scientific
+  significance
+\item The region of TS values for which you reject $H_0$ is called the
+  rejection region
+\end{itemize}
+\end{frame}
+
+\section{Notes}
+\begin{frame}\frametitle{More notes}
+\begin{itemize}
+\item The $Z$ test requires the assumptions of the CLT and for $n$ to be large enough
+  for it to apply
+\item If $n$ is small, then a Gossett's $T$ test is performed exactly in the same way,
+  with the normal quantiles replaced by the appropriate Student's $T$ quantiles and
+  $n-1$ df
+\item The probability of rejecting the null hypothesis when it is false is called {\bf power}
+\item Power is a used a lot to calculate sample sizes for experiments
+\end{itemize}
+\end{frame} 
+
+
+\begin{frame}\frametitle{Example reconsidered}
+Consider our example again. Suppose that $n= 16$ (rather than
+$100$). Then consider that \\
+$$
+.05 = P\left(\frac{\bar X - 30}{s / \sqrt{16}} \geq t_{1-\alpha, 15} ~|~ \mu = 30 \right)
+$$ \ \\
+So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $, while the critical
+value is $t_{1-\alpha, 15} = 1.75$. We now fail to reject.
+\end{frame}
+
+\section{Two sided tests}
+\begin{frame}\frametitle{Two sided tests}
+\begin{itemize}
+\item Suppose that we would reject the null hypothesis if in fact the 
+  mean was too large or too small
+\item That is, we want to test the alternative $H_a : \mu \neq 30$
+  (doesn't make a lot of sense in our setting)
+\item Then note
+  \begin{eqnarray*}
+\alpha = P\left(\left. \left|\frac{\bar X - 30}{s /\sqrt{16}}\right| > t_{1-\alpha/2,15} ~\right|~ \mu = 30\right)
+  \end{eqnarray*}
+\item That is we will reject if the test statistic, $0.8$, is either
+  too large or too small, but the critical value is calculated using
+  $\alpha / 2$
+\item In our example the critical value is $2.13$, so we fail to reject.
+\end{itemize}
+\end{frame}
+
+\section{Confidence intervals}
+\begin{frame}\frametitle{Connections with confidence intervals}
+\begin{itemize}
+\item Consider testing $H_0: \mu = \mu_0$ versus $H_a: \mu \neq \mu_0$
+\item Take the set of all possible values for which you fail to reject $H_0$, this
+  set is a $(1-\alpha)100\%$ confidence interval for $\mu$
+\item The same works in reverse; if a $(1-\alpha)100\%$ interval
+  contains $\mu_0$, then we {\bf fail  to} reject $H_0$
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Proof}
+\begin{itemize}
+\item Consider that we do not reject $H_0$ if
+  $$\left| \frac{\bar X - \mu_0}{s /\sqrt{n}} \right| \leq t_{1-\alpha/2, n-1}$$
+implying
+  $$
+  \left|\bar X - \mu_0 \right| \leq t_{1-\alpha/2, n-1} s /\sqrt{n}
+  $$
+implying
+  $$
+  \bar X - t_{1-\alpha/2, n-1} s /\sqrt{n} < \mu_0 
+  < \bar X + t_{1-\alpha/2, n-1} s /\sqrt{n} 
+  $$
+\end{itemize}
+\end{frame}
+
+\section{P-values}
+\begin{frame}\frametitle{P-values}
+\begin{itemize}
+\item Notice that we rejected the one sided test when $\alpha = 0.05$, would we reject
+  if $\alpha = 0.01$, how about $0.001$?
+\item The smallest value for alpha that you still reject the null hypothesis is called the {\bf attained significance level}
+\item This is equivalent, but philosophically a little different from, the {\bf P-value}
+\item The P-value is the probability under the null hypothesis of
+  obtaining evidence as extreme or more extreme than would be observed
+  by chance alone
+\item If the P-value is small, then either $H_0$ is true and we have
+  observed a rare event or $H_0$ is false
+\end{itemize}
+\end{frame} 
+
+\begin{frame}[fragile]\frametitle{Example} 
+In our example the $T$ statistic was $0.8$. What's the probability of getting
+a $T$ statistic as large as $0.8$?
+\begin{verbatim}
+pt(0.8, 15, lower.tail = FALSE) ##works out to be 0.22
+\end{verbatim}
+Therefore, the probability of seeing evidence as extreme or more
+extreme than that actually obtained is $0.22$
+\end{frame}
+
+\begin{frame}\frametitle{Notes}
+\begin{itemize}
+\item By reporting a P-value the reader can perform the hypothesis
+  test at whatever $\alpha$ level he or she choses
+\item If the P-value is less than $\alpha$ you reject the null hypothesis 
+\item For two sided hypothesis test, double the smaller of the two one
+  sided hypothesis test Pvalues
+\item Don't just report P-values, give CIs too!
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Criticisms of the P-value}
+\begin{itemize}
+\item P-values only consider significance, unlike CIs
+\item It is difficult with a P-value or result of a hypothesis test to
+  distinguish practical significance from statistical significance
+\item Absolute measures of the rareness of an event are not good
+  measures of the evidence for or against a hypothesis
+\item P-values have become abusively used
+\end{itemize}
+\end{frame}
+
+
+\end{document}
+
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/P-values.pdf b/06_StatisticalInference/old_markdown/03_03_pValues/P-values.pdf
new file mode 100644
index 000000000..67b8cfbf5
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/P-values.pdf differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/data/quakesRaw.rda b/06_StatisticalInference/old_markdown/03_03_pValues/data/quakesRaw.rda
new file mode 100644
index 000000000..7f0bf0f60
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/data/quakesRaw.rda differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/galton.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/galton.png
new file mode 100644
index 000000000..b20271cd8
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/galton.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/loadGalton.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/loadGalton.png
new file mode 100644
index 000000000..25f17e550
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/loadGalton.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..5a6941c74
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-10.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-10.png
new file mode 100644
index 000000000..b468ebe36
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-10.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-101.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-101.png
new file mode 100644
index 000000000..54ee97e64
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-101.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-102.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-102.png
new file mode 100644
index 000000000..480baa635
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-102.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-11.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-11.png
new file mode 100644
index 000000000..1b80aab95
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-11.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-12.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-12.png
new file mode 100644
index 000000000..29b25fd1b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-12.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-13.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-13.png
new file mode 100644
index 000000000..e095834de
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-13.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-14.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-14.png
new file mode 100644
index 000000000..00a7e8ad3
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-14.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-15.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-15.png
new file mode 100644
index 000000000..1e73900bf
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-15.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-16.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-16.png
new file mode 100644
index 000000000..dcebb2a64
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-16.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-17.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-17.png
new file mode 100644
index 000000000..07c801692
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-17.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-18.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-18.png
new file mode 100644
index 000000000..df6c6e58e
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-18.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-19.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-19.png
new file mode 100644
index 000000000..b40e0814b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-19.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..5a6941c74
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-20.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-20.png
new file mode 100644
index 000000000..6002a163f
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-20.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-21.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-21.png
new file mode 100644
index 000000000..eeb3306d3
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-21.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-22.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-22.png
new file mode 100644
index 000000000..2959d1a99
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-22.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-23.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-23.png
new file mode 100644
index 000000000..7f27b8c8b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-23.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-24.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-24.png
new file mode 100644
index 000000000..527fe1919
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-24.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..66da8cb8b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-4.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..13e644ab0
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-5.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..13e644ab0
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-5.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-6.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-6.png
new file mode 100644
index 000000000..ea7a964e2
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-6.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-7.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-7.png
new file mode 100644
index 000000000..165c49aaa
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-7.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-8.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-8.png
new file mode 100644
index 000000000..1891fe1ac
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-8.png differ
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-9.png b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-9.png
new file mode 100644
index 000000000..a3b0fc3cd
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/fig/unnamed-chunk-9.png differ
diff --git a/06_StatisticalInference/03_03_pValues/index.md b/06_StatisticalInference/old_markdown/03_03_pValues/index.Rmd
similarity index 93%
rename from 06_StatisticalInference/03_03_pValues/index.md
rename to 06_StatisticalInference/old_markdown/03_03_pValues/index.Rmd
index 3cec2d0b8..ea0bb8398 100644
--- a/06_StatisticalInference/03_03_pValues/index.md
+++ b/06_StatisticalInference/old_markdown/03_03_pValues/index.Rmd
@@ -1,119 +1,92 @@
----
-title       : P-values
-subtitle    : Statistical inference
-author      : Brian Caffo, Jeffrey Leek, Roger Peng 
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow   # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## P-values
-
-* Most common measure of "statistical significance"
-* Their ubiquity, along with concern over their interpretation and use
-  makes them controversial among statisticians
-  * [http://warnercnr.colostate.edu/~anderson/thompson1.html](http://warnercnr.colostate.edu/~anderson/thompson1.html)
-  * Also see *Statistical Evidence: A Likelihood Paradigm* by Richard Royall 
-  * *Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy* by Steve Goodman
-  * The hilariously titled: *The Earth is Round (p < .05)* by Cohen.
-* Some positive comments
-  * [simply statistics](http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/)
-  * [normal deviate](http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/)
-  * [Error statistics](http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/)
-
----
-
-
-## What is a P-value? 
-
-__Idea__: Suppose nothing is going on - how unusual is it to see the estimate we got?
-
-__Approach__: 
-
-1. Define the hypothetical distribution of a data summary (statistic) when "nothing is going on" (_null hypothesis_)
-2. Calculate the summary/statistic with the data we have (_test statistic_)
-3. Compare what we calculated to our hypothetical distribution and see if the value is "extreme" (_p-value_)
-
----
-## P-values
-* The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than would be observed by chance alone
-* If the P-value is small, then either $H_0$ is true and we have observed a rare event or $H_0$ is false
-*  In our example the $T$ statistic was $0.8$. 
-  * What's the probability of getting a $T$ statistic as large as $0.8$?
-
-```r
-pt(0.8, 15, lower.tail = FALSE) 
-```
-
-```
-[1] 0.2181
-```
-
-* Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under $H_0$ is 0.2181
-
----
-## The attained significance level
-* Our test statistic was $2$ for $H_0 : \mu_0  = 30$ versus $H_a:\mu > 30$.
-* Notice that we rejected the one sided test when $\alpha = 0.05$, would we reject if $\alpha = 0.01$, how about $0.001$?
-* The smallest value for alpha that you still reject the null hypothesis is called the *attained significance level*
-* This is equivalent, but philosophically a little different from, the *P-value*
-
----
-## Notes
-* By reporting a P-value the reader can perform the hypothesis
-  test at whatever $\alpha$ level he or she choses
-* If the P-value is less than $\alpha$ you reject the null hypothesis 
-* For two sided hypothesis test, double the smaller of the two one
-  sided hypothesis test Pvalues
-
----
-## Revisiting an earlier example
-- Suppose a friend has $8$ children, $7$ of which are girls and none are twins
-- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
-
-```r
-choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
-```
-
-```
-[1] 0.03516
-```
-
-```r
-pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
-```
-
-```
-[1] 0.03516
-```
-
-
----
-## Poisson example
-- Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.
-- Assume that an infection rate of 0.05 is an important benchmark. 
-- Given the model, could the observed rate being larger than 0.05 be attributed to chance?
-- Under $H_0: \lambda = 0.05$ so that $\lambda_0 100 = 5$
-- Consider $H_a: \lambda > 0.05$.
-
-
-```r
-ppois(9, 5, lower.tail = FALSE)
-```
-
-```
-[1] 0.03183
-```
-
-
-
-
+---
+title       : P-values
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeffrey Leek, Roger Peng 
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+  
+## P-values
+
+* Most common measure of "statistical significance"
+* Their ubiquity, along with concern over their interpretation and use
+  makes them controversial among statisticians
+  * [http://warnercnr.colostate.edu/~anderson/thompson1.html](http://warnercnr.colostate.edu/~anderson/thompson1.html)
+  * Also see *Statistical Evidence: A Likelihood Paradigm* by Richard Royall 
+  * *Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy* by Steve Goodman
+  * The hilariously titled: *The Earth is Round (p < .05)* by Cohen.
+* Some positive comments
+  * [simply statistics](http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/)
+  * [normal deviate](http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/)
+  * [Error statistics](http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/)
+
+---
+
+
+## What is a P-value? 
+
+__Idea__: Suppose nothing is going on - how unusual is it to see the estimate we got?
+
+__Approach__: 
+
+1. Define the hypothetical distribution of a data summary (statistic) when "nothing is going on" (_null hypothesis_)
+2. Calculate the summary/statistic with the data we have (_test statistic_)
+3. Compare what we calculated to our hypothetical distribution and see if the value is "extreme" (_p-value_)
+
+---
+## P-values
+* The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than would be observed by chance alone
+* If the P-value is small, then either $H_0$ is true and we have observed a rare event or $H_0$ is false
+*  In our example the $T$ statistic was $0.8$. 
+  * What's the probability of getting a $T$ statistic as large as $0.8$?
+```{r}
+pt(0.8, 15, lower.tail = FALSE) 
+```
+* Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under $H_0$ is `r pt(0.8, 15, lower.tail = FALSE)`
+
+---
+## The attained significance level
+* Our test statistic was $2$ for $H_0 : \mu_0  = 30$ versus $H_a:\mu > 30$.
+* Notice that we rejected the one sided test when $\alpha = 0.05$, would we reject if $\alpha = 0.01$, how about $0.001$?
+* The smallest value for alpha that you still reject the null hypothesis is called the *attained significance level*
+* This is equivalent, but philosophically a little different from, the *P-value*
+
+---
+## Notes
+* By reporting a P-value the reader can perform the hypothesis
+  test at whatever $\alpha$ level he or she choses
+* If the P-value is less than $\alpha$ you reject the null hypothesis 
+* For two sided hypothesis test, double the smaller of the two one
+  sided hypothesis test Pvalues
+
+---
+## Revisiting an earlier example
+- Suppose a friend has $8$ children, $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+```{r}
+choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
+pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
+```
+
+---
+## Poisson example
+- Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.
+- Assume that an infection rate of 0.05 is an important benchmark. 
+- Given the model, could the observed rate being larger than 0.05 be attributed to chance?
+- Under $H_0: \lambda = 0.05$ so that $\lambda_0 100 = 5$
+- Consider $H_a: \lambda > 0.05$.
+
+```{r}
+ppois(9, 5, lower.tail = FALSE)
+```
+
+
+
diff --git a/06_StatisticalInference/03_03_pValues/index.html b/06_StatisticalInference/old_markdown/03_03_pValues/index.html
similarity index 94%
rename from 06_StatisticalInference/03_03_pValues/index.html
rename to 06_StatisticalInference/old_markdown/03_03_pValues/index.html
index a92ee0403..8d31071c3 100644
--- a/06_StatisticalInference/03_03_pValues/index.html
+++ b/06_StatisticalInference/old_markdown/03_03_pValues/index.html
@@ -1,280 +1,280 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>P-values</title>
-  <meta charset="utf-8">
-  <meta name="description" content="P-values">
-  <meta name="author" content="Brian Caffo, Jeffrey Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>P-values</h1>
-    <h2>Statistical inference</h2>
-    <p>Brian Caffo, Jeffrey Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>P-values</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Most common measure of &quot;statistical significance&quot;</li>
-<li>Their ubiquity, along with concern over their interpretation and use
-makes them controversial among statisticians
-
-<ul>
-<li><a href="http://warnercnr.colostate.edu/%7Eanderson/thompson1.html">http://warnercnr.colostate.edu/~anderson/thompson1.html</a></li>
-<li>Also see <em>Statistical Evidence: A Likelihood Paradigm</em> by Richard Royall </li>
-<li><em>Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy</em> by Steve Goodman</li>
-<li>The hilariously titled: <em>The Earth is Round (p &lt; .05)</em> by Cohen.</li>
-</ul></li>
-<li>Some positive comments
-
-<ul>
-<li><a href="http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/">simply statistics</a></li>
-<li><a href="http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/">normal deviate</a></li>
-<li><a href="http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/">Error statistics</a></li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>What is a P-value?</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><strong>Idea</strong>: Suppose nothing is going on - how unusual is it to see the estimate we got?</p>
-
-<p><strong>Approach</strong>: </p>
-
-<ol>
-<li>Define the hypothetical distribution of a data summary (statistic) when &quot;nothing is going on&quot; (<em>null hypothesis</em>)</li>
-<li>Calculate the summary/statistic with the data we have (<em>test statistic</em>)</li>
-<li>Compare what we calculated to our hypothetical distribution and see if the value is &quot;extreme&quot; (<em>p-value</em>)</li>
-</ol>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>P-values</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than would be observed by chance alone</li>
-<li>If the P-value is small, then either \(H_0\) is true and we have observed a rare event or \(H_0\) is false</li>
-<li> In our example the \(T\) statistic was \(0.8\). 
-
-<ul>
-<li>What&#39;s the probability of getting a \(T\) statistic as large as \(0.8\)?</li>
-</ul></li>
-</ul>
-
-<pre><code class="r">pt(0.8, 15, lower.tail = FALSE) 
-</code></pre>
-
-<pre><code>[1] 0.2181
-</code></pre>
-
-<ul>
-<li>Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under \(H_0\) is 0.2181</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>The attained significance level</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Our test statistic was \(2\) for \(H_0 : \mu_0  = 30\) versus \(H_a:\mu > 30\).</li>
-<li>Notice that we rejected the one sided test when \(\alpha = 0.05\), would we reject if \(\alpha = 0.01\), how about \(0.001\)?</li>
-<li>The smallest value for alpha that you still reject the null hypothesis is called the <em>attained significance level</em></li>
-<li>This is equivalent, but philosophically a little different from, the <em>P-value</em></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Notes</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>By reporting a P-value the reader can perform the hypothesis
-test at whatever \(\alpha\) level he or she choses</li>
-<li>If the P-value is less than \(\alpha\) you reject the null hypothesis </li>
-<li>For two sided hypothesis test, double the smaller of the two one
-sided hypothesis test Pvalues</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Revisiting an earlier example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose a friend has \(8\) children, \(7\) of which are girls and none are twins</li>
-<li>If each gender has an independent \(50\)% probability for each birth, what&#39;s the probability of getting \(7\) or more girls out of \(8\) births?</li>
-</ul>
-
-<pre><code class="r">choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
-</code></pre>
-
-<pre><code>[1] 0.03516
-</code></pre>
-
-<pre><code class="r">pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
-</code></pre>
-
-<pre><code>[1] 0.03516
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Poisson example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.</li>
-<li>Assume that an infection rate of 0.05 is an important benchmark. </li>
-<li>Given the model, could the observed rate being larger than 0.05 be attributed to chance?</li>
-<li>Under \(H_0: \lambda = 0.05\) so that \(\lambda_0 100 = 5\)</li>
-<li>Consider \(H_a: \lambda > 0.05\).</li>
-</ul>
-
-<pre><code class="r">ppois(9, 5, lower.tail = FALSE)
-</code></pre>
-
-<pre><code>[1] 0.03183
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='P-values'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='What is a P-value?'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='P-values'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='The attained significance level'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Notes'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Revisiting an earlier example'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Poisson example'>
-         7
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>P-values</title>
+  <meta charset="utf-8">
+  <meta name="description" content="P-values">
+  <meta name="author" content="Brian Caffo, Jeffrey Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>P-values</h1>
+    <h2>Statistical inference</h2>
+    <p>Brian Caffo, Jeffrey Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>P-values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Most common measure of &quot;statistical significance&quot;</li>
+<li>Their ubiquity, along with concern over their interpretation and use
+makes them controversial among statisticians
+
+<ul>
+<li><a href="http://warnercnr.colostate.edu/%7Eanderson/thompson1.html">http://warnercnr.colostate.edu/~anderson/thompson1.html</a></li>
+<li>Also see <em>Statistical Evidence: A Likelihood Paradigm</em> by Richard Royall </li>
+<li><em>Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy</em> by Steve Goodman</li>
+<li>The hilariously titled: <em>The Earth is Round (p &lt; .05)</em> by Cohen.</li>
+</ul></li>
+<li>Some positive comments
+
+<ul>
+<li><a href="http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/">simply statistics</a></li>
+<li><a href="http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/">normal deviate</a></li>
+<li><a href="http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/">Error statistics</a></li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>What is a P-value?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>Idea</strong>: Suppose nothing is going on - how unusual is it to see the estimate we got?</p>
+
+<p><strong>Approach</strong>: </p>
+
+<ol>
+<li>Define the hypothetical distribution of a data summary (statistic) when &quot;nothing is going on&quot; (<em>null hypothesis</em>)</li>
+<li>Calculate the summary/statistic with the data we have (<em>test statistic</em>)</li>
+<li>Compare what we calculated to our hypothetical distribution and see if the value is &quot;extreme&quot; (<em>p-value</em>)</li>
+</ol>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>P-values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than would be observed by chance alone</li>
+<li>If the P-value is small, then either \(H_0\) is true and we have observed a rare event or \(H_0\) is false</li>
+<li> In our example the \(T\) statistic was \(0.8\). 
+
+<ul>
+<li>What&#39;s the probability of getting a \(T\) statistic as large as \(0.8\)?</li>
+</ul></li>
+</ul>
+
+<pre><code class="r">pt(0.8, 15, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.2181
+</code></pre>
+
+<ul>
+<li>Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under \(H_0\) is 0.2181</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>The attained significance level</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Our test statistic was \(2\) for \(H_0 : \mu_0  = 30\) versus \(H_a:\mu > 30\).</li>
+<li>Notice that we rejected the one sided test when \(\alpha = 0.05\), would we reject if \(\alpha = 0.01\), how about \(0.001\)?</li>
+<li>The smallest value for alpha that you still reject the null hypothesis is called the <em>attained significance level</em></li>
+<li>This is equivalent, but philosophically a little different from, the <em>P-value</em></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>By reporting a P-value the reader can perform the hypothesis
+test at whatever \(\alpha\) level he or she choses</li>
+<li>If the P-value is less than \(\alpha\) you reject the null hypothesis </li>
+<li>For two sided hypothesis test, double the smaller of the two one
+sided hypothesis test Pvalues</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Revisiting an earlier example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose a friend has \(8\) children, \(7\) of which are girls and none are twins</li>
+<li>If each gender has an independent \(50\)% probability for each birth, what&#39;s the probability of getting \(7\) or more girls out of \(8\) births?</li>
+</ul>
+
+<pre><code class="r">choose(8, 7) * 0.5^8 + choose(8, 8) * 0.5^8
+</code></pre>
+
+<pre><code>## [1] 0.03516
+</code></pre>
+
+<pre><code class="r">pbinom(6, size = 8, prob = 0.5, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.03516
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Poisson example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.</li>
+<li>Assume that an infection rate of 0.05 is an important benchmark. </li>
+<li>Given the model, could the observed rate being larger than 0.05 be attributed to chance?</li>
+<li>Under \(H_0: \lambda = 0.05\) so that \(\lambda_0 100 = 5\)</li>
+<li>Consider \(H_a: \lambda > 0.05\).</li>
+</ul>
+
+<pre><code class="r">ppois(9, 5, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.03183
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='P-values'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='What is a P-value?'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='P-values'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='The attained significance level'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Notes'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Revisiting an earlier example'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Poisson example'>
+         7
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/03_03_pValues/index.Rmd b/06_StatisticalInference/old_markdown/03_03_pValues/index.md
similarity index 81%
rename from 06_StatisticalInference/03_03_pValues/index.Rmd
rename to 06_StatisticalInference/old_markdown/03_03_pValues/index.md
index bc6f09908..5dabc5349 100644
--- a/06_StatisticalInference/03_03_pValues/index.Rmd
+++ b/06_StatisticalInference/old_markdown/03_03_pValues/index.md
@@ -1,107 +1,117 @@
----
-title       : P-values
-subtitle    : Statistical inference
-author      : Brian Caffo, Jeffrey Leek, Roger Peng 
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow   # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-```
-
-## P-values
-
-* Most common measure of "statistical significance"
-* Their ubiquity, along with concern over their interpretation and use
-  makes them controversial among statisticians
-  * [http://warnercnr.colostate.edu/~anderson/thompson1.html](http://warnercnr.colostate.edu/~anderson/thompson1.html)
-  * Also see *Statistical Evidence: A Likelihood Paradigm* by Richard Royall 
-  * *Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy* by Steve Goodman
-  * The hilariously titled: *The Earth is Round (p < .05)* by Cohen.
-* Some positive comments
-  * [simply statistics](http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/)
-  * [normal deviate](http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/)
-  * [Error statistics](http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/)
-
----
-
-
-## What is a P-value? 
-
-__Idea__: Suppose nothing is going on - how unusual is it to see the estimate we got?
-
-__Approach__: 
-
-1. Define the hypothetical distribution of a data summary (statistic) when "nothing is going on" (_null hypothesis_)
-2. Calculate the summary/statistic with the data we have (_test statistic_)
-3. Compare what we calculated to our hypothetical distribution and see if the value is "extreme" (_p-value_)
-
----
-## P-values
-* The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than would be observed by chance alone
-* If the P-value is small, then either $H_0$ is true and we have observed a rare event or $H_0$ is false
-*  In our example the $T$ statistic was $0.8$. 
-  * What's the probability of getting a $T$ statistic as large as $0.8$?
-```{r}
-pt(0.8, 15, lower.tail = FALSE) 
-```
-* Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under $H_0$ is `r pt(0.8, 15, lower.tail = FALSE)`
-
----
-## The attained significance level
-* Our test statistic was $2$ for $H_0 : \mu_0  = 30$ versus $H_a:\mu > 30$.
-* Notice that we rejected the one sided test when $\alpha = 0.05$, would we reject if $\alpha = 0.01$, how about $0.001$?
-* The smallest value for alpha that you still reject the null hypothesis is called the *attained significance level*
-* This is equivalent, but philosophically a little different from, the *P-value*
-
----
-## Notes
-* By reporting a P-value the reader can perform the hypothesis
-  test at whatever $\alpha$ level he or she choses
-* If the P-value is less than $\alpha$ you reject the null hypothesis 
-* For two sided hypothesis test, double the smaller of the two one
-  sided hypothesis test Pvalues
-
----
-## Revisiting an earlier example
-- Suppose a friend has $8$ children, $7$ of which are girls and none are twins
-- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
-```{r}
-choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
-pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
-```
-
----
-## Poisson example
-- Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.
-- Assume that an infection rate of 0.05 is an important benchmark. 
-- Given the model, could the observed rate being larger than 0.05 be attributed to chance?
-- Under $H_0: \lambda = 0.05$ so that $\lambda_0 100 = 5$
-- Consider $H_a: \lambda > 0.05$.
-
-```{r}
-ppois(9, 5, lower.tail = FALSE)
-```
-
-
-
+---
+title       : P-values
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeffrey Leek, Roger Peng 
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## P-values
+
+* Most common measure of "statistical significance"
+* Their ubiquity, along with concern over their interpretation and use
+  makes them controversial among statisticians
+  * [http://warnercnr.colostate.edu/~anderson/thompson1.html](http://warnercnr.colostate.edu/~anderson/thompson1.html)
+  * Also see *Statistical Evidence: A Likelihood Paradigm* by Richard Royall 
+  * *Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy* by Steve Goodman
+  * The hilariously titled: *The Earth is Round (p < .05)* by Cohen.
+* Some positive comments
+  * [simply statistics](http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/)
+  * [normal deviate](http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/)
+  * [Error statistics](http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/)
+
+---
+
+
+## What is a P-value? 
+
+__Idea__: Suppose nothing is going on - how unusual is it to see the estimate we got?
+
+__Approach__: 
+
+1. Define the hypothetical distribution of a data summary (statistic) when "nothing is going on" (_null hypothesis_)
+2. Calculate the summary/statistic with the data we have (_test statistic_)
+3. Compare what we calculated to our hypothetical distribution and see if the value is "extreme" (_p-value_)
+
+---
+## P-values
+* The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than would be observed by chance alone
+* If the P-value is small, then either $H_0$ is true and we have observed a rare event or $H_0$ is false
+*  In our example the $T$ statistic was $0.8$. 
+  * What's the probability of getting a $T$ statistic as large as $0.8$?
+
+```r
+pt(0.8, 15, lower.tail = FALSE)
+```
+
+```
+## [1] 0.2181
+```
+
+* Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under $H_0$ is 0.2181
+
+---
+## The attained significance level
+* Our test statistic was $2$ for $H_0 : \mu_0  = 30$ versus $H_a:\mu > 30$.
+* Notice that we rejected the one sided test when $\alpha = 0.05$, would we reject if $\alpha = 0.01$, how about $0.001$?
+* The smallest value for alpha that you still reject the null hypothesis is called the *attained significance level*
+* This is equivalent, but philosophically a little different from, the *P-value*
+
+---
+## Notes
+* By reporting a P-value the reader can perform the hypothesis
+  test at whatever $\alpha$ level he or she choses
+* If the P-value is less than $\alpha$ you reject the null hypothesis 
+* For two sided hypothesis test, double the smaller of the two one
+  sided hypothesis test Pvalues
+
+---
+## Revisiting an earlier example
+- Suppose a friend has $8$ children, $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+
+```r
+choose(8, 7) * 0.5^8 + choose(8, 8) * 0.5^8
+```
+
+```
+## [1] 0.03516
+```
+
+```r
+pbinom(6, size = 8, prob = 0.5, lower.tail = FALSE)
+```
+
+```
+## [1] 0.03516
+```
+
+
+---
+## Poisson example
+- Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.
+- Assume that an infection rate of 0.05 is an important benchmark. 
+- Given the model, could the observed rate being larger than 0.05 be attributed to chance?
+- Under $H_0: \lambda = 0.05$ so that $\lambda_0 100 = 5$
+- Consider $H_a: \lambda > 0.05$.
+
+
+```r
+ppois(9, 5, lower.tail = FALSE)
+```
+
+```
+## [1] 0.03183
+```
+
+
+
+
diff --git a/06_StatisticalInference/old_markdown/03_03_pValues/index.pdf b/06_StatisticalInference/old_markdown/03_03_pValues/index.pdf
new file mode 100644
index 000000000..ba31db25c
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_03_pValues/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/03_04_Power/assets/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/03_04_Power/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..c1c383311
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_04_Power/assets/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/old_markdown/03_04_Power/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/03_04_Power/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..a8e196fc6
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_04_Power/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/03_04_Power/index.md b/06_StatisticalInference/old_markdown/03_04_Power/index.Rmd
similarity index 87%
rename from 06_StatisticalInference/03_04_Power/index.md
rename to 06_StatisticalInference/old_markdown/03_04_Power/index.Rmd
index 1b289c777..ed6dbc0cc 100644
--- a/06_StatisticalInference/03_04_Power/index.md
+++ b/06_StatisticalInference/old_markdown/03_04_Power/index.Rmd
@@ -1,179 +1,137 @@
----
-title       : Power
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-## Power
-- Power is the probability of rejecting the null hypothesis when it is false
-- Ergo, power (as it's name would suggest) is a good thing; you want more power
-- A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it's false; the probability of a type II error is usually called $\beta$
-- Note Power  $= 1 - \beta$
-
----
-## Notes
-- Consider our previous example involving RDI
-- $H_0: \mu = 30$ versus $H_a: \mu > 30$
-- Then power is 
-$$P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~|~ \mu = \mu_a \right)$$
-- Note that this is a function that depends on the specific value of $\mu_a$!
-- Notice as $\mu_a$ approaches $30$ the power approaches $\alpha$
-
-
----
-## Calculating power for Gaussian data
-Assume that $n$ is large and that we know $\sigma$
-$$
-\begin{align}
-1 -\beta & = 
-P\left(\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\
-& = P\left(\frac{\bar X - \mu_a + \mu_a - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\ \\
-& = P\left(\frac{\bar X - \mu_a}{\sigma /\sqrt{n}} > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
-& = P\left(Z > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
-\end{align}
-$$
-
----
-## Example continued
--  Suppose that we wanted to detect a increase in mean RDI
-  of at least 2 events / hour (above 30). 
-- Assume normality and that the sample in question will have a standard deviation of $4$;
-- What would be the power if we took a sample size of $16$?
-  - $Z_{1-\alpha} = 1.645$ 
-  - $\frac{\mu_a - 30}{\sigma /\sqrt{n}} = 2 / (4 /\sqrt{16}) = 2$ 
-  - $P(Z > 1.645 - 2) = P(Z > -0.355) = 64\%$
-
-```r
-pnorm(-0.355, lower.tail = FALSE)
-```
-
-```
-[1] 0.6387
-```
-
-
----
-## Note
-- Consider $H_0 : \mu = \mu_0$ and $H_a : \mu > \mu_0$ with $\mu = \mu_a$ under $H_a$.
-- Under $H_0$ the statistic $Z = \frac{\sqrt{n}(\bar X - \mu_0)}{\sigma}$ is $N(0, 1)$
-- Under $H_a$ $Z$ is $N\left( \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}, 1\right)$
-- We reject if $Z > Z_{1-\alpha}$
-
-```
-sigma <- 10; mu_0 = 0; mu_a = 2; n <- 100; alpha = .05
-plot(c(-3, 6),c(0, dnorm(0)), type = "n", frame = false, xlab = "Z value", ylab = "")
-xvals <- seq(-3, 6, length = 1000)
-lines(xvals, dnorm(xvals), type = "l", lwd = 3)
-lines(xvals, dnorm(xvals, mean = sqrt(n) * (mu_a - mu_0) / sigma), lwd =3)
-abline(v = qnorm(1 - alpha))
-```
-
----
-<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
-
-
----
-## Question
-- When testing $H_a : \mu > \mu_0$, notice if power is $1 - \beta$, then 
-$$1 - \beta = P\left(Z > z_{1-\alpha} - \frac{\mu_a - \mu_0}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right) = P(Z > z_{\beta})$$
-- This yields the equation
-$$z_{1-\alpha} - \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma} = z_{\beta}$$
-- Unknowns: $\mu_a$, $\sigma$, $n$, $\beta$
-- Knowns: $\mu_0$, $\alpha$
-- Specify any 3 of the unknowns and you can solve for the remainder
-
----
-## Notes
-- The calculation for $H_a:\mu < \mu_0$ is similar
-- For $H_a: \mu \neq \mu_0$ calculate the one sided power using
-  $\alpha / 2$ (this is only approximately right, it excludes the probability of
-  getting a large TS in the opposite direction of the truth)
-- Power goes up as $\alpha$ gets larger
-- Power of a one sided test is greater than the power of the
-  associated two sided test
-- Power goes up as $\mu_1$ gets further away from $\mu_0$
-- Power goes up as $n$ goes up
-- Power doesn't need $\mu_a$, $\sigma$ and $n$, instead only $\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}$
-  - The quantity $\frac{\mu_a - \mu_0}{\sigma}$ is called the effect size, the difference in the means in standard deviation units.
-  - Being unit free, it has some hope of interpretability across settings
-
----
-## T-test power
--  Consider calculating power for a Gossett's $T$ test for our example
--  The power is
-  $$
-  P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~|~ \mu = \mu_a \right)
-  $$
-- Calcuting this requires the non-central t distribution.
-- `power.t.test` does this very well
-  - Omit one of the arguments and it solves for it
-
----
-## Example
-
-```r
-power.t.test(n = 16, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$power
-```
-
-```
-[1] 0.604
-```
-
-```r
-power.t.test(n = 16, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$power
-```
-
-```
-[1] 0.604
-```
-
-```r
-power.t.test(n = 16, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$power
-```
-
-```
-[1] 0.604
-```
-
-
----
-## Example
-
-```r
-power.t.test(power = .8, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$n
-```
-
-```
-[1] 26.14
-```
-
-```r
-power.t.test(power = .8, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$n
-```
-
-```
-[1] 26.14
-```
-
-```r
-power.t.test(power = .8, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$n
-```
-
-```
-[1] 26.14
-```
-
-
+---
+title       : Power
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Power
+- Power is the probability of rejecting the null hypothesis when it is false
+- Ergo, power (as its name would suggest) is a good thing; you want more power
+- A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it's false; the probability of a type II error is usually called $\beta$
+- Note Power  $= 1 - \beta$
+
+---
+## Notes
+- Consider our previous example involving RDI
+- $H_0: \mu = 30$ versus $H_a: \mu > 30$
+- Then power is 
+$$P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~|~ \mu = \mu_a \right)$$
+- Note that this is a function that depends on the specific value of $\mu_a$!
+- Notice as $\mu_a$ approaches $30$ the power approaches $\alpha$
+
+
+---
+## Calculating power for Gaussian data
+Assume that $n$ is large and that we know $\sigma$
+$$
+\begin{align}
+1 -\beta & = 
+P\left(\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\
+& = P\left(\frac{\bar X - \mu_a + \mu_a - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\ \\
+& = P\left(\frac{\bar X - \mu_a}{\sigma /\sqrt{n}} > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
+& = P\left(Z > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
+\end{align}
+$$
+
+---
+## Example continued
+-  Suppose that we wanted to detect a increase in mean RDI
+  of at least 2 events / hour (above 30). 
+- Assume normality and that the sample in question will have a standard deviation of $4$;
+- What would be the power if we took a sample size of $16$?
+  - $Z_{1-\alpha} = 1.645$ 
+  - $\frac{\mu_a - 30}{\sigma /\sqrt{n}} = 2 / (4 /\sqrt{16}) = 2$ 
+  - $P(Z > 1.645 - 2) = P(Z > -0.355) = 64\%$
+```{r}
+pnorm(-0.355, lower.tail = FALSE)
+```
+
+---
+## Note 
+- Consider $H_0 : \mu = \mu_0$ and $H_a : \mu > \mu_0$ with $\mu = \mu_a$ under $H_a$.
+- Under $H_0$ the statistic $Z = \frac{\sqrt{n}(\bar X - \mu_0)}{\sigma}$ is $N(0, 1)$
+- Under $H_a$ $Z$ is $N\left( \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}, 1\right)$
+- We reject if $Z > Z_{1-\alpha}$
+
+```
+sigma <- 10; mu_0 = 0; mu_a = 2; n <- 100; alpha = .05
+plot(c(-3, 6),c(0, dnorm(0)), type = "n", frame = FALSE, xlab = "Z value", ylab = "")
+xvals <- seq(-3, 6, length = 1000)
+lines(xvals, dnorm(xvals), type = "l", lwd = 3)
+lines(xvals, dnorm(xvals, mean = sqrt(n) * (mu_a - mu_0) / sigma), lwd =3)
+abline(v = qnorm(1 - alpha))
+```
+
+---
+```{r, fig.height=5, fig.width=5, echo = FALSE}
+sigma <- 10; mu_0 = 0; mu_a = 2; n <- 100; alpha = .05
+plot(c(-3, 6),c(0, dnorm(0)), type = "n", frame = FALSE, xlab = "Z value", ylab = "")
+xvals <- seq(-3, 6, length = 1000)
+lines(xvals, dnorm(xvals), type = "l", lwd = 3)
+lines(xvals, dnorm(xvals, mean = sqrt(n) * (mu_a - mu_0) / sigma), lwd =3)
+abline(v = qnorm(1 - alpha))
+```
+
+
+---
+## Question
+- When testing $H_a : \mu > \mu_0$, notice if power is $1 - \beta$, then 
+$$1 - \beta = P\left(Z > z_{1-\alpha} - \frac{\mu_a - \mu_0}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right) = P(Z > z_{\beta})$$
+- This yields the equation
+$$z_{1-\alpha} - \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma} = z_{\beta}$$
+- Unknowns: $\mu_a$, $\sigma$, $n$, $\beta$
+- Knowns: $\mu_0$, $\alpha$
+- Specify any 3 of the unknowns and you can solve for the remainder
+
+---
+## Notes
+- The calculation for $H_a:\mu < \mu_0$ is similar
+- For $H_a: \mu \neq \mu_0$ calculate the one sided power using
+  $\alpha / 2$ (this is only approximately right, it excludes the probability of
+  getting a large TS in the opposite direction of the truth)
+- Power goes up as $\alpha$ gets larger
+- Power of a one sided test is greater than the power of the
+  associated two sided test
+- Power goes up as $\mu_1$ gets further away from $\mu_0$
+- Power goes up as $n$ goes up
+- Power doesn't need $\mu_a$, $\sigma$ and $n$, instead only $\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}$
+  - The quantity $\frac{\mu_a - \mu_0}{\sigma}$ is called the effect size, the difference in the means in standard deviation units.
+  - Being unit free, it has some hope of interpretability across settings
+
+---
+## T-test power
+-  Consider calculating power for a Gossett's $T$ test for our example
+-  The power is
+  $$
+  P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~|~ \mu = \mu_a \right)
+  $$
+- Calcuting this requires the non-central t distribution.
+- `power.t.test` does this very well
+  - Omit one of the arguments and it solves for it
+
+---
+## Example
+```{r}
+power.t.test(n = 16, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$power
+power.t.test(n = 16, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$power
+power.t.test(n = 16, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$power
+```
+
+---
+## Example
+```{r}
+power.t.test(power = .8, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$n
+power.t.test(power = .8, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$n
+power.t.test(power = .8, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$n
+```
+
diff --git a/06_StatisticalInference/03_04_Power/index.html b/06_StatisticalInference/old_markdown/03_04_Power/index.html
similarity index 87%
rename from 06_StatisticalInference/03_04_Power/index.html
rename to 06_StatisticalInference/old_markdown/03_04_Power/index.html
index 84415fc6d..6465ed50f 100644
--- a/06_StatisticalInference/03_04_Power/index.html
+++ b/06_StatisticalInference/old_markdown/03_04_Power/index.html
@@ -1,382 +1,382 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Power</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Power">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Power</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Power</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Power is the probability of rejecting the null hypothesis when it is false</li>
-<li>Ergo, power (as it&#39;s name would suggest) is a good thing; you want more power</li>
-<li>A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it&#39;s false; the probability of a type II error is usually called \(\beta\)</li>
-<li>Note Power  \(= 1 - \beta\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Notes</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider our previous example involving RDI</li>
-<li>\(H_0: \mu = 30\) versus \(H_a: \mu > 30\)</li>
-<li>Then power is 
-\[P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~|~ \mu = \mu_a \right)\]</li>
-<li>Note that this is a function that depends on the specific value of \(\mu_a\)!</li>
-<li>Notice as \(\mu_a\) approaches \(30\) the power approaches \(\alpha\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Calculating power for Gaussian data</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Assume that \(n\) is large and that we know \(\sigma\)
-\[
-\begin{align}
-1 -\beta & = 
-P\left(\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\
-& = P\left(\frac{\bar X - \mu_a + \mu_a - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\ \\
-& = P\left(\frac{\bar X - \mu_a}{\sigma /\sqrt{n}} > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
-& = P\left(Z > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
-\end{align}
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Example continued</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li> Suppose that we wanted to detect a increase in mean RDI
-of at least 2 events / hour (above 30). </li>
-<li>Assume normality and that the sample in question will have a standard deviation of \(4\);</li>
-<li>What would be the power if we took a sample size of \(16\)?
-
-<ul>
-<li>\(Z_{1-\alpha} = 1.645\) </li>
-<li>\(\frac{\mu_a - 30}{\sigma /\sqrt{n}} = 2 / (4 /\sqrt{16}) = 2\) </li>
-<li>\(P(Z > 1.645 - 2) = P(Z > -0.355) = 64\%\)</li>
-</ul></li>
-</ul>
-
-<pre><code class="r">pnorm(-0.355, lower.tail = FALSE)
-</code></pre>
-
-<pre><code>[1] 0.6387
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Note</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider \(H_0 : \mu = \mu_0\) and \(H_a : \mu > \mu_0\) with \(\mu = \mu_a\) under \(H_a\).</li>
-<li>Under \(H_0\) the statistic \(Z = \frac{\sqrt{n}(\bar X - \mu_0)}{\sigma}\) is \(N(0, 1)\)</li>
-<li>Under \(H_a\) \(Z\) is \(N\left( \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}, 1\right)\)</li>
-<li>We reject if \(Z > Z_{1-\alpha}\)</li>
-</ul>
-
-<pre><code>sigma &lt;- 10; mu_0 = 0; mu_a = 2; n &lt;- 100; alpha = .05
-plot(c(-3, 6),c(0, dnorm(0)), type = &quot;n&quot;, frame = false, xlab = &quot;Z value&quot;, ylab = &quot;&quot;)
-xvals &lt;- seq(-3, 6, length = 1000)
-lines(xvals, dnorm(xvals), type = &quot;l&quot;, lwd = 3)
-lines(xvals, dnorm(xvals, mean = sqrt(n) * (mu_a - mu_0) / sigma), lwd =3)
-abline(v = qnorm(1 - alpha))
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <article data-timings="">
-    <div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Question</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>When testing \(H_a : \mu > \mu_0\), notice if power is \(1 - \beta\), then 
-\[1 - \beta = P\left(Z > z_{1-\alpha} - \frac{\mu_a - \mu_0}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right) = P(Z > z_{\beta})\]</li>
-<li>This yields the equation
-\[z_{1-\alpha} - \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma} = z_{\beta}\]</li>
-<li>Unknowns: \(\mu_a\), \(\sigma\), \(n\), \(\beta\)</li>
-<li>Knowns: \(\mu_0\), \(\alpha\)</li>
-<li>Specify any 3 of the unknowns and you can solve for the remainder</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Notes</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The calculation for \(H_a:\mu < \mu_0\) is similar</li>
-<li>For \(H_a: \mu \neq \mu_0\) calculate the one sided power using
-\(\alpha / 2\) (this is only approximately right, it excludes the probability of
-getting a large TS in the opposite direction of the truth)</li>
-<li>Power goes up as \(\alpha\) gets larger</li>
-<li>Power of a one sided test is greater than the power of the
-associated two sided test</li>
-<li>Power goes up as \(\mu_1\) gets further away from \(\mu_0\)</li>
-<li>Power goes up as \(n\) goes up</li>
-<li>Power doesn&#39;t need \(\mu_a\), \(\sigma\) and \(n\), instead only \(\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}\)
-
-<ul>
-<li>The quantity \(\frac{\mu_a - \mu_0}{\sigma}\) is called the effect size, the difference in the means in standard deviation units.</li>
-<li>Being unit free, it has some hope of interpretability across settings</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>T-test power</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li> Consider calculating power for a Gossett&#39;s \(T\) test for our example</li>
-<li> The power is
-\[
-P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~|~ \mu = \mu_a \right)
-\]</li>
-<li>Calcuting this requires the non-central t distribution.</li>
-<li><code>power.t.test</code> does this very well
-
-<ul>
-<li>Omit one of the arguments and it solves for it</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">power.t.test(n = 16, delta = 2 / 4, sd=1, type = &quot;one.sample&quot;,  alt = &quot;one.sided&quot;)$power
-</code></pre>
-
-<pre><code>[1] 0.604
-</code></pre>
-
-<pre><code class="r">power.t.test(n = 16, delta = 2, sd=4, type = &quot;one.sample&quot;,  alt = &quot;one.sided&quot;)$power
-</code></pre>
-
-<pre><code>[1] 0.604
-</code></pre>
-
-<pre><code class="r">power.t.test(n = 16, delta = 100, sd=200, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$power
-</code></pre>
-
-<pre><code>[1] 0.604
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">power.t.test(power = .8, delta = 2 / 4, sd=1, type = &quot;one.sample&quot;,  alt = &quot;one.sided&quot;)$n
-</code></pre>
-
-<pre><code>[1] 26.14
-</code></pre>
-
-<pre><code class="r">power.t.test(power = .8, delta = 2, sd=4, type = &quot;one.sample&quot;,  alt = &quot;one.sided&quot;)$n
-</code></pre>
-
-<pre><code>[1] 26.14
-</code></pre>
-
-<pre><code class="r">power.t.test(power = .8, delta = 100, sd=200, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$n
-</code></pre>
-
-<pre><code>[1] 26.14
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Power'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Notes'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='Calculating power for Gaussian data'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Example continued'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Note'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title=''>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Question'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Notes'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='T-test power'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='Example'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='Example'>
-         11
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Power</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Power">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Power</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Power</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Power is the probability of rejecting the null hypothesis when it is false</li>
+<li>Ergo, power (as it&#39;s name would suggest) is a good thing; you want more power</li>
+<li>A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it&#39;s false; the probability of a type II error is usually called \(\beta\)</li>
+<li>Note Power  \(= 1 - \beta\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider our previous example involving RDI</li>
+<li>\(H_0: \mu = 30\) versus \(H_a: \mu > 30\)</li>
+<li>Then power is 
+\[P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~|~ \mu = \mu_a \right)\]</li>
+<li>Note that this is a function that depends on the specific value of \(\mu_a\)!</li>
+<li>Notice as \(\mu_a\) approaches \(30\) the power approaches \(\alpha\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Calculating power for Gaussian data</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Assume that \(n\) is large and that we know \(\sigma\)
+\[
+\begin{align}
+1 -\beta & = 
+P\left(\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\
+& = P\left(\frac{\bar X - \mu_a + \mu_a - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\ \\
+& = P\left(\frac{\bar X - \mu_a}{\sigma /\sqrt{n}} > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
+& = P\left(Z > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
+\end{align}
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Example continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li> Suppose that we wanted to detect a increase in mean RDI
+of at least 2 events / hour (above 30). </li>
+<li>Assume normality and that the sample in question will have a standard deviation of \(4\);</li>
+<li>What would be the power if we took a sample size of \(16\)?
+
+<ul>
+<li>\(Z_{1-\alpha} = 1.645\) </li>
+<li>\(\frac{\mu_a - 30}{\sigma /\sqrt{n}} = 2 / (4 /\sqrt{16}) = 2\) </li>
+<li>\(P(Z > 1.645 - 2) = P(Z > -0.355) = 64\%\)</li>
+</ul></li>
+</ul>
+
+<pre><code class="r">pnorm(-0.355, lower.tail = FALSE)
+</code></pre>
+
+<pre><code>## [1] 0.6387
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Note</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider \(H_0 : \mu = \mu_0\) and \(H_a : \mu > \mu_0\) with \(\mu = \mu_a\) under \(H_a\).</li>
+<li>Under \(H_0\) the statistic \(Z = \frac{\sqrt{n}(\bar X - \mu_0)}{\sigma}\) is \(N(0, 1)\)</li>
+<li>Under \(H_a\) \(Z\) is \(N\left( \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}, 1\right)\)</li>
+<li>We reject if \(Z > Z_{1-\alpha}\)</li>
+</ul>
+
+<pre><code>sigma &lt;- 10; mu_0 = 0; mu_a = 2; n &lt;- 100; alpha = .05
+plot(c(-3, 6),c(0, dnorm(0)), type = &quot;n&quot;, frame = FALSE, xlab = &quot;Z value&quot;, ylab = &quot;&quot;)
+xvals &lt;- seq(-3, 6, length = 1000)
+lines(xvals, dnorm(xvals), type = &quot;l&quot;, lwd = 3)
+lines(xvals, dnorm(xvals, mean = sqrt(n) * (mu_a - mu_0) / sigma), lwd =3)
+abline(v = qnorm(1 - alpha))
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-2.png" alt="plot of chunk unnamed-chunk-2"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Question</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>When testing \(H_a : \mu > \mu_0\), notice if power is \(1 - \beta\), then 
+\[1 - \beta = P\left(Z > z_{1-\alpha} - \frac{\mu_a - \mu_0}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right) = P(Z > z_{\beta})\]</li>
+<li>This yields the equation
+\[z_{1-\alpha} - \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma} = z_{\beta}\]</li>
+<li>Unknowns: \(\mu_a\), \(\sigma\), \(n\), \(\beta\)</li>
+<li>Knowns: \(\mu_0\), \(\alpha\)</li>
+<li>Specify any 3 of the unknowns and you can solve for the remainder</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Notes</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The calculation for \(H_a:\mu < \mu_0\) is similar</li>
+<li>For \(H_a: \mu \neq \mu_0\) calculate the one sided power using
+\(\alpha / 2\) (this is only approximately right, it excludes the probability of
+getting a large TS in the opposite direction of the truth)</li>
+<li>Power goes up as \(\alpha\) gets larger</li>
+<li>Power of a one sided test is greater than the power of the
+associated two sided test</li>
+<li>Power goes up as \(\mu_1\) gets further away from \(\mu_0\)</li>
+<li>Power goes up as \(n\) goes up</li>
+<li>Power doesn&#39;t need \(\mu_a\), \(\sigma\) and \(n\), instead only \(\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}\)
+
+<ul>
+<li>The quantity \(\frac{\mu_a - \mu_0}{\sigma}\) is called the effect size, the difference in the means in standard deviation units.</li>
+<li>Being unit free, it has some hope of interpretability across settings</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>T-test power</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li> Consider calculating power for a Gossett&#39;s \(T\) test for our example</li>
+<li> The power is
+\[
+P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~|~ \mu = \mu_a \right)
+\]</li>
+<li>Calcuting this requires the non-central t distribution.</li>
+<li><code>power.t.test</code> does this very well
+
+<ul>
+<li>Omit one of the arguments and it solves for it</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">power.t.test(n = 16, delta = 2/4, sd = 1, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$power
+</code></pre>
+
+<pre><code>## [1] 0.604
+</code></pre>
+
+<pre><code class="r">power.t.test(n = 16, delta = 2, sd = 4, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$power
+</code></pre>
+
+<pre><code>## [1] 0.604
+</code></pre>
+
+<pre><code class="r">power.t.test(n = 16, delta = 100, sd = 200, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$power
+</code></pre>
+
+<pre><code>## [1] 0.604
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">power.t.test(power = 0.8, delta = 2/4, sd = 1, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$n
+</code></pre>
+
+<pre><code>## [1] 26.14
+</code></pre>
+
+<pre><code class="r">power.t.test(power = 0.8, delta = 2, sd = 4, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$n
+</code></pre>
+
+<pre><code>## [1] 26.14
+</code></pre>
+
+<pre><code class="r">power.t.test(power = 0.8, delta = 100, sd = 200, type = &quot;one.sample&quot;, alt = &quot;one.sided&quot;)$n
+</code></pre>
+
+<pre><code>## [1] 26.14
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Power'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Notes'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Calculating power for Gaussian data'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Example continued'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Note'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title=''>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Question'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Notes'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='T-test power'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Example'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example'>
+         11
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/03_04_Power/index.Rmd b/06_StatisticalInference/old_markdown/03_04_Power/index.md
similarity index 72%
rename from 06_StatisticalInference/03_04_Power/index.Rmd
rename to 06_StatisticalInference/old_markdown/03_04_Power/index.md
index ae0446258..37484c7b4 100644
--- a/06_StatisticalInference/03_04_Power/index.Rmd
+++ b/06_StatisticalInference/old_markdown/03_04_Power/index.md
@@ -1,153 +1,177 @@
----
-title       : Power
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-runif(1)
-```
-
-## Power
-- Power is the probability of rejecting the null hypothesis when it is false
-- Ergo, power (as it's name would suggest) is a good thing; you want more power
-- A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it's false; the probability of a type II error is usually called $\beta$
-- Note Power  $= 1 - \beta$
-
----
-## Notes
-- Consider our previous example involving RDI
-- $H_0: \mu = 30$ versus $H_a: \mu > 30$
-- Then power is 
-$$P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~|~ \mu = \mu_a \right)$$
-- Note that this is a function that depends on the specific value of $\mu_a$!
-- Notice as $\mu_a$ approaches $30$ the power approaches $\alpha$
-
-
----
-## Calculating power for Gaussian data
-Assume that $n$ is large and that we know $\sigma$
-$$
-\begin{align}
-1 -\beta & = 
-P\left(\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\
-& = P\left(\frac{\bar X - \mu_a + \mu_a - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\ \\
-& = P\left(\frac{\bar X - \mu_a}{\sigma /\sqrt{n}} > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
-& = P\left(Z > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
-\end{align}
-$$
-
----
-## Example continued
--  Suppose that we wanted to detect a increase in mean RDI
-  of at least 2 events / hour (above 30). 
-- Assume normality and that the sample in question will have a standard deviation of $4$;
-- What would be the power if we took a sample size of $16$?
-  - $Z_{1-\alpha} = 1.645$ 
-  - $\frac{\mu_a - 30}{\sigma /\sqrt{n}} = 2 / (4 /\sqrt{16}) = 2$ 
-  - $P(Z > 1.645 - 2) = P(Z > -0.355) = 64\%$
-```{r}
-pnorm(-0.355, lower.tail = FALSE)
-```
-
----
-## Note
-- Consider $H_0 : \mu = \mu_0$ and $H_a : \mu > \mu_0$ with $\mu = \mu_a$ under $H_a$.
-- Under $H_0$ the statistic $Z = \frac{\sqrt{n}(\bar X - \mu_0)}{\sigma}$ is $N(0, 1)$
-- Under $H_a$ $Z$ is $N\left( \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}, 1\right)$
-- We reject if $Z > Z_{1-\alpha}$
-
-```
-sigma <- 10; mu_0 = 0; mu_a = 2; n <- 100; alpha = .05
-plot(c(-3, 6),c(0, dnorm(0)), type = "n", frame = false, xlab = "Z value", ylab = "")
-xvals <- seq(-3, 6, length = 1000)
-lines(xvals, dnorm(xvals), type = "l", lwd = 3)
-lines(xvals, dnorm(xvals, mean = sqrt(n) * (mu_a - mu_0) / sigma), lwd =3)
-abline(v = qnorm(1 - alpha))
-```
-
----
-```{r, fig.height=5, fig.width=5, echo = FALSE}
-sigma <- 10; mu_0 = 0; mu_a = 2; n <- 100; alpha = .05
-plot(c(-3, 6),c(0, dnorm(0)), type = "n", frame = false, xlab = "Z value", ylab = "")
-xvals <- seq(-3, 6, length = 1000)
-lines(xvals, dnorm(xvals), type = "l", lwd = 3)
-lines(xvals, dnorm(xvals, mean = sqrt(n) * (mu_a - mu_0) / sigma), lwd =3)
-abline(v = qnorm(1 - alpha))
-```
-
-
----
-## Question
-- When testing $H_a : \mu > \mu_0$, notice if power is $1 - \beta$, then 
-$$1 - \beta = P\left(Z > z_{1-\alpha} - \frac{\mu_a - \mu_0}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right) = P(Z > z_{\beta})$$
-- This yields the equation
-$$z_{1-\alpha} - \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma} = z_{\beta}$$
-- Unknowns: $\mu_a$, $\sigma$, $n$, $\beta$
-- Knowns: $\mu_0$, $\alpha$
-- Specify any 3 of the unknowns and you can solve for the remainder
-
----
-## Notes
-- The calculation for $H_a:\mu < \mu_0$ is similar
-- For $H_a: \mu \neq \mu_0$ calculate the one sided power using
-  $\alpha / 2$ (this is only approximately right, it excludes the probability of
-  getting a large TS in the opposite direction of the truth)
-- Power goes up as $\alpha$ gets larger
-- Power of a one sided test is greater than the power of the
-  associated two sided test
-- Power goes up as $\mu_1$ gets further away from $\mu_0$
-- Power goes up as $n$ goes up
-- Power doesn't need $\mu_a$, $\sigma$ and $n$, instead only $\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}$
-  - The quantity $\frac{\mu_a - \mu_0}{\sigma}$ is called the effect size, the difference in the means in standard deviation units.
-  - Being unit free, it has some hope of interpretability across settings
-
----
-## T-test power
--  Consider calculating power for a Gossett's $T$ test for our example
--  The power is
-  $$
-  P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~|~ \mu = \mu_a \right)
-  $$
-- Calcuting this requires the non-central t distribution.
-- `power.t.test` does this very well
-  - Omit one of the arguments and it solves for it
-
----
-## Example
-```{r}
-power.t.test(n = 16, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$power
-power.t.test(n = 16, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$power
-power.t.test(n = 16, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$power
-```
-
----
-## Example
-```{r}
-power.t.test(power = .8, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$n
-power.t.test(power = .8, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$n
-power.t.test(power = .8, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$n
-```
-
+---
+title       : Power
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Power
+- Power is the probability of rejecting the null hypothesis when it is false
+- Ergo, power (as it's name would suggest) is a good thing; you want more power
+- A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it's false; the probability of a type II error is usually called $\beta$
+- Note Power  $= 1 - \beta$
+
+---
+## Notes
+- Consider our previous example involving RDI
+- $H_0: \mu = 30$ versus $H_a: \mu > 30$
+- Then power is 
+$$P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~|~ \mu = \mu_a \right)$$
+- Note that this is a function that depends on the specific value of $\mu_a$!
+- Notice as $\mu_a$ approaches $30$ the power approaches $\alpha$
+
+
+---
+## Calculating power for Gaussian data
+Assume that $n$ is large and that we know $\sigma$
+$$
+\begin{align}
+1 -\beta & = 
+P\left(\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\
+& = P\left(\frac{\bar X - \mu_a + \mu_a - 30}{\sigma /\sqrt{n}} > z_{1-\alpha} ~|~ \mu = \mu_a \right)\\ \\
+& = P\left(\frac{\bar X - \mu_a}{\sigma /\sqrt{n}} > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
+& = P\left(Z > z_{1-\alpha} - \frac{\mu_a - 30}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right)\\ \\
+\end{align}
+$$
+
+---
+## Example continued
+-  Suppose that we wanted to detect a increase in mean RDI
+  of at least 2 events / hour (above 30). 
+- Assume normality and that the sample in question will have a standard deviation of $4$;
+- What would be the power if we took a sample size of $16$?
+  - $Z_{1-\alpha} = 1.645$ 
+  - $\frac{\mu_a - 30}{\sigma /\sqrt{n}} = 2 / (4 /\sqrt{16}) = 2$ 
+  - $P(Z > 1.645 - 2) = P(Z > -0.355) = 64\%$
+
+```r
+pnorm(-0.355, lower.tail = FALSE)
+```
+
+```
+## [1] 0.6387
+```
+
+
+---
+## Note 
+- Consider $H_0 : \mu = \mu_0$ and $H_a : \mu > \mu_0$ with $\mu = \mu_a$ under $H_a$.
+- Under $H_0$ the statistic $Z = \frac{\sqrt{n}(\bar X - \mu_0)}{\sigma}$ is $N(0, 1)$
+- Under $H_a$ $Z$ is $N\left( \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}, 1\right)$
+- We reject if $Z > Z_{1-\alpha}$
+
+```
+sigma <- 10; mu_0 = 0; mu_a = 2; n <- 100; alpha = .05
+plot(c(-3, 6),c(0, dnorm(0)), type = "n", frame = FALSE, xlab = "Z value", ylab = "")
+xvals <- seq(-3, 6, length = 1000)
+lines(xvals, dnorm(xvals), type = "l", lwd = 3)
+lines(xvals, dnorm(xvals, mean = sqrt(n) * (mu_a - mu_0) / sigma), lwd =3)
+abline(v = qnorm(1 - alpha))
+```
+
+---
+![plot of chunk unnamed-chunk-2](assets/fig/unnamed-chunk-2.png) 
+
+
+
+---
+## Question
+- When testing $H_a : \mu > \mu_0$, notice if power is $1 - \beta$, then 
+$$1 - \beta = P\left(Z > z_{1-\alpha} - \frac{\mu_a - \mu_0}{\sigma /\sqrt{n}} ~|~ \mu = \mu_a \right) = P(Z > z_{\beta})$$
+- This yields the equation
+$$z_{1-\alpha} - \frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma} = z_{\beta}$$
+- Unknowns: $\mu_a$, $\sigma$, $n$, $\beta$
+- Knowns: $\mu_0$, $\alpha$
+- Specify any 3 of the unknowns and you can solve for the remainder
+
+---
+## Notes
+- The calculation for $H_a:\mu < \mu_0$ is similar
+- For $H_a: \mu \neq \mu_0$ calculate the one sided power using
+  $\alpha / 2$ (this is only approximately right, it excludes the probability of
+  getting a large TS in the opposite direction of the truth)
+- Power goes up as $\alpha$ gets larger
+- Power of a one sided test is greater than the power of the
+  associated two sided test
+- Power goes up as $\mu_1$ gets further away from $\mu_0$
+- Power goes up as $n$ goes up
+- Power doesn't need $\mu_a$, $\sigma$ and $n$, instead only $\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}$
+  - The quantity $\frac{\mu_a - \mu_0}{\sigma}$ is called the effect size, the difference in the means in standard deviation units.
+  - Being unit free, it has some hope of interpretability across settings
+
+---
+## T-test power
+-  Consider calculating power for a Gossett's $T$ test for our example
+-  The power is
+  $$
+  P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~|~ \mu = \mu_a \right)
+  $$
+- Calcuting this requires the non-central t distribution.
+- `power.t.test` does this very well
+  - Omit one of the arguments and it solves for it
+
+---
+## Example
+
+```r
+power.t.test(n = 16, delta = 2/4, sd = 1, type = "one.sample", alt = "one.sided")$power
+```
+
+```
+## [1] 0.604
+```
+
+```r
+power.t.test(n = 16, delta = 2, sd = 4, type = "one.sample", alt = "one.sided")$power
+```
+
+```
+## [1] 0.604
+```
+
+```r
+power.t.test(n = 16, delta = 100, sd = 200, type = "one.sample", alt = "one.sided")$power
+```
+
+```
+## [1] 0.604
+```
+
+
+---
+## Example
+
+```r
+power.t.test(power = 0.8, delta = 2/4, sd = 1, type = "one.sample", alt = "one.sided")$n
+```
+
+```
+## [1] 26.14
+```
+
+```r
+power.t.test(power = 0.8, delta = 2, sd = 4, type = "one.sample", alt = "one.sided")$n
+```
+
+```
+## [1] 26.14
+```
+
+```r
+power.t.test(power = 0.8, delta = 100, sd = 200, type = "one.sample", alt = "one.sided")$n
+```
+
+```
+## [1] 26.14
+```
+
+
diff --git a/06_StatisticalInference/old_markdown/03_04_Power/index.pdf b/06_StatisticalInference/old_markdown/03_04_Power/index.pdf
new file mode 100644
index 000000000..b5e3024dc
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_04_Power/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/Multiple testing.pdf b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/Multiple testing.pdf
new file mode 100644
index 000000000..5c66cc7dc
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/Multiple testing.pdf differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/assets/fig/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..556c3a44b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/assets/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/data/cd4.data b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/data/cd4.data
new file mode 100644
index 000000000..38587b174
--- /dev/null
+++ b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/data/cd4.data
@@ -0,0 +1,2376 @@
+   -0.741958   548    6.57   0   0   5   8   10002
+   -0.246407   893    6.57   0   1   5   2   10002
+    0.243669   657    6.57   0   1   5  -1   10002
+   -2.729637   464    6.95   0   1   5   4   10005
+   -2.250513   845    6.95   0   1   5  -4   10005
+   -0.221766   752    6.95   0   1   5  -5   10005
+    0.221766   459    6.95   0   1   5   2   10005
+    0.774812   181    6.95   0   1   5  -3   10005
+    1.256673   434    6.95   0   1   5  -7   10005
+   -1.240246   846    2.64   0   1   5  18   10029
+   -0.741958  1102    2.64   0   1   5  18   10029
+   -0.251882   801    2.64   0   1   5  38   10029
+    0.251882   824    2.64   0   1   5   7   10029
+    0.769336   866    2.64   0   1   5  15   10029
+    1.412731   704    2.64   0   1   5  21   10029
+    1.806982   757    2.64   0   1   5  25   10029
+    2.420260   726    2.64   0   1   5  29   10029
+   -1.393566  1277   11.28   3   1  -4  -7   10039
+   -0.720055  1132   11.28   3   0  -2  -5   10039
+   -0.260096  1454   11.28   3   1  -3  -6   10039
+    0.260096   738   11.28   3   0  -4  -7   10039
+   -0.306639   994   17.99   0   1   5  -7   10048
+    0.306639   486   17.99   0   1   5  -7   10048
+    0.813142   605   17.99   0   1   5  -5   10048
+    1.095140   880   17.99   0   1   5   7   10048
+    1.593429   352   17.99   0   1   0  -7   10048
+    2.094456   376   17.99   0   0  -4  -6   10048
+    2.625599   212   17.99   0   0  -4   1   10048
+    3.143053   237   17.99   0   0  -5  -5   10048
+   -1.409993  1119   -1.47   0   1   5  -2   10052
+   -0.873374   729   -1.47   0   1   2   2   10052
+    0.873374   355   -1.47   0   1  -1   1   10052
+    4.358658   611   -1.47   0   1   1   2   10052
+   -0.739220   429   -1.11   0   1   5   0   10079
+   -0.232717   467   -1.11   0   1   5  -1   10079
+    0.232717   599   -1.11   0   1   5  -3   10079
+    0.709103   184   -1.11   0   1  -4   4   10079
+   -0.279261   783    1.64   0   1   5  24   10088
+    0.279261   858    1.64   0   1   1   9   10088
+    0.815880   709    1.64   0   0  -2   1   10088
+    1.182752   517    1.64   0   0   0   6   10088
+    2.223135   952    1.64   0   0  -1   1   10088
+    2.866530   845    1.64   0   1  -2  13   10088
+    3.326489   428    1.64   0   1  -4   8   10088
+    3.756331   755    1.64   1   0  -3  15   10088
+    4.235455   463    1.64   1   0  -3  20   10088
+    5.075975   484    1.64   1   0  -4  18   10088
+    5.459274   534    1.64   1   0  -4  22   10088
+   -2.193018  1860   -2.02   0   0  -2  -4   10092
+   -1.579740  1420   -2.02   0   0  -4  -7   10092
+   -1.062286  2016   -2.02   0   0  -2  -7   10092
+   -0.761123  2622   -2.02   0   0  -2  -7   10092
+   -0.249144  1761    5.93   3   1   0  -4   10131
+    0.249144  2271    5.93   3   1  -2  -2   10131
+    1.226557  1054    5.93   3   1  -2   1   10131
+    1.760438  1418    5.93   3   1  -3   1   10131
+    2.258727  1886    5.93   3   1  -3   4   10131
+    3.255305  1306    5.93   3   1  -4  -1   10131
+    3.928816  1276    5.93   3   1  -4   4   10131
+    4.347707  1544    5.93   3   1  -4   9   10131
+    4.865161  1230    5.93   3   1  -4   2   10131
+    5.344285  1689    5.93   3   1  -4   1   10131
+   -1.338809   853    9.33   2   1   5  -7   10132
+   -0.835045  1369    9.33   2   1   5   0   10132
+   -0.240931  1049    9.33   4   1   5  -7   10132
+    0.240931   682    9.33   3   1   5  -7   10132
+    0.728268   328    9.33   3   1   5  -7   10132
+    1.256673   480    9.33   2   1   0  -7   10132
+    1.831622   224    9.33   3   1  -4  -6   10132
+    2.291581   440    9.33   2   1  -4  -6   10132
+   -1.727584  1470    6.12   3   1   5   3   10135
+   -1.226557  1568    6.12   4   1   5  -6   10135
+   -0.249144  1010    6.12   4   1   1  20   10135
+    1.930185   142    6.12   3   1   5  19   10135
+   -1.253936   561    4.40   0   1   5   0   10145
+   -0.755647  1102    4.40   0   1   5   0   10145
+   -0.240931  1620    4.40   0   1  -1  -7   10145
+    0.238193   697    4.40   0   1  -1  -4   10145
+    0.766598   538    4.40   0   1  -4  -4   10145
+    1.284052   811    4.40   0   1  -4  -5   10145
+    1.779603   592    4.40   0   1  -4  -2   10145
+    2.277892   568    4.40   0   1  -3  -6   10145
+    2.795346   384    4.40   0   1  -4  -3   10145
+    3.293634   431    4.40   0   1   1  -7   10145
+    3.772758   544    4.40   0   1  -3  -5   10145
+    4.290212   677    4.40   0   1  -3  -7   10145
+   -1.270363   218    0.92   0   1   3  -5   10171
+   -0.739220  2241    0.92   3   1   0  -7   10171
+   -0.229979  1714    0.92   3   1   5  -4   10171
+    0.229979  1250    0.92   3   1   1  13   10171
+    0.709103   499    0.92   3   1  -2   0   10171
+    1.744011   382    0.92   3   1  -1  -4   10171
+   -2.759754  1435   11.21   2   1  -4  -6   10173
+   -0.766598  1265   11.21   0   1  -4  -4   10173
+   -0.249144   960   11.21   0   1  -4  -3   10173
+    0.249144   944   11.21   0   1  -4   0   10173
+    0.747433   766   11.21   0   1  -4   3   10173
+    1.207392  1265   11.21   0   1  -4  -1   10173
+   -0.249144   639    1.26   3   1   5  18   10175
+    0.249144   540    1.26   4   1   5  12   10175
+   -0.750171  1261   11.41   4   1   1   5   10191
+   -0.243669  3184   11.41   4   1   5  -5   10191
+    0.243669  1225   11.41   4   1   5  -6   10191
+    0.722793   804   11.41   4   1  -4   3   10191
+    1.182752   742   11.41   4   1  -4  -3   10191
+    2.217659   756   11.41   4   1  -3   0   10191
+    2.715948   368   11.41   4   1  -4  11   10191
+    4.202601  1156   11.41   2   1  -4  -1   10191
+    0.251882   584   14.89   4   0   5  -5   10196
+    0.750171   447   14.89   4   0   5  -4   10196
+    1.226557   439   14.89   4   0   5  15   10196
+    3.222450   340   14.89   3   0  -2  -4   10196
+    3.737166   207   14.89   3   0  -3  -4   10196
+   -0.249144  1159    1.29   0   1   5   5   10204
+    0.249144  1172    1.29   0   1   2   7   10204
+    0.752909   653    1.29   0   1   1   4   10204
+    1.180014   741    1.29   0   1  -1   7   10204
+    1.733060   736    1.29   0   1  -1   5   10204
+    2.231348   851    1.29   0   1  -3  -2   10204
+    2.729637   717    1.29   0   1  -3  11   10204
+    3.227926   592    1.29   0   1  -2   3   10204
+    4.224504   613    1.29   0   1  -2   4   10204
+    4.722793   423    1.29   0   1  -2  -3   10204
+    5.259411   480    1.29   0   1  -1   7   10204
+   -0.243669   933   -0.24   4   1  -3  -6   10213
+    0.243669   876   -0.24   4   1  -3  -3   10213
+    0.741958   178   -0.24   4   1  -3  -6   10213
+    1.223819   727   -0.24   4   1  -3  -4   10213
+    1.722108   539   -0.24   3   1  -4  -2   10213
+    2.737851   756   -0.24   4   1  -4  -5   10213
+    3.255305   649   -0.24   4   1  -4   1   10213
+    3.772758   428   -0.24   4   1  -4   3   10213
+   -1.201916  1536   22.91   3   1   5  -7   10221
+   -0.709103   688   22.91   3   1   5  -7   10221
+   -0.210815  1562   22.91   3   0   5  -7   10221
+    0.210815   547   22.91   3   1   5  -7   10221
+    1.212868   912   22.91   3   1   3  -6   10221
+    1.711157   660   22.91   3   1  -1  -7   10221
+    2.691307   477   22.91   3   1  -2  -7   10221
+    3.244353   446   22.91   3   1  -1  -5   10221
+    3.704312   348   22.91   3   1  -2  -6   10221
+    4.271047   138   22.91   3   1  -3  -6   10221
+   -2.803559   734   19.01   0   0   5  -5   10222
+   -1.787817   817   19.01   0   0   5  -2   10222
+   -1.327858   994   19.01   0   0   5  -6   10222
+   -0.829569   710   19.01   0   0   5  -6   10222
+    0.813142   275   19.01   0   0  -4  -7   10222
+    1.319644   147   19.01   0   0  -3  -6   10222
+   -0.260096   821    8.77   0   1   5  -2   10259
+    0.260096  1005    8.77   0   1   5  22   10259
+    0.758385   364    8.77   0   1  -2  18   10259
+    3.274470   520    8.77   0   1  -3  17   10259
+    3.753593   279    8.77   0   1   1  18   10259
+    4.251882   579    8.77   0   1   5   8   10259
+    4.750171   348    8.77   0   1   5  -3   10259
+   -0.774812   328    8.74   0   1   3   8   10263
+   -0.257358   542    8.74   1   1   5  -2   10263
+    0.257358   454    8.74   0   1  -4  -2   10263
+    1.733060   316    8.74   0   1  -4  15   10263
+   -2.685832   491    9.84   0   1   0   5   10273
+   -2.198494  1114    9.84   0   1   0   5   10273
+   -1.744011  1125    9.84   0   1  -1   2   10273
+    0.249144  1009    9.84   0   1  -1   2   10273
+    0.747433  1010    9.84   0   1  -3  20   10273
+    1.303217   964    9.84   0   1  -3   8   10273
+    1.801506  1029    9.84   0   1  -4   6   10273
+   -1.256673   627    8.42   0   1  -1  14   10290
+   -0.260096   657    8.42   0   1  -2   7   10290
+    0.260096   504    8.42   0   1  -1  -1   10290
+    1.620808   355    8.42   0   1  -2  -6   10290
+   -1.242984  1985   14.10   3   1   5  -6   10302
+   -0.744695  1782   14.10   3   1   0  -7   10302
+   -0.240931  1081   14.10   3   1  -4  -7   10302
+    0.240931   789   14.10   3   1  -4  -4   10302
+    2.272416   991   14.10   3   1  -4   0   10302
+    2.770705  1199   14.10   2   1  -4  -7   10302
+    3.268994   833   14.10   3   1  -4  -7   10302
+    3.767283   845   14.10   0   1  -4   1   10302
+    4.265572   693   14.10   0   1  -4   3   10302
+   -1.667351   587    0.63   0   0   5   1   10304
+   -1.193703   638    0.63   0   1   5   2   10304
+   -0.673511   583    0.63   0   1   5  10   10304
+   -0.249144   479    0.63   0   1   5  17   10304
+    1.152635   588    0.63   0   0   5   2   10304
+    1.401780   826    0.63   0   1   5   7   10304
+    1.911020   273    0.63   0   0   5   4   10304
+    2.737851   324    0.63   0   0  -2   1   10304
+    3.236140   515    0.63   0   0   0   2   10304
+   -2.384668   876   12.30   0   0   5  -4   10323
+   -1.869952  1509   12.30   0   0   0   2   10323
+   -1.366188  1701   12.30   0   1  -2  -1   10323
+   -0.867899  1288   12.30   0   0  -1  -5   10323
+    1.278576   683   12.30   0   0  -5  -6   10323
+    1.700205   753   12.30   0   0  -5  -6   10323
+    2.149213   686   12.30   0   0  -5  -6   10323
+    2.524298   576   12.30   0   0  -5  -6   10323
+    3.080082   647   12.30   0   0  -5  -7   10323
+   -1.604381  1472    8.78   4   1   5  26   10343
+   -0.254620  1589    8.78   3   1  -1  24   10343
+    1.519507   739    8.78   3   1  -3  -1   10343
+   -0.747433  1549    0.44   3   1   5  -7   10344
+   -0.249144  2187    0.44   3   1   5  -5   10344
+    0.249144  1637    0.44   3   1   1  -6   10344
+    0.728268  1164    0.44   3   1   1  -5   10344
+    2.817248  1521    0.44   3   1  -2  -4   10344
+    3.392197   792    0.44   3   1  -2  -3   10344
+    3.912389  1038    0.44   2   1  -2  17   10344
+    4.361396   719    0.44   2   1  -2   0   10344
+    4.810404  1115    0.44   3   1  -2  -3   10344
+   -2.726899  1697    6.50   3   1   5  14   10350
+   -2.228611  1416    6.50   3   1   5   4   10350
+   -1.749487   803    6.50   3   1   5  -1   10350
+    0.766598   713    6.50   2   1   1  -5   10350
+    2.031485   599    6.50   2   1  -4   4   10350
+    2.502396   556    6.50   2   1  -3   0   10350
+   -1.242984  1313    5.55   3   1   5  -6   10360
+   -0.744695  1176    5.55   3   1  -4  -7   10360
+   -0.240931  1552    5.55   3   1  -3  -6   10360
+    0.240931  1253    5.55   3   1  -4  -5   10360
+    1.256673  1319    5.55   3   1  -4  -7   10360
+    1.754962   930    5.55   3   1  -4  -7   10360
+    2.272416   790    5.55   4   1  -4   0   10360
+    2.770705   898    5.55   3   1  -4  -7   10360
+    3.268994   716    5.55   3   1  -4  -7   10360
+    3.767283   718    5.55   0   1  -4  -7   10360
+    4.265572   718    5.55   0   1  -4   0   10360
+    0.246407   668    3.56   0   1   5  10   10361
+    0.744695   405    3.56   0   1   3   9   10361
+    1.262149   400    3.56   0   1  -4   5   10361
+    1.741273   259    3.56   0   1  -4   7   10361
+    2.321697   558    3.56   0   1   2   4   10361
+    2.789870   488    3.56   0   1   5   2   10361
+    3.288159   307    3.56   0   1   5   0   10361
+    3.786448   307    3.56   0   1   5   7   10361
+    4.284737   326    3.56   0   1   5  11   10361
+    4.802190   293    3.56   0   1   2  24   10361
+    5.289528   343    3.56   0   1   5  36   10361
+   -2.765229  1211    2.06   3   1  -2   7   10362
+   -2.266940  1452    2.06   2   1  -3   5   10362
+   -1.796030   703    2.06   2   1  -2  -5   10362
+    0.224504   853    2.06   2   1  -3  -2   10362
+    0.722793   793    2.06   3   1   0  -3   10362
+   -0.375086  1615   -8.32   3   1   5  -1   10372
+    0.375086   785   -8.32   3   1   5   5   10372
+   -0.752909  1501    3.73   1   1   3  14   10388
+   -0.249144  1248    3.73   2   1   5   7   10388
+    0.835045   567    3.73   2   1   5   4   10388
+    1.316906   309    3.73   2   1   4  -1   10388
+    2.140999   448    3.73   1   1  -3   2   10388
+   -1.248460   644   -6.70   0   1  -2   1   10396
+   -0.750171  1488   -6.70   0   1  -1  -2   10396
+   -0.251882  1696   -6.70   0   1  -2  -6   10396
+    0.251882   612   -6.70   0   1  -2  -3   10396
+    0.936345   946   -6.70   0   1  -3  -3   10396
+    1.352498   765   -6.70   0   1  -1   7   10396
+    2.655715   650   -6.70   0   1  -3  14   10396
+    4.005476   848   -6.70   0   1  -4  -3   10396
+    4.591376   550   -6.70   0   1  -4   4   10396
+   -0.265572   816   -3.98   0   1  -1   4   10401
+    0.262834   943   -3.98   0   1  -2   2   10401
+    0.780287   826   -3.98   0   1  -1   1   10401
+    1.240246   623   -3.98   0   1  -4   4   10401
+    3.195072   747   -3.98   0   1  -3  11   10401
+    3.865845   497   -3.98   0   1  -4  21   10401
+    4.249145   515   -3.98   0   1  -3  27   10401
+    4.689939   187   -3.98   0   1  -4  22   10401
+   -0.246407  1003   18.29   4   1   5  -2   10403
+    0.251882   710   18.29   4   1   5  -5   10403
+    0.804928   706   18.29   4   1   5   0   10403
+   -2.683094   661   -1.64   0   0  -1  -3   10416
+   -2.171116  1299   -1.64   0   0  -1  -3   10416
+   -1.702943   664   -1.64   0   0  -1  -3   10416
+    0.301164  1187   -1.64   0   0   5   0   10416
+    0.791239   819   -1.64   0   0  -2  -1   10416
+    1.308693   589   -1.64   0   0   2  -2   10416
+    1.787817   659   -1.64   0   0   0  -6   10416
+    2.324435   810   -1.64   0   0   1  -5   10416
+   -1.678303  1103   -0.10   0   0   5   0   10419
+   -1.180014  1498   -0.10   0   1   5  -4   10419
+   -0.662560  1542   -0.10   2   0  -2  -4   10419
+   -0.240931  1211   -0.10   3   1   1  -3   10419
+    0.240931   840   -0.10   3   0  -1  -3   10419
+    0.739220  1113   -0.10   3   1   0  -4   10419
+    1.240246   741   -0.10   3   0  -2  -7   10419
+    1.738535   503   -0.10   2   0  -3  -5   10419
+    2.236824   950   -0.10   2   1  -2  -1   10419
+    2.789870   716   -0.10   0   1  -2  -4   10419
+    3.268994   690   -0.10   0   1  -2  -3   10419
+    3.767283   705   -0.10   0   1  -2  -1   10419
+   -0.167009  1290    7.71   0   1  -1   7   10424
+    0.249144  2053    7.71   0   1  -3  -2   10424
+    0.766598   609    7.71   0   1  -2  -3   10424
+    1.188227   573    7.71   0   1  -1  -7   10424
+    2.688570   654    7.71   0   1  -3  -2   10424
+    3.225188   855    7.71   0   1  -3  -1   10424
+    3.742642   453    7.71   0   0  -4   3   10424
+    4.758385   293    7.71   0   0  -4   7   10424
+   -0.421629  1587   -0.67   3   1  -1   2   10425
+    0.424367  1613   -0.67   3   1  -3   2   10425
+    0.999316   986   -0.67   3   1  -4   6   10425
+    1.678303  1072   -0.67   3   0  -4   1   10425
+    1.952088  1457   -0.67   3   0  -4   2   10425
+    3.307324  1556   -0.67   3   0  -4  -3   10425
+    5.051335  1667   -0.67   3   0  -4  -2   10425
+   -0.257358  1092   -5.57   0   1   5  12   10432
+    3.485284   688   -5.57   1   1  -4   5   10432
+    4.728268   505   -5.57   0   1  -4   2   10432
+   -2.464066   659   -6.32   0   0  -3  10   10433
+   -1.941136  1081   -6.32   0   1  -4   3   10433
+    0.966461   431   -6.32   0   0  -4  19   10433
+    1.527721   521   -6.32   0   0  -3   2   10433
+    2.026010   585   -6.32   0   0  -4  14   10433
+    2.806297   669   -6.32   0   1  -4  19   10433
+   -0.262834  1723   16.15   4   1   0  -6   10437
+    0.262834  1632   16.15   4   1   5   5   10437
+    0.780287  1278   16.15   4   1   0  15   10437
+    1.223819   605   16.15   4   1   2  -3   10437
+    1.702943  1003   16.15   4   1   0  -7   10437
+    2.201232  1236   16.15   4   1  -1   1   10437
+    3.230664   232   16.15   2   1   3   2   10437
+    3.748118   618   16.15   4   1  -2  -4   10437
+    4.246407   819   16.15   3   1   1  -5   10437
+    4.763860   676   16.15   1   1  -1  -6   10437
+    5.204654   211   16.15   1   1  -2  -6   10437
+   -0.257358   283    5.87   3   1   2   0   10444
+    0.257358   449    5.87   3   1  -2  -1   10444
+    0.750171   528    5.87   4   1  -2  -5   10444
+    1.193703   456    5.87   0   1  -4  -4   10444
+   -1.760438   761   -3.14   0   1  -2  -6   10453
+   -0.744695   788   -3.14   0   1  -4  -7   10453
+   -0.210815   774   -3.14   0   1  -1  -6   10453
+    1.453799   481   -3.14   0   1  -3  -1   10453
+    1.979466  1039   -3.14   0   0  -4  -5   10453
+    2.584531   629   -3.14   0   0  -3  -5   10453
+    2.833676  1014   -3.14   0   1  -3  -4   10453
+   -1.292266   631   -1.12   0   0  -3  -1   10473
+   -0.747433   558   -1.12   0   0  -2   6   10473
+   -0.229979   934   -1.12   0   0  -2   2   10473
+    0.229979   200   -1.12   0   1  -3   8   10473
+    2.338125   314   -1.12   0   0  -4   2   10473
+    2.839151   149   -1.12   0   0  -4   7   10473
+   -0.257358   890    9.83   0   1  -4  -6   10526
+    0.257358  1037    9.83   0   1  -4  -5   10526
+    0.780287   813    9.83   0   1  -3   3   10526
+    1.163587   611    9.83   0   1  -4  -5   10526
+    1.661875   862    9.83   0   1  -4  -6   10526
+    2.409309  1285    9.83   0   1  -4  -1   10526
+    5.051335   926    9.83   0   1   5   2   10526
+   -0.334018   754    4.94   0   1  -2  -5   10527
+    0.334018   643    4.94   0   1  -2  -7   10527
+    0.755647   478    4.94   0   1  -4  -7   10527
+    1.226557   393    4.94   0   1  -4  -7   10527
+    2.499658   408    4.94   0   1   5  -7   10527
+    2.863792   413    4.94   0   1   5  -7   10527
+    3.479808   363    4.94   0   1  -4  -2   10527
+   -0.851472   659   -3.41   3   1  -1   5   10538
+   -0.257358   972   -3.41   3   1   5   1   10538
+    0.257358   587   -3.41   3   1   1  -1   10538
+    0.755647   250   -3.41   2   1  -1   7   10538
+   -1.215606   944   -4.22   1   1  -4  -7   10557
+   -0.717317  1604   -4.22   2   1  -3  -4   10557
+   -0.216290  1226   -4.22   1   1  -4  -4   10557
+    0.216290   415   -4.22   0   1  -4  -7   10557
+    1.180014  1014   -4.22   0   1  -4  -7   10557
+    1.694730   830   -4.22   0   1  -4  -6   10557
+    2.182067   630   -4.22   0   1  -4  -7   10557
+    2.967830   503   -4.22   0   1  -4  -7   10557
+    3.178645   455   -4.22   0   1  -4  -6   10557
+    3.676934   526   -4.22   0   1  -4  -7   10557
+    4.194387   435   -4.22   0   1  -4  -7   10557
+   -0.741958   824   -2.76   1   1   5  16   10564
+   -0.260096   980   -2.76   1   1   5   4   10564
+    0.260096   924   -2.76   1   1   5  11   10564
+    0.739220   476   -2.76   1   1  -1   4   10564
+    1.735797   801   -2.76   1   1  -2   4   10564
+    3.364819   713   -2.76   1   0  -4   3   10564
+    3.904175   622   -2.76   1   1  -2   0   10564
+    4.476386   731   -2.76   1   1  -2  -4   10564
+   -2.461328  1641   -4.07   4   1   3  -5   10569
+   -1.963039  2621   -4.07   0   1  -1   0   10569
+   -0.506502  1185   -4.07   3   1  -2   7   10569
+    0.506502  1193   -4.07   3   1  -2   4   10569
+    0.999316  1581   -4.07   3   1  -3  -2   10569
+    1.571526  1134   -4.07   1   1  -3   1   10569
+    2.069815   620   -4.07   3   1  -3  -6   10569
+    2.491444   874   -4.07   0   1  -3   6   10569
+   -2.357290   560    5.77   0   1   5  -1   10579
+   -1.853525   475    5.77   0   1  -3  -4   10579
+   -1.092402   812    5.77   0   1   3  -7   10579
+   -0.835045   729    5.77   0   1   0  -6   10579
+    0.728268   749    5.77   0   0   1  -1   10579
+   -2.699521   855    6.20   2   1  -3  -4   10587
+   -2.193018  1264    6.20   2   1   0  -4   10587
+   -1.675565  1259    6.20   1   1   5  -5   10587
+   -1.256673   519    6.20   1   1  -3  -6   10587
+    0.711841  1103    6.20   1   1  -4  -5   10587
+    1.708419  1082    6.20   1   1  -4  -4   10587
+    2.225873   779    6.20   1   1  -4  -5   10587
+    0.254620  1151   -0.27   0   1  -2  16   10591
+    1.067762   867   -0.27   0   1  -1   2   10591
+    2.494182   705   -0.27   0   0  -4  -3   10591
+    3.299110   890   -0.27   0   1  -3  19   10591
+    3.830253   551   -0.27   0   1  -4   2   10591
+    4.191649   794   -0.27   0   0  -4  -3   10591
+    4.717317   801   -0.27   0   0  -1   3   10591
+    5.253936   677   -0.27   0   1  -1   7   10591
+   -0.851472   983   -3.09   3   0  -2   7   10642
+   -0.240931  1270   -3.09   3   1  -2  11   10642
+    0.240931  1162   -3.09   3   1  -4  18   10642
+    3.378508   447   -3.09   3   0  -4  10   10642
+   -0.271047  1407    6.23   0   1   5   1   10662
+    0.268309  1285    6.23   0   1   5   2   10662
+    0.785763   881    6.23   0   1   5   1   10662
+    1.248460   459    6.23   0   1  -5   8   10662
+    2.357290   305    6.23   0   1  -4   4   10662
+    3.373032   130    6.23   0   0  -5  -4   10662
+    4.005476    40    6.23   0   0  -5   0   10662
+   -1.697467  1245    0.36   0   1   1  16   10669
+   -1.199179  1103    0.36   0   1   3  11   10669
+   -0.260096  1260    0.36   0   1   3   2   10669
+    1.316906   478    0.36   0   1  -5  15   10669
+    1.891855   385    0.36   0   1  -4  15   10669
+   -0.906229  2640    2.10   4   1   3  -3   10675
+   -0.312115  1934    2.10   3   1   5  -1   10675
+    0.312115   738    2.10   3   0  -1  -1   10675
+    0.561259  1188    2.10   4   0   1   2   10675
+   -1.785079   945   -2.74   0   0  -4   0   10678
+   -1.218344  1042   -2.74   0   0  -4   1   10678
+   -0.744695  1256   -2.74   0   0  -4  -7   10678
+    0.747433   836   -2.74   0   0  -4   8   10678
+    1.284052   566   -2.74   0   0  -4  -2   10678
+    1.787817   845   -2.74   0   0  -4  22   10678
+    2.874743   527   -2.74   0   0  -4   7   10678
+   -2.688570   908   -3.51   0   1   2  -4   10700
+   -0.262834  1202   -3.51   0   1   0  -1   10700
+    0.262834   641   -3.51   0   1  -2   0   10700
+    1.919233   441   -3.51   0   1  -2   8   10700
+    2.458590   645   -3.51   0   1  -2  -1   10700
+   -0.810404  1088    1.78   2   1   5   0   10770
+   -0.240931   681    1.78   1   1   1  10   10770
+    1.141684   836    1.78   1   1  -2   0   10770
+    2.663929  1615    1.78   1   1  -1  -4   10770
+    3.123888  1231    1.78   0   1  -4  -3   10770
+    3.660506  1681    1.78   1   1  -3  -1   10770
+    4.139630  1740    1.78   1   1  -1   0   10770
+   -2.466804  1236    0.36   0   1   0   0   10773
+    2.464066   942    0.36   0   0   0  -3   10773
+    2.981519   950    0.36   0   0  -3   0   10773
+   -1.248460  1077    0.08   0   1   5  11   10806
+   -0.747433  1220    0.08   0   1   3   8   10806
+   -0.249144   756    0.08   0   1   5  19   10806
+    0.249144   551    0.08   0   0   2  28   10806
+    0.747433   676    0.08   0   0   1  18   10806
+   -1.820671  1960   -6.15   3   1   5  -2   10865
+   -1.264887  1043   -6.15   3   1   0  28   10865
+   -0.747433  1465   -6.15   3   1  -1   4   10865
+   -0.249144  1442   -6.15   3   1   5   4   10865
+    0.249144  1329   -6.15   3   1   2   2   10865
+   -0.240931  1206   -0.18   0   1   5   1   10878
+    0.240931   691   -0.18   0   0  -2  -1   10878
+   -0.758385   773    0.32   0   0   5   0   10915
+   -0.240931   735    0.32   0   1   5  -6   10915
+    2.198494   264    0.32   0   1  -3   4   10915
+    2.726899   139    0.32   0   1  -1  16   10915
+    3.225188   145    0.32   0   0  -3   8   10915
+    3.633128   221    0.32   0   0  -3   2   10915
+    4.123203   274    0.32   0   0  -4   5   10915
+   -0.758385   596   -3.17   0   1   1   8   10916
+   -0.240931   616   -3.17   0   1  -3  -3   10916
+    0.240931   496   -3.17   0   1  -1  -2   10916
+    0.700890   672   -3.17   0   1  -2  -3   10916
+    1.215606   359   -3.17   0   1  -4  -2   10916
+    1.771389   660   -3.17   0   1  -2  -5   10916
+    2.308008   257   -3.17   0   1  -2  -1   10916
+    2.825462   383   -3.17   0   1  -4  20   10916
+   -2.836413   710   -4.75   0   1   0   3   10949
+   -2.297057   605   -4.75   3   1   5  22   10949
+   -1.820671   765   -4.75   2   1   5   8   10949
+   -0.780287  1144   -4.75   3   1  -3   1   10949
+   -0.251882  1035   -4.75   3   1  -2   0   10949
+    0.769336   749   -4.75   2   1  -3   8   10949
+    1.248460   675   -4.75   1   0  -4   6   10949
+    1.746749   690   -4.75   0   0  -4   1   10949
+    2.384668   928   -4.75   2   0  -3   2   10949
+    0.848734   618    9.01   4   1  -4  -7   10954
+    3.195072   299    9.01   4   1  -2  -5   10954
+    3.770020   459    9.01   4   1  -1  -4   10954
+    4.153320   454    9.01   4   1   5  -4   10954
+    4.632443   556    9.01   4   1  -3  -5   10954
+    4.971937   155    9.01   4   1  -3  -5   10954
+   -0.755647   942    2.23   0   1  -1  14   10956
+   -0.238193   958    2.23   0   1  -2  23   10956
+    0.238193   760    2.23   0   1   5  -4   10956
+    0.700890   781    2.23   0   1  -4  13   10956
+    2.250513   848    2.23   1   1   5   5   10956
+    2.904860   662    2.23   0   1  -4  13   10956
+    3.375770   708    2.23   0   1  -3  11   10956
+    3.860370   736    2.23   0   1  -4   9   10956
+    4.358658   750    2.23   0   1  -3  28   10956
+   -1.691992   295    0.01   0   0   1  12   10968
+   -1.078713   494    0.01   0   0   5  19   10968
+   -0.607803   389    0.01   0   0   1  16   10968
+   -0.301164   619    0.01   0   0  -2   0   10968
+    0.301164   288    0.01   0   0  -1  11   10968
+    1.374401   539    0.01   0   0  -1  19   10968
+    1.872690   349    0.01   0   0  -3  11   10968
+    2.409309   343    0.01   0   0  -1   7   10968
+    2.896646   529    0.01   0   0  -3  24   10968
+    3.408624   404    0.01   0   0  -3   5   10968
+    3.906913   473    0.01   0   0  -1   2   10968
+   -1.056810   984   -2.41   3   1  -2  -6   11005
+   -0.254620  1332   -2.41   3   1  -4  -5   11005
+    0.254620  1098   -2.41   3   1  -4  -4   11005
+    2.573580   299   -2.41   0   1  -5  -7   11005
+    3.813826   700   -2.41   0   1  -3  -7   11005
+    4.312115   568   -2.41   0   1  -4  -7   11005
+   -1.587953  1521   -1.19   0   0  -2  -4   11048
+   -0.744695   484   -1.19   0   0  -4  -3   11048
+    2.198494   351   -1.19   0   0  -4   0   11048
+    2.737851   187   -1.19   0   0  -4  -7   11048
+   -0.246407   700   21.74   0   0   5   0   11062
+    0.249144   365   21.74   0   0   0  -6   11062
+    0.621492   273   21.74   0   0  -5  -7   11062
+    2.997947   391   21.74   0   0  -5  -6   11062
+   -1.097878   940   23.06   0   0  -2   0   11076
+   -0.240931   570   23.06   0   1   0   3   11076
+   -0.246407  1848   -2.40   2   1  -2  20   11088
+    0.246407  1119   -2.40   1   1  -3  14   11088
+    0.744695  1273   -2.40   1   1  -1   9   11088
+    0.251882   497   -7.91   0   1   0  -4   11100
+   -0.240931   684   18.03   0   0  -1  -3   11106
+    0.240931   561   18.03   0   0  -1   1   11106
+    0.744695   397   18.03   0   0  -2  -1   11106
+    2.948665   441   18.03   0   0  -3  -2   11106
+    3.427789   370   18.03   0   0  -3  -6   11106
+    3.791923   629   18.03   0   0  -3  -3   11106
+    4.610541   403   18.03   0   0  -4   0   11106
+    0.210815  1976   19.64   0   1   3   5   11118
+    1.188227  1391   19.64   0   1  -3  37   11118
+    1.574264  1417   19.64   0   1  -4  10   11118
+    2.759754   466   19.64   1   1  -4   3   11118
+    3.277207  1041   19.64   0   1  -4   1   11118
+    3.750856   663   19.64   0   1  -4  -1   11118
+    4.024641   783   19.64   0   1  -4   5   11118
+    4.698152   651   19.64   0   1  -4   2   11118
+    5.158111   898   19.64   0   1  -4   2   11118
+   -1.741273   833   11.12   0   1   5  -2   11131
+   -1.437372   902   11.12   0   1   5  -5   11131
+   -0.963723   713   11.12   0   1   5  -6   11131
+    1.462012   317   11.12   0   1   5  20   11131
+   -0.251882   582    1.22   0   1   1  -2   11142
+    0.251882   831    1.22   0   1  -1  -7   11142
+    0.711841   878    1.22   0   1  -4  -7   11142
+    1.171800   612    1.22   0   0  -4  -6   11142
+    1.765914   563    1.22   0   0  -4  -6   11142
+    2.264203   924    1.22   0   0  -2  -5   11142
+    2.735113   384    1.22   0   0  -2  -6   11142
+    3.241615   765    1.22   0   0  -3  -7   11142
+    3.759069   526    1.22   0   1  -2  -7   11142
+    4.295688   630    1.22   0   1   5  -7   11142
+    4.832307   475    1.22   0   1   5  -7   11142
+    5.273101   549    1.22   0   0   5  -7   11142
+   -2.833676   893   -1.34   3   1  -3  -5   11143
+   -0.720055  2014   -1.34   3   1  -3  -5   11143
+   -0.221766  1698   -1.34   2   1  -2  -2   11143
+    0.221766  1310   -1.34   3   1  -2  -5   11143
+    0.769336  1459   -1.34   3   1  -2  -5   11143
+    1.256673  1294   -1.34   3   1  -2  -7   11143
+   -1.612594  1906  -10.04   0   0   5  -1   11165
+   -1.226557  2335  -10.04   0   0   5   2   11165
+    0.709103  1208  -10.04   0   1   0   3   11165
+    1.240246  1714  -10.04   0   1  -3  -1   11165
+    2.299795  1771  -10.04   0   0  -1  -2   11165
+    2.869268  1537  -10.04   0   0  -1  -3   11165
+    3.386721  2434  -10.04   0   1   2  -6   11165
+    3.909651  1510  -10.04   0   1   5  -4   11165
+   -2.784394  1341    5.72   0   1   5  -2   11172
+   -1.779603  1056    5.72   0   1   5  -2   11172
+   -1.270363  1229    5.72   0   1   5  16   11172
+    0.818617   958    5.72   0   1   5  13   11172
+    1.281314   893    5.72   3   0   5  10   11172
+    1.757700   474    5.72   3   0   5  16   11172
+    2.255989   756    5.72   3   0   5  12   11172
+   -1.240246  1432    0.45   0   1  -2   2   11175
+   -0.741958  1593    0.45   0   0  -2   0   11175
+   -0.262834   756    0.45   0   1  -4   5   11175
+    1.273101   938    0.45   0   1  -3   4   11175
+    1.806982   912    0.45   0   1   1  -6   11175
+    2.297057   882    0.45   0   1  -2   0   11175
+    2.806297  1009    0.45   0   1  -5  -4   11175
+    3.310062   702    0.45   0   0  -5  -2   11175
+    3.789186   574    0.45   0   0  -5   6   11175
+   -1.267625  1128    1.89   3   1   5  48   11199
+   -0.783025   986    1.89   4   1   5  12   11199
+   -0.251882   570    1.89   4   1   5  45   11199
+    0.692676   730    1.89   4   1   1  19   11199
+    2.291581  1119    1.89   4   1   0   0   11199
+   -1.166324  1083    3.98   0   1   2  -4   11200
+   -0.758385  1268    3.98   0   1  -1  -6   11200
+   -0.240931   993    3.98   0   1   5  -4   11200
+    0.758385   462    3.98   0   1   5  -6   11200
+    1.275838   449    3.98   0   1   4  -5   11200
+    1.754962   370    3.98   0   1   5   4   11200
+    2.272416   279    3.98   0   1  -1   1   11200
+    2.770705   350    3.98   0   1   3   2   11200
+    3.268994   183    3.98   0   1   1   2   11200
+    3.780972   132    3.98   0   1   2  12   11200
+   -2.091718  1137   -6.13   3   1   5   9   20003
+   -1.555099   973   -6.13   3   1   5   3   20003
+    1.555099   463   -6.13   3   1  -2   3   20003
+    2.477755   483   -6.13   1   1   5   1   20003
+   -0.251882   863   -5.87   0   1   5   4   20013
+    0.900753   814   -5.87   0   0   5  -1   20013
+    1.470226   634   -5.87   0   0   1   9   20013
+    2.620123   570   -5.87   0   0  -3   0   20013
+    3.126626   600   -5.87   0   0  -2  26   20013
+    3.718001   261   -5.87   0   0  -2   4   20013
+   -1.341547   799    1.79   1   1  -2  11   20014
+   -0.347707   913    1.79   0   1  -2   3   20014
+    0.826831   269    1.79   0   1  -3  11   20014
+    1.382615   276    1.79   0   1  -3   1   20014
+    1.872690   224    1.79   0   1  -4   6   20014
+   -1.563313  1075    4.51   0   1  -2  10   20032
+   -0.867899   865    4.51   0   1  -3   4   20032
+   -0.720055   988    4.51   0   1  -3  10   20032
+   -0.336756   940    4.51   0   1   0  11   20032
+    0.336756   900    4.51   0   1  -1  -5   20032
+    0.878850   556    4.51   0   1  -4   5   20032
+   -0.257358  1586    1.65   0   1   1  -7   20042
+    0.257358   485    1.65   0   1   3  -4   20042
+    1.434634   459    1.65   0   1   5  -7   20042
+    1.965777   387    1.65   0   1   3  -7   20042
+    2.518823   254    1.65   0   1   5  -7   20042
+    2.948665   592    1.65   0   1   2  -4   20042
+    3.594798   118    1.65   0   1   5  -7   20042
+    3.693361   151    1.65   0   1  -5  -7   20042
+    4.402464   112    1.65   0   1   5  -7   20042
+    5.056810    45    1.65   0   1   5  -7   20042
+   -0.251882  1331   -4.25   3   1   5   8   20057
+    0.251882   905   -4.25   3   1   5  -6   20057
+    0.761123   744   -4.25   0   1  -4   9   20057
+    1.355236   530   -4.25   1   1  -4   8   20057
+    1.820671   680   -4.25   3   1  -3  -1   20057
+   -1.670089   561    1.17   0   1   5  -2   20066
+   -0.375086  1255    1.17   0   1   3  15   20066
+    0.375086   922    1.17   0   1  -1  15   20066
+   -0.249144   941  -11.29   3   1   5  19   20072
+    0.249144   785  -11.29   3   1   5  24   20072
+    0.744695   976  -11.29   3   1   5  21   20072
+    1.418207   801  -11.29   3   1   0  24   20072
+    1.842574   591  -11.29   4   0  -2   7   20072
+    4.353183   410  -11.29   3   1   5  18   20072
+   -2.882957   754   13.71   0   0  -2  -4   20082
+   -2.349076   814   13.71   0   0   3   0   20082
+   -0.758385   936   13.71   0   0   5  -5   20082
+   -0.224504   796   13.71   0   0   5  30   20082
+    0.224504   770   13.71   0   0   4  -2   20082
+    0.511978   676   13.71   0   0   1  -2   20082
+    1.029432   811   13.71   0   0   5  -7   20082
+    1.492129   662   13.71   0   0   0  -4   20082
+    2.023272   780   13.71   0   0   5  -4   20082
+   -0.243669  1828    6.34   3   0   5  -4   20086
+    0.736482  1517    6.34   3   0   5  -5   20086
+    1.355236  1215    6.34   3   0   5  -5   20086
+    1.826146  1448    6.34   3   0  -2  -5   20086
+    2.362765   880    6.34   3   0  -3  -4   20086
+    3.230664   740    6.34   3   0   5  -5   20086
+    3.479808  1329    6.34   3   0   5  -5   20086
+    3.786448   926    6.34   3   0   5  -7   20086
+    4.202601  1193    6.34   3   0  -1  -5   20086
+    4.725531   986    6.34   3   0   5  -6   20086
+    5.226557   582    6.34   2   0   0  -7   20086
+   -0.251882   641    6.31   0   1   5   3   20089
+    0.766598   284    6.31   3   1   3   8   20089
+    1.347023   186    6.31   0   1   1   6   20089
+    1.796030    52    6.31   0   0  -1  -1   20089
+    2.332649    65    6.31   0   1  -2  -5   20089
+   -0.249144  1289   -4.48   0   1   5   6   20111
+    0.249144  1010   -4.48   0   1   2   3   20111
+    0.733744   847   -4.48   0   1   5   2   20111
+    1.303217   829   -4.48   0   1   0  16   20111
+    1.782341   702   -4.48   0   1  -2  -3   20111
+    2.297057   742   -4.48   0   1  -3   1   20111
+    2.814511   692   -4.48   0   1   4  -2   20111
+    3.605749   869   -4.48   0   1  -1   2   20111
+    4.098563   615   -4.48   0   1   0   2   20111
+    4.659822   498   -4.48   0   1  -1   3   20111
+    5.190965   334   -4.48   0   1   0   3   20111
+   -0.309377  1618    6.68   3   1   5  -4   20143
+    0.309377  1105    6.68   3   1   5   0   20143
+    0.821355   694    6.68   3   1   5  -7   20143
+    1.270363   207    6.68   3   1  -2  -3   20143
+    1.711157    30    6.68   4   1  -3   9   20143
+   -0.763860   746   -5.22   3   1   5   3   20147
+   -0.254620   964   -5.22   3   1   5   0   20147
+    0.254620   360   -5.22   2   1   5  -3   20147
+    0.772074   624   -5.22   3   1  -2  -3   20147
+   -0.752909   590    7.65   0   1   5  33   20158
+   -0.254620   669    7.65   0   1   5  33   20158
+    0.249144   843    7.65   0   1   1  24   20158
+    0.744695   536    7.65   0   1   2  24   20158
+    1.278576   535    7.65   0   1   5  22   20158
+    1.834360   333    7.65   0   1   5  38   20158
+    2.313484   542    7.65   0   1  -3  26   20158
+    2.740589   125    7.65   0   1   5  40   20158
+    3.159480   284    7.65   0   0   1  36   20158
+    3.567420   153    7.65   0   1   5  34   20158
+    4.019165   258    7.65   0   1  -2  28   20158
+    4.555784   243    7.65   0   1   5  40   20158
+   -0.287474  1537    7.87   3   1  -4  -4   20175
+    3.293634   413    7.87   3   1  -5  -7   20175
+   -2.902122   799   14.50   0   1  -2  -5   20199
+   -2.332649   790   14.50   0   1   0  -2   20199
+    1.303217   740   14.50   0   1   5  -7   20199
+    1.763176   327   14.50   0   1  -2  -7   20199
+    2.343600   430   14.50   0   1  -4  -7   20199
+   -0.747433  1452   15.30   2   1   5   6   20205
+   -0.251882  2057   15.30   3   1   5  -4   20205
+    0.251882  1171   15.30   2   1   5  -7   20205
+    0.843258  1380   15.30   2   1   5  -7   20205
+    1.357974   926   15.30   2   1   5  -3   20205
+    1.798768  1408   15.30   1   1   5  -6   20205
+    2.453114  1231   15.30   2   1   1   8   20205
+    2.776181   828   15.30   2   1  -2  -6   20205
+    3.099247   551   15.30   2   1   0  -7   20205
+    3.712526   632   15.30   2   1   0  -6   20205
+    4.219028   722   15.30   2   1   5  -7   20205
+    4.810404   724   15.30   2   1   5   8   20205
+   -0.251882   488   -0.41   0   0  -4  -6   20232
+    0.246407   787   -0.41   0   1  -2  -6   20232
+    0.744695   435   -0.41   0   1  -3  -2   20232
+    1.262149   640   -0.41   0   1  -4  -4   20232
+    1.779603   636   -0.41   0   1  -5  -2   20232
+    2.335387   889   -0.41   0   1  -4  -3   20232
+    2.910335   621   -0.41   0   1  -4  -6   20232
+    3.197810   426   -0.41   0   1  -5  -4   20232
+    3.638604   498   -0.41   0   1  -3  -7   20232
+    3.964408   426   -0.41   0   1  -3  -6   20232
+    4.462697   596   -0.41   0   1  -4  -2   20232
+   -2.787132   857    4.42   0   1   5  -4   20240
+   -2.422998   375    4.42   0   1   5   3   20240
+   -2.078029   543    4.42   0   1   5  -3   20240
+   -1.382615   530    4.42   0   1   5  10   20240
+   -0.900753   525    4.42   0   1   5   5   20240
+   -0.251882   633    4.42   0   1   5  17   20240
+   -0.301164  1116    8.22   3   1   5   0   20284
+    0.295688   543    8.22   3   1  -2   2   20284
+    0.722793   605    8.22   3   1  -2   6   20284
+    1.242984   560    8.22   3   1  -2   1   20284
+    1.752224   482    8.22   3   0  -4  -2   20284
+    2.250513   513    8.22   3   0  -4   4   20284
+    2.710472   529    8.22   3   0  -4   1   20284
+    3.501711   387    8.22   3   0  -4   0   20284
+    3.975359   278    8.22   3   0  -4   1   20284
+    4.377823   370    8.22   4   0  -4   4   20284
+    4.919918   280    8.22   3   0  -4   8   20284
+   -1.664613  1104   -2.96   0   1   5  -5   20305
+   -1.141684   577   -2.96   0   1   5  -7   20305
+   -0.594114   801   -2.96   0   1   5  -6   20305
+   -0.114990  1264   -2.96   0   1   5  -7   20305
+    0.432580  1163   -2.96   0   1   5  -5   20305
+    0.960986   471   -2.96   0   1   5  -6   20305
+    1.555099   505   -2.96   0   1   5  -7   20305
+    1.801506   450   -2.96   0   0   0  -7   20305
+    2.305270   524   -2.96   0   1   5  -7   20305
+    2.625599   698   -2.96   0   1  -4  -5   20305
+    3.162218   695   -2.96   0   1   5  -5   20305
+    3.824778  1131   -2.96   0   1   5  -7   20305
+   -0.791239  1038   -1.76   0   1   5  -3   20323
+    0.227242   505   -1.76   0   1   0  37   20323
+    0.752909    89   -1.76   0   1  -3  26   20323
+    1.177276    43   -1.76   0   0  -3  12   20323
+   -0.243669  1197    5.65   1   1   5  -6   20324
+    0.243669   750    5.65   1   1   5  -7   20324
+    0.854209   622    5.65   1   1   5  -7   20324
+    1.259411   479    5.65   1   1   5  -4   20324
+    1.776865   553    5.65   1   1   5  -5   20324
+    2.967830   315    5.65   1   1  -3  -4   20324
+    4.021903   127    5.65   1   1  -4   1   20324
+   -0.747433  1720    3.81   3   1   5   8   20332
+   -0.235455   930    3.81   3   1   5  -1   20332
+    0.235455   921    3.81   4   1   5   0   20332
+   -0.741958   889   20.14   0   1   5  -6   20344
+   -0.243669   946   20.14   0   1   5  -7   20344
+    0.243669   798   20.14   0   1   5  -6   20344
+    0.777550   385   20.14   0   1   5  -7   20344
+    1.273101   383   20.14   0   1   1  -7   20344
+    1.771389   216   20.14   0   1   5  -6   20344
+    2.231348   224   20.14   0   1   3  -4   20344
+    4.531143    14   20.14   0   0  -4  -7   20344
+   -0.265572   673    3.88   0   0   5   3   20348
+    0.265572   616    3.88   0   1   5   2   20348
+    0.777550   802    3.88   0   1   5   7   20348
+    1.275838   429    3.88   0   1   5   6   20348
+    1.749487   536    3.88   0   1   5   7   20348
+    2.313484   346    3.88   0   0   4  11   20348
+    4.109514   371    3.88   0   0   0   0   20348
+    4.624230   758    3.88   0   1   5   8   20348
+    5.147160   590    3.88   0   1   5  18   20348
+   -2.841889  1264   -4.26   0   1   5   1   20362
+   -1.440109   692   -4.26   0   1   5  -1   20362
+   -0.922656   559   -4.26   0   1   5   5   20362
+   -0.709103   838   -4.26   0   1   5   4   20362
+   -0.229979   657   -4.26   0   1   5   6   20362
+    0.229979   640   -4.26   0   1   5   3   20362
+    0.761123   551   -4.26   0   1   5  20   20362
+   -2.806297   655   23.61   0   0   4  -6   20363
+   -2.291581   770   23.61   0   0   5  -5   20363
+   -0.194387   869   23.61   0   0   5  -7   20363
+    0.194387   471   23.61   0   0   1  -6   20363
+    0.531143   460   23.61   0   0   1  -7   20363
+    1.048597   602   23.61   0   0   0  -6   20363
+    1.574264   451   23.61   0   0   5  -5   20363
+    2.069815   510   23.61   0   0   2  -4   20363
+   -0.238193   542    5.17   0   1   5  -1   20374
+    0.238193   527    5.17   0   1   5  -4   20374
+    0.758385   396    5.17   0   1   5  -5   20374
+    1.327858   425    5.17   0   1   5  -7   20374
+    1.754962   505    5.17   0   1   5  -6   20374
+    2.305270   270    5.17   0   1   5  -3   20374
+    2.918549   244    5.17   0   1   5  -3   20374
+    3.230664   250    5.17   0   0   5  -6   20374
+    3.512663   251    5.17   0   1   5  -7   20374
+    4.054757   323    5.17   0   1   5  -7   20374
+    4.700890   428    5.17   0   1   5  -7   20374
+    5.218344   465    5.17   0   1   5  -6   20374
+   -0.262834   908   -2.37   0   0   5  -6   20393
+    0.262834   250   -2.37   0   1   3  -7   20393
+    0.818617   307   -2.37   0   1   3  -6   20393
+    1.330595   605   -2.37   0   1  -1  -3   20393
+    1.796030   546   -2.37   0   1   0  -7   20393
+    2.255989   372   -2.37   0   1  -3  -4   20393
+    2.852840   339   -2.37   0   1   1  -7   20393
+    3.175907   311   -2.37   0   1   1  -2   20393
+    3.466119   319   -2.37   0   1   1  -7   20393
+    4.000000   492   -2.37   0   1   0  -4   20393
+    4.536619   350   -2.37   0   1  -1  -2   20393
+    5.103354   442   -2.37   0   1   5   2   20393
+   -2.811773   859    4.71   3   1  -4  -7   20395
+   -2.272416  1333    4.71   2   1  -4   6   20395
+   -1.771389   995    4.71   3   1  -4   6   20395
+    0.320329   356    4.71   3   1  -3  -1   20395
+    1.470226   450    4.71   2   1  -3  20   20395
+    1.872690   590    4.71   2   1  -3  12   20395
+    2.518823   451    4.71   3   1  -4   4   20395
+   -0.265572   899  -10.41   2   1   5  -5   20397
+    0.271047   950  -10.41   1   1  -4  -3   20397
+    1.259411   791  -10.41   0   1  -4  -3   20397
+    1.872690   611  -10.41   0   0  -4  -7   20397
+    2.370979  1011  -10.41   0   1  -4  -6   20397
+    2.891171   338  -10.41   0   1  -4  -4   20397
+    3.140315   369  -10.41   0   1  -3  11   20397
+    3.682409   411  -10.41   0   0  -4  -3   20397
+    4.002738   311  -10.41   0   0  -4   5   20397
+    4.558522   447  -10.41   0   1  -2  -4   20397
+    5.130733   309  -10.41   0   1  -2  -7   20397
+   -2.603696   838   -1.35   0   1  -1  -2   20404
+   -1.555099   412   -1.35   0   1  -1   0   20404
+   -0.678987   888   -1.35   1   1   0  -6   20404
+   -0.268309   724   -1.35   1   1   1  -5   20404
+    0.268309   721   -1.35   1   1  -2  -7   20404
+   -0.506502   690   -7.76   0   1   5  -5   20417
+    0.511978  1025   -7.76   0   1  -3  -5   20417
+    0.966461   483   -7.76   0   1  -4   1   20417
+   -0.763860   935    3.35   0   1   0   3   20421
+   -0.260096   816    3.35   0   1  -2   3   20421
+    0.251882  1119    3.35   0   1  -1   0   20421
+    0.799452   902    3.35   0   1  -4  -3   20421
+    1.286790   809    3.35   0   1  -4   3   20421
+    1.785079   374    3.35   0   1  -4   3   20421
+    2.321697   684    3.35   0   1  -4   3   20421
+    3.687885   725    3.35   0   0  -4  24   20421
+   -1.429158   752   -1.93   0   1   5  13   20439
+   -0.936345   590   -1.93   0   1   1   6   20439
+   -0.295688   758   -1.93   0   1   5  15   20439
+    0.295688   696   -1.93   0   1   5   8   20439
+    1.352498   350   -1.93   0   1   0   2   20439
+    1.831622   346   -1.93   0   1  -3   1   20439
+    2.349076   473   -1.93   0   0  -2  -3   20439
+    2.907598   533   -1.93   0   1   1  11   20439
+    3.323751   417   -1.93   0   1   5  -4   20439
+    3.685147   230   -1.93   0   1   1   1   20439
+   -0.273785  1354   -6.65   0   0   0  14   20476
+    0.273785   809   -6.65   0   1   2   4   20476
+    0.747433   511   -6.65   0   1   2   7   20476
+    1.705681   299   -6.65   0   1   1   5   20476
+   -2.776181   892   -9.52   0   1   0  -3   20477
+   -2.280630  1110   -9.52   0   1   0   4   20477
+   -0.117728   740   -9.52   0   1  -2  -4   20477
+    0.117728   472   -9.52   0   1  -3   2   20477
+    0.481862   755   -9.52   0   1   1  -1   20477
+    1.018481   701   -9.52   0   1   2   1   20477
+    1.555099    37   -9.52   0   1   5  -1   20477
+    2.072553   470   -9.52   0   1   1  16   20477
+   -2.242300  1035   -2.75   3   1   0  -5   20492
+   -1.744011  1296   -2.75   3   1   5  -6   20492
+   -1.242984  1219   -2.75   3   1   5  -4   20492
+    0.843258   855   -2.75   3   1  -4  -4   20492
+    1.341547   938   -2.75   3   1  -4  -7   20492
+    2.069815  1320   -2.75   3   1  -4  -5   20492
+    2.644764  1125   -2.75   4   1  -4  -3   20492
+    3.206023  1368   -2.75   3   0  -4  -6   20492
+   -0.273785   823    4.78   0   1   5   9   20498
+    0.772074   313    4.78   0   0  -1  17   20498
+    1.327858   123    4.78   0   0  -3   4   20498
+    1.806982   165    4.78   0   0  -3   9   20498
+    0.268309   548   -4.66   3   1   5   5   20523
+   -0.262834   626    1.34   0   1   0  -2   20537
+    0.725530   415    1.34   2   1  -2  -1   20537
+    1.237509   605    1.34   1   1  -3   2   20537
+    1.867214   454    1.34   2   1  -4   6   20537
+    2.288843   595    1.34   1   0  -4   0   20537
+    2.825462   440    1.34   0   1  -4   1   20537
+   -1.839836   957    3.36   1   1  -2  -7   20567
+   -1.234771  1006    3.36   0   1  -4  -3   20567
+   -0.802190   936    3.36   0   1  -4  -7   20567
+    0.717317   492    3.36   0   1  -4  -7   20567
+    1.314168  1629    3.36   0   1  -4  -7   20567
+    1.615332  1203    3.36   0   1  -4   3   20567
+    2.439425  1397    3.36   0   1  -4  -1   20567
+   -0.262834  1116   -2.89   4   1   5   4   20568
+    0.262834   467   -2.89   4   1   5  13   20568
+    0.739220   360   -2.89   4   1   0   5   20568
+    1.226557   512   -2.89   4   1  -3  23   20568
+    2.288843   704   -2.89   0   1  -3  -5   20568
+    2.789870   250   -2.89   0   1  -5  13   20568
+    3.085558   202   -2.89   0   1  -4   4   20568
+    3.526352   174   -2.89   0   1  -3   1   20568
+   -1.535934   424   20.66   0   1  -1  11   20571
+   -1.152635   553   20.66   0   1  -1   8   20571
+   -0.769336   734   20.66   0   1  -1  11   20571
+   -0.328542   405   20.66   0   1   0   9   20571
+    0.328542   596   20.66   0   1   1   8   20571
+    0.884326   396   20.66   0   1  -5  11   20571
+   -0.254620  1438   12.40   0   0   5  -7   20583
+    0.254620   858   12.40   0   1   5  -1   20583
+    1.226557  1181   12.40   0   1  -2  -6   20583
+    1.823409  1321   12.40   0   0  -1  -2   20583
+    2.247776  1412   12.40   0   1  -1  12   20583
+    2.861054   685   12.40   0   0  -4   1   20583
+    3.140315  1160   12.40   0   0  -2   2   20583
+    3.471595   971   12.40   0   1  -4  -3   20583
+    4.375085   877   12.40   0   1  -1   2   20583
+    4.826831   718   12.40   0   1   5  -6   20583
+   -1.259411   831   -1.57   2   1   0   4   20584
+   -0.761123   753   -1.57   2   1   3  -6   20584
+   -0.260096   508   -1.57   2   1  -3   3   20584
+    1.366188   406   -1.57   1   1  -4  -7   20584
+    1.809719   175   -1.57   2   1  -4  -6   20584
+    2.598220   108   -1.57   2   1  -4  -7   20584
+   -1.278576   473   -5.80   0   1  -1   7   20591
+   -0.758385   513   -5.80   0   1  -3  22   20591
+   -0.279261   701   -5.80   0   1  -4  -2   20591
+    0.810404   367   -5.80   0   0  -3   8   20591
+    1.390828   438   -5.80   0   0  -3  -3   20591
+    1.812457   367   -5.80   0   1   0   5   20591
+    2.088980   449   -5.80   0   0  -3   6   20591
+    2.631075   393   -5.80   0   0  -4   1   20591
+    3.066393   507   -5.80   0   0  -3   0   20591
+    3.564682   255   -5.80   0   0   2   5   20591
+    3.997262   189   -5.80   0   0  -1  12   20591
+   -1.459274  1292    0.46   3   1   5  -7   20595
+   -1.040383   884    0.46   3   1   5  -7   20595
+   -0.747433   785    0.46   3   1   5  -7   20595
+   -0.268309  1243    0.46   3   1   5  -7   20595
+    0.268309   789    0.46   3   1   5  -5   20595
+    0.769336   517    0.46   0   1   5   0   20595
+   -0.290212   573   -2.09   3   1   3  -3   20604
+    0.290212   414   -2.09   3   1   2  -3   20604
+   -2.581793   676   -0.45   0   1   5  -2   20605
+   -2.121834   682   -0.45   0   1   5  -3   20605
+   -2.713210   728   -9.40   0   1   2  -4   20616
+    1.752224   558   -9.40   0   1   5  -2   20616
+   -2.721424  1039   25.54   4   0  -3  -5   20664
+   -0.240931   511   25.54   4   0  -4  -7   20664
+   -0.281999  1022   -6.90   2   1  -4   2   20678
+   -1.270363   800    4.47   0   1   5  -2   20713
+   -0.763860   743    4.47   0   1   5  14   20713
+   -0.251882   719    4.47   0   1  -2  11   20713
+    1.223819   481    4.47   0   0  -4  10   20713
+    1.971253   348    4.47   0   0  -4  -7   20713
+    2.362765   392    4.47   0   0  -4  -6   20713
+   -0.251882   440   -6.37   0   1   5  -5   20723
+    1.223819   375   -6.37   1   1   5  -3   20723
+    2.718686   388   -6.37   1   1   5  -2   20723
+    2.948665   274   -6.37   1   1   3  -5   20723
+   -0.829569  1196    0.02   3   0   0  -6   20736
+    0.210815   489    0.02   2   1  -3  -7   20736
+    0.706366   519    0.02   1   1  -4  -2   20736
+   -2.023272  1212   13.32   0   1   0  -1   20748
+   -1.612594  1000   13.32   0   1  -3   3   20748
+   -1.111567  1244   13.32   0   1   0   4   20748
+   -0.670773  1023   13.32   0   1   0  11   20748
+   -0.249144   714   13.32   0   1   1  -2   20748
+   -0.254620  1451    1.93   0   1   5  -3   20749
+    0.249144  1684    1.93   0   1   5   0   20749
+    0.744695   797    1.93   0   1   1  -6   20749
+    1.289528   916    1.93   0   1   3  -2   20749
+    1.897331   918    1.93   0   1   4  -2   20749
+    2.247776  1345    1.93   0   1   2  -4   20749
+    3.110198   821    1.93   0   1   5  -5   20749
+   -0.251882   906    0.26   2   1   5  -4   20768
+    0.246407   763    0.26   0   1   5  27   20768
+    0.659822   533    0.26   2   1   5   9   20768
+    1.171800   513    0.26   3   1  -2  -1   20768
+    1.691992   764    0.26   3   1   1   5   20768
+    2.190281   869    0.26   3   1  -3  10   20768
+    3.427789   369    0.26   3   1  -4  23   20768
+    4.659822   343    0.26   3   1  -4  17   20768
+   -1.234771   555   -1.93   0   0  -1   6   20776
+   -0.736482   668   -1.93   0   1   2   3   20776
+   -0.246407   585   -1.93   0   1   1   2   20776
+    0.736482   402   -1.93   0   1  -4  -1   20776
+    1.234771   565   -1.93   0   1  -2   6   20776
+    1.826146   241   -1.93   0   1   2   1   20776
+    2.965092   203   -1.93   0   1   5  -3   20776
+    3.383984   232   -1.93   0   1  -2  -3   20776
+   -2.195756  1003   11.56   0   1   5   1   20777
+   -1.700205  1683   11.56   0   1   5   3   20777
+   -1.240246  2109   11.56   0   1   3  -5   20777
+    0.993840   996   11.56   0   1  -3   1   20777
+    1.338809  1058   11.56   0   1  -5  -3   20777
+    1.973990  1552   11.56   0   1   0  -5   20777
+    2.431211  1210   11.56   0   1  -5  -6   20777
+    3.003422  1007   11.56   0   1  -5  -7   20777
+   -2.026010   912   -1.53   3   1  -4  -3   20779
+   -0.818617  1122   -1.53   3   1   5  -2   20779
+   -0.279261  1035   -1.53   3   1   4   1   20779
+    0.279261   647   -1.53   3   1   5   0   20779
+   -2.551677   520   -1.81   0   1   5   1   20837
+   -1.188227   778   -1.81   0   1  -3  -6   20837
+   -0.229979  1490   -1.81   0   1  -4  -5   20837
+    0.229979   995   -1.81   0   1  -4  -4   20837
+   -1.067762   995   -2.79   0   1   5   3   20839
+   -0.550308   827   -2.79   0   1   0  -4   20839
+   -0.262834  1310   -2.79   0   1   1  -1   20839
+    0.257358   732   -2.79   0   1   5  -1   20839
+    1.180014   625   -2.79   0   1   1  -4   20839
+    1.713895   507   -2.79   0   1   5   5   20839
+    2.083504   243   -2.79   0   1   5  -2   20839
+    3.025325   638   -2.79   0   1   0  -4   20839
+    3.463381   600   -2.79   0   1   5   1   20839
+    3.961670   632   -2.79   0   1   5   7   20839
+   -0.251882   972    1.09   0   1  -2   1   20850
+    0.246407   500    1.09   0   1  -1   7   20850
+    0.517454   380    1.09   0   1  -4  -3   20850
+    1.089665   350    1.09   0   1  -5   5   20850
+    1.587953   272    1.09   0   1  -4  -4   20850
+    2.047912   569    1.09   0   1  -4  -5   20850
+    2.469541   266    1.09   0   1  -4  -6   20850
+    3.203285   221    1.09   0   1  -4  -4   20850
+    3.715264   262    1.09   0   1  -4  -2   20850
+    4.213552   299    1.09   0   1  -4  -5   20850
+   -0.134155   752    1.18   4   1  -2  16   20851
+    0.147844   711    1.18   3   1  -2  22   20851
+    0.668036   230    1.18   3   1  -4  12   20851
+    1.204654   319    1.18   3   1  -3   3   20851
+    1.645448   289    1.18   3   1  -4  14   20851
+    2.822724   188    1.18   3   1  -4  18   20851
+   -0.210815   759    4.37   1   1   5  12   20852
+    0.210815   969    4.37   1   1   5   6   20852
+    1.171800   664    4.37   1   1   2  14   20852
+    1.664613   587    4.37   1   1   1  12   20852
+    2.469541   499    4.37   1   1   5  17   20852
+    3.011636   486    4.37   1   1  -4  10   20852
+    3.493498   805    4.37   1   1  -4   9   20852
+    3.972621   400    4.37   1   1  -1  18   20852
+    4.583162   567    4.37   1   1  -1   7   20852
+   -0.221766   926   29.08   0   0   5  -2   20891
+    0.221766   354   29.08   0   1  -2   0   20891
+    0.550308   899   29.08   0   0  -4  -3   20891
+    1.081451   675   29.08   0   0  -4  -2   20891
+    1.585216   524   29.08   0   0  -4  -6   20891
+    2.045175  1082   29.08   0   0  -4  -7   20891
+    2.521561   590   29.08   0   0  -4  -7   20891
+    2.825462   861   29.08   0   0  -4  -7   20891
+    3.252567   786   29.08   0   0  -4  -6   20891
+    3.822040   728   29.08   0   0  -4  -4   20891
+    4.262834   764   29.08   0   0  -4  -6   20891
+   -0.210815  1877  -11.28   0   1   5  -3   20906
+    0.210815  1051  -11.28   0   1   5   2   20906
+    0.520192  1155  -11.28   0   1  -1   2   20906
+    1.045859   270  -11.28   0   1   5  -1   20906
+    2.050650    10  -11.28   0   1   0  10   20906
+   -0.298426   617    8.02   0   1  -2  -6   21029
+    0.298426   644    8.02   0   1  -2  -6   21029
+    0.703628   719    8.02   0   1  -2  -4   21029
+    1.779603   680    8.02   0   0  -3  -3   21029
+    2.220397   669    8.02   0   0  -5   0   21029
+    2.757016   423    8.02   0   0  -5  -4   21029
+   -0.268309   573    5.06   0   1  -1  12   21058
+    0.268309   686    5.06   0   1  -2   1   21058
+    0.731006   636    5.06   3   1   1  12   21058
+    1.284052   467    5.06   3   1  -1   3   21058
+    2.299795   306    5.06   3   1  -2  18   21058
+    3.028063   333    5.06   3   1  -3   7   21058
+    3.583847   274    5.06   2   1  -1   8   21058
+    5.404518   322    5.06   4   1  -4  12   21058
+   -0.793977  1056   15.01   0   0   5  22   21083
+    0.240931   635   15.01   0   0   5  13   21083
+    0.736482   650   15.01   0   0   5  21   21083
+    1.240246   391   15.01   0   0   5   6   21083
+    1.713895   205   15.01   0   0   5  15   21083
+   -2.516085  1150    2.08   3   1  -3   1   21087
+   -2.061602  2342    2.08   2   1  -2   2   21087
+   -1.577002  2540    2.08   3   1  -4  -2   21087
+   -0.555784  1140    2.08   3   1  -3   2   21087
+   -0.260096  1688    2.08   2   1  -1   4   21087
+    0.758385  1080    2.08   2   1  -3   7   21087
+   -0.257358   846   -7.80   0   0   5   2   21090
+    0.257358   757   -7.80   0   1   5  -5   21090
+   -0.533881   896   -2.75   0   0  -4  -4   21093
+    1.032170   390   -2.75   0   0  -4  -2   21093
+    1.489391   467   -2.75   0   0  -4  -5   21093
+    2.031485   540   -2.75   0   0  -4  -1   21093
+    2.587269   335   -2.75   0   0  -4  -3   21093
+    3.991786   279   -2.75   0   0  -4  -7   21093
+    4.566735   221   -2.75   0   0  -5  -7   21093
+   -0.235455   946   -2.41   1   1   1   9   21136
+    0.240931   473   -2.41   0   1  -3   8   21136
+    1.100616   969   -2.41   2   1  -2   7   21136
+    1.585216   761   -2.41   0   1  -4   7   21136
+    2.116359   669   -2.41   0   1  -4   3   21136
+    2.557153   594   -2.41   0   1  -4  10   21136
+    3.132101   286   -2.41   0   1  -4   1   21136
+    3.592060   243   -2.41   0   1  -4   9   21136
+    4.043806   557   -2.41   0   1  -5   0   21136
+    4.550308   329   -2.41   0   1  -4   0   21136
+   -1.489391   417    4.29   0   1  -2  -7   21194
+   -0.736482   635    4.29   3   1  -4  -7   21194
+   -0.229979   544    4.29   3   1  -2  -6   21194
+   -0.668036   491   -0.37   0   1   5   5   30007
+    0.668036   740   -0.37   0   1   5   6   30007
+    1.169062   885   -0.37   0   1   5  11   30007
+    1.590691   453   -0.37   0   1   5  22   30007
+    2.088980   698   -0.37   0   1   5   0   30007
+    2.644764   796   -0.37   0   1   4   9   30007
+    3.085558   404   -0.37   0   1   5  24   30007
+    4.082136   422   -0.37   0   1   5   8   30007
+    4.752909   474   -0.37   0   1   5   3   30007
+   -2.803559   540   -2.71   0   1   0  -2   30010
+   -2.316222  1016   -2.71   0   1   3   1   30010
+   -1.817933  1132   -2.71   0   1   5  -1   30010
+   -1.297741   796   -2.71   0   1   3  10   30010
+   -0.747433  1159   -2.71   0   1   5  -1   30010
+   -0.287474   418   -2.71   0   1   5   2   30010
+    0.802190   114   -2.71   0   0  -2   9   30010
+   -1.973990  1004   -1.66   0   1  -1   1   30018
+   -1.489391  1471   -1.66   0   1  -2   2   30018
+   -0.747433   932   -1.66   0   1   1  12   30018
+    0.243669   688   -1.66   0   1  -2   2   30018
+   -2.422998   574   21.63   0   1   5   0   30024
+   -1.629021  1302   21.63   0   1   5  -6   30024
+   -1.117043   540   21.63   0   1   5  -7   30024
+   -0.613279  1115   21.63   0   1   5  -7   30024
+   -0.229979   914   21.63   0   1   5  -7   30024
+    0.229979  1096   21.63   0   1   5  -7   30024
+    1.073238   969   21.63   0   1   0  -7   30024
+    1.623546   699   21.63   0   1   0  -7   30024
+    2.121834   703   21.63   0   1  -4  -7   30024
+    2.620123   613   21.63   0   1  -4  -7   30024
+   -0.276523  1342   -3.06   4   1   5  -3   30038
+    0.262834   718   -3.06   4   1   1  -3   30038
+    0.799452   413   -3.06   4   1   5  -2   30038
+    1.524983   633   -3.06   4   1   5  -5   30038
+    4.016427   458   -3.06   0   1  -2  -6   30038
+   -2.984257   467   13.94   0   1   1   0   30046
+   -2.576318   539   13.94   0   1   1  -3   30046
+   -2.094456   841   13.94   0   1  -3  -4   30046
+   -1.596167   970   13.94   0   1  -2  -5   30046
+   -1.180014  1054   13.94   0   1  -1  -3   30046
+   -0.758385   718   13.94   0   1  -2   0   30046
+   -0.298426   770   13.94   0   1  -4   0   30046
+    0.298426   738   13.94   0   1  -3  -3   30046
+    0.854209   593   13.94   0   1  -4   0   30046
+   -0.249144   547   -3.64   0   0   5  13   30048
+    0.249144   228   -3.64   0   0  -1  13   30048
+    0.750171   391   -3.64   0   1   3  11   30048
+    1.234771   361   -3.64   0   1  -4  -5   30048
+    1.601643   288   -3.64   0   1  -4   6   30048
+    2.954141   197   -3.64   0   1  -4  -4   30048
+    4.065709    39   -3.64   0   0  -4  24   30048
+   -1.746749  1328   19.13   0   0  -2  25   30049
+   -1.325120  1678   19.13   0   0  -4  12   30049
+   -0.788501  1087   19.13   0   0  -4   1   30049
+   -0.249144  1407   19.13   0   0  -2  10   30049
+    0.249144   944   19.13   0   0  -2  10   30049
+   -1.226557   980   -5.33   0   1   5  -3   30050
+   -0.747433   508   -5.33   0   1   0   6   30050
+   -0.240931  1306   -5.33   0   1   0   7   30050
+    0.240931  1164   -5.33   0   1  -1   9   30050
+    0.659822   917   -5.33   0   1  -1   5   30050
+    1.081451   760   -5.33   0   1  -2   5   30050
+    1.609856   803   -5.33   0   1  -1  10   30050
+    2.069815   795   -5.33   0   1  -1   3   30050
+    2.587269   876   -5.33   0   1  -2  18   30050
+    3.321013   605   -5.33   0   1  -3  18   30050
+    3.860370   284   -5.33   0   1  -3  18   30050
+   -2.710472   806    1.12   0   1  -3  -3   30051
+   -1.418207   619    1.12   0   1  -4  -7   30051
+   -0.728268   664    1.12   0   0  -4  -7   30051
+   -0.325804   436    1.12   0   0  -4  -1   30051
+    0.325804   450    1.12   0   0   0  -7   30051
+   -2.327173   879   -0.66   0   0  -3   0   30069
+   -1.921971  1215   -0.66   0   0  -3  -5   30069
+   -1.404517  1184   -0.66   0   0  -1  -2   30069
+   -0.829569   984   -0.66   0   0   0   1   30069
+   -0.284736  1200   -0.66   0   0   0   1   30069
+   -0.832307   768   -6.44   3   1   0   0   30075
+   -0.238193   656   -6.44   3   1   5   4   30075
+    0.238193   511   -6.44   3   1  -4  12   30075
+    0.698152   710   -6.44   4   1  -4  21   30075
+    1.070500   641   -6.44   3   1  -4   3   30075
+    2.425736   916   -6.44   4   1  -3   0   30075
+   -1.549623   875    4.17   0   1   0   2   30083
+   -0.621492   972    4.17   0   1   5  -1   30083
+    0.621492   609    4.17   0   1  -3  10   30083
+   -0.720055   839   -1.76   0   1   2   6   30101
+   -0.251882  1357   -1.76   0   1   0   7   30101
+    0.251882  1138   -1.76   1   1  -3  10   30101
+    0.689938  1018   -1.76   0   1  -4  22   30101
+    1.782341  1129   -1.76   1   1  -3  18   30101
+    2.165640   968   -1.76   2   1  -4  28   30101
+    2.606434   950   -1.76   2   1  -4  21   30101
+    3.104723  1005   -1.76   0   1  -1  13   30101
+    3.586585  1125   -1.76   0   1  -3  27   30101
+    4.084873   914   -1.76   2   1  -4  16   30101
+   -2.053388  2407    4.09   3   1   5  -1   30119
+   -1.533196  1082    4.09   3   1   5  -5   30119
+   -1.034908   945    4.09   3   1   1  -6   30119
+   -0.574949  1166    4.09   3   1  -5  -1   30119
+   -0.210815  1279    4.09   3   1  -4   5   30119
+    0.210815  1085    4.09   3   1  -4  -6   30119
+    0.689938  1176    4.09   3   1  -4  -2   30119
+    1.418207  1190    4.09   3   1  -5  -4   30119
+    1.839836  1379    4.09   2   1  -4  -7   30119
+    2.420260   831    4.09   2   1  -4  -7   30119
+    2.937714   821    4.09   3   1  -4  -7   30119
+    3.474333   840    4.09   0   1  -4  -6   30119
+   -2.798084   831   -6.39   1   1   3   2   30122
+    2.798084   622   -6.39   0   1   5  -2   30122
+   -2.532512   504   -0.62   0   1   5  -2   30132
+   -2.110883   695   -0.62   0   1  -1   8   30132
+   -1.708419   900   -0.62   0   1   4   0   30132
+   -1.248460   665   -0.62   0   1   0   0   30132
+   -0.249144  1166   -0.62   0   1   5  10   30132
+    0.249144   274   -0.62   0   1   0  -1   30132
+   -1.535934   973    2.31   4   1   5  -7   30133
+   -1.002053   948    2.31   4   1   5   4   30133
+   -0.520192   595    2.31   4   1   5  -1   30133
+    0.520192   729    2.31   4   1  -4  -7   30133
+    0.980151   994    2.31   4   1  -4  -7   30133
+    2.047912   858    2.31   3   1  -5  -4   30133
+    2.469541   482    2.31   4   1  -5  -7   30133
+    3.449692   553    2.31   3   1  -4  -7   30133
+    3.989048   787    2.31   3   1  -4  -7   30133
+   -0.752909  1869    5.83   4   1   5  29   30135
+   -0.251882  2128    5.83   4   1   5   6   30135
+    0.249144  1407    5.83   4   1   5   0   30135
+    0.728268  1071    5.83   4   1   5  12   30135
+    1.127995  1140    5.83   4   1   5  15   30135
+    1.519507   757    5.83   4   1   5  16   30135
+    2.020534   799    5.83   4   1   5  11   30135
+    2.480493   673    5.83   4   1   5   6   30135
+   -0.265572  1479   -3.50   3   1   5  17   30148
+    0.260096  3015   -3.50   3   1   5  19   30148
+    0.758385   892   -3.50   3   1   1   7   30148
+    1.237509  1061   -3.50   3   1  -4  28   30148
+    1.601643   953   -3.50   3   1  -3  16   30148
+    2.138262   983   -3.50   3   1  -4   4   30148
+    3.096509  1054   -3.50   2   1   0   4   30148
+    3.668720  1098   -3.50   3   1  -3   1   30148
+    4.399726   793   -3.50   0   1  -4   5   30148
+   -0.747433  1876    3.31   3   0  -5  -2   30173
+   -0.232717  1254    3.31   3   1  -2   2   30173
+    0.251882   428   -4.61   1   1   5  -6   30177
+    0.194387   662    7.73   0   0  -3  -2   30179
+    1.629021   780    7.73   0   0  -5  -5   30179
+   -2.989733   814    6.17   3   1   5  -3   30183
+   -1.897331  1226    6.17   3   1  -4  -1   30183
+   -1.303217  1637    6.17   0   1   5   2   30183
+   -0.785763  1435    6.17   0   1   5  11   30183
+   -0.268309  1437    6.17   0   1   5   3   30183
+    0.268309   906    6.17   0   1   5  -4   30183
+   -2.529774   886    8.93   0   1   2   4   30193
+   -2.031485  1046    8.93   3   1   5   3   30193
+   -1.533196  1210    8.93   0   1   0   3   30193
+   -0.747433  1025    8.93   0   1  -1   5   30193
+    0.210815   953    8.93   0   1   1  -1   30193
+    1.839836   168    8.93   0   0  -4  -1   30193
+   -0.156057   612    1.43   0   1   5   3   30216
+    0.960986   506    1.43   0   1  -2  -3   30216
+    1.440109   659    1.43   0   1   2  -1   30216
+    1.984942   381    1.43   0   1   0  12   30216
+    2.475017   397    1.43   0   1  -3   5   30216
+    3.030801   765    1.43   0   1  -2   6   30216
+    3.548255   438    1.43   0   1  -1  -5   30216
+    4.065709   422    1.43   0   1  -2  -1   30216
+    4.563997   483    1.43   0   1  -2   3   30216
+    5.062286   369    1.43   0   1   0  18   30216
+   -1.760438   522    0.68   0   0   5  -5   30225
+    1.138946   421    0.68   0   1   5  -6   30225
+   -0.936345  1267    5.78   3   1   5  -4   30239
+   -0.410678  1120    5.78   3   1   5  -6   30239
+    0.413415   978    5.78   3   1  -4  -5   30239
+    1.065024   609    5.78   3   0  -4  -6   30239
+   -0.262834  1093    2.69   3   1   5  17   30262
+    0.262834   757    2.69   3   1   5   7   30262
+    0.799452   894    2.69   3   1  -3  -5   30262
+    1.697467   619    2.69   3   1  -2  -3   30262
+    3.370294   602    2.69   2   1  -5  -5   30262
+   -0.736482   766   -2.21   3   1  -1   6   30281
+   -0.249144  1168   -2.21   3   1   0   6   30281
+    0.246407   825   -2.21   0   1  -2   1   30281
+    0.651608   741   -2.21   0   1  -2   9   30281
+    1.486653   576   -2.21   1   1  -3   3   30281
+    1.987680   435   -2.21   0   1  -3   3   30281
+    3.099247    88   -2.21   0   1  -4   1   30281
+   -0.709103   992    4.13   4   1   5  -1   30301
+   -0.210815  1029    4.13   0   1   2  -5   30301
+    0.210815   597    4.13   0   1  -4  -4   30301
+    0.651608   389    4.13   0   1  -4  -7   30301
+    1.152635   419    4.13   3   1  -4  -1   30301
+   -0.221766   615    0.62   0   0   5  -7   30306
+    0.221766  1157    0.62   0   0   5  -4   30306
+    1.188227   479    0.62   0   0   5   5   30306
+    1.700205   517    0.62   0   0   5  -4   30306
+    2.220397   200    0.62   0   0  -1  -4   30306
+    2.713210   152    0.62   0   0  -4  -4   30306
+    3.277207    16    0.62   0   0  -4   8   30306
+   -2.036961   948    7.09   4   1   5   4   30310
+   -1.557837   449    7.09   4   0  -1  15   30310
+   -0.561259   589    7.09   4   1   3  29   30310
+   -0.229979   740    7.09   4   1   5   3   30310
+    0.229979   427    7.09   4   1   5   8   30310
+    0.804928   590    7.09   4   1  -1  -1   30310
+    1.396304   563    7.09   4   1  -4   2   30310
+    1.820671   411    7.09   4   0  -4  19   30310
+    2.370979   439    7.09   4   1   1  31   30310
+    2.888433   351    7.09   4   1   0   8   30310
+   -0.807666   764   -5.86   0   0   1  -3   30324
+   -0.268309   779   -5.86   0   1  -3  -7   30324
+    0.268309   630   -5.86   0   1   5  -4   30324
+    0.676249   618   -5.86   0   1   2   1   30324
+    1.059548   514   -5.86   0   1   1  -7   30324
+    1.557837   389   -5.86   0   1  -1  -7   30324
+    2.075291   486   -5.86   0   1  -4   5   30324
+    2.847365   234   -5.86   0   1   3  16   30324
+   -2.718686   695   -8.80   0   1   5   5   30372
+   -2.313484   760   -8.80   0   1   5  -6   30372
+   -1.839836   580   -8.80   0   0   5  -5   30372
+   -0.804928   592   -8.80   0   0   1   4   30372
+    0.804928   481   -8.80   0   0  -1  11   30372
+    1.305955   487   -8.80   0   1  -4  -1   30372
+   -0.249144   995   -2.53   3   1  -3  -7   30376
+    0.249144   975   -2.53   3   1  -4   0   30376
+    0.824093   782   -2.53   3   1  -2   0   30376
+   -2.609172  1253    6.95   3   1  -1  28   30382
+   -2.110883  1688    6.95   3   1   5  40   30382
+   -0.750171  1357    6.95   3   1   5  49   30382
+    0.271047  1210    6.95   4   1  -2  45   30382
+   -1.656400   624    7.03   0   1   5   2   30388
+   -1.196441   687    7.03   0   1   0  -7   30388
+   -0.700890   692    7.03   0   1   0  -7   30388
+    0.260096   349    7.03   0   1   5  12   30388
+    1.793292   305    7.03   0   0  -4  -7   30388
+    2.291581   293    7.03   0   0  -5  -3   30388
+    2.724162   230    7.03   0   0  -5  -7   30388
+    3.219712   130    7.03   0   0  -5  -7   30388
+   -2.253251  1276    6.49   0   1   5  -4   30392
+   -1.793292   928    6.49   0   1   5  -7   30392
+   -0.796715  1118    6.49   0   1   5  -4   30392
+   -0.260096  1171    6.49   0   1   5  -5   30392
+    0.260096   679    6.49   0   1   5  -4   30392
+    0.950034   380    6.49   0   1   5  19   30392
+    1.524983   651    6.49   0   1   5   2   30392
+   -0.287474  1647   25.90   4   0   5   5   30405
+    0.287474   868   25.90   4   0   5   0   30405
+    0.791239   946   25.90   4   0   5   3   30405
+    1.289528  1174   25.90   4   0   3  22   30405
+    1.837098   811   25.90   4   0  -2  13   30405
+    2.143737   888   25.90   4   0  -4  18   30405
+    2.666667   582   25.90   4   0  -2  16   30405
+    3.049966   371   25.90   4   0  -4  10   30405
+    3.523614   483   25.90   4   0  -2  15   30405
+    4.041068   447   25.90   4   0   5  13   30405
+   -0.262834  1240   -9.24   0   1   0   2   30412
+    0.262834   529   -9.24   0   1   5   7   30412
+    0.703628   876   -9.24   0   1   0  -2   30412
+    1.259411   804   -9.24   3   1   2   5   30412
+    1.982204   694   -9.24   3   1   1  10   30412
+    2.464066   852   -9.24   3   1  -3  11   30412
+    3.789186   564   -9.24   3   1   5  23   30412
+    4.188912   402   -9.24   2   1  -4  30   30412
+    5.278576   323   -9.24   0   1  -2  -2   30412
+   -0.985626   803   -4.47   3   1  -2   1   30420
+   -0.317591  1823   -4.47   3   1   5  -2   30420
+    0.317591  1139   -4.47   3   1  -2   6   30420
+    2.291581   973   -4.47   3   1   0   0   30420
+   -2.855578   672    5.27   0   0  -3  -7   30428
+   -2.327173   391    5.27   0   0  -3  -4   30428
+   -1.196441   622    5.27   0   0  -3  -6   30428
+   -0.276523   764    5.27   0   0  -3  -7   30428
+    0.276523  1155    5.27   0   0  -2  -7   30428
+    1.010267   595    5.27   0   0  -3  -7   30428
+    1.722108   560    5.27   0   0  -3  -7   30428
+   -0.314853  1326   -1.77   4   1   2  -7   30454
+    0.314853  1781   -1.77   2   1  -1   2   30454
+    0.835045  1490   -1.77   4   1  -2   2   30454
+   -0.287474   744   -2.41   0   1   5  -2   30485
+    0.287474   694   -2.41   3   1  -2   5   30485
+    0.804928   662   -2.41   0   1   1  24   30485
+    1.705681   504   -2.41   3   0  -4   7   30485
+    2.146475   580   -2.41   4   0  -4   3   30485
+    2.721424   665   -2.41   0   0  -4  23   30485
+    3.583847   381   -2.41   3   0  -2   1   30485
+    4.101301   527   -2.41   0   0  -2   2   30485
+    4.613278   544   -2.41   3   0  -1   5   30485
+   -2.234086   573    4.64   0   1   5  12   30489
+   -0.750171  1178    4.64   0   1   5  -4   30489
+    0.750171   769    4.64   0   1   5  11   30489
+   -2.642026  1067    9.77   0   1  -1  -7   30490
+   -1.722108  1112    9.77   0   1   5  -6   30490
+   -1.262149  1095    9.77   0   1   3  -7   30490
+   -0.323066  1018    9.77   0   1   0  -6   30490
+    0.323066   624    9.77   0   0  -5  -5   30490
+   -2.956879   749   -4.54   0   1  -1  -7   30498
+   -2.387406   974   -4.54   0   1  -3  -6   30498
+    2.387406   147   -4.54   0   1   0  -5   30498
+   -2.433949  2534   13.25   4   1   5  25   30503
+    2.433949   394   13.25   3   1   5   8   30503
+   -2.836413   824   -1.96   1   1   0  -7   30504
+   -2.395619   697   -1.96   0   0  -2  -1   30504
+   -1.935660   518   -1.96   2   0   1  -2   30504
+   -1.054072   707   -1.96   1   1   0  -6   30504
+   -0.728268   775   -1.96   0   1  -2  -7   30504
+   -0.210815   558   -1.96   0   1   0  -7   30504
+    0.210815   806   -1.96   0   1   0  -7   30504
+    0.709103   418   -1.96   0   1  -3  -6   30504
+   -0.249144   852   14.43   4   1  -3  -5   30508
+    0.659822   508   14.43   4   1  -3  39   30508
+    1.571526   561   14.43   4   1  -3   8   30508
+    2.069815   479   14.43   3   1  -3  20   30508
+    2.548939   289   14.43   3   1  -4  17   30508
+   -2.138262   974    4.39   0   0  -4  -6   30515
+   -1.639973   665    4.39   0   1   1  -4   30515
+   -1.133470   989    4.39   0   1   5  -5   30515
+   -0.629706  1303    4.39   0   1   3  -3   30515
+   -0.227242   578    4.39   0   1   2  -7   30515
+    0.227242   938    4.39   0   1  -2  -6   30515
+    0.744695   824    4.39   0   0  -4   0   30515
+    1.169062  1133    4.39   0   0  -4  -4   30515
+    1.590691   712    4.39   0   0  -4  -5   30515
+    2.168378   540    4.39   0   0  -4  -1   30515
+    2.685832   547    4.39   0   1  -4  -3   30515
+    3.164956   385    4.39   0   0  -4   7   30515
+   -0.268309   378   -7.77   0   0  -2  -4   30531
+    0.544832   657   -7.77   0   0  -3  -1   30531
+    1.566051   686   -7.77   0   0  -3  -4   30531
+    2.310746   647   -7.77   0   1  -3  -2   30531
+   -0.413415  1814   -5.36   3   0  -2   1   30536
+    0.413415  1064   -5.36   3   1  -3  -3   30536
+    1.735797   409   -5.36   3   1  -3  -2   30536
+    1.314168   641   -3.06   2   1  -3  43   30548
+    4.358658   510   -3.06   0   1  -3  33   30548
+   -2.162902   493   -9.63   0   1   3   9   30562
+   -1.645448  1094   -9.63   0   1   1   6   30562
+   -1.119781   645   -9.63   0   1   5   7   30562
+   -0.670773  1166   -9.63   0   1   4   6   30562
+   -0.229979   574   -9.63   0   1   5   5   30562
+    0.229979   806   -9.63   0   1   3   1   30562
+    0.766598   862   -9.63   0   1   3  19   30562
+    1.149897   638   -9.63   0   1   3  10   30562
+    2.951403   402   -2.06   0   1  -1   1   30599
+    3.411362   356   -2.06   0   1  -4   0   30599
+    3.947981   259   -2.06   0   1  -4  -1   30599
+   -2.598220  1378   -5.26   2   0   5  13   30654
+   -2.026010   529   -5.26   3   1   2  26   30654
+   -1.527721   696   -5.26   2   1  -4  28   30654
+   -0.988364   773   -5.26   2   1  -4   4   30654
+   -0.240931   932   -5.26   2   1  -4  16   30654
+    0.240931   316   -5.26   3   1  -2  33   30654
+   -2.740589  1206   -6.83   1   1  -4  17   30663
+   -2.146475   966   -6.83   1   1  -3  23   30663
+   -1.705681   722   -6.83   2   1  -4  13   30663
+   -1.245722   830   -6.83   3   1  -2  22   30663
+   -0.229979   768   -6.83   2   0  -3  -4   30663
+    0.229979   545   -6.83   2   0  -4  19   30663
+    0.774812   448   -6.83   3   1  -4  -6   30663
+    1.412731   406   -6.83   3   1  -4  -2   30663
+    1.943874   317   -6.83   2   0  -4   7   30663
+   -2.683094   918   -8.21   0   0  -1  -3   30673
+   -1.727584  1016   -8.21   0   0   1  -4   30673
+   -0.739220   971   -8.21   0   0  -1  -7   30673
+    1.256673   821   -8.21   0   0  -4  -3   30673
+    1.390828   851   -8.21   0   0  -4  -3   30673
+   -0.780287   348    1.51   0   1  -2  -6   30677
+   -0.249144  1076    1.51   0   1  -1  -7   30677
+    0.637919   758    1.51   0   1  -3  -7   30677
+    1.149897   469    1.51   0   1  -4  -7   30677
+    1.648186   794    1.51   0   0  -4  -7   30677
+    2.108145   754    1.51   0   1  -2  -7   30677
+    2.625599   635    1.51   0   1  -5  -4   30677
+    3.123888   886    1.51   0   1  -5  -7   30677
+   -2.691307   576   -5.09   0   0  -3  -6   30692
+   -2.173854   896   -5.09   0   1   0  -7   30692
+   -1.752224  1163   -5.09   0   0  -3  -7   30692
+   -1.286790  1191   -5.09   0   0   3  -7   30692
+   -0.290212  1186   -5.09   0   0  -2  -7   30692
+    0.290212   708   -5.09   0   1  -3  -7   30692
+    0.845996   419   -5.09   0   0  -2  -4   30692
+    1.382615   505   -5.09   0   1  -2  -6   30692
+    1.919233   544   -5.09   0   0   0  -7   30692
+   -1.232033   905    8.81   0   1  -1   2   30693
+   -0.728268   991    8.81   0   1  -1  -2   30693
+    0.728268   484    8.81   0   1  -2  -3   30693
+    1.245722   612    8.81   0   1  -3   5   30693
+    2.067077   628    8.81   0   1  -1  -5   30693
+    2.568104   273    8.81   0   1  -2  -6   30693
+    3.140315   451    8.81   0   1  -2  -7   30693
+   -2.039699   605   -3.11   0   1   5   8   30698
+   -1.563313   917   -3.11   0   1   5  -4   30698
+   -0.835045   770   -3.11   0   1   5  -7   30698
+   -0.240931  1200   -3.11   0   1   5  -2   30698
+    0.240931   845   -3.11   0   1   4  11   30698
+    0.566735   685   -3.11   0   1  -4   3   30698
+    1.103354   527   -3.11   0   1  -3   4   30698
+    1.620808   503   -3.11   0   1  -2   1   30698
+    2.176591   525   -3.11   0   1   1   5   30698
+   -2.119097   838    4.85   4   1   5  14   30699
+   -1.620808  1126    4.85   4   1   5  12   30699
+   -1.141684   971    4.85   4   1   0  15   30699
+   -0.260096   992    4.85   4   1   5  11   30699
+    0.260096   791    4.85   3   1   1  13   30699
+    0.739220   505    4.85   4   1  -1  11   30699
+    1.806982   676    4.85   4   0  -2  10   30699
+    2.305270   590    4.85   3   1   0  10   30699
+    2.803559   483    4.85   3   1   0   8   30699
+   -0.840520   642    9.53   0   0  -3   5   30702
+   -0.342231   314    9.53   0   1  -1   9   30702
+    0.342231   572    9.53   0   1  -5   5   30702
+    0.939083   472    9.53   0   1  -3  -4   30702
+    1.316906   500    9.53   0   1  -3   6   30702
+   -0.298426   486    2.22   0   1   5   3   30713
+    0.939083   803    2.22   0   1   5  -3   30713
+    2.989733   679    2.22   0   1   1  -5   30713
+   -2.466804   748    2.55   0   1   5  25   30735
+   -1.968515   685    2.55   0   1   5  21   30735
+   -1.546886   684    2.55   0   1   5  19   30735
+   -1.144422   711    2.55   0   1   5  25   30735
+   -0.703628   837    2.55   0   1   5  22   30735
+   -0.224504   897    2.55   0   1   5  17   30735
+    0.435318   606    2.55   0   1   0  22   30735
+    0.952772   431    2.55   0   1   1  22   30735
+    1.467488   473    2.55   0   1   5  18   30735
+    1.965777   370    2.55   0   1   5  16   30735
+   -0.249144  1137   -0.33   2   1   5  11   30777
+    0.747433   441   -0.33   2   1  -4   2   30777
+    1.212868   623   -0.33   1   0  -4  -1   30777
+    1.749487   554   -0.33   1   1  -4   0   30777
+    2.365503   955   -0.33   1   1  -4   2   30777
+    2.844627   314   -0.33   1   1  -5  -1   30777
+   -2.193018   414    8.10   0   0   5  15   30798
+   -1.180014   376    8.10   0   1   5  -2   30798
+   -0.815880   411    8.10   0   0  -4   1   30798
+   -0.273785   514    8.10   0   1  -4   6   30798
+    0.273785   501    8.10   0   0  -4  -3   30798
+    0.695414   558    8.10   0   1  -4  -3   30798
+    1.117043   518    8.10   0   1  -4  -5   30798
+    1.615332   572    8.10   0   0  -4  -3   30798
+    2.116359   361    8.10   0   0  -4  -2   30798
+    2.633812   317    8.10   0   0  -4  -6   30798
+   -0.251882  1785   -1.47   0   1  -4  11   30820
+    0.251882  1075   -1.47   0   1   4  21   30820
+    1.631759  1117   -1.47   0   1   0  15   30820
+   -0.249144   303   11.53   0   0   5  17   30827
+    0.249144   451   11.53   0   1   0  -5   30827
+    0.673511   297   11.53   0   1  -2  17   30827
+    1.040383   233   11.53   0   0  -3   2   30827
+    1.620808   290   11.53   0   0  -2  -1   30827
+    1.908282   184   11.53   0   0  -4  -4   30827
+    2.417522   505   11.53   0   0  -2  -6   30827
+    2.915811   301   11.53   0   0  -3  -4   30827
+    3.436003   101   11.53   0   0  -3  -5   30827
+   -1.210130   723   -4.85   1   1   2  16   30835
+   -0.681725  1049   -4.85   0   1   1   3   30835
+   -0.183436  1028   -4.85   2   1  -1   6   30835
+    0.183436   640   -4.85   0   1  -3   6   30835
+    0.643395   476   -4.85   1   1   2   3   30835
+    1.155373   536   -4.85   0   1  -4  18   30835
+   -0.605065  1683    4.96   0   1   5  -5   30840
+    0.605065  1317    4.96   0   1   5  -6   30840
+    0.821355   670    4.96   0   1   5  -5   30840
+    1.412731   902    4.96   0   1  -2  -6   30840
+   -2.324435  1178   -2.94   4   1  -2  -5   30864
+    2.324435   348   -2.94   4   1   5   6   30864
+   -2.579056   941    7.35   0   0  -4  -6   30868
+   -2.119097   790    7.35   0   0  -3  -1   30868
+   -1.601643  1059    7.35   0   0  -4  -7   30868
+   -1.218344  1026    7.35   0   1  -3  -5   30868
+   -0.796715  1047    7.35   0   0  -2  -6   30868
+   -0.260096  1072    7.35   0   0   0  -4   30868
+    0.260096   816    7.35   0   0   0   2   30868
+    0.815880   735    7.35   0   0  -3   0   30868
+   -2.031485   853   10.09   0   0   2  36   30871
+   -1.514032   743   10.09   0   0   5  21   30871
+   -1.034908   785   10.09   0   0  -4  30   30871
+   -0.689938   687   10.09   0   0  -4  24   30871
+   -0.268309   572   10.09   0   0   5  42   30871
+    0.268309   899   10.09   0   0   5  44   30871
+    0.651608   899   10.09   0   0   5  34   30871
+    1.149897   824   10.09   0   0   5  34   30871
+    1.648186   728   10.09   0   0   5  42   30871
+    2.146475   858   10.09   0   0   5  25   30871
+    2.650239   899   10.09   0   0   5  29   30871
+   -0.769336   799   -9.52   0   0   5  -4   30881
+   -0.254620   921   -9.52   0   1   5  -3   30881
+    0.251882   326   -9.52   0   0   5  -3   30881
+    0.629706   272   -9.52   0   0  -4  -4   30881
+    1.070500   276   -9.52   0   0  -4  -6   30881
+    1.319644   415   -9.52   0   0  -4  -2   30881
+    1.919233   410   -9.52   0   0  -4  -7   30881
+    2.398357   354   -9.52   0   0  -4  -3   30881
+    2.858316   333   -9.52   0   0  -4  -6   30881
+    3.373032   401   -9.52   0   0  -4  -6   30881
+    3.906913   387   -9.52   0   0  -3  -4   30881
+   -2.713210  1077    1.96   4   1  -1  27   30882
+   -0.851472  1516    1.96   4   1  -2  -6   30882
+   -0.492813  1631    1.96   4   1  -3  -3   30882
+    0.851472   804    1.96   4   1  -2  -7   30882
+    1.221081  1215    1.96   4   1  -3  -7   30882
+    1.642710   810    1.96   4   1  -4  -7   30882
+    2.138262   919    1.96   4   1  -2  -7   30882
+   -2.160164   882    7.55   1   1  -3   1   30913
+   -1.357974   698    7.55   1   1  -1   1   30913
+   -0.952772   653    7.55   2   1  -1   1   30913
+   -0.416153   550    7.55   1   1  -2  -6   30913
+   -2.050650   817   -9.48   4   1   5  -4   30931
+    2.420260   515   -9.48   3   1   5  15   30931
+   -0.260096   701   -6.44   3   1  -2   1   30933
+    0.260096   767   -6.44   3   1  -1  14   30933
+    1.226557   344   -6.44   3   0  -3   1   30933
+    1.705681   328   -6.44   3   1  -4  10   30933
+    2.242300   413   -6.44   3   1  -3  -5   30933
+    2.598220   291   -6.44   3   1   2   2   30933
+    3.028063   323   -6.44   3   1   0   1   30933
+    3.545517   150   -6.44   3   1  -1  26   30933
+    4.229980   102   -6.44   3   1   5  -2   30933
+   -2.540725   556   -3.02   0   0  -1   1   30953
+   -2.080767   779   -3.02   0   0  -4   5   30953
+   -1.620808   947   -3.02   0   0  -4  -6   30953
+   -0.777550   936   -3.02   0   0  -3  11   30953
+   -0.298426   919   -3.02   0   0  -3   8   30953
+    0.298426   740   -3.02   0   0  -4   6   30953
+    1.111567   756   -3.02   0   0  -4  -7   30953
+   -2.628337   596    1.09   1   0  -3   3   30960
+   -2.119097   760    1.09   0   0  -1   6   30960
+   -1.218344   609    1.09   0   0  -4   0   30960
+   -0.758385   648    1.09   0   0  -4  13   30960
+    0.624230   575    1.09   0   0  -3  21   30960
+   -2.685832   715   -4.65   0   1  -4  -7   30995
+   -2.187543   824   -4.65   0   0  -4  -6   30995
+   -1.708419   641   -4.65   0   1  -2  -7   30995
+   -1.201916   499   -4.65   0   0  -4  -6   30995
+   -0.722793   462   -4.65   0   1  -2  -7   30995
+   -2.004107  1097   -3.89   0   1   5   7   30999
+   -1.486653   818   -3.89   0   1   5   4   30999
+   -1.048597   832   -3.89   0   1   0  -3   30999
+    1.048597   253   -3.89   0   1  -1   8   30999
+   -1.130732   873  -10.07   1   1  -3  -7   31036
+   -0.616016   829  -10.07   3   1  -1  -6   31036
+   -0.191650  2702  -10.07   0   1  -4  -7   31036
+    0.191650   932  -10.07   2   1  -3  -7   31036
+    0.684463   658  -10.07   2   1  -2  -7   31036
+    1.336071   823  -10.07   2   1  -4  -7   31036
+   -0.763860  1585   -6.17   3   0   0  15   31054
+   -0.227242  1731   -6.17   3   1   1  16   31054
+    0.227242  1466   -6.17   3   1   5   9   31054
+    0.610541  1224   -6.17   3   1   0   1   31054
+    1.108830  1482   -6.17   3   1  -4   0   31054
+    1.694730  1376   -6.17   0   1  -4  -3   31054
+    2.088980  1245   -6.17   0   0  -4  -2   31054
+    2.748802  1159   -6.17   3   1  -4  -6   31054
+    3.400411   916   -6.17   3   1  -4  -5   31054
+   -2.203970   644   -4.24   0   1   0   5   31062
+   -1.571526   373   -4.24   0   1   0   6   31062
+   -1.169062   573   -4.24   0   1  -4   4   31062
+   -0.728268   408   -4.24   0   1  -3   1   31062
+   -0.268309   389   -4.24   0   1  -3  19   31062
+    0.268309   677   -4.24   0   1  -2   1   31062
+    0.678987   819   -4.24   0   1  -4   5   31062
+    0.900753   466   -4.24   0   0  -4   6   31062
+    1.399042   382   -4.24   0   0  -4   7   31062
+    1.878166   332   -4.24   0   0  -4   6   31062
+    2.759754   371   -4.24   0   0  -4   7   31062
+   -2.609172   616   -7.36   0   1   5  -6   40012
+   -0.783025   523   -7.36   0   1  -2  -5   40012
+   -0.213552   781   -7.36   0   1   5   5   40012
+    0.213552   656   -7.36   0   1   5   0   40012
+    0.747433   751   -7.36   0   1   5  -5   40012
+    1.379877   403   -7.36   0   1   5  -6   40012
+   -0.391513  1043   -4.39   0   1   1   8   40014
+    0.391513  1200   -4.39   0   1   2   0   40014
+    0.906229  1331   -4.39   0   1  -3  11   40014
+    1.442847  1429   -4.39   0   1  -4   1   40014
+    1.941136  1107   -4.39   0   1  -3   2   40014
+    2.365503  1093   -4.39   0   1  -4   3   40014
+    3.394935  1082   -4.39   0   1  -3   3   40014
+    3.912389  1141   -4.39   0   0  -1   1   40014
+    4.438056  1483   -4.39   0   0  -2  10   40014
+    4.928132  1229   -4.39   0   0  -4   6   40014
+    5.426420   918   -4.39   0   0   2   0   40014
+   -2.625599  1240   -4.78   0   1   5   0   40043
+   -2.179329   959   -4.78   0   1   5   4   40043
+   -1.664613  1358   -4.78   0   1   5   8   40043
+   -1.163587   961   -4.78   0   1   5  24   40043
+   -0.681725  1275   -4.78   0   0   5  21   40043
+   -0.199863  1132   -4.78   0   0   5   8   40043
+    0.506502   376   -4.78   0   1   5  29   40043
+    1.086927   437   -4.78   0   1   5  27   40043
+    2.064339   386   -4.78   0   1   5  12   40043
+   -0.722793  1009   -9.58   0   1   5   4   40121
+   -0.295688   877   -9.58   0   1   5   6   40121
+    0.295688   434   -9.58   0   1   3  -1   40121
+    0.813142   743   -9.58   0   0   5  -1   40121
+    1.694730   393   -9.58   0   1   5   3   40121
+   -1.930185   887   -7.40   1   1  -4  19   40132
+   -1.259411   963   -7.40   1   1  -1  -2   40132
+   -0.665298  1115   -7.40   1   1  -3  -1   40132
+   -0.306639   485    7.04   0   0   0  21   40175
+    0.306639   450    7.04   0   1   0   2   40175
+    1.248460   466    7.04   0   0  -3  15   40175
+    1.724846   560    7.04   0   0  -1  11   40175
+    2.165640   550    7.04   0   0  -1  13   40175
+    2.642026   582    7.04   0   0   1  -2   40175
+    3.597536   204    7.04   0   0  -4  -4   40175
+   -0.777550  1267    1.93   2   1   5  15   40217
+   -0.243669  1106    1.93   3   1   5   5   40217
+    0.243669  1016    1.93   3   1  -4  -2   40217
+    1.273101   833    1.93   3   1   1   8   40217
+    1.812457   731    1.93   0   1  -2  -2   40217
+    2.288843   595    1.93   1   1   0   5   40217
+    2.748802   677    1.93   3   1  -5   4   40217
+    3.225188   525    1.93   1   1  -2   6   40217
+    3.846680   634    1.93   3   1  -4  18   40217
+    4.268310   632    1.93   3   1  -4  20   40217
+   -0.325804   803   11.22   4   1   5   1   40224
+    0.325804   664   11.22   3   1   0   1   40224
+    0.804928   539   11.22   3   0  -2  -4   40224
+    1.648186   311   11.22   3   0  -4   1   40224
+   -0.720055  1625    0.73   3   1  -2  11   40249
+   -0.246407   866    0.73   3   1  -4  32   40249
+    0.246407  1262    0.73   3   1  -4  -1   40249
+    1.281314   855    0.73   3   1  -1  23   40249
+   -2.715948   486   -2.41   0   1   5  25   40286
+   -2.269678   479   -2.41   0   1   4   1   40286
+   -1.760438   688   -2.41   1   1   5  -3   40286
+   -1.177276   794   -2.41   0   1  -4   5   40286
+    0.238193   602   -2.41   1   1   5  13   40286
+    1.215606   384   -2.41   1   1   3   3   40286
+    2.220397   304   -2.41   1   1  -1   3   40286
+   -0.295688  1142    6.04   3   1  -2   7   40327
+    0.273785  1023    6.04   3   1   0   1   40327
+    0.755647  1065    6.04   3   1   3   8   40327
+    1.158111   556    6.04   3   1   3  15   40327
+   -0.936345   776    4.13   0   0   4   3   40337
+   -0.265572   847    4.13   0   0  -3  -3   40337
+    0.265572   594    4.13   0   0  -4  -4   40337
+    0.717317   414    4.13   0   0  -4   1   40337
+    1.292266   472    4.13   0   0  -4  11   40337
+    1.853525   333    4.13   0   0  -2   8   40337
+   -2.907598  1016    7.39   0   1   5   3   40340
+   -2.387406   923    7.39   0   0   5  10   40340
+   -1.897331   855    7.39   0   1  -1  16   40340
+   -1.585216  1052    7.39   0   1  -2  20   40340
+    0.960986   343    7.39   2   1  -1  13   40340
+   -1.691992  1289   11.90   0   1   5  26   40362
+   -1.048597  1425   11.90   4   1  -1  26   40362
+   -0.659822  1525   11.90   4   1   5  17   40362
+   -0.281999  1782   11.90   4   1   5  11   40362
+    0.281999   402   11.90   4   1   5   4   40362
+    0.793977   551   11.90   4   1   5   2   40362
+    1.817933   471   11.90   4   1  -3  20   40362
+    2.412046   454   11.90   0   1  -4  10   40362
+    2.924025   463   11.90   0   1  -3   8   40362
+   -0.383299   547    0.85   1   1   5  -1   40363
+    0.383299   446    0.85   0   1   1  -5   40363
+    0.936345   446    0.85   0   1   1   5   40363
+    1.505818   358    0.85   0   1  -2  -4   40363
+    2.217659   415    0.85   0   1   5   7   40363
+    2.757016   358    0.85   0   1   5  -4   40363
+    3.233402   260    0.85   0   1   5   2   40363
+    3.709788   309    0.85   0   1   5   1   40363
+    4.342231   354    0.85   0   1   5  -1   40363
+    4.706366   367    0.85   0   1   5  -3   40363
+    5.305955   452    0.85   0   1   5  -3   40363
+   -1.694730   735    7.55   0   1   5   9   40372
+   -1.067762   966    7.55   0   1   5  -7   40372
+   -0.665298   772    7.55   0   1   1  -3   40372
+   -0.260096   749    7.55   0   1  -1  -6   40372
+    1.979466   423    7.55   0   1  -3   8   40372
+    2.554415   446    7.55   0   1  -4   0   40372
+    3.088296   605    7.55   0   1   0  -5   40372
+    3.493498   534    7.55   0   1   0  -2   40372
+    4.062971   387    7.55   0   1  -2   5   40372
+   -2.880219   955    4.73   0   0   5  -6   40374
+   -2.379192  1065    4.73   0   1   5  12   40374
+   -1.826146   919    4.73   0   1   5  -1   40374
+   -1.377139   683    4.73   0   1   5  -1   40374
+   -0.898015   988    4.73   0   1   5  -4   40374
+   -0.364134  1054    4.73   0   1   5  -3   40374
+    0.131417   688    4.73   0   1   5   3   40374
+   -2.948665  1015   -9.15   2   1   0  -7   40375
+   -1.872690   800   -9.15   2   1  -1  -4   40375
+   -1.382615   635   -9.15   2   1  -1  -7   40375
+   -0.895277   752   -9.15   2   1  -4  -7   40375
+   -0.438056   635   -9.15   2   1  -2  -5   40375
+    0.438056  1024   -9.15   1   1   1  -7   40375
+    1.193703   575   -9.15   1   1   0  -1   40375
+   -0.202601  1109    6.16   4   1  -1  12   40378
+    4.386037   899    6.16   3   1   5  32   40378
+   -2.718686  1332    3.78   2   1  -4  -7   40390
+   -1.727584   937    3.78   2   1   4   0   40390
+   -1.281314  1248    3.78   3   1   5  -1   40390
+   -0.741958  1541    3.78   3   1   5   1   40390
+   -0.249144  1080    3.78   3   1   5   3   40390
+   -2.532512   996    5.04   0   1   5  10   40399
+   -1.973990   699    5.04   0   1   5  17   40399
+   -2.458590  1344   -7.78   0   1   5   5   40401
+   -1.661875  1148   -7.78   0   1  -1  -2   40401
+   -1.467488  1296   -7.78   0   1  -2  -2   40401
+   -0.772074  1069   -7.78   0   1  -3   0   40401
+   -0.298426   778   -7.78   0   1   1   1   40401
+    0.298426   801   -7.78   0   1  -2  -1   40401
+    0.722793   619   -7.78   0   1   0   4   40401
+    1.221081   632   -7.78   0   1   1   1   40401
+    1.719370   372   -7.78   0   1  -1   9   40401
+   -1.700205   903    5.74   3   1   5  -5   40402
+    2.198494   426    5.74   2   1  -4  -3   40402
+   -2.184805  1070    0.56   1   0  -4   0   40419
+   -1.601643  1210    0.56   3   0  -2  -1   40419
+   -1.130732  1107    0.56   0   0   0  10   40419
+    0.268309   530    0.56   1   0  -4  -2   40419
+    0.769336   308    0.56   2   0  -3  -1   40419
+   -1.730322  1075    9.74   0   1  -2  -6   40438
+   -1.242984   941    9.74   0   1  -3  -4   40438
+   -0.670773  1161    9.74   0   1  -2  -6   40438
+   -0.268309   963    9.74   0   0  -4  -6   40438
+    0.268309   433    9.74   0   0  -2   0   40438
+   -0.394251   335    0.02   0   0  -1   4   40445
+    0.394251   338    0.02   0   1   0  -2   40445
+    1.349760   356    0.02   0   1  -5  -5   40445
+    3.479808   208    0.02   0   0  -5  -4   40445
+    3.937029   236    0.02   0   0  -5  -2   40445
+    4.495551   187    0.02   0   0  -5  -3   40445
+    4.917180   190    0.02   0   0  -5  -5   40445
+   -1.423682   785    8.38   0   1   5  -4   40464
+   -0.813142   857    8.38   0   1   5  -1   40464
+   -0.229979   723    8.38   0   1   5   4   40464
+    0.229979   807    8.38   0   1   5   5   40464
+    0.736482   605    8.38   0   1   1   9   40464
+    1.675565   585    8.38   0   1   2   2   40464
+    2.138262   616    8.38   0   1   5   1   40464
+    2.636550   513    8.38   0   1   1  26   40464
+    3.249829   491    8.38   0   1   0  32   40464
+    3.633128   509    8.38   0   1   1  22   40464
+   -2.910335   942   11.59   3   1   5  -7   40499
+   -2.422998   699   11.59   3   1   5  -5   40499
+   -1.957563   907   11.59   3   1   4  -7   40499
+   -1.552361   734   11.59   3   1  -2  -7   40499
+   -0.314853   774   -9.38   0   1  -3  28   40508
+    0.314853   714   -9.38   0   1   0  27   40508
+    0.974675  1323   -9.38   0   1  -4   2   40508
+    1.472964   842   -9.38   0   0  -4  -7   40508
+    2.299795   924   -9.38   0   0  -4  -1   40508
+    5.264887   678   -9.38   0   0  -5  -6   40508
+   -1.552361   650   12.99   4   1   5   6   40520
+    1.552361   311   12.99   4   1   5  28   40520
+    2.039699   466   12.99   4   1  -1  -4   40520
+    2.557153   351   12.99   0   1   5  12   40520
+    3.085558   256   12.99   4   1   5   2   40520
+    3.518138   386   12.99   4   1   5  -1   40520
+   -0.895277   841    8.59   0   1   5  -4   40534
+   -0.254620   511    8.59   0   1   5  -6   40534
+    0.254620   520    8.59   0   1  -2  -3   40534
+    0.714579   513    8.59   0   0  -4   7   40534
+    1.218344   449    8.59   0   1  -2  -3   40534
+    1.659138   371    8.59   0   1  -1  -6   40534
+    2.190281   381    8.59   0   1  -2  -3   40534
+    2.691307   384    8.59   0   1   0   1   40534
+    3.186858   246    8.59   0   0  -4   3   40534
+    4.167009   335    8.59   0   1  -3  -6   40534
+   -0.835045   375    0.85   0   1   5  -6   40553
+   -0.221766   725    0.85   0   1   1  -7   40553
+    0.221766   661    0.85   0   0  -2  -6   40553
+    1.097878   451    0.85   0   0   0   4   40553
+    1.538672   443    0.85   0   0  -3  11   40553
+    2.058864   499    0.85   0   1  -1  -7   40553
+    2.537988   606    0.85   0   1   3   0   40553
+    3.110198   347    0.85   0   1  -5   3   40553
+    3.581109   478    0.85   0   1   5  -7   40553
+    4.093087   330    0.85   0   1   5  -4   40553
+    0.443532  1242    6.85   1   1   5   4   40555
+    1.182752   494    6.85   0   1  -3  -7   40555
+    3.019849   368    6.85   0   1  -3  -4   40555
+    3.728953   274    6.85   0   1  -3  35   40555
+   -0.306639   899    3.27   0   1   5  -6   40562
+    0.306639   653    3.27   0   1   2  -3   40562
+    0.725530   394    3.27   0   1  -1  -1   40562
+    3.197810   346    3.27   0   1  -5   0   40562
+   -0.375086  1109    2.36   3   1   5  -7   40571
+    0.375086  1259    2.36   3   1   5  -6   40571
+    0.930869   876    2.36   3   1   1  -4   40571
+    1.541410   770    2.36   3   1  -2   9   40571
+   -0.323066   506    0.42   2   1   5  31   40624
+    0.807666   529    0.42   1   1  -4  29   40624
+    1.305955   386    0.42   1   1  -4  16   40624
+    2.165640   514    0.42   0   1   1   7   40624
+    3.145791   591    0.42   0   1   5   8   40624
+    3.811088   492    0.42   1   1   5   8   40624
+   -1.385352   880    1.29   0   0  -2  -2   40661
+   -0.774812   853    1.29   0   1  -2  -7   40661
+   -0.229979   504    1.29   0   1  -3  -7   40661
+    0.229979   490    1.29   0   0  -3  -6   40661
+    2.086242   314    1.29   0   0  -5  -6   40661
+    2.565366   213    1.29   0   0  -5  -7   40661
+    3.066393   182    1.29   0   0  -5  -5   40661
+    3.657769   278    1.29   0   0  -4  -4   40661
+    4.062971   228    1.29   0   0  -4  -2   40661
+   -1.289528   559    5.34   0   0   5   1   40672
+   -0.711841   705    5.34   0   1   5   0   40672
+   -0.229979   676    5.34   0   1   5   1   40672
+    0.229979   582    5.34   0   0   5   1   40672
+    1.190965   343    5.34   0   0   3   8   40672
+    2.182067   416    5.34   0   0   1   2   40672
+   -0.804928   937   -1.94   2   1  -3  -4   40681
+    0.881588   641   -1.94   2   1  -2  -7   40681
+    1.837098   405   -1.94   2   1  -4  -7   40681
+    2.699521   229   -1.94   2   0  -4  -2   40681
+   -0.566735   535   11.22   4   1   5  -7   40693
+    0.566735   690   11.22   4   1   5  -7   40693
+    1.086927   542   11.22   4   1  -4  -6   40693
+    1.650924   508   11.22   4   1  -4  -7   40693
+   -1.379877   882    3.66   4   1   5  -3   40702
+   -0.747433  1167    3.66   4   1   5   1   40702
+    0.648871   644    3.66   2   1   0  -3   40702
+    1.108830   871    3.66   2   1  -2   6   40702
+    2.198494   754    3.66   3   1   2  -1   40702
+    2.658453   787    3.66   2   1  -2  -3   40702
+   -0.295688  1000    0.71   3   0   0  13   40738
+    0.295688   480    0.71   3   1   2  -1   40738
+    0.835045   449    0.71   3   1  -2   8   40738
+    1.448323   407    0.71   3   1  -2  22   40738
+    1.946612   360    0.71   2   1  -3   5   40738
+    3.457906   325    0.71   2   1  -5   1   40738
+    4.881588   382    0.71   2   1  -4  -3   40738
+   -0.292950   942    1.11   0   1   5  -1   40759
+    0.689938   322    1.11   0   1  -4   8   40759
+    1.462012   376    1.11   0   1   5  -5   40759
+    3.852156   299    1.11   1   1   5  -4   40759
+   -0.334018   882   18.14   3   1   5   3   40774
+    0.334018   713   18.14   0   1   0   3   40774
+    0.835045   512   18.14   3   1  -2   3   40774
+    1.349760   389   18.14   3   1  -4   4   40774
+   -0.383299   431   -2.45   0   0   5   8   40791
+    0.383299   340   -2.45   0   1   4   9   40791
+   -2.521561  1157   -4.55   4   1   2  33   40795
+   -2.099931   972   -4.55   4   1   5  33   40795
+   -1.661875  1350   -4.55   4   1   5  39   40795
+   -1.182752  1290   -4.55   4   1   5  39   40795
+   -0.240931  1246   -4.55   4   1   5  48   40795
+    0.240931   530   -4.55   4   1   5  48   40795
+    1.259411   546   -4.55   4   1   5  40   40795
+    2.291581   394   -4.55   4   1   5  39   40795
+   -0.287474   823   -9.82   3   1   5  27   40807
+    0.287474  1313   -9.82   3   1   5   9   40807
+    0.845996   866   -9.82   3   1  -4   0   40807
+    1.420945   902   -9.82   2   1  -4  -1   40807
+    1.935660   970   -9.82   3   1  -4  -5   40807
+    2.874743   771   -9.82   3   1  -1   0   40807
+   -0.309377   694    8.36   0   1   5   8   40867
+    0.254620   827    8.36   0   1   1  12   40867
+    0.758385   714    8.36   0   1  -5  28   40867
+    1.429158   519    8.36   0   1  -5  11   40867
+   -2.067077   918    4.32   0   1  -4  -7   40873
+   -1.601643  1186    4.32   0   1  -4  -7   40873
+   -1.070500  1110    4.32   0   1  -4  -6   40873
+   -0.257358   811    4.32   0   1  -4  -7   40873
+   -0.375086   662    1.13   0   0   5   6   40901
+    0.375086   618    1.13   0   1   5  -2   40901
+    0.873374   589    1.13   0   1   3  -7   40901
+    1.390828   653    1.13   0   1   5   4   40901
+    1.869952   440    1.13   0   1   5   9   40901
+    2.368241   763    1.13   0   1   5   3   40901
+    2.885695   511    1.13   0   0  -4  -4   40901
+    3.345654   515    1.13   0   1  -1  12   40901
+    3.863107   235    1.13   0   0  -2  18   40901
+    4.380561   389    1.13   0   0  -1   6   40901
+    4.859685   201    1.13   0   0   0   9   40901
+   -0.317591   992    0.95   2   1  -1  -5   40904
+    0.317591   927    0.95   3   1  -3  -6   40904
+    0.911704  1191    0.95   3   1  -3  -4   40904
+    1.429158   535    0.95   3   1  -3   0   40904
+    3.211499   558    0.95   3   0  -4  -7   40904
+    3.594798   516    0.95   2   1  -3  -7   40904
+    0.911704   638    1.66   0   1   5   1   40942
+    1.503080   584    1.66   0   1   3   6   40942
+    3.315537   204    1.66   0   1   4  -4   40942
+   -2.907598   794    4.72   3   1  -3  -7   40959
+   -2.458590   799    4.72   3   1  -4  -6   40959
+   -0.607803  1084    4.72   3   0  -4  -5   40959
+   -0.358658   901    4.72   3   0  -4  -6   40959
+    0.358658   721    4.72   1   0  -4  -7   40959
+    0.706366   560    4.72   2   0  -4  -6   40959
+    1.242984   588    4.72   0   0  -4  -7   40959
+   -1.117043   887    4.18   0   1   0  -1   40967
+    0.610541   454    4.18   0   1   0  -1   40967
+   -0.999316   458   -0.04   3   0  -3  -6   40970
+   -0.325804   672   -0.04   4   0   0  -5   40970
+    0.325804   384   -0.04   3   0  -3  -1   40970
+    0.651608   318   -0.04   3   0  -4  -3   40970
+    1.590691   272   -0.04   3   0  -4  -5   40970
+    2.108145   226   -0.04   4   0  -4  -6   40970
+    2.606434   159   -0.04   4   0  -5  -3   40970
+    4.065709   217   -0.04   3   0  -4  -7   40970
+   -2.765229   987   -7.67   3   1  -4   1   40973
+   -2.308008   742   -7.67   2   1  -1   5   40973
+   -1.817933   876   -7.67   1   1   3   4   40973
+   -1.385352   796   -7.67   2   1   2   7   40973
+   -0.813142   864   -7.67   1   1   5  -4   40973
+   -0.273785   775   -7.67   0   1   5  18   40973
+   -2.951403  1218    5.57   3   1   5   3   41032
+   -2.491444  1331    5.57   3   1   5   2   41032
+   -2.012320  1175    5.57   3   1   5   6   41032
+   -0.191650   919    5.57   2   1   5  -2   41032
+    0.191650   909    5.57   3   1   5   2   41032
+    0.881588   718    5.57   3   1   5  34   41032
+    1.188227   959    5.57   2   1   5   6   41032
+    1.667351   769    5.57   2   1   5   6   41032
+   -1.097878   630   -7.14   1   1   0   3   41045
+   -0.427105  1278   -7.14   1   1  -1  10   41045
+    0.427105   722   -7.14   0   1  -4  13   41045
+    0.974675   811   -7.14   1   1  -4   5   41045
+    2.488706   564   -7.14   1   1  -3   3   41045
+    3.471595   461   -7.14   1   1  -5   9   41045
+   -1.259411   540   -4.51   4   1   5   4   41061
+    1.259411   468   -4.51   0   1  -2   1   41061
+    4.000000   593   -4.51   0   1   5  -6   41061
+   -1.853525   906   15.23   0   1   5  -1   41062
+   -1.125257  1212   15.23   0   1   5  -6   41062
+   -0.703628  1459   15.23   0   1   5  -1   41062
+   -0.235455  1538   15.23   0   1   0  -7   41062
+    0.235455  1362   15.23   0   1   0  -5   41062
+    1.396304   624   15.23   0   1  -2  -5   41062
+    2.354552   170   15.23   0   1  -4  -5   41062
+    2.819986   277   15.23   0   1  -5  -2   41062
+   -0.287474   593   -3.07   3   1   5   8   41082
+    1.267625   569   -3.07   3   1   5   0   41082
+    1.727584   420   -3.07   4   1  -2   8   41082
+   -2.483231  1193    4.28   4   1  -1  12   41108
+   -2.039699  1009    4.28   4   1  -4   8   41108
+   -1.314168   760    4.28   4   1  -3  24   41108
+   -1.065024  1207    4.28   4   1  -2  31   41108
+   -0.585900  1327    4.28   4   1  -2   5   41108
+   -0.125941  1078    4.28   4   1  -2  10   41108
+    0.432580  1049    4.28   4   1  -2  22   41108
+   -0.301164   547    3.92   0   1   1  -4   41142
+    0.813142   517    3.92   0   1  -2   5   41142
+    1.292266   508    3.92   0   0  -4  41   41142
+   -2.798084  2001    6.66   2   1  -4  -7   41157
+   -2.302532  1325    6.66   2   1  -3  -2   41157
+   -1.839836  2077    6.66   1   1  -4  -4   41157
+   -1.314168  1578    6.66   1   1  -4   1   41157
+   -0.279261  2047    6.66   2   1   0  -7   41157
+   -0.821355   980    7.67   0   1  -3   0   41158
+   -0.251882  1174    7.67   0   1   1  -7   41158
+    0.251882  1008    7.67   0   1  -1  -4   41158
+    0.750171   931    7.67   0   1  -2  -6   41158
+    1.763176   910    7.67   0   1  -2  -3   41158
+    2.702259   327    7.67   0   1  -1  -4   41158
+    3.200547   295    7.67   0   1  -4  -1   41158
+    3.698836    83    7.67   0   1  -3  -2   41158
+   -2.535250  1124   15.68   0   1   4  -5   41163
+   -2.012320  1081   15.68   0   1  -1  -5   41163
+   -0.153320  1069   15.68   0   1  -1   0   41163
+    0.421629   665   15.68   0   1   1   4   41163
+    0.900753   422   15.68   0   1   1  -6   41163
+   -0.249144   934   -2.85   0   1   3   9   41165
+    0.249144  1028   -2.85   0   1   3  19   41165
+    0.733744   770   -2.85   0   1   1   2   41165
+    1.284052   725   -2.85   0   1  -3  17   41165
+    1.801506   708   -2.85   0   1  -2  15   41165
+    2.491444   803   -2.85   0   1  -1   6   41165
+    3.085558   552   -2.85   0   1   1  18   41165
+   -0.312115   736    1.20   0   1   5  -6   41185
+    0.312115   804    1.20   0   1   5  -7   41185
+    0.777550   748    1.20   0   1   3  -7   41185
+    1.702943   484    1.20   0   1  -4  -7   41185
+    2.250513   247    1.20   0   0  -4  -7   41185
+   -1.122519   670   -5.58   0   1   5  -3   41194
+   -0.643395   870   -5.58   0   1   1   0   41194
+    0.643395   526   -5.58   0   1  -2  -3   41194
+    1.971253   529   -5.58   0   1  -1  -1   41194
+    2.718686   535   -5.58   0   1  -4   0   41194
+    3.293634   490   -5.58   0   1  -1  -1   41194
+   -0.276523   554   16.58   0   0   5  -1   41221
+    0.276523   756   16.58   0   0   5  -4   41221
+    0.791239   667   16.58   0   0   3  -5   41221
+    1.284052   587   16.58   0   0   5  -3   41221
+    2.286105   575   16.58   0   0   2  -3   41221
+    2.789870   457   16.58   0   0   0  -1   41221
+    3.786448   211   16.58   0   0  -2  13   41221
+    4.295688   307   16.58   0   0   0  -7   41221
+    4.783025    98   16.58   0   0   5  -7   41221
+   -0.958248   512    1.19   0   1   5  -7   41243
+    0.958248   532    1.19   0   1   5  -4   41243
+    1.555099   686    1.19   0   1   5  -4   41243
+    2.773443   385    1.19   0   1   5  -2   41243
+   -0.279261  1238   12.01   4   1  -1  -7   41253
+    0.279261   347   12.01   4   1  -2  -7   41253
+    1.546886   360   12.01   4   0  -3  -7   41253
+    2.026010   562   12.01   4   1  -3  -6   41253
+    3.526352   313   12.01   4   1  -4  -5   41253
+   -0.260096  1091    0.59   0   0   5  -2   41261
+    0.260096   744    0.59   0   1   5  -5   41261
+    0.878850   786    0.59   0   0   5  -1   41261
+    1.639973   663    0.59   0   0  -2  -3   41261
+    2.633812   804    0.59   0   0  -4  -1   41261
+    3.137577   716    0.59   0   0  -4   0   41261
+    3.578371   930    0.59   0   0  -3   1   41261
+    4.588638   983    0.59   0   0  -4   7   41261
+   -1.232033   471   -8.67   0   1   5  -5   41265
+   -0.720055   762   -8.67   0   1   5   2   41265
+   -0.161533   931   -8.67   0   1   5  -7   41265
+    0.161533   595   -8.67   0   1   0  -7   41265
+    0.991102   405   -8.67   0   1  -2  -6   41265
+    1.399042   415   -8.67   0   1   5  -7   41265
+    1.897331   299   -8.67   0   1   0  -6   41265
+    3.805613   353   -8.67   1   1   5  -7   41265
+   -1.067762   835    9.25   0   0   5   9   41289
+   -0.635181  1391    9.25   0   0   0   5   41289
+   -0.213552  1053    9.25   0   0   0   3   41289
+    0.213552   768    9.25   0   0  -4   4   41289
+    2.154689   798    9.25   0   0   1  15   41289
+    2.652977   893    9.25   0   0  -1  20   41289
+    3.170431   955    9.25   0   0   0  20   41289
+    3.726215   729    9.25   0   0   3  17   41289
+   -2.896646   869    5.07   0   1   5   1   41305
+   -2.398357   928    5.07   0   1   2  -6   41305
+   -2.015058   786    5.07   0   1   5  -2   41305
+   -1.010267   792    5.07   0   1   5   2   41305
+   -0.492813  1044    5.07   0   1   5   3   41305
+    0.492813   833    5.07   0   1   5  21   41305
+   -0.238193   580    2.01   0   1  -1   1   41314
+    0.238193   819    2.01   0   1   0   7   41314
+    0.736482   412    2.01   0   1   1   0   41314
+    1.204654   559    2.01   0   1  -2   6   41314
+    3.041752   208    2.01   0   0  -3   4   41314
+   -2.255989   707    6.22   3   1   5   5   41325
+   -1.286790   731    6.22   0   1   5  -7   41325
+   -0.725530   841    6.22   0   1  -2  -4   41325
+    1.763176   497    6.22   3   0  -5  -7   41325
+    2.308008   449    6.22   0   1  -4  -7   41325
+    2.767967   441    6.22   0   1  -4  29   41325
+   -2.239562  2115    2.60   4   0  -2   2   41328
+   -1.741273  2403    2.60   4   0  -4  -3   41328
+   -1.251198  2461    2.60   4   1  -4  10   41328
+   -0.783025  2334    2.60   3   1  -4   0   41328
+    0.273785  1221    2.60   4   1   1   4   41328
+    0.928131  1194    2.60   4   1  -4   8   41328
+    1.311431   939    2.60   4   1  -3   2   41328
+    1.916496  1083    2.60   4   1  -3   5   41328
+    2.767967   748    2.60   0   0  -3  12   41328
+   -0.292950   531   13.72   3   1   5  -7   41395
+    0.292950   882   13.72   2   1   0  -5   41395
+    0.829569   551   13.72   2   1   4  -6   41395
+    2.020534   471   13.72   3   1   1   3   41395
+    2.524298   348   13.72   3   1   0   1   41395
+    2.978782   432   13.72   3   1  -1  -2   41395
+    3.479808   481   13.72   3   1  -4   8   41395
+    3.997262   201   13.72   3   1   0  -2   41395
+    5.054072   109   13.72   3   1  -2  -5   41395
+   -2.584531  1352    6.16   3   1   5  17   41402
+   -2.201232  1578    6.16   3   1   5  22   41402
+   -1.618070  1425    6.16   2   1   3  23   41402
+   -1.158111  1512    6.16   3   1   3   5   41402
+   -0.610541  1505    6.16   3   1   5  -2   41402
+    0.610541  1464    6.16   2   1  -1   6   41402
+    1.117043   924    6.16   3   1   1   7   41402
+   -2.989733   400   -6.02   0   0   3  -4   41406
+   -2.299795   667   -6.02   0   0   2  -4   41406
+   -1.648186   625   -6.02   0   0  -4  -2   41406
+   -1.245722   552   -6.02   0   0  -3   0   41406
+   -0.709103  1103   -6.02   0   0   0  -3   41406
+   -0.268309   472   -6.02   0   0  -2   3   41406
+    0.268309   357   -6.02   0   0  -2   2   41406
+   -1.557837   920   -8.12   2   1  -4  10   41407
+   -0.963723  1188   -8.12   1   1  -3   0   41407
+   -0.446270  1022   -8.12   2   1  -4  -1   41407
+    0.446270   845   -8.12   1   1  -3   4   41407
+    1.062286   462   -8.12   1   1  -4   3   41407
+    1.691992   437   -8.12   1   1  -4  -3   41407
+   -0.312115   676    1.49   0   1  -3   0   41411
+    0.312115   767    1.49   0   1  -4   2   41411
+    0.777550   866    1.49   0   1  -4  -2   41411
+    1.295003   941    1.49   0   1  -4  -6   41411
+    1.883641   714    1.49   0   1  -4  -4   41411
+    2.327173   843    1.49   0   1  -4  -3   41411
+    2.882957   755    1.49   0   1  -3  -4   41411
+    3.457906   686    1.49   0   1  -4  -5   41411
+    3.958932   389    1.49   0   1  -3  -5   41411
+    4.457221   698    1.49   0   1   2   0   41411
+    4.876112   689    1.49   0   1   0   2   41411
+   -0.268309   615   12.43   1   0   5   9   41414
+    0.268309   437   12.43   0   0   5   7   41414
+    0.813142   418   12.43   0   0  -3   0   41414
+    2.899384   400   12.43   0   0  -4  -6   41414
+   -0.725530   511   -4.30   0   0  -1  -2   41416
+   -0.229979   399   -4.30   0   1  -2   2   41416
+    0.229979   380   -4.30   0   1  -4  -2   41416
+    0.772074   304   -4.30   0   1  -4   6   41416
+    1.226557   254   -4.30   0   1  -4  -5   41416
+    1.683778   232   -4.30   0   1  -4   2   41416
+    2.182067   108   -4.30   0   0  -4   2   41416
+    3.197810    89   -4.30   0   0  -4  -2   41416
+   -1.798768   719   10.80   0   1   5   0   41452
+   -1.212868  1089   10.80   0   1   5  -3   41452
+   -0.731006  1210   10.80   0   1   5   3   41452
+   -0.153320  1001   10.80   0   1   5   0   41452
+    2.302532   649   10.80   0   1   5  -1   41452
+    2.874743   758   10.80   1   0   5  -7   41452
+    3.542779   483   10.80   0   1   5  -7   41452
+   -2.647502   819    0.67   0   0   5   1   41474
+   -2.242300   970    0.67   0   0   5   0   41474
+   -1.760438   647    0.67   0   0   5   2   41474
+   -1.264887   828    0.67   0   0  -2   4   41474
+    0.268309   647    0.67   0   0  -3   6   41474
+    0.783025   624    0.67   0   0  -4   8   41474
+    1.347023   547    0.67   0   0   5   8   41474
+    1.768652   628    0.67   0   0   1   2   41474
+   -1.754962   924   -2.23   0   1   5  -7   41475
+   -1.273101   930   -2.23   0   1   5  -2   41475
+   -0.813142  1046   -2.23   0   1   5  -7   41475
+   -0.314853   747   -2.23   0   1   5  -5   41475
+    0.314853   804   -2.23   0   1   5  -6   41475
+    1.426420   605   -2.23   0   1   3  -7   41475
+    2.406571   557   -2.23   0   1  -2  -5   41475
+    2.984257   567   -2.23   0   1   3  -7   41475
+    3.520876   532   -2.23   0   1   1  -5   41475
+   -2.743326   889    4.98   0   0  -3  -2   41521
+   -0.747433   780    4.98   0   0  -4  -5   41521
+   -0.249144   760    4.98   0   0  -2  -4   41521
+    0.249144   513    4.98   0   0  -1  -7   41521
+    0.224504   414   -3.29   4   1   5  -6   41549
+   -1.130732  1433    7.59   2   1   5   3   41566
+   -0.689938  1238    7.59   3   1   1  -4   41566
+   -0.229979  1022    7.59   3   1   5   7   41566
+    0.229979   750    7.59   2   1   3  -2   41566
+    0.709103   839    7.59   0   1  -4  -3   41566
+    1.166324   796    7.59   1   1  -3  -6   41566
+    2.179329   498    7.59   1   1  -2  -4   41566
+    2.715948   311    7.59   1   1   0  -6   41566
+    4.153320   384    7.59   3   1   5  12   41566
+   -2.910335  1605    3.61   0   1   5   6   41620
+   -2.414784  1296    3.61   0   1   5   3   41620
+   -1.459274  1684    3.61   0   1  -4  19   41620
+    0.309377  1048    3.61   0   1   5  14   41620
+    0.711841   932    3.61   0   1   5   9   41620
+    1.158111   430    3.61   0   1   5   4   41620
+    1.598905   350    3.61   0   1   1   5   41620
+    2.097194   297    3.61   0   1   3   8   41620
+   -0.265572   747    2.85   0   1   1  -2   41621
+    0.763860   970    2.85   0   1   3  -5   41621
+    1.779603  1093    2.85   0   1  -4   2   41621
+    2.220397   734    2.85   0   0  -4  -3   41621
+    2.833676  1082    2.85   0   1  -4  -3   41621
+    3.236140   972    2.85   0   1  -3  -2   41621
+    3.772758  1207    2.85   0   1  -3  -4   41621
+    4.271047   853    2.85   0   1  -2  -3   41621
+    4.733744   974    2.85   0   0  -4   0   41621
+   -0.528405  1238    2.19   0   1   5  -6   41628
+    0.528405   876    2.19   0   1   5  -2   41628
+   -1.251198  1166   -6.07   4   1   5   2   41646
+   -0.725530  1112   -6.07   3   1   5  16   41646
+   -0.251882  1224   -6.07   3   1   5  -2   41646
+    0.251882   830   -6.07   3   1   5  -1   41646
+    0.766598   687   -6.07   3   1   5   2   41646
+    1.245722   636   -6.07   3   1   5   0   41646
+   -1.919233   695   -0.32   0   1   5   7   41656
+   -1.286790   562   -0.32   1   1   5   1   41656
+   -0.747433   692   -0.32   1   1   5   6   41656
+   -0.287474   695   -0.32   0   1   5   5   41656
+    1.869952    42   -0.32   0   0  -5  16   41656
+   -2.773443  1722   10.85   3   0  -4  -7   41658
+   -2.223135  2051   10.85   2   0  -4  -4   41658
+   -1.702943  1786   10.85   3   0  -4  -6   41658
+   -1.374401  1764   10.85   3   0  -5  -7   41658
+    0.194387   827   10.85   3   0  -5  -4   41658
+    0.616016   608   10.85   3   0  -5  -4   41658
+    1.152635   507   10.85   2   0  -4  -4   41658
+    1.650924   471   10.85   3   0  -5  -7   41658
+    2.132786   257   10.85   3   0  -5  -4   41658
+   -2.340863  1746    7.13   4   1  -4  -4   41659
+   -1.727584  1294    7.13   3   1  -4   4   41659
+   -1.286790  1683    7.13   3   1  -4   1   41659
+   -0.755647  1457    7.13   3   1  -5  -7   41659
+   -0.355921  1663    7.13   0   1  -5  -6   41659
+    0.355921   948    7.13   2   1  -4  -2   41659
+   -0.287474  1058    1.95   0   1   2  -2   41687
+    0.287474   780    1.95   0   1   2   0   41687
+    0.577686   719    1.95   0   1  -2  -3   41687
+    2.012320   364    1.95   0   1  -2   0   41687
+    2.488706   371    1.95   0   0  -3   3   41687
+    2.934976   281    1.95   0   0   0  -6   41687
+   -1.779603   532    2.49   0   1   5  21   41691
+   -1.284052   464    2.49   0   1   5  16   41691
+   -0.766598   780    2.49   0   1   3  33   41691
+   -0.369610   633    2.49   0   1   5  31   41691
+    1.377139   540    2.49   0   1   5  38   41691
+   -1.593429   563   -5.57   0   0   5  -3   41692
+   -1.180014   658   -5.57   0   0   2  -3   41692
+   -0.698152  1035   -5.57   0   0   4  -3   41692
+   -0.229979   588   -5.57   0   0  -2   0   41692
+    0.229979   686   -5.57   0   0  -4  -5   41692
+    0.747433   576   -5.57   0   0  -3  -2   41692
+    1.226557   370   -5.57   0   0  -4  -1   41692
+    1.700205   616   -5.57   0   0  -5  -2   41692
+    2.272416   652   -5.57   0   0  -4  -4   41692
+    2.817248   566   -5.57   0   0  -4  -3   41692
+    3.307324   945   -5.57   0   0  -4  -6   41692
+    0.268309   756   -7.15   2   1  -2   1   41717
+    0.804928   659   -7.15   3   1  -5  -2   41717
+    1.089665  1019   -7.15   3   1  -4   0   41717
+    1.820671   518   -7.15   3   1  -4  -2   41717
+   -1.571526   880   -1.81   0   1   5  16   41725
+   -1.130732   432   -1.81   0   1   5  10   41725
+   -0.632444   784   -1.81   0   1   5  -3   41725
+   -0.213552   885   -1.81   0   1  -4  -4   41725
+    0.213552   961   -1.81   0   1  -1  -2   41725
+    0.731006   867   -1.81   0   0  -4   0   41725
+    1.656400   674   -1.81   0   1  -2   5   41725
+    2.206708   901   -1.81   0   0  -2   4   41725
+    2.729637   617   -1.81   0   1  -3  -2   41725
+    3.203285   746   -1.81   0   1  -2  -2   41725
+   -0.213552  1026   -4.16   0   1   0  18   41728
+    0.213552   674   -4.16   0   1  -3   8   41728
+    0.769336   768   -4.16   0   1  -3   8   41728
+    1.188227   899   -4.16   0   1  -2   0   41728
+    1.839836   449   -4.16   0   1  -1  15   41728
+    2.496920   240   -4.16   0   1   1  10   41728
+   -2.579056  1453   14.60   4   0   5   4   41741
+   -2.239562  1932   14.60   4   0   5   6   41741
+   -0.835045  1661   14.60   3   0   5  18   41741
+   -0.295688  1761   14.60   3   0   5   9   41741
+    0.295688   737   14.60   3   0   5   6   41741
+    1.122519   431   14.60   3   1   5  26   41741
+   -2.277892   695   -8.23   1   1   5   6   41820
+   -1.226557   646   -8.23   1   1  -2   5   41820
+   -0.941821   882   -8.23   1   1  -2  -5   41820
+    0.941821   391   -8.23   1   1   5   1   41820
+    2.496920   575   -8.23   1   1  -3   0   41820
+   -0.262834  1518    7.63   0   0  -4   5   41829
+    0.262834   973    7.63   1   0  -4   2   41829
+    0.744695   677    7.63   1   1  -5   6   41829
+    1.522245   874    7.63   0   1  -3   0   41829
+    2.480493   538    7.63   1   1  -3  10   41829
+    3.077344   546    7.63   1   0  -5   1   41829
+    3.441478   294    7.63   1   1  -2   6   41829
+    4.021903   546    7.63   1   1  -4   1   41829
+    4.531143   383    7.63   1   1  -3   2   41829
+   -0.238193   606   -5.04   4   1  -3  11   41844
+    0.238193   570   -5.04   4   1   0  10   41844
+    0.772074   826   -5.04   4   1  -4   4   41844
+    1.538672   983   -5.04   4   1  -4   8   41844
+    2.056126   517   -5.04   4   0  -3   9   41844
+    3.419576   462   -5.04   4   1  -3   0   41844
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/data/movies.txt b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/data/movies.txt
new file mode 100644
index 000000000..1a95013c0
--- /dev/null
+++ b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/data/movies.txt
@@ -0,0 +1,141 @@
+	score	rating	genre	box office	running time
+2 Fast 2 Furious	48.9	PG-13	action/adventure	127.146	107
+28 Days Later	78.2	R	horror	45.065	113
+A Guy Thing	39.5	PG-13	rom comedy	15.545	101
+A Man Apart	42.9	R	action/adventure	26.248	110
+A Mighty Wind	79.9	PG-13	comedy	17.781	91
+Agent Cody Banks	57.9	PG	action/adventure	47.811	102
+Alex & Emma	35.1	PG-13	rom comedy	14.219	96
+American Wedding	50.7	R	comedy	104.441	96
+Anger Management	62.6	PG-13	comedy	134.404	106
+Anything Else	63.3	R	rom comedy	3.212	108
+Bad Boys II	38.1	R	action/adventure	138.397	147
+Bad Santa	75.8	R	comedy	59.523	91
+Basic	43.6	R	suspense	26.536	98
+Beyond Borders	48.8	R	drama	4.430	127
+big fish	78.0	PG-13	drama	65.151	125
+Biker Boyz	44.8	PG-13	action/adventure	21.731	110
+Boat Trip	29.2	R	comedy	8.600	94
+Bringing Down the House	57.7	PG-13	comedy	132.582	105
+Brother Bear	60.6	G	animated	84.874	84
+Bruce Almighty	63.3	PG-13	comedy	242.705	101
+Bulletproof Monk	47.3	PG-13	action/adventure	23.359	104
+Cabin Fever	54.3	R	horror	21.158	93
+Calendar Girls	67.3	PG-13	comedy	29.591	108
+Charlie's Angles: Full Throttle	54.8	PG-13	action/adventure	100.785	106
+Cheaper by the Dozen	52.9	PG	comedy	135.445	98
+Cold Creek Manor	50.6	R	suspense	21.386	118
+Cold Mountain	78.0	R	drama	90.712	152
+Confidence	62.2	R	drama	12.218	97
+Cradle 2 the Grave	45.8	R	action/adventure	34.613	101
+Daddy Day Care	40.5	PG	comedy	104.297	92
+Daredevil	52.1	PG-13	action/adventure	102.544	103
+Dark Blue	58.7	R	drama	9.250	118
+Darkness Falls	31.1	PG-13	horror	32.131	86
+Deliver Us From Eva	61.8	R	rom comedy	17.366	105
+Dickie Roberts, Former Child Star	52.6	PG-13	comedy	22.739	98
+Down With Love	63.7	PG-13	rom comedy	20.305	101
+Dr. Seuss' The Cat in the Hat	37.8	PG	fantasy	99.880	82
+Dreamcatcher	42.5	R	sci-fi	33.700	136
+Dumb and Dumberer	29.5	PG-13	comedy	26.166	85
+Duplex	51.3	PG-13	comedy	9.671	89
+Dysfunktional Family	57.2	R	documentary	2.235	89
+Elf	70.3	PG	comedy	173.306	95
+Final Destination 2	46.1	R	horror	46.492	90
+Finding Nemo	91.2	G	animated	339.715	100
+Freaky Friday	74.9	PG	comedy	110.222	97
+Freddy vs. Jason	47.3	R	horror	82.217	97
+From Justin to Kelly	34.8	PG	musical	4.929	81
+Gigli	29.3	R	rom comedy	6.088	121
+Gods and Generals	42.0	PG-13	drama	12.875	231
+Good Boy!	57.9	PG	comedy	37.655	87
+Gothika	48.1	R	horror	59.454	98
+Grind	35.0	PG-13	comedy	5.124	105
+Head of State	56.7	PG-13	comedy	37.845	95
+Holes	71.9	PG	drama	67.365	117
+Hollywood Homicide	47.9	PG-13	action/adventure	30.941	116
+Honey	49.9	PG-13	drama	30.223	94
+House of 1000 Corpses	36.3	R	horror	12.599	105
+How to Deal	53.6	PG-13	drama	14.144	101
+How to Lose a Guy in 10 Days	50.3	PG-13	rom comedy	106.094	116
+Identity	60.0	R	suspense	52.131	90
+In the Cut	53.6	R	suspense	4.717	119
+Intolerable Cruelty	67.1	PG-13	rom comedy	35.189	100
+It Runs in the Family	55.6	PG-13	comedy	7.492	109
+Jeepers Creepers 2	44.2	R	horror	35.667	104
+Johnny English	56.4	PG	comedy	28.082	87
+Just Married	41.5	PG-13	rom comedy	56.127	95
+Kangaroo Jack	30.4	PG	comedy	66.746	89
+Kill Bill - Vol 1	84.1	R	action/adventure	69.869	111
+Lara Croft, Tomb Raider	48.8	PG-13	action/adventure	65.660	117
+Le Divorce	59.6	PG-13	comedy	9.081	117
+Legally Blonde 2	50.8	PG-13	comedy	89.921	95
+Looney Tunes: Back in Action	59.3	PG	comedy	20.808	90
+Lost in Translation	94.6	R	drama	43.217	102
+Love Actually	70.5	R	rom comedy	59.365	135
+Love Don't Cost a Thing	45.4	PG-13	rom comedy	21.803	100
+Malibu's Most Wanted	44.8	PG-13	comedy	34.340	86
+Marci X	36.2	R	comedy	1.649	84
+Master and Commander	87.6	PG-13	action/adventure	92.076	138
+Matchstick Men	68.9	PG-13	comedy	36.886	116
+Mona Lisa Smile	55.8	PG-13	drama	63.696	117
+Monster	64.9	R	drama	23.802	109
+My Boss's Daughter	33.2	PG-13	rom comedy	15.551	90
+Mystic River	89.9	R	drama	79.207	137
+National Security	37.9	PG-13	action/adventure	35.765	88
+Old School	54.7	R	comedy	74.663	92
+Once Upon A Time in Mexico	64.6	R	action/adventure	55.846	102
+Open Range	71.6	R	western	58.331	139
+Out of Time	65.8	PG-13	suspense	41.077	105
+Paycheck	48.3	PG-13	sci-fi	53.428	119
+Peter Pan	67.5	PG	fantasy	47.581	113
+Phone Booth	62.1	R	suspense	46.566	81
+Piglet's Big Movie	63.6	G	animated	23.103	75
+Pirates of the Caribbean	67.8	PG-13	action/adventure	305.414	143
+Radio	54.2	PG	drama	51.987	109
+Rugrats Go Wild	55.6	PG	animated	39.403	84
+Runaway Jury	65.4	PG-13	suspense	49.441	127
+S.W.A.T.	55.4	PG-13	action/adventure	116.643	117
+Scary Movie 3	50.2	PG-13	comedy	110.000	84
+Seabiscuit	81.3	PG-13	drama	120.171	141
+Secondhand Lions	57.9	PG	drama	41.521	111
+Shanghai Knights	62.1	PG-13	action/adventure	60.477	114
+Sinbad: Legend of the Seven Seas	55.3	PG	animated	26.309	86
+Something's Gotta Give	75.5	PG-13	rom comedy	121.418	128
+Spy Kids 3-D: Game Over	65.0	PG	action/adventure	111.761	84
+Stuck on You	65.6	PG-13	comedy	33.762	118
+Tears of the Sun	53.1	R	action/adventure	43.427	121
+Terminator 3:Rise of the Machines	71.3	R	action/adventure	150.358	109
+The Core	53.1	PG-13	sci-fi	31.187	135
+The Fighting Temptations	63.3	PG-13	comedy	30.251	123
+The Haunted Mansion	45.3	PG	comedy	74.320	99
+The Hulk	61.0	PG-13	action/adventure	132.176	138
+The Hunted	54.0	R	action/adventure	34.316	94
+The In-Laws	44.8	PG-13	comedy	20.453	95
+The Italian Job	66.5	PG-13	action/adventure	106.129	111
+The Jungle Book 2	55.2	G	animated	47.902	72
+The Last Samurai	67.8	R	drama	110.069	154
+The League of Extraordinary Gentlemen	43.2	PG-13	action/adventure	66.465	110
+The Life of David Gale	46.9	R	suspense	19.956	130
+The Lizze McGuire Movie	57.1	PG	comedy	42.718	94
+The Lord of the Rings III	92.2	PG-13	fantasy	361.119	201
+The Matrix Reloaded	70.1	R	action/adventure	281.519	138
+The Matrix Revolutions	49.7	R	action/adventure	139.260	129
+The Medallion	46.2	PG-13	action/adventure	22.219	88
+The Missing	67.0	R	western	26.900	137
+The Order	29.8	R	suspense	7.661	102
+The Real Cancun	40.8	R	documentary	3.779	96
+The Recruit	62.6	PG-13	suspense	52.802	115
+The Rundown	60.4	PG-13	action	47.611	104
+The School of Rock	86.4	PG-13	comedy	81.239	108
+The Texas Chainsaw Massacre	42.2	PG-13	horror	80.168	98
+Timeline	42.9	R	action	19.481	116
+Tupac: Resurrection	69.5	R	documentary	7.719	90
+Under the Tuscan Sun	63.1	PG-13	comedy	43.502	113
+Underworld	46.2	R	action	51.484	121
+Uptown Girls	38.5	PG-13	comedy	37.182	92
+View from the Top	43.7	PG-13	comedy	15.597	87
+What a Girl Wants	53.3	PG	comedy	36.017	105
+Willard	60.3	PG-13	horror	6.852	100
+Wrong Turn	32.6	R	horror	15.419	84
+X2: X-Men United	72.5	PG-13	action	214.950	133
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/datasources.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/datasources.png
new file mode 100644
index 000000000..e105fc200
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/datasources.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/example10pvals.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/example10pvals.png
new file mode 100644
index 000000000..91096c194
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/example10pvals.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/galton.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/galton.png
new file mode 100644
index 000000000..b20271cd8
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/galton.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/jellybeans1.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/jellybeans1.png
new file mode 100644
index 000000000..8430f66be
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/jellybeans1.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/jellybeans2.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/jellybeans2.png
new file mode 100644
index 000000000..4738bc088
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/jellybeans2.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/lowess.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/lowess.png
new file mode 100644
index 000000000..219c1e59e
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/lowess.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/significant.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/significant.png
new file mode 100644
index 000000000..520de3fff
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/significant.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/splines.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/splines.png
new file mode 100644
index 000000000..92df03e64
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/splines.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-1.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..f9b52e352
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-1.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-10.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-10.png
new file mode 100644
index 000000000..f8ab616ce
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-10.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-101.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-101.png
new file mode 100644
index 000000000..54ee97e64
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-101.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-102.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-102.png
new file mode 100644
index 000000000..480baa635
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-102.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-11.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-11.png
new file mode 100644
index 000000000..7e4c4c9a7
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-11.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-12.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-12.png
new file mode 100644
index 000000000..c5ef3714d
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-12.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-13.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-13.png
new file mode 100644
index 000000000..e019562c6
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-13.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-14.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-14.png
new file mode 100644
index 000000000..00a7e8ad3
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-14.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-15.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-15.png
new file mode 100644
index 000000000..1e73900bf
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-15.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-16.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-16.png
new file mode 100644
index 000000000..dcebb2a64
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-16.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-17.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-17.png
new file mode 100644
index 000000000..07c801692
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-17.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-18.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-18.png
new file mode 100644
index 000000000..df6c6e58e
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-18.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-19.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-19.png
new file mode 100644
index 000000000..b40e0814b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-19.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-2.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..84c6574a3
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-2.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-20.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-20.png
new file mode 100644
index 000000000..6002a163f
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-20.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-21.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-21.png
new file mode 100644
index 000000000..eeb3306d3
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-21.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-22.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-22.png
new file mode 100644
index 000000000..2959d1a99
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-22.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-23.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-23.png
new file mode 100644
index 000000000..7f27b8c8b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-23.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-24.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-24.png
new file mode 100644
index 000000000..527fe1919
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-24.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-3.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..113a6048e
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-3.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-4.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..e1c7e1a2b
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-5.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..f1cc6b19a
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-5.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-6.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-6.png
new file mode 100644
index 000000000..548b1bc6e
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-6.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-7.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-7.png
new file mode 100644
index 000000000..728bcae47
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-7.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-8.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-8.png
new file mode 100644
index 000000000..bda9fe2bf
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-8.png differ
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-9.png b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-9.png
new file mode 100644
index 000000000..bf15ea854
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/fig/unnamed-chunk-9.png differ
diff --git a/06_StatisticalInference/03_05_MultipleTesting/index.md b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.Rmd
similarity index 89%
rename from 06_StatisticalInference/03_05_MultipleTesting/index.md
rename to 06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.Rmd
index 317eae1a7..4d5cc68a4 100644
--- a/06_StatisticalInference/03_05_MultipleTesting/index.md
+++ b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.Rmd
@@ -1,309 +1,253 @@
----
-title       : Multiple testing
-subtitle    : Statistical Inference 
-author      : Brian Caffo, Jeffrey Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow   # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
----
-
-
-
-
-
-## Key ideas
-
-* Hypothesis testing/significance analysis is commonly overused
-* Correcting for multiple testing avoids false positives or discoveries
-* Two key components
-  * Error measure
-  * Correction
-
-
----
-
-## Three eras of statistics
-
-__The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions__: Are there more male than female births? Is the rate of insanity rising?
-
-The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who __developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment__. The questions dealt with still tended to be simple Is treatment A better than treatment B? 
-
-__The era of scientific mass production__, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?
-
-[http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf](http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf)
-
----
-
-## Reasons for multiple testing
-
-<img class=center src=fig/datasources.png height=450>
-
-
----
-
-## Why correct for multiple tests?
-
-<img class=center src=fig/jellybeans1.png height=450>
-
-
-[http://xkcd.com/882/](http://xkcd.com/882/)
-
----
-
-## Why correct for multiple tests?
-
-<img class=center src=fig/jellybeans2.png height=400>
-
-[http://xkcd.com/882/](http://xkcd.com/882/)
-
-
----
-
-## Types of errors
-
-Suppose you are testing a hypothesis that a parameter $\beta$ equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
-</br></br>
-
-                    | $\beta=0$   | $\beta\neq0$   |  Hypotheses
---------------------|-------------|----------------|---------
-Claim $\beta=0$     |      $U$    |      $T$       |  $m-R$
-Claim $\beta\neq 0$ |      $V$    |      $S$       |  $R$
-    Claims          |     $m_0$   |      $m-m_0$   |  $m$
-
-</br></br>
-
-__Type I error or false positive ($V$)__ Say that the parameter does not equal zero when it does
-
-__Type II error or false negative ($T$)__ Say that the parameter equals zero when it doesn't 
-
-
----
-
-## Error rates
-
-__False positive rate__ - The rate at which false results ($\beta = 0$) are called significant: $E\left[\frac{V}{m_0}\right]$*
-
-__Family wise error rate (FWER)__ - The probability of at least one false positive ${\rm Pr}(V \geq 1)$
-
-__False discovery rate (FDR)__ - The rate at which claims of significance are false $E\left[\frac{V}{R}\right]$
-
-* The false positive rate is closely related to the type I error rate [http://en.wikipedia.org/wiki/False_positive_rate](http://en.wikipedia.org/wiki/False_positive_rate)
-
----
-
-## Controlling the false positive rate
-
-If P-values are correctly calculated calling all $P < \alpha$ significant will control the false positive rate at level $\alpha$ on average. 
-
-<redtext>Problem</redtext>: Suppose that you perform 10,000 tests and $\beta = 0$ for all of them. 
-
-Suppose that you call all $P < 0.05$ significant. 
-
-The expected number of false positives is: $10,000 \times 0.05 = 500$  false positives. 
-
-__How do we avoid so many false positives?__
-
-
----
-
-## Controlling family-wise error rate (FWER)
-
-
-The [Bonferroni correction](http://en.wikipedia.org/wiki/Bonferroni_correction) is the oldest multiple testing correction. 
-
-__Basic idea__: 
-* Suppose you do $m$ tests
-* You want to control FWER at level $\alpha$ so $Pr(V \geq 1) < \alpha$
-* Calculate P-values normally
-* Set $\alpha_{fwer} = \alpha/m$
-* Call all $P$-values less than $\alpha_{fwer}$ significant
-
-__Pros__: Easy to calculate, conservative
-__Cons__: May be very conservative
-
-
----
-
-## Controlling false discovery rate (FDR)
-
-This is the most popular correction when performing _lots_ of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. 
-
-__Basic idea__: 
-* Suppose you do $m$ tests
-* You want to control FDR at level $\alpha$ so $E\left[\frac{V}{R}\right]$
-* Calculate P-values normally
-* Order the P-values from smallest to largest $P_{(1)},...,P_{(m)}$
-* Call any $P_{(i)} \leq \alpha \times \frac{i}{m}$ significant
-
-__Pros__: Still pretty easy to calculate, less conservative (maybe much less)
-
-__Cons__: Allows for more false positives, may behave strangely under dependence
-
----
-
-## Example with 10 P-values
-
-<img class=center src=fig/example10pvals.png height=450>
-
-Controlling all error rates at $\alpha = 0.20$
-
----
-
-## Adjusted P-values
-
-* One approach is to adjust the threshold $\alpha$
-* A different approach is to calculate "adjusted p-values"
-* They _are not p-values_ anymore
-* But they can be used directly without adjusting $\alpha$
-
-__Example__: 
-* Suppose P-values are $P_1,\ldots,P_m$
-* You could adjust them by taking $P_i^{fwer} = \max{m \times P_i,1}$ for each P-value.
-* Then if you call all $P_i^{fwer} < \alpha$ significant you will control the FWER. 
-
----
-
-## Case study I: no true positives
-
-
-```r
-set.seed(1010093)
-pValues <- rep(NA,1000)
-for(i in 1:1000){
-  y <- rnorm(20)
-  x <- rnorm(20)
-  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
-}
-
-# Controls false positive rate
-sum(pValues < 0.05)
-```
-
-```
-[1] 51
-```
-
-
----
-
-## Case study I: no true positives
-
-
-```r
-# Controls FWER 
-sum(p.adjust(pValues,method="bonferroni") < 0.05)
-```
-
-```
-[1] 0
-```
-
-```r
-# Controls FDR 
-sum(p.adjust(pValues,method="BH") < 0.05)
-```
-
-```
-[1] 0
-```
-
-
-
----
-
-## Case study II: 50% true positives
-
-
-```r
-set.seed(1010093)
-pValues <- rep(NA,1000)
-for(i in 1:1000){
-  x <- rnorm(20)
-  # First 500 beta=0, last 500 beta=2
-  if(i <= 500){y <- rnorm(20)}else{ y <- rnorm(20,mean=2*x)}
-  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
-}
-trueStatus <- rep(c("zero","not zero"),each=500)
-table(pValues < 0.05, trueStatus)
-```
-
-```
-       trueStatus
-        not zero zero
-  FALSE        0  476
-  TRUE       500   24
-```
-
-
----
-
-
-## Case study II: 50% true positives
-
-
-```r
-# Controls FWER 
-table(p.adjust(pValues,method="bonferroni") < 0.05,trueStatus)
-```
-
-```
-       trueStatus
-        not zero zero
-  FALSE       23  500
-  TRUE       477    0
-```
-
-```r
-# Controls FDR 
-table(p.adjust(pValues,method="BH") < 0.05,trueStatus)
-```
-
-```
-       trueStatus
-        not zero zero
-  FALSE        0  487
-  TRUE       500   13
-```
-
-
-
----
-
-
-## Case study II: 50% true positives
-
-__P-values versus adjusted P-values__
-
-```r
-par(mfrow=c(1,2))
-plot(pValues,p.adjust(pValues,method="bonferroni"),pch=19)
-plot(pValues,p.adjust(pValues,method="BH"),pch=19)
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
-
-
----
-
-
-## Notes and resources
-
-__Notes__:
-* Multiple testing is an entire subfield
-* A basic Bonferroni/BH correction is usually enough
-* If there is strong dependence between tests there may be problems
-  * Consider method="BY"
-
-__Further resources__:
-* [Multiple testing procedures with applications to genomics](http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&s=books&qid=1187394873&sr=1-2)
-* [Statistical significance for genome-wide studies](http://www.pnas.org/content/100/16/9440.full)
-* [Introduction to multiple testing](http://ies.ed.gov/ncee/pubs/20084018/app_b.asp)
-
+---
+title       : Multiple testing
+subtitle    : Statistical Inference 
+author      : Brian Caffo, Jeffrey Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Key ideas
+
+* Hypothesis testing/significance analysis is commonly overused
+* Correcting for multiple testing avoids false positives or discoveries
+* Two key components
+  * Error measure
+  * Correction
+
+
+---
+
+## Three eras of statistics
+
+__The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions__: Are there more male than female births? Is the rate of insanity rising?
+
+The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who __developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment__. The questions dealt with still tended to be simple Is treatment A better than treatment B? 
+
+__The era of scientific mass production__, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?
+
+[http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf](http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf)
+
+---
+
+## Reasons for multiple testing
+
+<img class=center src=fig/datasources.png height=450>
+
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans1.png height=450>
+
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans2.png height=400>
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+
+---
+
+## Types of errors
+
+Suppose you are testing a hypothesis that a parameter $\beta$ equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
+</br></br>
+
+                    | $\beta=0$   | $\beta\neq0$   |  Hypotheses
+--------------------|-------------|----------------|---------
+Claim $\beta=0$     |      $U$    |      $T$       |  $m-R$
+Claim $\beta\neq 0$ |      $V$    |      $S$       |  $R$
+    Claims          |     $m_0$   |      $m-m_0$   |  $m$
+
+</br></br>
+
+__Type I error or false positive ($V$)__ Say that the parameter does not equal zero when it does
+
+__Type II error or false negative ($T$)__ Say that the parameter equals zero when it doesn't 
+
+
+---
+
+## Error rates
+
+__False positive rate__ - The rate at which false results ($\beta = 0$) are called significant: $E\left[\frac{V}{m_0}\right]$*
+
+__Family wise error rate (FWER)__ - The probability of at least one false positive ${\rm Pr}(V \geq 1)$
+
+__False discovery rate (FDR)__ - The rate at which claims of significance are false $E\left[\frac{V}{R}\right]$
+
+* The false positive rate is closely related to the type I error rate [http://en.wikipedia.org/wiki/False_positive_rate](http://en.wikipedia.org/wiki/False_positive_rate)
+
+---
+
+## Controlling the false positive rate
+
+If P-values are correctly calculated calling all $P < \alpha$ significant will control the false positive rate at level $\alpha$ on average. 
+
+<redtext>Problem</redtext>: Suppose that you perform 10,000 tests and $\beta = 0$ for all of them. 
+
+Suppose that you call all $P < 0.05$ significant. 
+
+The expected number of false positives is: $10,000 \times 0.05 = 500$  false positives. 
+
+__How do we avoid so many false positives?__
+
+
+---
+
+## Controlling family-wise error rate (FWER)
+
+
+The [Bonferroni correction](http://en.wikipedia.org/wiki/Bonferroni_correction) is the oldest multiple testing correction. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FWER at level $\alpha$ so $Pr(V \geq 1) < \alpha$
+* Calculate P-values normally
+* Set $\alpha_{fwer} = \alpha/m$
+* Call all $P$-values less than $\alpha_{fwer}$ significant
+
+__Pros__: Easy to calculate, conservative
+__Cons__: May be very conservative
+
+
+---
+
+## Controlling false discovery rate (FDR)
+
+This is the most popular correction when performing _lots_ of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FDR at level $\alpha$ so $E\left[\frac{V}{R}\right]$
+* Calculate P-values normally
+* Order the P-values from smallest to largest $P_{(1)},...,P_{(m)}$
+* Call any $P_{(i)} \leq \alpha \times \frac{i}{m}$ significant
+
+__Pros__: Still pretty easy to calculate, less conservative (maybe much less)
+
+__Cons__: Allows for more false positives, may behave strangely under dependence
+
+---
+
+## Example with 10 P-values
+
+<img class=center src=fig/example10pvals.png height=450>
+
+Controlling all error rates at $\alpha = 0.20$
+
+---
+
+## Adjusted P-values
+
+* One approach is to adjust the threshold $\alpha$
+* A different approach is to calculate "adjusted p-values"
+* They _are not p-values_ anymore
+* But they can be used directly without adjusting $\alpha$
+
+__Example__: 
+* Suppose P-values are $P_1,\ldots,P_m$
+* You could adjust them by taking $P_i^{fwer} = \max{m \times P_i,1}$ for each P-value.
+* Then if you call all $P_i^{fwer} < \alpha$ significant you will control the FWER. 
+
+---
+
+## Case study I: no true positives
+
+```{r createPvals,cache=TRUE}
+set.seed(1010093)
+pValues <- rep(NA,1000)
+for(i in 1:1000){
+  y <- rnorm(20)
+  x <- rnorm(20)
+  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
+}
+
+# Controls false positive rate
+sum(pValues < 0.05)
+```
+
+---
+
+## Case study I: no true positives
+
+```{r, dependson="createPvals"}
+# Controls FWER 
+sum(p.adjust(pValues,method="bonferroni") < 0.05)
+# Controls FDR 
+sum(p.adjust(pValues,method="BH") < 0.05)
+```
+
+
+---
+
+## Case study II: 50% true positives
+
+```{r createPvals2,cache=TRUE}
+set.seed(1010093)
+pValues <- rep(NA,1000)
+for(i in 1:1000){
+  x <- rnorm(20)
+  # First 500 beta=0, last 500 beta=2
+  if(i <= 500){y <- rnorm(20)}else{ y <- rnorm(20,mean=2*x)}
+  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
+}
+trueStatus <- rep(c("zero","not zero"),each=500)
+table(pValues < 0.05, trueStatus)
+```
+
+---
+
+
+## Case study II: 50% true positives
+
+```{r, dependson="createPvals2"}
+# Controls FWER 
+table(p.adjust(pValues,method="bonferroni") < 0.05,trueStatus)
+# Controls FDR 
+table(p.adjust(pValues,method="BH") < 0.05,trueStatus)
+```
+
+
+---
+
+
+## Case study II: 50% true positives
+
+__P-values versus adjusted P-values__
+```{r, dependson="createPvals2",fig.height=4,fig.width=8}
+par(mfrow=c(1,2))
+plot(pValues,p.adjust(pValues,method="bonferroni"),pch=19)
+plot(pValues,p.adjust(pValues,method="BH"),pch=19)
+```
+
+
+---
+
+
+## Notes and resources
+
+__Notes__:
+* Multiple testing is an entire subfield
+* A basic Bonferroni/BH correction is usually enough
+* If there is strong dependence between tests there may be problems
+  * Consider method="BY"
+
+__Further resources__:
+* [Multiple testing procedures with applications to genomics](http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&s=books&qid=1187394873&sr=1-2)
+* [Statistical significance for genome-wide studies](http://www.pnas.org/content/100/16/9440.full)
+* [Introduction to multiple testing](http://ies.ed.gov/ncee/pubs/20084018/app_b.asp)
+
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.html b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.html
new file mode 100644
index 000000000..dfc498eeb
--- /dev/null
+++ b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.html
@@ -0,0 +1,585 @@
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Multiple testing</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Multiple testing">
+  <meta name="author" content="Brian Caffo, Jeffrey Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Multiple testing</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeffrey Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>Key ideas</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Hypothesis testing/significance analysis is commonly overused</li>
+<li>Correcting for multiple testing avoids false positives or discoveries</li>
+<li>Two key components
+
+<ul>
+<li>Error measure</li>
+<li>Correction</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Three eras of statistics</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions</strong>: Are there more male than female births? Is the rate of insanity rising?</p>
+
+<p>The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who <strong>developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment</strong>. The questions dealt with still tended to be simple Is treatment A better than treatment B? </p>
+
+<p><strong>The era of scientific mass production</strong>, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?</p>
+
+<p><a href="http://www-stat.stanford.edu/%7Eckirby/brad/papers/2010LSIexcerpt.pdf">http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf</a></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>Reasons for multiple testing</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img class=center src=fig/datasources.png height=450></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Why correct for multiple tests?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img class=center src=fig/jellybeans1.png height=450></p>
+
+<p><a href="http://xkcd.com/882/">http://xkcd.com/882/</a></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Why correct for multiple tests?</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img class=center src=fig/jellybeans2.png height=400></p>
+
+<p><a href="http://xkcd.com/882/">http://xkcd.com/882/</a></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Types of errors</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>Suppose you are testing a hypothesis that a parameter \(\beta\) equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
+</br></br></p>
+
+<table><thead>
+<tr>
+<th></th>
+<th>\(\beta=0\)</th>
+<th>\(\beta\neq0\)</th>
+<th>Hypotheses</th>
+</tr>
+</thead><tbody>
+<tr>
+<td>Claim \(\beta=0\)</td>
+<td>\(U\)</td>
+<td>\(T\)</td>
+<td>\(m-R\)</td>
+</tr>
+<tr>
+<td>Claim \(\beta\neq 0\)</td>
+<td>\(V\)</td>
+<td>\(S\)</td>
+<td>\(R\)</td>
+</tr>
+<tr>
+<td>Claims</td>
+<td>\(m_0\)</td>
+<td>\(m-m_0\)</td>
+<td>\(m\)</td>
+</tr>
+</tbody></table>
+
+<p></br></br></p>
+
+<p><strong>Type I error or false positive (\(V\))</strong> Say that the parameter does not equal zero when it does</p>
+
+<p><strong>Type II error or false negative (\(T\))</strong> Say that the parameter equals zero when it doesn&#39;t </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Error rates</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>False positive rate</strong> - The rate at which false results (\(\beta = 0\)) are called significant: \(E\left[\frac{V}{m_0}\right]\)*</p>
+
+<p><strong>Family wise error rate (FWER)</strong> - The probability of at least one false positive \({\rm Pr}(V \geq 1)\)</p>
+
+<p><strong>False discovery rate (FDR)</strong> - The rate at which claims of significance are false \(E\left[\frac{V}{R}\right]\)</p>
+
+<ul>
+<li>The false positive rate is closely related to the type I error rate <a href="http://en.wikipedia.org/wiki/False_positive_rate">http://en.wikipedia.org/wiki/False_positive_rate</a></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Controlling the false positive rate</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>If P-values are correctly calculated calling all \(P < \alpha\) significant will control the false positive rate at level \(\alpha\) on average. </p>
+
+<p><redtext>Problem</redtext>: Suppose that you perform 10,000 tests and \(\beta = 0\) for all of them. </p>
+
+<p>Suppose that you call all \(P < 0.05\) significant. </p>
+
+<p>The expected number of false positives is: \(10,000 \times 0.05 = 500\)  false positives. </p>
+
+<p><strong>How do we avoid so many false positives?</strong></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>Controlling family-wise error rate (FWER)</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>The <a href="http://en.wikipedia.org/wiki/Bonferroni_correction">Bonferroni correction</a> is the oldest multiple testing correction. </p>
+
+<p><strong>Basic idea</strong>: </p>
+
+<ul>
+<li>Suppose you do \(m\) tests</li>
+<li>You want to control FWER at level \(\alpha\) so \(Pr(V \geq 1) < \alpha\)</li>
+<li>Calculate P-values normally</li>
+<li>Set \(\alpha_{fwer} = \alpha/m\)</li>
+<li>Call all \(P\)-values less than \(\alpha_{fwer}\) significant</li>
+</ul>
+
+<p><strong>Pros</strong>: Easy to calculate, conservative
+<strong>Cons</strong>: May be very conservative</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>Controlling false discovery rate (FDR)</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>This is the most popular correction when performing <em>lots</em> of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. </p>
+
+<p><strong>Basic idea</strong>: </p>
+
+<ul>
+<li>Suppose you do \(m\) tests</li>
+<li>You want to control FDR at level \(\alpha\) so \(E\left[\frac{V}{R}\right]\)</li>
+<li>Calculate P-values normally</li>
+<li>Order the P-values from smallest to largest \(P_{(1)},...,P_{(m)}\)</li>
+<li>Call any \(P_{(i)} \leq \alpha \times \frac{i}{m}\) significant</li>
+</ul>
+
+<p><strong>Pros</strong>: Still pretty easy to calculate, less conservative (maybe much less)</p>
+
+<p><strong>Cons</strong>: Allows for more false positives, may behave strangely under dependence</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Example with 10 P-values</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img class=center src=fig/example10pvals.png height=450></p>
+
+<p>Controlling all error rates at \(\alpha = 0.20\)</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Adjusted P-values</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>One approach is to adjust the threshold \(\alpha\)</li>
+<li>A different approach is to calculate &quot;adjusted p-values&quot;</li>
+<li>They <em>are not p-values</em> anymore</li>
+<li>But they can be used directly without adjusting \(\alpha\)</li>
+</ul>
+
+<p><strong>Example</strong>: </p>
+
+<ul>
+<li>Suppose P-values are \(P_1,\ldots,P_m\)</li>
+<li>You could adjust them by taking \(P_i^{fwer} = \max{m \times P_i,1}\) for each P-value.</li>
+<li>Then if you call all \(P_i^{fwer} < \alpha\) significant you will control the FWER. </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Case study I: no true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">set.seed(1010093)
+pValues &lt;- rep(NA, 1000)
+for (i in 1:1000) {
+    y &lt;- rnorm(20)
+    x &lt;- rnorm(20)
+    pValues[i] &lt;- summary(lm(y ~ x))$coeff[2, 4]
+}
+
+# Controls false positive rate
+sum(pValues &lt; 0.05)
+</code></pre>
+
+<pre><code>## [1] 51
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Case study I: no true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r"># Controls FWER
+sum(p.adjust(pValues, method = &quot;bonferroni&quot;) &lt; 0.05)
+</code></pre>
+
+<pre><code>## [1] 0
+</code></pre>
+
+<pre><code class="r"># Controls FDR
+sum(p.adjust(pValues, method = &quot;BH&quot;) &lt; 0.05)
+</code></pre>
+
+<pre><code>## [1] 0
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Case study II: 50% true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">set.seed(1010093)
+pValues &lt;- rep(NA, 1000)
+for (i in 1:1000) {
+    x &lt;- rnorm(20)
+    # First 500 beta=0, last 500 beta=2
+    if (i &lt;= 500) {
+        y &lt;- rnorm(20)
+    } else {
+        y &lt;- rnorm(20, mean = 2 * x)
+    }
+    pValues[i] &lt;- summary(lm(y ~ x))$coeff[2, 4]
+}
+trueStatus &lt;- rep(c(&quot;zero&quot;, &quot;not zero&quot;), each = 500)
+table(pValues &lt; 0.05, trueStatus)
+</code></pre>
+
+<pre><code>##        trueStatus
+##         not zero zero
+##   FALSE        0  476
+##   TRUE       500   24
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Case study II: 50% true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r"># Controls FWER
+table(p.adjust(pValues, method = &quot;bonferroni&quot;) &lt; 0.05, trueStatus)
+</code></pre>
+
+<pre><code>##        trueStatus
+##         not zero zero
+##   FALSE       23  500
+##   TRUE       477    0
+</code></pre>
+
+<pre><code class="r"># Controls FDR
+table(p.adjust(pValues, method = &quot;BH&quot;) &lt; 0.05, trueStatus)
+</code></pre>
+
+<pre><code>##        trueStatus
+##         not zero zero
+##   FALSE        0  487
+##   TRUE       500   13
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Case study II: 50% true positives</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>P-values versus adjusted P-values</strong></p>
+
+<pre><code class="r">par(mfrow = c(1, 2))
+plot(pValues, p.adjust(pValues, method = &quot;bonferroni&quot;), pch = 19)
+plot(pValues, p.adjust(pValues, method = &quot;BH&quot;), pch = 19)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-3.png" alt="plot of chunk unnamed-chunk-3"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Notes and resources</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><strong>Notes</strong>:</p>
+
+<ul>
+<li>Multiple testing is an entire subfield</li>
+<li>A basic Bonferroni/BH correction is usually enough</li>
+<li>If there is strong dependence between tests there may be problems
+
+<ul>
+<li>Consider method=&quot;BY&quot;</li>
+</ul></li>
+</ul>
+
+<p><strong>Further resources</strong>:</p>
+
+<ul>
+<li><a href="http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&amp;s=books&amp;qid=1187394873&amp;sr=1-2">Multiple testing procedures with applications to genomics</a></li>
+<li><a href="http://www.pnas.org/content/100/16/9440.full">Statistical significance for genome-wide studies</a></li>
+<li><a href="http://ies.ed.gov/ncee/pubs/20084018/app_b.asp">Introduction to multiple testing</a></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Key ideas'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Three eras of statistics'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Reasons for multiple testing'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Why correct for multiple tests?'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Why correct for multiple tests?'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Types of errors'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Error rates'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Controlling the false positive rate'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Controlling family-wise error rate (FWER)'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Controlling false discovery rate (FDR)'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Example with 10 P-values'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Adjusted P-values'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Case study I: no true positives'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Case study I: no true positives'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Case study II: 50% true positives'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Case study II: 50% true positives'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Case study II: 50% true positives'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Notes and resources'>
+         18
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.md b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.md
new file mode 100644
index 000000000..08f1afa2f
--- /dev/null
+++ b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.md
@@ -0,0 +1,308 @@
+---
+title       : Multiple testing
+subtitle    : Statistical Inference 
+author      : Brian Caffo, Jeffrey Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Key ideas
+
+* Hypothesis testing/significance analysis is commonly overused
+* Correcting for multiple testing avoids false positives or discoveries
+* Two key components
+  * Error measure
+  * Correction
+
+
+---
+
+## Three eras of statistics
+
+__The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions__: Are there more male than female births? Is the rate of insanity rising?
+
+The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who __developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment__. The questions dealt with still tended to be simple Is treatment A better than treatment B? 
+
+__The era of scientific mass production__, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?
+
+[http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf](http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf)
+
+---
+
+## Reasons for multiple testing
+
+<img class=center src=fig/datasources.png height=450>
+
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans1.png height=450>
+
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans2.png height=400>
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+
+---
+
+## Types of errors
+
+Suppose you are testing a hypothesis that a parameter $\beta$ equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
+</br></br>
+
+                    | $\beta=0$   | $\beta\neq0$   |  Hypotheses
+--------------------|-------------|----------------|---------
+Claim $\beta=0$     |      $U$    |      $T$       |  $m-R$
+Claim $\beta\neq 0$ |      $V$    |      $S$       |  $R$
+    Claims          |     $m_0$   |      $m-m_0$   |  $m$
+
+</br></br>
+
+__Type I error or false positive ($V$)__ Say that the parameter does not equal zero when it does
+
+__Type II error or false negative ($T$)__ Say that the parameter equals zero when it doesn't 
+
+
+---
+
+## Error rates
+
+__False positive rate__ - The rate at which false results ($\beta = 0$) are called significant: $E\left[\frac{V}{m_0}\right]$*
+
+__Family wise error rate (FWER)__ - The probability of at least one false positive ${\rm Pr}(V \geq 1)$
+
+__False discovery rate (FDR)__ - The rate at which claims of significance are false $E\left[\frac{V}{R}\right]$
+
+* The false positive rate is closely related to the type I error rate [http://en.wikipedia.org/wiki/False_positive_rate](http://en.wikipedia.org/wiki/False_positive_rate)
+
+---
+
+## Controlling the false positive rate
+
+If P-values are correctly calculated calling all $P < \alpha$ significant will control the false positive rate at level $\alpha$ on average. 
+
+<redtext>Problem</redtext>: Suppose that you perform 10,000 tests and $\beta = 0$ for all of them. 
+
+Suppose that you call all $P < 0.05$ significant. 
+
+The expected number of false positives is: $10,000 \times 0.05 = 500$  false positives. 
+
+__How do we avoid so many false positives?__
+
+
+---
+
+## Controlling family-wise error rate (FWER)
+
+
+The [Bonferroni correction](http://en.wikipedia.org/wiki/Bonferroni_correction) is the oldest multiple testing correction. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FWER at level $\alpha$ so $Pr(V \geq 1) < \alpha$
+* Calculate P-values normally
+* Set $\alpha_{fwer} = \alpha/m$
+* Call all $P$-values less than $\alpha_{fwer}$ significant
+
+__Pros__: Easy to calculate, conservative
+__Cons__: May be very conservative
+
+
+---
+
+## Controlling false discovery rate (FDR)
+
+This is the most popular correction when performing _lots_ of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FDR at level $\alpha$ so $E\left[\frac{V}{R}\right]$
+* Calculate P-values normally
+* Order the P-values from smallest to largest $P_{(1)},...,P_{(m)}$
+* Call any $P_{(i)} \leq \alpha \times \frac{i}{m}$ significant
+
+__Pros__: Still pretty easy to calculate, less conservative (maybe much less)
+
+__Cons__: Allows for more false positives, may behave strangely under dependence
+
+---
+
+## Example with 10 P-values
+
+<img class=center src=fig/example10pvals.png height=450>
+
+Controlling all error rates at $\alpha = 0.20$
+
+---
+
+## Adjusted P-values
+
+* One approach is to adjust the threshold $\alpha$
+* A different approach is to calculate "adjusted p-values"
+* They _are not p-values_ anymore
+* But they can be used directly without adjusting $\alpha$
+
+__Example__: 
+* Suppose P-values are $P_1,\ldots,P_m$
+* You could adjust them by taking $P_i^{fwer} = \max{m \times P_i,1}$ for each P-value.
+* Then if you call all $P_i^{fwer} < \alpha$ significant you will control the FWER. 
+
+---
+
+## Case study I: no true positives
+
+
+```r
+set.seed(1010093)
+pValues <- rep(NA, 1000)
+for (i in 1:1000) {
+    y <- rnorm(20)
+    x <- rnorm(20)
+    pValues[i] <- summary(lm(y ~ x))$coeff[2, 4]
+}
+
+# Controls false positive rate
+sum(pValues < 0.05)
+```
+
+```
+## [1] 51
+```
+
+
+---
+
+## Case study I: no true positives
+
+
+```r
+# Controls FWER
+sum(p.adjust(pValues, method = "bonferroni") < 0.05)
+```
+
+```
+## [1] 0
+```
+
+```r
+# Controls FDR
+sum(p.adjust(pValues, method = "BH") < 0.05)
+```
+
+```
+## [1] 0
+```
+
+
+
+---
+
+## Case study II: 50% true positives
+
+
+```r
+set.seed(1010093)
+pValues <- rep(NA, 1000)
+for (i in 1:1000) {
+    x <- rnorm(20)
+    # First 500 beta=0, last 500 beta=2
+    if (i <= 500) {
+        y <- rnorm(20)
+    } else {
+        y <- rnorm(20, mean = 2 * x)
+    }
+    pValues[i] <- summary(lm(y ~ x))$coeff[2, 4]
+}
+trueStatus <- rep(c("zero", "not zero"), each = 500)
+table(pValues < 0.05, trueStatus)
+```
+
+```
+##        trueStatus
+##         not zero zero
+##   FALSE        0  476
+##   TRUE       500   24
+```
+
+
+---
+
+
+## Case study II: 50% true positives
+
+
+```r
+# Controls FWER
+table(p.adjust(pValues, method = "bonferroni") < 0.05, trueStatus)
+```
+
+```
+##        trueStatus
+##         not zero zero
+##   FALSE       23  500
+##   TRUE       477    0
+```
+
+```r
+# Controls FDR
+table(p.adjust(pValues, method = "BH") < 0.05, trueStatus)
+```
+
+```
+##        trueStatus
+##         not zero zero
+##   FALSE        0  487
+##   TRUE       500   13
+```
+
+
+
+---
+
+
+## Case study II: 50% true positives
+
+__P-values versus adjusted P-values__
+
+```r
+par(mfrow = c(1, 2))
+plot(pValues, p.adjust(pValues, method = "bonferroni"), pch = 19)
+plot(pValues, p.adjust(pValues, method = "BH"), pch = 19)
+```
+
+![plot of chunk unnamed-chunk-3](assets/fig/unnamed-chunk-3.png) 
+
+
+
+---
+
+
+## Notes and resources
+
+__Notes__:
+* Multiple testing is an entire subfield
+* A basic Bonferroni/BH correction is usually enough
+* If there is strong dependence between tests there may be problems
+  * Consider method="BY"
+
+__Further resources__:
+* [Multiple testing procedures with applications to genomics](http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&s=books&qid=1187394873&sr=1-2)
+* [Statistical significance for genome-wide studies](http://www.pnas.org/content/100/16/9440.full)
+* [Introduction to multiple testing](http://ies.ed.gov/ncee/pubs/20084018/app_b.asp)
+
diff --git a/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.pdf b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.pdf
new file mode 100644
index 000000000..88d17ad14
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_05_MultipleTesting/index.pdf differ
diff --git a/06_StatisticalInference/old/12. Bootstrapping/Bootstrapping.pdf b/06_StatisticalInference/old_markdown/03_06_resampledInference/Bootstrapping.pdf
similarity index 100%
rename from 06_StatisticalInference/old/12. Bootstrapping/Bootstrapping.pdf
rename to 06_StatisticalInference/old_markdown/03_06_resampledInference/Bootstrapping.pdf
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/assets/fig/unnamed-chunk-4.png b/06_StatisticalInference/old_markdown/03_06_resampledInference/assets/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..d88aaaddd
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_06_resampledInference/assets/fig/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/assets/fig/unnamed-chunk-5.png b/06_StatisticalInference/old_markdown/03_06_resampledInference/assets/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..57f92bdae
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_06_resampledInference/assets/fig/unnamed-chunk-5.png differ
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/assets/fig/unnamed-chunk-7.png b/06_StatisticalInference/old_markdown/03_06_resampledInference/assets/fig/unnamed-chunk-7.png
new file mode 100644
index 000000000..8542ea3a4
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_06_resampledInference/assets/fig/unnamed-chunk-7.png differ
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-4.png b/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..90ac670e4
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-5.png b/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..c00466d70
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-5.png differ
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-6.png b/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-6.png
new file mode 100644
index 000000000..9a2e8fff9
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-6.png differ
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-7.png b/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-7.png
new file mode 100644
index 000000000..b8f74a2df
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_06_resampledInference/fig/unnamed-chunk-7.png differ
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/figure/unnamed-chunk-4.png b/06_StatisticalInference/old_markdown/03_06_resampledInference/figure/unnamed-chunk-4.png
new file mode 100644
index 000000000..8796b8bff
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_06_resampledInference/figure/unnamed-chunk-4.png differ
diff --git a/06_StatisticalInference/03_06_resampledInference/index.Rmd b/06_StatisticalInference/old_markdown/03_06_resampledInference/index.Rmd
similarity index 90%
rename from 06_StatisticalInference/03_06_resampledInference/index.Rmd
rename to 06_StatisticalInference/old_markdown/03_06_resampledInference/index.Rmd
index 775099c76..3f25e8c38 100644
--- a/06_StatisticalInference/03_06_resampledInference/index.Rmd
+++ b/06_StatisticalInference/old_markdown/03_06_resampledInference/index.Rmd
@@ -1,259 +1,243 @@
----
-title       : Resampled inference
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
-
----
-```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
-# make this an external chunk that can be included in any file
-options(width = 100)
-opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
-
-options(xtable.type = 'html')
-knit_hooks$set(inline = function(x) {
-  if(is.numeric(x)) {
-    round(x, getOption('digits'))
-  } else {
-    paste(as.character(x), collapse = ', ')
-  }
-})
-knit_hooks$set(plot = knitr:::hook_plot_html)
-runif(1)
-```
-
-## The jackknife
-
-- The jackknife is a tool for estimating standard errors  and the bias of estimators 
-- As its name suggests, the jackknife is a small, handy tool; in contrast to the bootstrap, which is then the moral equivalent of a giant workshop full of tools
-- Both the jackknife and the bootstrap involve *resampling* data; that is, repeatedly creating new data sets from the original data
-
----
-
-## The jackknife
-
-- The jackknife deletes each observation and calculates an estimate based on the remaining $n-1$ of them
-- It uses this collection of estimates to do things like estimate the bias and the standard error
-- Note that estimating the bias and having a standard error are not needed for things like sample means, which we know are unbiased estimates of population means and what their standard errors are
-
----
-
-## The jackknife
-
-- We'll consider the jackknife for univariate data
-- Let $X_1,\ldots,X_n$ be a collection of data used to estimate a parameter $\theta$
-- Let $\hat \theta$ be the estimate based on the full data set
-- Let $\hat \theta_{i}$ be the estimate of $\theta$ obtained by *deleting observation $i$*
-- Let $\bar \theta = \frac{1}{n}\sum_{i=1}^n \hat \theta_{i}$
-
----
-
-## Continued
-
-- Then, the jackknife estimate of the bias is
-   $$
-   (n - 1) \left(\bar \theta - \hat \theta\right)
-   $$
-   (how far the average delete-one estimate is from the actual estimate)
-- The jackknife estimate of the standard error is
-   $$
-   \left[\frac{n-1}{n}\sum_{i=1}^n (\hat \theta_i - \bar\theta )^2\right]^{1/2}
-   $$
-(the deviance of the delete-one estimates from the average delete-one estimate)
-
----
-
-## Example
-### We want to estimate the bias and standard error of the median
-
-```{r, results='hide'}
-library(UsingR)
-data(father.son)
-x <- father.son$sheight
-n <- length(x)
-theta <- median(x)
-jk <- sapply(1 : n,
-             function(i) median(x[-i])
-             )
-thetaBar <- mean(jk)
-biasEst <- (n - 1) * (thetaBar - theta) 
-seEst <- sqrt((n - 1) * mean((jk - thetaBar)^2))
-```
-
----
-
-## Example
-
-```{r}
-c(biasEst, seEst)
-library(bootstrap)
-temp <- jackknife(x, median)
-c(temp$jack.bias, temp$jack.se)
-```
-
----
-
-## Example
-
-- Both methods (of course) yield an estimated bias of `r temp$jack.bias` and a se of `r temp$jack.se`
-- Odd little fact: the jackknife estimate of the bias for the median is always $0$ when the number of observations is even
-- It has been shown that the jackknife is a linear approximation to the bootstrap
-- Generally do not use the jackknife for sample quantiles like the median; as it has been shown to have some poor properties
-
----
-
-## Pseudo observations
-
-- Another interesting way to think about the jackknife uses pseudo observations
-- Let
-$$
-      \mbox{Pseudo Obs} = n \hat \theta - (n - 1) \hat \theta_{i}
-$$
-- Think of these as ``whatever observation $i$ contributes to the estimate of $\theta$''
-- Note when $\hat \theta$ is the sample mean, the pseudo observations are the data themselves
-- Then the sample standard error of these observations is the previous jackknife estimated standard error.
-- The mean of these observations is a bias-corrected estimate of $\theta$
-
----
-
-## The bootstrap
-
-- The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics
-- For example, how would one derive a confidence interval for the median?
-- The bootstrap procedure follows from the so called bootstrap principle
-
----
-
-## The bootstrap principle
-
-- Suppose that I have a statistic that estimates some population parameter, but I don't know its sampling distribution
-- The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution
-
----
-
-## The bootstrap in practice
-
-- In practice, the bootstrap principle is always carried out using simulation
-- We will cover only a few aspects of bootstrap resampling
-- The general procedure follows by first simulating complete data sets from the observed data with replacement
-
-  - This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution
-
-- Calculate the statistic for each simulated data set
-- Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error
-
----
-## Nonparametric bootstrap algorithm example
-
-- Bootstrap procedure for calculating confidence interval for the median from a data set of $n$ observations
-
-  i. Sample $n$ observations **with replacement** from the observed data resulting in one simulated complete data set
-  
-  ii. Take the median of the simulated data set
-  
-  iii. Repeat these two steps $B$ times, resulting in $B$ simulated medians
-  
-  iv. These medians are approximately drawn from the sampling distribution of the median of $n$ observations; therefore we can
-  
-    - Draw a histogram of them
-    - Calculate their standard deviation to estimate the standard error of the median
-    - Take the $2.5^{th}$ and $97.5^{th}$ percentiles as a confidence interval for the median
-
----
-
-## Example code
-
-```{r}
-B <- 1000
-resamples <- matrix(sample(x,
-                           n * B,
-                           replace = TRUE),
-                    B, n)
-medians <- apply(resamples, 1, median)
-sd(medians)
-quantile(medians, c(.025, .975))
-```
-
----
-## Histogram of bootstrap resamples
-
-```{r, fig.height=5, fig.width=5}
-hist(medians)
-```
-
----
-
-## Notes on the bootstrap
-
-- The bootstrap is non-parametric
-- Better percentile bootstrap confidence intervals correct for bias
-- There are lots of variations on bootstrap procedures; the book "An Introduction to the Bootstrap"" by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information
-
-
----
-## Group comparisons
-- Consider comparing two independent groups.
-- Example, comparing sprays B and C
-
-```{r, fig.height=4, fig.width=4}
-data(InsectSprays)
-boxplot(count ~ spray, data = InsectSprays)
-```
-
----
-## Permutation tests
--  Consider the null hypothesis that the distribution of the observations from each group is the same
--  Then, the group labels are irrelevant
--  We then discard the group levels and permute the combined data
--  Split the permuted data into two groups with $n_A$ and $n_B$
-  observations (say by always treating the first $n_A$ observations as
-  the first group)
--  Evaluate the probability of getting a statistic as large or
-  large than the one observed
--  An example statistic would be the difference in the averages between the two groups;
-  one could also use a t-statistic 
-
----
-## Variations on permutation testing
-Data type | Statistic | Test name 
----|---|---|
-Ranks | rank sum | rank sum test
-Binary | hypergeometric prob | Fisher's exact test
-Raw data | | ordinary permutation test
-
-- Also, so-called *randomization tests* are exactly permutation tests, with a different motivation.
-- For matched data, one can randomize the signs
-  - For ranks, this results in the signed rank test
-- Permutation strategies work for regression as well
-  - Permuting a regressor of interest
-- Permutation tests work very well in multivariate settings
-
----
-## Permutation test for pesticide data
-```{r}
-subdata <- InsectSprays[InsectSprays$spray %in% c("B", "C"),]
-y <- subdata$count
-group <- as.character(subdata$spray)
-testStat <- function(w, g) mean(w[g == "B"]) - mean(w[g == "C"])
-observedStat <- testStat(y, group)
-permutations <- sapply(1 : 10000, function(i) testStat(y, sample(group)))
-observedStat
-mean(permutations > observedStat)
-```
-
----
-## Histogram of permutations
-```{r, echo= FALSE, fig.width=5, fig.height=5}
-hist(permutations)
-```
-
-
+---
+title       : Resampled inference
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+
+---
+
+## The jackknife
+
+- The jackknife is a tool for estimating standard errors  and the bias of estimators 
+- As its name suggests, the jackknife is a small, handy tool; in contrast to the bootstrap, which is then the moral equivalent of a giant workshop full of tools
+- Both the jackknife and the bootstrap involve *resampling* data; that is, repeatedly creating new data sets from the original data
+
+---
+
+## The jackknife
+
+- The jackknife deletes each observation and calculates an estimate based on the remaining $n-1$ of them
+- It uses this collection of estimates to do things like estimate the bias and the standard error
+- Note that estimating the bias and having a standard error are not needed for things like sample means, which we know are unbiased estimates of population means and what their standard errors are
+
+---
+
+## The jackknife
+
+- We'll consider the jackknife for univariate data
+- Let $X_1,\ldots,X_n$ be a collection of data used to estimate a parameter $\theta$
+- Let $\hat \theta$ be the estimate based on the full data set
+- Let $\hat \theta_{i}$ be the estimate of $\theta$ obtained by *deleting observation $i$*
+- Let $\bar \theta = \frac{1}{n}\sum_{i=1}^n \hat \theta_{i}$
+
+---
+
+## Continued
+
+- Then, the jackknife estimate of the bias is
+   $$
+   (n - 1) \left(\bar \theta - \hat \theta\right)
+   $$
+   (how far the average delete-one estimate is from the actual estimate)
+- The jackknife estimate of the standard error is
+   $$
+   \left[\frac{n-1}{n}\sum_{i=1}^n (\hat \theta_i - \bar\theta )^2\right]^{1/2}
+   $$
+(the deviance of the delete-one estimates from the average delete-one estimate)
+
+---
+
+## Example
+### We want to estimate the bias and standard error of the median
+
+```{r, results='hide'}
+library(UsingR)
+data(father.son)
+x <- father.son$sheight
+n <- length(x)
+theta <- median(x)
+jk <- sapply(1 : n,
+             function(i) median(x[-i])
+             )
+thetaBar <- mean(jk)
+biasEst <- (n - 1) * (thetaBar - theta) 
+seEst <- sqrt((n - 1) * mean((jk - thetaBar)^2))
+```
+
+---
+
+## Example test
+
+```{r}
+c(biasEst, seEst)
+library(bootstrap)
+temp <- jackknife(x, median)
+c(temp$jack.bias, temp$jack.se)
+```
+
+---
+
+## Example
+
+- Both methods (of course) yield an estimated bias of `r temp$jack.bias` and a se of `r temp$jack.se`
+- Odd little fact: the jackknife estimate of the bias for the median is always $0$ when the number of observations is even
+- It has been shown that the jackknife is a linear approximation to the bootstrap
+- Generally do not use the jackknife for sample quantiles like the median; as it has been shown to have some poor properties
+
+---
+
+## Pseudo observations
+
+- Another interesting way to think about the jackknife uses pseudo observations
+- Let
+$$
+      \mbox{Pseudo Obs} = n \hat \theta - (n - 1) \hat \theta_{i}
+$$
+- Think of these as ``whatever observation $i$ contributes to the estimate of $\theta$''
+- Note when $\hat \theta$ is the sample mean, the pseudo observations are the data themselves
+- Then the sample standard error of these observations is the previous jackknife estimated standard error.
+- The mean of these observations is a bias-corrected estimate of $\theta$
+
+---
+
+## The bootstrap
+
+- The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics
+- For example, how would one derive a confidence interval for the median?
+- The bootstrap procedure follows from the so called bootstrap principle
+
+---
+
+## The bootstrap principle
+
+- Suppose that I have a statistic that estimates some population parameter, but I don't know its sampling distribution
+- The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution
+
+---
+
+## The bootstrap in practice
+
+- In practice, the bootstrap principle is always carried out using simulation
+- We will cover only a few aspects of bootstrap resampling
+- The general procedure follows by first simulating complete data sets from the observed data with replacement
+
+  - This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution
+
+- Calculate the statistic for each simulated data set
+- Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error
+
+---
+## Nonparametric bootstrap algorithm example
+
+- Bootstrap procedure for calculating confidence interval for the median from a data set of $n$ observations
+
+  i. Sample $n$ observations **with replacement** from the observed data resulting in one simulated complete data set
+  
+  ii. Take the median of the simulated data set
+  
+  iii. Repeat these two steps $B$ times, resulting in $B$ simulated medians
+  
+  iv. These medians are approximately drawn from the sampling distribution of the median of $n$ observations; therefore we can
+  
+    - Draw a histogram of them
+    - Calculate their standard deviation to estimate the standard error of the median
+    - Take the $2.5^{th}$ and $97.5^{th}$ percentiles as a confidence interval for the median
+
+---
+
+## Example code
+
+```{r}
+B <- 1000
+resamples <- matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+medians <- apply(resamples, 1, median)
+sd(medians)
+quantile(medians, c(.025, .975))
+```
+
+---
+## Histogram of bootstrap resamples
+
+```{r, fig.height=5, fig.width=5}
+hist(medians)
+```
+
+---
+
+## Notes on the bootstrap
+
+- The bootstrap is non-parametric
+- Better percentile bootstrap confidence intervals correct for bias
+- There are lots of variations on bootstrap procedures; the book "An Introduction to the Bootstrap"" by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information
+
+
+---
+## Group comparisons
+- Consider comparing two independent groups.
+- Example, comparing sprays B and C
+
+```{r, fig.height=4, fig.width=4}
+data(InsectSprays)
+boxplot(count ~ spray, data = InsectSprays)
+```
+
+---
+## Permutation tests
+-  Consider the null hypothesis that the distribution of the observations from each group is the same
+-  Then, the group labels are irrelevant
+-  We then discard the group levels and permute the combined data
+-  Split the permuted data into two groups with $n_A$ and $n_B$
+  observations (say by always treating the first $n_A$ observations as
+  the first group)
+-  Evaluate the probability of getting a statistic as large or
+  large than the one observed
+-  An example statistic would be the difference in the averages between the two groups;
+  one could also use a t-statistic 
+
+---
+## Variations on permutation testing
+Data type | Statistic | Test name 
+---|---|---|
+Ranks | rank sum | rank sum test
+Binary | hypergeometric prob | Fisher's exact test
+Raw data | | ordinary permutation test
+
+- Also, so-called *randomization tests* are exactly permutation tests, with a different motivation.
+- For matched data, one can randomize the signs
+  - For ranks, this results in the signed rank test
+- Permutation strategies work for regression as well
+  - Permuting a regressor of interest
+- Permutation tests work very well in multivariate settings
+
+---
+## Permutation test for pesticide data
+```{r}
+subdata <- InsectSprays[InsectSprays$spray %in% c("B", "C"),]
+y <- subdata$count
+group <- as.character(subdata$spray)
+testStat <- function(w, g) mean(w[g == "B"]) - mean(w[g == "C"])
+observedStat <- testStat(y, group)
+permutations <- sapply(1 : 10000, function(i) testStat(y, sample(group)))
+observedStat
+mean(permutations > observedStat)
+```
+
+---
+## Histogram of permutations
+```{r, echo= FALSE, fig.width=5, fig.height=5}
+hist(permutations)
+```
+
+
diff --git a/06_StatisticalInference/03_06_resampledInference/index.html b/06_StatisticalInference/old_markdown/03_06_resampledInference/index.html
similarity index 91%
rename from 06_StatisticalInference/03_06_resampledInference/index.html
rename to 06_StatisticalInference/old_markdown/03_06_resampledInference/index.html
index 15515b719..ce1da45ea 100644
--- a/06_StatisticalInference/03_06_resampledInference/index.html
+++ b/06_StatisticalInference/old_markdown/03_06_resampledInference/index.html
@@ -1,614 +1,609 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>Resampled inference</title>
-  <meta charset="utf-8">
-  <meta name="description" content="Resampled inference">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>Resampled inference</h1>
-    <h2>Statistical Inference</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>The jackknife</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The jackknife is a tool for estimating standard errors  and the bias of estimators </li>
-<li>As its name suggests, the jackknife is a small, handy tool; in contrast to the bootstrap, which is then the moral equivalent of a giant workshop full of tools</li>
-<li>Both the jackknife and the bootstrap involve <em>resampling</em> data; that is, repeatedly creating new data sets from the original data</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>The jackknife</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The jackknife deletes each observation and calculates an estimate based on the remaining \(n-1\) of them</li>
-<li>It uses this collection of estimates to do things like estimate the bias and the standard error</li>
-<li>Note that estimating the bias and having a standard error are not needed for things like sample means, which we know are unbiased estimates of population means and what their standard errors are</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>The jackknife</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>We&#39;ll consider the jackknife for univariate data</li>
-<li>Let \(X_1,\ldots,X_n\) be a collection of data used to estimate a parameter \(\theta\)</li>
-<li>Let \(\hat \theta\) be the estimate based on the full data set</li>
-<li>Let \(\hat \theta_{i}\) be the estimate of \(\theta\) obtained by <em>deleting observation \(i\)</em></li>
-<li>Let \(\bar \theta = \frac{1}{n}\sum_{i=1}^n \hat \theta_{i}\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Continued</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Then, the jackknife estimate of the bias is
-\[
-(n - 1) \left(\bar \theta - \hat \theta\right)
-\]
-(how far the average delete-one estimate is from the actual estimate)</li>
-<li>The jackknife estimate of the standard error is
-\[
-\left[\frac{n-1}{n}\sum_{i=1}^n (\hat \theta_i - \bar\theta )^2\right]^{1/2}
-\]
-(the deviance of the delete-one estimates from the average delete-one estimate)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <h3>We want to estimate the bias and standard error of the median</h3>
-
-<pre><code class="r">library(UsingR)
-data(father.son)
-x &lt;- father.son$sheight
-n &lt;- length(x)
-theta &lt;- median(x)
-jk &lt;- sapply(1 : n,
-             function(i) median(x[-i])
-             )
-thetaBar &lt;- mean(jk)
-biasEst &lt;- (n - 1) * (thetaBar - theta) 
-seEst &lt;- sqrt((n - 1) * mean((jk - thetaBar)^2))
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">c(biasEst, seEst)
-</code></pre>
-
-<pre><code>[1] 0.0000 0.1014
-</code></pre>
-
-<pre><code class="r">library(bootstrap)
-temp &lt;- jackknife(x, median)
-c(temp$jack.bias, temp$jack.se)
-</code></pre>
-
-<pre><code>[1] 0.0000 0.1014
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Both methods (of course) yield an estimated bias of 0 and a se of 0.1014</li>
-<li>Odd little fact: the jackknife estimate of the bias for the median is always \(0\) when the number of observations is even</li>
-<li>It has been shown that the jackknife is a linear approximation to the bootstrap</li>
-<li>Generally do not use the jackknife for sample quantiles like the median; as it has been shown to have some poor properties</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Pseudo observations</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Another interesting way to think about the jackknife uses pseudo observations</li>
-<li>Let
-\[
-  \mbox{Pseudo Obs} = n \hat \theta - (n - 1) \hat \theta_{i}
-\]</li>
-<li>Think of these as ``whatever observation \(i\) contributes to the estimate of \(\theta\)&#39;&#39;</li>
-<li>Note when \(\hat \theta\) is the sample mean, the pseudo observations are the data themselves</li>
-<li>Then the sample standard error of these observations is the previous jackknife estimated standard error.</li>
-<li>The mean of these observations is a bias-corrected estimate of \(\theta\)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>The bootstrap</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics</li>
-<li>For example, how would one derive a confidence interval for the median?</li>
-<li>The bootstrap procedure follows from the so called bootstrap principle</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>The bootstrap principle</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Suppose that I have a statistic that estimates some population parameter, but I don&#39;t know its sampling distribution</li>
-<li>The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>The bootstrap in practice</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>In practice, the bootstrap principle is always carried out using simulation</li>
-<li>We will cover only a few aspects of bootstrap resampling</li>
-<li><p>The general procedure follows by first simulating complete data sets from the observed data with replacement</p>
-
-<ul>
-<li>This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution</li>
-</ul></li>
-<li><p>Calculate the statistic for each simulated data set</p></li>
-<li><p>Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error</p></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>Nonparametric bootstrap algorithm example</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li><p>Bootstrap procedure for calculating confidence interval for the median from a data set of \(n\) observations</p>
-
-<p>i. Sample \(n\) observations <strong>with replacement</strong> from the observed data resulting in one simulated complete data set</p>
-
-<p>ii. Take the median of the simulated data set</p>
-
-<p>iii. Repeat these two steps \(B\) times, resulting in \(B\) simulated medians</p>
-
-<p>iv. These medians are approximately drawn from the sampling distribution of the median of \(n\) observations; therefore we can</p>
-
-<ul>
-<li>Draw a histogram of them</li>
-<li>Calculate their standard deviation to estimate the standard error of the median</li>
-<li>Take the \(2.5^{th}\) and \(97.5^{th}\) percentiles as a confidence interval for the median</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>Example code</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">B &lt;- 1000
-resamples &lt;- matrix(sample(x,
-                           n * B,
-                           replace = TRUE),
-                    B, n)
-medians &lt;- apply(resamples, 1, median)
-sd(medians)
-</code></pre>
-
-<pre><code>[1] 0.08546
-</code></pre>
-
-<pre><code class="r">quantile(medians, c(.025, .975))
-</code></pre>
-
-<pre><code> 2.5% 97.5% 
-68.43 68.82 
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Histogram of bootstrap resamples</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">hist(medians)
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Notes on the bootstrap</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The bootstrap is non-parametric</li>
-<li>Better percentile bootstrap confidence intervals correct for bias</li>
-<li>There are lots of variations on bootstrap procedures; the book &quot;An Introduction to the Bootstrap&quot;&quot; by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Group comparisons</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>Consider comparing two independent groups.</li>
-<li>Example, comparing sprays B and C</li>
-</ul>
-
-<pre><code class="r">data(InsectSprays)
-boxplot(count ~ spray, data = InsectSprays)
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Permutation tests</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li> Consider the null hypothesis that the distribution of the observations from each group is the same</li>
-<li> Then, the group labels are irrelevant</li>
-<li> We then discard the group levels and permute the combined data</li>
-<li> Split the permuted data into two groups with \(n_A\) and \(n_B\)
-observations (say by always treating the first \(n_A\) observations as
-the first group)</li>
-<li> Evaluate the probability of getting a statistic as large or
-large than the one observed</li>
-<li> An example statistic would be the difference in the averages between the two groups;
-one could also use a t-statistic </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h2>Variations on permutation testing</h2>
-  </hgroup>
-  <article data-timings="">
-    <table><thead>
-<tr>
-<th>Data type</th>
-<th>Statistic</th>
-<th>Test name</th>
-</tr>
-</thead><tbody>
-<tr>
-<td>Ranks</td>
-<td>rank sum</td>
-<td>rank sum test</td>
-</tr>
-<tr>
-<td>Binary</td>
-<td>hypergeometric prob</td>
-<td>Fisher&#39;s exact test</td>
-</tr>
-<tr>
-<td>Raw data</td>
-<td></td>
-<td>ordinary permutation test</td>
-</tr>
-</tbody></table>
-
-<ul>
-<li>Also, so-called <em>randomization tests</em> are exactly permutation tests, with a different motivation.</li>
-<li>For matched data, one can randomize the signs
-
-<ul>
-<li>For ranks, this results in the signed rank test</li>
-</ul></li>
-<li>Permutation strategies work for regression as well
-
-<ul>
-<li>Permuting a regressor of interest</li>
-</ul></li>
-<li>Permutation tests work very well in multivariate settings</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    <h2>Permutation test for pesticide data</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">subdata &lt;- InsectSprays[InsectSprays$spray %in% c(&quot;B&quot;, &quot;C&quot;),]
-y &lt;- subdata$count
-group &lt;- as.character(subdata$spray)
-testStat &lt;- function(w, g) mean(w[g == &quot;B&quot;]) - mean(w[g == &quot;C&quot;])
-observedStat &lt;- testStat(y, group)
-permutations &lt;- sapply(1 : 10000, function(i) testStat(y, sample(group)))
-observedStat
-</code></pre>
-
-<pre><code>[1] 13.25
-</code></pre>
-
-<pre><code class="r">mean(permutations &gt; observedStat)
-</code></pre>
-
-<pre><code>[1] 0
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-20" style="background:;">
-  <hgroup>
-    <h2>Histogram of permutations</h2>
-  </hgroup>
-  <article data-timings="">
-    <div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='The jackknife'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='The jackknife'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='The jackknife'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Continued'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Example'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Example'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Example'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Pseudo observations'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='The bootstrap'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='The bootstrap principle'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='The bootstrap in practice'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Nonparametric bootstrap algorithm example'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='Example code'>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='Histogram of bootstrap resamples'>
-         14
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title='Notes on the bootstrap'>
-         15
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='Group comparisons'>
-         16
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=17 title='Permutation tests'>
-         17
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=18 title='Variations on permutation testing'>
-         18
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=19 title='Permutation test for pesticide data'>
-         19
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=20 title='Histogram of permutations'>
-         20
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
+<!DOCTYPE html>
+<html>
+<head>
+  <title>Resampled inference</title>
+  <meta charset="utf-8">
+  <meta name="description" content="Resampled inference">
+  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
+  <meta name="generator" content="slidify" />
+  <meta name="apple-mobile-web-app-capable" content="yes">
+  <meta http-equiv="X-UA-Compatible" content="chrome=1">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
+    media="only screen and (max-device-width: 480px)" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
+  </script>
+  
+  
+
+</head>
+<body style="opacity: 0">
+  <slides class="layout-widescreen">
+    
+    <!-- LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Resampled inference</h1>
+    <h2>Statistical Inference</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
+    
+
+    <!-- SLIDES -->
+    <slide class="" id="slide-1" style="background:;">
+  <hgroup>
+    <h2>The jackknife</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The jackknife is a tool for estimating standard errors  and the bias of estimators </li>
+<li>As its name suggests, the jackknife is a small, handy tool; in contrast to the bootstrap, which is then the moral equivalent of a giant workshop full of tools</li>
+<li>Both the jackknife and the bootstrap involve <em>resampling</em> data; that is, repeatedly creating new data sets from the original data</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>The jackknife</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The jackknife deletes each observation and calculates an estimate based on the remaining \(n-1\) of them</li>
+<li>It uses this collection of estimates to do things like estimate the bias and the standard error</li>
+<li>Note that estimating the bias and having a standard error are not needed for things like sample means, which we know are unbiased estimates of population means and what their standard errors are</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
+  <hgroup>
+    <h2>The jackknife</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>We&#39;ll consider the jackknife for univariate data</li>
+<li>Let \(X_1,\ldots,X_n\) be a collection of data used to estimate a parameter \(\theta\)</li>
+<li>Let \(\hat \theta\) be the estimate based on the full data set</li>
+<li>Let \(\hat \theta_{i}\) be the estimate of \(\theta\) obtained by <em>deleting observation \(i\)</em></li>
+<li>Let \(\bar \theta = \frac{1}{n}\sum_{i=1}^n \hat \theta_{i}\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-4" style="background:;">
+  <hgroup>
+    <h2>Continued</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Then, the jackknife estimate of the bias is
+\[
+(n - 1) \left(\bar \theta - \hat \theta\right)
+\]
+(how far the average delete-one estimate is from the actual estimate)</li>
+<li>The jackknife estimate of the standard error is
+\[
+\left[\frac{n-1}{n}\sum_{i=1}^n (\hat \theta_i - \bar\theta )^2\right]^{1/2}
+\]
+(the deviance of the delete-one estimates from the average delete-one estimate)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-5" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>We want to estimate the bias and standard error of the median</h3>
+
+<pre><code class="r">library(UsingR)
+data(father.son)
+x &lt;- father.son$sheight
+n &lt;- length(x)
+theta &lt;- median(x)
+jk &lt;- sapply(1:n, function(i) median(x[-i]))
+thetaBar &lt;- mean(jk)
+biasEst &lt;- (n - 1) * (thetaBar - theta)
+seEst &lt;- sqrt((n - 1) * mean((jk - thetaBar)^2))
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-6" style="background:;">
+  <hgroup>
+    <h2>Example test</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">c(biasEst, seEst)
+</code></pre>
+
+<pre><code>## [1] 0.0000 0.1014
+</code></pre>
+
+<pre><code class="r">library(bootstrap)
+temp &lt;- jackknife(x, median)
+c(temp$jack.bias, temp$jack.se)
+</code></pre>
+
+<pre><code>## [1] 0.0000 0.1014
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-7" style="background:;">
+  <hgroup>
+    <h2>Example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Both methods (of course) yield an estimated bias of 0 and a se of 0.1014</li>
+<li>Odd little fact: the jackknife estimate of the bias for the median is always \(0\) when the number of observations is even</li>
+<li>It has been shown that the jackknife is a linear approximation to the bootstrap</li>
+<li>Generally do not use the jackknife for sample quantiles like the median; as it has been shown to have some poor properties</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
+  <hgroup>
+    <h2>Pseudo observations</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Another interesting way to think about the jackknife uses pseudo observations</li>
+<li>Let
+\[
+  \mbox{Pseudo Obs} = n \hat \theta - (n - 1) \hat \theta_{i}
+\]</li>
+<li>Think of these as ``whatever observation \(i\) contributes to the estimate of \(\theta\)&#39;&#39;</li>
+<li>Note when \(\hat \theta\) is the sample mean, the pseudo observations are the data themselves</li>
+<li>Then the sample standard error of these observations is the previous jackknife estimated standard error.</li>
+<li>The mean of these observations is a bias-corrected estimate of \(\theta\)</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-9" style="background:;">
+  <hgroup>
+    <h2>The bootstrap</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics</li>
+<li>For example, how would one derive a confidence interval for the median?</li>
+<li>The bootstrap procedure follows from the so called bootstrap principle</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-10" style="background:;">
+  <hgroup>
+    <h2>The bootstrap principle</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Suppose that I have a statistic that estimates some population parameter, but I don&#39;t know its sampling distribution</li>
+<li>The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>The bootstrap in practice</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>In practice, the bootstrap principle is always carried out using simulation</li>
+<li>We will cover only a few aspects of bootstrap resampling</li>
+<li><p>The general procedure follows by first simulating complete data sets from the observed data with replacement</p>
+
+<ul>
+<li>This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution</li>
+</ul></li>
+<li><p>Calculate the statistic for each simulated data set</p></li>
+<li><p>Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error</p></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Nonparametric bootstrap algorithm example</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li><p>Bootstrap procedure for calculating confidence interval for the median from a data set of \(n\) observations</p>
+
+<p>i. Sample \(n\) observations <strong>with replacement</strong> from the observed data resulting in one simulated complete data set</p>
+
+<p>ii. Take the median of the simulated data set</p>
+
+<p>iii. Repeat these two steps \(B\) times, resulting in \(B\) simulated medians</p>
+
+<p>iv. These medians are approximately drawn from the sampling distribution of the median of \(n\) observations; therefore we can</p>
+
+<ul>
+<li>Draw a histogram of them</li>
+<li>Calculate their standard deviation to estimate the standard error of the median</li>
+<li>Take the \(2.5^{th}\) and \(97.5^{th}\) percentiles as a confidence interval for the median</li>
+</ul></li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
+  <hgroup>
+    <h2>Example code</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">B &lt;- 1000
+resamples &lt;- matrix(sample(x, n * B, replace = TRUE), B, n)
+medians &lt;- apply(resamples, 1, median)
+sd(medians)
+</code></pre>
+
+<pre><code>## [1] 0.08834
+</code></pre>
+
+<pre><code class="r">quantile(medians, c(0.025, 0.975))
+</code></pre>
+
+<pre><code>##  2.5% 97.5% 
+## 68.41 68.82
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-14" style="background:;">
+  <hgroup>
+    <h2>Histogram of bootstrap resamples</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">hist(medians)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-4.png" alt="plot of chunk unnamed-chunk-4"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-15" style="background:;">
+  <hgroup>
+    <h2>Notes on the bootstrap</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>The bootstrap is non-parametric</li>
+<li>Better percentile bootstrap confidence intervals correct for bias</li>
+<li>There are lots of variations on bootstrap procedures; the book &quot;An Introduction to the Bootstrap&quot;&quot; by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-16" style="background:;">
+  <hgroup>
+    <h2>Group comparisons</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li>Consider comparing two independent groups.</li>
+<li>Example, comparing sprays B and C</li>
+</ul>
+
+<pre><code class="r">data(InsectSprays)
+boxplot(count ~ spray, data = InsectSprays)
+</code></pre>
+
+<p><img src="assets/fig/unnamed-chunk-5.png" alt="plot of chunk unnamed-chunk-5"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-17" style="background:;">
+  <hgroup>
+    <h2>Permutation tests</h2>
+  </hgroup>
+  <article data-timings="">
+    <ul>
+<li> Consider the null hypothesis that the distribution of the observations from each group is the same</li>
+<li> Then, the group labels are irrelevant</li>
+<li> We then discard the group levels and permute the combined data</li>
+<li> Split the permuted data into two groups with \(n_A\) and \(n_B\)
+observations (say by always treating the first \(n_A\) observations as
+the first group)</li>
+<li> Evaluate the probability of getting a statistic as large or
+large than the one observed</li>
+<li> An example statistic would be the difference in the averages between the two groups;
+one could also use a t-statistic </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-18" style="background:;">
+  <hgroup>
+    <h2>Variations on permutation testing</h2>
+  </hgroup>
+  <article data-timings="">
+    <table><thead>
+<tr>
+<th>Data type</th>
+<th>Statistic</th>
+<th>Test name</th>
+</tr>
+</thead><tbody>
+<tr>
+<td>Ranks</td>
+<td>rank sum</td>
+<td>rank sum test</td>
+</tr>
+<tr>
+<td>Binary</td>
+<td>hypergeometric prob</td>
+<td>Fisher&#39;s exact test</td>
+</tr>
+<tr>
+<td>Raw data</td>
+<td></td>
+<td>ordinary permutation test</td>
+</tr>
+</tbody></table>
+
+<ul>
+<li>Also, so-called <em>randomization tests</em> are exactly permutation tests, with a different motivation.</li>
+<li>For matched data, one can randomize the signs
+
+<ul>
+<li>For ranks, this results in the signed rank test</li>
+</ul></li>
+<li>Permutation strategies work for regression as well
+
+<ul>
+<li>Permuting a regressor of interest</li>
+</ul></li>
+<li>Permutation tests work very well in multivariate settings</li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>Permutation test for pesticide data</h2>
+  </hgroup>
+  <article data-timings="">
+    <pre><code class="r">subdata &lt;- InsectSprays[InsectSprays$spray %in% c(&quot;B&quot;, &quot;C&quot;), ]
+y &lt;- subdata$count
+group &lt;- as.character(subdata$spray)
+testStat &lt;- function(w, g) mean(w[g == &quot;B&quot;]) - mean(w[g == &quot;C&quot;])
+observedStat &lt;- testStat(y, group)
+permutations &lt;- sapply(1:10000, function(i) testStat(y, sample(group)))
+observedStat
+</code></pre>
+
+<pre><code>## [1] 13.25
+</code></pre>
+
+<pre><code class="r">mean(permutations &gt; observedStat)
+</code></pre>
+
+<pre><code>## [1] 0
+</code></pre>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-20" style="background:;">
+  <hgroup>
+    <h2>Histogram of permutations</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-7.png" alt="plot of chunk unnamed-chunk-7"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+    <slide class="backdrop"></slide>
+  </slides>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='The jackknife'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='The jackknife'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='The jackknife'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Continued'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Example'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Example test'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Example'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Pseudo observations'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='The bootstrap'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='The bootstrap principle'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='The bootstrap in practice'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Nonparametric bootstrap algorithm example'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Example code'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Histogram of bootstrap resamples'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Notes on the bootstrap'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Group comparisons'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Permutation tests'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='Variations on permutation testing'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='Permutation test for pesticide data'>
+         19
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=20 title='Histogram of permutations'>
+         20
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
+    <script 
+      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
+    </script>
+    <script>CFInstall.check({mode: 'overlay'});</script>
+  <![endif]-->
+</body>
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+</script>
+<!-- LOAD HIGHLIGHTER JS FILES -->
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
   </html>
\ No newline at end of file
diff --git a/06_StatisticalInference/03_06_resampledInference/index.md b/06_StatisticalInference/old_markdown/03_06_resampledInference/index.md
similarity index 85%
rename from 06_StatisticalInference/03_06_resampledInference/index.md
rename to 06_StatisticalInference/old_markdown/03_06_resampledInference/index.md
index d1544df72..448841d3a 100644
--- a/06_StatisticalInference/03_06_resampledInference/index.md
+++ b/06_StatisticalInference/old_markdown/03_06_resampledInference/index.md
@@ -1,294 +1,287 @@
----
-title       : Resampled inference
-subtitle    : Statistical Inference
-author      : Brian Caffo, Jeff Leek, Roger Peng
-job         : Johns Hopkins Bloomberg School of Public Health
-logo        : bloomberg_shield.png
-framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
-highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
-url:
-  lib: ../../librariesNew
-  assets: ../../assets
-widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
-mode        : selfcontained # {standalone, draft}
-
----
-
-
-
-## The jackknife
-
-- The jackknife is a tool for estimating standard errors  and the bias of estimators 
-- As its name suggests, the jackknife is a small, handy tool; in contrast to the bootstrap, which is then the moral equivalent of a giant workshop full of tools
-- Both the jackknife and the bootstrap involve *resampling* data; that is, repeatedly creating new data sets from the original data
-
----
-
-## The jackknife
-
-- The jackknife deletes each observation and calculates an estimate based on the remaining $n-1$ of them
-- It uses this collection of estimates to do things like estimate the bias and the standard error
-- Note that estimating the bias and having a standard error are not needed for things like sample means, which we know are unbiased estimates of population means and what their standard errors are
-
----
-
-## The jackknife
-
-- We'll consider the jackknife for univariate data
-- Let $X_1,\ldots,X_n$ be a collection of data used to estimate a parameter $\theta$
-- Let $\hat \theta$ be the estimate based on the full data set
-- Let $\hat \theta_{i}$ be the estimate of $\theta$ obtained by *deleting observation $i$*
-- Let $\bar \theta = \frac{1}{n}\sum_{i=1}^n \hat \theta_{i}$
-
----
-
-## Continued
-
-- Then, the jackknife estimate of the bias is
-   $$
-   (n - 1) \left(\bar \theta - \hat \theta\right)
-   $$
-   (how far the average delete-one estimate is from the actual estimate)
-- The jackknife estimate of the standard error is
-   $$
-   \left[\frac{n-1}{n}\sum_{i=1}^n (\hat \theta_i - \bar\theta )^2\right]^{1/2}
-   $$
-(the deviance of the delete-one estimates from the average delete-one estimate)
-
----
-
-## Example
-### We want to estimate the bias and standard error of the median
-
-
-```r
-library(UsingR)
-data(father.son)
-x <- father.son$sheight
-n <- length(x)
-theta <- median(x)
-jk <- sapply(1 : n,
-             function(i) median(x[-i])
-             )
-thetaBar <- mean(jk)
-biasEst <- (n - 1) * (thetaBar - theta) 
-seEst <- sqrt((n - 1) * mean((jk - thetaBar)^2))
-```
-
-
----
-
-## Example
-
-
-```r
-c(biasEst, seEst)
-```
-
-```
-[1] 0.0000 0.1014
-```
-
-```r
-library(bootstrap)
-temp <- jackknife(x, median)
-c(temp$jack.bias, temp$jack.se)
-```
-
-```
-[1] 0.0000 0.1014
-```
-
-
----
-
-## Example
-
-- Both methods (of course) yield an estimated bias of 0 and a se of 0.1014
-- Odd little fact: the jackknife estimate of the bias for the median is always $0$ when the number of observations is even
-- It has been shown that the jackknife is a linear approximation to the bootstrap
-- Generally do not use the jackknife for sample quantiles like the median; as it has been shown to have some poor properties
-
----
-
-## Pseudo observations
-
-- Another interesting way to think about the jackknife uses pseudo observations
-- Let
-$$
-      \mbox{Pseudo Obs} = n \hat \theta - (n - 1) \hat \theta_{i}
-$$
-- Think of these as ``whatever observation $i$ contributes to the estimate of $\theta$''
-- Note when $\hat \theta$ is the sample mean, the pseudo observations are the data themselves
-- Then the sample standard error of these observations is the previous jackknife estimated standard error.
-- The mean of these observations is a bias-corrected estimate of $\theta$
-
----
-
-## The bootstrap
-
-- The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics
-- For example, how would one derive a confidence interval for the median?
-- The bootstrap procedure follows from the so called bootstrap principle
-
----
-
-## The bootstrap principle
-
-- Suppose that I have a statistic that estimates some population parameter, but I don't know its sampling distribution
-- The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution
-
----
-
-## The bootstrap in practice
-
-- In practice, the bootstrap principle is always carried out using simulation
-- We will cover only a few aspects of bootstrap resampling
-- The general procedure follows by first simulating complete data sets from the observed data with replacement
-
-  - This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution
-
-- Calculate the statistic for each simulated data set
-- Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error
-
----
-## Nonparametric bootstrap algorithm example
-
-- Bootstrap procedure for calculating confidence interval for the median from a data set of $n$ observations
-
-  i. Sample $n$ observations **with replacement** from the observed data resulting in one simulated complete data set
-  
-  ii. Take the median of the simulated data set
-  
-  iii. Repeat these two steps $B$ times, resulting in $B$ simulated medians
-  
-  iv. These medians are approximately drawn from the sampling distribution of the median of $n$ observations; therefore we can
-  
-    - Draw a histogram of them
-    - Calculate their standard deviation to estimate the standard error of the median
-    - Take the $2.5^{th}$ and $97.5^{th}$ percentiles as a confidence interval for the median
-
----
-
-## Example code
-
-
-```r
-B <- 1000
-resamples <- matrix(sample(x,
-                           n * B,
-                           replace = TRUE),
-                    B, n)
-medians <- apply(resamples, 1, median)
-sd(medians)
-```
-
-```
-[1] 0.08546
-```
-
-```r
-quantile(medians, c(.025, .975))
-```
-
-```
- 2.5% 97.5% 
-68.43 68.82 
-```
-
-
----
-## Histogram of bootstrap resamples
-
-
-```r
-hist(medians)
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
-
-
----
-
-## Notes on the bootstrap
-
-- The bootstrap is non-parametric
-- Better percentile bootstrap confidence intervals correct for bias
-- There are lots of variations on bootstrap procedures; the book "An Introduction to the Bootstrap"" by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information
-
-
----
-## Group comparisons
-- Consider comparing two independent groups.
-- Example, comparing sprays B and C
-
-
-```r
-data(InsectSprays)
-boxplot(count ~ spray, data = InsectSprays)
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
-
-
----
-## Permutation tests
--  Consider the null hypothesis that the distribution of the observations from each group is the same
--  Then, the group labels are irrelevant
--  We then discard the group levels and permute the combined data
--  Split the permuted data into two groups with $n_A$ and $n_B$
-  observations (say by always treating the first $n_A$ observations as
-  the first group)
--  Evaluate the probability of getting a statistic as large or
-  large than the one observed
--  An example statistic would be the difference in the averages between the two groups;
-  one could also use a t-statistic 
-
----
-## Variations on permutation testing
-Data type | Statistic | Test name 
----|---|---|
-Ranks | rank sum | rank sum test
-Binary | hypergeometric prob | Fisher's exact test
-Raw data | | ordinary permutation test
-
-- Also, so-called *randomization tests* are exactly permutation tests, with a different motivation.
-- For matched data, one can randomize the signs
-  - For ranks, this results in the signed rank test
-- Permutation strategies work for regression as well
-  - Permuting a regressor of interest
-- Permutation tests work very well in multivariate settings
-
----
-## Permutation test for pesticide data
-
-```r
-subdata <- InsectSprays[InsectSprays$spray %in% c("B", "C"),]
-y <- subdata$count
-group <- as.character(subdata$spray)
-testStat <- function(w, g) mean(w[g == "B"]) - mean(w[g == "C"])
-observedStat <- testStat(y, group)
-permutations <- sapply(1 : 10000, function(i) testStat(y, sample(group)))
-observedStat
-```
-
-```
-[1] 13.25
-```
-
-```r
-mean(permutations > observedStat)
-```
-
-```
-[1] 0
-```
-
-
----
-## Histogram of permutations
-<div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div>
-
-
-
+---
+title       : Resampled inference
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+
+---
+
+## The jackknife
+
+- The jackknife is a tool for estimating standard errors  and the bias of estimators 
+- As its name suggests, the jackknife is a small, handy tool; in contrast to the bootstrap, which is then the moral equivalent of a giant workshop full of tools
+- Both the jackknife and the bootstrap involve *resampling* data; that is, repeatedly creating new data sets from the original data
+
+---
+
+## The jackknife
+
+- The jackknife deletes each observation and calculates an estimate based on the remaining $n-1$ of them
+- It uses this collection of estimates to do things like estimate the bias and the standard error
+- Note that estimating the bias and having a standard error are not needed for things like sample means, which we know are unbiased estimates of population means and what their standard errors are
+
+---
+
+## The jackknife
+
+- We'll consider the jackknife for univariate data
+- Let $X_1,\ldots,X_n$ be a collection of data used to estimate a parameter $\theta$
+- Let $\hat \theta$ be the estimate based on the full data set
+- Let $\hat \theta_{i}$ be the estimate of $\theta$ obtained by *deleting observation $i$*
+- Let $\bar \theta = \frac{1}{n}\sum_{i=1}^n \hat \theta_{i}$
+
+---
+
+## Continued
+
+- Then, the jackknife estimate of the bias is
+   $$
+   (n - 1) \left(\bar \theta - \hat \theta\right)
+   $$
+   (how far the average delete-one estimate is from the actual estimate)
+- The jackknife estimate of the standard error is
+   $$
+   \left[\frac{n-1}{n}\sum_{i=1}^n (\hat \theta_i - \bar\theta )^2\right]^{1/2}
+   $$
+(the deviance of the delete-one estimates from the average delete-one estimate)
+
+---
+
+## Example
+### We want to estimate the bias and standard error of the median
+
+
+```r
+library(UsingR)
+data(father.son)
+x <- father.son$sheight
+n <- length(x)
+theta <- median(x)
+jk <- sapply(1:n, function(i) median(x[-i]))
+thetaBar <- mean(jk)
+biasEst <- (n - 1) * (thetaBar - theta)
+seEst <- sqrt((n - 1) * mean((jk - thetaBar)^2))
+```
+
+
+---
+
+## Example test
+
+
+```r
+c(biasEst, seEst)
+```
+
+```
+## [1] 0.0000 0.1014
+```
+
+```r
+library(bootstrap)
+temp <- jackknife(x, median)
+c(temp$jack.bias, temp$jack.se)
+```
+
+```
+## [1] 0.0000 0.1014
+```
+
+
+---
+
+## Example
+
+- Both methods (of course) yield an estimated bias of 0 and a se of 0.1014
+- Odd little fact: the jackknife estimate of the bias for the median is always $0$ when the number of observations is even
+- It has been shown that the jackknife is a linear approximation to the bootstrap
+- Generally do not use the jackknife for sample quantiles like the median; as it has been shown to have some poor properties
+
+---
+
+## Pseudo observations
+
+- Another interesting way to think about the jackknife uses pseudo observations
+- Let
+$$
+      \mbox{Pseudo Obs} = n \hat \theta - (n - 1) \hat \theta_{i}
+$$
+- Think of these as ``whatever observation $i$ contributes to the estimate of $\theta$''
+- Note when $\hat \theta$ is the sample mean, the pseudo observations are the data themselves
+- Then the sample standard error of these observations is the previous jackknife estimated standard error.
+- The mean of these observations is a bias-corrected estimate of $\theta$
+
+---
+
+## The bootstrap
+
+- The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics
+- For example, how would one derive a confidence interval for the median?
+- The bootstrap procedure follows from the so called bootstrap principle
+
+---
+
+## The bootstrap principle
+
+- Suppose that I have a statistic that estimates some population parameter, but I don't know its sampling distribution
+- The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution
+
+---
+
+## The bootstrap in practice
+
+- In practice, the bootstrap principle is always carried out using simulation
+- We will cover only a few aspects of bootstrap resampling
+- The general procedure follows by first simulating complete data sets from the observed data with replacement
+
+  - This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution
+
+- Calculate the statistic for each simulated data set
+- Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error
+
+---
+## Nonparametric bootstrap algorithm example
+
+- Bootstrap procedure for calculating confidence interval for the median from a data set of $n$ observations
+
+  i. Sample $n$ observations **with replacement** from the observed data resulting in one simulated complete data set
+  
+  ii. Take the median of the simulated data set
+  
+  iii. Repeat these two steps $B$ times, resulting in $B$ simulated medians
+  
+  iv. These medians are approximately drawn from the sampling distribution of the median of $n$ observations; therefore we can
+  
+    - Draw a histogram of them
+    - Calculate their standard deviation to estimate the standard error of the median
+    - Take the $2.5^{th}$ and $97.5^{th}$ percentiles as a confidence interval for the median
+
+---
+
+## Example code
+
+
+```r
+B <- 1000
+resamples <- matrix(sample(x, n * B, replace = TRUE), B, n)
+medians <- apply(resamples, 1, median)
+sd(medians)
+```
+
+```
+## [1] 0.08834
+```
+
+```r
+quantile(medians, c(0.025, 0.975))
+```
+
+```
+##  2.5% 97.5% 
+## 68.41 68.82
+```
+
+
+---
+## Histogram of bootstrap resamples
+
+
+```r
+hist(medians)
+```
+
+![plot of chunk unnamed-chunk-4](assets/fig/unnamed-chunk-4.png) 
+
+
+---
+
+## Notes on the bootstrap
+
+- The bootstrap is non-parametric
+- Better percentile bootstrap confidence intervals correct for bias
+- There are lots of variations on bootstrap procedures; the book "An Introduction to the Bootstrap"" by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information
+
+
+---
+## Group comparisons
+- Consider comparing two independent groups.
+- Example, comparing sprays B and C
+
+
+```r
+data(InsectSprays)
+boxplot(count ~ spray, data = InsectSprays)
+```
+
+![plot of chunk unnamed-chunk-5](assets/fig/unnamed-chunk-5.png) 
+
+
+---
+## Permutation tests
+-  Consider the null hypothesis that the distribution of the observations from each group is the same
+-  Then, the group labels are irrelevant
+-  We then discard the group levels and permute the combined data
+-  Split the permuted data into two groups with $n_A$ and $n_B$
+  observations (say by always treating the first $n_A$ observations as
+  the first group)
+-  Evaluate the probability of getting a statistic as large or
+  large than the one observed
+-  An example statistic would be the difference in the averages between the two groups;
+  one could also use a t-statistic 
+
+---
+## Variations on permutation testing
+Data type | Statistic | Test name 
+---|---|---|
+Ranks | rank sum | rank sum test
+Binary | hypergeometric prob | Fisher's exact test
+Raw data | | ordinary permutation test
+
+- Also, so-called *randomization tests* are exactly permutation tests, with a different motivation.
+- For matched data, one can randomize the signs
+  - For ranks, this results in the signed rank test
+- Permutation strategies work for regression as well
+  - Permuting a regressor of interest
+- Permutation tests work very well in multivariate settings
+
+---
+## Permutation test for pesticide data
+
+```r
+subdata <- InsectSprays[InsectSprays$spray %in% c("B", "C"), ]
+y <- subdata$count
+group <- as.character(subdata$spray)
+testStat <- function(w, g) mean(w[g == "B"]) - mean(w[g == "C"])
+observedStat <- testStat(y, group)
+permutations <- sapply(1:10000, function(i) testStat(y, sample(group)))
+observedStat
+```
+
+```
+## [1] 13.25
+```
+
+```r
+mean(permutations > observedStat)
+```
+
+```
+## [1] 0
+```
+
+
+---
+## Histogram of permutations
+![plot of chunk unnamed-chunk-7](assets/fig/unnamed-chunk-7.png) 
+
+
+
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/index.pdf b/06_StatisticalInference/old_markdown/03_06_resampledInference/index.pdf
new file mode 100644
index 000000000..ce8822a3c
Binary files /dev/null and b/06_StatisticalInference/old_markdown/03_06_resampledInference/index.pdf differ
diff --git a/06_StatisticalInference/old_markdown/03_06_resampledInference/lecture12.tex b/06_StatisticalInference/old_markdown/03_06_resampledInference/lecture12.tex
new file mode 100644
index 000000000..785cd2196
--- /dev/null
+++ b/06_StatisticalInference/old_markdown/03_06_resampledInference/lecture12.tex
@@ -0,0 +1,404 @@
+\documentclass[aspectratio=169]{beamer}
+\mode<presentation>
+%\usetheme{Warsaw}
+%\usetheme{Goettingen}
+\usetheme{Hannover}
+%\useoutertheme{default}
+
+%\useoutertheme{infolines}
+\useoutertheme{sidebar}
+\usecolortheme{dolphin}
+
+\usepackage{amsmath}
+\usepackage{amssymb}
+\usepackage{enumerate}
+
+%some bold math symbosl
+\newcommand{\Cov}{\mathrm{Cov}}
+\newcommand{\Cor}{\mathrm{Cor}}
+\newcommand{\Var}{\mathrm{Var}}
+\newcommand{\brho}{\boldsymbol{\rho}}
+\newcommand{\bSigma}{\boldsymbol{\Sigma}}
+\newcommand{\btheta}{\boldsymbol{\theta}}
+\newcommand{\bbeta}{\boldsymbol{\beta}}
+\newcommand{\bmu}{\boldsymbol{\mu}}
+\newcommand{\bW}{\mathbf{W}}
+\newcommand{\one}{\mathbf{1}}
+\newcommand{\bH}{\mathbf{H}}
+\newcommand{\by}{\mathbf{y}}
+\newcommand{\bolde}{\mathbf{e}}
+\newcommand{\bx}{\mathbf{x}}
+
+\newcommand{\cpp}[1]{\texttt{#1}}
+
+ \title{Mathematical Biostatistics Boot Camp 2: Lecture 12}
+\author{Brian Caffo}
+\date{\today}
+\institute[Department of Biostatistics]{
+  Department of Biostatistics \\
+  Johns Hopkins Bloomberg School of Public Health\\
+  Johns Hopkins University
+}
+
+
+\begin{document}
+\frame{\titlepage}
+
+%\section{Table of contents}
+\frame{
+  \frametitle{Table of contents}
+  \tableofcontents
+}
+
+\section{Nonparametric tests}
+\begin{frame}\frametitle{Nonparametric tests}
+\begin{itemize}
+\item ``Distribution free'' methods require fewer assumptions than
+  parametric methods
+\item Focus on testing rather than estimation
+\item Not sensitive to outlying observations
+\item Especially useful for cruder data (like ranks)
+\item ``Throws away'' some of the information in the data
+\item May be less powerful than parametric counterparts, when the
+  parametric assumptions are true
+\item For large samples, are equally efficient to parametric
+  counterparts
+\end{itemize}
+\end{frame}
+
+\begin{frame}
+ \ttfamily \scriptsize
+  \begin{tabular}{rrrrrrrrrrr}
+Fish & SR  & P   & Diff & Sgn rank   & Fish & SR  & P   & Diff & Sng rank   \\ \hline
+1    & .32 & .39 & .07  & +15.5  & 13   & .20 & .22 & .02  & +6.5   \\
+2    & .40 & .47 & .07  & +15.5  & 14   & .31 & .30 & -.01 & -2.5   \\
+3    & .11 & .11 & .00  &        & 15   & .62 & .60 & -.02 & -6.5   \\
+4    & .47 & .43 & -.04 & -11.0  & 16   & .52 & .53 &  .01 & +2.5   \\
+5    & .32 & .42 & .10  & +20.0  & 17   & .77 & .85 & .08  & +17.5  \\
+6    & .35 & .30 & -.05 & -13.5  & 18   & .23 & .21 & -.02 & -6.5   \\
+7    & .32 & .43 & .11  & +20.0  & 19   & .30 & .33 & .03  & +9.0   \\
+8    & .63 & .98 & .35  & +23.0  & 20   & .70 & .57 & -.13 & -21.0  \\
+9    & .50 & .86 & .36  & +24.0  & 21   & .41 & .43 &  .02 & +6.5   \\
+10   & .60 & .79 & .19  & +22.0  & 22   & .53 & .49 & -.04 & -11.0  \\
+11   & .38 & .33 & -.05 & -13.5  & 23   & .19 & .20 &  .01 & +2.5   \\
+12   & .46 & .45 & -.01 & -2.5   & 24   & .31 & .35 & .04  & +11.0  \\
+     &     &     &      &        & 25   & .48 & .40 & -.08 & -17.5  \\ \hline  
+\end{tabular}\\
+Measurements are mecury levels in fish (ppm)\\
+Data from Rice Mathematical Statistics and Data Analysis; second edition
+\normalfont \normalsize
+\end{frame}
+
+\section{Sign test}
+\begin{frame}\frametitle{Alternatives to the paired t-test}
+\begin{itemize}
+\item Let $D_i = $ difference (\texttt{P - SR})
+\item Let $\theta$ be the population median of the $D_i$
+\item $H_0:\theta = 0$ versus $H_a:\theta \neq 0$ (or $>$ or $<$)
+\item Notice that $\theta = 0$ iff $p = P(D > 0) = .5$ 
+\item Let $X$ be the number of times $D > 0$
+  \begin{itemize}
+  \item $X$ is then binomial$(n,p)$
+  \end{itemize}
+\item The sign test tests wether $H_0:p = .5$ using $X$
+\end{itemize}
+\end{frame} 
+
+\begin{frame}[fragile]\frametitle{Example}
+\begin{itemize}
+\item $\theta =$ median difference \texttt{p - sr}
+\item $H_0:\theta = 0$ versus $H_a:\theta \neq 0$
+\item Number of instances where the difference is bigger than 0 is
+  15 out of 25 trials
+\item \texttt{binom.test(15, 25)}
+\begin{verbatim}
+p-value = 0.4244
+\end{verbatim}
+\item Or we could have used large sample tests for a binomial
+  proportion \texttt{prop.test(15, 25, p = .5)}
+\begin{verbatim}
+X-squared = 0.64, df = 1, p-value = 0.4237
+\end{verbatim}
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Discussion}
+\begin{itemize}
+\item Magnitude of the differences is discarded
+  \begin{itemize}
+  \item Perhaps too much information lost
+  \end{itemize}
+\item Could easily have tested $H_0:\theta = \theta_0$ by calculating the number
+  of times $D > \theta_0$ and performing a binomial test
+  \begin{itemize}
+  \item We can invert these tests to get a distribution free confidence interval
+    for the median
+  \end{itemize}
+\end{itemize}
+\end{frame}
+
+\section{Signed rank test}
+\begin{frame}\frametitle{Signed rank test}
+\begin{itemize}
+\item Wilcoxon's statistic uses the information in the {\bf signed ranks}
+  of the differences
+\item Saves some of the information regarding the magnitude of the differences
+\item Still tests $H_0:\theta = 0$ versus the three alternatives
+\item Appropriately normalized, the test statistic follows a normal distribution
+\item Also the exact small sample distribution of the signed rank statistic is known
+  (if there are no ties)
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Signed rank procedure}
+\begin{enumerate}
+\item Take the paired differences
+\item Take the absolute values of the differences
+\item Rank these absolute values, throwing out the 0s
+\item Multiply the ranks by the sign of the difference (+1 for a positive difference and -1 for
+  a negative difference)
+\item Cacluate the rank sum $W_+$ of the positive ranks
+\end{enumerate}
+\end{frame}
+
+\begin{frame}\frametitle{Signed rank procedure}
+\begin{itemize}
+\item If $\theta > 0$ then $W_+$ should be large 
+\item If $\theta < 0$ then $W_+$ should be small 
+\item Properly normalized, $W_+$ follows a large sample normal distribution
+\item For small sample sizes, $W_+$ has an exact distribution under the null hypothesis
+\item Can get critical values from tables in the textbook
+\end{itemize}
+\end{frame}
+
+\section{Monte Carlo}
+\begin{frame}\frametitle{Monte Carlo}
+\begin{itemize}
+\item Assume no ties
+\item Simulate $n$ observations from any distribution that has $\theta = 0$ as
+  its median
+\item Rank the absolute value of the data, retain the signs, calculate
+  the signed rank statistic
+\item Apply this procedure over and over, the proportion of time that
+  the observed test statistic is larger or smaller (depending on the hypothesis)
+  is a Monte Carlo approximation to the P-value
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Monte Carlo}
+\begin{itemize}
+\item Here's a slightly more elegant way to simulate from the null distribution
+\item Consider the ranks $1,\ldots,n$
+\item Randomly assign the signs as binary with probability $.5$ of
+  being positive and $.5$ of being negative
+\item Calculate the signed rank statistic
+\item Apply this procedure over and over, the proportion of time that
+  the observed test statistic is larger or smaller (depending on the hypothesis)
+  is a Monte Carlo approximation to the P-value
+\end{itemize}
+\end{frame}
+
+
+\begin{frame}\frametitle{Large sample distribution of $W_+$}
+\begin{itemize}
+\item Under $H_0$ and if there are no ties 
+  \begin{itemize}
+  \item $E(W_+) = n(n+1)/4$
+  \item $Var(W_+) = n(n+1)(2n+1)/24$
+  \item $TS = \{W_+ - E(W_+)\}/Sd(W_+) \rightarrow \mathrm{Normal}(0, 1)$
+  \end{itemize}
+\item There is a correction term necessary for ties
+\item Without ties, it's possible to do an exact (small sample) test
+\end{itemize}
+\end{frame}
+
+\begin{frame}[fragile]\frametitle{Example}
+\begin{verbatim}
+diff <- c(.07, .07, .00, -.04,  ...)
+wilcox.test(diff, exact = FALSE)
+\end{verbatim}
+\begin{itemize}
+\item $H_0:\mbox{Med diff} = 0$ vesus $H_a:\mbox{Med diff} \neq 0$
+\item $W_+ = 194.5$
+\item $E(W_+) = 24 \times 25 / 4 = 150$
+\item $\Var(W_+) = 24 \times 25 \times 49 / 24 = 1,225$
+\item $TS = (194.5 - 150) / \sqrt{1,224} = 1.27$
+\item P-value $=.20$
+\item R's P-value (uses correction for ties) $= 0.21$
+\end{itemize}
+\end{frame}
+
+\section{Independent groups}
+\begin{frame}\frametitle{Methods for unpaired samples} 
+Comparing two measuring techniques A and B\\
+Units are in deg C per gram
+\begin{center}
+\ttfamily
+  \begin{tabular}{|cc|c|} \hline
+\multicolumn{2}{|c|}{Method A} & Method B \\ \hline
+79.98 & 80.05 & 80.02 \\
+80.04 & 80.03 & 79.94 \\
+80.02 & 80.02 & 79.98 \\
+80.04 & 80.00 & 79.97 \\
+80.03 & 80.02 & 79.97 \\
+80.03 &       & 80.03 \\
+80.04 &       & 79.95 \\
+79.97 &       & 79.97 \\ \hline
+  \end{tabular}
+\end{center}
+Data from Rice Mathematical Statistics and Data Analysis; second edition \normalsize \normalfont
+\end{frame}
+
+\section{Mann/Whitney test}
+\begin{frame}\frametitle{The Mann/Whitney test}
+\begin{itemize}
+\item Tests whether or not the two treatments have the same location
+\item Assumes independent identically distributed errors, not necessarily normal
+\item Null hypothesis can also be written more generally as a stochastic shift for
+  two arbitrary distributions
+\item Test uses the sum of the ranks obtained by discarding the
+  treatment labels
+\item Also called the Wilcoxon rank sum test
+\end{itemize}
+\end{frame}
+
+
+\begin{frame}\frametitle{The Mann-Whitney test}
+\begin{itemize}
+\item Procedure
+  \begin{enumerate}
+  \item Discard the treatment labels
+  \item Rank the observations 
+  \item Calculate the sum of the ranks in the first treatment
+  \item Either 
+    \begin{itemize}
+    \item calculate the asymptotic normal distrubtion of 
+      this statistic
+    \item compare with the exact distribution under the null hypothesis
+    \end{itemize}
+\end{enumerate}
+\end{itemize}
+\end{frame}
+
+\begin{frame}
+\begin{center}
+\ttfamily
+  \begin{tabular}{|cc|c|} \hline
+\multicolumn{2}{|c|}{Method A} & Method B \\ \hline
+ 7.5  & 21.0  & 11.5  \\
+19.0  & 15.5  &  1.0  \\
+11.5  & 11.5  &  7.5  \\
+19.0  &  9.0  &  4.5  \\
+15.5  & 11.5  &  4.5  \\
+15.5  &       & 15.5  \\
+19.0  &       &  2.0  \\
+ 4.5  &       &  4.5  \\ \hline
+\multicolumn{2}{|c|}{180} & 51 \\ \hline
+  \end{tabular}
+\end{center}
+Sum has to add up to $21 \times 22 / 2 = 231$ \normalsize \normalfont
+\end{frame}
+
+\begin{frame}[fragile]\frametitle{Aside}
+Gauss supposedly came up with this in grade school
+\begin{verbatim}
+  x =   1  +    2  +    3  +    4  +  ...  +    n
+  x =   n  +  n-1  +  n-2  +  n-3  +  ...  +    1
+
+Therefore
+  2x = n+1  +  n+1  +  n+1  +  n+1  +  ...  +  n+1
+
+So 2x = n (n + 1) / 2
+
+So x = n (n + 1) / 2
+\end{verbatim}
+\end{frame}
+
+\begin{frame}\frametitle{Results} 
+\begin{itemize}
+\item  Let $W$ be the sum of the ranks for the first treatment ($A$)
+\item Let $n_A$ and $n_B$ be the sample sizes 
+\item Then
+\begin{itemize}
+\item $E(W) = n_A ( n_A + n_B + 1)/ 2$
+\item $\Var(W) = n_A n_B (n_A + n_B + 1) / 12$
+\item $TS = \{W - E(W)\}/Sd(W) \rightarrow N(0,1)$
+\end{itemize}
+\item Also the exact distribution of $W$ can be calculated
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Example}
+\begin{itemize}
+\item $W = 51$
+\item $E(W) = 8 (8 + 13 + 1) / 2 = 88$
+\item $Sd(W) = \sqrt{8 \times 13 (8 + 13 + 1) / 12} = 13.8$
+\item $TS = (51 - 88) / 13.8 = -2.68$
+\item Two-sided P-value$ = .007$
+\item R function \texttt{wilcox.test} will perform the test
+\end{itemize}
+\end{frame}
+
+\section{Monte Carlo}
+\begin{frame}\frametitle{Monte Carlo}
+\begin{itemize}
+\item Note that under $H_0$, the two groups are {\bf exchangeable}
+\item Therefore, any allocation of the ranks between the two groups is
+  equally likely
+\item Procedure: Take the ranks $1,\ldots,N_A+N_B$ and permute them
+\item Take the first $N_A$ ranks and allocate them to Group $A$;
+  allocate the remainder to Group $B$
+\item Calculate the test statistic
+\item Repeat this process over and over; the proportion of times the
+  test statistic is larger or smaller (depending on the alternative) than
+  the observed value is an exact P-value
+\end{itemize}
+\end{frame}
+
+\begin{frame}\frametitle{Notes about nonpar tests}
+\begin{itemize}
+\item Tend to be more robust to outliers than parametric counterparts
+\item Do not require normality assumptions
+\item Usually have exact small-sample versions
+\item Are often based on ranks rather than the raw data
+\item Loss in power over parametric counterparts is often not bad
+\item Nonpar tests are not assumption free
+\end{itemize}
+\end{frame}
+
+\section{Permutation tests}
+\begin{frame}\frametitle{Permutation tests}
+  \begin{itemize}
+  \item Permutation tests are similar to the rank-sum tests, though they use the
+    actual data rather than the ranks
+  \item That is, consider the null hypothesis that the distribution of the
+    observations from each group is the same
+  \item Then, the group labels are irrelevant
+  \item We then discard the group levels and permute the combined data
+  \item Split the permuted data into two groups with $n_A$ and $n_B$
+    observations (say by always treating the first $n_A$ observations as
+    the first group)
+  \item Evaluate the probability of getting a statistic as large or
+    large than the one observed
+  \item An example statistic would be the difference in the averages between the two groups;
+    one could also use a t-statistic 
+  \end{itemize}
+\end{frame}
+
+\begin{frame}
+  \begin{itemize}
+  \item This is an easy way to produce a null distribution for a test of equal distributions
+  \item Similar in flavor to the bootstrap
+  \item This procedure produces an exact test
+  \item Less robust, but more powerful than the rank sum tests
+  \item Very popular in genomic applications
+  \end{itemize}
+\end{frame}
+
+\begin{frame}
+  \includegraphics[width=3in]{permute.pdf}
+\end{frame}
+
+
+
+\end{document}
diff --git a/06_StatisticalInference/rmd.zip b/06_StatisticalInference/rmd.zip
new file mode 100644
index 000000000..ad43b6302
Binary files /dev/null and b/06_StatisticalInference/rmd.zip differ
diff --git a/06_StatisticalInference/rmd/01_Introduction.Rmd b/06_StatisticalInference/rmd/01_Introduction.Rmd
new file mode 100644
index 000000000..dde2d8720
--- /dev/null
+++ b/06_StatisticalInference/rmd/01_Introduction.Rmd
@@ -0,0 +1,161 @@
+---
+title       : Introduction to statistical inference
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Statistical inference defined
+
+Statistical inference is the process of drawing formal conclusions from
+data. 
+
+In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
+statistical data where uncertainty must be accounted for.
+
+---
+
+## Motivating example: who's going to win the election?
+
+In every major election, pollsters would like to know, ahead of the
+actual election, who's going to win. Here, the target of
+estimation (the estimand) is clear, the percentage of people in 
+a particular group (city, state, county, country or other electoral
+grouping) who will vote for each candidate.
+
+We can not poll everyone. Even if we could, some polled 
+may change their vote by the time the election occurs.
+How do we collect a reasonable subset of data and quantify the
+uncertainty in the process to produce a good guess at who will win?
+
+---
+
+## Motivating example: is hormone replacement therapy effective? 
+
+A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.**
+
+Here's there's two inferential problems. 
+
+1. Is HRT effective?
+2. How long should we continue the trial in the presence of contrary
+evidence?
+
+See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts
+
+---
+
+## Motivating example 
+### Brain activation
+
+![fMRI salmon study](fig/fmri-salmon.jpg 'fMRI salmon study')
+
+http://www.wired.com/2009/09/fmrisalmon/
+
+
+---
+
+## Summary
+
+- These examples illustrate many of the difficulties of trying
+to use data to create general conclusions about a population.
+- Paramount among our concerns are:
+  - Is the sample representative of the population that we'd like to draw inferences about?
+  - Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?
+  - Is there systematic bias created by missing data or the design or conduct of the study?
+  - What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
+or random sampling, or implicit as the aggregation of many complex uknown processes.
+  - Are we trying to estimate an underlying mechanistic model of phenomena under study?
+- Statistical inference requires navigating the set of assumptions and
+tools and subsequently thinking about how to draw conclusions from data.
+
+--- 
+## Example goals of inference
+
+1. Estimate and quantify the uncertainty of an estimate of 
+a population quantity (the proportion of people who will
+  vote for a candidate).
+2. Determine whether a population quantity 
+  is a benchmark value ("is the treatment effective?").
+3. Infer a mechanistic relationship when quantities are measured with
+  noise ("What is the slope for Hooke's law?")
+4. Determine the impact of a policy? ("If we reduce polution levels,
+  will asthma rates decline?")
+5. Talk about the probability that something occurs.
+
+---
+## Example tools of the trade 
+
+1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest
+2. Random sampling: concerned with obtaining data that is representative 
+of the population of interest
+3. Sampling models: concerned with creating a model for the sampling
+process, the most common is so called "iid".
+4. Hypothesis testing: concerned with decision making in the presence of uncertainty
+5. Confidence intervals: concerned with quantifying uncertainty in 
+estimation
+6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
+approximated.
+7. Study design: the process of designing an experiment to minimize biases and variability.
+8. Nonparametric bootstrapping: the process of using the data to,
+  with minimal probability model assumptions, create inferences.
+9. Permutation, randomization and exchangeability testing: the process 
+of using data permutations to perform inferences.
+
+---
+## Different thinking about probability leads to different styles of inference
+
+We won't spend too much time talking about this, but there are several different
+styles of inference. Two broad categories that get discussed a lot are:
+
+1. Frequency probability: is the long run proportion of
+ times an event occurs in independent, identically distributed 
+ repetitions.
+2. Frequency inference: uses frequency interpretations of probabilities
+to control error rates. Answers questions like "What should I decide
+given my data controlling the long run proportion of mistakes I make at
+a tolerable level."
+3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.
+4. Bayesian inference: the use of Bayesian probability representation
+of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what
+should I believe now?"
+
+Data scientists tend to fall within shades of gray of these and various other schools of inference. 
+
+---
+## In this class
+
+* In this class, we will primarily focus on basic sampling models, 
+basic probability models and frequency style analyses
+to create standard inferences. 
+* Being data scientists,  we will also consider some inferential strategies that  rely heavily on the observed data, such as permutation testing
+and bootstrapping.
+* As probability modeling will be our starting point, we first build
+up basic probability.
+
+---
+## Where to learn more on the topics not covered
+
+1. Explicit use of random sampling in inferences: look in references
+on "finite population statistics". Used heavily in polling and
+sample surveys.
+2. Explicit use of randomization in inferences: look in references
+on "causal inference" especially in clinical trials.
+3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).
+4. Missing data: well covered in biostatistics and econometric
+references; look for references to "multiple imputation", a popular tool for
+addressing missing data.
+5. Study design: consider looking in the subject matter area that
+  you are interested in; some examples with rich histories in design:
+  1. The epidemiological literature is very focused on using study design to investigate public health.
+  2. The classical development of study design in agriculture broadly covers design and design principles.
+  3. The industrial quality control literature covers design thoroughly.
+
diff --git a/06_StatisticalInference/rmd/02_Probability.Rmd b/06_StatisticalInference/rmd/02_Probability.Rmd
new file mode 100644
index 000000000..9f81bb399
--- /dev/null
+++ b/06_StatisticalInference/rmd/02_Probability.Rmd
@@ -0,0 +1,586 @@
+<<<<<<< HEAD:06_StatisticalInference/01_02_Probability/index.Rmd
+---
+title       : Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Notation
+
+- The **sample space**, $\Omega$, is the collection of possible outcomes of an experiment
+  - Example: die roll $\Omega = \{1,2,3,4,5,6\}$
+- An **event**, say $E$, is a subset of $\Omega$ 
+  - Example: die roll is even $E = \{2,4,6\}$
+- An **elementary** or **simple** event is a particular result
+  of an experiment
+  - Example: die roll is a four, $\omega = 4$
+- $\emptyset$ is called the **null event** or the **empty set**
+
+---
+
+## Interpretation of set operations
+
+Normal set operations have particular interpretations in this setting
+
+1. $\omega \in E$ implies that $E$ occurs when $\omega$ occurs
+2. $\omega \not\in E$ implies that $E$ does not occur when $\omega$ occurs
+3. $E \subset F$ implies that the occurrence of $E$ implies the occurrence of $F$
+4. $E \cap F$  implies the event that both $E$ and $F$ occur
+5. $E \cup F$ implies the event that at least one of $E$ or $F$ occur
+6. $E \cap F=\emptyset$ means that $E$ and $F$ are **mutually exclusive**, or cannot both occur
+7. $E^c$ or $\bar E$ is the event that $E$ does not occur
+
+---
+
+## Probability
+
+A **probability measure**, $P$, is a function from the collection of possible events so that the following hold
+
+1. For an event $E\subset \Omega$, $0 \leq P(E) \leq 1$
+2. $P(\Omega) = 1$
+3. If $E_1$ and $E_2$ are mutually exclusive events
+  $P(E_1 \cup E_2) = P(E_1) + P(E_2)$.
+
+Part 3 of the definition implies **finite additivity**
+
+$$
+P(\cup_{i=1}^n A_i) = \sum_{i=1}^n P(A_i)
+$$
+where the $\{A_i\}$ are mutually exclusive. (Note a more general version of
+additivity is used in advanced classes.)
+
+
+---
+
+
+## Example consequences
+
+- $P(\emptyset) = 0$
+- $P(E) = 1 - P(E^c)$
+- $P(A \cup B) = P(A) + P(B) - P(A \cap B)$
+- if $A \subset B$ then $P(A) \leq P(B)$
+- $P\left(A \cup B\right) = 1 - P(A^c \cap B^c)$
+- $P(A \cap B^c) = P(A) - P(A \cap B)$
+- $P(\cup_{i=1}^n E_i) \leq \sum_{i=1}^n P(E_i)$
+- $P(\cup_{i=1}^n E_i) \geq \max_i P(E_i)$
+
+---
+
+## Example
+
+The National Sleep Foundation ([www.sleepfoundation.org](http://www.sleepfoundation.org/)) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?
+
+---
+
+## Example continued
+
+Answer: No, the events are not mutually exclusive. To elaborate let:
+
+$$
+\begin{eqnarray*}
+    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
+    A_2 & = & \{\mbox{Person has RLS}\} 
+  \end{eqnarray*}
+$$
+
+Then 
+
+$$
+\begin{eqnarray*}
+    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
+   & = & 0.13 - \mbox{Probability of having both}
+  \end{eqnarray*}
+$$
+Likely, some fraction of the population has both.
+
+---
+
+## Random variables
+
+- A **random variable** is a numerical outcome of an experiment.
+- The random variables that we study will come in two varieties,
+  **discrete** or **continuous**.
+- Discrete random variable are random variables that take on only a
+countable number of possibilities.
+  * $P(X = k)$
+- Continuous random variable can take any value on the real line or some subset of the real line.
+  * $P(X \in A)$
+
+---
+
+## Examples of variables that can be thought of as random variables
+
+- The $(0-1)$ outcome of the flip of a coin
+- The outcome from the roll of a die
+- The BMI of a subject four years after a baseline measurement
+- The hypertension status of a subject randomly drawn from a population
+
+---
+
+## PMF
+
+A probability mass function evaluated at a value corresponds to the
+probability that a random variable takes that value. To be a valid
+pmf a function, $p$, must satisfy
+
+  1. $p(x) \geq 0$ for all $x$
+  2. $\sum_{x} p(x) = 1$
+
+The sum is taken over all of the possible values for $x$.
+
+---
+
+## Example
+
+Let $X$ be the result of a coin flip where $X=0$ represents
+tails and $X = 1$ represents heads.
+$$
+p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+Suppose that we do not know whether or not the coin is fair; Let
+$\theta$ be the probability of a head expressed as a proportion
+(between 0 and 1).
+$$
+p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+
+---
+
+## PDF
+
+A probability density function (pdf), is a function associated with
+a continuous random variable 
+
+  *Areas under pdfs correspond to probabilities for that random variable*
+
+To be a valid pdf, a function $f$ must satisfy
+
+1. $f(x) \geq 0$ for all $x$
+
+2. The area under $f(x)$ is one.
+
+---
+## Example
+
+Suppose that the proportion of help calls that get addressed in
+a random day by a help line is given by
+$$
+f(x) = \left\{\begin{array}{ll}
+    2 x & \mbox{ for } 1 > x > 0 \\
+    0                 & \mbox{ otherwise} 
+\end{array} \right. 
+$$
+
+Is this a mathematically valid density?
+
+---
+
+```{r, fig.height = 5, fig.width = 5, echo = TRUE, fig.align='center'}
+x <- c(-0.5, 0, 1, 1, 1.5); y <- c( 0, 0, 2, 0, 0)
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+```
+
+---
+
+## Example continued
+
+What is the probability that 75% or fewer of calls get addressed?
+
+```{r, fig.height = 5, fig.width = 5, echo = FALSE, fig.align='center'}
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+polygon(c(0, .75, .75, 0), c(0, 0, 1.5, 0), lwd = 3, col = "lightblue")
+```
+
+---
+```{r}
+1.5 * .75 / 2
+pbeta(.75, 2, 1)
+```
+---
+
+## CDF and survival function
+
+- The **cumulative distribution function** (CDF) of a random variable $X$ is defined as the function 
+$$
+F(x) = P(X \leq x)
+$$
+- This definition applies regardless of whether $X$ is discrete or continuous.
+- The **survival function** of a random variable $X$ is defined as
+$$
+S(x) = P(X > x)
+$$
+- Notice that $S(x) = 1 - F(x)$
+- For continuous random variables, the PDF is the derivative of the CDF
+
+---
+
+## Example
+
+What are the survival function and CDF from the density considered before?
+
+For $1 \geq x \geq 0$
+$$
+F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
+$$
+
+$$
+S(x) = 1 - x^2
+$$
+
+```{r}
+pbeta(c(0.4, 0.5, 0.6), 2, 1)
+```
+
+---
+
+## Quantiles
+
+- The  $\alpha^{th}$ **quantile** of a distribution with distribution function $F$ is the point $x_\alpha$ so that
+$$
+F(x_\alpha) = \alpha
+$$
+- A **percentile** is simply a quantile with $\alpha$ expressed as a percent
+- The **median** is the $50^{th}$ percentile
+
+---
+## Example
+- We want to solve $0.5 = F(x) = x^2$
+- Resulting in the solution 
+```{r, echo = TRUE} 
+sqrt(0.5)
+``` 
+- Therefore, about `r sqrt(0.5)` of calls being answered on a random day is the median.
+- R can approximate quantiles for you for common distributions
+
+```{r}
+qbeta(0.5, 2, 1)
+```
+
+---
+
+## Summary
+
+- You might be wondering at this point "I've heard of a median before, it didn't require integration. Where's the data?"
+- We're referring to are **population quantities**. Therefore, the median being
+  discussed is the **population median**.
+- A probability model connects the data to the population using assumptions.
+- Therefore the median we're discussing is the **estimand**, the sample median will be the **estimator**
+=======
+---
+title       : Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Probability
+
+- In these slides we will cover the basics of probability at low enough level
+to have a basic understanding for the rest of the series
+- For a more complete treatment see the class Mathematical Biostatistics Boot Camp 1
+    - Youtube: www.youtube.com/playlist?list=PLpl-gQkQivXhk6qSyiNj51qamjAtZISJ-
+    - Coursera: www.coursera.org/course/biostats
+    - Git: http://github.com/bcaffo/Caffo-Coursera
+
+
+---
+
+## Probability
+
+Given a random experiment (say rolling a die) a probability measure is a population quantity
+that summarizes the randomness.
+
+Specifically, probability takes a possible outcome from the expertiment and:
+
+- assigns it a number between 0 and 1 
+- so that the probability that something occurs is 1 (the die must be rolled)
+and 
+- so that the probability of the union of any two sets of outcomes that have nothing in common (mutually exclusive)
+is the sum of their respective probabilities.
+
+
+The Russian mathematician Kolmogorov formalized these rules.
+
+---
+
+
+## Rules probability must follow
+
+- The probability that nothing occurs is 0
+- The probability that something occurs is 1
+- The probability of something is 1 minus the probability that the opposite occurs
+- The probability of at least one of 
+    two (or more) things that can not simultaneously occur (mutually exclusive) 
+    is the sum of their
+    respective probabilities
+- If an event A implies the occurrence of event B, then the probability of A
+occurring is less than the probability that B occurs
+- For any two events the probability that at least one occurs is the sum of their
+    probabilities minus their intersection.
+
+---
+
+## Example
+
+The National Sleep Foundation ([www.sleepfoundation.org](http://www.sleepfoundation.org/)) reports that around 3% of the American population has sleep apnea. They also report that around 10% of the North American and European population has restless leg syndrome. Does this imply that 13% of people will have at least one sleep problems of these sorts?
+
+---
+
+## Example continued
+
+Answer: No, the events can simultaneously occur and so 
+are not mutually exclusive. To elaborate let:
+
+---
+## If you want to see the mathematics
+
+$$
+\begin{eqnarray*}
+    A_1 & = & \{\mbox{Person has sleep apnea}\} \\
+    A_2 & = & \{\mbox{Person has RLS}\} 
+  \end{eqnarray*}
+$$
+
+Then 
+
+$$
+\begin{eqnarray*}
+    P(A_1 \cup A_2 ) & = & P(A_1) + P(A_2) - P(A_1 \cap A_2) \\
+   & = & 0.13 - \mbox{Probability of having both}
+  \end{eqnarray*}
+$$
+Likely, some fraction of the population has both.
+
+---
+## Going further
+
+Probability calculus is useful for understanding the rules that probabilities
+must follow. 
+
+However, we need ways to model and think about probabilities for
+numeric outcomes of experiments (broadly defined). 
+
+Densities and mass functions for random variables are the best starting point for this.
+
+Remember, everything we're talking about up to at this point is a population quantity 
+not a statement about what occurs in the data.  
+- We're going with this is: use the data to estimate properties of the population.
+
+---
+## Random variables
+
+- A **random variable** is a numerical outcome of an experiment.
+- The random variables that we study will come in two varieties,
+  **discrete** or **continuous**.
+- Discrete random variable are random variables that take on only a
+countable number of possibilities and we talk about the probability that they
+take specific values
+- Continuous random variable can conceptually take any value on the real line or some subset of the real line and we talk about the probability that they line within
+some range
+
+---
+
+## Examples of variables that can be thought of as random variables
+
+Experiments that we use for intuition and building context
+- The $(0-1)$ outcome of the flip of a coin
+- The outcome from the roll of a die
+
+Specific instances of treating variables as if random
+- The web site traffic on a given day
+- The BMI of a subject four years after a baseline measurement
+- The hypertension status of a subject randomly drawn from a population
+- The number of people who click on an ad 
+- Intelligence quotients for a sample of children
+
+---
+
+## PMF
+
+A probability mass function evaluated at a value corresponds to the
+probability that a random variable takes that value. To be a valid
+pmf a function, $p$, must satisfy
+
+  1. It must always be larger than or equal to 0.
+  2. The sum of the possible values that the random variable can take has to add up to one.
+
+---
+
+## Example
+
+Let $X$ be the result of a coin flip where $X=0$ represents
+tails and $X = 1$ represents heads.
+$$
+p(x) = (1/2)^{x} (1/2)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+Suppose that we do not know whether or not the coin is fair; Let
+$\theta$ be the probability of a head expressed as a proportion
+(between 0 and 1).
+$$
+p(x) = \theta^{x} (1 - \theta)^{1-x} ~~\mbox{ for }~~x = 0,1
+$$
+
+---
+
+## PDF
+
+A probability density function (pdf), is a function associated with
+a continuous random variable 
+
+  *Areas under pdfs correspond to probabilities for that random variable*
+
+To be a valid pdf, a function must satisfy
+
+1. It must be larger than or equal to zero everywhere.
+
+2. The total area under it must be one.
+
+---
+## Example
+
+Suppose that the proportion of help calls that get addressed in
+a random day by a help line is given by
+$$
+f(x) = \left\{\begin{array}{ll}
+    2 x & \mbox{ for }& 0< x < 1 \\
+    0                 & \mbox{ otherwise} 
+\end{array} \right. 
+$$
+
+Is this a mathematically valid density?
+
+---
+
+```{r, fig.height = 5, fig.width = 5, echo = TRUE, fig.align='center'}
+x <- c(-0.5, 0, 1, 1, 1.5); y <- c( 0, 0, 2, 0, 0)
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+```
+
+---
+
+## Example continued
+
+What is the probability that 75% or fewer of calls get addressed?
+
+```{r, fig.height = 5, fig.width = 5, echo = FALSE, fig.align='center'}
+plot(x, y, lwd = 3, frame = FALSE, type = "l")
+polygon(c(0, .75, .75, 0), c(0, 0, 1.5, 0), lwd = 3, col = "lightblue")
+```
+
+---
+```{r}
+1.5 * .75 / 2
+pbeta(.75, 2, 1)
+```
+---
+
+## CDF and survival function
+### Certain areas are so useful, we give them names
+
+- The **cumulative distribution function** (CDF) of a random variable, $X$, returns the probability that the random variable is less than or equal to the value $x$
+$$
+F(x) = P(X \leq x)
+$$
+(This definition applies regardless of whether $X$ is discrete or continuous.)
+- The **survival function** of a random variable $X$ is defined as the probability
+that the random variable is greater than the value $x$
+$$
+S(x) = P(X > x)
+$$
+- Notice that $S(x) = 1 - F(x)$
+
+---
+
+## Example
+
+What are the survival function and CDF from the density considered before?
+
+For $1 \geq x \geq 0$
+$$
+F(x) = P(X \leq x) = \frac{1}{2} Base \times Height = \frac{1}{2} (x) \times (2 x) = x^2
+$$
+
+$$
+S(x) = 1 - x^2
+$$
+
+```{r}
+pbeta(c(0.4, 0.5, 0.6), 2, 1)
+```
+
+---
+
+## Quantiles
+
+You've heard of sample quantiles. If you were the 95th percentile on an exam, you know
+that 95% of people scored worse than you and 5% scored better. 
+These are sample quantities. Here we define their population analogs.
+
+
+---
+## Definition
+
+The  $\alpha^{th}$ **quantile** of a distribution with distribution function $F$ is the point $x_\alpha$ so that
+$$
+F(x_\alpha) = \alpha
+$$
+- A **percentile** is simply a quantile with $\alpha$ expressed as a percent
+- The **median** is the $50^{th}$ percentile
+
+---
+## For example
+
+The $95^{th}$ percentile of a distribution is the point so that:
+- the probability that a random variable drawn from the population is less is 95%
+- the probability that a random variable drawn from the population is more is 5%
+
+---
+## Example
+What is the median of the distribution that we were working with before?
+- We want to solve $0.5 = F(x) = x^2$
+- Resulting in the solution 
+```{r, echo = TRUE} 
+sqrt(0.5)
+``` 
+- Therefore, about `r sqrt(0.5)` of calls being answered on a random day is the median.
+
+---
+## Example continued
+R can approximate quantiles for you for common distributions
+
+```{r}
+qbeta(0.5, 2, 1)
+```
+
+---
+
+## Summary
+
+- You might be wondering at this point "I've heard of a median before, it didn't require integration. Where's the data?"
+- We're referring to are **population quantities**. Therefore, the median being
+  discussed is the **population median**.
+- A probability model connects the data to the population using assumptions.
+- Therefore the median we're discussing is the **estimand**, the sample median will be the **estimator**
+
+
+
+>>>>>>> devel:06_StatisticalInference/02_Probability/index.Rmd
diff --git a/06_StatisticalInference/rmd/03_ConditionalProbability.Rmd b/06_StatisticalInference/rmd/03_ConditionalProbability.Rmd
new file mode 100644
index 000000000..381915cd4
--- /dev/null
+++ b/06_StatisticalInference/rmd/03_ConditionalProbability.Rmd
@@ -0,0 +1,221 @@
+---
+title       : Conditional Probability
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Conditional probability, motivation
+
+- The probability of getting a one when rolling a (standard) die
+  is usually assumed to be one sixth
+- Suppose you were given the extra information that the die roll
+  was an odd number (hence 1, 3 or 5)
+- *conditional on this new information*, the probability of a
+  one is now one third
+
+---
+
+## Conditional probability, definition
+
+- Let $B$ be an event so that $P(B) > 0$
+- Then the conditional probability of an event $A$ given that $B$ has occurred is
+  $$
+  P(A ~|~ B) = \frac{P(A \cap B)}{P(B)}
+  $$
+- Notice that if $A$ and $B$ are independent (defined later in the lecture), then
+  $$
+  P(A ~|~ B) = \frac{P(A) P(B)}{P(B)} = P(A)
+  $$
+
+---
+
+## Example
+
+- Consider our die roll example
+- $B = \{1, 3, 5\}$
+- $A = \{1\}$
+$$
+  \begin{eqnarray*}
+P(\mbox{one given that roll is odd})  & = & P(A ~|~ B) \\ \\
+  & = & \frac{P(A \cap B)}{P(B)} \\ \\
+  & = & \frac{P(A)}{P(B)} \\ \\ 
+  & = & \frac{1/6}{3/6} = \frac{1}{3}
+  \end{eqnarray*}
+$$
+
+
+
+---
+
+## Bayes' rule
+Baye's rule allows us to reverse the conditioning set provided
+that we know some marginal probabilities
+$$
+P(B ~|~ A) = \frac{P(A ~|~ B) P(B)}{P(A ~|~ B) P(B) + P(A ~|~ B^c)P(B^c)}.
+$$
+  
+
+---
+
+## Diagnostic tests
+
+- Let $+$ and $-$ be the events that the result of a diagnostic test is positive or negative respectively
+- Let $D$ and $D^c$ be the event that the subject of the test has or does not have the disease respectively 
+- The **sensitivity** is the probability that the test is positive given that the subject actually has the disease, $P(+ ~|~ D)$
+- The **specificity** is the probability that the test is negative given that the subject does not have the disease, $P(- ~|~ D^c)$
+
+---
+
+## More definitions
+
+- The **positive predictive value** is the probability that the subject has the  disease given that the test is positive, $P(D ~|~ +)$
+- The **negative predictive value** is the probability that the subject does not have the disease given that the test is negative, $P(D^c ~|~ -)$
+- The **prevalence of the disease** is the marginal probability of disease, $P(D)$
+
+---
+
+## More definitions
+
+- The **diagnostic likelihood ratio of a positive test**, labeled $DLR_+$, is $P(+ ~|~ D) / P(+ ~|~ D^c)$, which is the $$sensitivity / (1 - specificity)$$
+- The **diagnostic likelihood ratio of a negative test**, labeled $DLR_-$, is $P(- ~|~ D) / P(- ~|~ D^c)$, which is the $$(1 - sensitivity) / specificity$$
+
+---
+
+## Example
+
+- A study comparing the efficacy of HIV tests, reports on an experiment which concluded that HIV antibody tests have a sensitivity of 99.7% and a specificity of 98.5%
+- Suppose that a subject, from a population with a .1% prevalence of HIV, receives a positive test result. What is the positive predictive value?
+- Mathematically, we want $P(D ~|~ +)$ given the sensitivity, $P(+ ~|~ D) = .997$, the specificity, $P(- ~|~ D^c) =.985$, and the prevalence $P(D) = .001$
+
+---
+
+## Using Bayes' formula
+
+$$
+\begin{eqnarray*}
+  P(D ~|~ +) & = &\frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}\\ \\
+ & = & \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + \{1-P(-~|~D^c)\}\{1 - P(D)\}} \\ \\
+ & = & \frac{.997\times .001}{.997 \times .001 + .015 \times .999}\\ \\
+ & = & .062
+\end{eqnarray*}
+$$
+
+- In this population a positive test result only suggests a 6% probability that the subject has the disease 
+- (The positive predictive value is 6% for this test)
+
+---
+
+## More on this example
+
+- The low positive predictive value is due to low prevalence of disease and the somewhat modest specificity
+- Suppose it was known that the subject was an intravenous drug user and routinely had intercourse with an HIV infected partner
+- Notice that the evidence implied by a positive test result does not change because of the prevalence of disease in the subject's population, only our interpretation of that evidence changes
+
+---
+
+## Likelihood ratios
+
+- Using Bayes rule, we have
+  $$
+  P(D ~|~ +) = \frac{P(+~|~D)P(D)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)} 
+  $$
+  and
+  $$
+  P(D^c ~|~ +) = \frac{P(+~|~D^c)P(D^c)}{P(+~|~D)P(D) + P(+~|~D^c)P(D^c)}.
+  $$
+
+---
+
+## Likelihood ratios
+
+- Therefore
+$$
+\frac{P(D ~|~ +)}{P(D^c ~|~ +)} = \frac{P(+~|~D)}{P(+~|~D^c)}\times \frac{P(D)}{P(D^c)}
+$$
+ie
+$$
+\mbox{post-test odds of }D = DLR_+\times\mbox{pre-test odds of }D
+$$
+- Similarly, $DLR_-$ relates the decrease in the odds of the
+  disease after a negative test result to the odds of disease prior to
+  the test.
+
+---
+
+## HIV example revisited
+
+- Suppose a subject has a positive HIV test
+- $DLR_+ = .997 / (1 - .985) \approx 66$
+- The result of the positive test is that the odds of disease is now 66 times the pretest odds
+- Or, equivalently, the hypothesis of disease is 66 times more supported by the data than the hypothesis of no disease
+
+---
+
+## HIV example revisited
+
+- Suppose that a subject has a negative test result 
+- $DLR_- = (1 - .997) / .985  \approx .003$
+- Therefore, the post-test odds of disease is now $.3\%$ of the pretest odds given the negative test.
+- Or, the hypothesis of disease is supported $.003$ times that of the hypothesis of absence of disease given the negative test result
+
+---
+
+## Independence
+
+- Two events $A$ and $B$ are **independent** if $$P(A \cap B) = P(A)P(B)$$
+- Equivalently if $P(A ~|~ B) = P(A)$ 
+- Two random variables, $X$ and $Y$ are independent if for any two sets $A$ and $B$ $$P([X \in A] \cap [Y \in B]) = P(X\in A)P(Y\in B)$$
+- If $A$ is independent of $B$ then 
+  - $A^c$ is independent of $B$ 
+  - $A$ is independent of $B^c$
+  - $A^c$ is independent of $B^c$
+
+
+---
+
+## Example
+
+- What is the probability of getting two consecutive heads?
+- $A = \{\mbox{Head on flip 1}\}$ ~ $P(A) = .5$
+- $B = \{\mbox{Head on flip 2}\}$ ~ $P(B) = .5$
+- $A \cap B = \{\mbox{Head on flips 1 and 2}\}$
+- $P(A \cap B) = P(A)P(B) = .5 \times .5 = .25$ 
+
+---
+
+## Example
+
+- Volume 309 of Science reports on a physician who was on trial for expert testimony in a criminal trial
+- Based on an estimated prevalence of sudden infant death syndrome of $1$ out of $8,543$, the physician testified that that the probability of a mother having two children with SIDS was $\left(\frac{1}{8,543}\right)^2$
+- The mother on trial was convicted of murder
+
+---
+
+## Example: continued
+
+- Relevant to this discussion, the principal mistake was to *assume* that the events of having SIDs within a family are independent
+- That is, $P(A_1 \cap A_2)$ is not necessarily equal to $P(A_1)P(A_2)$
+- Biological processes that have a believed genetic or familiar environmental component, of course, tend to be dependent within families
+- (There are many other statistical points of discussion for this case.)
+
+
+---
+## IID random variables
+
+- Random variables are said to be iid if they are independent and identically distributed
+  - Independent: statistically unrelated from one and another
+  - Identically distributed: all having been drawn from the same population distribution
+- iid random variables are the default model for random samples
+- Many of the important theories of statistics are founded on assuming that variables are iid
+- Assuming a random sample and iid will be the default starting point of inference for this class
+
diff --git a/06_StatisticalInference/rmd/04_Expectations.Rmd b/06_StatisticalInference/rmd/04_Expectations.Rmd
new file mode 100644
index 000000000..9fa27e2df
--- /dev/null
+++ b/06_StatisticalInference/rmd/04_Expectations.Rmd
@@ -0,0 +1,227 @@
+---
+title       : Expected values
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Expected values
+- Expected values are useful cor characterizing a distribution
+- The mean is a characterization of its center
+- The variance and standard deviation are characterizations of
+how spread out it is
+- Our sample expected values (the sample mean and variance) will
+estimate the population versions
+
+
+---
+## The population mean
+- The **expected value** or **mean** of a random variable is the center of its distribution
+- For discrete random variable $X$ with PMF $p(x)$, it is defined as follows
+    $$
+    E[X] = \sum_x xp(x).
+    $$
+    where the sum is taken over the possible values of $x$
+- $E[X]$ represents the center of mass of a collection of locations and weights, $\{x, p(x)\}$
+
+---
+## The sample mean
+- The sample mean estimates this population mean
+- The center of mass of the data is the empirical mean
+$$
+\bar X = \sum_{i=1}^n x_i p(x_i)
+$$
+where $p(x_i) = 1/n$
+
+---
+
+## Example
+### Find the center of mass of the bars
+```{r galton, fig.height=6,fig.width=12, fig.align='center', echo = FALSE, message =FALSE, warning=FALSE}
+library(UsingR); data(galton); library(ggplot2)
+library(reshape2)
+longGalton <- melt(galton, measure.vars = c("child", "parent"))
+g <- ggplot(longGalton, aes(x = value)) + geom_histogram(aes(y = ..density..,  fill = variable), binwidth=1, colour = "black") + geom_density(size = 2)
+g <- g + facet_grid(. ~ variable)
+g
+```
+
+---
+## Using manipulate
+```
+library(manipulate)
+myHist <- function(mu){
+    g <- ggplot(galton, aes(x = child))
+    g <- g + geom_histogram(fill = "salmon", 
+      binwidth=1, aes(y = ..density..), colour = "black")
+    g <- g + geom_density(size = 2)
+    g <- g + geom_vline(xintercept = mu, size = 2)
+    mse <- round(mean((galton$child - mu)^2), 3)  
+    g <- g + labs(title = paste('mu = ', mu, ' MSE = ', mse))
+    g
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The center of mass is the empirical mean
+```{r lsm, dependson="galton",fig.height=7,fig.width=7, fig.align='center', echo = FALSE}
+    g <- ggplot(galton, aes(x = child))
+    g <- g + geom_histogram(fill = "salmon", 
+      binwidth=1, aes(y = ..density..), colour = "black")
+    g <- g + geom_density(size = 2)
+    g <- g + geom_vline(xintercept = mean(galton$child), size = 2)
+    g
+```
+
+
+---
+## Example of a population mean
+
+- Suppose a coin is flipped and $X$ is declared $0$ or $1$ corresponding to a head or a tail, respectively
+- What is the expected value of $X$? 
+    $$
+    E[X] = .5 \times 0 + .5 \times 1 = .5
+    $$
+- Note, if thought about geometrically, this answer is obvious; if two equal weights are spaced at 0 and 1, the center of mass will be $.5$
+
+```{r, echo = FALSE, fig.height=4, fig.width = 6, fig.align='center'}
+ggplot(data.frame(x = factor(0 : 1), y = c(.5, .5)), aes(x = x, y = y)) + geom_bar(stat = "identity", colour = 'black', fill = "lightblue")
+```
+
+---
+## What about a biased coin?
+
+- Suppose that a random variable, $X$, is so that
+$P(X=1) = p$ and $P(X=0) = (1 - p)$
+- (This is a biased coin when $p\neq 0.5$)
+- What is its expected value?
+$$
+E[X] = 0 * (1 - p) + 1 * p = p
+$$
+
+---
+
+## Example
+
+- Suppose that a die is rolled and $X$ is the number face up
+- What is the expected value of $X$?
+    $$
+    E[X] = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} +
+    3 \times \frac{1}{6} + 4 \times \frac{1}{6} +
+    5 \times \frac{1}{6} + 6 \times \frac{1}{6} = 3.5
+    $$
+- Again, the geometric argument makes this answer obvious without calculation.
+
+```{r, fig.align='center', echo=FALSE, fig.height=4, fig.width=10}
+ggplot(data.frame(x = factor(1 : 6), y = rep(1/6, 6)), aes(x = x, y = y)) + geom_bar(stat = "identity", colour = 'black', fill = "lightblue")
+
+```
+
+---
+
+## Continuous random variables
+
+- For a continuous random variable, $X$, with density, $f$, the expected value is again exactly the center of mass of the density
+
+
+---
+
+## Example
+
+- Consider a density where $f(x) = 1$ for $x$ between zero and one
+- (Is this a valid density?)
+- Suppose that $X$ follows this density; what is its expected value?  
+```{r, fig.height=6, fig.width=6, echo=FALSE, fig.align='center'}
+g <- ggplot(data.frame(x = c(-0.25, 0, 0, 1, 1, 1.25),
+                  y = c(0, 0, 1, 1, 0, 0)),
+       aes(x = x, y = y))
+g <- g + geom_line(size = 2, colour = "black")
+g <- g + labs(title = "Uniform density")
+g  
+
+```
+
+---
+
+## Facts about expected values
+
+- Recall that expected values are properties of distributions
+- Note the average of random variables is itself a random variable
+and its associated distribution has an expected value
+- The center of this distribution is the same as that of the original distribution
+- Therefore, the expected value of the **sample mean** is the population mean that it's trying to estimate
+- When the expected value of an estimator is what its trying to estimate, we say that the estimator is **unbiased**
+- Let's try a simulation experiment
+
+---
+## Simulation experiment
+Simulating normals with mean 0 and variance 1 versus averages
+of 10 normals from the same population
+
+```{r, fig.height=6, figh.width=6, fig.align='center', echo = FALSE}
+library(ggplot2)
+nosim <- 10000; n <- 10
+dat <- data.frame(
+    x = c(rnorm(nosim), apply(matrix(rnorm(nosim * n), nosim), 1, mean)),
+    what = factor(rep(c("Obs", "Mean"), c(nosim, nosim))) 
+    )
+ggplot(dat, aes(x = x, fill = what)) + geom_density(size = 2, alpha = .2); 
+
+```
+
+---
+## Averages of x die rolls
+
+```{r, fig.align='center',fig.height=5, fig.width=10, echo = FALSE, warning=FALSE, error=FALSE, message=FALSE}  
+dat <- data.frame(
+  x = c(sample(1 : 6, nosim, replace = TRUE),
+        apply(matrix(sample(1 : 6, nosim * 2, replace = TRUE), 
+                     nosim), 1, mean),
+        apply(matrix(sample(1 : 6, nosim * 3, replace = TRUE), 
+                     nosim), 1, mean),
+        apply(matrix(sample(1 : 6, nosim * 4, replace = TRUE), 
+                     nosim), 1, mean)
+        ),
+  size = factor(rep(1 : 4, rep(nosim, 4))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(alpha = .20, binwidth=.25, colour = "black") 
+g + facet_grid(. ~ size)
+```
+
+
+---
+## Averages of x coin flips
+```{r, fig.align='center',fig.height=5, fig.width=10, echo = FALSE, warning=FALSE, error=FALSE, message=FALSE}
+dat <- data.frame(
+  x = c(sample(0 : 1, nosim, replace = TRUE),
+        apply(matrix(sample(0 : 1, nosim * 10, replace = TRUE), 
+                     nosim), 1, mean),
+        apply(matrix(sample(0 : 1, nosim * 20, replace = TRUE), 
+                     nosim), 1, mean),
+        apply(matrix(sample(0 : 1, nosim * 30, replace = TRUE), 
+                     nosim), 1, mean)
+        ),
+  size = factor(rep(c(1, 10, 20, 30), rep(nosim, 4))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(alpha = .20, binwidth = 1 / 12, colour = "black"); 
+g + facet_grid(. ~ size)
+```
+
+---
+## Sumarizing what we know
+- Expected values are properties of distributions
+- The population mean is the center of mass of population
+- The sample mean is the center of mass of the observed data
+- The sample mean is an estimate of the population mean
+- The sample mean is unbiased 
+  - The population mean of its distribution is the mean that it's
+  trying to estimate
+- The more data that goes into the sample mean, the more 
+concentrated its density / mass function is around the population mean
diff --git a/06_StatisticalInference/rmd/05_Variance.Rmd b/06_StatisticalInference/rmd/05_Variance.Rmd
new file mode 100644
index 000000000..7b1b371f2
--- /dev/null
+++ b/06_StatisticalInference/rmd/05_Variance.Rmd
@@ -0,0 +1,239 @@
+---
+title       : The variance
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## The variance
+
+- The variance of a random variable is a measure of *spread*
+- If $X$ is a random variable with mean $\mu$, the variance of $X$ is defined as
+
+$$
+Var(X) = E[(X - \mu)^2] = E[X^2] - E[X]^2
+$$ 
+
+- The expected (squared) distance from the mean
+- Densities with a higher variance are more spread out than densities with a lower variance
+- The square root of the variance is called the **standard deviation**
+- The standard deviation has the same units as $X$
+
+---
+
+## Example
+
+- What's the variance from the result of a toss of a die? 
+
+  - $E[X] = 3.5$ 
+  - $E[X^2] = 1 ^ 2 \times \frac{1}{6} + 2 ^ 2 \times \frac{1}{6} + 3 ^ 2 \times \frac{1}{6} + 4 ^ 2 \times \frac{1}{6} + 5 ^ 2 \times \frac{1}{6} + 6 ^ 2 \times \frac{1}{6} = 15.17$ 
+
+- $Var(X) = E[X^2] - E[X]^2 \approx 2.92$
+
+---
+
+## Example
+
+- What's the variance from the result of the toss of a coin with probability of heads (1) of $p$? 
+
+  - $E[X] = 0 \times (1 - p) + 1 \times p = p$
+  - $E[X^2] = E[X] = p$ 
+
+$$Var(X) = E[X^2] - E[X]^2 = p - p^2 = p(1 - p)$$
+
+
+---
+## Distributions with increasing variance
+```{r, echo = FALSE, fig.height = 6, fig.width = 8, fig.align='center'}
+library(ggplot2)
+xvals <- seq(-10, 10, by = .01)
+dat <- data.frame(
+    y = c(
+        dnorm(xvals, mean = 0, sd = 1),
+        dnorm(xvals, mean = 0, sd = 2),
+        dnorm(xvals, mean = 0, sd = 3),
+        dnorm(xvals, mean = 0, sd = 4)
+    ),
+    x = rep(xvals, 4),
+    factor = factor(rep(1 : 4, rep(length(xvals), 4)))
+)
+ggplot(dat, aes(x = x, y = y, color = factor)) + geom_line(size = 2)    
+```
+
+---
+## The sample variance 
+- The sample variance is 
+$$
+S^2 = \frac{\sum_{i=1} (X_i - \bar X)^2}{n-1}
+$$
+(almost, but not quite, the average squared deviation from
+the sample mean)
+- It is also a random variable
+  - It has an associate population distribution
+  - Its expected value is the population variance
+  - Its distribution gets more concentrated around the population variance with more data
+- Its square root is the sample standard deviation
+
+
+---
+## Simulation experiment
+### Simulating from a population with variance 1
+
+```{r, fig.height=6, figh.width=6, fig.align='center', echo = FALSE}
+library(ggplot2)
+nosim <- 10000; 
+dat <- data.frame(
+    x = c(apply(matrix(rnorm(nosim * 10), nosim), 1, var),
+          apply(matrix(rnorm(nosim * 20), nosim), 1, var),
+          apply(matrix(rnorm(nosim * 30), nosim), 1, var)),
+    n = factor(rep(c("10", "20", "30"), c(nosim, nosim, nosim))) 
+    )
+ggplot(dat, aes(x = x, fill = n)) + geom_density(size = 2, alpha = .2) + geom_vline(xintercept = 1, size = 2) 
+
+```
+
+---
+## Variances of x die rolls
+```{r, fig.align='center',fig.height=5, fig.width=10, echo = FALSE, warning=FALSE, error=FALSE, message=FALSE}  
+dat <- data.frame(
+  x = c(apply(matrix(sample(1 : 6, nosim * 10, replace = TRUE), 
+                     nosim), 1, var),
+        apply(matrix(sample(1 : 6, nosim * 20, replace = TRUE), 
+                     nosim), 1, var),
+        apply(matrix(sample(1 : 6, nosim * 30, replace = TRUE), 
+                     nosim), 1, var)
+        ),
+  size = factor(rep(c(10, 20, 30), rep(nosim, 3))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(alpha = .20, binwidth=.3, colour = "black") 
+g <- g + geom_vline(xintercept = 2.92, size = 2)
+g + facet_grid(. ~ size)
+```
+
+
+---
+
+## Recall the mean
+- Recall that the average of random sample from a population 
+is itself a random variable
+- We know that this distribution is centered around the population
+mean, $E[\bar X] = \mu$
+- We also know what its variance is $Var(\bar X) = \sigma^2 / n$
+- This is very useful, since we don't have repeat sample means 
+to get its variance; now we know how it relates to
+the population variance
+- We call the standard deviation of a statistic a standard error
+
+---
+## To summarize
+- The sample variance, $S^2$, estimates the population variance, $\sigma^2$
+- The distribution of the sample variance is centered around $\sigma^2$
+- The the variance of sample mean is $\sigma^2 / n$
+  - Its logical estimate is $s^2 / n$
+  - The logical estimate of the standard error is $s / \sqrt{n}$
+- $s$, the standard deviation, talks about how variable the population is
+- $s/\sqrt{n}$, the standard error, talks about how variable averages of random samples of size $n$ from the population are
+
+---
+## Simulation example
+Standard normals have variance 1; means of $n$ standard normals
+have standard deviation $1/\sqrt{n}$
+
+```{r}
+nosim <- 1000
+n <- 10
+sd(apply(matrix(rnorm(nosim * n), nosim), 1, mean))
+1 / sqrt(n)
+```
+
+
+---
+## Simulation example
+Standard uniforms have variance $1/12$; means of 
+random samples of $n$ uniforms have sd $1/\sqrt{12 \times n}$
+
+
+```{r}
+nosim <- 1000
+n <- 10
+sd(apply(matrix(runif(nosim * n), nosim), 1, mean))
+1 / sqrt(12 * n)
+```
+
+
+---
+## Simulation example
+Poisson(4) have variance $4$; means of 
+random samples of $n$ Poisson(4) have sd $2/\sqrt{n}$
+
+
+```{r}
+nosim <- 1000
+n <- 10
+sd(apply(matrix(rpois(nosim * n, 4), nosim), 1, mean))
+2 / sqrt(n)
+```
+
+
+---
+## Simulation example
+Fair coin flips have variance $0.25$; means of 
+random samples of $n$ coin flips have sd $1 / (2 \sqrt{n})$
+
+
+```{r}
+nosim <- 1000
+n <- 10
+sd(apply(matrix(sample(0 : 1, nosim * n, replace = TRUE),
+                nosim), 1, mean))
+1 / (2 * sqrt(n))
+```
+
+---
+## Data example
+```{r}
+library(UsingR); data(father.son); 
+x <- father.son$sheight
+n<-length(x)
+```
+
+---
+## Plot of the son's heights
+```{r, fig.height=6, fig.width=6, echo=FALSE, fig.align='center'}
+g <- ggplot(data = father.son, aes(x = sheight)) 
+g <- g + geom_histogram(aes(y = ..density..), fill = "lightblue", binwidth=1, colour = "black")
+g <- g + geom_density(size = 2, colour = "black")
+g
+```
+
+---
+## Let's interpret these numbers
+```{r}
+round(c(var(x), var(x) / n, sd(x), sd(x) / sqrt(n)),2)
+```
+
+```{r, echo = FALSE, fig.height=4, fig.width=4,fig.align='center'}
+g
+```
+
+
+---
+## Summarizing what we know about variances
+- The sample variance estimates the population variance
+- The distribution of the sample variance is centered at
+what its estimating
+- It gets more concentrated around the population variance with larger sample sizes
+- The variance of the sample mean is the population variance
+divided by $n$
+  - The square root is the standard error
+- It turns out that we can say a lot about the distribution of
+averages from random samples, 
+even though we only get one to look at in a given data set
diff --git a/06_StatisticalInference/rmd/06_CommonDistros.Rmd b/06_StatisticalInference/rmd/06_CommonDistros.Rmd
new file mode 100644
index 000000000..c9fcfc074
--- /dev/null
+++ b/06_StatisticalInference/rmd/06_CommonDistros.Rmd
@@ -0,0 +1,261 @@
+---
+title       : Some Common Distributions
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+
+## The Bernoulli distribution
+
+- The **Bernoulli distribution** arises as the result of a binary outcome
+- Bernoulli random variables take (only) the values 1 and 0 with probabilities of (say) $p$ and $1-p$ respectively
+- The PMF for a Bernoulli random variable $X$ is $$P(X = x) =  p^x (1 - p)^{1 - x}$$
+- The mean of a Bernoulli random variable is $p$ and the variance is $p(1 - p)$
+- If we let $X$ be a Bernoulli random variable, it is typical to call $X=1$ as a "success" and $X=0$ as a "failure"
+
+
+---
+
+## Binomial trials
+
+- The *binomial random variables* are obtained as the sum of iid Bernoulli trials
+- In specific, let $X_1,\ldots,X_n$ be iid Bernoulli$(p)$; then $X = \sum_{i=1}^n X_i$ is a binomial random variable
+- The binomial mass function is
+$$
+P(X = x) = 
+\left(
+\begin{array}{c}
+  n \\ x
+\end{array}
+\right)
+p^x(1 - p)^{n-x}
+$$
+for $x=0,\ldots,n$
+
+---
+
+## Choose
+
+- Recall that the notation 
+  $$\left(
+    \begin{array}{c}
+      n \\ x
+    \end{array}
+  \right) = \frac{n!}{x!(n-x)!}
+  $$ (read "$n$ choose $x$") counts the number of ways of selecting $x$ items out of $n$
+  without replacement disregarding the order of the items
+
+$$\left(
+    \begin{array}{c}
+      n \\ 0
+    \end{array}
+  \right) =
+\left(
+    \begin{array}{c}
+      n \\ n
+    \end{array}
+  \right) =  1
+  $$ 
+
+---
+
+## Example
+
+- Suppose a friend has $8$ children (oh my!), $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+$$\left(
+\begin{array}{c}
+  8 \\ 7
+\end{array}
+\right) .5^{7}(1-.5)^{1}
++
+\left(
+\begin{array}{c}
+  8 \\ 8
+\end{array}
+\right) .5^{8}(1-.5)^{0} \approx 0.04
+$$
+```{r}
+choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
+pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
+```
+
+
+---
+
+## The normal distribution
+
+- A random variable is said to follow a **normal** or **Gaussian** distribution with mean $\mu$ and variance $\sigma^2$ if the associated density is
+  $$
+  (2\pi \sigma^2)^{-1/2}e^{-(x - \mu)^2/2\sigma^2}
+  $$
+  If $X$ a RV with this density then $E[X] = \mu$ and $Var(X) = \sigma^2$
+- We write $X\sim \mbox{N}(\mu, \sigma^2)$
+- When $\mu = 0$ and $\sigma = 1$ the resulting distribution is called **the standard normal distribution**
+- Standard normal RVs are often labeled $Z$
+
+---
+## The standard normal distribution with reference lines 
+```{r, fig.height=6, fig.width=6, fig.align='center', echo = FALSE}
+x <- seq(-3, 3, length = 1000)
+library(ggplot2)
+g <- ggplot(data.frame(x = x, y = dnorm(x)), 
+            aes(x = x, y = y)) + geom_line(size = 2)
+g <- g + geom_vline(xintercept = -3 : 3, size = 2)
+g
+```
+
+---
+
+## Facts about the normal density
+
+If $X \sim \mbox{N}(\mu,\sigma^2)$ then 
+$$Z = \frac{X -\mu}{\sigma} \sim N(0, 1)$$ 
+
+
+If $Z$ is standard normal $$X = \mu + \sigma Z \sim \mbox{N}(\mu, \sigma^2)$$
+
+---
+
+## More facts about the normal density
+
+1. Approximately $68\%$, $95\%$ and $99\%$  of the normal density lies within $1$, $2$ and $3$ standard deviations from the mean, respectively
+2. $-1.28$, $-1.645$, $-1.96$ and $-2.33$ are the $10^{th}$, $5^{th}$, $2.5^{th}$ and $1^{st}$ percentiles of the standard normal distribution respectively
+3. By symmetry, $1.28$, $1.645$, $1.96$ and $2.33$ are the $90^{th}$, $95^{th}$, $97.5^{th}$ and $99^{th}$ percentiles of the standard normal distribution respectively
+
+---
+
+## Question
+
+- What is the $95^{th}$ percentile of a $N(\mu, \sigma^2)$ distribution? 
+  - Quick answer in R `qnorm(.95, mean = mu, sd = sd)`
+- Or, because you have the standard normal quantiles memorized
+and you know that 1.645 is the 95th percentile you know that the answer has to be
+$$\mu + \sigma 1.645$$
+- (In general $\mu + \sigma z_0$ where $z_0$ is the appropriate standard normal quantile)
+
+---
+
+## Question
+
+- What is the probability that a $\mbox{N}(\mu,\sigma^2)$ RV is larger than $x$?
+
+---
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What's the probability of getting
+more than  1,160 clicks in a day?
+
+---
+
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What's the probability of getting
+more than  1,160 clicks in a day?
+
+It's not very likely, 1,160 is `r (1160 - 1020) / 50` standard
+deviations from the mean 
+```{r}
+pnorm(1160, mean = 1020, sd = 50, lower.tail = FALSE)
+pnorm(2.8, lower.tail = FALSE)
+```
+
+---
+
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What number of daily ad clicks would represent
+the one where 75% of days have fewer clicks (assuming
+days are independent and identically distributed)?
+
+---
+
+## Example
+
+Assume that the number of daily ad clicks for a company 
+is (approximately) normally distributed with a mean of 1020 and a standard
+deviation of 50. What number of daily ad clicks would represent
+the one where 75% of days have fewer clicks (assuming
+days are independent and identically distributed)?
+
+```{r}
+qnorm(0.75, mean = 1020, sd = 50)
+```
+
+---
+## The Poisson distribution
+* Used to model counts
+* The Poisson mass function is
+$$
+P(X = x; \lambda) = \frac{\lambda^x e^{-\lambda}}{x!}
+$$
+for $x=0,1,\ldots$
+* The mean of this distribution is $\lambda$
+* The variance of this distribution is $\lambda$
+* Notice that $x$ ranges from $0$ to $\infty$
+
+---
+## Some uses for the Poisson distribution
+* Modeling count data  
+* Modeling event-time or survival data
+* Modeling contingency tables
+* Approximating binomials when $n$ is large and $p$ is small
+
+---
+## Rates and Poisson random variables
+* Poisson random variables are used to model rates
+* $X \sim Poisson(\lambda t)$ where 
+  * $\lambda = E[X / t]$ is the expected count per unit of time
+  * $t$ is the total monitoring time
+
+---
+## Example
+The number of people that show up at a bus stop is Poisson with
+a mean of $2.5$ per hour.
+
+If watching the bus stop for 4 hours, what is the probability that $3$
+or fewer people show up for the whole time?
+
+```{r}
+ppois(3, lambda = 2.5 * 4)
+```
+
+---
+## Poisson approximation to the binomial
+* When $n$ is large and $p$ is small the Poisson distribution
+  is an accurate approximation to the binomial distribution
+* Notation
+  * $X \sim \mbox{Binomial}(n, p)$
+  * $\lambda = n p$
+  * $n$ gets large 
+  * $p$ gets small
+
+
+---
+## Example, Poisson approximation to the binomial
+
+We flip a coin with success probablity $0.01$ five hundred times. 
+
+What's the probability of 2 or fewer successes?
+
+```{r}
+pbinom(2, size = 500, prob = .01)
+ppois(2, lambda=500 * .01)
+```
+
diff --git a/06_StatisticalInference/rmd/07_Asymptopia.Rmd b/06_StatisticalInference/rmd/07_Asymptopia.Rmd
new file mode 100644
index 000000000..d91ea9410
--- /dev/null
+++ b/06_StatisticalInference/rmd/07_Asymptopia.Rmd
@@ -0,0 +1,405 @@
+---
+title       : A trip to Asymptopia
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Asymptotics
+* Asymptotics is the term for the behavior of statistics as the sample size (or some other relevant quantity) limits to infinity (or some other relevant number)
+* (Asymptopia is my name for the land of asymptotics, where everything works out well and there's no messes. The land of infinite data is nice that way.)
+* Asymptotics are incredibly useful for simple statistical inference and approximations 
+* (Not covered in this class) Asymptotics often lead to nice understanding of procedures
+* Asymptotics generally give no assurances about finite sample performance
+* Asymptotics form the basis for frequency interpretation of probabilities 
+  (the long run proportion of times an event occurs)
+
+
+---
+
+## Limits of random variables
+
+- Fortunately, for the sample mean there's a set of powerful results
+- These results allow us to talk about the large sample distribution
+of sample means of a collection of $iid$ observations
+- The first of these results we inuitively know
+  - It says that the average limits to what its estimating, the population mean
+  - It's called the Law of Large Numbers
+  - Example $\bar X_n$ could be the average of the result of $n$ coin flips (i.e. the sample proportion of heads)
+    - As we flip a fair coin over and over, it evetually converges to the
+    true probability of a head
+    The LLN forms the basis of frequency style thinking
+
+
+---
+## Law of large numbers in action
+```{r, fig.height=5, fig.width=5}
+n <- 10000; means <- cumsum(rnorm(n)) / (1  : n); library(ggplot2)
+g <- ggplot(data.frame(x = 1 : n, y = means), aes(x = x, y = y)) 
+g <- g + geom_hline(yintercept = 0) + geom_line(size = 2) 
+g <- g + labs(x = "Number of obs", y = "Cumulative mean")
+g
+```
+
+
+---
+## Law of large numbers in action, coin flip
+```{r, fig.height=5, fig.width=5}
+means <- cumsum(sample(0 : 1, n , replace = TRUE)) / (1  : n)
+g <- ggplot(data.frame(x = 1 : n, y = means), aes(x = x, y = y)) 
+g <- g + geom_hline(yintercept = 0.5) + geom_line(size = 2) 
+g <- g + labs(x = "Number of obs", y = "Cumulative mean")
+g
+```
+
+
+
+---
+## Discussion
+- An estimator is **consistent** if it converges to what you want to estimate
+  - The LLN says that the sample mean of iid sample is
+  consistent for the population mean
+  - Typically, good estimators are consistent; it's not too much to ask that if we go to the trouble of collecting an infinite amount of data that we get the right answer
+- The sample variance and the sample standard deviation
+of iid random variables are consistent as well
+
+---
+
+## The Central Limit Theorem
+
+- The **Central Limit Theorem** (CLT) is one of the most important theorems in statistics
+- For our purposes, the CLT states that the distribution of averages of iid variables (properly normalized) becomes that of a standard normal as the sample size increases
+- The CLT applies in an endless variety of settings
+- The result is that 
+$$\frac{\bar X_n - \mu}{\sigma / \sqrt{n}}=
+\frac{\sqrt n (\bar X_n - \mu)}{\sigma}
+= \frac{\mbox{Estimate} - \mbox{Mean of estimate}}{\mbox{Std. Err. of estimate}}$$ has a distribution like that of a standard normal for large $n$.
+- (Replacing the standard error by its estimated value doesn't change the CLT)
+- The useful way to think about the CLT is that 
+$\bar X_n$ is approximately
+$N(\mu, \sigma^2 / n)$
+
+
+
+---
+
+## Example
+
+- Simulate a standard normal random variable by rolling $n$ (six sided)
+- Let $X_i$ be the outcome for die $i$
+- Then note that $\mu = E[X_i] = 3.5$
+- $Var(X_i) = 2.92$ 
+- SE $\sqrt{2.92 / n} = 1.71 / \sqrt{n}$
+- Lets roll $n$ dice, take their mean, subtract off 3.5,
+and divide by $1.71 / \sqrt{n}$ and repeat this over and over
+
+
+---
+## Result of our die rolling experiment
+
+```{r, echo = FALSE, fig.width=9, fig.height = 6, fig.align='center'}
+nosim <- 1000
+cfunc <- function(x, n) sqrt(n) * (mean(x) - 3.5) / 1.71
+dat <- data.frame(
+  x = c(apply(matrix(sample(1 : 6, nosim * 10, replace = TRUE), 
+                     nosim), 1, cfunc, 10),
+        apply(matrix(sample(1 : 6, nosim * 20, replace = TRUE), 
+                     nosim), 1, cfunc, 20),
+        apply(matrix(sample(1 : 6, nosim * 30, replace = TRUE), 
+                     nosim), 1, cfunc, 30)
+        ),
+  size = factor(rep(c(10, 20, 30), rep(nosim, 3))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(alpha = .20, binwidth=.3, colour = "black", aes(y = ..density..)) 
+g <- g + stat_function(fun = dnorm, size = 2)
+g + facet_grid(. ~ size)
+```
+
+
+---
+## Coin CLT
+
+ - Let $X_i$ be the $0$ or $1$ result of the $i^{th}$ flip of a possibly unfair coin
+- The sample proportion, say $\hat p$, is the average of the coin flips
+- $E[X_i] = p$ and $Var(X_i) = p(1-p)$
+- Standard error of the mean is $\sqrt{p(1-p)/n}$
+- Then
+$$
+    \frac{\hat p - p}{\sqrt{p(1-p)/n}}
+$$
+will be approximately normally distributed
+- Let's flip a coin $n$ times, take the sample proportion
+of heads, subtract off .5 and multiply the result by
+$2 \sqrt{n}$ (divide by $1/(2 \sqrt{n})$)
+
+---
+## Simulation results
+```{r, echo = FALSE, fig.width=9, fig.height = 6, fig.align='center'}
+nosim <- 1000
+cfunc <- function(x, n) 2 * sqrt(n) * (mean(x) - 0.5) 
+dat <- data.frame(
+  x = c(apply(matrix(sample(0:1, nosim * 10, replace = TRUE), 
+                     nosim), 1, cfunc, 10),
+        apply(matrix(sample(0:1, nosim * 20, replace = TRUE), 
+                     nosim), 1, cfunc, 20),
+        apply(matrix(sample(0:1, nosim * 30, replace = TRUE), 
+                     nosim), 1, cfunc, 30)
+        ),
+  size = factor(rep(c(10, 20, 30), rep(nosim, 3))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(binwidth=.3, colour = "black", aes(y = ..density..)) 
+g <- g + stat_function(fun = dnorm, size = 2)
+g + facet_grid(. ~ size)
+```
+
+---
+## Simulation results, $p = 0.9$
+```{r, echo = FALSE, fig.width=9, fig.height = 6, fig.align='center'}
+nosim <- 1000
+cfunc <- function(x, n) sqrt(n) * (mean(x) - 0.9) / sqrt(.1 * .9)
+dat <- data.frame(
+  x = c(apply(matrix(sample(0:1, prob = c(.1,.9), nosim * 10, replace = TRUE), 
+                     nosim), 1, cfunc, 10),
+        apply(matrix(sample(0:1, prob = c(.1,.9), nosim * 20, replace = TRUE), 
+                     nosim), 1, cfunc, 20),
+        apply(matrix(sample(0:1, prob = c(.1,.9), nosim * 30, replace = TRUE), 
+                     nosim), 1, cfunc, 30)
+        ),
+  size = factor(rep(c(10, 20, 30), rep(nosim, 3))))
+g <- ggplot(dat, aes(x = x, fill = size)) + geom_histogram(binwidth=.3, colour = "black", aes(y = ..density..)) 
+g <- g + stat_function(fun = dnorm, size = 2)
+g + facet_grid(. ~ size)
+```
+
+---
+## Galton's quincunx 
+
+http://en.wikipedia.org/wiki/Bean_machine#mediaviewer/File:Quincunx_(Galton_Box)_-_Galton_1889_diagram.png
+
+<img src="fig/quincunx.png" height="450"></img>
+
+---
+
+## Confidence intervals
+
+- According to the CLT, the sample mean, $\bar X$, 
+is approximately normal with mean $\mu$ and sd $\sigma / \sqrt{n}$
+- $\mu + 2 \sigma /\sqrt{n}$ is pretty far out in the tail
+(only 2.5% of a normal being larger than 2 sds in the tail)
+- Similarly, $\mu - 2 \sigma /\sqrt{n}$ is pretty far in the left tail (only 2.5% chance of a normal being smaller than 2 sds in the tail)
+- So the probability $\bar X$ is bigger than $\mu + 2 \sigma / \sqrt{n}$
+or smaller than $\mu - 2 \sigma / \sqrt{n}$ is 5%
+    - Or equivalently, the probability of being between these limits is 95%
+- The quantity $\bar X \pm 2 \sigma /\sqrt{n}$ is called
+a 95% interval for $\mu$
+- The 95% refers to the fact that if one were to repeatly
+get samples of size $n$, about 95% of the intervals obtained
+would contain $\mu$
+- The 97.5th quantile is 1.96 (so I rounded to 2 above)
+- 90% interval you want (100 - 90) / 2 = 5% in each tail 
+  - So you want the 95th percentile (1.645)
+
+
+---
+## Give a confidence interval for the average height of sons
+in Galton's data
+```{r}
+library(UsingR);data(father.son); x <- father.son$sheight
+(mean(x) + c(-1, 1) * qnorm(.975) * sd(x) / sqrt(length(x))) / 12
+```
+
+---
+
+## Sample proportions
+
+- In the event that each $X_i$ is $0$ or $1$ with common success probability $p$ then $\sigma^2 = p(1 - p)$
+- The interval takes the form
+$$
+    \hat p \pm z_{1 - \alpha/2}  \sqrt{\frac{p(1 - p)}{n}}
+$$
+- Replacing $p$ by $\hat p$ in the standard error results in what is called a Wald confidence interval for $p$
+- For 95% intervals
+$$\hat p \pm \frac{1}{\sqrt{n}}$$ 
+is a quick CI estimate for $p$
+
+---
+## Example
+* Your campaign advisor told you that in a random sample of 100 likely voters,
+  56 intent to vote for you. 
+  * Can you relax? Do you have this race in the bag?
+  * Without access to a computer or calculator, how precise is this estimate?
+* `1/sqrt(100)=0.1` so a back of the envelope calculation gives an approximate 95% interval of `(0.46, 0.66)`
+  * Not enough for you to relax, better go do more campaigning!
+* Rough guidelines, 100 for 1 decimal place, 10,000 for 2, 1,000,000 for 3.
+```{r}
+round(1 / sqrt(10 ^ (1 : 6)), 3)
+```
+
+
+
+---
+## Binomial interval
+
+```{r}
+.56 + c(-1, 1) * qnorm(.975) * sqrt(.56 * .44 / 100)
+binom.test(56, 100)$conf.int
+```
+
+---
+
+## Simulation
+
+```{r}
+n <- 20; pvals <- seq(.1, .9, by = .05); nosim <- 1000
+coverage <- sapply(pvals, function(p){
+  phats <- rbinom(nosim, prob = p, size = n) / n
+  ll <- phats - qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  ul <- phats + qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  mean(ll < p & ul > p)
+})
+
+```
+
+
+---
+## Plot of the results (not so good)
+```{r, echo=FALSE, fig.align='center', fig.height=6, fig.width=6}
+ggplot(data.frame(pvals, coverage), aes(x = pvals, y = coverage)) + geom_line(size = 2) + geom_hline(yintercept = 0.95) + ylim(.75, 1.0)
+````
+
+---
+## What's happening?
+- $n$ isn't large enough for the CLT to be applicable
+for many of the values of $p$
+- Quick fix, form the interval with 
+$$
+\frac{X + 2}{n + 4}
+$$
+- (Add two successes and failures, Agresti/Coull interval)
+
+---
+## Simulation
+First let's show that coverage gets better with $n$
+
+```{r}
+n <- 100; pvals <- seq(.1, .9, by = .05); nosim <- 1000
+coverage2 <- sapply(pvals, function(p){
+  phats <- rbinom(nosim, prob = p, size = n) / n
+  ll <- phats - qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  ul <- phats + qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  mean(ll < p & ul > p)
+})
+
+```
+
+---
+## Plot of coverage for $n=100$
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE}
+ggplot(data.frame(pvals, coverage), aes(x = pvals, y = coverage2)) + geom_line(size = 2) + geom_hline(yintercept = 0.95)+ ylim(.75, 1.0)
+```
+
+---
+## Simulation
+Now let's look at $n=20$ but adding 2 successes and failures
+```{r}
+n <- 20; pvals <- seq(.1, .9, by = .05); nosim <- 1000
+coverage <- sapply(pvals, function(p){
+  phats <- (rbinom(nosim, prob = p, size = n) + 2) / (n + 4)
+  ll <- phats - qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  ul <- phats + qnorm(.975) * sqrt(phats * (1 - phats) / n)
+  mean(ll < p & ul > p)
+})
+```
+
+
+---
+## Adding 2 successes and 2 failures
+(It's a little conservative)
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE}
+ggplot(data.frame(pvals, coverage), aes(x = pvals, y = coverage)) + geom_line(size = 2) + geom_hline(yintercept = 0.95)+ ylim(.75, 1.0)
+````
+
+---
+
+## Poisson interval
+* A nuclear pump failed 5 times out of 94.32 days, give a 95% confidence interval for the failure rate per day?
+* $X \sim Poisson(\lambda t)$.
+* Estimate $\hat \lambda = X/t$
+* $Var(\hat \lambda) = \lambda / t$ 
+* $\hat \lambda / t$ is our variance estimate
+
+---
+## R code
+```{r}
+x <- 5; t <- 94.32; lambda <- x / t
+round(lambda + c(-1, 1) * qnorm(.975) * sqrt(lambda / t), 3)
+poisson.test(x, T = 94.32)$conf
+```
+
+
+---
+## Simulating the Poisson coverage rate
+Let's see how this interval performs for lambda
+values near what we're estimating
+```{r}
+lambdavals <- seq(0.005, 0.10, by = .01); nosim <- 1000
+t <- 100
+coverage <- sapply(lambdavals, function(lambda){
+  lhats <- rpois(nosim, lambda = lambda * t) / t
+  ll <- lhats - qnorm(.975) * sqrt(lhats / t)
+  ul <- lhats + qnorm(.975) * sqrt(lhats / t)
+  mean(ll < lambda & ul > lambda)
+})
+```
+
+
+
+---
+## Covarage
+(Gets really bad for small values of lambda)
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE}
+ggplot(data.frame(lambdavals, coverage), aes(x = lambdavals, y = coverage)) + geom_line(size = 2) + geom_hline(yintercept = 0.95)+ylim(0, 1.0)
+````
+
+
+
+---
+## What if we increase t to 1000?
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE}
+lambdavals <- seq(0.005, 0.10, by = .01); nosim <- 1000
+t <- 1000
+coverage <- sapply(lambdavals, function(lambda){
+  lhats <- rpois(nosim, lambda = lambda * t) / t
+  ll <- lhats - qnorm(.975) * sqrt(lhats / t)
+  ul <- lhats + qnorm(.975) * sqrt(lhats / t)
+  mean(ll < lambda & ul > lambda)
+})
+ggplot(data.frame(lambdavals, coverage), aes(x = lambdavals, y = coverage)) + geom_line(size = 2) + geom_hline(yintercept = 0.95) + ylim(0, 1.0)
+```
+
+
+---
+## Summary
+- The LLN states that averages of iid samples 
+converge to the population means that they are estimating
+- The CLT states that averages are approximately normal, with
+distributions
+  - centered at the population mean 
+  - with standard deviation equal to the standard error of the mean
+  - CLT gives no guarantee that $n$ is large enough
+- Taking the mean and adding and subtracting the relevant
+normal quantile times the SE yields a confidence interval for the mean
+  - Adding and subtracting 2 SEs works for 95% intervals
+- Confidence intervals get wider as the coverage increases
+(why?)
+- Confidence intervals get narrower with less variability or
+larger sample sizes
+- The Poisson and binomial case have exact intervals that
+don't require the CLT
+  - But a quick fix for small sample size binomial calculations is to add 2 successes and failures
diff --git a/06_StatisticalInference/rmd/08_tCIs.Rmd b/06_StatisticalInference/rmd/08_tCIs.Rmd
new file mode 100644
index 000000000..4f71f8400
--- /dev/null
+++ b/06_StatisticalInference/rmd/08_tCIs.Rmd
@@ -0,0 +1,291 @@
+---
+title       : T Confidence Intervals
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## T Confidence intervals
+
+- In the previous, we discussed creating a confidence interval using the CLT
+  - They took the form $Est \pm ZQ \times SE_{Est}$
+- In this lecture, we discuss some methods for small samples, notably Gosset's $t$ distribution and $t$ confidence intervals
+  - They are of the form $Est \pm TQ \times SE_{Est}$
+- These are some of the handiest of intervals
+- If you want a rule between whether to use a $t$ interval
+or normal interval, just always use the $t$ interval
+- We'll cover the one and two group versions
+
+---
+
+## Gosset's $t$ distribution
+
+- Invented by William Gosset (under the pseudonym "Student") in 1908
+- Has thicker tails than the normal
+- Is indexed by a degrees of freedom; gets more like a standard normal as df gets larger
+- It assumes that the underlying data are iid 
+Gaussian with the result that
+$$
+\frac{\bar X - \mu}{S/\sqrt{n}}
+$$
+follows Gosset's $t$ distribution with $n-1$ degrees of freedom
+- (If we replaced $s$ by $\sigma$ the statistic would be exactly standard normal)
+- Interval is $\bar X \pm t_{n-1} S/\sqrt{n}$ where $t_{n-1}$
+is the relevant quantile
+
+---
+## Code for manipulate
+```{r, echo=TRUE,eval=FALSE}
+k <- 1000
+xvals <- seq(-5, 5, length = k)
+myplot <- function(df){
+  d <- data.frame(y = c(dnorm(xvals), dt(xvals, df)),
+                  x = xvals,
+                  dist = factor(rep(c("Normal", "T"), c(k,k))))
+  g <- ggplot(d, aes(x = x, y = y)) 
+  g <- g + geom_line(size = 2, aes(colour = dist))
+  g
+}
+manipulate(myplot(mu), mu = slider(1, 20, step = 1))  
+```
+
+---
+## Easier to see
+```{r, eval = FALSE, echo = TRUE}
+pvals <- seq(.5, .99, by = .01)
+myplot2 <- function(df){
+  d <- data.frame(n= qnorm(pvals),t=qt(pvals, df),
+                  p = pvals)
+  g <- ggplot(d, aes(x= n, y = t))
+  g <- g + geom_abline(size = 2, col = "lightblue")
+  g <- g + geom_line(size = 2, col = "black")
+  g <- g + geom_vline(xintercept = qnorm(0.975))
+  g <- g + geom_hline(yintercept = qt(0.975, df))
+  g
+}
+manipulate(myplot2(df), df = slider(1, 20, step = 1))
+```
+
+---
+
+## Note's about the $t$ interval
+
+- The $t$ interval technically assumes that the data are iid normal, though it is robust to this assumption
+- It works well whenever the distribution of the data is roughly symmetric and mound shaped
+- Paired observations are often analyzed using the $t$ interval by taking differences
+- For large degrees of freedom, $t$ quantiles become the same as standard normal quantiles; therefore this interval converges to the same interval as the CLT yielded
+- For skewed distributions, the spirit of the $t$ interval assumptions are violated
+  - Also, for skewed distributions, it doesn't make a lot of sense to center the interval at the mean
+  - In this case, consider taking logs or using a different summary like the median
+- For highly discrete data, like binary, other intervals are available
+
+---
+
+## Sleep data
+
+In R typing `data(sleep)` brings up the sleep data originally
+analyzed in Gosset's Biometrika paper, which shows the increase in
+hours for 10 patients on two soporific drugs. R treats the data as two
+groups rather than paired.
+
+---
+## The data
+```{r}
+data(sleep)
+head(sleep)
+```
+
+---
+## Plotting the data
+```{r, echo = FALSE, fig.width=6, fig.height=6, fig.align='center'}
+library(ggplot2)
+g <- ggplot(sleep, aes(x = group, y = extra, group = factor(ID)))
+g <- g + geom_line(size = 1, aes(colour = ID)) + geom_point(size =10, pch = 21, fill = "salmon", alpha = .5)
+g
+```
+
+---
+## Results
+```{r, echo=TRUE}
+g1 <- sleep$extra[1 : 10]; g2 <- sleep$extra[11 : 20]
+difference <- g2 - g1
+mn <- mean(difference); s <- sd(difference); n <- 10
+```
+```{r, echo=TRUE,eval=FALSE}
+mn + c(-1, 1) * qt(.975, n-1) * s / sqrt(n)
+t.test(difference)
+t.test(g2, g1, paired = TRUE)
+t.test(extra ~ I(relevel(group, 2)), paired = TRUE, data = sleep)
+```
+
+---
+## The results
+(After a little formatting)
+```{r, echo = FALSE}
+rbind(
+mn + c(-1, 1) * qt(.975, n-1) * s / sqrt(n),
+as.vector(t.test(difference)$conf.int),
+as.vector(t.test(g2, g1, paired = TRUE)$conf.int),
+as.vector(t.test(extra ~ I(relevel(group, 2)), paired = TRUE, data = sleep)$conf.int)
+)
+```
+
+---
+
+## Independent group $t$ confidence intervals
+
+- Suppose that we want to compare the mean blood pressure between two groups in a randomized trial; those who received the treatment to those who received a placebo
+- We cannot use the paired t test because the groups are independent and may have different sample sizes
+- We now present methods for comparing independent groups
+
+---
+## Confidence interval
+
+- Therefore a $(1 - \alpha)\times 100\%$ confidence interval for $\mu_y - \mu_x$ is 
+$$
+    \bar Y - \bar X \pm t_{n_x + n_y - 2, 1 - \alpha/2}S_p\left(\frac{1}{n_x} + \frac{1}{n_y}\right)^{1/2}
+$$
+- The pooled variance estimator is $$S_p^2 = \{(n_x - 1) S_x^2 + (n_y - 1) S_y^2\}/(n_x + n_y - 2)$$ 
+- Remember this interval is assuming a constant variance across the two groups
+- If there is some doubt, assume a different variance per group, which we will discuss later
+
+---
+
+## Example
+### Based on Rosner, Fundamentals of Biostatistics
+(Really a very good reference book)
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- Pooled variance estimate
+```{r}
+sp <- sqrt((7 * 15.34^2 + 20 * 18.23^2) / (8 + 21 - 2))
+132.86 - 127.44 + c(-1, 1) * qt(.975, 27) * sp * (1 / 8 + 1 / 21)^.5
+```
+
+
+---
+## Mistakenly treating the sleep data as grouped
+```{r}
+n1 <- length(g1); n2 <- length(g2)
+sp <- sqrt( ((n1 - 1) * sd(x1)^2 + (n2-1) * sd(x2)^2) / (n1 + n2-2))
+md <- mean(g2) - mean(g1)
+semd <- sp * sqrt(1 / n1 + 1/n2)
+rbind(
+md + c(-1, 1) * qt(.975, n1 + n2 - 2) * semd,  
+t.test(g2, g1, paired = FALSE, var.equal = TRUE)$conf,
+t.test(g2, g1, paired = TRUE)$conf
+)
+```
+
+---
+## Grouped versus independent
+```{r, echo = FALSE, fig.width=6, fig.height=6, fig.align='center'}
+library(ggplot2)
+g <- ggplot(sleep, aes(x = group, y = extra, group = factor(ID)))
+g <- g + geom_line(size = 1, aes(colour = ID)) + geom_point(size =10, pch = 21, fill = "salmon", alpha = .5)
+g
+```
+
+---
+
+## `ChickWeight` data in R
+```{r}
+library(datasets); data(ChickWeight); library(reshape2)
+##define weight gain or loss
+wideCW <- dcast(ChickWeight, Diet + Chick ~ Time, value.var = "weight")
+names(wideCW)[-(1 : 2)] <- paste("time", names(wideCW)[-(1 : 2)], sep = "")
+library(dplyr)
+wideCW <- mutate(wideCW,
+  gain = time21 - time0
+)
+
+```
+
+---
+## Plotting the raw data
+
+```{r, echo =FALSE, fig.align='center', fig.width=12, fig.height=6}
+g <- ggplot(ChickWeight, aes(x = Time, y = weight, 
+                             colour = Diet, group = Chick))
+g <- g + geom_line()
+g <- g + stat_summary(aes(group = 1), geom = "line", fun.y = mean, size = 1, col = "black")
+g <- g + facet_grid(. ~ Diet)
+g
+```
+
+
+
+---
+## Weight gain by diet
+```{r, echo=FALSE, fig.align='center', fig.width=6, fig.height=6, warning=FALSE}
+g <- ggplot(wideCW, aes(x = factor(Diet), y = gain, fill = factor(Diet)))
+g <- g + geom_violin(col = "black", size = 2)
+g
+
+```
+
+---
+## Let's do a t interval
+```{r}
+wideCW14 <- subset(wideCW, Diet %in% c(1, 4))
+rbind(
+t.test(gain ~ Diet, paired = FALSE, var.equal = TRUE, data = wideCW14)$conf,
+t.test(gain ~ Diet, paired = FALSE, var.equal = FALSE, data = wideCW14)$conf
+)
+```
+
+
+---
+
+## Unequal variances
+
+- Under unequal variances
+$$
+\bar Y - \bar X \pm t_{df} \times \left(\frac{s_x^2}{n_x} + \frac{s_y^2}{n_y}\right)^{1/2}
+$$
+where $t_{df}$ is calculated with degrees of freedom
+$$
+df=    \frac{\left(S_x^2 / n_x + S_y^2/n_y\right)^2}
+    {\left(\frac{S_x^2}{n_x}\right)^2 / (n_x - 1) +
+      \left(\frac{S_y^2}{n_y}\right)^2 / (n_y - 1)}
+$$
+will be approximately a 95% interval
+- This works really well
+  - So when in doubt, just assume unequal variances
+
+---
+
+## Example
+
+- Comparing SBP for 8 oral contraceptive users versus 21 controls
+- $\bar X_{OC} = 132.86$ mmHg with $s_{OC} = 15.34$ mmHg
+- $\bar X_{C} = 127.44$ mmHg with $s_{C} = 18.23$ mmHg
+- $df=15.04$, $t_{15.04, .975} = 2.13$
+- Interval
+$$
+132.86 - 127.44 \pm 2.13 \left(\frac{15.34^2}{8} + \frac{18.23^2}{21} \right)^{1/2}
+= [-8.91, 19.75]
+$$
+- In R, `t.test(..., var.equal = FALSE)`
+
+---
+## Comparing other kinds of data
+* For binomial data, there's lots of ways to compare two groups
+  * Relative risk, risk difference, odds ratio.
+  * Chi-squared tests, normal approximations, exact tests.
+* For count data, there's also Chi-squared tests and exact tests.
+* We'll leave the discussions for comparing groups of data for binary
+  and count data until covering glms in the regression class.
+* In addition, Mathematical Biostatistics Boot Camp 2 covers many special
+  cases relevant to biostatistics.
+
diff --git a/06_StatisticalInference/rmd/09_HT.Rmd b/06_StatisticalInference/rmd/09_HT.Rmd
new file mode 100644
index 000000000..40140aa48
--- /dev/null
+++ b/06_StatisticalInference/rmd/09_HT.Rmd
@@ -0,0 +1,241 @@
+---
+title       : Hypothesis testing
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Hypothesis testing
+* Hypothesis testing is concerned with making decisions using data
+* A null hypothesis is specified that represents the status quo,
+  usually labeled $H_0$
+* The null hypothesis is assumed true and statistical evidence is required
+  to reject it in favor of a research or alternative hypothesis 
+
+---
+## Example
+* A respiratory disturbance index of more than $30$ events / hour, say, is 
+  considered evidence of severe sleep disordered breathing (SDB).
+* Suppose that in a sample of $100$ overweight subjects with other
+  risk factors for sleep disordered breathing at a sleep clinic, the
+  mean RDI was $32$ events / hour with a standard deviation of $10$ events / hour.
+* We might want to test the hypothesis that 
+  * $H_0 : \mu = 30$
+  * $H_a : \mu > 30$
+  * where $\mu$ is the population mean RDI.
+
+---
+## Hypothesis testing
+* The alternative hypotheses are typically of the form $<$, $>$ or $\neq$
+* Note that there are four possible outcomes of our statistical decision process
+
+Truth | Decide | Result |
+---|---|---|
+$H_0$ | $H_0$ | Correctly accept null |
+$H_0$ | $H_a$ | Type I error |
+$H_a$ | $H_a$ | Correctly reject null |
+$H_a$ | $H_0$ | Type II error |
+
+---
+## Discussion
+* Consider a court of law; the null hypothesis is that the
+  defendant is innocent
+* We require a standard on the available evidence to reject the null hypothesis (convict)
+* If we set a low standard, then we would increase the
+  percentage of innocent people convicted (type I errors); however we
+  would also increase the percentage of guilty people convicted
+  (correctly rejecting the null)
+* If we set a high standard, then we increase the the
+  percentage of innocent people let free (correctly accepting the
+  null) while we would also increase the percentage of guilty people
+  let free (type II errors)
+
+---
+## Example
+* Consider our sleep example again
+* A reasonable strategy would reject the null hypothesis if
+  $\bar X$ was larger than some constant, say $C$
+* Typically, $C$ is chosen so that the probability of a Type I
+  error, $\alpha$, is $.05$ (or some other relevant constant)
+* $\alpha$ = Type I error rate = Probability of rejecting the null hypothesis when, in fact, the null hypothesis is correct
+
+---
+## Example continued
+- Standard error of the mean $10 / \sqrt{100} = 1$
+- Under $H_0$ $\bar X \sim N(30, 1)$ 
+- We want to chose $C$ so that the $P(\bar X > C; H_0)$ is 
+5%
+- The 95th percentile of a normal distribution is 1.645
+standard deviations from the mean
+- If $C = 30 + 1 \times 1.645 = 31.645$
+  - Then the probability that a $N(30, 1)$ is larger
+    than it is 5%
+  - So the rule "Reject $H_0$ when $\bar X \geq 31.645$"
+    has the property that the probability of rejection
+    is 5% when $H_0$ is true (for the $\mu_0$, $\sigma$
+    and $n$ given)
+
+
+---
+## Discussion
+* In general we don't convert $C$ back to the original scale
+* We would just reject because the Z-score; which is how many
+  standard errors the sample mean is above the hypothesized mean
+  $$
+  \frac{32 - 30}{10 / \sqrt{100}} = 2
+  $$
+  is greater than $1.645$
+* Or, whenever $\sqrt{n} (\bar X - \mu_0) / s > Z_{1-\alpha}$
+
+---
+## General rules
+* The $Z$ test for $H_0:\mu = \mu_0$ versus 
+  * $H_1: \mu < \mu_0$
+  * $H_2: \mu \neq \mu_0$
+  * $H_3: \mu > \mu_0$ 
+* Test statistic $ TS = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} $
+* Reject the null hypothesis when 
+  * $TS \leq Z_{\alpha} = -Z_{1 - \alpha}$
+  * $|TS| \geq Z_{1 - \alpha / 2}$
+  * $TS \geq Z_{1 - \alpha}$
+
+---
+## Notes
+* We have fixed $\alpha$ to be low, so if we reject $H_0$ (either
+  our model is wrong) or there is a low probability that we have made
+  an error
+* We have not fixed the probability of a type II error, $\beta$;
+  therefore we tend to say ``Fail to reject $H_0$'' rather than
+  accepting $H_0$
+* Statistical significance is no the same as scientific
+  significance
+* The region of TS values for which you reject $H_0$ is called the
+  rejection region
+
+---
+## More notes
+* The $Z$ test requires the assumptions of the CLT and for $n$ to be large enough
+  for it to apply
+* If $n$ is small, then a Gossett's $T$ test is performed exactly in the same way,
+  with the normal quantiles replaced by the appropriate Student's $T$ quantiles and
+  $n-1$ df
+* The probability of rejecting the null hypothesis when it is false is called *power*
+* Power is a used a lot to calculate sample sizes for experiments
+
+---
+## Example reconsidered
+- Consider our example again. Suppose that $n= 16$ (rather than
+$100$)
+- The statistic
+$$
+\frac{\bar X - 30}{s / \sqrt{16}}
+$$
+follows a $T$ distribution with 15 df under $H_0$
+- Under $H_0$, the probability that it is larger
+that the 95th percentile of the $T$ distribution is 5%
+- The 95th percentile of the T distribution with 15
+df is `r qt(.95, 15)` (obtained via `qt(.95, 15)`)
+- So that our test statistic is now $\sqrt{16}(32 - 30) / 10 = 0.8 $
+- We now fail to reject.
+
+---
+## Two sided tests
+* Suppose that we would reject the null hypothesis if in fact the  mean was too large or too small
+* That is, we want to test the alternative $H_a : \mu \neq 30$
+* We will reject if the test statistic, $0.8$, is either too large or too small
+* Then we want the probability of rejecting under the
+null to be 5%, split equally as 2.5% in the upper
+tail and 2.5% in the lower tail
+* Thus we reject if our test statistic is larger
+than `qt(.975, 15)` or smaller than `qt(.025, 15)`
+  * This is the same as saying: reject if the
+  absolute value of our statistic is larger than
+  `qt(0.975, 15)` = `r qt(0.975, 15)`
+  * So we fail to reject the two sided test as well
+  * (If you fail to reject the one sided test, you
+  know that you will fail to reject the two sided)
+
+---
+## T test in R
+```{r, echo=TRUE, comment=">", results='markup'}
+library(UsingR); data(father.son)
+t.test(father.son$sheight - father.son$fheight)
+```
+
+---
+## Connections with confidence intervals
+* Consider testing $H_0: \mu = \mu_0$ versus $H_a: \mu \neq \mu_0$
+* Take the set of all possible values for which you fail to reject $H_0$, this set is a $(1-\alpha)100\%$ confidence interval for $\mu$
+* The same works in reverse; if a $(1-\alpha)100\%$ interval
+  contains $\mu_0$, then we *fail  to* reject $H_0$
+
+---
+## Two group intervals
+- First, now you know how to do two group T tests
+since we already covered indepedent group T intervals
+- Rejection rules are the same 
+- Test $H_0 : \mu_1 = \mu_2$
+- Let's just go through an example
+
+---
+## `chickWeight` data
+Recall that we reformatted this data
+```{r, echo=TRUE,results='hide'}
+library(datasets); data(ChickWeight); library(reshape2)
+##define weight gain or loss
+wideCW <- dcast(ChickWeight, Diet + Chick ~ Time, value.var = "weight")
+names(wideCW)[-(1 : 2)] <- paste("time", names(wideCW)[-(1 : 2)], sep = "")
+library(dplyr)
+wideCW <- mutate(wideCW,
+  gain = time21 - time0
+)
+```
+
+---
+### Unequal variance T test comparing diets 1 and 4
+```{r,echo=TRUE, comment="> ", results='markup'}
+wideCW14 <- subset(wideCW, Diet %in% c(1, 4))
+t.test(gain ~ Diet, paired = FALSE, 
+       var.equal = TRUE, data = wideCW14)
+```
+
+
+
+---
+## Exact binomial test
+- Recall this problem, *Suppose a friend has $8$ children, $7$ of which are girls and none are twins*
+- Perform the relevant hypothesis test. $H_0 : p = 0.5$ $H_a : p > 0.5$
+  - What is the relevant rejection region so that the probability of rejecting is (less than) 5%?
+  
+Rejection region | Type I error rate |
+---|---|
+[0 : 8] | `r pbinom(-1, size = 8, p = .5, lower.tail = FALSE)`
+[1 : 8] | `r pbinom( 0, size = 8, p = .5, lower.tail = FALSE)`
+[2 : 8] | `r pbinom( 1, size = 8, p = .5, lower.tail = FALSE)`
+[3 : 8] | `r pbinom( 2, size = 8, p = .5, lower.tail = FALSE)`
+[4 : 8] | `r pbinom( 3, size = 8, p = .5, lower.tail = FALSE)`
+[5 : 8] | `r pbinom( 4, size = 8, p = .5, lower.tail = FALSE)`
+[6 : 8] | `r pbinom( 5, size = 8, p = .5, lower.tail = FALSE)`
+[7 : 8] | `r pbinom( 6, size = 8, p = .5, lower.tail = FALSE)`
+[8 : 8] | `r pbinom( 7, size = 8, p = .5, lower.tail = FALSE)`
+
+---
+## Notes
+* It's impossible to get an exact 5% level test for this case due to the discreteness of the binomial. 
+  * The closest is the rejection region [7 : 8]
+  * Any alpha level lower than `r 1 / 2 ^8` is not attainable.
+* For larger sample sizes, we could do a normal approximation, but you already knew this.
+* Two sided test isn't obvious. 
+  * Given a way to do two sided tests, we could take the set of values of $p_0$ for which we fail to reject to get an exact binomial confidence interval (called the Clopper/Pearson interval, BTW)
+* For these problems, people always create a P-value (next lecture) rather than computing the rejection region.
+
+
diff --git a/06_StatisticalInference/rmd/10_pValues.Rmd b/06_StatisticalInference/rmd/10_pValues.Rmd
new file mode 100644
index 000000000..78eb2b52b
--- /dev/null
+++ b/06_StatisticalInference/rmd/10_pValues.Rmd
@@ -0,0 +1,93 @@
+---
+title       : P-values
+subtitle    : Statistical inference
+author      : Brian Caffo, Jeffrey Leek, Roger Peng 
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+  
+## P-values
+
+* Most common measure of statistical significance
+* Their ubiquity, along with concern over their interpretation and use
+  makes them controversial among statisticians
+  * [http://warnercnr.colostate.edu/~anderson/thompson1.html](http://warnercnr.colostate.edu/~anderson/thompson1.html)
+  * Also see *Statistical Evidence: A Likelihood Paradigm* by Richard Royall 
+  * *Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy* by Steve Goodman
+  * The hilariously titled: *The Earth is Round (p < .05)* by Cohen.
+* Some positive comments
+  * [simply statistics](http://simplystatistics.org/2012/01/06/p-values-and-hypothesis-testing-get-a-bad-rap-but-we/)
+  * [normal deviate](http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/)
+  * [Error statistics](http://errorstatistics.com/2013/06/14/p-values-cant-be-trusted-except-when-used-to-argue-that-p-values-cant-be-trusted/)
+
+---
+
+
+## What is a P-value? 
+
+__Idea__: Suppose nothing is going on - how unusual is it to see the estimate we got?
+
+__Approach__: 
+
+1. Define the hypothetical distribution of a data summary (statistic) when "nothing is going on" (_null hypothesis_)
+2. Calculate the summary/statistic with the data we have (_test statistic_)
+3. Compare what we calculated to our hypothetical distribution and see if the value is "extreme" (_p-value_)
+
+---
+## P-values
+* The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than that obtained
+* If the P-value is small, then either $H_0$ is true and we have observed a rare event or $H_0$ is false
+*  Suppos that you get a $T$ statistic of $2.5$ for 15 df testing $H_0:\mu = \mu_0$
+versus $H_a : \mu > \mu_0$. 
+  * What's the probability of getting a $T$ statistic as large as $2.5$?
+```{r}
+pt(2.5, 15, lower.tail = FALSE) 
+```
+* Therefore, the probability of seeing evidence as extreme or more extreme than that actually obtained under $H_0$ is `r pt(2.5, 15, lower.tail = FALSE)`
+
+---
+## The attained significance level
+* Our test statistic was $2$ for $H_0 : \mu_0  = 30$ versus $H_a:\mu > 30$.
+* Notice that we rejected the one sided test when $\alpha = 0.05$, would we reject if $\alpha = 0.01$, how about $0.001$?
+* The smallest value for alpha that you still reject the null hypothesis is called the *attained significance level*
+* This is equivalent, but philosophically a little different from, the *P-value*
+
+---
+## Notes
+* By reporting a P-value the reader can perform the hypothesis
+  test at whatever $\alpha$ level he or she choses
+* If the P-value is less than $\alpha$ you reject the null hypothesis 
+* For two sided hypothesis test, double the smaller of the two one
+  sided hypothesis test Pvalues
+
+---
+## Revisiting an earlier example
+- Suppose a friend has $8$ children, $7$ of which are girls and none are twins
+- If each gender has an independent $50$% probability for each birth, what's the probability of getting $7$ or more girls out of $8$ births?
+```{r}
+choose(8, 7) * .5 ^ 8 + choose(8, 8) * .5 ^ 8 
+pbinom(6, size = 8, prob = .5, lower.tail = FALSE)
+```
+
+---
+## Poisson example
+- Suppose that a hospital has an infection rate of 10 infections per 100 person/days at risk (rate of 0.1) during the last monitoring period.
+- Assume that an infection rate of 0.05 is an important benchmark. 
+- Given the model, could the observed rate being larger than 0.05 be attributed to chance?
+- Under $H_0: \lambda = 0.05$ so that $\lambda_0 100 = 5$
+- Consider $H_a: \lambda > 0.05$.
+
+```{r}
+ppois(9, 5, lower.tail = FALSE)
+```
+
+
+
diff --git a/06_StatisticalInference/rmd/11_Power.Rmd b/06_StatisticalInference/rmd/11_Power.Rmd
new file mode 100644
index 000000000..e00643f95
--- /dev/null
+++ b/06_StatisticalInference/rmd/11_Power.Rmd
@@ -0,0 +1,159 @@
+---
+title       : Power
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## Power
+- Power is the probability of rejecting the null hypothesis when it is false
+- Ergo, power (as its name would suggest) is a good thing; you want more power
+- A type II error (a bad thing, as its name would suggest) is failing to reject the null hypothesis when it's false; the probability of a type II error is usually called $\beta$
+- Note Power  $= 1 - \beta$
+
+---
+## Notes
+- Consider our previous example involving RDI
+- $H_0: \mu = 30$ versus $H_a: \mu > 30$
+- Then power is 
+$$P\left(\frac{\bar X - 30}{s /\sqrt{n}} > t_{1-\alpha,n-1} ~;~ \mu = \mu_a \right)$$
+- Note that this is a function that depends on the specific value of $\mu_a$!
+- Notice as $\mu_a$ approaches $30$ the power approaches $\alpha$
+
+
+---
+## Calculating power for Gaussian data
+- We reject if $\frac{\bar X - 30}{\sigma /\sqrt{n}} > z_{1-\alpha}$    
+    - Equivalently if $\bar X > 30 + Z_{1-\alpha} \frac{\sigma}{\sqrt{n}}$
+- Under $H_0 : \bar X \sim N(\mu_0, \sigma^2 / n)$
+- Under $H_a : \bar X \sim N(\mu_a, \sigma^2 / n)$
+- So we want 
+```{r, echo=TRUE,eval=FALSE}
+alpha = 0.05
+z = qnorm(1 - alpha)
+pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+```
+
+---
+## Example continued
+- $\mu_a = 32$, $\mu_0 = 30$, $n =16$, $\sigma = 4$
+```{r, echo=TRUE,eval=TRUE}
+mu0 = 30; mua = 32; sigma = 4; n = 16
+z = qnorm(1 - alpha)
+pnorm(mu0 + z * sigma / sqrt(n), mean = mu0, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+      lower.tail = FALSE)
+```
+
+---
+##  Plotting the power curve
+
+```{r, fig.align='center', fig.height=6, fig.width=12, echo=FALSE}
+library(ggplot2)
+nseq = c(8, 16, 32, 64, 128)
+mua = seq(30, 35, by = 0.1)
+power = sapply(nseq, function(n)
+    pnorm(mu0 + z * sigma / sqrt(n), mean = mua, sd = sigma / sqrt(n), 
+          lower.tail = FALSE)
+    )
+colnames(power) <- paste("n", nseq, sep = "")
+d <- data.frame(mua, power)
+library(reshape2)
+d2 <- melt(d, id.vars = "mua")
+names(d2) <- c("mua", "n", "power")    
+g <- ggplot(d2, 
+            aes(x = mua, y = power, col = n)) + geom_line(size = 2)
+g            
+```
+
+
+---
+## Graphical Depiction of Power
+```{r, echo = TRUE, eval=FALSE}
+library(manipulate)
+mu0 = 30
+myplot <- function(sigma, mua, n, alpha){
+    g = ggplot(data.frame(mu = c(27, 36)), aes(x = mu))
+    g = g + stat_function(fun=dnorm, geom = "line", 
+                          args = list(mean = mu0, sd = sigma / sqrt(n)), 
+                          size = 2, col = "red")
+    g = g + stat_function(fun=dnorm, geom = "line", 
+                          args = list(mean = mua, sd = sigma / sqrt(n)), 
+                          size = 2, col = "blue")
+    xitc = mu0 + qnorm(1 - alpha) * sigma / sqrt(n)
+    g = g + geom_vline(xintercept=xitc, size = 3)
+    g
+}
+manipulate(
+    myplot(sigma, mua, n, alpha),
+    sigma = slider(1, 10, step = 1, initial = 4),
+    mua = slider(30, 35, step = 1, initial = 32),
+    n = slider(1, 50, step = 1, initial = 16),
+    alpha = slider(0.01, 0.1, step = 0.01, initial = 0.05)
+    )
+
+```
+
+
+---
+## Question
+- When testing $H_a : \mu > \mu_0$, notice if power is $1 - \beta$, then 
+$$1 - \beta = P\left(\bar X > \mu_0 + z_{1-\alpha} \frac{\sigma}{\sqrt{n}} ; \mu = \mu_a \right)$$
+- where $\bar X \sim N(\mu_a, \sigma^2 / n)$
+- Unknowns: $\mu_a$, $\sigma$, $n$, $\beta$
+- Knowns: $\mu_0$, $\alpha$
+- Specify any 3 of the unknowns and you can solve for the remainder
+
+---
+## Notes
+- The calculation for $H_a:\mu < \mu_0$ is similar
+- For $H_a: \mu \neq \mu_0$ calculate the one sided power using
+  $\alpha / 2$ (this is only approximately right, it excludes the probability of
+  getting a large TS in the opposite direction of the truth)
+- Power goes up as $\alpha$ gets larger
+- Power of a one sided test is greater than the power of the
+  associated two sided test
+- Power goes up as $\mu_1$ gets further away from $\mu_0$
+- Power goes up as $n$ goes up
+- Power doesn't need $\mu_a$, $\sigma$ and $n$, instead only $\frac{\sqrt{n}(\mu_a - \mu_0)}{\sigma}$
+  - The quantity $\frac{\mu_a - \mu_0}{\sigma}$ is called the effect size, the difference in the means in standard deviation units.
+  - Being unit free, it has some hope of interpretability across settings
+
+---
+## T-test power
+-  Consider calculating power for a Gossett's $T$ test for our example
+-  The power is
+  $$
+  P\left(\frac{\bar X - \mu_0}{S /\sqrt{n}} > t_{1-\alpha, n-1} ~;~ \mu = \mu_a \right)
+  $$
+- Calcuting this requires the non-central t distribution.
+- `power.t.test` does this very well
+  - Omit one of the arguments and it solves for it
+
+---
+## Example
+```{r}
+power.t.test(n = 16, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$power
+power.t.test(n = 16, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$power
+power.t.test(n = 16, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$power
+```
+
+---
+## Example
+```{r}
+power.t.test(power = .8, delta = 2 / 4, sd=1, type = "one.sample",  alt = "one.sided")$n
+power.t.test(power = .8, delta = 2, sd=4, type = "one.sample",  alt = "one.sided")$n
+power.t.test(power = .8, delta = 100, sd=200, type = "one.sample", alt = "one.sided")$n
+```
+
diff --git a/06_StatisticalInference/rmd/12_MultipleTesting.Rmd b/06_StatisticalInference/rmd/12_MultipleTesting.Rmd
new file mode 100644
index 000000000..4d5cc68a4
--- /dev/null
+++ b/06_StatisticalInference/rmd/12_MultipleTesting.Rmd
@@ -0,0 +1,253 @@
+---
+title       : Multiple testing
+subtitle    : Statistical Inference 
+author      : Brian Caffo, Jeffrey Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow   # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+## Key ideas
+
+* Hypothesis testing/significance analysis is commonly overused
+* Correcting for multiple testing avoids false positives or discoveries
+* Two key components
+  * Error measure
+  * Correction
+
+
+---
+
+## Three eras of statistics
+
+__The age of Quetelet and his successors, in which huge census-level data sets were brought to bear on simple but important questions__: Are there more male than female births? Is the rate of insanity rising?
+
+The classical period of Pearson, Fisher, Neyman, Hotelling, and their successors, intellectual giants who __developed a theory of optimal inference capable of wringing every drop of information out of a scientific experiment__. The questions dealt with still tended to be simple Is treatment A better than treatment B? 
+
+__The era of scientific mass production__, in which new technologies typified by the microarray allow a single team of scientists to produce data sets of a size Quetelet would envy. But now the flood of data is accompanied by a deluge of questions, perhaps thousands of estimates or hypothesis tests that the statistician is charged with answering together; not at all what the classical masters had in mind. Which variables matter among the thousands measured? How do you relate unrelated information?
+
+[http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf](http://www-stat.stanford.edu/~ckirby/brad/papers/2010LSIexcerpt.pdf)
+
+---
+
+## Reasons for multiple testing
+
+<img class=center src=fig/datasources.png height=450>
+
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans1.png height=450>
+
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+---
+
+## Why correct for multiple tests?
+
+<img class=center src=fig/jellybeans2.png height=400>
+
+[http://xkcd.com/882/](http://xkcd.com/882/)
+
+
+---
+
+## Types of errors
+
+Suppose you are testing a hypothesis that a parameter $\beta$ equals zero versus the alternative that it does not equal zero. These are the possible outcomes. 
+</br></br>
+
+                    | $\beta=0$   | $\beta\neq0$   |  Hypotheses
+--------------------|-------------|----------------|---------
+Claim $\beta=0$     |      $U$    |      $T$       |  $m-R$
+Claim $\beta\neq 0$ |      $V$    |      $S$       |  $R$
+    Claims          |     $m_0$   |      $m-m_0$   |  $m$
+
+</br></br>
+
+__Type I error or false positive ($V$)__ Say that the parameter does not equal zero when it does
+
+__Type II error or false negative ($T$)__ Say that the parameter equals zero when it doesn't 
+
+
+---
+
+## Error rates
+
+__False positive rate__ - The rate at which false results ($\beta = 0$) are called significant: $E\left[\frac{V}{m_0}\right]$*
+
+__Family wise error rate (FWER)__ - The probability of at least one false positive ${\rm Pr}(V \geq 1)$
+
+__False discovery rate (FDR)__ - The rate at which claims of significance are false $E\left[\frac{V}{R}\right]$
+
+* The false positive rate is closely related to the type I error rate [http://en.wikipedia.org/wiki/False_positive_rate](http://en.wikipedia.org/wiki/False_positive_rate)
+
+---
+
+## Controlling the false positive rate
+
+If P-values are correctly calculated calling all $P < \alpha$ significant will control the false positive rate at level $\alpha$ on average. 
+
+<redtext>Problem</redtext>: Suppose that you perform 10,000 tests and $\beta = 0$ for all of them. 
+
+Suppose that you call all $P < 0.05$ significant. 
+
+The expected number of false positives is: $10,000 \times 0.05 = 500$  false positives. 
+
+__How do we avoid so many false positives?__
+
+
+---
+
+## Controlling family-wise error rate (FWER)
+
+
+The [Bonferroni correction](http://en.wikipedia.org/wiki/Bonferroni_correction) is the oldest multiple testing correction. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FWER at level $\alpha$ so $Pr(V \geq 1) < \alpha$
+* Calculate P-values normally
+* Set $\alpha_{fwer} = \alpha/m$
+* Call all $P$-values less than $\alpha_{fwer}$ significant
+
+__Pros__: Easy to calculate, conservative
+__Cons__: May be very conservative
+
+
+---
+
+## Controlling false discovery rate (FDR)
+
+This is the most popular correction when performing _lots_ of tests say in genomics, imaging, astronomy, or other signal-processing disciplines. 
+
+__Basic idea__: 
+* Suppose you do $m$ tests
+* You want to control FDR at level $\alpha$ so $E\left[\frac{V}{R}\right]$
+* Calculate P-values normally
+* Order the P-values from smallest to largest $P_{(1)},...,P_{(m)}$
+* Call any $P_{(i)} \leq \alpha \times \frac{i}{m}$ significant
+
+__Pros__: Still pretty easy to calculate, less conservative (maybe much less)
+
+__Cons__: Allows for more false positives, may behave strangely under dependence
+
+---
+
+## Example with 10 P-values
+
+<img class=center src=fig/example10pvals.png height=450>
+
+Controlling all error rates at $\alpha = 0.20$
+
+---
+
+## Adjusted P-values
+
+* One approach is to adjust the threshold $\alpha$
+* A different approach is to calculate "adjusted p-values"
+* They _are not p-values_ anymore
+* But they can be used directly without adjusting $\alpha$
+
+__Example__: 
+* Suppose P-values are $P_1,\ldots,P_m$
+* You could adjust them by taking $P_i^{fwer} = \max{m \times P_i,1}$ for each P-value.
+* Then if you call all $P_i^{fwer} < \alpha$ significant you will control the FWER. 
+
+---
+
+## Case study I: no true positives
+
+```{r createPvals,cache=TRUE}
+set.seed(1010093)
+pValues <- rep(NA,1000)
+for(i in 1:1000){
+  y <- rnorm(20)
+  x <- rnorm(20)
+  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
+}
+
+# Controls false positive rate
+sum(pValues < 0.05)
+```
+
+---
+
+## Case study I: no true positives
+
+```{r, dependson="createPvals"}
+# Controls FWER 
+sum(p.adjust(pValues,method="bonferroni") < 0.05)
+# Controls FDR 
+sum(p.adjust(pValues,method="BH") < 0.05)
+```
+
+
+---
+
+## Case study II: 50% true positives
+
+```{r createPvals2,cache=TRUE}
+set.seed(1010093)
+pValues <- rep(NA,1000)
+for(i in 1:1000){
+  x <- rnorm(20)
+  # First 500 beta=0, last 500 beta=2
+  if(i <= 500){y <- rnorm(20)}else{ y <- rnorm(20,mean=2*x)}
+  pValues[i] <- summary(lm(y ~ x))$coeff[2,4]
+}
+trueStatus <- rep(c("zero","not zero"),each=500)
+table(pValues < 0.05, trueStatus)
+```
+
+---
+
+
+## Case study II: 50% true positives
+
+```{r, dependson="createPvals2"}
+# Controls FWER 
+table(p.adjust(pValues,method="bonferroni") < 0.05,trueStatus)
+# Controls FDR 
+table(p.adjust(pValues,method="BH") < 0.05,trueStatus)
+```
+
+
+---
+
+
+## Case study II: 50% true positives
+
+__P-values versus adjusted P-values__
+```{r, dependson="createPvals2",fig.height=4,fig.width=8}
+par(mfrow=c(1,2))
+plot(pValues,p.adjust(pValues,method="bonferroni"),pch=19)
+plot(pValues,p.adjust(pValues,method="BH"),pch=19)
+```
+
+
+---
+
+
+## Notes and resources
+
+__Notes__:
+* Multiple testing is an entire subfield
+* A basic Bonferroni/BH correction is usually enough
+* If there is strong dependence between tests there may be problems
+  * Consider method="BY"
+
+__Further resources__:
+* [Multiple testing procedures with applications to genomics](http://www.amazon.com/Multiple-Procedures-Applications-Genomics-Statistics/dp/0387493166/ref=sr_1_2/102-3292576-129059?ie=UTF8&s=books&qid=1187394873&sr=1-2)
+* [Statistical significance for genome-wide studies](http://www.pnas.org/content/100/16/9440.full)
+* [Introduction to multiple testing](http://ies.ed.gov/ncee/pubs/20084018/app_b.asp)
+
diff --git a/06_StatisticalInference/rmd/13_Resampling.Rmd b/06_StatisticalInference/rmd/13_Resampling.Rmd
new file mode 100644
index 000000000..15e7c395d
--- /dev/null
+++ b/06_StatisticalInference/rmd/13_Resampling.Rmd
@@ -0,0 +1,222 @@
+---
+title       : Resampled inference
+subtitle    : Statistical Inference
+author      : Brian Caffo, Jeff Leek, Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+
+## The bootstrap
+
+- The bootstrap is a tremendously useful tool for constructing confidence intervals and calculating standard errors for difficult statistics
+- For example, how would one derive a confidence interval for the median?
+- The bootstrap procedure follows from the so called bootstrap principle
+
+---
+## Sample of 50 die rolls
+
+```{r, echo = FALSE, fig.width=12, fig.height = 6, fig.align='center'}
+library(ggplot2)
+library(gridExtra)
+nosim <- 1000
+
+cfunc <- function(x, n) mean(x)
+g1 = ggplot(data.frame(y = rep(1/6, 6), x = 1 : 6), aes(y = y, x = x))
+g1 = g1 + geom_bar(stat = "identity", fill = "lightblue", colour = "black")
+
+dat <- data.frame(x = apply(matrix(sample(1 : 6, nosim * 50, replace = TRUE), 
+                     nosim), 1, mean))
+g2 <- ggplot(dat, aes(x = x)) + geom_histogram(binwidth=.2, colour = "black", fill = "salmon", aes(y = ..density..)) 
+
+grid.arrange(g1, g2, ncol = 2)
+
+```
+
+
+---
+## What if we only had one sample?
+```{r, echo = FALSE, fig.width=9, fig.height = 6, fig.align='center'}
+n = 50
+B = 1000
+## our data
+x = sample(1 : 6, n, replace = TRUE)
+## bootstrap resamples
+resamples = matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+resampledMeans = apply(resamples, 1, mean)
+g1 <- ggplot(as.data.frame(prop.table(table(x))), aes(x = x, y = Freq)) + geom_bar(colour = "black", fill = "lightblue", stat = "identity") 
+g2 <- ggplot(data.frame(x = resampledMeans), aes(x = x)) + geom_histogram(binwidth=.2, colour = "black", fill = "salmon", aes(y = ..density..)) 
+grid.arrange(g1, g2, ncol = 2)
+```
+
+
+---
+## Consider a data set
+```{r}
+library(UsingR)
+data(father.son)
+x <- father.son$sheight
+n <- length(x)
+B <- 10000
+resamples <- matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+resampledMedians <- apply(resamples, 1, median)
+```
+
+---
+## A plot of the histrogram of the resamples
+```{r, fig.align='center', fig.height=6, fig.width=6, echo=FALSE, warning=FALSE}
+g = ggplot(data.frame(x = resampledMedians), aes(x = x)) 
+g = g + geom_density(size = 2, fill = "red")
+#g = g + geom_histogram(alpha = .20, binwidth=.3, colour = "black", fill = "blue", aes(y = ..density..)) 
+g = g + geom_vline(xintercept = median(x), size = 2)
+g
+```
+
+---
+
+
+## The bootstrap principle
+
+- Suppose that I have a statistic that estimates some population parameter, but I don't know its sampling distribution
+- The bootstrap principle suggests using the distribution defined by the data to approximate its sampling distribution
+
+---
+
+## The bootstrap in practice
+
+- In practice, the bootstrap principle is always carried out using simulation
+- We will cover only a few aspects of bootstrap resampling
+- The general procedure follows by first simulating complete data sets from the observed data with replacement
+
+  - This is approximately drawing from the sampling distribution of that statistic, at least as far as the data is able to approximate the true population distribution
+
+- Calculate the statistic for each simulated data set
+- Use the simulated statistics to either define a confidence interval or take the standard deviation to calculate a standard error
+
+
+---
+## Nonparametric bootstrap algorithm example
+
+- Bootstrap procedure for calculating confidence interval for the median from a data set of $n$ observations
+
+  i. Sample $n$ observations **with replacement** from the observed data resulting in one simulated complete data set
+  
+  ii. Take the median of the simulated data set
+  
+  iii. Repeat these two steps $B$ times, resulting in $B$ simulated medians
+  
+  iv. These medians are approximately drawn from the sampling distribution of the median of $n$ observations; therefore we can
+  
+    - Draw a histogram of them
+    - Calculate their standard deviation to estimate the standard error of the median
+    - Take the $2.5^{th}$ and $97.5^{th}$ percentiles as a confidence interval for the median
+
+---
+
+## Example code
+
+```{r}
+B <- 10000
+resamples <- matrix(sample(x,
+                           n * B,
+                           replace = TRUE),
+                    B, n)
+medians <- apply(resamples, 1, median)
+sd(medians)
+quantile(medians, c(.025, .975))
+```
+
+---
+## Histogram of bootstrap resamples
+
+```{r, fig.height=6, fig.width=6, echo=TRUE,fig.align='center', warning=FALSE}
+g = ggplot(data.frame(medians = medians), aes(x = medians))
+g = g + geom_histogram(color = "black", fill = "lightblue", binwidth = 0.05)
+g
+```
+
+---
+
+## Notes on the bootstrap
+
+- The bootstrap is non-parametric
+- Better percentile bootstrap confidence intervals correct for bias
+- There are lots of variations on bootstrap procedures; the book "An Introduction to the Bootstrap"" by Efron and Tibshirani is a great place to start for both bootstrap and jackknife information
+
+
+---
+## Group comparisons
+- Consider comparing two independent groups.
+- Example, comparing sprays B and C
+
+```{r, fig.height=6, fig.width=8, echo=FALSE, fig.align='center'}
+data(InsectSprays)
+g = ggplot(InsectSprays, aes(spray, count, fill = spray))
+g = g + geom_boxplot()
+g
+```
+
+---
+## Permutation tests
+-  Consider the null hypothesis that the distribution of the observations from each group is the same
+-  Then, the group labels are irrelevant
+- Consider a data frome with count and spray
+- Permute the spray (group) labels 
+- Recalculate the statistic
+  - Mean difference in counts
+  - Geometric means
+  - T statistic
+- Calculate the percentage of simulations where
+the simulated statistic was more extreme (toward
+the alternative) than the observed
+
+---
+## Variations on permutation testing
+Data type | Statistic | Test name 
+---|---|---|
+Ranks | rank sum | rank sum test
+Binary | hypergeometric prob | Fisher's exact test
+Raw data | | ordinary permutation test
+
+- Also, so-called *randomization tests* are exactly permutation tests, with a different motivation.
+- For matched data, one can randomize the signs
+  - For ranks, this results in the signed rank test
+- Permutation strategies work for regression as well
+  - Permuting a regressor of interest
+- Permutation tests work very well in multivariate settings
+
+---
+## Permutation test B v C
+```{r}
+subdata <- InsectSprays[InsectSprays$spray %in% c("B", "C"),]
+y <- subdata$count
+group <- as.character(subdata$spray)
+testStat <- function(w, g) mean(w[g == "B"]) - mean(w[g == "C"])
+observedStat <- testStat(y, group)
+permutations <- sapply(1 : 10000, function(i) testStat(y, sample(group)))
+observedStat
+mean(permutations > observedStat)
+```
+
+---
+## Histogram of permutations B v C
+```{r, echo= FALSE, fig.width=6, fig.height=6, fig.align='center'}
+g = ggplot(data.frame(permutations = permutations),
+           aes(permutations))
+g = g + geom_histogram(fill = "lightblue", color = "black", binwidth = 1)
+g = g + geom_vline(xintercept = observedStat, size = 2)
+g
+```
diff --git a/06_StatisticalInference/syllabus.md b/06_StatisticalInference/syllabus.md
deleted file mode 100644
index 65449c35b..000000000
--- a/06_StatisticalInference/syllabus.md
+++ /dev/null
@@ -1,194 +0,0 @@
-## Course Title
-
-### Statistical Inference
-
----
-
-## Course Instructor(s)
-
-The primary instructor of this class is
-[Brian Caffo](http://www.bcaffo.com) 
-
-Brian is a professor at Johns Hopkins Biostatistics and
-co-directs the  [SMART working group](http://www.smart-stats.og)
-
-This class is co-taught by Roger Peng and Jeff Leek. In addition,
-Sean Kross and Nick Carchedi have been helping greatly.
-
----
-
-## Course Description
-
-In this class students will learn the fundamentals of statistical
-inference. Students will receive a broad overview of the goals,
-assumptions and modes of performing statistical inference. Students
-will be able to perform inferential tasks in highly targeted settings
-and will be able to use the skills developed as a roadmap for more
-complex inferential challenges.
-
----
-
-## Course Content
-
-This class is taught in three modules
-1. Probability and probability distributions
-2. Basics of inference
-3. More advanced inference techniques
-
-Each module has sub modules, labeled such as 01_03. Videos within submodules are
-broken up so that 01_03_a is the first video in sub-module 3 in module 1
-while 01_03_b is the second video.
-
-For convenience we post the broken up videos, and then also the full videos
-for each sub-module on the site. 
-
-The full list of topics are as follows
-
-Module 1, probability and probability distributions
-* 01_01 Introduction
-* 01_02 Probability
-* 01_03 Expectations
-* 01_04 Independence
-* 01_05 Conditional probability
-
-Module 2, basics of inference
-* 02_01 Common Distributions
-* 02_02 Asymptopia
-* 02_03 t confidence intervals
-* 02_04 Likelihood
-* 02_05 Beginning Bayes Inference
-
-Module 3, more advanced inference 
-* 03_01 Independent group intervals
-* 03_02 Hypothesis testing
-* 03_03 P-values
-* 03_04 Power
-* 03_05 Multiple Testing
-* 03_06 resampled inference
-
-
----
-Github repository
-
-The most up to date information on the course lecture notes will always be in the Github repository
-
-[https://github.com/DataScienceSpecialization/courses](https://github.com/DataScienceSpecialization/courses)
-
-Please issue pull requests so that we may improve the materials.
-
----
-
-## Lecture Materials
-
-Lecture videos will be released weekly and will be available for the
-week and thereafter. You are welcome to view them at your
-convenience. Accompanying each video lecture will be a PDF copy of the
-slides and a link to an HTML5 version of the slides.
-
-The lecture videos are released in a weekly fashion. They do not
-correspond to the modules (as there's three modules and four weeks).
-
----
-
-## Weekly quizzes
-
-The weekly quizzes will cover the material from that week.
-
-### Quiz 1
-
-Assigned: Class open (1st of Month)
-Due: 7th of the Month 12:00 AM UTC
-
-
-### Quiz 2
-
-Assigned: 8th of the Month 12:01 AM UTC
-Due: 14th of the Month 12:00 AM UTC
-
-
-### Quiz 3
-
-Assigned: 15th of the Month 12:01 AM UTC
-Due: 21st of the Month 12:00 AM UTC
-
-
-### Quiz 4
-
-Assigned: 22nd of the Month 12:01 AM UTC
-Due: 28th of the Month 12:00 AM UTC
-
----
-
-## Quiz Scoring
-
-You may attempt each quiz up to 2 times. Only the score from your final attempt will count toward your grade.
-
----
-
-## Hard deadlines and soft deadlines
-
-The reported due date is the soft deadline for each quiz. You may turn
-in quizzes up to two days after the soft deadline. The hard deadline
-is the Tuesday after the Quiz is due at 23:30 UTC-5:00. Each day late
-will incur a 10% penalty, but if you use a late day, the penalty will
-not be applied to that day.
-
----
-
-## Late Days for Quizzes
-
-You are permitted 5 late days for quizzes in the course. If you use a late day, your quiz grade will not be affected. 
-
----
-
-## Dates for the project
-
-This class has no project unlike the other classes in the Data Science Series. (The content doesn't lend itself well to a project.) 
-So be warned that there are more quiz questions here than in the other classes in the Data Science series.
-
----
-
-## Typos
-
-* We are prone to a typo or two - please report them and we will try
-* to update the notes accordingly.  In some cases, the videos may
-* still contain typos that have been fixed in the lecture notes. The
-* lecture notes represent the most up-to-date version of the course
-* material.
-
-
----
-
-## Differences of opinion
-
-Keep in mind that currently data analysis is as much art as it is
-science - so we may have a difference of opinion - and that is ok!
-Please refrain from angry, sarcastic, or abusive comments on the
-message boards. Our goal is to create a supportive community that
-helps the learning of all students, from the most advanced to those
-who are just seeing this material for the first time.
-
----
-
-## Technical Information
-
-Regardless of your platform (Windows or Mac) you will need a
-high-speed Internet connection in order to watch the videos on the
-Coursera web site. It is possible to download the video files and
-watch them on your computer rather than stream them from Coursera and
-this may be preferable for some of you.
-
-### Here is some platform-specific information:
-
-_Windows_
-
-The Coursera web site seems to work best with either the Chrome or the
-Firefox web browsers. In particular, you may run into trouble if you
-use Internet Explorer. The Chrome and Firefox browsers can be
-downloaded from: _Chrome:
-[http://www.google.com/chrome](http://www.google.com/chrome) _
-Firefox: [http://www.mozilla.org](http://www.mozilla.org)
-
-_Mac_
-
-The Coursera site appears to work well with Safari, Chrome, or Firefox, so any of these browsers should be fine.
diff --git a/07_RegressionModels/01_01_introduction/assets/fig/freqGalton.png b/07_RegressionModels/01_01_introduction/assets/fig/freqGalton.png
new file mode 100644
index 000000000..9b51ecf78
Binary files /dev/null and b/07_RegressionModels/01_01_introduction/assets/fig/freqGalton.png differ
diff --git a/07_RegressionModels/01_01_introduction/assets/fig/galton.png b/07_RegressionModels/01_01_introduction/assets/fig/galton.png
new file mode 100644
index 000000000..cc86de513
Binary files /dev/null and b/07_RegressionModels/01_01_introduction/assets/fig/galton.png differ
diff --git a/07_RegressionModels/01_01_introduction/assets/fig/lsm.png b/07_RegressionModels/01_01_introduction/assets/fig/lsm.png
new file mode 100644
index 000000000..2b4be0f7e
Binary files /dev/null and b/07_RegressionModels/01_01_introduction/assets/fig/lsm.png differ
diff --git a/07_RegressionModels/01_01_introduction/assets/fig/unnamed-chunk-1.png b/07_RegressionModels/01_01_introduction/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..2de65b5d4
Binary files /dev/null and b/07_RegressionModels/01_01_introduction/assets/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/01_01_introduction/assets/fig/unnamed-chunk-3.png b/07_RegressionModels/01_01_introduction/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..784162bcf
Binary files /dev/null and b/07_RegressionModels/01_01_introduction/assets/fig/unnamed-chunk-3.png differ
diff --git a/07_RegressionModels/01_01_introduction/fig/freqGalton.png b/07_RegressionModels/01_01_introduction/fig/freqGalton.png
index d29908f99..2a45abccf 100644
Binary files a/07_RegressionModels/01_01_introduction/fig/freqGalton.png and b/07_RegressionModels/01_01_introduction/fig/freqGalton.png differ
diff --git a/07_RegressionModels/01_01_introduction/fig/galton.png b/07_RegressionModels/01_01_introduction/fig/galton.png
index e50005508..8d9c4187e 100644
Binary files a/07_RegressionModels/01_01_introduction/fig/galton.png and b/07_RegressionModels/01_01_introduction/fig/galton.png differ
diff --git a/07_RegressionModels/01_01_introduction/fig/lsm.png b/07_RegressionModels/01_01_introduction/fig/lsm.png
index e143202d5..9d4033608 100644
Binary files a/07_RegressionModels/01_01_introduction/fig/lsm.png and b/07_RegressionModels/01_01_introduction/fig/lsm.png differ
diff --git a/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-1.png b/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-1.png
index 0d9819809..9d4033608 100644
Binary files a/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-1.png and b/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-2.png b/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..b82292e41
Binary files /dev/null and b/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-2.png differ
diff --git a/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-4.png b/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..66c02dd4b
Binary files /dev/null and b/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-4.png differ
diff --git a/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-5.png b/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..f111724b7
Binary files /dev/null and b/07_RegressionModels/01_01_introduction/fig/unnamed-chunk-5.png differ
diff --git a/07_RegressionModels/01_01_introduction/index.Rmd b/07_RegressionModels/01_01_introduction/index.Rmd
index 1fadce87c..035969d30 100644
--- a/07_RegressionModels/01_01_introduction/index.Rmd
+++ b/07_RegressionModels/01_01_introduction/index.Rmd
@@ -8,13 +8,11 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
 ---
-## A famous motivating example
-
 ```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
 # make this an external chunk that can be included in any file
 options(width = 100)
@@ -32,6 +30,8 @@ knit_hooks$set(plot = knitr:::hook_plot_html)
 runif(1)
 ```
 
+## A famous motivating example
+
 <img class=center src=fig/galton.jpg height=150>
 
 ### (Perhaps surprisingly, this example is still relevant)
@@ -42,6 +42,24 @@ runif(1)
 
 [Predicting height: the Victorian approach beats modern genomics](http://www.wired.com/wiredscience/2009/03/predicting-height-the-victorian-approach-beats-modern-genomics/)
 
+---
+## Recent simply statistics post
+(Simply Statistics is a blog by Jeff Leek, Roger Peng and 
+Rafael Irizarry, who wrote this post, link on the image)
+
+<a href="http://simplystatistics.org/2013/01/28/data-supports-claim-that-if-kobe-stops-ball-hogging-the-lakers-will-win-more/">
+<img class=center src=http://simplystatistics.org/wp-content/uploads/2013/01/kobelakers1-1024x1024.png height=250></img>
+</a>
+
+- "Data supports claim that if Kobe stops ball hogging the Lakers will win more"
+- "Linear regression suggests that an increase of 1% in % of shots taken by Kobe results in a drop of 1.16 points (+/- 0.22)  in score differential."
+- How was it done? Do you agree with the analysis? 
+
+
+
+
+
+
 ---
 ## Questions for this class
 * Consider trying to answer the following kinds of questions:
@@ -70,13 +88,12 @@ runif(1)
   * Overplotting is an issue from discretization.
 
 ---
-## Code
-
 ```{r galton,fig.height=3.5,fig.width=8}
-library(UsingR); data(galton)
-par(mfrow=c(1,2))
-hist(galton$child,col="blue",breaks=100)
-hist(galton$parent,col="blue",breaks=100)
+library(UsingR); data(galton); library(reshape); long <- melt(galton)
+g <- ggplot(long, aes(x = value, fill = variable)) 
+g <- g + geom_histogram(colour = "black", binwidth=1) 
+g <- g + facet_grid(. ~ variable)
+g
 ```
 
 ---
@@ -86,7 +103,7 @@ hist(galton$parent,col="blue",breaks=100)
   * One definition, let $Y_i$ be the height of child $i$ for $i = 1, \ldots, n = 928$, then define the middle as the value of $\mu$
   that minimizes $$\sum_{i=1}^n (Y_i - \mu)^2$$
 * This is physical center of mass of the histrogram.
-* You might have guessed that the answer $\mu = \bar X$.
+* You might have guessed that the answer $\mu = \bar Y$.
 
 
 ---
@@ -96,61 +113,67 @@ hist(galton$parent,col="blue",breaks=100)
 ```
 library(manipulate)
 myHist <- function(mu){
-  hist(galton$child,col="blue",breaks=100)
-  lines(c(mu, mu), c(0, 150),col="red",lwd=5)
-  mse <- mean((galton$child - mu)^2)
-  text(63, 150, paste("mu = ", mu))
-  text(63, 140, paste("MSE = ", round(mse, 2)))
+    mse <- mean((galton$child - mu)^2)
+    g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+    g <- g + geom_vline(xintercept = mu, size = 3)
+    g <- g + ggtitle(paste("mu = ", mu, ", MSE = ", round(mse, 2), sep = ""))
+    g
 }
 manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
 ```
 
 ---
-## The least squares estimate is the empirical mean
-```{r lsm, dependson="galton",fig.height=4,fig.width=4}
-  hist(galton$child,col="blue",breaks=100)
-  meanChild <- mean(galton$child)
-  lines(rep(meanChild,100),seq(0,150,length=100),col="red",lwd=5)
+## The least squares est. is the empirical mean
+```{r , fig.height=4, fig.width=4, fig.align='center'}
+g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+g <- g + geom_vline(xintercept = mean(galton$child), size = 3)
+g
 ```
 
 ---
-### The math follows as:
+### The math (not required for the class) follows as:
 $$ 
 \begin{align} 
-\sum_{i=1}^n (Y_i - \mu)^2 & = \
-\sum_{i=1}^n (Y_i - \bar Y + \bar Y - \mu)^2 \\ 
-& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
-2 \sum_{i=1}^n (Y_i - \bar Y)  (\bar Y - \mu) +\
-\sum_{i=1}^n (\bar Y - \mu)^2 \\
-& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
-2 (\bar Y - \mu) \sum_{i=1}^n (Y_i - \bar Y)  +\
-\sum_{i=1}^n (\bar Y - \mu)^2 \\
-& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
-2 (\bar Y - \mu)  (\sum_{i=1}^n Y_i - n \bar Y) +\
-\sum_{i=1}^n (\bar Y - \mu)^2 \\
-& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \sum_{i=1}^n (\bar Y - \mu)^2\\ 
-& \geq \sum_{i=1}^n (Y_i - \bar Y)^2 \
+\sum_{i=1}^n \left(Y_i - \mu\right)^2 & = \
+\sum_{i=1}^n \left(Y_i - \bar Y + \bar Y - \mu\right)^2 \\ 
+& = \sum_{i=1}^n \left(Y_i - \bar Y\right)^2 + \
+2 \sum_{i=1}^n \left(Y_i - \bar Y\right)  \left(\bar Y - \mu\right) +\
+\sum_{i=1}^n \left(\bar Y - \mu\right)^2 \\
+& = \sum_{i=1}^n \left(Y_i - \bar Y\right)^2 + \
+2 \left(\bar Y - \mu\right) \sum_{i=1}^n \left(Y_i - \bar Y\right) +\
+\sum_{i=1}^n \left(\bar Y - \mu\right)^2 \\
+& = \sum_{i=1}^n \left(Y_i - \bar Y\right)^2 + \
+2 \left(\bar Y - \mu\right)  \left(\left(\sum_{i=1}^n Y_i\right) -\
+ n \bar Y\right) +\
+\sum_{i=1}^n \left(\bar Y - \mu\right)^2 \\
+& = \sum_{i=1}^n \left(Y_i - \bar Y\right)^2 + \
+ \sum_{i=1}^n \left(\bar Y - \mu\right)^2\\ 
+& \geq \sum_{i=1}^n \left(Y_i - \bar Y\right)^2 \
 \end{align} 
 $$
 
 ---
 ## Comparing childrens' heights and their parents' heights
 
-```{r, dependson="galton",fig.height=4,fig.width=4}
-plot(galton$parent,galton$child,pch=19,col="blue")
+```{r, dependson="galton",fig.height=4,fig.width=4, fig.align='center'}
+ggplot(galton, aes(x = parent, y = child)) + geom_point()
 ```
 
 ---
 Size of point represents number of points at that (X, Y) combination (See the Rmd file for the code).
 
-```{r freqGalton, dependson="galton",fig.height=6,fig.width=6,echo=FALSE}
+```{r freqGalton, dependson="galton",fig.height=6,fig.width=7,echo=FALSE}
+library(dplyr)
 freqData <- as.data.frame(table(galton$child, galton$parent))
 names(freqData) <- c("child", "parent", "freq")
-plot(as.numeric(as.vector(freqData$parent)), 
-     as.numeric(as.vector(freqData$child)),
-     pch = 21, col = "black", bg = "lightblue",
-     cex = .15 * freqData$freq, 
-     xlab = "parent", ylab = "child")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+g
 ```
 
 ---
@@ -165,24 +188,23 @@ of the points to the line
 and children's heights
 
 ---
-```
+```{r, echo = TRUE, eval = FALSE}
+y <- galton$child - mean(galton$child)
+x <- galton$parent - mean(galton$parent)
+freqData <- as.data.frame(table(x, y))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
 myPlot <- function(beta){
-  y <- galton$child - mean(galton$child)
-  x <- galton$parent - mean(galton$parent)
-  freqData <- as.data.frame(table(x, y))
-  names(freqData) <- c("child", "parent", "freq")
-  plot(
-    as.numeric(as.vector(freqData$parent)), 
-    as.numeric(as.vector(freqData$child)),
-    pch = 21, col = "black", bg = "lightblue",
-    cex = .15 * freqData$freq, 
-    xlab = "parent", 
-    ylab = "child"
-    )
-  abline(0, beta, lwd = 3)
-  points(0, 0, cex = 2, pch = 19)
-  mse <- mean( (y - beta * x)^2 )
-  title(paste("beta = ", beta, "mse = ", round(mse, 3)))
+    g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+    g <- g  + scale_size(range = c(2, 20), guide = "none" )
+    g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+    g <- g + geom_point(aes(colour=freq, size = freq))
+    g <- g + scale_colour_gradient(low = "lightblue", high="white")                     
+    g <- g + geom_abline(intercept = 0, slope = beta, size = 3)
+    mse <- mean( (y - beta * x) ^2 )
+    g <- g + ggtitle(paste("beta = ", beta, "mse = ", round(mse, 3)))
+    g
 }
 manipulate(myPlot(beta), beta = slider(0.6, 1.2, step = 0.02))
 ```
@@ -195,17 +217,18 @@ lm(I(child - mean(child))~ I(parent - mean(parent)) - 1, data = galton)
 ```
 
 ---
-## Visualizing the best fit line
-### Size of points are frequencies at that X, Y combination
-```{r, fig.height=5,fig.width=5,echo=FALSE}
+```{r, fig.height=6,fig.width=7,echo=FALSE}
 freqData <- as.data.frame(table(galton$child, galton$parent))
 names(freqData) <- c("child", "parent", "freq")
-plot(as.numeric(as.vector(freqData$parent)), 
-     as.numeric(as.vector(freqData$child)),
-     pch = 21, col = "black", bg = "lightblue",
-     cex = .05 * freqData$freq, 
-     xlab = "parent", ylab = "child")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
 lm1 <- lm(galton$child ~ galton$parent)
-lines(galton$parent,lm1$fitted,col="red",lwd=3)
+g <- g + geom_abline(intercept = coef(lm1)[1], slope = coef(lm1)[2], size = 3, colour = grey(.5))
+g
 ```
 
diff --git a/07_RegressionModels/01_01_introduction/index.Rmd.BACKUP.943.Rmd b/07_RegressionModels/01_01_introduction/index.Rmd.BACKUP.943.Rmd
new file mode 100644
index 000000000..65332b64e
--- /dev/null
+++ b/07_RegressionModels/01_01_introduction/index.Rmd.BACKUP.943.Rmd
@@ -0,0 +1,466 @@
+<<<<<<< HEAD
+---
+title       : Introduction to regression
+subtitle    : Regression
+author      : Brian Caffo, Jeff Leek and Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
+# make this an external chunk that can be included in any file
+options(width = 100)
+opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
+
+options(xtable.type = 'html')
+knit_hooks$set(inline = function(x) {
+  if(is.numeric(x)) {
+    round(x, getOption('digits'))
+  } else {
+    paste(as.character(x), collapse = ', ')
+  }
+})
+knit_hooks$set(plot = knitr:::hook_plot_html)
+runif(1)
+```
+
+## A famous motivating example
+
+<img class=center src=fig/galton.jpg height=150>
+
+### (Perhaps surprisingly, this example is still relevant)
+
+<img class=center src=fig/height.png height=150>
+
+[http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html](http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html)
+
+[Predicting height: the Victorian approach beats modern genomics](http://www.wired.com/wiredscience/2009/03/predicting-height-the-victorian-approach-beats-modern-genomics/)
+
+---
+## Recent simply statistics post
+(Simply Statistics is a blog by Jeff Leek, Roger Peng and 
+Rafael Irizarry, who wrote this post, link on the image)
+
+<a href="http://simplystatistics.org/2013/01/28/data-supports-claim-that-if-kobe-stops-ball-hogging-the-lakers-will-win-more/">
+<img class=center src=http://simplystatistics.org/wp-content/uploads/2013/01/kobelakers1-1024x1024.png height=250></img>
+</a>
+
+- "Data supports claim that if Kobe stops ball hogging the Lakers will win more"
+- "Linear regression suggests that an increase of 1% in % of shots taken by Kobe results in a drop of 1.16 points (+/- 0.22)  in score differential."
+- How was it done? Do you agree with the analysis? 
+
+
+
+
+
+
+---
+## Questions for this class
+* Consider trying to answer the following kinds of questions:
+  * To use the parents' heights to predict childrens' heights.
+  * To try to find a parsimonious, easily described mean 
+    relationship between parent and children's heights.
+  * To investigate the variation in childrens' heights that appears 
+  unrelated to parents' heights (residual variation).
+  * To quantify what impact genotype information has beyond parental height in explaining child height.
+  * To figure out how/whether and what assumptions are needed to
+    generalize findings beyond the data in question.  
+  * Why do children of very tall parents tend to be 
+    tall, but a little shorter than their parents and why children of very short parents tend to be short, but a little taller than their parents? (This is a famous question called 'Regression to the mean'.)
+
+---
+## Galton's Data
+
+* Let's look at the data first, used by Francis Galton in 1885. 
+* Galton was a statistician who invented the term and concepts
+  of regression and correlation, founded the journal Biometrika,
+  and was the cousin of Charles Darwin.
+* You may need to run `install.packages("UsingR")` if the `UsingR` library is not installed.
+* Let's look at the marginal (parents disregarding children and children disregarding parents) distributions first. 
+  * Parent distribution is all heterosexual couples.
+  * Correction for gender via multiplying female heights by 1.08.
+  * Overplotting is an issue from discretization.
+
+---
+```{r galton,fig.height=3.5,fig.width=8}
+library(UsingR); data(galton); library(reshape); library(dplyr); library(ggplot2); long <- melt(galton)
+g <- ggplot(long, aes(x = value, fill = variable)) 
+g <- g + geom_histogram(colour = "black", binwidth=1) 
+g <- g + facet_grid(. ~ variable)
+g
+```
+
+---
+## Finding the middle via least squares
+* Consider only the children's heights. 
+  * How could one describe the "middle"?
+  * One definition, let $Y_i$ be the height of child $i$ for $i = 1, \ldots, n = 928$, then define the middle as the value of $\mu$
+  that minimizes $$\sum_{i=1}^n (Y_i - \mu)^2$$
+* This is physical center of mass of the histrogram.
+* You might have guessed that the answer $\mu = \bar Y$.
+
+
+---
+## Experiment
+### Use R studio's manipulate to see what value of $\mu$ minimizes the sum of the squared deviations.
+
+```
+library(manipulate)
+myHist <- function(mu){
+    mse <- mean((galton$child - mu)^2)
+    g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+    g <- g + geom_vline(xintercept = mu, size = 3)
+    g <- g + ggtitle(paste("mu = ", mu, ", MSE = ", round(mse, 2), sep = ""))
+    g
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The least squares est. is the empirical mean
+```{r , fig.height=4, fig.width=4, fig.align='center'}
+g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+g <- g + geom_vline(xintercept = mean(galton$child), size = 3)
+g
+```
+
+---
+### The math (not required for the class) follows as:
+$$ 
+\begin{align} 
+\sum_{i=1}^n (Y_i - \mu)^2 & = \
+\sum_{i=1}^n (Y_i - \bar Y + \bar Y - \mu)^2 \\ 
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 \sum_{i=1}^n (Y_i - \bar Y)  (\bar Y - \mu) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu) \sum_{i=1}^n (Y_i - \bar Y)  +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu)  (\sum_{i=1}^n Y_i - n \bar Y) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \sum_{i=1}^n (\bar Y - \mu)^2\\ 
+& \geq \sum_{i=1}^n (Y_i - \bar Y)^2 \
+\end{align} 
+$$
+
+---
+## Comparing childrens' heights and their parents' heights
+
+```{r, dependson="galton",fig.height=4,fig.width=4, fig.align='center'}
+ggplot(galton, aes(x = parent, y = child)) + geom_point()
+```
+
+---
+Size of point represents number of points at that (X, Y) combination (See the Rmd file for the code).
+
+```{r freqGalton, dependson="galton",fig.height=6,fig.width=7,echo=FALSE}
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+g
+```
+
+---
+## Regression through the origin
+* Suppose that $X_i$ are the parents' heights.
+* Consider picking the slope $\beta$ that minimizes $$\sum_{i=1}^n (Y_i - X_i \beta)^2$$
+* This is exactly using the origin as a pivot point picking the
+line that minimizes the sum of the squared vertical distances
+of the points to the line
+* Use R studio's  manipulate function to experiment
+* Subtract the means so that the origin is the mean of the parent
+and children's heights
+
+---
+```{r, echo = TRUE, eval = FALSE}
+y <- galton$child - mean(galton$child)
+x <- galton$parent - mean(galton$parent)
+freqData <- as.data.frame(table(x, y))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+myPlot <- function(beta){
+    g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+    g <- g  + scale_size(range = c(2, 20), guide = "none" )
+    g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+    g <- g + geom_point(aes(colour=freq, size = freq))
+    g <- g + scale_colour_gradient(low = "lightblue", high="white")                     
+    g <- g + geom_abline(intercept = 0, slope = beta, size = 3)
+    mse <- mean( (y - beta * x) ^2 )
+    g <- g + ggtitle(paste("beta = ", beta, "mse = ", round(mse, 3)))
+    g
+}
+manipulate(myPlot(beta), beta = slider(0.6, 1.2, step = 0.02))
+```
+
+---
+## The solution 
+### In the next few lectures we'll talk about why this is the solution
+```{r}
+lm(I(child - mean(child))~ I(parent - mean(parent)) - 1, data = galton)
+```
+
+---
+```{r, fig.height=6,fig.width=7,echo=FALSE}
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+lm1 <- lm(galton$child ~ galton$parent)
+g <- g + geom_abline(intercept = coef(lm1)[1], slope = coef(lm1)[2], size = 3, colour = grey(.5))
+g
+```
+
+=======
+---
+title       : Introduction to regression
+subtitle    : Regression
+author      : Brian Caffo, Jeff Leek and Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
+# make this an external chunk that can be included in any file
+options(width = 100)
+opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
+
+options(xtable.type = 'html')
+knit_hooks$set(inline = function(x) {
+  if(is.numeric(x)) {
+    round(x, getOption('digits'))
+  } else {
+    paste(as.character(x), collapse = ', ')
+  }
+})
+knit_hooks$set(plot = knitr:::hook_plot_html)
+runif(1)
+```
+
+## A famous motivating example
+
+<img class=center src=fig/galton.jpg height=150>
+
+### (Perhaps surprisingly, this example is still relevant)
+
+<img class=center src=fig/height.png height=150>
+
+[http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html](http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html)
+
+[Predicting height: the Victorian approach beats modern genomics](http://www.wired.com/wiredscience/2009/03/predicting-height-the-victorian-approach-beats-modern-genomics/)
+
+---
+## Recent simply statistics post
+(Simply Statistics is a blog by Jeff Leek, Roger Peng and 
+Rafael Irizarry, who wrote this post, link on the image)
+
+<a href="http://simplystatistics.org/2013/01/28/data-supports-claim-that-if-kobe-stops-ball-hogging-the-lakers-will-win-more/">
+<img class=center src=http://simplystatistics.org/wp-content/uploads/2013/01/kobelakers1-1024x1024.png height=250></img>
+</a>
+
+- "Data supports claim that if Kobe stops ball hogging the Lakers will win more"
+- "Linear regression suggests that an increase of 1% in % of shots taken by Kobe results in a drop of 1.16 points (+/- 0.22)  in score differential."
+- How was it done? Do you agree with the analysis? 
+
+
+
+
+
+
+---
+## Questions for this class
+* Consider trying to answer the following kinds of questions:
+  * To use the parents' heights to predict childrens' heights.
+  * To try to find a parsimonious, easily described mean 
+    relationship between parent and children's heights.
+  * To investigate the variation in childrens' heights that appears 
+  unrelated to parents' heights (residual variation).
+  * To quantify what impact genotype information has beyond parental height in explaining child height.
+  * To figure out how/whether and what assumptions are needed to
+    generalize findings beyond the data in question.  
+  * Why do children of very tall parents tend to be 
+    tall, but a little shorter than their parents and why children of very short parents tend to be short, but a little taller than their parents? (This is a famous question called 'Regression to the mean'.)
+
+---
+## Galton's Data
+
+* Let's look at the data first, used by Francis Galton in 1885. 
+* Galton was a statistician who invented the term and concepts
+  of regression and correlation, founded the journal Biometrika,
+  and was the cousin of Charles Darwin.
+* You may need to run `install.packages("UsingR")` if the `UsingR` library is not installed.
+* Let's look at the marginal (parents disregarding children and children disregarding parents) distributions first. 
+  * Parent distribution is all heterosexual couples.
+  * Correction for gender via multiplying female heights by 1.08.
+  * Overplotting is an issue from discretization.
+
+---
+```{r galton,fig.height=3.5,fig.width=8}
+library(UsingR); data(galton); library(reshape); long <- melt(galton)
+g <- ggplot(long, aes(x = value, fill = variable)) 
+g <- g + geom_histogram(colour = "black", binwidth=1) 
+g <- g + facet_grid(. ~ variable)
+g
+```
+
+---
+## Finding the middle via least squares
+* Consider only the children's heights. 
+  * How could one describe the "middle"?
+  * One definition, let $Y_i$ be the height of child $i$ for $i = 1, \ldots, n = 928$, then define the middle as the value of $\mu$
+  that minimizes $$\sum_{i=1}^n (Y_i - \mu)^2$$
+* This is physical center of mass of the histrogram.
+* You might have guessed that the answer $\mu = \bar Y$.
+
+
+---
+## Experiment
+### Use R studio's manipulate to see what value of $\mu$ minimizes the sum of the squared deviations.
+
+```
+library(manipulate)
+myHist <- function(mu){
+    mse <- mean((galton$child - mu)^2)
+    g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+    g <- g + geom_vline(xintercept = mu, size = 3)
+    g <- g + ggtitle(paste("mu = ", mu, ", MSE = ", round(mse, 2), sep = ""))
+    g
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The least squares est. is the empirical mean
+```{r , fig.height=4, fig.width=4, fig.align='center'}
+g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+g <- g + geom_vline(xintercept = mean(galton$child), size = 3)
+g
+```
+
+---
+### The math (not required for the class) follows as:
+$$ 
+\begin{align} 
+\sum_{i=1}^n (Y_i - \mu)^2 & = \
+\sum_{i=1}^n (Y_i - \bar Y + \bar Y - \mu)^2 \\ 
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 \sum_{i=1}^n (Y_i - \bar Y)  (\bar Y - \mu) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu) \sum_{i=1}^n (Y_i - \bar Y)  +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu)  (\sum_{i=1}^n Y_i - n \bar Y) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \sum_{i=1}^n (\bar Y - \mu)^2\\ 
+& \geq \sum_{i=1}^n (Y_i - \bar Y)^2 \
+\end{align} 
+$$
+
+---
+## Comparing childrens' heights and their parents' heights
+
+```{r, dependson="galton",fig.height=4,fig.width=4, fig.align='center'}
+ggplot(galton, aes(x = parent, y = child)) + geom_point()
+```
+
+---
+Size of point represents number of points at that (X, Y) combination (See the Rmd file for the code).
+
+```{r freqGalton, dependson="galton",fig.height=6,fig.width=7,echo=FALSE}
+library(dplyr)
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+g
+```
+
+---
+## Regression through the origin
+* Suppose that $X_i$ are the parents' heights.
+* Consider picking the slope $\beta$ that minimizes $$\sum_{i=1}^n (Y_i - X_i \beta)^2$$
+* This is exactly using the origin as a pivot point picking the
+line that minimizes the sum of the squared vertical distances
+of the points to the line
+* Use R studio's  manipulate function to experiment
+* Subtract the means so that the origin is the mean of the parent
+and children's heights
+
+---
+```{r, echo = TRUE, eval = FALSE}
+y <- galton$child - mean(galton$child)
+x <- galton$parent - mean(galton$parent)
+freqData <- as.data.frame(table(x, y))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+myPlot <- function(beta){
+    g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+    g <- g  + scale_size(range = c(2, 20), guide = "none" )
+    g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+    g <- g + geom_point(aes(colour=freq, size = freq))
+    g <- g + scale_colour_gradient(low = "lightblue", high="white")                     
+    g <- g + geom_abline(intercept = 0, slope = beta, size = 3)
+    mse <- mean( (y - beta * x) ^2 )
+    g <- g + ggtitle(paste("beta = ", beta, "mse = ", round(mse, 3)))
+    g
+}
+manipulate(myPlot(beta), beta = slider(0.6, 1.2, step = 0.02))
+```
+
+---
+## The solution 
+### In the next few lectures we'll talk about why this is the solution
+```{r}
+lm(I(child - mean(child))~ I(parent - mean(parent)) - 1, data = galton)
+```
+
+---
+```{r, fig.height=6,fig.width=7,echo=FALSE}
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+lm1 <- lm(galton$child ~ galton$parent)
+g <- g + geom_abline(intercept = coef(lm1)[1], slope = coef(lm1)[2], size = 3, colour = grey(.5))
+g
+```
+
+>>>>>>> upstream/master
diff --git a/07_RegressionModels/01_01_introduction/index.Rmd.BASE.943.Rmd b/07_RegressionModels/01_01_introduction/index.Rmd.BASE.943.Rmd
new file mode 100644
index 000000000..3d8da7e24
--- /dev/null
+++ b/07_RegressionModels/01_01_introduction/index.Rmd.BASE.943.Rmd
@@ -0,0 +1,231 @@
+---
+title       : Introduction to regression
+subtitle    : Regression
+author      : Brian Caffo, Jeff Leek and Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
+# make this an external chunk that can be included in any file
+options(width = 100)
+opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
+
+options(xtable.type = 'html')
+knit_hooks$set(inline = function(x) {
+  if(is.numeric(x)) {
+    round(x, getOption('digits'))
+  } else {
+    paste(as.character(x), collapse = ', ')
+  }
+})
+knit_hooks$set(plot = knitr:::hook_plot_html)
+runif(1)
+```
+
+## A famous motivating example
+
+<img class=center src=fig/galton.jpg height=150>
+
+### (Perhaps surprisingly, this example is still relevant)
+
+<img class=center src=fig/height.png height=150>
+
+[http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html](http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html)
+
+[Predicting height: the Victorian approach beats modern genomics](http://www.wired.com/wiredscience/2009/03/predicting-height-the-victorian-approach-beats-modern-genomics/)
+
+---
+## Recent simply statistics post
+(Simply Statistics is a blog by Jeff Leek, Roger Peng and 
+Rafael Irizarry, who wrote this post, link on the image)
+
+<a href="http://simplystatistics.org/2013/01/28/data-supports-claim-that-if-kobe-stops-ball-hogging-the-lakers-will-win-more/">
+<img class=center src=http://simplystatistics.org/wp-content/uploads/2013/01/kobelakers1-1024x1024.png height=250></img>
+</a>
+
+- "Data supports claim that if Kobe stops ball hogging the Lakers will win more"
+- "Linear regression suggests that an increase of 1% in % of shots taken by Kobe results in a drop of 1.16 points (+/- 0.22)  in score differential."
+- How was it done? Do you agree with the analysis? 
+
+
+
+
+
+
+---
+## Questions for this class
+* Consider trying to answer the following kinds of questions:
+  * To use the parents' heights to predict childrens' heights.
+  * To try to find a parsimonious, easily described mean 
+    relationship between parent and children's heights.
+  * To investigate the variation in childrens' heights that appears 
+  unrelated to parents' heights (residual variation).
+  * To quantify what impact genotype information has beyond parental height in explaining child height.
+  * To figure out how/whether and what assumptions are needed to
+    generalize findings beyond the data in question.  
+  * Why do children of very tall parents tend to be 
+    tall, but a little shorter than their parents and why children of very short parents tend to be short, but a little taller than their parents? (This is a famous question called 'Regression to the mean'.)
+
+---
+## Galton's Data
+
+* Let's look at the data first, used by Francis Galton in 1885. 
+* Galton was a statistician who invented the term and concepts
+  of regression and correlation, founded the journal Biometrika,
+  and was the cousin of Charles Darwin.
+* You may need to run `install.packages("UsingR")` if the `UsingR` library is not installed.
+* Let's look at the marginal (parents disregarding children and children disregarding parents) distributions first. 
+  * Parent distribution is all heterosexual couples.
+  * Correction for gender via multiplying female heights by 1.08.
+  * Overplotting is an issue from discretization.
+
+---
+```{r galton,fig.height=3.5,fig.width=8}
+library(UsingR); data(galton); library(reshape); long <- melt(galton)
+g <- ggplot(long, aes(x = value, fill = variable)) 
+g <- g + geom_histogram(colour = "black", binwidth=1) 
+g <- g + facet_grid(. ~ variable)
+g
+```
+
+---
+## Finding the middle via least squares
+* Consider only the children's heights. 
+  * How could one describe the "middle"?
+  * One definition, let $Y_i$ be the height of child $i$ for $i = 1, \ldots, n = 928$, then define the middle as the value of $\mu$
+  that minimizes $$\sum_{i=1}^n (Y_i - \mu)^2$$
+* This is physical center of mass of the histrogram.
+* You might have guessed that the answer $\mu = \bar Y$.
+
+
+---
+## Experiment
+### Use R studio's manipulate to see what value of $\mu$ minimizes the sum of the squared deviations.
+
+```
+library(manipulate)
+myHist <- function(mu){
+    mse <- mean((galton$child - mu)^2)
+    g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+    g <- g + geom_vline(xintercept = mu, size = 3)
+    g <- g + ggtitle(paste("mu = ", mu, ", MSE = ", round(mse, 2), sep = ""))
+    g
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The least squares est. is the empirical mean
+```{r , fig.height=4, fig.width=4, fig.align='center'}
+g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+g <- g + geom_vline(xintercept = mean(galton$child), size = 3)
+g
+```
+
+---
+### The math (not required for the class) follows as:
+$$ 
+\begin{align} 
+\sum_{i=1}^n (Y_i - \mu)^2 & = \
+\sum_{i=1}^n (Y_i - \bar Y + \bar Y - \mu)^2 \\ 
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 \sum_{i=1}^n (Y_i - \bar Y)  (\bar Y - \mu) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu) \sum_{i=1}^n (Y_i - \bar Y)  +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu)  (\sum_{i=1}^n Y_i - n \bar Y) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \sum_{i=1}^n (\bar Y - \mu)^2\\ 
+& \geq \sum_{i=1}^n (Y_i - \bar Y)^2 \
+\end{align} 
+$$
+
+---
+## Comparing childrens' heights and their parents' heights
+
+```{r, dependson="galton",fig.height=4,fig.width=4, fig.align='center'}
+ggplot(galton, aes(x = parent, y = child)) + geom_point()
+```
+
+---
+Size of point represents number of points at that (X, Y) combination (See the Rmd file for the code).
+
+```{r freqGalton, dependson="galton",fig.height=6,fig.width=7,echo=FALSE}
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+g
+```
+
+---
+## Regression through the origin
+* Suppose that $X_i$ are the parents' heights.
+* Consider picking the slope $\beta$ that minimizes $$\sum_{i=1}^n (Y_i - X_i \beta)^2$$
+* This is exactly using the origin as a pivot point picking the
+line that minimizes the sum of the squared vertical distances
+of the points to the line
+* Use R studio's  manipulate function to experiment
+* Subtract the means so that the origin is the mean of the parent
+and children's heights
+
+---
+```{r, echo = TRUE, eval = FALSE}
+y <- galton$child - mean(galton$child)
+x <- galton$parent - mean(galton$parent)
+freqData <- as.data.frame(table(x, y))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+myPlot <- function(beta){
+    g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+    g <- g  + scale_size(range = c(2, 20), guide = "none" )
+    g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+    g <- g + geom_point(aes(colour=freq, size = freq))
+    g <- g + scale_colour_gradient(low = "lightblue", high="white")                     
+    g <- g + geom_abline(intercept = 0, slope = beta, size = 3)
+    mse <- mean( (y - beta * x) ^2 )
+    g <- g + ggtitle(paste("beta = ", beta, "mse = ", round(mse, 3)))
+    g
+}
+manipulate(myPlot(beta), beta = slider(0.6, 1.2, step = 0.02))
+```
+
+---
+## The solution 
+### In the next few lectures we'll talk about why this is the solution
+```{r}
+lm(I(child - mean(child))~ I(parent - mean(parent)) - 1, data = galton)
+```
+
+---
+```{r, fig.height=6,fig.width=7,echo=FALSE}
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+lm1 <- lm(galton$child ~ galton$parent)
+g <- g + geom_abline(intercept = coef(lm1)[1], slope = coef(lm1)[2], size = 3, colour = grey(.5))
+g
+```
+
diff --git a/07_RegressionModels/01_01_introduction/index.Rmd.LOCAL.943.Rmd b/07_RegressionModels/01_01_introduction/index.Rmd.LOCAL.943.Rmd
new file mode 100644
index 000000000..9c5aedb48
--- /dev/null
+++ b/07_RegressionModels/01_01_introduction/index.Rmd.LOCAL.943.Rmd
@@ -0,0 +1,231 @@
+---
+title       : Introduction to regression
+subtitle    : Regression
+author      : Brian Caffo, Jeff Leek and Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
+# make this an external chunk that can be included in any file
+options(width = 100)
+opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
+
+options(xtable.type = 'html')
+knit_hooks$set(inline = function(x) {
+  if(is.numeric(x)) {
+    round(x, getOption('digits'))
+  } else {
+    paste(as.character(x), collapse = ', ')
+  }
+})
+knit_hooks$set(plot = knitr:::hook_plot_html)
+runif(1)
+```
+
+## A famous motivating example
+
+<img class=center src=fig/galton.jpg height=150>
+
+### (Perhaps surprisingly, this example is still relevant)
+
+<img class=center src=fig/height.png height=150>
+
+[http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html](http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html)
+
+[Predicting height: the Victorian approach beats modern genomics](http://www.wired.com/wiredscience/2009/03/predicting-height-the-victorian-approach-beats-modern-genomics/)
+
+---
+## Recent simply statistics post
+(Simply Statistics is a blog by Jeff Leek, Roger Peng and 
+Rafael Irizarry, who wrote this post, link on the image)
+
+<a href="http://simplystatistics.org/2013/01/28/data-supports-claim-that-if-kobe-stops-ball-hogging-the-lakers-will-win-more/">
+<img class=center src=http://simplystatistics.org/wp-content/uploads/2013/01/kobelakers1-1024x1024.png height=250></img>
+</a>
+
+- "Data supports claim that if Kobe stops ball hogging the Lakers will win more"
+- "Linear regression suggests that an increase of 1% in % of shots taken by Kobe results in a drop of 1.16 points (+/- 0.22)  in score differential."
+- How was it done? Do you agree with the analysis? 
+
+
+
+
+
+
+---
+## Questions for this class
+* Consider trying to answer the following kinds of questions:
+  * To use the parents' heights to predict childrens' heights.
+  * To try to find a parsimonious, easily described mean 
+    relationship between parent and children's heights.
+  * To investigate the variation in childrens' heights that appears 
+  unrelated to parents' heights (residual variation).
+  * To quantify what impact genotype information has beyond parental height in explaining child height.
+  * To figure out how/whether and what assumptions are needed to
+    generalize findings beyond the data in question.  
+  * Why do children of very tall parents tend to be 
+    tall, but a little shorter than their parents and why children of very short parents tend to be short, but a little taller than their parents? (This is a famous question called 'Regression to the mean'.)
+
+---
+## Galton's Data
+
+* Let's look at the data first, used by Francis Galton in 1885. 
+* Galton was a statistician who invented the term and concepts
+  of regression and correlation, founded the journal Biometrika,
+  and was the cousin of Charles Darwin.
+* You may need to run `install.packages("UsingR")` if the `UsingR` library is not installed.
+* Let's look at the marginal (parents disregarding children and children disregarding parents) distributions first. 
+  * Parent distribution is all heterosexual couples.
+  * Correction for gender via multiplying female heights by 1.08.
+  * Overplotting is an issue from discretization.
+
+---
+```{r galton,fig.height=3.5,fig.width=8}
+library(UsingR); data(galton); library(reshape); library(dplyr); library(ggplot2); long <- melt(galton)
+g <- ggplot(long, aes(x = value, fill = variable)) 
+g <- g + geom_histogram(colour = "black", binwidth=1) 
+g <- g + facet_grid(. ~ variable)
+g
+```
+
+---
+## Finding the middle via least squares
+* Consider only the children's heights. 
+  * How could one describe the "middle"?
+  * One definition, let $Y_i$ be the height of child $i$ for $i = 1, \ldots, n = 928$, then define the middle as the value of $\mu$
+  that minimizes $$\sum_{i=1}^n (Y_i - \mu)^2$$
+* This is physical center of mass of the histrogram.
+* You might have guessed that the answer $\mu = \bar Y$.
+
+
+---
+## Experiment
+### Use R studio's manipulate to see what value of $\mu$ minimizes the sum of the squared deviations.
+
+```
+library(manipulate)
+myHist <- function(mu){
+    mse <- mean((galton$child - mu)^2)
+    g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+    g <- g + geom_vline(xintercept = mu, size = 3)
+    g <- g + ggtitle(paste("mu = ", mu, ", MSE = ", round(mse, 2), sep = ""))
+    g
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The least squares est. is the empirical mean
+```{r , fig.height=4, fig.width=4, fig.align='center'}
+g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+g <- g + geom_vline(xintercept = mean(galton$child), size = 3)
+g
+```
+
+---
+### The math (not required for the class) follows as:
+$$ 
+\begin{align} 
+\sum_{i=1}^n (Y_i - \mu)^2 & = \
+\sum_{i=1}^n (Y_i - \bar Y + \bar Y - \mu)^2 \\ 
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 \sum_{i=1}^n (Y_i - \bar Y)  (\bar Y - \mu) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu) \sum_{i=1}^n (Y_i - \bar Y)  +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu)  (\sum_{i=1}^n Y_i - n \bar Y) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \sum_{i=1}^n (\bar Y - \mu)^2\\ 
+& \geq \sum_{i=1}^n (Y_i - \bar Y)^2 \
+\end{align} 
+$$
+
+---
+## Comparing childrens' heights and their parents' heights
+
+```{r, dependson="galton",fig.height=4,fig.width=4, fig.align='center'}
+ggplot(galton, aes(x = parent, y = child)) + geom_point()
+```
+
+---
+Size of point represents number of points at that (X, Y) combination (See the Rmd file for the code).
+
+```{r freqGalton, dependson="galton",fig.height=6,fig.width=7,echo=FALSE}
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+g
+```
+
+---
+## Regression through the origin
+* Suppose that $X_i$ are the parents' heights.
+* Consider picking the slope $\beta$ that minimizes $$\sum_{i=1}^n (Y_i - X_i \beta)^2$$
+* This is exactly using the origin as a pivot point picking the
+line that minimizes the sum of the squared vertical distances
+of the points to the line
+* Use R studio's  manipulate function to experiment
+* Subtract the means so that the origin is the mean of the parent
+and children's heights
+
+---
+```{r, echo = TRUE, eval = FALSE}
+y <- galton$child - mean(galton$child)
+x <- galton$parent - mean(galton$parent)
+freqData <- as.data.frame(table(x, y))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+myPlot <- function(beta){
+    g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+    g <- g  + scale_size(range = c(2, 20), guide = "none" )
+    g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+    g <- g + geom_point(aes(colour=freq, size = freq))
+    g <- g + scale_colour_gradient(low = "lightblue", high="white")                     
+    g <- g + geom_abline(intercept = 0, slope = beta, size = 3)
+    mse <- mean( (y - beta * x) ^2 )
+    g <- g + ggtitle(paste("beta = ", beta, "mse = ", round(mse, 3)))
+    g
+}
+manipulate(myPlot(beta), beta = slider(0.6, 1.2, step = 0.02))
+```
+
+---
+## The solution 
+### In the next few lectures we'll talk about why this is the solution
+```{r}
+lm(I(child - mean(child))~ I(parent - mean(parent)) - 1, data = galton)
+```
+
+---
+```{r, fig.height=6,fig.width=7,echo=FALSE}
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+lm1 <- lm(galton$child ~ galton$parent)
+g <- g + geom_abline(intercept = coef(lm1)[1], slope = coef(lm1)[2], size = 3, colour = grey(.5))
+g
+```
+
diff --git a/07_RegressionModels/01_01_introduction/index.Rmd.REMOTE.943.Rmd b/07_RegressionModels/01_01_introduction/index.Rmd.REMOTE.943.Rmd
new file mode 100644
index 000000000..18a440e43
--- /dev/null
+++ b/07_RegressionModels/01_01_introduction/index.Rmd.REMOTE.943.Rmd
@@ -0,0 +1,232 @@
+---
+title       : Introduction to regression
+subtitle    : Regression
+author      : Brian Caffo, Jeff Leek and Roger Peng
+job         : Johns Hopkins Bloomberg School of Public Health
+logo        : bloomberg_shield.png
+framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
+highlighter : highlight.js  # {highlight.js, prettify, highlight}
+hitheme     : tomorrow      # 
+url:
+  lib: ../../librariesNew
+  assets: ../../assets
+widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
+mode        : selfcontained # {standalone, draft}
+---
+```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F, results='hide'}
+# make this an external chunk that can be included in any file
+options(width = 100)
+opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
+
+options(xtable.type = 'html')
+knit_hooks$set(inline = function(x) {
+  if(is.numeric(x)) {
+    round(x, getOption('digits'))
+  } else {
+    paste(as.character(x), collapse = ', ')
+  }
+})
+knit_hooks$set(plot = knitr:::hook_plot_html)
+runif(1)
+```
+
+## A famous motivating example
+
+<img class=center src=fig/galton.jpg height=150>
+
+### (Perhaps surprisingly, this example is still relevant)
+
+<img class=center src=fig/height.png height=150>
+
+[http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html](http://www.nature.com/ejhg/journal/v17/n8/full/ejhg20095a.html)
+
+[Predicting height: the Victorian approach beats modern genomics](http://www.wired.com/wiredscience/2009/03/predicting-height-the-victorian-approach-beats-modern-genomics/)
+
+---
+## Recent simply statistics post
+(Simply Statistics is a blog by Jeff Leek, Roger Peng and 
+Rafael Irizarry, who wrote this post, link on the image)
+
+<a href="http://simplystatistics.org/2013/01/28/data-supports-claim-that-if-kobe-stops-ball-hogging-the-lakers-will-win-more/">
+<img class=center src=http://simplystatistics.org/wp-content/uploads/2013/01/kobelakers1-1024x1024.png height=250></img>
+</a>
+
+- "Data supports claim that if Kobe stops ball hogging the Lakers will win more"
+- "Linear regression suggests that an increase of 1% in % of shots taken by Kobe results in a drop of 1.16 points (+/- 0.22)  in score differential."
+- How was it done? Do you agree with the analysis? 
+
+
+
+
+
+
+---
+## Questions for this class
+* Consider trying to answer the following kinds of questions:
+  * To use the parents' heights to predict childrens' heights.
+  * To try to find a parsimonious, easily described mean 
+    relationship between parent and children's heights.
+  * To investigate the variation in childrens' heights that appears 
+  unrelated to parents' heights (residual variation).
+  * To quantify what impact genotype information has beyond parental height in explaining child height.
+  * To figure out how/whether and what assumptions are needed to
+    generalize findings beyond the data in question.  
+  * Why do children of very tall parents tend to be 
+    tall, but a little shorter than their parents and why children of very short parents tend to be short, but a little taller than their parents? (This is a famous question called 'Regression to the mean'.)
+
+---
+## Galton's Data
+
+* Let's look at the data first, used by Francis Galton in 1885. 
+* Galton was a statistician who invented the term and concepts
+  of regression and correlation, founded the journal Biometrika,
+  and was the cousin of Charles Darwin.
+* You may need to run `install.packages("UsingR")` if the `UsingR` library is not installed.
+* Let's look at the marginal (parents disregarding children and children disregarding parents) distributions first. 
+  * Parent distribution is all heterosexual couples.
+  * Correction for gender via multiplying female heights by 1.08.
+  * Overplotting is an issue from discretization.
+
+---
+```{r galton,fig.height=3.5,fig.width=8}
+library(UsingR); data(galton); library(reshape); long <- melt(galton)
+g <- ggplot(long, aes(x = value, fill = variable)) 
+g <- g + geom_histogram(colour = "black", binwidth=1) 
+g <- g + facet_grid(. ~ variable)
+g
+```
+
+---
+## Finding the middle via least squares
+* Consider only the children's heights. 
+  * How could one describe the "middle"?
+  * One definition, let $Y_i$ be the height of child $i$ for $i = 1, \ldots, n = 928$, then define the middle as the value of $\mu$
+  that minimizes $$\sum_{i=1}^n (Y_i - \mu)^2$$
+* This is physical center of mass of the histrogram.
+* You might have guessed that the answer $\mu = \bar Y$.
+
+
+---
+## Experiment
+### Use R studio's manipulate to see what value of $\mu$ minimizes the sum of the squared deviations.
+
+```
+library(manipulate)
+myHist <- function(mu){
+    mse <- mean((galton$child - mu)^2)
+    g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+    g <- g + geom_vline(xintercept = mu, size = 3)
+    g <- g + ggtitle(paste("mu = ", mu, ", MSE = ", round(mse, 2), sep = ""))
+    g
+}
+manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
+```
+
+---
+## The least squares est. is the empirical mean
+```{r , fig.height=4, fig.width=4, fig.align='center'}
+g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+g <- g + geom_vline(xintercept = mean(galton$child), size = 3)
+g
+```
+
+---
+### The math (not required for the class) follows as:
+$$ 
+\begin{align} 
+\sum_{i=1}^n (Y_i - \mu)^2 & = \
+\sum_{i=1}^n (Y_i - \bar Y + \bar Y - \mu)^2 \\ 
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 \sum_{i=1}^n (Y_i - \bar Y)  (\bar Y - \mu) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu) \sum_{i=1}^n (Y_i - \bar Y)  +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \
+2 (\bar Y - \mu)  (\sum_{i=1}^n Y_i - n \bar Y) +\
+\sum_{i=1}^n (\bar Y - \mu)^2 \\
+& = \sum_{i=1}^n (Y_i - \bar Y)^2 + \sum_{i=1}^n (\bar Y - \mu)^2\\ 
+& \geq \sum_{i=1}^n (Y_i - \bar Y)^2 \
+\end{align} 
+$$
+
+---
+## Comparing childrens' heights and their parents' heights
+
+```{r, dependson="galton",fig.height=4,fig.width=4, fig.align='center'}
+ggplot(galton, aes(x = parent, y = child)) + geom_point()
+```
+
+---
+Size of point represents number of points at that (X, Y) combination (See the Rmd file for the code).
+
+```{r freqGalton, dependson="galton",fig.height=6,fig.width=7,echo=FALSE}
+library(dplyr)
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+g
+```
+
+---
+## Regression through the origin
+* Suppose that $X_i$ are the parents' heights.
+* Consider picking the slope $\beta$ that minimizes $$\sum_{i=1}^n (Y_i - X_i \beta)^2$$
+* This is exactly using the origin as a pivot point picking the
+line that minimizes the sum of the squared vertical distances
+of the points to the line
+* Use R studio's  manipulate function to experiment
+* Subtract the means so that the origin is the mean of the parent
+and children's heights
+
+---
+```{r, echo = TRUE, eval = FALSE}
+y <- galton$child - mean(galton$child)
+x <- galton$parent - mean(galton$parent)
+freqData <- as.data.frame(table(x, y))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+myPlot <- function(beta){
+    g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+    g <- g  + scale_size(range = c(2, 20), guide = "none" )
+    g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+    g <- g + geom_point(aes(colour=freq, size = freq))
+    g <- g + scale_colour_gradient(low = "lightblue", high="white")                     
+    g <- g + geom_abline(intercept = 0, slope = beta, size = 3)
+    mse <- mean( (y - beta * x) ^2 )
+    g <- g + ggtitle(paste("beta = ", beta, "mse = ", round(mse, 3)))
+    g
+}
+manipulate(myPlot(beta), beta = slider(0.6, 1.2, step = 0.02))
+```
+
+---
+## The solution 
+### In the next few lectures we'll talk about why this is the solution
+```{r}
+lm(I(child - mean(child))~ I(parent - mean(parent)) - 1, data = galton)
+```
+
+---
+```{r, fig.height=6,fig.width=7,echo=FALSE}
+freqData <- as.data.frame(table(galton$child, galton$parent))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")                    
+lm1 <- lm(galton$child ~ galton$parent)
+g <- g + geom_abline(intercept = coef(lm1)[1], slope = coef(lm1)[2], size = 3, colour = grey(.5))
+g
+```
+
diff --git a/07_RegressionModels/01_01_introduction/index.html b/07_RegressionModels/01_01_introduction/index.html
index 4bd2ea6b9..089359464 100644
--- a/07_RegressionModels/01_01_introduction/index.html
+++ b/07_RegressionModels/01_01_introduction/index.html
@@ -8,51 +8,46 @@
   <meta name="generator" content="slidify" />
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
     media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
   </script>
   
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
+  
 
 </head>
 <body style="opacity: 0">
   <slides class="layout-widescreen">
     
     <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Introduction to regression</h1>
+    <h2>Regression</h2>
+    <p>Brian Caffo, Jeff Leek and Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
     
 
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Introduction to regression</h1>
-        <h2>Regression</h2>
-        <p>Brian Caffo, Jeff Leek and Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
     <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
+    <slide class="" id="slide-1" style="background:;">
   <hgroup>
     <h2>A famous motivating example</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <p><img class=center src=fig/galton.jpg height=150></p>
 
 <h3>(Perhaps surprisingly, this example is still relevant)</h3>
@@ -67,11 +62,33 @@ <h3>(Perhaps surprisingly, this example is still relevant)</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-2" style="background:;">
+<slide class="" id="slide-2" style="background:;">
+  <hgroup>
+    <h2>Recent simply statistics post</h2>
+  </hgroup>
+  <article data-timings="">
+    <p>(Simply Statistics is a blog by Jeff Leek, Roger Peng and 
+Rafael Irizarry, who wrote this post, link on the image)</p>
+
+<p><a href="http://simplystatistics.org/2013/01/28/data-supports-claim-that-if-kobe-stops-ball-hogging-the-lakers-will-win-more/">
+<img class=center src=http://simplystatistics.org/wp-content/uploads/2013/01/kobelakers1-1024x1024.png height=250></img>
+</a></p>
+
+<ul>
+<li>&quot;Data supports claim that if Kobe stops ball hogging the Lakers will win more&quot;</li>
+<li>&quot;Linear regression suggests that an increase of 1% in % of shots taken by Kobe results in a drop of 1.16 points (+/- 0.22)  in score differential.&quot;</li>
+<li>How was it done? Do you agree with the analysis? </li>
+</ul>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-3" style="background:;">
   <hgroup>
     <h2>Questions for this class</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Consider trying to answer the following kinds of questions:
 
@@ -93,11 +110,11 @@ <h2>Questions for this class</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-3" style="background:;">
+<slide class="" id="slide-4" style="background:;">
   <hgroup>
     <h2>Galton&#39;s Data</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Let&#39;s look at the data first, used by Francis Galton in 1885. </li>
 <li>Galton was a statistician who invented the term and concepts
@@ -117,15 +134,13 @@ <h2>Galton&#39;s Data</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Code</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">library(UsingR); data(galton)
-par(mfrow=c(1,2))
-hist(galton$child,col=&quot;blue&quot;,breaks=100)
-hist(galton$parent,col=&quot;blue&quot;,breaks=100)
+<slide class="" id="slide-5" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">library(UsingR); data(galton); library(reshape); long &lt;- melt(galton)
+g &lt;- ggplot(long, aes(x = value, fill = variable)) 
+g &lt;- g + geom_histogram(colour = &quot;black&quot;, binwidth=1) 
+g &lt;- g + facet_grid(. ~ variable)
+g
 </code></pre>
 
 <div class="rimage center"><img src="fig/galton.png" title="plot of chunk galton" alt="plot of chunk galton" class="plot" /></div>
@@ -134,11 +149,11 @@ <h2>Code</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-5" style="background:;">
+<slide class="" id="slide-6" style="background:;">
   <hgroup>
     <h2>Finding the middle via least squares</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Consider only the children&#39;s heights. 
 
@@ -148,27 +163,27 @@ <h2>Finding the middle via least squares</h2>
 that minimizes \[\sum_{i=1}^n (Y_i - \mu)^2\]</li>
 </ul></li>
 <li>This is physical center of mass of the histrogram.</li>
-<li>You might have guessed that the answer \(\mu = \bar X\).</li>
+<li>You might have guessed that the answer \(\mu = \bar Y\).</li>
 </ul>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-6" style="background:;">
+<slide class="" id="slide-7" style="background:;">
   <hgroup>
     <h2>Experiment</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Use R studio&#39;s manipulate to see what value of \(\mu\) minimizes the sum of the squared deviations.</h3>
 
 <pre><code>library(manipulate)
 myHist &lt;- function(mu){
-  hist(galton$child,col=&quot;blue&quot;,breaks=100)
-  lines(c(mu, mu), c(0, 150),col=&quot;red&quot;,lwd=5)
-  mse &lt;- mean((galton$child - mu)^2)
-  text(63, 150, paste(&quot;mu = &quot;, mu))
-  text(63, 140, paste(&quot;MSE = &quot;, round(mse, 2)))
+    mse &lt;- mean((galton$child - mu)^2)
+    g &lt;- ggplot(galton, aes(x = child)) + geom_histogram(fill = &quot;salmon&quot;, colour = &quot;black&quot;, binwidth=1)
+    g &lt;- g + geom_vline(xintercept = mu, size = 3)
+    g &lt;- g + ggtitle(paste(&quot;mu = &quot;, mu, &quot;, MSE = &quot;, round(mse, 2), sep = &quot;&quot;))
+    g
 }
 manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
 </code></pre>
@@ -177,27 +192,27 @@ <h3>Use R studio&#39;s manipulate to see what value of \(\mu\) minimizes the sum
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-7" style="background:;">
+<slide class="" id="slide-8" style="background:;">
   <hgroup>
-    <h2>The least squares estimate is the empirical mean</h2>
+    <h2>The least squares est. is the empirical mean</h2>
   </hgroup>
-  <article>
-    <pre><code class="r">  hist(galton$child,col=&quot;blue&quot;,breaks=100)
-  meanChild &lt;- mean(galton$child)
-  lines(rep(meanChild,100),seq(0,150,length=100),col=&quot;red&quot;,lwd=5)
+  <article data-timings="">
+    <pre><code class="r">g &lt;- ggplot(galton, aes(x = child)) + geom_histogram(fill = &quot;salmon&quot;, colour = &quot;black&quot;, binwidth=1)
+g &lt;- g + geom_vline(xintercept = mean(galton$child), size = 3)
+g
 </code></pre>
 
-<div class="rimage center"><img src="fig/lsm.png" title="plot of chunk lsm" alt="plot of chunk lsm" class="plot" /></div>
+<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-8" style="background:;">
+<slide class="" id="slide-9" style="background:;">
   <hgroup>
-    <h3>The math follows as:</h3>
+    <h3>The math (not required for the class) follows as:</h3>
   </hgroup>
-  <article>
+  <article data-timings="">
     <p>\[ 
 \begin{align} 
 \sum_{i=1}^n (Y_i - \mu)^2 & = \
@@ -220,25 +235,22 @@ <h3>The math follows as:</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-9" style="background:;">
+<slide class="" id="slide-10" style="background:;">
   <hgroup>
     <h2>Comparing childrens&#39; heights and their parents&#39; heights</h2>
   </hgroup>
-  <article>
-    <pre><code class="r">plot(galton$parent,galton$child,pch=19,col=&quot;blue&quot;)
+  <article data-timings="">
+    <pre><code class="r">ggplot(galton, aes(x = parent, y = child)) + geom_point()
 </code></pre>
 
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
+<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
+<slide class="" id="slide-11" style="background:;">
+  <article data-timings="">
     <p>Size of point represents number of points at that (X, Y) combination (See the Rmd file for the code).</p>
 
 <div class="rimage center"><img src="fig/freqGalton.png" title="plot of chunk freqGalton" alt="plot of chunk freqGalton" class="plot" /></div>
@@ -247,11 +259,11 @@ <h2>Comparing childrens&#39; heights and their parents&#39; heights</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-11" style="background:;">
+<slide class="" id="slide-12" style="background:;">
   <hgroup>
     <h2>Regression through the origin</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Suppose that \(X_i\) are the parents&#39; heights.</li>
 <li>Consider picking the slope \(\beta\) that minimizes \[\sum_{i=1}^n (Y_i - X_i \beta)^2\]</li>
@@ -267,28 +279,24 @@ <h2>Regression through the origin</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <pre><code>myPlot &lt;- function(beta){
-  y &lt;- galton$child - mean(galton$child)
-  x &lt;- galton$parent - mean(galton$parent)
-  freqData &lt;- as.data.frame(table(x, y))
-  names(freqData) &lt;- c(&quot;child&quot;, &quot;parent&quot;, &quot;freq&quot;)
-  plot(
-    as.numeric(as.vector(freqData$parent)), 
-    as.numeric(as.vector(freqData$child)),
-    pch = 21, col = &quot;black&quot;, bg = &quot;lightblue&quot;,
-    cex = .15 * freqData$freq, 
-    xlab = &quot;parent&quot;, 
-    ylab = &quot;child&quot;
-    )
-  abline(0, beta, lwd = 3)
-  points(0, 0, cex = 2, pch = 19)
-  mse &lt;- mean( (y - beta * x)^2 )
-  title(paste(&quot;beta = &quot;, beta, &quot;mse = &quot;, round(mse, 3)))
+<slide class="" id="slide-13" style="background:;">
+  <article data-timings="">
+    <pre><code class="r">y &lt;- galton$child - mean(galton$child)
+x &lt;- galton$parent - mean(galton$parent)
+freqData &lt;- as.data.frame(table(x, y))
+names(freqData) &lt;- c(&quot;child&quot;, &quot;parent&quot;, &quot;freq&quot;)
+freqData$child &lt;- as.numeric(as.character(freqData$child))
+freqData$parent &lt;- as.numeric(as.character(freqData$parent))
+myPlot &lt;- function(beta){
+    g &lt;- ggplot(filter(freqData, freq &gt; 0), aes(x = parent, y = child))
+    g &lt;- g  + scale_size(range = c(2, 20), guide = &quot;none&quot; )
+    g &lt;- g + geom_point(colour=&quot;grey50&quot;, aes(size = freq+20, show_guide = FALSE))
+    g &lt;- g + geom_point(aes(colour=freq, size = freq))
+    g &lt;- g + scale_colour_gradient(low = &quot;lightblue&quot;, high=&quot;white&quot;)                     
+    g &lt;- g + geom_abline(intercept = 0, slope = beta, size = 3)
+    mse &lt;- mean( (y - beta * x) ^2 )
+    g &lt;- g + ggtitle(paste(&quot;beta = &quot;, beta, &quot;mse = &quot;, round(mse, 3)))
+    g
 }
 manipulate(myPlot(beta), beta = slider(0.6, 1.2, step = 0.02))
 </code></pre>
@@ -297,11 +305,11 @@ <h2>Regression through the origin</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-13" style="background:;">
+<slide class="" id="slide-14" style="background:;">
   <hgroup>
     <h2>The solution</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>In the next few lectures we&#39;ll talk about why this is the solution</h3>
 
 <pre><code class="r">lm(I(child - mean(child))~ I(parent - mean(parent)) - 1, data = galton)
@@ -321,14 +329,9 @@ <h3>In the next few lectures we&#39;ll talk about why this is the solution</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>Visualizing the best fit line</h2>
-  </hgroup>
-  <article>
-    <h3>Size of points are frequencies at that X, Y combination</h3>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
+<slide class="" id="slide-15" style="background:;">
+  <article data-timings="">
+    <div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
 
   </article>
   <!-- Presenter Notes -->
@@ -336,34 +339,125 @@ <h3>Size of points are frequencies at that X, Y combination</h3>
 
     <slide class="backdrop"></slide>
   </slides>
-
-  <!--[if IE]>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='A famous motivating example'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Recent simply statistics post'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Questions for this class'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Galton&#39;s Data'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title=''>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Finding the middle via least squares'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Experiment'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='The least squares est. is the empirical mean'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='The math (not required for the class) follows as:'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Comparing childrens&#39; heights and their parents&#39; heights'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title=''>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Regression through the origin'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title=''>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='The solution'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title=''>
+         15
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
     <script 
       src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
     </script>
     <script>CFInstall.check({mode: 'overlay'});</script>
   <![endif]-->
 </body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
 </script>
 <!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/07_RegressionModels/01_01_introduction/index.md b/07_RegressionModels/01_01_introduction/index.md
index 552a5b75f..3001bf184 100644
--- a/07_RegressionModels/01_01_introduction/index.md
+++ b/07_RegressionModels/01_01_introduction/index.md
@@ -8,15 +8,15 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
 ---
-## A famous motivating example
 
 
 
+## A famous motivating example
 
 <img class=center src=fig/galton.jpg height=150>
 
@@ -28,6 +28,24 @@ mode        : selfcontained # {standalone, draft}
 
 [Predicting height: the Victorian approach beats modern genomics](http://www.wired.com/wiredscience/2009/03/predicting-height-the-victorian-approach-beats-modern-genomics/)
 
+---
+## Recent simply statistics post
+(Simply Statistics is a blog by Jeff Leek, Roger Peng and 
+Rafael Irizarry, who wrote this post, link on the image)
+
+<a href="http://simplystatistics.org/2013/01/28/data-supports-claim-that-if-kobe-stops-ball-hogging-the-lakers-will-win-more/">
+<img class=center src=http://simplystatistics.org/wp-content/uploads/2013/01/kobelakers1-1024x1024.png height=250></img>
+</a>
+
+- "Data supports claim that if Kobe stops ball hogging the Lakers will win more"
+- "Linear regression suggests that an increase of 1% in % of shots taken by Kobe results in a drop of 1.16 points (+/- 0.22)  in score differential."
+- How was it done? Do you agree with the analysis? 
+
+
+
+
+
+
 ---
 ## Questions for this class
 * Consider trying to answer the following kinds of questions:
@@ -56,14 +74,13 @@ mode        : selfcontained # {standalone, draft}
   * Overplotting is an issue from discretization.
 
 ---
-## Code
-
 
 ```r
-library(UsingR); data(galton)
-par(mfrow=c(1,2))
-hist(galton$child,col="blue",breaks=100)
-hist(galton$parent,col="blue",breaks=100)
+library(UsingR); data(galton); library(reshape); long <- melt(galton)
+g <- ggplot(long, aes(x = value, fill = variable)) 
+g <- g + geom_histogram(colour = "black", binwidth=1) 
+g <- g + facet_grid(. ~ variable)
+g
 ```
 
 <div class="rimage center"><img src="fig/galton.png" title="plot of chunk galton" alt="plot of chunk galton" class="plot" /></div>
@@ -76,7 +93,7 @@ hist(galton$parent,col="blue",breaks=100)
   * One definition, let $Y_i$ be the height of child $i$ for $i = 1, \ldots, n = 928$, then define the middle as the value of $\mu$
   that minimizes $$\sum_{i=1}^n (Y_i - \mu)^2$$
 * This is physical center of mass of the histrogram.
-* You might have guessed that the answer $\mu = \bar X$.
+* You might have guessed that the answer $\mu = \bar Y$.
 
 
 ---
@@ -86,29 +103,29 @@ hist(galton$parent,col="blue",breaks=100)
 ```
 library(manipulate)
 myHist <- function(mu){
-  hist(galton$child,col="blue",breaks=100)
-  lines(c(mu, mu), c(0, 150),col="red",lwd=5)
-  mse <- mean((galton$child - mu)^2)
-  text(63, 150, paste("mu = ", mu))
-  text(63, 140, paste("MSE = ", round(mse, 2)))
+    mse <- mean((galton$child - mu)^2)
+    g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+    g <- g + geom_vline(xintercept = mu, size = 3)
+    g <- g + ggtitle(paste("mu = ", mu, ", MSE = ", round(mse, 2), sep = ""))
+    g
 }
 manipulate(myHist(mu), mu = slider(62, 74, step = 0.5))
 ```
 
 ---
-## The least squares estimate is the empirical mean
+## The least squares est. is the empirical mean
 
 ```r
-  hist(galton$child,col="blue",breaks=100)
-  meanChild <- mean(galton$child)
-  lines(rep(meanChild,100),seq(0,150,length=100),col="red",lwd=5)
+g <- ggplot(galton, aes(x = child)) + geom_histogram(fill = "salmon", colour = "black", binwidth=1)
+g <- g + geom_vline(xintercept = mean(galton$child), size = 3)
+g
 ```
 
-<div class="rimage center"><img src="fig/lsm.png" title="plot of chunk lsm" alt="plot of chunk lsm" class="plot" /></div>
+<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
 
 
 ---
-### The math follows as:
+### The math (not required for the class) follows as:
 $$ 
 \begin{align} 
 \sum_{i=1}^n (Y_i - \mu)^2 & = \
@@ -132,10 +149,10 @@ $$
 
 
 ```r
-plot(galton$parent,galton$child,pch=19,col="blue")
+ggplot(galton, aes(x = parent, y = child)) + geom_point()
 ```
 
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
+<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
 
 
 ---
@@ -156,28 +173,29 @@ of the points to the line
 and children's heights
 
 ---
-```
+
+```r
+y <- galton$child - mean(galton$child)
+x <- galton$parent - mean(galton$parent)
+freqData <- as.data.frame(table(x, y))
+names(freqData) <- c("child", "parent", "freq")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
 myPlot <- function(beta){
-  y <- galton$child - mean(galton$child)
-  x <- galton$parent - mean(galton$parent)
-  freqData <- as.data.frame(table(x, y))
-  names(freqData) <- c("child", "parent", "freq")
-  plot(
-    as.numeric(as.vector(freqData$parent)), 
-    as.numeric(as.vector(freqData$child)),
-    pch = 21, col = "black", bg = "lightblue",
-    cex = .15 * freqData$freq, 
-    xlab = "parent", 
-    ylab = "child"
-    )
-  abline(0, beta, lwd = 3)
-  points(0, 0, cex = 2, pch = 19)
-  mse <- mean( (y - beta * x)^2 )
-  title(paste("beta = ", beta, "mse = ", round(mse, 3)))
+    g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+    g <- g  + scale_size(range = c(2, 20), guide = "none" )
+    g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+    g <- g + geom_point(aes(colour=freq, size = freq))
+    g <- g + scale_colour_gradient(low = "lightblue", high="white")                     
+    g <- g + geom_abline(intercept = 0, slope = beta, size = 3)
+    mse <- mean( (y - beta * x) ^2 )
+    g <- g + ggtitle(paste("beta = ", beta, "mse = ", round(mse, 3)))
+    g
 }
 manipulate(myPlot(beta), beta = slider(0.6, 1.2, step = 0.02))
 ```
 
+
 ---
 ## The solution 
 ### In the next few lectures we'll talk about why this is the solution
@@ -199,8 +217,6 @@ I(parent - mean(parent))
 
 
 ---
-## Visualizing the best fit line
-### Size of points are frequencies at that X, Y combination
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
+<div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
 
 
diff --git a/07_RegressionModels/01_02_notation/index.Rmd b/07_RegressionModels/01_02_notation/index.Rmd
index 040699101..2c6eb62e2 100644
--- a/07_RegressionModels/01_02_notation/index.Rmd
+++ b/07_RegressionModels/01_02_notation/index.Rmd
@@ -8,8 +8,8 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
-  assets: ../../assets
+  lib: ../../librariesNew
+  assets: ../../assetsNew
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
 ---
@@ -18,7 +18,7 @@ mode        : selfcontained # {standalone, draft}
 
 * In this module, we'll cover some basic definitions and notation used throughout the class.
 * We will try to minimize the amount of mathematics required for this class.
-* No caclculus is required. 
+* No calculus is required. 
 
 ---
 
@@ -30,10 +30,6 @@ mode        : selfcontained # {standalone, draft}
 * We often use a different letter than $X$, such as $Y_1, \ldots , Y_n$.
 * We will typically use Greek letters for things we don't know. 
   Such as, $\mu$ is a mean that we'd like to estimate.
-* We will use capital letters for conceptual values of the variables and lowercase letters for realized values.
-  * So this way we can write $P(X_i > x)$. 
-  * $X_i$ is a conceptual random variable.
-  * $x$ is a number that we plug into.
 
 ---
 ## The empirical mean 
@@ -46,9 +42,8 @@ $$
 $$
 \tilde X_i = X_i - \bar X.
 $$
-The the mean of the $\tilde X_i$ is 0.
+The mean of the $\tilde X_i$ is 0.
 * This process is called "centering" the random variables.
-* The mean is a measure of central tendancy of the data.
 * Recall from the previous lecture that the mean is 
   the least squares solution for minimizing
   $$
@@ -67,21 +62,18 @@ $$
 * The empirical standard deviation is defined as
 $S = \sqrt{S^2}$. Notice that the standard deviation has the same units as the data.
 * The data defined by $X_i / s$ have empirical standard deviation 1. This is called "scaling" the data.
-* The empirical standard deviation is a measure of spread.
-* Sometimes people divide by $n$ rather than $n-1$ (the latter
-produces an unbiased estimate.)
 
 ---
 ## Normalization
 
-* The the data defined by
+* The data defined by
 $$
 Z_i = \frac{X_i - \bar X}{s}
 $$
 have empirical mean zero and empirical standard deviation 1. 
 * The process of centering then scaling the data is called "normalizing" the data. 
 * Normalized data are centered at 0 and have units equal to standard deviations of the original data. 
-* Example, a value of 2 form normalized data means that data point
+* Example, a value of 2 from normalized data means that data point
 was two standard deviations larger than the mean.
 
 ---
@@ -93,8 +85,6 @@ Cov(X, Y) =
 \frac{1}{n-1}\sum_{i=1}^n (X_i - \bar X) (Y_i - \bar Y)
 = \frac{1}{n-1}\left( \sum_{i=1}^n X_i Y_i - n \bar X \bar Y\right)
 $$
-* Some people prefer to divide by $n$ rather than $n-1$ (the latter
-produces an unbiased estimate.)
 * The correlation is defined is
 $$
 Cor(X, Y) = \frac{Cov(X, Y)}{S_x S_y}
diff --git a/07_RegressionModels/01_02_notation/index.html b/07_RegressionModels/01_02_notation/index.html
index 42e277dcf..e5621cbbd 100644
--- a/07_RegressionModels/01_02_notation/index.html
+++ b/07_RegressionModels/01_02_notation/index.html
@@ -8,66 +8,61 @@
   <meta name="generator" content="slidify" />
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
     media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
   </script>
   
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
+  
 
 </head>
 <body style="opacity: 0">
   <slides class="layout-widescreen">
     
     <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assetsNew/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Some basic notation and background</h1>
+    <h2>Regression</h2>
+    <p>Brian Caffo, PhD<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
     
 
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Some basic notation and background</h1>
-        <h2>Regression</h2>
-        <p>Brian Caffo, PhD<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
     <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
+    <slide class="" id="slide-1" style="background:;">
   <hgroup>
     <h2>Some basic definitions</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>In this module, we&#39;ll cover some basic definitions and notation used throughout the class.</li>
 <li>We will try to minimize the amount of mathematics required for this class.</li>
-<li>No caclculus is required. </li>
+<li>No calculus is required. </li>
 </ul>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-2" style="background:;">
+<slide class="" id="slide-2" style="background:;">
   <hgroup>
     <h2>Notation for data</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>We write \(X_1, X_2, \ldots, X_n\) to describe \(n\) data points.</li>
 <li>As an example, consider the data set \(\{1, 2, 5\}\) then 
@@ -78,24 +73,17 @@ <h2>Notation for data</h2>
 <li>We often use a different letter than \(X\), such as \(Y_1, \ldots , Y_n\).</li>
 <li>We will typically use Greek letters for things we don&#39;t know. 
 Such as, \(\mu\) is a mean that we&#39;d like to estimate.</li>
-<li>We will use capital letters for conceptual values of the variables and lowercase letters for realized values.
-
-<ul>
-<li>So this way we can write \(P(X_i > x)\). </li>
-<li>\(X_i\) is a conceptual random variable.</li>
-<li>\(x\) is a number that we plug into.</li>
-</ul></li>
 </ul>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-3" style="background:;">
+<slide class="" id="slide-3" style="background:;">
   <hgroup>
     <h2>The empirical mean</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Define the empirical mean as
 \[
@@ -105,9 +93,8 @@ <h2>The empirical mean</h2>
 \[
 \tilde X_i = X_i - \bar X.
 \]
-The the mean of the \(\tilde X_i\) is 0.</li>
+The mean of the \(\tilde X_i\) is 0.</li>
 <li>This process is called &quot;centering&quot; the random variables.</li>
-<li>The mean is a measure of central tendancy of the data.</li>
 <li>Recall from the previous lecture that the mean is 
 the least squares solution for minimizing
 \[
@@ -119,11 +106,11 @@ <h2>The empirical mean</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-4" style="background:;">
+<slide class="" id="slide-4" style="background:;">
   <hgroup>
     <h2>The emprical standard deviation and variance</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Define the empirical variance as 
 \[
@@ -133,29 +120,26 @@ <h2>The emprical standard deviation and variance</h2>
 <li>The empirical standard deviation is defined as
 \(S = \sqrt{S^2}\). Notice that the standard deviation has the same units as the data.</li>
 <li>The data defined by \(X_i / s\) have empirical standard deviation 1. This is called &quot;scaling&quot; the data.</li>
-<li>The empirical standard deviation is a measure of spread.</li>
-<li>Sometimes people divide by \(n\) rather than \(n-1\) (the latter
-produces an unbiased estimate.)</li>
 </ul>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-5" style="background:;">
+<slide class="" id="slide-5" style="background:;">
   <hgroup>
     <h2>Normalization</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
-<li>The the data defined by
+<li>The data defined by
 \[
 Z_i = \frac{X_i - \bar X}{s}
 \]
 have empirical mean zero and empirical standard deviation 1. </li>
 <li>The process of centering then scaling the data is called &quot;normalizing&quot; the data. </li>
 <li>Normalized data are centered at 0 and have units equal to standard deviations of the original data. </li>
-<li>Example, a value of 2 form normalized data means that data point
+<li>Example, a value of 2 from normalized data means that data point
 was two standard deviations larger than the mean.</li>
 </ul>
 
@@ -163,11 +147,11 @@ <h2>Normalization</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-6" style="background:;">
+<slide class="" id="slide-6" style="background:;">
   <hgroup>
     <h2>The empirical covariance</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Consider now when we have pairs of data, \((X_i, Y_i)\).</li>
 <li>Their empirical covariance is 
@@ -176,8 +160,6 @@ <h2>The empirical covariance</h2>
 \frac{1}{n-1}\sum_{i=1}^n (X_i - \bar X) (Y_i - \bar Y)
 = \frac{1}{n-1}\left( \sum_{i=1}^n X_i Y_i - n \bar X \bar Y\right)
 \]</li>
-<li>Some people prefer to divide by \(n\) rather than \(n-1\) (the latter
-produces an unbiased estimate.)</li>
 <li>The correlation is defined is
 \[
 Cor(X, Y) = \frac{Cov(X, Y)}{S_x S_y}
@@ -190,11 +172,11 @@ <h2>The empirical covariance</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-7" style="background:;">
+<slide class="" id="slide-7" style="background:;">
   <hgroup>
     <h2>Some facts about correlation</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>\(Cor(X, Y) = Cor(Y, X)\)</li>
 <li>\(-1 \leq Cor(X, Y) \leq 1\)</li>
@@ -209,34 +191,77 @@ <h2>Some facts about correlation</h2>
 
     <slide class="backdrop"></slide>
   </slides>
-
-  <!--[if IE]>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Some basic definitions'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Notation for data'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='The empirical mean'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='The emprical standard deviation and variance'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Normalization'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='The empirical covariance'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Some facts about correlation'>
+         7
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
     <script 
       src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
     </script>
     <script>CFInstall.check({mode: 'overlay'});</script>
   <![endif]-->
 </body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
 </script>
 <!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/07_RegressionModels/01_02_notation/index.md b/07_RegressionModels/01_02_notation/index.md
index 075a6e02b..653ca8000 100644
--- a/07_RegressionModels/01_02_notation/index.md
+++ b/07_RegressionModels/01_02_notation/index.md
@@ -8,8 +8,8 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
-  assets: ../../assets
+  lib: ../../librariesNew
+  assets: ../../assetsNew
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
 ---
@@ -18,7 +18,7 @@ mode        : selfcontained # {standalone, draft}
 
 * In this module, we'll cover some basic definitions and notation used throughout the class.
 * We will try to minimize the amount of mathematics required for this class.
-* No caclculus is required. 
+* No calculus is required. 
 
 ---
 
@@ -30,10 +30,6 @@ mode        : selfcontained # {standalone, draft}
 * We often use a different letter than $X$, such as $Y_1, \ldots , Y_n$.
 * We will typically use Greek letters for things we don't know. 
   Such as, $\mu$ is a mean that we'd like to estimate.
-* We will use capital letters for conceptual values of the variables and lowercase letters for realized values.
-  * So this way we can write $P(X_i > x)$. 
-  * $X_i$ is a conceptual random variable.
-  * $x$ is a number that we plug into.
 
 ---
 ## The empirical mean 
@@ -46,9 +42,8 @@ $$
 $$
 \tilde X_i = X_i - \bar X.
 $$
-The the mean of the $\tilde X_i$ is 0.
+The mean of the $\tilde X_i$ is 0.
 * This process is called "centering" the random variables.
-* The mean is a measure of central tendancy of the data.
 * Recall from the previous lecture that the mean is 
   the least squares solution for minimizing
   $$
@@ -67,21 +62,18 @@ $$
 * The empirical standard deviation is defined as
 $S = \sqrt{S^2}$. Notice that the standard deviation has the same units as the data.
 * The data defined by $X_i / s$ have empirical standard deviation 1. This is called "scaling" the data.
-* The empirical standard deviation is a measure of spread.
-* Sometimes people divide by $n$ rather than $n-1$ (the latter
-produces an unbiased estimate.)
 
 ---
 ## Normalization
 
-* The the data defined by
+* The data defined by
 $$
 Z_i = \frac{X_i - \bar X}{s}
 $$
 have empirical mean zero and empirical standard deviation 1. 
 * The process of centering then scaling the data is called "normalizing" the data. 
 * Normalized data are centered at 0 and have units equal to standard deviations of the original data. 
-* Example, a value of 2 form normalized data means that data point
+* Example, a value of 2 from normalized data means that data point
 was two standard deviations larger than the mean.
 
 ---
@@ -93,8 +85,6 @@ Cov(X, Y) =
 \frac{1}{n-1}\sum_{i=1}^n (X_i - \bar X) (Y_i - \bar Y)
 = \frac{1}{n-1}\left( \sum_{i=1}^n X_i Y_i - n \bar X \bar Y\right)
 $$
-* Some people prefer to divide by $n$ rather than $n-1$ (the latter
-produces an unbiased estimate.)
 * The correlation is defined is
 $$
 Cor(X, Y) = \frac{Cov(X, Y)}{S_x S_y}
diff --git a/07_RegressionModels/01_03_ols/fig/unnamed-chunk-1.png b/07_RegressionModels/01_03_ols/fig/unnamed-chunk-1.png
index dd500b89c..a8bba9e75 100644
Binary files a/07_RegressionModels/01_03_ols/fig/unnamed-chunk-1.png and b/07_RegressionModels/01_03_ols/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/01_03_ols/fig/unnamed-chunk-6.png b/07_RegressionModels/01_03_ols/fig/unnamed-chunk-6.png
index 76f962836..30cd5ce56 100644
Binary files a/07_RegressionModels/01_03_ols/fig/unnamed-chunk-6.png and b/07_RegressionModels/01_03_ols/fig/unnamed-chunk-6.png differ
diff --git a/07_RegressionModels/01_03_ols/index.Rmd b/07_RegressionModels/01_03_ols/index.Rmd
index 17f93b5c5..539c7d2e7 100644
--- a/07_RegressionModels/01_03_ols/index.Rmd
+++ b/07_RegressionModels/01_03_ols/index.Rmd
@@ -8,14 +8,13 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
 
 ---
-
-```{r setup, cache = FALSE, echo = FALSE, message = FALSE, warning = FALSE, tidy = FALSE}
+````{r setup, cache = FALSE, echo = FALSE, message = FALSE, warning = FALSE, tidy = FALSE, results='hide', error=FALSE}
 # make this an external chunk that can be included in any file
 options(width = 100)
 opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, tidy = F, cache.path = '.cache/', fig.path = 'fig/')
@@ -29,21 +28,26 @@ knit_hooks$set(inline = function(x) {
   }
 })
 knit_hooks$set(plot = knitr:::hook_plot_html)
+runif(1)
 ```
 
 ## General least squares for linear equations
 Consider again the parent and child height data from Galton
 
-```{r, fig.height=5, fig.width=5, echo=FALSE}
+```{r, fig.height=5, fig.width=8, echo=FALSE}
 library(UsingR)
 data(galton)
+library(dplyr); library(ggplot2)
 freqData <- as.data.frame(table(galton$child, galton$parent))
 names(freqData) <- c("child", "parent", "freq")
-plot(as.numeric(as.vector(freqData$parent)), 
-     as.numeric(as.vector(freqData$child)),
-     pch = 21, col = "black", bg = "lightblue",
-     cex = .05 * freqData$freq, 
-     xlab = "parent", ylab = "child")
+freqData$child <- as.numeric(as.character(freqData$child))
+freqData$parent <- as.numeric(as.character(freqData$parent))
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")  
+g
 ```
 
 ---
@@ -56,113 +60,9 @@ $i^{th}$ (average over the pair of) parents' heights.
   $$
   \sum_{i=1}^n \{Y_i - (\beta_0 + \beta_1 X_i)\}^2
   $$
-* How do we do it?
-
----
-## Let's solve this problem generally
-* Let $\mu_i = \beta_0 + \beta_1 X_i$ and our estimates be
-$\hat \mu_i = \hat \beta_0 + \hat \beta_1 X_i$. 
-* We want to minimize
-$$ \dagger \sum_{i=1}^n (Y_i - \mu_i)^2 = \sum_{i=1}^n (Y_i - \hat \mu_i) ^ 2 + 2 \sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) + \sum_{i=1}^n (\hat \mu_i - \mu_i)^2$$
-* Suppose that $$\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0$$ then
-$$ \dagger 
-=\sum_{i=1}^n (Y_i - \hat \mu_i) ^ 2  + \sum_{i=1}^n (\hat \mu_i - \mu_i)^2\geq \sum_{i=1}^n (Y_i - \hat \mu_i) ^ 2$$
-
----
-## Mean only regression
-* So we know that if:
-$$ \sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0$$
-where $\mu_i = \beta_0 + \beta_1 X_i$ and $\hat \mu_i = \hat \beta_0 + \hat \beta_1 X_i$ then the line 
-$$Y = \hat \beta_0 + \hat \beta_1 X$$
-is the least squares line.
-* Consider forcing $\beta_1 = 0$ and thus $\hat \beta_1=0$; 
-that is, only considering horizontal lines
-* The solution works out to be
-$$\hat \beta_0 = \bar Y.$$
 
 ---
-## Let's show it
-$$\begin{align} \
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) 
-= & \sum_{i=1}^n (Y_i - \hat \beta_0) (\hat \beta_0 - \beta_0) \\
-= & (\hat \beta_0 - \beta_0) \sum_{i=1}^n (Y_i   - \hat \beta_0) \
-\end{align} $$
-
-Thus, this will equal 0 if $\sum_{i=1}^n (Y_i  - \hat \beta_0)
-= n\bar Y - n \hat \beta_0=0$
-
-Thus $\hat \beta_0 = \bar Y.$
-
----
-## Regression through the origin
-* Recall that if:
-$$ \sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0$$
-where $\mu_i = \beta_0 + \beta_1 X_i$ and $\hat \mu_i = \hat \beta_0 + \hat \beta_1 X_i$ then the line 
-$$Y = \hat \beta_0 + \hat \beta_1 X$$
-is the least squares line.
-* Consider forcing $\beta_0 = 0$ and thus $\hat \beta_0=0$; 
-that is, only considering lines through the origin
-* The solution works out to be
-$$\hat \beta_1 = \frac{\sum_{i=1^n} Y_i X_i}{\sum_{i=1}^n X_i^2}.$$
-
----
-## Let's show it
-$$\begin{align} \
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) 
-= & \sum_{i=1}^n (Y_i - \hat \beta_1 X_i) (\hat \beta_1 X_i - \beta_1 X_i) \\
-= & (\hat \beta_1 - \beta_1) \sum_{i=1}^n (Y_i X_i  - \hat \beta_1 X_i ^2) \
-\end{align} $$
-
-Thus, this will equal 0 if $\sum_{i=1}^n (Y_i X_i  - \hat \beta_1 X_i ^2) = \sum_{i=1}^n Y_i X_i - \hat \beta_1 \sum_{i=1}^n X_i^2 =0$
-
-Thus
-$$\hat \beta_1 = \frac{\sum_{i=1^n} Y_i X_i}{\sum_{i=1}^n X_i^2}.$$
-
-
----
-## Recapping what we know
-* If we define $\mu_i = \beta_0$ then $\hat \beta_0 = \bar Y$.
-  * If we only look at horizontal lines, the least squares estimate of the intercept of that line is the average of the outcomes.
-* If we define $\mu_i = X_i \beta_1$ then $\hat \beta_1 = \frac{\sum_{i=1^n} Y_i X_i}{\sum_{i=1}^n X_i^2}$
-  * If we only look at lines through the origin, we get the estimated slope is the cross product of the X and Ys divided by the cross product of the Xs with themselves.
-* What about when $\mu_i = \beta_0 + \beta_1 X_i$? That is, we don't want to restrict ourselves to horizontal lines or lines through the origin.
-
----
-## Let's figure it out
-$$\begin{align} \
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) 
-= & \sum_{i=1}^n (Y_i - \hat\beta_0 - \hat\beta_1 X_i) (\hat \beta_0 + \hat \beta_1 X_i - \beta_0 - \beta_1 X_i) \\
-= & (\hat \beta_0 - \beta_0) \sum_{i=1}^n (Y_i - \hat\beta_0 - \hat \beta_1 X_i) + (\beta_1 - \beta_1)\sum_{i=1}^n (Y_i - \hat\beta_0 - \hat \beta_1 X_i)X_i\\
-\end{align} $$
-Note that 
-
-$$0=\sum_{i=1}^n (Y_i - \hat\beta_0 - \hat \beta_1 X_i) = n \bar Y - n \hat \beta_0 - n \hat \beta_1 \bar X ~~\mbox{implies that}~~\hat \beta_0 = \bar Y - \hat \beta_1 \bar X $$
-
-Then
-$$\sum_{i=1}^n (Y_i  - \hat\beta_0 - \hat \beta_1 X_i) X_i =  \sum_{i=1}^n (Y_i  - \bar Y + \hat \beta_1 \bar X - \hat \beta_1 X_i)X_i$$
-
----
-## Continued
-$$=\sum_{i=1}^n \{(Y_i  - \bar Y) - \hat \beta_1 (X_i - \bar X) \}X_i$$
-And thus
-$$ \sum_{i=1}^n (Y_i  - \bar Y)X_i - \hat \beta_1 \sum_{i=1}^n
-(X_i - \bar X) X_i = 0.$$
-So we arrive at
-$$
-\hat \beta_1 =
-\frac{\sum_{i=1}^n \{(Y_i  - \bar Y)X_i}{\sum_{i=1}^n
-(X_i - \bar X) X_i} = 
-\frac{\sum_{i=1}^n (Y_i  - \bar Y)(X_i - \bar X)}{\sum_{i=1}^n
-(X_i - \bar X) (X_i - \bar X)}
-= Cor(Y, X) \frac{Sd(Y)}{Sd(X)}.
-$$
-And recall
-$$
-\hat \beta_0 = \bar Y - \hat \beta_1 \bar X.
-$$
-
----
-## Consequences
+## Results
 * The least squares model fit to the line $Y = \beta_0 + \beta_1 X$ through the data pairs $(X_i, Y_i)$ with $Y_i$ as the outcome obtains the line $Y = \hat \beta_0 + \hat \beta_1 X$ where
   $$\hat \beta_1 = Cor(Y, X) \frac{Sd(Y)}{Sd(X)} ~~~ \hat \beta_0 = \bar Y - \hat \beta_1 \bar X$$
 * $\hat \beta_1$ has the units of $Y / X$, $\hat \beta_0$ has the units of $Y$.
@@ -172,6 +72,7 @@ $$
 $(X_i - \bar X, Y_i - \bar Y)$, and did regression through the origin.
 * If you normalized the data, $\{ \frac{X_i - \bar X}{Sd(X)}, \frac{Y_i - \bar Y}{Sd(Y)}\}$, the slope is $Cor(Y, X)$.
 
+
 ---
 ## Revisiting Galton's data
 ### Double check our calculations using R
@@ -211,42 +112,15 @@ xn <- (x - mean(x))/sd(x)
 c(cor(y, x), cor(yn, xn), coef(lm(yn ~ xn))[2])
 ```
 
-
----
-## Plotting the fit
-* Size of points are frequencies at that X, Y combination.
-* For the red lie the child is outcome.
-* For the blue, the parent is the outcome  (accounting for the fact that the response is plotted on the horizontal axis).
-* Black line assumes $Cor(Y, X) = 1$ (slope is $Sd(Y)/Sd(x)$).
-* Big black dot is $(\bar X, \bar Y)$.
-
----
-The code to add the lines
-
-```
-abline(mean(y) - mean(x) * cor(y, x) * sd(y) / sd(x), 
-  sd(y) / sd(x) * cor(y, x), 
-  lwd = 3, col = "red")
-abline(mean(y) - mean(x) * sd(y) / sd(x) / cor(y, x), 
-  sd(y) cor(y, x) / sd(x), 
-  lwd = 3, col = "blue")
-abline(mean(y) - mean(x) * sd(y) / sd(x), 
-  sd(y) / sd(x), 
-  lwd = 2)
-points(mean(x), mean(y), cex = 2, pch = 19)
-```
-
 ---
 ```{r, fig.height=6,fig.width=6,echo=FALSE}
-freqData <- as.data.frame(table(galton$child, galton$parent))
-names(freqData) <- c("child", "parent", "freq")
-plot(as.numeric(as.vector(freqData$parent)), 
-     as.numeric(as.vector(freqData$child)),
-     pch = 21, col = "black", bg = "lightblue",
-     cex = .05 * freqData$freq, 
-     xlab = "parent", ylab = "child", xlim = c(62, 74), ylim = c(62, 74))
-abline(mean(y) - mean(x) * cor(y, x) * sd(y) / sd(x), sd(y) / sd(x) * cor(y, x), lwd = 3, col = "red")
-abline(mean(y) - mean(x) * sd(y) / sd(x) / cor(y, x), sd(y) / sd(x) / cor(y, x), lwd = 3, col = "blue")
-abline(mean(y) - mean(x) * sd(y) / sd(x), sd(y) / sd(x), lwd = 2)
-points(mean(x), mean(y), cex = 2, pch = 19)
+g <- ggplot(filter(freqData, freq > 0), aes(x = parent, y = child))
+g <- g  + scale_size(range = c(2, 20), guide = "none" )
+g <- g + geom_point(colour="grey50", aes(size = freq+20, show_guide = FALSE))
+g <- g + geom_point(aes(colour=freq, size = freq))
+g <- g + scale_colour_gradient(low = "lightblue", high="white")  
+g <- g + geom_smooth(method="lm", formula=y~x)
+g
 ```
+
+
diff --git a/07_RegressionModels/01_03_ols/index.html b/07_RegressionModels/01_03_ols/index.html
index 2f7fa3efb..96c118668 100644
--- a/07_RegressionModels/01_03_ols/index.html
+++ b/07_RegressionModels/01_03_ols/index.html
@@ -8,51 +8,46 @@
   <meta name="generator" content="slidify" />
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
     media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
   </script>
   
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
+  
 
 </head>
 <body style="opacity: 0">
   <slides class="layout-widescreen">
     
     <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Least squares estimation of regression lines</h1>
+    <h2>Regression via least squares</h2>
+    <p>Brian Caffo, Jeff Leek and Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
     
 
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Least squares estimation of regression lines</h1>
-        <h2>Regression via least squares</h2>
-        <p>Brian Caffo, Jeff Leek and Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
     <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
+    <slide class="" id="slide-1" style="background:;">
   <hgroup>
     <h2>General least squares for linear equations</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <p>Consider again the parent and child height data from Galton</p>
 
 <div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
@@ -61,11 +56,11 @@ <h2>General least squares for linear equations</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-2" style="background:;">
+<slide class="" id="slide-2" style="background:;">
   <hgroup>
     <h2>Fitting the best line</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Let \(Y_i\) be the \(i^{th}\) child&#39;s height and \(X_i\) be the 
 \(i^{th}\) (average over the pair of) parents&#39; heights. </li>
@@ -78,190 +73,17 @@ <h2>Fitting the best line</h2>
 \[
 \sum_{i=1}^n \{Y_i - (\beta_0 + \beta_1 X_i)\}^2
 \]</li>
-<li>How do we do it?</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Let&#39;s solve this problem generally</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Let \(\mu_i = \beta_0 + \beta_1 X_i\) and our estimates be
-\(\hat \mu_i = \hat \beta_0 + \hat \beta_1 X_i\). </li>
-<li>We want to minimize
-\[ \dagger \sum_{i=1}^n (Y_i - \mu_i)^2 = \sum_{i=1}^n (Y_i - \hat \mu_i) ^ 2 + 2 \sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) + \sum_{i=1}^n (\hat \mu_i - \mu_i)^2\]</li>
-<li>Suppose that \[\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0\] then
-\[ \dagger 
-=\sum_{i=1}^n (Y_i - \hat \mu_i) ^ 2  + \sum_{i=1}^n (\hat \mu_i - \mu_i)^2\geq \sum_{i=1}^n (Y_i - \hat \mu_i) ^ 2\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Mean only regression</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>So we know that if:
-\[ \sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0\]
-where \(\mu_i = \beta_0 + \beta_1 X_i\) and \(\hat \mu_i = \hat \beta_0 + \hat \beta_1 X_i\) then the line 
-\[Y = \hat \beta_0 + \hat \beta_1 X\]
-is the least squares line.</li>
-<li>Consider forcing \(\beta_1 = 0\) and thus \(\hat \beta_1=0\); 
-that is, only considering horizontal lines</li>
-<li>The solution works out to be
-\[\hat \beta_0 = \bar Y.\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Let&#39;s show it</h2>
-  </hgroup>
-  <article>
-    <p>\[\begin{align} \
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) 
-= & \sum_{i=1}^n (Y_i - \hat \beta_0) (\hat \beta_0 - \beta_0) \\
-= & (\hat \beta_0 - \beta_0) \sum_{i=1}^n (Y_i   - \hat \beta_0) \
-\end{align} \]</p>
-
-<p>Thus, this will equal 0 if \(\sum_{i=1}^n (Y_i  - \hat \beta_0)
-= n\bar Y - n \hat \beta_0=0\)</p>
-
-<p>Thus \(\hat \beta_0 = \bar Y.\)</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Regression through the origin</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Recall that if:
-\[ \sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0\]
-where \(\mu_i = \beta_0 + \beta_1 X_i\) and \(\hat \mu_i = \hat \beta_0 + \hat \beta_1 X_i\) then the line 
-\[Y = \hat \beta_0 + \hat \beta_1 X\]
-is the least squares line.</li>
-<li>Consider forcing \(\beta_0 = 0\) and thus \(\hat \beta_0=0\); 
-that is, only considering lines through the origin</li>
-<li>The solution works out to be
-\[\hat \beta_1 = \frac{\sum_{i=1^n} Y_i X_i}{\sum_{i=1}^n X_i^2}.\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>Let&#39;s show it</h2>
-  </hgroup>
-  <article>
-    <p>\[\begin{align} \
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) 
-= & \sum_{i=1}^n (Y_i - \hat \beta_1 X_i) (\hat \beta_1 X_i - \beta_1 X_i) \\
-= & (\hat \beta_1 - \beta_1) \sum_{i=1}^n (Y_i X_i  - \hat \beta_1 X_i ^2) \
-\end{align} \]</p>
-
-<p>Thus, this will equal 0 if \(\sum_{i=1}^n (Y_i X_i  - \hat \beta_1 X_i ^2) = \sum_{i=1}^n Y_i X_i - \hat \beta_1 \sum_{i=1}^n X_i^2 =0\)</p>
-
-<p>Thus
-\[\hat \beta_1 = \frac{\sum_{i=1^n} Y_i X_i}{\sum_{i=1}^n X_i^2}.\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Recapping what we know</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>If we define \(\mu_i = \beta_0\) then \(\hat \beta_0 = \bar Y\).
-
-<ul>
-<li>If we only look at horizontal lines, the least squares estimate of the intercept of that line is the average of the outcomes.</li>
-</ul></li>
-<li>If we define \(\mu_i = X_i \beta_1\) then \(\hat \beta_1 = \frac{\sum_{i=1^n} Y_i X_i}{\sum_{i=1}^n X_i^2}\)
-
-<ul>
-<li>If we only look at lines through the origin, we get the estimated slope is the cross product of the X and Ys divided by the cross product of the Xs with themselves.</li>
-</ul></li>
-<li>What about when \(\mu_i = \beta_0 + \beta_1 X_i\)? That is, we don&#39;t want to restrict ourselves to horizontal lines or lines through the origin.</li>
 </ul>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Let&#39;s figure it out</h2>
-  </hgroup>
-  <article>
-    <p>\[\begin{align} \
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) 
-= & \sum_{i=1}^n (Y_i - \hat\beta_0 - \hat\beta_1 X_i) (\hat \beta_0 + \hat \beta_1 X_i - \beta_0 - \beta_1 X_i) \\
-= & (\hat \beta_0 - \beta_0) \sum_{i=1}^n (Y_i - \hat\beta_0 - \hat \beta_1 X_i) + (\beta_1 - \beta_1)\sum_{i=1}^n (Y_i - \hat\beta_0 - \hat \beta_1 X_i)X_i\\
-\end{align} \]
-Note that </p>
-
-<p>\[0=\sum_{i=1}^n (Y_i - \hat\beta_0 - \hat \beta_1 X_i) = n \bar Y - n \hat \beta_0 - n \hat \beta_1 \bar X ~~\mbox{implies that}~~\hat \beta_0 = \bar Y - \hat \beta_1 \bar X \]</p>
-
-<p>Then
-\[\sum_{i=1}^n (Y_i  - \hat\beta_0 - \hat \beta_1 X_i) X_i =  \sum_{i=1}^n (Y_i  - \bar Y + \hat \beta_1 \bar X - \hat \beta_1 X_i)X_i\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>Continued</h2>
-  </hgroup>
-  <article>
-    <p>\[=\sum_{i=1}^n \{(Y_i  - \bar Y) - \hat \beta_1 (X_i - \bar X) \}X_i\]
-And thus
-\[ \sum_{i=1}^n (Y_i  - \bar Y)X_i - \hat \beta_1 \sum_{i=1}^n
-(X_i - \bar X) X_i = 0.\]
-So we arrive at
-\[
-\hat \beta_1 =
-\frac{\sum_{i=1}^n \{(Y_i  - \bar Y)X_i}{\sum_{i=1}^n
-(X_i - \bar X) X_i} = 
-\frac{\sum_{i=1}^n (Y_i  - \bar Y)(X_i - \bar X)}{\sum_{i=1}^n
-(X_i - \bar X) (X_i - \bar X)}
-= Cor(Y, X) \frac{Sd(Y)}{Sd(X)}.
-\]
-And recall
-\[
-\hat \beta_0 = \bar Y - \hat \beta_1 \bar X.
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-11" style="background:;">
+<slide class="" id="slide-3" style="background:;">
   <hgroup>
-    <h2>Consequences</h2>
+    <h2>Results</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>The least squares model fit to the line \(Y = \beta_0 + \beta_1 X\) through the data pairs \((X_i, Y_i)\) with \(Y_i\) as the outcome obtains the line \(Y = \hat \beta_0 + \hat \beta_1 X\) where
 \[\hat \beta_1 = Cor(Y, X) \frac{Sd(Y)}{Sd(X)} ~~~ \hat \beta_0 = \bar Y - \hat \beta_1 \bar X\]</li>
@@ -277,11 +99,11 @@ <h2>Consequences</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-12" style="background:;">
+<slide class="" id="slide-4" style="background:;">
   <hgroup>
     <h2>Revisiting Galton&#39;s data</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Double check our calculations using R</h3>
 
 <pre><code class="r">y &lt;- galton$child
@@ -300,11 +122,11 @@ <h3>Double check our calculations using R</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-13" style="background:;">
+<slide class="" id="slide-5" style="background:;">
   <hgroup>
     <h2>Revisiting Galton&#39;s data</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Reversing the outcome/predictor relationship</h3>
 
 <pre><code class="r">beta1 &lt;- cor(y, x) *  sd(x) / sd(y)
@@ -321,11 +143,11 @@ <h3>Reversing the outcome/predictor relationship</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-14" style="background:;">
+<slide class="" id="slide-6" style="background:;">
   <hgroup>
     <h2>Revisiting Galton&#39;s data</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Regression through the origin yields an equivalent slope if you center the data first</h3>
 
 <pre><code class="r">yc &lt;- y - mean(y)
@@ -342,11 +164,11 @@ <h3>Regression through the origin yields an equivalent slope if you center the d
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-15" style="background:;">
+<slide class="" id="slide-7" style="background:;">
   <hgroup>
     <h2>Revisiting Galton&#39;s data</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Normalizing variables results in the slope being the correlation</h3>
 
 <pre><code class="r">yn &lt;- (y - mean(y))/sd(y)
@@ -362,51 +184,8 @@ <h3>Normalizing variables results in the slope being the correlation</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Plotting the fit</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>Size of points are frequencies at that X, Y combination.</li>
-<li>For the red lie the child is outcome.</li>
-<li>For the blue, the parent is the outcome  (accounting for the fact that the response is plotted on the horizontal axis).</li>
-<li>Black line assumes \(Cor(Y, X) = 1\) (slope is \(Sd(Y)/Sd(x)\)).</li>
-<li>Big black dot is \((\bar X, \bar Y)\).</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>The code to add the lines</p>
-
-<pre><code>abline(mean(y) - mean(x) * cor(y, x) * sd(y) / sd(x), 
-  sd(y) / sd(x) * cor(y, x), 
-  lwd = 3, col = &quot;red&quot;)
-abline(mean(y) - mean(x) * sd(y) / sd(x) / cor(y, x), 
-  sd(y) cor(y, x) / sd(x), 
-  lwd = 3, col = &quot;blue&quot;)
-abline(mean(y) - mean(x) * sd(y) / sd(x), 
-  sd(y) / sd(x), 
-  lwd = 2)
-points(mean(x), mean(y), cex = 2, pch = 19)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
+<slide class="" id="slide-8" style="background:;">
+  <article data-timings="">
     <div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
 
   </article>
@@ -415,34 +194,83 @@ <h2>Plotting the fit</h2>
 
     <slide class="backdrop"></slide>
   </slides>
-
-  <!--[if IE]>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='General least squares for linear equations'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Fitting the best line'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Results'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Revisiting Galton&#39;s data'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Revisiting Galton&#39;s data'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Revisiting Galton&#39;s data'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Revisiting Galton&#39;s data'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title=''>
+         8
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
     <script 
       src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
     </script>
     <script>CFInstall.check({mode: 'overlay'});</script>
   <![endif]-->
 </body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
 </script>
 <!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/07_RegressionModels/01_03_ols/index.md b/07_RegressionModels/01_03_ols/index.md
index 99cb69009..26bbf6903 100644
--- a/07_RegressionModels/01_03_ols/index.md
+++ b/07_RegressionModels/01_03_ols/index.md
@@ -8,7 +8,7 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
@@ -17,7 +17,6 @@ mode        : selfcontained # {standalone, draft}
 
 
 
-
 ## General least squares for linear equations
 Consider again the parent and child height data from Galton
 
@@ -34,113 +33,9 @@ $i^{th}$ (average over the pair of) parents' heights.
   $$
   \sum_{i=1}^n \{Y_i - (\beta_0 + \beta_1 X_i)\}^2
   $$
-* How do we do it?
-
----
-## Let's solve this problem generally
-* Let $\mu_i = \beta_0 + \beta_1 X_i$ and our estimates be
-$\hat \mu_i = \hat \beta_0 + \hat \beta_1 X_i$. 
-* We want to minimize
-$$ \dagger \sum_{i=1}^n (Y_i - \mu_i)^2 = \sum_{i=1}^n (Y_i - \hat \mu_i) ^ 2 + 2 \sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) + \sum_{i=1}^n (\hat \mu_i - \mu_i)^2$$
-* Suppose that $$\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0$$ then
-$$ \dagger 
-=\sum_{i=1}^n (Y_i - \hat \mu_i) ^ 2  + \sum_{i=1}^n (\hat \mu_i - \mu_i)^2\geq \sum_{i=1}^n (Y_i - \hat \mu_i) ^ 2$$
-
----
-## Mean only regression
-* So we know that if:
-$$ \sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0$$
-where $\mu_i = \beta_0 + \beta_1 X_i$ and $\hat \mu_i = \hat \beta_0 + \hat \beta_1 X_i$ then the line 
-$$Y = \hat \beta_0 + \hat \beta_1 X$$
-is the least squares line.
-* Consider forcing $\beta_1 = 0$ and thus $\hat \beta_1=0$; 
-that is, only considering horizontal lines
-* The solution works out to be
-$$\hat \beta_0 = \bar Y.$$
-
----
-## Let's show it
-$$\begin{align} \
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) 
-= & \sum_{i=1}^n (Y_i - \hat \beta_0) (\hat \beta_0 - \beta_0) \\
-= & (\hat \beta_0 - \beta_0) \sum_{i=1}^n (Y_i   - \hat \beta_0) \
-\end{align} $$
-
-Thus, this will equal 0 if $\sum_{i=1}^n (Y_i  - \hat \beta_0)
-= n\bar Y - n \hat \beta_0=0$
-
-Thus $\hat \beta_0 = \bar Y.$
 
 ---
-## Regression through the origin
-* Recall that if:
-$$ \sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0$$
-where $\mu_i = \beta_0 + \beta_1 X_i$ and $\hat \mu_i = \hat \beta_0 + \hat \beta_1 X_i$ then the line 
-$$Y = \hat \beta_0 + \hat \beta_1 X$$
-is the least squares line.
-* Consider forcing $\beta_0 = 0$ and thus $\hat \beta_0=0$; 
-that is, only considering lines through the origin
-* The solution works out to be
-$$\hat \beta_1 = \frac{\sum_{i=1^n} Y_i X_i}{\sum_{i=1}^n X_i^2}.$$
-
----
-## Let's show it
-$$\begin{align} \
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) 
-= & \sum_{i=1}^n (Y_i - \hat \beta_1 X_i) (\hat \beta_1 X_i - \beta_1 X_i) \\
-= & (\hat \beta_1 - \beta_1) \sum_{i=1}^n (Y_i X_i  - \hat \beta_1 X_i ^2) \
-\end{align} $$
-
-Thus, this will equal 0 if $\sum_{i=1}^n (Y_i X_i  - \hat \beta_1 X_i ^2) = \sum_{i=1}^n Y_i X_i - \hat \beta_1 \sum_{i=1}^n X_i^2 =0$
-
-Thus
-$$\hat \beta_1 = \frac{\sum_{i=1^n} Y_i X_i}{\sum_{i=1}^n X_i^2}.$$
-
-
----
-## Recapping what we know
-* If we define $\mu_i = \beta_0$ then $\hat \beta_0 = \bar Y$.
-  * If we only look at horizontal lines, the least squares estimate of the intercept of that line is the average of the outcomes.
-* If we define $\mu_i = X_i \beta_1$ then $\hat \beta_1 = \frac{\sum_{i=1^n} Y_i X_i}{\sum_{i=1}^n X_i^2}$
-  * If we only look at lines through the origin, we get the estimated slope is the cross product of the X and Ys divided by the cross product of the Xs with themselves.
-* What about when $\mu_i = \beta_0 + \beta_1 X_i$? That is, we don't want to restrict ourselves to horizontal lines or lines through the origin.
-
----
-## Let's figure it out
-$$\begin{align} \
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) 
-= & \sum_{i=1}^n (Y_i - \hat\beta_0 - \hat\beta_1 X_i) (\hat \beta_0 + \hat \beta_1 X_i - \beta_0 - \beta_1 X_i) \\
-= & (\hat \beta_0 - \beta_0) \sum_{i=1}^n (Y_i - \hat\beta_0 - \hat \beta_1 X_i) + (\beta_1 - \beta_1)\sum_{i=1}^n (Y_i - \hat\beta_0 - \hat \beta_1 X_i)X_i\\
-\end{align} $$
-Note that 
-
-$$0=\sum_{i=1}^n (Y_i - \hat\beta_0 - \hat \beta_1 X_i) = n \bar Y - n \hat \beta_0 - n \hat \beta_1 \bar X ~~\mbox{implies that}~~\hat \beta_0 = \bar Y - \hat \beta_1 \bar X $$
-
-Then
-$$\sum_{i=1}^n (Y_i  - \hat\beta_0 - \hat \beta_1 X_i) X_i =  \sum_{i=1}^n (Y_i  - \bar Y + \hat \beta_1 \bar X - \hat \beta_1 X_i)X_i$$
-
----
-## Continued
-$$=\sum_{i=1}^n \{(Y_i  - \bar Y) - \hat \beta_1 (X_i - \bar X) \}X_i$$
-And thus
-$$ \sum_{i=1}^n (Y_i  - \bar Y)X_i - \hat \beta_1 \sum_{i=1}^n
-(X_i - \bar X) X_i = 0.$$
-So we arrive at
-$$
-\hat \beta_1 =
-\frac{\sum_{i=1}^n \{(Y_i  - \bar Y)X_i}{\sum_{i=1}^n
-(X_i - \bar X) X_i} = 
-\frac{\sum_{i=1}^n (Y_i  - \bar Y)(X_i - \bar X)}{\sum_{i=1}^n
-(X_i - \bar X) (X_i - \bar X)}
-= Cor(Y, X) \frac{Sd(Y)}{Sd(X)}.
-$$
-And recall
-$$
-\hat \beta_0 = \bar Y - \hat \beta_1 \bar X.
-$$
-
----
-## Consequences
+## Results
 * The least squares model fit to the line $Y = \beta_0 + \beta_1 X$ through the data pairs $(X_i, Y_i)$ with $Y_i$ as the outcome obtains the line $Y = \hat \beta_0 + \hat \beta_1 X$ where
   $$\hat \beta_1 = Cor(Y, X) \frac{Sd(Y)}{Sd(X)} ~~~ \hat \beta_0 = \bar Y - \hat \beta_1 \bar X$$
 * $\hat \beta_1$ has the units of $Y / X$, $\hat \beta_0$ has the units of $Y$.
@@ -150,6 +45,7 @@ $$
 $(X_i - \bar X, Y_i - \bar Y)$, and did regression through the origin.
 * If you normalized the data, $\{ \frac{X_i - \bar X}{Sd(X)}, \frac{Y_i - \bar Y}{Sd(Y)}\}$, the slope is $Cor(Y, X)$.
 
+
 ---
 ## Revisiting Galton's data
 ### Double check our calculations using R
@@ -219,31 +115,8 @@ c(cor(y, x), cor(yn, xn), coef(lm(yn ~ xn))[2])
 ```
 
 
-
----
-## Plotting the fit
-* Size of points are frequencies at that X, Y combination.
-* For the red lie the child is outcome.
-* For the blue, the parent is the outcome  (accounting for the fact that the response is plotted on the horizontal axis).
-* Black line assumes $Cor(Y, X) = 1$ (slope is $Sd(Y)/Sd(x)$).
-* Big black dot is $(\bar X, \bar Y)$.
-
 ---
-The code to add the lines
+<div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
 
-```
-abline(mean(y) - mean(x) * cor(y, x) * sd(y) / sd(x), 
-  sd(y) / sd(x) * cor(y, x), 
-  lwd = 3, col = "red")
-abline(mean(y) - mean(x) * sd(y) / sd(x) / cor(y, x), 
-  sd(y) cor(y, x) / sd(x), 
-  lwd = 3, col = "blue")
-abline(mean(y) - mean(x) * sd(y) / sd(x), 
-  sd(y) / sd(x), 
-  lwd = 2)
-points(mean(x), mean(y), cex = 2, pch = 19)
-```
 
----
-<div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
 
diff --git a/07_RegressionModels/01_04_rttm/assets/fig/unnamed-chunk-1.png b/07_RegressionModels/01_04_rttm/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..18e9d8eca
Binary files /dev/null and b/07_RegressionModels/01_04_rttm/assets/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/01_04_rttm/fig/unnamed-chunk-1.png b/07_RegressionModels/01_04_rttm/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..41b880f44
Binary files /dev/null and b/07_RegressionModels/01_04_rttm/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/01_04_rttm/index.Rmd b/07_RegressionModels/01_04_rttm/index.Rmd
index 59404ffa5..c6db2910b 100644
--- a/07_RegressionModels/01_04_rttm/index.Rmd
+++ b/07_RegressionModels/01_04_rttm/index.Rmd
@@ -1,14 +1,14 @@
 ---
 title       : Historical side note, Regression to Mediocrity
-subtitle    : Regression to the mean
-author      : Brian Caffo, Jeff Leek, Roger Peng PhD
+subtitle    : Regression
+author      : Brian Caffo, Jeff Leek and Roger Peng
 job         : Johns Hopkins Bloomberg School of Public Health
 logo        : bloomberg_shield.png
 framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
@@ -40,7 +40,7 @@ knit_hooks$set(plot = knitr:::hook_plot_html)
 ---
 ## Regression to the mean
 * These phenomena are all examples of so-called regression to the mean
-* Invented by Francis Galton in the paper "Regression towvards mediocrity in hereditary stature" The Journal of the Anthropological Institute of Great Britain and Ireland , Vol. 15, (1886).
+* Invented by Francis Galton in the paper "Regression towards mediocrity in hereditary stature" The Journal of the Anthropological Institute of Great Britain and Ireland , Vol. 15, (1886).
 * Think of it this way, imagine if you simulated pairs of random normals
   * The largest first ones would be the largest by chance, and the probability that there are smaller for the second simulation is high.
   * In other words  $P(Y < x | X = x)$ gets bigger as $x$ heads into the very large values.
@@ -56,41 +56,41 @@ knit_hooks$set(plot = knitr:::hook_plot_html)
 * Notice if $X$ is the outcome and you create a plot where $X$ is the horizontal axis, the slope of the least squares line that you plot is $1/Cor(Y, X)$. 
 
 ---
-## Normalizing the data and setting plotting parameters
-```{r, echo = TRUE}
+<<<<<<< HEAD
+```{r, fig.height=6,fig.width=6,echo=FALSE}
+=======
+## Plot of the results
+```{r, fig.height=6,fig.width=6,echo=FALSE, fig.align='center'}
+>>>>>>> 3e5b14bbb8f101fc2a8573beb037d5f1b6f6fe47
 library(UsingR)
 data(father.son)
 y <- (father.son$sheight - mean(father.son$sheight)) / sd(father.son$sheight)
 x <- (father.son$fheight - mean(father.son$fheight)) / sd(father.son$fheight)
 rho <- cor(x, y)
-myPlot <- function(x, y) {
-  plot(x, y, 
-       xlab = "Father's height, normalized",
-       ylab = "Son's height, normalized",
-       xlim = c(-3, 3), ylim = c(-3, 3),
-       bg = "lightblue", col = "black", cex = 1.1, pch = 21, 
-       frame = FALSE)
-}
-```
-
----
-## Plot the data, code
-```
-myPlot(x, y)
-abline(0, 1) # if there were perfect correlation
-abline(0, rho, lwd = 2) # father predicts son
-abline(0, 1 / rho, lwd = 2) # son predicts father, son on vertical axis
-abline(h = 0); abline(v = 0) # reference lines for no relathionship
-```
-
----
-## Plot the data, results
-```{r, fig.height=6,fig.width=6,echo=FALSE}
-myPlot(x, y)
-abline(0, 1)
-abline(0, rho, lwd = 2)
-abline(0, 1 / rho, lwd = 2)
-abline(h = 0); abline(v = 0) 
+library(ggplot2)
+<<<<<<< HEAD
+g = ggplot(data.frame(x = x, y = y), aes(x = x, y = y))
+g = g + geom_point(size = 6, colour = "black", alpha = 0.2)
+g = g + geom_point(size = 4, colour = "salmon", alpha = 0.2)
+g = g + xlim(-4, 4) + ylim(-4, 4)
+g = g + geom_abline(intercept = 0, slope = 1)
+g = g + geom_vline(xintercept = 0)
+g = g + geom_hline(yintercept = 0)
+g = g + geom_abline(intercept = 0, slope = rho, size = 2)
+g = g + geom_abline(intercept = 0, slope = 1 / rho, size = 2)
+=======
+g = ggplot(data.frame(x, y), aes(x = x, y = y))
+g = g + geom_point(size = 5, alpha = .2, colour = "black")
+g = g + geom_point(size = 4, alpha = .2, colour = "red")
+g = g + geom_vline(xintercept = 0)
+g = g + geom_hline(yintercept = 0)
+g = g + geom_abline(position = "identity")
+g = g + geom_abline(intercept = 0, slope = rho, size = 2)
+g = g + geom_abline(intercept = 0, slope = 1 / rho, size = 2)
+g = g + xlab("Father's height, normalized")
+g = g + ylab("Son's height, normalized")
+>>>>>>> 3e5b14bbb8f101fc2a8573beb037d5f1b6f6fe47
+g
 ```
 
 ---
diff --git a/07_RegressionModels/01_04_rttm/index.html b/07_RegressionModels/01_04_rttm/index.html
index 3cf677f25..fcef03a79 100644
--- a/07_RegressionModels/01_04_rttm/index.html
+++ b/07_RegressionModels/01_04_rttm/index.html
@@ -4,55 +4,55 @@
   <title>Historical side note, Regression to Mediocrity</title>
   <meta charset="utf-8">
   <meta name="description" content="Historical side note, Regression to Mediocrity">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng PhD">
+  <meta name="author" content="Brian Caffo, Jeff Leek and Roger Peng">
   <meta name="generator" content="slidify" />
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
     media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
   </script>
   
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
+  
 
 </head>
 <body style="opacity: 0">
   <slides class="layout-widescreen">
     
     <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Historical side note, Regression to Mediocrity</h1>
+<<<<<<< HEAD
+    <h2>Regression</h2>
+    <p>Brian Caffo, Jeff Leek and Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+=======
+    <h2>Regression to the mean</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng PhD<br/>Johns Hopkins Bloomberg School of Public Health</p>
+>>>>>>> 3e5b14bbb8f101fc2a8573beb037d5f1b6f6fe47
+  </hgroup>
+  <article></article>  
+</slide>
     
 
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Historical side note, Regression to Mediocrity</h1>
-        <h2>Regression to the mean</h2>
-        <p>Brian Caffo, Jeff Leek, Roger Peng PhD<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
     <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
+    <slide class="" id="slide-1" style="background:;">
   <hgroup>
     <h2>A historically famous idea, Regression to the Mean</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Why is it that the children of tall parents tend to be tall, but not as tall as their parents? </li>
 <li>Why do children of short parents tend to be short, but not as short as their parents?</li>
@@ -64,14 +64,14 @@ <h2>A historically famous idea, Regression to the Mean</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-2" style="background:;">
+<slide class="" id="slide-2" style="background:;">
   <hgroup>
     <h2>Regression to the mean</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>These phenomena are all examples of so-called regression to the mean</li>
-<li>Invented by Francis Galton in the paper &quot;Regression towvards mediocrity in hereditary stature&quot; The Journal of the Anthropological Institute of Great Britain and Ireland , Vol. 15, (1886).</li>
+<li>Invented by Francis Galton in the paper &quot;Regression towards mediocrity in hereditary stature&quot; The Journal of the Anthropological Institute of Great Britain and Ireland , Vol. 15, (1886).</li>
 <li>Think of it this way, imagine if you simulated pairs of random normals
 
 <ul>
@@ -90,11 +90,11 @@ <h2>Regression to the mean</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-3" style="background:;">
+<slide class="" id="slide-3" style="background:;">
   <hgroup>
     <h2>Regression to the mean</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Suppose that we normalize \(X\) (child&#39;s height) and \(Y\) (parent&#39;s height) so that they both have mean 0 and variance 1. </li>
 <li>Then, recall, our regression line passes through \((0, 0)\) (the mean of the X and Y).</li>
@@ -106,62 +106,25 @@ <h2>Regression to the mean</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Normalizing the data and setting plotting parameters</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">library(UsingR)
-data(father.son)
-y &lt;- (father.son$sheight - mean(father.son$sheight)) / sd(father.son$sheight)
-x &lt;- (father.son$fheight - mean(father.son$fheight)) / sd(father.son$fheight)
-rho &lt;- cor(x, y)
-myPlot &lt;- function(x, y) {
-  plot(x, y, 
-       xlab = &quot;Father&#39;s height, normalized&quot;,
-       ylab = &quot;Son&#39;s height, normalized&quot;,
-       xlim = c(-3, 3), ylim = c(-3, 3),
-       bg = &quot;lightblue&quot;, col = &quot;black&quot;, cex = 1.1, pch = 21, 
-       frame = FALSE)
-}
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
+<slide class="" id="slide-4" style="background:;">
+<<<<<<< HEAD
+=======
   <hgroup>
-    <h2>Plot the data, code</h2>
+    <h2>Plot of the results</h2>
   </hgroup>
-  <article>
-    <pre><code>myPlot(x, y)
-abline(0, 1) # if there were perfect correlation
-abline(0, rho, lwd = 2) # father predicts son
-abline(0, 1 / rho, lwd = 2) # son predicts father, son on vertical axis
-abline(h = 0); abline(v = 0) # reference lines for no relathionship
-</code></pre>
+>>>>>>> 3e5b14bbb8f101fc2a8573beb037d5f1b6f6fe47
+  <article data-timings="">
+    <div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Plot the data, results</h2>
-  </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
+<slide class="" id="slide-5" style="background:;">
   <hgroup>
     <h2>Discussion</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>If you had to predict a son&#39;s normalized height, it would be
 \(Cor(Y, X) * X_i\) </li>
@@ -178,34 +141,69 @@ <h2>Discussion</h2>
 
     <slide class="backdrop"></slide>
   </slides>
-
-  <!--[if IE]>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='A historically famous idea, Regression to the Mean'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Regression to the mean'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Regression to the mean'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+<<<<<<< HEAD
+        data-slide=4 title=''>
+=======
+        data-slide=4 title='Plot of the results'>
+>>>>>>> 3e5b14bbb8f101fc2a8573beb037d5f1b6f6fe47
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Discussion'>
+         5
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
     <script 
       src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
     </script>
     <script>CFInstall.check({mode: 'overlay'});</script>
   <![endif]-->
 </body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
 </script>
 <!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/07_RegressionModels/01_04_rttm/index.md b/07_RegressionModels/01_04_rttm/index.md
index 57ba45899..2675a4458 100644
--- a/07_RegressionModels/01_04_rttm/index.md
+++ b/07_RegressionModels/01_04_rttm/index.md
@@ -1,14 +1,14 @@
 ---
 title       : Historical side note, Regression to Mediocrity
-subtitle    : Regression to the mean
-author      : Brian Caffo, Jeff Leek, Roger Peng PhD
+subtitle    : Regression
+author      : Brian Caffo, Jeff Leek and Roger Peng
 job         : Johns Hopkins Bloomberg School of Public Health
 logo        : bloomberg_shield.png
 framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
@@ -17,7 +17,6 @@ mode        : selfcontained # {standalone, draft}
 
 
 
-
 ## A historically famous idea, Regression to the Mean
 * Why is it that the children of tall parents tend to be tall, but not as tall as their parents? 
 * Why do children of short parents tend to be short, but not as short as their parents?
@@ -27,7 +26,7 @@ mode        : selfcontained # {standalone, draft}
 ---
 ## Regression to the mean
 * These phenomena are all examples of so-called regression to the mean
-* Invented by Francis Galton in the paper "Regression towvards mediocrity in hereditary stature" The Journal of the Anthropological Institute of Great Britain and Ireland , Vol. 15, (1886).
+* Invented by Francis Galton in the paper "Regression towards mediocrity in hereditary stature" The Journal of the Anthropological Institute of Great Britain and Ireland , Vol. 15, (1886).
 * Think of it this way, imagine if you simulated pairs of random normals
   * The largest first ones would be the largest by chance, and the probability that there are smaller for the second simulation is high.
   * In other words  $P(Y < x | X = x)$ gets bigger as $x$ heads into the very large values.
@@ -43,39 +42,13 @@ mode        : selfcontained # {standalone, draft}
 * Notice if $X$ is the outcome and you create a plot where $X$ is the horizontal axis, the slope of the least squares line that you plot is $1/Cor(Y, X)$. 
 
 ---
-## Normalizing the data and setting plotting parameters
-
-```r
-library(UsingR)
-data(father.son)
-y <- (father.son$sheight - mean(father.son$sheight)) / sd(father.son$sheight)
-x <- (father.son$fheight - mean(father.son$fheight)) / sd(father.son$fheight)
-rho <- cor(x, y)
-myPlot <- function(x, y) {
-  plot(x, y, 
-       xlab = "Father's height, normalized",
-       ylab = "Son's height, normalized",
-       xlim = c(-3, 3), ylim = c(-3, 3),
-       bg = "lightblue", col = "black", cex = 1.1, pch = 21, 
-       frame = FALSE)
-}
-```
-
-
----
-## Plot the data, code
-```
-myPlot(x, y)
-abline(0, 1) # if there were perfect correlation
-abline(0, rho, lwd = 2) # father predicts son
-abline(0, 1 / rho, lwd = 2) # son predicts father, son on vertical axis
-abline(h = 0); abline(v = 0) # reference lines for no relathionship
-```
-
----
-## Plot the data, results
-<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
+<<<<<<< HEAD
+<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
 
+=======
+## Plot of the results
+<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
+>>>>>>> 3e5b14bbb8f101fc2a8573beb037d5f1b6f6fe47
 
 ---
 ## Discussion
diff --git a/07_RegressionModels/01_05_linearRegression/fig/unnamed-chunk-1.png b/07_RegressionModels/01_05_linearRegression/fig/unnamed-chunk-1.png
index 4c6ff7fa1..fde66a1ff 100644
Binary files a/07_RegressionModels/01_05_linearRegression/fig/unnamed-chunk-1.png and b/07_RegressionModels/01_05_linearRegression/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/01_05_linearRegression/index.Rmd b/07_RegressionModels/01_05_linearRegression/index.Rmd
index b41a996be..fed70dcb0 100644
--- a/07_RegressionModels/01_05_linearRegression/index.Rmd
+++ b/07_RegressionModels/01_05_linearRegression/index.Rmd
@@ -8,7 +8,7 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
@@ -38,22 +38,7 @@ $$
 * Here the $\epsilon_{i}$ are assumed iid $N(0, \sigma^2)$. 
 * Note, $E[Y_i ~|~ X_i = x_i] = \mu_i = \beta_0 + \beta_1 x_i$
 * Note, $Var(Y_i ~|~ X_i = x_i) = \sigma^2$.
-* Likelihood equivalent model specification is that the $Y_i$ are independent $N(\mu_i, \sigma^2)$.
 
----
-## Likelihood
-$$
-{\cal L}(\beta, \sigma)
-= \prod_{i=1}^n \left\{(2 \pi \sigma^2)^{-1/2}\exp\left(-\frac{1}{2\sigma^2}(y_i - \mu_i)^2 \right) \right\}
-$$
-so that the twice the negative log (base e) likelihood is
-$$
--2 \log\{ {\cal L}(\beta, \sigma) \}
-= \frac{1}{\sigma^2} \sum_{i=1}^n (y_i - \mu_i)^2 + n\log(\sigma^2)
-$$
-Discussion
-* Maximizing the likelihood is the same as minimizing -2 log likelihood
-* The least squares estimate for $\mu_i = \beta_0 + \beta_1 x_i$ is exactly the maximimum likelihood estimate (regardless of $\sigma$)
 
 ---
 ## Recap
@@ -76,7 +61,7 @@ Y_i = \beta_0 + \beta_1 X_i + \epsilon_i
 = \beta_0 + a \beta_1 + \beta_1 (X_i - a) + \epsilon_i
 = \tilde \beta_0 + \beta_1 (X_i - a) + \epsilon_i
 $$
-So, shifting you $X$ values by value $a$ changes the intercept, but not the slope. 
+So, shifting your $X$ values by value $a$ changes the intercept, but not the slope. 
 * Often $a$ is set to $\bar X$ so that the intercept is interpretted as the expected response at the average $X$ value.
 
 ---
@@ -108,44 +93,28 @@ $$
   $$
   \hat \beta_0 + \hat \beta_1 X
   $$
-* Note that at the observed value of $X$s, we obtain the
-  predictions
-  $$
-  \hat \mu_i = \hat Y_i = \hat \beta_0 + \hat \beta_1 X_i
-  $$
-* Remember that least squares minimizes 
-$$
-\sum_{i=1}^n (Y_i - \mu_i)
-$$
-for $\mu_i$ expressed as points on a line
+
 
 ---
 ## Example
 ### `diamond` data set from `UsingR` 
-Data is diamond prices (Signapore dollars) and diamond weight
+Data is diamond prices (Singapore dollars) and diamond weight
 in carats (standard measure of diamond mass, 0.2 $g$). To get the data use `library(UsingR); data(diamond)`
 
-Plotting the fitted regression line and data
-```
-data(diamond)
-plot(diamond$carat, diamond$price,  
-     xlab = "Mass (carats)", 
-     ylab = "Price (SIN $)", 
-     bg = "lightblue", 
-     col = "black", cex = 1.1, pch = 21,frame = FALSE)
-abline(lm(price ~ carat, data = diamond), lwd = 2)
-```
 
 ---
-## The plot
+## Plot of the data
 ```{r, echo = FALSE, fig.height=5,fig.width=5}
+library(UsingR)
 data(diamond)
-plot(diamond$carat, diamond$price,  
-     xlab = "Mass (carats)", 
-     ylab = "Price (SIN $)", 
-     bg = "lightblue", 
-     col = "black", cex = 1.1, pch = 21,frame = FALSE)
-abline(lm(price ~ carat, data = diamond), lwd = 2)
+library(ggplot2)
+g = ggplot(diamond, aes(x = carat, y = price))
+g = g + xlab("Mass (carats)")
+g = g + ylab("Price (SIN $)")
+g = g + geom_point(size = 7, colour = "black", alpha=0.5)
+g = g + geom_point(size = 5, colour = "blue", alpha=0.2)
+g = g + geom_smooth(method = "lm", colour = "black")
+g
 ```
 
 ---
diff --git a/07_RegressionModels/01_05_linearRegression/index.html b/07_RegressionModels/01_05_linearRegression/index.html
index 876178f27..440a2f234 100644
--- a/07_RegressionModels/01_05_linearRegression/index.html
+++ b/07_RegressionModels/01_05_linearRegression/index.html
@@ -8,51 +8,46 @@
   <meta name="generator" content="slidify" />
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
     media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
   </script>
   
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
+  
 
 </head>
 <body style="opacity: 0">
   <slides class="layout-widescreen">
     
     <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Statistical linear regression models</h1>
+    <h2></h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
     
 
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Statistical linear regression models</h1>
-        <h2></h2>
-        <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
     <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
+    <slide class="" id="slide-1" style="background:;">
   <hgroup>
     <h2>Basic regression model with additive Gaussian errors.</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Least squares is an estimation tool, how do we do inference?</li>
 <li>Consider developing a probabilistic model for linear regression
@@ -62,43 +57,17 @@ <h2>Basic regression model with additive Gaussian errors.</h2>
 <li>Here the \(\epsilon_{i}\) are assumed iid \(N(0, \sigma^2)\). </li>
 <li>Note, \(E[Y_i ~|~ X_i = x_i] = \mu_i = \beta_0 + \beta_1 x_i\)</li>
 <li>Note, \(Var(Y_i ~|~ X_i = x_i) = \sigma^2\).</li>
-<li>Likelihood equivalent model specification is that the \(Y_i\) are independent \(N(\mu_i, \sigma^2)\).</li>
 </ul>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Likelihood</h2>
-  </hgroup>
-  <article>
-    <p>\[
-{\cal L}(\beta, \sigma)
-= \prod_{i=1}^n \left\{(2 \pi \sigma^2)^{-1/2}\exp\left(-\frac{1}{2\sigma^2}(y_i - \mu_i)^2 \right) \right\}
-\]
-so that the twice the negative log (base e) likelihood is
-\[
--2 \log\{ {\cal L}(\beta, \sigma) \}
-= \frac{1}{\sigma^2} \sum_{i=1}^n (y_i - \mu_i)^2 + n\log(\sigma^2)
-\]
-Discussion</p>
-
-<ul>
-<li>Maximizing the likelihood is the same as minimizing -2 log likelihood</li>
-<li>The least squares estimate for \(\mu_i = \beta_0 + \beta_1 x_i\) is exactly the maximimum likelihood estimate (regardless of \(\sigma\))</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-3" style="background:;">
+<slide class="" id="slide-2" style="background:;">
   <hgroup>
     <h2>Recap</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Model \(Y_i =  \mu_i + \epsilon_i = \beta_0 + \beta_1 X_i + \epsilon_i\) where \(\epsilon_i\) are iid \(N(0, \sigma^2)\)</li>
 <li>ML estimates of \(\beta_0\) and \(\beta_1\) are the least squares estimates
@@ -111,11 +80,11 @@ <h2>Recap</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-4" style="background:;">
+<slide class="" id="slide-3" style="background:;">
   <hgroup>
     <h2>Interpretting regression coefficients, the itc</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>\(\beta_0\) is the expected value of the response when the predictor is 0
 \[
@@ -128,7 +97,7 @@ <h2>Interpretting regression coefficients, the itc</h2>
 = \beta_0 + a \beta_1 + \beta_1 (X_i - a) + \epsilon_i
 = \tilde \beta_0 + \beta_1 (X_i - a) + \epsilon_i
 \]
-So, shifting you \(X\) values by value \(a\) changes the intercept, but not the slope. </li>
+So, shifting your \(X\) values by value \(a\) changes the intercept, but not the slope. </li>
 <li>Often \(a\) is set to \(\bar X\) so that the intercept is interpretted as the expected response at the average \(X\) value.</li>
 </ul>
 
@@ -136,11 +105,11 @@ <h2>Interpretting regression coefficients, the itc</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-5" style="background:;">
+<slide class="" id="slide-4" style="background:;">
   <hgroup>
     <h2>Interpretting regression coefficients, the slope</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>\(\beta_1\) is the expected change in response for a 1 unit change in the predictor
 \[
@@ -167,74 +136,53 @@ <h2>Interpretting regression coefficients, the slope</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-6" style="background:;">
+<slide class="" id="slide-5" style="background:;">
   <hgroup>
     <h2>Using regression coeficients for prediction</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>If we would like to guess the outcome at a particular
 value of the predictor, say \(X\), the regression model guesses
 \[
 \hat \beta_0 + \hat \beta_1 X
 \]</li>
-<li>Note that at the observed value of $X$s, we obtain the
-predictions
-\[
-\hat \mu_i = \hat Y_i = \hat \beta_0 + \hat \beta_1 X_i
-\]</li>
-<li>Remember that least squares minimizes 
-\[
-\sum_{i=1}^n (Y_i - \mu_i)
-\]
-for \(\mu_i\) expressed as points on a line</li>
 </ul>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-7" style="background:;">
+<slide class="" id="slide-6" style="background:;">
   <hgroup>
     <h2>Example</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3><code>diamond</code> data set from <code>UsingR</code></h3>
 
-<p>Data is diamond prices (Signapore dollars) and diamond weight
+<p>Data is diamond prices (Singapore dollars) and diamond weight
 in carats (standard measure of diamond mass, 0.2 \(g\)). To get the data use <code>library(UsingR); data(diamond)</code></p>
 
-<p>Plotting the fitted regression line and data</p>
-
-<pre><code>data(diamond)
-plot(diamond$carat, diamond$price,  
-     xlab = &quot;Mass (carats)&quot;, 
-     ylab = &quot;Price (SIN $)&quot;, 
-     bg = &quot;lightblue&quot;, 
-     col = &quot;black&quot;, cex = 1.1, pch = 21,frame = FALSE)
-abline(lm(price ~ carat, data = diamond), lwd = 2)
-</code></pre>
-
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-8" style="background:;">
+<slide class="" id="slide-7" style="background:;">
   <hgroup>
-    <h2>The plot</h2>
+    <h2>Plot of the data</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-9" style="background:;">
+<slide class="" id="slide-8" style="background:;">
   <hgroup>
     <h2>Fitting the linear regression model</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <pre><code class="r">fit &lt;- lm(price ~ carat, data = diamond)
 coef(fit)
 </code></pre>
@@ -253,11 +201,11 @@ <h2>Fitting the linear regression model</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-10" style="background:;">
+<slide class="" id="slide-9" style="background:;">
   <hgroup>
     <h2>Getting a more interpretable intercept</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <pre><code class="r">fit2 &lt;- lm(price ~ I(carat - mean(carat)), data = diamond)
 coef(fit2)
 </code></pre>
@@ -273,11 +221,11 @@ <h2>Getting a more interpretable intercept</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-11" style="background:;">
+<slide class="" id="slide-10" style="background:;">
   <hgroup>
     <h2>Changing scale</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>A one carat increase in a diamond is pretty big, what about
 changing units to 1/10th of a carat? </li>
@@ -301,11 +249,11 @@ <h2>Changing scale</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-12" style="background:;">
+<slide class="" id="slide-11" style="background:;">
   <hgroup>
     <h2>Predicting the price of a diamond</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <pre><code class="r">newx &lt;- c(0.16, 0.27, 0.34)
 coef(fit)[1] + coef(fit)[2] * newx
 </code></pre>
@@ -324,11 +272,8 @@ <h2>Predicting the price of a diamond</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
+<slide class="" id="slide-12" style="background:;">
+  <article data-timings="">
     <p>Predicted values at the observed Xs (red)
 and at the new Xs (lines)</p>
 
@@ -340,34 +285,107 @@ <h2>Predicting the price of a diamond</h2>
 
     <slide class="backdrop"></slide>
   </slides>
-
-  <!--[if IE]>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Basic regression model with additive Gaussian errors.'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Recap'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Interpretting regression coefficients, the itc'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Interpretting regression coefficients, the slope'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Using regression coeficients for prediction'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Example'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Plot of the data'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Fitting the linear regression model'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Getting a more interpretable intercept'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Changing scale'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Predicting the price of a diamond'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title=''>
+         12
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
     <script 
       src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
     </script>
     <script>CFInstall.check({mode: 'overlay'});</script>
   <![endif]-->
 </body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
 </script>
 <!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/07_RegressionModels/01_05_linearRegression/index.md b/07_RegressionModels/01_05_linearRegression/index.md
index cae0e248a..b41afab5a 100644
--- a/07_RegressionModels/01_05_linearRegression/index.md
+++ b/07_RegressionModels/01_05_linearRegression/index.md
@@ -8,7 +8,7 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
@@ -25,22 +25,7 @@ $$
 * Here the $\epsilon_{i}$ are assumed iid $N(0, \sigma^2)$. 
 * Note, $E[Y_i ~|~ X_i = x_i] = \mu_i = \beta_0 + \beta_1 x_i$
 * Note, $Var(Y_i ~|~ X_i = x_i) = \sigma^2$.
-* Likelihood equivalent model specification is that the $Y_i$ are independent $N(\mu_i, \sigma^2)$.
 
----
-## Likelihood
-$$
-{\cal L}(\beta, \sigma)
-= \prod_{i=1}^n \left\{(2 \pi \sigma^2)^{-1/2}\exp\left(-\frac{1}{2\sigma^2}(y_i - \mu_i)^2 \right) \right\}
-$$
-so that the twice the negative log (base e) likelihood is
-$$
--2 \log\{ {\cal L}(\beta, \sigma) \}
-= \frac{1}{\sigma^2} \sum_{i=1}^n (y_i - \mu_i)^2 + n\log(\sigma^2)
-$$
-Discussion
-* Maximizing the likelihood is the same as minimizing -2 log likelihood
-* The least squares estimate for $\mu_i = \beta_0 + \beta_1 x_i$ is exactly the maximimum likelihood estimate (regardless of $\sigma$)
 
 ---
 ## Recap
@@ -63,7 +48,7 @@ Y_i = \beta_0 + \beta_1 X_i + \epsilon_i
 = \beta_0 + a \beta_1 + \beta_1 (X_i - a) + \epsilon_i
 = \tilde \beta_0 + \beta_1 (X_i - a) + \epsilon_i
 $$
-So, shifting you $X$ values by value $a$ changes the intercept, but not the slope. 
+So, shifting your $X$ values by value $a$ changes the intercept, but not the slope. 
 * Often $a$ is set to $\bar X$ so that the intercept is interpretted as the expected response at the average $X$ value.
 
 ---
@@ -95,36 +80,17 @@ $$
   $$
   \hat \beta_0 + \hat \beta_1 X
   $$
-* Note that at the observed value of $X$s, we obtain the
-  predictions
-  $$
-  \hat \mu_i = \hat Y_i = \hat \beta_0 + \hat \beta_1 X_i
-  $$
-* Remember that least squares minimizes 
-$$
-\sum_{i=1}^n (Y_i - \mu_i)
-$$
-for $\mu_i$ expressed as points on a line
+
 
 ---
 ## Example
 ### `diamond` data set from `UsingR` 
-Data is diamond prices (Signapore dollars) and diamond weight
+Data is diamond prices (Singapore dollars) and diamond weight
 in carats (standard measure of diamond mass, 0.2 $g$). To get the data use `library(UsingR); data(diamond)`
 
-Plotting the fitted regression line and data
-```
-data(diamond)
-plot(diamond$carat, diamond$price,  
-     xlab = "Mass (carats)", 
-     ylab = "Price (SIN $)", 
-     bg = "lightblue", 
-     col = "black", cex = 1.1, pch = 21,frame = FALSE)
-abline(lm(price ~ carat, data = diamond), lwd = 2)
-```
 
 ---
-## The plot
+## Plot of the data
 <div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
 
 
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-1.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..f4700fe78
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-10.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-10.png
new file mode 100644
index 000000000..83f39b1f3
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-10.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-12.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-12.png
new file mode 100644
index 000000000..6aed89db4
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-12.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-3.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..a6574e3a2
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-3.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-4.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..48268326b
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-4.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-5.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..bc6850474
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-5.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-6.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-6.png
new file mode 100644
index 000000000..f04f21bea
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-6.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-7.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-7.png
new file mode 100644
index 000000000..6bcdcc009
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-7.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-8.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-8.png
new file mode 100644
index 000000000..e78771a8b
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-8.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-9.png b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-9.png
new file mode 100644
index 000000000..b4291d316
Binary files /dev/null and b/07_RegressionModels/01_06_residualVariation/assets/fig/unnamed-chunk-9.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-3.png b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-3.png
index 262c666b2..aa9a02a13 100644
Binary files a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-3.png and b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-3.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-4.png b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-4.png
index 32e66cf15..640dfb373 100644
Binary files a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-4.png and b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-4.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-5.png b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-5.png
index 6aab6fb88..f4ec72e00 100644
Binary files a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-5.png and b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-5.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-6.png b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-6.png
index 27623fa42..825312949 100644
Binary files a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-6.png and b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-6.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-7.png b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-7.png
index 54b72a541..dbcb3e237 100644
Binary files a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-7.png and b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-7.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-9.png b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-9.png
index cd0be449f..638696819 100644
Binary files a/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-9.png and b/07_RegressionModels/01_06_residualVariation/fig/unnamed-chunk-9.png differ
diff --git a/07_RegressionModels/01_06_residualVariation/index.Rmd b/07_RegressionModels/01_06_residualVariation/index.Rmd
index f4c090164..dbe34e3f2 100644
--- a/07_RegressionModels/01_06_residualVariation/index.Rmd
+++ b/07_RegressionModels/01_06_residualVariation/index.Rmd
@@ -8,7 +8,7 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
@@ -29,6 +29,26 @@ knit_hooks$set(inline = function(x) {
 })
 knit_hooks$set(plot = knitr:::hook_plot_html)
 ```
+## Motivating example
+### `diamond` data set from `UsingR` 
+Data is diamond prices (Singapore dollars) and diamond weight
+in carats (standard measure of diamond mass, 0.2 $g$). To get the data use `library(UsingR); data(diamond)`
+
+---
+```{r, echo = FALSE, fig.height=6,fig.width=6}
+library(UsingR)
+data(diamond)
+library(ggplot2)
+g = ggplot(diamond, aes(x = carat, y = price))
+g = g + xlab("Mass (carats)")
+g = g + ylab("Price (SIN $)")
+g = g + geom_smooth(method = "lm", colour = "black")
+g = g + geom_point(size = 7, colour = "black", alpha=0.5)
+g = g + geom_point(size = 5, colour = "blue", alpha=0.2)
+g
+```
+---
+
 ## Residuals
 * Model $Y_i = \beta_0 + \beta_1 X_i + \epsilon_i$ where $\epsilon_i \sim N(0, \sigma^2)$.
 * Observed outcome $i$ is $Y_i$ at predictor value $X_i$
@@ -77,7 +97,7 @@ plot(diamond$carat, diamond$price,
      xlab = "Mass (carats)", 
      ylab = "Price (SIN $)", 
      bg = "lightblue", 
-     col = "black", cex = 1.1, pch = 21,frame = FALSE)
+     col = "black", cex = 2, pch = 21,frame = FALSE)
 abline(fit, lwd = 2)
 for (i in 1 : n) 
   lines(c(x[i], x[i]), c(y[i], yhat[i]), col = "red" , lwd = 2)
@@ -86,11 +106,11 @@ for (i in 1 : n)
 ---
 ## Residuals versus X
 ```{r, echo = FALSE, fig.height=5, fig.width=5}
-plot(diamond$carat, e,  
+plot(x, e,  
      xlab = "Mass (carats)", 
      ylab = "Residuals (SIN $)", 
      bg = "lightblue", 
-     col = "black", cex = 1.1, pch = 21,frame = FALSE)
+     col = "black", cex = 2, pch = 21,frame = FALSE)
 abline(h = 0, lwd = 2)
 for (i in 1 : n) 
   lines(c(x[i], x[i]), c(e[i], 0), col = "red" , lwd = 2)
@@ -98,29 +118,78 @@ for (i in 1 : n)
 
 ---
 ## Non-linear data
-```{r, echo = TRUE, fig.height=5, fig.width=5}
-x <- runif(100, -3, 3); y <- x + sin(x) + rnorm(100, sd = .2); 
-plot(x, y); abline(lm(y ~ x))
+```{r, echo = FALSE, fig.height=5, fig.width=5}
+x = runif(100, -3, 3); y = x + sin(x) + rnorm(100, sd = .2); 
+library(ggplot2)
+g = ggplot(data.frame(x = x, y = y), aes(x = x, y = y))
+g = g + geom_smooth(method = "lm", colour = "black")
+g = g + geom_point(size = 7, colour = "black", alpha = 0.4)
+g = g + geom_point(size = 5, colour = "red", alpha = 0.4)
+g
 ```
 
 ---
-```{r, echo = TRUE, fig.height=5, fig.width=5}
-plot(x, resid(lm(y ~ x))); 
-abline(h = 0)
+## Residual plot
+```{r, echo = FALSE, fig.height=5, fig.width=5}
+g = ggplot(data.frame(x = x, y = resid(lm(y ~ x))), 
+           aes(x = x, y = y))
+g = g + geom_hline(yintercept = 0, size = 2); 
+g = g + geom_point(size = 7, colour = "black", alpha = 0.4)
+g = g + geom_point(size = 5, colour = "red", alpha = 0.4)
+g = g + xlab("X") + ylab("Residual")
+g
 ```
 
 ---
 ## Heteroskedasticity
-```{r, echo = TRUE, fig.height=4.5, fig.width=4.5}
+```{r, echo = FALSE, fig.height=4.5, fig.width=4.5}
 x <- runif(100, 0, 6); y <- x + rnorm(100,  mean = 0, sd = .001 * x); 
-plot(x, y); abline(lm(y ~ x))
+g = ggplot(data.frame(x = x, y = y), aes(x = x, y = y))
+g = g + geom_smooth(method = "lm", colour = "black")
+g = g + geom_point(size = 7, colour = "black", alpha = 0.4)
+g = g + geom_point(size = 5, colour = "red", alpha = 0.4)
+g
 ```
 
 ---
 ## Getting rid of the blank space can be helpful
-```{r, echo = TRUE, fig.height=4.5, fig.width=4.5}
-plot(x, resid(lm(y ~ x))); 
-abline(h = 0)
+```{r, echo = FALSE, fig.height=4.5, fig.width=4.5}
+g = ggplot(data.frame(x = x, y = resid(lm(y ~ x))), 
+           aes(x = x, y = y))
+g = g + geom_hline(yintercept = 0, size = 2); 
+g = g + geom_point(size = 7, colour = "black", alpha = 0.4)
+g = g + geom_point(size = 5, colour = "red", alpha = 0.4)
+g = g + xlab("X") + ylab("Residual")
+g
+```
+
+---
+## Diamond data residual plot
+
+```{r, echo = FALSE, fig.height=4.5, fig.width=4.5}
+diamond$e <- resid(lm(price ~ carat, data = diamond))
+g = ggplot(diamond, aes(x = carat, y = e))
+g = g + xlab("Mass (carats)")
+g = g + ylab("Residual price (SIN $)")
+g = g + geom_hline(yintercept = 0, size = 2)
+g = g + geom_point(size = 7, colour = "black", alpha=0.5)
+g = g + geom_point(size = 5, colour = "blue", alpha=0.2)
+g
+```
+
+---
+## Diamond data residual plot
+
+```{r, echo = FALSE, fig.height=4.5, fig.width=4.5}
+e = c(resid(lm(price ~ 1, data = diamond)),
+      resid(lm(price ~ carat, data = diamond)))
+fit = factor(c(rep("Itc", nrow(diamond)),
+               rep("Itc, slope", nrow(diamond))))
+g = ggplot(data.frame(e = e, fit = fit), aes(y = e, x = fit, fill = fit))
+g = g + geom_dotplot(binaxis = "y", size = 2, stackdir = "center", binwidth = 20)
+g = g + xlab("Fitting approach")
+g = g + ylab("Residual price")
+g
 ```
 
 ---
@@ -145,61 +214,26 @@ sqrt(sum(resid(fit)^2) / (n - 2))
 
 ---
 ## Summarizing variation
-$$
-\begin{align}
-\sum_{i=1}^n (Y_i - \bar Y)^2 
-& = \sum_{i=1}^n (Y_i - \hat Y_i + \hat Y_i - \bar Y)^2 \\
-& = \sum_{i=1}^n (Y_i - \hat Y_i)^2 + 
-2 \sum_{i=1}^n  (Y_i - \hat Y_i)(\hat Y_i - \bar Y) + 
-\sum_{i=1}^n  (\hat Y_i - \bar Y)^2 \\
-\end{align}
-$$
-
-****
-### Scratch work
-$(Y_i - \hat Y_i) = \{Y_i - (\bar Y - \hat \beta_1 \bar X) - \hat \beta_1 X_i\} = (Y_i - \bar Y) - \hat \beta_1 (X_i - \bar X)$
-
-$(\hat Y_i - \bar Y) = (\bar Y - \hat \beta_1 \bar X - \hat \beta_1 X_i - \bar Y )
-= \hat \beta_1  (X_i - \bar X)$
-
-$\sum_{i=1}^n  (Y_i - \hat Y_i)(\hat Y_i - \bar Y) 
-= \sum_{i=1}^n  \{(Y_i - \bar Y) - \hat \beta_1 (X_i - \bar X))\}\{\hat \beta_1  (X_i - \bar X)\}$
-
-$=\hat \beta_1 \sum_{i=1}^n (Y_i - \bar Y)(X_i - \bar X) -\hat\beta_1^2\sum_{i=1}^n (X_i - \bar X)^2$
-
-$= \hat \beta_1^2 \sum_{i=1}^n (X_i - \bar X)^2-\hat\beta_1^2\sum_{i=1}^n (X_i - \bar X)^2 = 0$
 
----
-## Summarizing variation
+- The total variability in our response is the variability around an intercept
+(think mean only regression) $\sum_{i=1}^n (Y_i - \bar Y)^2$
+- The regression variability is the variability that is explained by adding the
+predictor $\sum_{i=1}^n  (\hat Y_i - \bar Y)^2$
+- The error variability is what's leftover around the regression line
+$\sum_{i=1}^n (Y_i - \hat Y_i)^2$
+- Neat fact
 $$
 \sum_{i=1}^n (Y_i - \bar Y)^2 
 = \sum_{i=1}^n (Y_i - \hat Y_i)^2 + \sum_{i=1}^n  (\hat Y_i - \bar Y)^2 
 $$
 
-Or 
-
-Total Variation = Residual Variation + Regression Variation
-
-Define the percent of total varation described by the model as
-$$
-R^2 = \frac{\sum_{i=1}^n  (\hat Y_i - \bar Y)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-= 1 - \frac{\sum_{i=1}^n  (Y_i - \hat Y_i)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-$$
-
 ---
-## Relation between $R^2$ and $r$ (the corrrelation)
-Recall that $(\hat Y_i - \bar Y) = \hat \beta_1  (X_i - \bar X)$
-so that
+## R squared
+- R squared is the percentage of the total variability that is explained
+by the linear relationship with the predictor
 $$
 R^2 = \frac{\sum_{i=1}^n  (\hat Y_i - \bar Y)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-= \hat \beta_1^2  \frac{\sum_{i=1}^n(X_i - \bar X)}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-= Cor(Y, X)^2
-$$
-Since, recall, 
-$$
-\hat \beta_1 = Cor(Y, X)\frac{Sd(Y)}{Sd(X)}
 $$
-So, $R^2$ is literally $r$ squared.
 
 ---
 ## Some facts about $R^2$
@@ -241,3 +275,48 @@ mtext("Anscombe's 4 Regression data sets", outer = TRUE, cex = 1.5)
 par(op)
 ```
 
+---
+## How to derive R squared (Not required!)
+### For those that are interested
+$$
+\begin{align}
+\sum_{i=1}^n (Y_i - \bar Y)^2 
+& = \sum_{i=1}^n (Y_i - \hat Y_i + \hat Y_i - \bar Y)^2 \\
+& = \sum_{i=1}^n (Y_i - \hat Y_i)^2 + 
+2 \sum_{i=1}^n  (Y_i - \hat Y_i)(\hat Y_i - \bar Y) + 
+\sum_{i=1}^n  (\hat Y_i - \bar Y)^2 \\
+\end{align}
+$$
+
+****
+### Scratch work
+$(Y_i - \hat Y_i) = \{Y_i - (\bar Y - \hat \beta_1 \bar X) - \hat \beta_1 X_i\} = (Y_i - \bar Y) - \hat \beta_1 (X_i - \bar X)$
+
+$(\hat Y_i - \bar Y) = (\bar Y - \hat \beta_1 \bar X - \hat \beta_1 X_i - \bar Y )
+= \hat \beta_1  (X_i - \bar X)$
+
+$\sum_{i=1}^n  (Y_i - \hat Y_i)(\hat Y_i - \bar Y) 
+= \sum_{i=1}^n  \{(Y_i - \bar Y) - \hat \beta_1 (X_i - \bar X))\}\{\hat \beta_1  (X_i - \bar X)\}$
+
+$=\hat \beta_1 \sum_{i=1}^n (Y_i - \bar Y)(X_i - \bar X) -\hat\beta_1^2\sum_{i=1}^n (X_i - \bar X)^2$
+
+$= \hat \beta_1^2 \sum_{i=1}^n (X_i - \bar X)^2-\hat\beta_1^2\sum_{i=1}^n (X_i - \bar X)^2 = 0$
+
+
+---
+## The relation between R squared and r
+### (Again not required)
+Recall that $(\hat Y_i - \bar Y) = \hat \beta_1  (X_i - \bar X)$
+so that
+$$
+R^2 = \frac{\sum_{i=1}^n  (\hat Y_i - \bar Y)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
+= \hat \beta_1^2  \frac{\sum_{i=1}^n(X_i - \bar X)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
+= Cor(Y, X)^2
+$$
+Since, recall, 
+$$
+\hat \beta_1 = Cor(Y, X)\frac{Sd(Y)}{Sd(X)}
+$$
+So, $R^2$ is literally $r$ squared.
+
+
diff --git a/07_RegressionModels/01_06_residualVariation/index.html b/07_RegressionModels/01_06_residualVariation/index.html
index d2ca4d492..760a430e5 100644
--- a/07_RegressionModels/01_06_residualVariation/index.html
+++ b/07_RegressionModels/01_06_residualVariation/index.html
@@ -8,52 +8,122 @@
   <meta name="generator" content="slidify" />
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
     media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
   </script>
   
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
+  
 
 </head>
 <body style="opacity: 0">
   <slides class="layout-widescreen">
     
     <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Residuals and residual variation</h1>
+    <h2></h2>
+    <p>Brian Caffo, Jeff Leek and Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
     
 
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Residuals and residual variation</h1>
-        <h2></h2>
-        <p>Brian Caffo, Jeff Leek and Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
     <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Residuals</h2>
-  </hgroup>
-  <article>
-    <ul>
+    <slide class="" id="slide-1" style="background:;">
+  <article data-timings="">
+    <pre><code>## Error: object &#39;opts_chunk&#39; not found
+</code></pre>
+
+<pre><code>## Error: object &#39;knit_hooks&#39; not found
+</code></pre>
+
+<pre><code>## Error: object &#39;knit_hooks&#39; not found
+</code></pre>
+
+<h2>Motivating example</h2>
+
+<h3><code>diamond</code> data set from <code>UsingR</code></h3>
+
+<p>Data is diamond prices (Singapore dollars) and diamond weight
+in carats (standard measure of diamond mass, 0.2 \(g\)). To get the data use <code>library(UsingR); data(diamond)</code></p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <article data-timings="">
+    <pre><code>## Loading required package: MASS
+## Loading required package: HistData
+## Loading required package: Hmisc
+## Loading required package: grid
+## Loading required package: lattice
+## Loading required package: survival
+## Loading required package: splines
+## Loading required package: Formula
+## 
+## Attaching package: &#39;Hmisc&#39;
+## 
+## The following objects are masked from &#39;package:base&#39;:
+## 
+##     format.pval, round.POSIXt, trunc.POSIXt, units
+## 
+## Loading required package: aplpack
+## Loading required package: tcltk
+## Loading required package: quantreg
+## Loading required package: SparseM
+## 
+## Attaching package: &#39;SparseM&#39;
+## 
+## The following object is masked from &#39;package:base&#39;:
+## 
+##     backsolve
+## 
+## 
+## Attaching package: &#39;quantreg&#39;
+## 
+## The following object is masked from &#39;package:Hmisc&#39;:
+## 
+##     latex
+## 
+## The following object is masked from &#39;package:survival&#39;:
+## 
+##     untangle.specials
+## 
+## 
+## Attaching package: &#39;UsingR&#39;
+## 
+## The following object is masked from &#39;package:survival&#39;:
+## 
+##     cancer
+## 
+## 
+## Attaching package: &#39;ggplot2&#39;
+## 
+## The following object is masked from &#39;package:UsingR&#39;:
+## 
+##     movies
+</code></pre>
+
+<h2><img src="assets/fig/unnamed-chunk-1.png" alt="plot of chunk unnamed-chunk-1"> </h2>
+
+<h2>Residuals</h2>
+
+<ul>
 <li>Model \(Y_i = \beta_0 + \beta_1 X_i + \epsilon_i\) where \(\epsilon_i \sim N(0, \sigma^2)\).</li>
 <li>Observed outcome \(i\) is \(Y_i\) at predictor value \(X_i\)</li>
 <li>Predicted outcome \(i\) is \(\hat Y_i\) at predictor valuve \(X_i\) is
@@ -76,11 +146,11 @@ <h2>Residuals</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-2" style="background:;">
+<slide class="" id="slide-3" style="background:;">
   <hgroup>
     <h2>Properties of the residuals</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>\(E[e_i] = 0\).</li>
 <li>If an intercept is included, \(\sum_{i=1}^n e_i = 0\)</li>
@@ -98,11 +168,11 @@ <h2>Properties of the residuals</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-3" style="background:;">
+<slide class="" id="slide-4" style="background:;">
   <hgroup>
     <h2>Code</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <pre><code class="r">data(diamond)
 y &lt;- diamond$price; x &lt;- diamond$carat; n &lt;- length(y)
 fit &lt;- lm(y ~ x)
@@ -111,106 +181,112 @@ <h2>Code</h2>
 max(abs(e -(y - yhat)))
 </code></pre>
 
-<pre><code>[1] 9.486e-13
+<pre><code>## [1] 9.486e-13
 </code></pre>
 
 <pre><code class="r">max(abs(e - (y - coef(fit)[1] - coef(fit)[2] * x)))
 </code></pre>
 
-<pre><code>[1] 9.486e-13
+<pre><code>## [1] 9.486e-13
 </code></pre>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-4" style="background:;">
+<slide class="" id="slide-5" style="background:;">
   <hgroup>
     <h2>Residuals are the signed length of the red lines</h2>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-3.png" alt="plot of chunk unnamed-chunk-3"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-5" style="background:;">
+<slide class="" id="slide-6" style="background:;">
   <hgroup>
     <h2>Residuals versus X</h2>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-4.png" alt="plot of chunk unnamed-chunk-4"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-6" style="background:;">
+<slide class="" id="slide-7" style="background:;">
   <hgroup>
     <h2>Non-linear data</h2>
   </hgroup>
-  <article>
-    <pre><code class="r">x &lt;- runif(100, -3, 3); y &lt;- x + sin(x) + rnorm(100, sd = .2); 
-plot(x, y); abline(lm(y ~ x))
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-5.png" alt="plot of chunk unnamed-chunk-5"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-7" style="background:;">
+<slide class="" id="slide-8" style="background:;">
   <hgroup>
-    
+    <h2>Residual plot</h2>
   </hgroup>
-  <article>
-    <pre><code class="r">plot(x, resid(lm(y ~ x))); 
-abline(h = 0)
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-6.png" alt="plot of chunk unnamed-chunk-6"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-8" style="background:;">
+<slide class="" id="slide-9" style="background:;">
   <hgroup>
     <h2>Heteroskedasticity</h2>
   </hgroup>
-  <article>
-    <pre><code class="r">x &lt;- runif(100, 0, 6); y &lt;- x + rnorm(100,  mean = 0, sd = .001 * x); 
-plot(x, y); abline(lm(y ~ x))
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-7.png" alt="plot of chunk unnamed-chunk-7"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-9" style="background:;">
+<slide class="" id="slide-10" style="background:;">
   <hgroup>
     <h2>Getting rid of the blank space can be helpful</h2>
   </hgroup>
-  <article>
-    <pre><code class="r">plot(x, resid(lm(y ~ x))); 
-abline(h = 0)
-</code></pre>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-8.png" alt="plot of chunk unnamed-chunk-8"> </p>
 
-<div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div>
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-11" style="background:;">
+  <hgroup>
+    <h2>Diamond data residual plot</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-9.png" alt="plot of chunk unnamed-chunk-9"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-10" style="background:;">
+<slide class="" id="slide-12" style="background:;">
+  <hgroup>
+    <h2>Diamond data residual plot</h2>
+  </hgroup>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-10.png" alt="plot of chunk unnamed-chunk-10"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-13" style="background:;">
   <hgroup>
     <h2>Estimating residual variation</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Model \(Y_i = \beta_0 + \beta_1 X_i + \epsilon_i\) where \(\epsilon_i \sim N(0, \sigma^2)\).</li>
 <li>The ML estimate of \(\sigma^2\) is \(\frac{1}{n}\sum_{i=1}^n e_i^2\),
@@ -226,115 +302,74 @@ <h2>Estimating residual variation</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-11" style="background:;">
+<slide class="" id="slide-14" style="background:;">
   <hgroup>
     <h2>Diamond example</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <pre><code class="r">y &lt;- diamond$price; x &lt;- diamond$carat; n &lt;- length(y)
 fit &lt;- lm(y ~ x)
 summary(fit)$sigma
 </code></pre>
 
-<pre><code>[1] 31.84
+<pre><code>## [1] 31.84
 </code></pre>
 
 <pre><code class="r">sqrt(sum(resid(fit)^2) / (n - 2))
 </code></pre>
 
-<pre><code>[1] 31.84
+<pre><code>## [1] 31.84
 </code></pre>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-12" style="background:;">
+<slide class="" id="slide-15" style="background:;">
   <hgroup>
     <h2>Summarizing variation</h2>
   </hgroup>
-  <article>
-    <p>\[
-\begin{align}
-\sum_{i=1}^n (Y_i - \bar Y)^2 
-& = \sum_{i=1}^n (Y_i - \hat Y_i + \hat Y_i - \bar Y)^2 \\
-& = \sum_{i=1}^n (Y_i - \hat Y_i)^2 + 
-2 \sum_{i=1}^n  (Y_i - \hat Y_i)(\hat Y_i - \bar Y) + 
-\sum_{i=1}^n  (\hat Y_i - \bar Y)^2 \\
-\end{align}
-\]</p>
-
-<hr>
-
-<h3>Scratch work</h3>
-
-<p>\((Y_i - \hat Y_i) = \{Y_i - (\bar Y - \hat \beta_1 \bar X) - \hat \beta_1 X_i\} = (Y_i - \bar Y) - \hat \beta_1 (X_i - \bar X)\)</p>
-
-<p>\((\hat Y_i - \bar Y) = (\bar Y - \hat \beta_1 \bar X - \hat \beta_1 X_i - \bar Y )
-= \hat \beta_1  (X_i - \bar X)\)</p>
-
-<p>\(\sum_{i=1}^n  (Y_i - \hat Y_i)(\hat Y_i - \bar Y) 
-= \sum_{i=1}^n  \{(Y_i - \bar Y) - \hat \beta_1 (X_i - \bar X))\}\{\hat \beta_1  (X_i - \bar X)\}\)</p>
-
-<p>\(=\hat \beta_1 \sum_{i=1}^n (Y_i - \bar Y)(X_i - \bar X) -\hat\beta_1^2\sum_{i=1}^n (X_i - \bar X)^2\)</p>
-
-<p>\(= \hat \beta_1^2 \sum_{i=1}^n (X_i - \bar X)^2-\hat\beta_1^2\sum_{i=1}^n (X_i - \bar X)^2 = 0\)</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>Summarizing variation</h2>
-  </hgroup>
-  <article>
-    <p>\[
+  <article data-timings="">
+    <ul>
+<li>The total variability in our response is the variability around an intercept
+(think mean only regression) \(\sum_{i=1}^n (Y_i - \bar Y)^2\)</li>
+<li>The regression variability is the variability that is explained by adding the
+predictor \(\sum_{i=1}^n  (\hat Y_i - \bar Y)^2\)</li>
+<li>The error variability is what&#39;s leftover around the regression line
+\(\sum_{i=1}^n (Y_i - \hat Y_i)^2\)</li>
+<li>Neat fact
+\[
 \sum_{i=1}^n (Y_i - \bar Y)^2 
 = \sum_{i=1}^n (Y_i - \hat Y_i)^2 + \sum_{i=1}^n  (\hat Y_i - \bar Y)^2 
-\]</p>
-
-<p>Or </p>
-
-<p>Total Variation = Residual Variation + Regression Variation</p>
-
-<p>Define the percent of total varation described by the model as
-\[
-R^2 = \frac{\sum_{i=1}^n  (\hat Y_i - \bar Y)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-= 1 - \frac{\sum_{i=1}^n  (Y_i - \hat Y_i)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-\]</p>
+\]</li>
+</ul>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-14" style="background:;">
+<slide class="" id="slide-16" style="background:;">
   <hgroup>
-    <h2>Relation between \(R^2\) and \(r\) (the corrrelation)</h2>
+    <h2>R squared</h2>
   </hgroup>
-  <article>
-    <p>Recall that \((\hat Y_i - \bar Y) = \hat \beta_1  (X_i - \bar X)\)
-so that
+  <article data-timings="">
+    <ul>
+<li>R squared is the percentage of the total variability that is explained
+by the linear relationship with the predictor
 \[
 R^2 = \frac{\sum_{i=1}^n  (\hat Y_i - \bar Y)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-= \hat \beta_1^2  \frac{\sum_{i=1}^n(X_i - \bar X)}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-= Cor(Y, X)^2
-\]
-Since, recall, 
-\[
-\hat \beta_1 = Cor(Y, X)\frac{Sd(Y)}{Sd(X)}
-\]
-So, \(R^2\) is literally \(r\) squared.</p>
+\]</li>
+</ul>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-15" style="background:;">
+<slide class="" id="slide-17" style="background:;">
   <hgroup>
     <h2>Some facts about \(R^2\)</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>\(R^2\) is the percentage of variation explained by the regression model.</li>
 <li>\(0 \leq R^2 \leq 1\)</li>
@@ -358,12 +393,57 @@ <h2>Some facts about \(R^2\)</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-16" style="background:;">
+<slide class="" id="slide-18" style="background:;">
   <hgroup>
     <h2><code>data(anscombe);example(anscombe)</code></h2>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-9.png" title="plot of chunk unnamed-chunk-9" alt="plot of chunk unnamed-chunk-9" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-12.png" alt="plot of chunk unnamed-chunk-12"> </p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-19" style="background:;">
+  <hgroup>
+    <h2>How to derive R squared (Not required!)</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>For those that are interested</h3>
+
+<p>\[
+\begin{align}
+\sum_{i=1}^n (Y_i - \bar Y)^2 
+& = \sum_{i=1}^n (Y_i - \hat Y_i + \hat Y_i - \bar Y)^2 \\
+& = \sum_{i=1}^n (Y_i - \hat Y_i)^2 + 
+2 \sum_{i=1}^n  (Y_i - \hat Y_i)(\hat Y_i - \bar Y) + 
+\sum_{i=1}^n  (\hat Y_i - \bar Y)^2 \\
+\end{align}
+\]</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-20" style="background:;">
+  <hgroup>
+    <h2>The relation between R squared and r</h2>
+  </hgroup>
+  <article data-timings="">
+    <h3>(Again not required)</h3>
+
+<p>Recall that \((\hat Y_i - \bar Y) = \hat \beta_1  (X_i - \bar X)\)
+so that
+\[
+R^2 = \frac{\sum_{i=1}^n  (\hat Y_i - \bar Y)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
+= \hat \beta_1^2  \frac{\sum_{i=1}^n(X_i - \bar X)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
+= Cor(Y, X)^2
+\]
+Since, recall, 
+\[
+\hat \beta_1 = Cor(Y, X)\frac{Sd(Y)}{Sd(X)}
+\]
+So, \(R^2\) is literally \(r\) squared.</p>
 
   </article>
   <!-- Presenter Notes -->
@@ -371,34 +451,155 @@ <h2><code>data(anscombe);example(anscombe)</code></h2>
 
     <slide class="backdrop"></slide>
   </slides>
-
-  <!--[if IE]>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title=''>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title=''>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Properties of the residuals'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Code'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Residuals are the signed length of the red lines'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Residuals versus X'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Non-linear data'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Residual plot'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Heteroskedasticity'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Getting rid of the blank space can be helpful'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Diamond data residual plot'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Diamond data residual plot'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Estimating residual variation'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Diamond example'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Summarizing variation'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='R squared'>
+         16
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=17 title='Some facts about \(R^2\)'>
+         17
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=18 title='<code>data(anscombe);example(anscombe)</code>'>
+         18
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=19 title='How to derive R squared (Not required!)'>
+         19
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=20 title='The relation between R squared and r'>
+         20
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
     <script 
       src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
     </script>
     <script>CFInstall.check({mode: 'overlay'});</script>
   <![endif]-->
 </body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
 </script>
 <!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/07_RegressionModels/01_06_residualVariation/index.md b/07_RegressionModels/01_06_residualVariation/index.md
index bec54316a..c5ee71894 100644
--- a/07_RegressionModels/01_06_residualVariation/index.md
+++ b/07_RegressionModels/01_06_residualVariation/index.md
@@ -8,13 +8,86 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
 
 ---
 
+```
+## Error: object 'opts_chunk' not found
+```
+
+```
+## Error: object 'knit_hooks' not found
+```
+
+```
+## Error: object 'knit_hooks' not found
+```
+## Motivating example
+### `diamond` data set from `UsingR` 
+Data is diamond prices (Singapore dollars) and diamond weight
+in carats (standard measure of diamond mass, 0.2 $g$). To get the data use `library(UsingR); data(diamond)`
+
+---
+
+```
+## Loading required package: MASS
+## Loading required package: HistData
+## Loading required package: Hmisc
+## Loading required package: grid
+## Loading required package: lattice
+## Loading required package: survival
+## Loading required package: splines
+## Loading required package: Formula
+## 
+## Attaching package: 'Hmisc'
+## 
+## The following objects are masked from 'package:base':
+## 
+##     format.pval, round.POSIXt, trunc.POSIXt, units
+## 
+## Loading required package: aplpack
+## Loading required package: tcltk
+## Loading required package: quantreg
+## Loading required package: SparseM
+## 
+## Attaching package: 'SparseM'
+## 
+## The following object is masked from 'package:base':
+## 
+##     backsolve
+## 
+## 
+## Attaching package: 'quantreg'
+## 
+## The following object is masked from 'package:Hmisc':
+## 
+##     latex
+## 
+## The following object is masked from 'package:survival':
+## 
+##     untangle.specials
+## 
+## 
+## Attaching package: 'UsingR'
+## 
+## The following object is masked from 'package:survival':
+## 
+##     cancer
+## 
+## 
+## Attaching package: 'ggplot2'
+## 
+## The following object is masked from 'package:UsingR':
+## 
+##     movies
+```
+
+![plot of chunk unnamed-chunk-1](assets/fig/unnamed-chunk-1.png) 
+---
 
 ## Residuals
 * Model $Y_i = \beta_0 + \beta_1 X_i + \epsilon_i$ where $\epsilon_i \sim N(0, \sigma^2)$.
@@ -58,7 +131,7 @@ max(abs(e -(y - yhat)))
 ```
 
 ```
-[1] 9.486e-13
+## [1] 9.486e-13
 ```
 
 ```r
@@ -66,62 +139,42 @@ max(abs(e - (y - coef(fit)[1] - coef(fit)[2] * x)))
 ```
 
 ```
-[1] 9.486e-13
+## [1] 9.486e-13
 ```
 
-
 ---
 ## Residuals are the signed length of the red lines
-<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-3](assets/fig/unnamed-chunk-3.png) 
 
 ---
 ## Residuals versus X
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-4](assets/fig/unnamed-chunk-4.png) 
 
 ---
 ## Non-linear data
-
-```r
-x <- runif(100, -3, 3); y <- x + sin(x) + rnorm(100, sd = .2); 
-plot(x, y); abline(lm(y ~ x))
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-5](assets/fig/unnamed-chunk-5.png) 
 
 ---
-
-```r
-plot(x, resid(lm(y ~ x))); 
-abline(h = 0)
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
-
+## Residual plot
+![plot of chunk unnamed-chunk-6](assets/fig/unnamed-chunk-6.png) 
 
 ---
 ## Heteroskedasticity
-
-```r
-x <- runif(100, 0, 6); y <- x + rnorm(100,  mean = 0, sd = .001 * x); 
-plot(x, y); abline(lm(y ~ x))
-```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-7](assets/fig/unnamed-chunk-7.png) 
 
 ---
 ## Getting rid of the blank space can be helpful
+![plot of chunk unnamed-chunk-8](assets/fig/unnamed-chunk-8.png) 
 
-```r
-plot(x, resid(lm(y ~ x))); 
-abline(h = 0)
-```
+---
+## Diamond data residual plot
 
-<div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div>
+![plot of chunk unnamed-chunk-9](assets/fig/unnamed-chunk-9.png) 
 
+---
+## Diamond data residual plot
+
+![plot of chunk unnamed-chunk-10](assets/fig/unnamed-chunk-10.png) 
 
 ---
 ## Estimating residual variation
@@ -144,7 +197,7 @@ summary(fit)$sigma
 ```
 
 ```
-[1] 31.84
+## [1] 31.84
 ```
 
 ```r
@@ -152,12 +205,52 @@ sqrt(sum(resid(fit)^2) / (n - 2))
 ```
 
 ```
-[1] 31.84
+## [1] 31.84
 ```
 
-
 ---
 ## Summarizing variation
+
+- The total variability in our response is the variability around an intercept
+(think mean only regression) $\sum_{i=1}^n (Y_i - \bar Y)^2$
+- The regression variability is the variability that is explained by adding the
+predictor $\sum_{i=1}^n  (\hat Y_i - \bar Y)^2$
+- The error variability is what's leftover around the regression line
+$\sum_{i=1}^n (Y_i - \hat Y_i)^2$
+- Neat fact
+$$
+\sum_{i=1}^n (Y_i - \bar Y)^2 
+= \sum_{i=1}^n (Y_i - \hat Y_i)^2 + \sum_{i=1}^n  (\hat Y_i - \bar Y)^2 
+$$
+
+---
+## R squared
+- R squared is the percentage of the total variability that is explained
+by the linear relationship with the predictor
+$$
+R^2 = \frac{\sum_{i=1}^n  (\hat Y_i - \bar Y)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
+$$
+
+---
+## Some facts about $R^2$
+* $R^2$ is the percentage of variation explained by the regression model.
+* $0 \leq R^2 \leq 1$
+* $R^2$ is the sample correlation squared.
+* $R^2$ can be a misleading summary of model fit. 
+  * Deleting data can inflate $R^2$.
+  * (For later.) Adding terms to a regression model always increases $R^2$.
+* Do `example(anscombe)` to see the following data.
+  * Basically same mean and variance of X and Y.
+  * Identical correlations (hence same $R^2$ ).
+  * Same linear regression relationship.
+
+---
+## `data(anscombe);example(anscombe)`
+![plot of chunk unnamed-chunk-12](assets/fig/unnamed-chunk-12.png) 
+
+---
+## How to derive R squared (Not required!)
+### For those that are interested
 $$
 \begin{align}
 \sum_{i=1}^n (Y_i - \bar Y)^2 
@@ -182,30 +275,15 @@ $=\hat \beta_1 \sum_{i=1}^n (Y_i - \bar Y)(X_i - \bar X) -\hat\beta_1^2\sum_{i=1
 
 $= \hat \beta_1^2 \sum_{i=1}^n (X_i - \bar X)^2-\hat\beta_1^2\sum_{i=1}^n (X_i - \bar X)^2 = 0$
 
----
-## Summarizing variation
-$$
-\sum_{i=1}^n (Y_i - \bar Y)^2 
-= \sum_{i=1}^n (Y_i - \hat Y_i)^2 + \sum_{i=1}^n  (\hat Y_i - \bar Y)^2 
-$$
-
-Or 
-
-Total Variation = Residual Variation + Regression Variation
-
-Define the percent of total varation described by the model as
-$$
-R^2 = \frac{\sum_{i=1}^n  (\hat Y_i - \bar Y)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-= 1 - \frac{\sum_{i=1}^n  (Y_i - \hat Y_i)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-$$
 
 ---
-## Relation between $R^2$ and $r$ (the corrrelation)
+## The relation between R squared and r
+### (Again not required)
 Recall that $(\hat Y_i - \bar Y) = \hat \beta_1  (X_i - \bar X)$
 so that
 $$
 R^2 = \frac{\sum_{i=1}^n  (\hat Y_i - \bar Y)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
-= \hat \beta_1^2  \frac{\sum_{i=1}^n(X_i - \bar X)}{\sum_{i=1}^n (Y_i - \bar Y)^2}
+= \hat \beta_1^2  \frac{\sum_{i=1}^n(X_i - \bar X)^2}{\sum_{i=1}^n (Y_i - \bar Y)^2}
 = Cor(Y, X)^2
 $$
 Since, recall, 
@@ -214,21 +292,4 @@ $$
 $$
 So, $R^2$ is literally $r$ squared.
 
----
-## Some facts about $R^2$
-* $R^2$ is the percentage of variation explained by the regression model.
-* $0 \leq R^2 \leq 1$
-* $R^2$ is the sample correlation squared.
-* $R^2$ can be a misleading summary of model fit. 
-  * Deleting data can inflate $R^2$.
-  * (For later.) Adding terms to a regression model always increases $R^2$.
-* Do `example(anscombe)` to see the following data.
-  * Basically same mean and variance of X and Y.
-  * Identical correlations (hence same $R^2$ ).
-  * Same linear regression relationship.
-
----
-## `data(anscombe);example(anscombe)`
-<div class="rimage center"><img src="fig/unnamed-chunk-9.png" title="plot of chunk unnamed-chunk-9" alt="plot of chunk unnamed-chunk-9" class="plot" /></div>
-
 
diff --git a/07_RegressionModels/01_07_inference/fig/fig.width==5.png b/07_RegressionModels/01_07_inference/fig/fig.width==5.png
index 66e6f8a36..25a321675 100644
Binary files a/07_RegressionModels/01_07_inference/fig/fig.width==5.png and b/07_RegressionModels/01_07_inference/fig/fig.width==5.png differ
diff --git a/07_RegressionModels/01_07_inference/index.Rmd b/07_RegressionModels/01_07_inference/index.Rmd
index 57a40b612..9897dfe59 100644
--- a/07_RegressionModels/01_07_inference/index.Rmd
+++ b/07_RegressionModels/01_07_inference/index.Rmd
@@ -8,7 +8,7 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
@@ -43,7 +43,7 @@ $$
 ---
 ## Review
 * Statistics like $\frac{\hat \theta - \theta}{\hat \sigma_{\hat \theta}}$ often have the following properties.
-    1. Is normally distributed and has a finite sample Student's T distribution if the estimated variance is replaced with a sample estimate (under normality assumptions).
+    1. Is normally distributed and has a finite sample Student's T distribution if the  variance is replaced with a sample estimate (under normality assumptions).
     3. Can be used to test $H_0 : \theta = \theta_0$ versus $H_a : \theta >, <, \neq \theta_0$.
     4. Can be used to create a confidence interval for $\theta$ via $\hat \theta \pm Q_{1-\alpha/2} \hat \sigma_{\hat \theta}$
     where $Q_{1-\alpha/2}$ is the relevant quantile from either a normal or T distribution.
@@ -53,18 +53,6 @@ very similarily to what you saw in your inference class.
 on the ways in which the $X$ values are collected, the iid sampling model, and mean model, 
 the normal results hold to create intervals and confidence intervals
 
----
-## Standard errors (conditioned on X)
-$$
-\begin{align}
-Var(\hat \beta_1) & =
-Var\left(\frac{\sum_{i=1}^n (Y_i - \bar Y) (X_i - \bar X)}{\sum_{i=1}^n (X_i - \bar X)^2}\right) \\
-& = \frac{Var\left(\sum_{i=1}^n Y_i (X_i - \bar X) \right) }{\left(\sum_{i=1}^n (X_i - \bar X)^2 \right)^2} \\
-& = \frac{\sum_{i=1}^n \sigma^2(X_i - \bar X)^2}{\left(\sum_{i=1}^n (X_i - \bar X)^2 \right)^2} \\
-& = \frac{\sigma^2}{\sum_{i=1}^n (X_i - \bar X)^2} \\
-\end{align}
-$$
-
 ---
 ## Results
 * $\sigma_{\hat \beta_1}^2 = Var(\hat \beta_1) = \sigma^2 / \sum_{i=1}^n (X_i - \bar X)^2$
@@ -112,7 +100,7 @@ summary(fit)$coefficients
 ```{r}
 sumCoef <- summary(fit)$coefficients
 sumCoef[1,1] + c(-1, 1) * qt(.975, df = fit$df) * sumCoef[1, 2]
-sumCoef[2,1] + c(-1, 1) * qt(.975, df = fit$df) * sumCoef[2, 2]
+(sumCoef[2,1] + c(-1, 1) * qt(.975, df = fit$df) * sumCoef[2, 2]) / 10
 ```
 With 95% confidence, we estimate that a 0.1 carat increase in
 diamond size results in a `r round((sumCoef[2,1] - qt(.975, df = fit$df) * sumCoef[2, 2]) / 10, 1)` to `r round((sumCoef[2,1] + qt(.975, df = fit$df) * sumCoef[2, 2]) / 10, 1)` increase in price in (Singapore) dollars.
@@ -133,35 +121,26 @@ $$
 * Line at $x_0$ se, $\hat \sigma\sqrt{\frac{1}{n} +  \frac{(x_0 - \bar X)^2}{\sum_{i=1}^n (X_i - \bar X)^2}}$
 * Prediction interval se at $x_0$, $\hat \sigma\sqrt{1 + \frac{1}{n} + \frac{(x_0 - \bar X)^2}{\sum_{i=1}^n (X_i - \bar X)^2}}$
 
----
-## Plotting the prediction intervals
-
-```
-plot(x, y, frame=FALSE,xlab="Carat",ylab="Dollars",pch=21,col="black", bg="lightblue", cex=2)
-abline(fit, lwd = 2)
-xVals <- seq(min(x), max(x), by = .01)
-yVals <- beta0 + beta1 * xVals
-se1 <- sigma * sqrt(1 / n + (xVals - mean(x))^2/ssx)
-se2 <- sigma * sqrt(1 + 1 / n + (xVals - mean(x))^2/ssx)
-lines(xVals, yVals + 2 * se1)
-lines(xVals, yVals - 2 * se1)
-lines(xVals, yVals + 2 * se2)
-lines(xVals, yVals - 2 * se2)
-```
 
 ---
 ## Plotting the prediction intervals
 ```{r, fig.height=5, fig.width==5, echo = FALSE, results='hide'}
-plot(x, y, frame=FALSE,xlab="Carat",ylab="Dollars",pch=21,col="black", bg="lightblue", cex=2)
-abline(fit, lwd = 2)
-xVals <- seq(min(x), max(x), by = .01)
-yVals <- beta0 + beta1 * xVals
-se1 <- sigma * sqrt(1 / n + (xVals - mean(x))^2/ssx)
-se2 <- sigma * sqrt(1 + 1 / n + (xVals - mean(x))^2/ssx)
-lines(xVals, yVals + 2 * se1)
-lines(xVals, yVals - 2 * se1)
-lines(xVals, yVals + 2 * se2)
-lines(xVals, yVals - 2 * se2)
+library(ggplot2)
+newx = data.frame(x = seq(min(x), max(x), length = 100))
+p1 = data.frame(predict(fit, newdata= newx,interval = ("confidence")))
+p2 = data.frame(predict(fit, newdata = newx,interval = ("prediction")))
+p1$interval = "confidence"
+p2$interval = "prediction"
+p1$x = newx$x
+p2$x = newx$x
+dat = rbind(p1, p2)
+names(dat)[1] = "y"
+
+g = ggplot(dat, aes(x = x, y = y))
+g = g + geom_ribbon(aes(ymin = lwr, ymax = upr, fill = interval), alpha = 0.2) 
+g = g + geom_line()
+g = g + geom_point(data = data.frame(x = x, y=y), aes(x = x, y = y), size = 4)
+g
 ```
 
 ---
@@ -178,20 +157,6 @@ lines(xVals, yVals - 2 * se2)
 ---
 
 ## In R
-```
-newdata <- data.frame(x = xVals)
-p1 <- predict(fit, newdata, interval = ("confidence"))
-p2 <- predict(fit, newdata, interval = ("prediction"))
-plot(x, y, frame=FALSE,xlab="Carat",ylab="Dollars",pch=21,col="black", bg="lightblue", cex=2)
-abline(fit, lwd = 2)
-lines(xVals, p1[,2]); lines(xVals, p1[,3])
-lines(xVals, p2[,2]); lines(xVals, p2[,3])
-```
-
----
----
-## In R
-
 ```{r, fig.height=5, fig.width=5, echo=FALSE,results='hide'}
 newdata <- data.frame(x = xVals)
 p1 <- predict(fit, newdata, interval = ("confidence"))
@@ -201,8 +166,6 @@ abline(fit, lwd = 2)
 lines(xVals, p1[,2]); lines(xVals, p1[,3])
 lines(xVals, p2[,2]); lines(xVals, p2[,3])
 ```
-
-  
   
   
   
diff --git a/07_RegressionModels/01_07_inference/index.html b/07_RegressionModels/01_07_inference/index.html
index cb9c530d3..ef2bd568c 100644
--- a/07_RegressionModels/01_07_inference/index.html
+++ b/07_RegressionModels/01_07_inference/index.html
@@ -8,51 +8,46 @@
   <meta name="generator" content="slidify" />
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
     media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
   </script>
   
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
+  
 
 </head>
 <body style="opacity: 0">
   <slides class="layout-widescreen">
     
     <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Inference in regression</h1>
+    <h2></h2>
+    <p>Brian Caffo, Jeff Leek and Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
     
 
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Inference in regression</h1>
-        <h2></h2>
-        <p>Brian Caffo, Jeff Leek and Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
     <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
+    <slide class="" id="slide-1" style="background:;">
   <hgroup>
     <h2>Recall our model and fitted values</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Consider the model
 \[
@@ -69,16 +64,16 @@ <h2>Recall our model and fitted values</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-2" style="background:;">
+<slide class="" id="slide-2" style="background:;">
   <hgroup>
     <h2>Review</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Statistics like \(\frac{\hat \theta - \theta}{\hat \sigma_{\hat \theta}}\) often have the following properties.
 
 <ol>
-<li>Is normally distributed and has a finite sample Student&#39;s T distribution if the estimated variance is replaced with a sample estimate (under normality assumptions).</li>
+<li>Is normally distributed and has a finite sample Student&#39;s T distribution if the  variance is replaced with a sample estimate (under normality assumptions).</li>
 <li>Can be used to test \(H_0 : \theta = \theta_0\) versus \(H_a : \theta >, <, \neq \theta_0\).</li>
 <li>Can be used to create a confidence interval for \(\theta\) via \(\hat \theta \pm Q_{1-\alpha/2} \hat \sigma_{\hat \theta}\)
 where \(Q_{1-\alpha/2}\) is the relevant quantile from either a normal or T distribution.</li>
@@ -94,30 +89,11 @@ <h2>Review</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>Standard errors (conditioned on X)</h2>
-  </hgroup>
-  <article>
-    <p>\[
-\begin{align}
-Var(\hat \beta_1) & =
-Var\left(\frac{\sum_{i=1}^n (Y_i - \bar Y) (X_i - \bar X)}{\sum_{i=1}^n (X_i - \bar X)^2}\right) \\
-& = \frac{Var\left(\sum_{i=1}^n Y_i (X_i - \bar X) \right) }{\left(\sum_{i=1}^n (X_i - \bar X)^2 \right)^2} \\
-& = \frac{\sum_{i=1}^n \sigma^2(X_i - \bar X)^2}{\left(\sum_{i=1}^n (X_i - \bar X)^2 \right)^2} \\
-& = \frac{\sigma^2}{\sum_{i=1}^n (X_i - \bar X)^2} \\
-\end{align}
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-4" style="background:;">
+<slide class="" id="slide-3" style="background:;">
   <hgroup>
     <h2>Results</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>\(\sigma_{\hat \beta_1}^2 = Var(\hat \beta_1) = \sigma^2 / \sum_{i=1}^n (X_i - \bar X)^2\)</li>
 <li>\(\sigma_{\hat \beta_0}^2 = Var(\hat \beta_0)  = \left(\frac{1}{n} + \frac{\bar X^2}{\sum_{i=1}^n (X_i - \bar X)^2 }\right)\sigma^2\)</li>
@@ -135,11 +111,11 @@ <h2>Results</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-5" style="background:;">
+<slide class="" id="slide-4" style="background:;">
   <hgroup>
     <h2>Example diamond data set</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <pre><code class="r">library(UsingR); data(diamond)
 y &lt;- diamond$price; x &lt;- diamond$carat; n &lt;- length(y)
 beta1 &lt;- cor(y, x) * sd(y) / sd(x)
@@ -161,11 +137,11 @@ <h2>Example diamond data set</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-6" style="background:;">
+<slide class="" id="slide-5" style="background:;">
   <hgroup>
     <h2>Example continued</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <pre><code class="r">coefTable
 </code></pre>
 
@@ -187,11 +163,11 @@ <h2>Example continued</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-7" style="background:;">
+<slide class="" id="slide-6" style="background:;">
   <hgroup>
     <h2>Getting a confidence interval</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <pre><code class="r">sumCoef &lt;- summary(fit)$coefficients
 sumCoef[1,1] + c(-1, 1) * qt(.975, df = fit$df) * sumCoef[1, 2]
 </code></pre>
@@ -199,10 +175,10 @@ <h2>Getting a confidence interval</h2>
 <pre><code>[1] -294.5 -224.8
 </code></pre>
 
-<pre><code class="r">sumCoef[2,1] + c(-1, 1) * qt(.975, df = fit$df) * sumCoef[2, 2]
+<pre><code class="r">(sumCoef[2,1] + c(-1, 1) * qt(.975, df = fit$df) * sumCoef[2, 2]) / 10
 </code></pre>
 
-<pre><code>[1] 3556 3886
+<pre><code>[1] 355.6 388.6
 </code></pre>
 
 <p>With 95% confidence, we estimate that a 0.1 carat increase in
@@ -212,11 +188,11 @@ <h2>Getting a confidence interval</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-8" style="background:;">
+<slide class="" id="slide-7" style="background:;">
   <hgroup>
     <h2>Prediction of outcomes</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Consider predicting \(Y\) at a value of \(X\)
 
@@ -240,43 +216,22 @@ <h2>Prediction of outcomes</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Plotting the prediction intervals</h2>
-  </hgroup>
-  <article>
-    <pre><code>plot(x, y, frame=FALSE,xlab=&quot;Carat&quot;,ylab=&quot;Dollars&quot;,pch=21,col=&quot;black&quot;, bg=&quot;lightblue&quot;, cex=2)
-abline(fit, lwd = 2)
-xVals &lt;- seq(min(x), max(x), by = .01)
-yVals &lt;- beta0 + beta1 * xVals
-se1 &lt;- sigma * sqrt(1 / n + (xVals - mean(x))^2/ssx)
-se2 &lt;- sigma * sqrt(1 + 1 / n + (xVals - mean(x))^2/ssx)
-lines(xVals, yVals + 2 * se1)
-lines(xVals, yVals - 2 * se1)
-lines(xVals, yVals + 2 * se2)
-lines(xVals, yVals - 2 * se2)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-10" style="background:;">
+<slide class="" id="slide-8" style="background:;">
   <hgroup>
     <h2>Plotting the prediction intervals</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <div class="rimage center"><img src="fig/fig.width==5.png" title="plot of chunk fig.width==5" alt="plot of chunk fig.width==5" class="plot" /></div>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-11" style="background:;">
+<slide class="" id="slide-9" style="background:;">
   <hgroup>
     <h2>Discussion</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Both intervals have varying widths.
 
@@ -301,34 +256,21 @@ <h2>Discussion</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-12" style="background:;">
+<slide class="" id="slide-10" style="background:;">
   <hgroup>
     <h2>In R</h2>
   </hgroup>
-  <article>
-    <pre><code>newdata &lt;- data.frame(x = xVals)
-p1 &lt;- predict(fit, newdata, interval = (&quot;confidence&quot;))
-p2 &lt;- predict(fit, newdata, interval = (&quot;prediction&quot;))
-plot(x, y, frame=FALSE,xlab=&quot;Carat&quot;,ylab=&quot;Dollars&quot;,pch=21,col=&quot;black&quot;, bg=&quot;lightblue&quot;, cex=2)
-abline(fit, lwd = 2)
-lines(xVals, p1[,2]); lines(xVals, p1[,3])
-lines(xVals, p2[,2]); lines(xVals, p2[,3])
-</code></pre>
+  <article data-timings="">
+    <pre><code>
+
 
-  </article>
-  <!-- Presenter Notes -->
-</slide>
 
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <hr>
 
-<h2>In R</h2>
 
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
+
+
+
+</code></pre>
 
   </article>
   <!-- Presenter Notes -->
@@ -336,34 +278,95 @@ <h2>In R</h2>
 
     <slide class="backdrop"></slide>
   </slides>
-
-  <!--[if IE]>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title='Recall our model and fitted values'>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Review'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Results'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Example diamond data set'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Example continued'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Getting a confidence interval'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Prediction of outcomes'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Plotting the prediction intervals'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Discussion'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='In R'>
+         10
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
     <script 
       src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
     </script>
     <script>CFInstall.check({mode: 'overlay'});</script>
   <![endif]-->
 </body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
 </script>
 <!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/07_RegressionModels/01_07_inference/index.md b/07_RegressionModels/01_07_inference/index.md
index 86524d4f7..a36573c87 100644
--- a/07_RegressionModels/01_07_inference/index.md
+++ b/07_RegressionModels/01_07_inference/index.md
@@ -8,13 +8,12 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
 ---
 
-
 ## Recall our model and fitted values
 * Consider the model
 $$
@@ -29,7 +28,7 @@ $$
 ---
 ## Review
 * Statistics like $\frac{\hat \theta - \theta}{\hat \sigma_{\hat \theta}}$ often have the following properties.
-    1. Is normally distributed and has a finite sample Student's T distribution if the estimated variance is replaced with a sample estimate (under normality assumptions).
+    1. Is normally distributed and has a finite sample Student's T distribution if the  variance is replaced with a sample estimate (under normality assumptions).
     3. Can be used to test $H_0 : \theta = \theta_0$ versus $H_a : \theta >, <, \neq \theta_0$.
     4. Can be used to create a confidence interval for $\theta$ via $\hat \theta \pm Q_{1-\alpha/2} \hat \sigma_{\hat \theta}$
     where $Q_{1-\alpha/2}$ is the relevant quantile from either a normal or T distribution.
@@ -39,18 +38,6 @@ very similarily to what you saw in your inference class.
 on the ways in which the $X$ values are collected, the iid sampling model, and mean model, 
 the normal results hold to create intervals and confidence intervals
 
----
-## Standard errors (conditioned on X)
-$$
-\begin{align}
-Var(\hat \beta_1) & =
-Var\left(\frac{\sum_{i=1}^n (Y_i - \bar Y) (X_i - \bar X)}{\sum_{i=1}^n (X_i - \bar X)^2}\right) \\
-& = \frac{Var\left(\sum_{i=1}^n Y_i (X_i - \bar X) \right) }{\left(\sum_{i=1}^n (X_i - \bar X)^2 \right)^2} \\
-& = \frac{\sum_{i=1}^n \sigma^2(X_i - \bar X)^2}{\left(\sum_{i=1}^n (X_i - \bar X)^2 \right)^2} \\
-& = \frac{\sigma^2}{\sum_{i=1}^n (X_i - \bar X)^2} \\
-\end{align}
-$$
-
 ---
 ## Results
 * $\sigma_{\hat \beta_1}^2 = Var(\hat \beta_1) = \sigma^2 / \sum_{i=1}^n (X_i - \bar X)^2$
@@ -85,7 +72,6 @@ colnames(coefTable) <- c("Estimate", "Std. Error", "t value", "P(>|t|)")
 rownames(coefTable) <- c("(Intercept)", "x")
 ```
 
-
 ---
 ## Example continued
 
@@ -111,7 +97,6 @@ summary(fit)$coefficients
 x             3721.0      81.79   45.50 6.751e-40
 ```
 
-
 ---
 ## Getting a confidence interval
 
@@ -125,13 +110,12 @@ sumCoef[1,1] + c(-1, 1) * qt(.975, df = fit$df) * sumCoef[1, 2]
 ```
 
 ```r
-sumCoef[2,1] + c(-1, 1) * qt(.975, df = fit$df) * sumCoef[2, 2]
+(sumCoef[2,1] + c(-1, 1) * qt(.975, df = fit$df) * sumCoef[2, 2]) / 10
 ```
 
 ```
-[1] 3556 3886
+[1] 355.6 388.6
 ```
-
 With 95% confidence, we estimate that a 0.1 carat increase in
 diamond size results in a 355.6 to 388.6 increase in price in (Singapore) dollars.
 
@@ -151,27 +135,11 @@ $$
 * Line at $x_0$ se, $\hat \sigma\sqrt{\frac{1}{n} +  \frac{(x_0 - \bar X)^2}{\sum_{i=1}^n (X_i - \bar X)^2}}$
 * Prediction interval se at $x_0$, $\hat \sigma\sqrt{1 + \frac{1}{n} + \frac{(x_0 - \bar X)^2}{\sum_{i=1}^n (X_i - \bar X)^2}}$
 
----
-## Plotting the prediction intervals
-
-```
-plot(x, y, frame=FALSE,xlab="Carat",ylab="Dollars",pch=21,col="black", bg="lightblue", cex=2)
-abline(fit, lwd = 2)
-xVals <- seq(min(x), max(x), by = .01)
-yVals <- beta0 + beta1 * xVals
-se1 <- sigma * sqrt(1 / n + (xVals - mean(x))^2/ssx)
-se2 <- sigma * sqrt(1 + 1 / n + (xVals - mean(x))^2/ssx)
-lines(xVals, yVals + 2 * se1)
-lines(xVals, yVals - 2 * se1)
-lines(xVals, yVals + 2 * se2)
-lines(xVals, yVals - 2 * se2)
-```
 
 ---
 ## Plotting the prediction intervals
 <div class="rimage center"><img src="fig/fig.width==5.png" title="plot of chunk fig.width==5" alt="plot of chunk fig.width==5" class="plot" /></div>
 
-
 ---
 ## Discussion
 * Both intervals have varying widths.
@@ -187,23 +155,7 @@ lines(xVals, yVals - 2 * se2)
 
 ## In R
 ```
-newdata <- data.frame(x = xVals)
-p1 <- predict(fit, newdata, interval = ("confidence"))
-p2 <- predict(fit, newdata, interval = ("prediction"))
-plot(x, y, frame=FALSE,xlab="Carat",ylab="Dollars",pch=21,col="black", bg="lightblue", cex=2)
-abline(fit, lwd = 2)
-lines(xVals, p1[,2]); lines(xVals, p1[,3])
-lines(xVals, p2[,2]); lines(xVals, p2[,3])
-```
 
----
----
-## In R
-
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
-
-
-  
   
   
   
diff --git a/07_RegressionModels/02_01_multivariate/index.Rmd b/07_RegressionModels/02_01_multivariate/index.Rmd
index d381d804e..480607893 100644
--- a/07_RegressionModels/02_01_multivariate/index.Rmd
+++ b/07_RegressionModels/02_01_multivariate/index.Rmd
@@ -8,7 +8,7 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
@@ -73,66 +73,15 @@ predictor variables.)
 
 ---
 ## How to get estimates
-* The real way requires linear algebra. We'll go over an intuitive development instead. 
 * Recall that the LS estimate for regression through the origin, $E[Y_i]=X_{1i}\beta_1$, was $\sum X_i Y_i / \sum X_i^2$.
 * Let's consider two regressors, $E[Y_i] = X_{1i}\beta_1 + X_{2i}\beta_2 = \mu_i$. 
-* Also, recall, that if $\hat \mu_i$ satisfies
+* Least squares tries to minimize
 $$
-\sum_{i=1} (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0
+\sum_{i=1}^n (Y_i - X_{1i} \beta_1 - X_{2i} \beta_2)^2
 $$
-for all possible values of $\mu_i$, then we've found the LS estimates.
 
 ---
-$$
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = \sum_{i=1}^n (Y_i - \hat \beta_1 X_{1i} - \hat \beta_2 X_{2i})
-\left\{X_{1i}(\hat \beta_1 - \beta_1) + X_{2i}(\hat \beta_2 - \beta_2) \right\}
-$$
-* Thus we need 
-  1. $\sum_{i=1}^n (Y_i - \hat \beta_1 X_{1i} - \hat \beta_2 X_{2i}) X_{1i} = 0$ 
-  2.  $\sum_{i=1}^n (Y_i - \hat \beta_1 X_{1i} - \hat \beta_2 X_{2i}) X_{2i} = 0$
-* Hold $\hat \beta_1$ fixed in 2. and solve and we get that
-$$
-\hat \beta_2 = \frac{\sum_{i=1} (Y_i - X_{1i}\hat \beta_1)X_{2i}}{\sum_{i=1}^n X_{2i}^2}
-$$
-* Plugging this into 1. we get that
-$$
-0 = \sum_{i=1}^n \left\{Y_i - \frac{\sum_j X_{2j}Y_j}{\sum_j X_{2j}^2}X_{2i} + 
-\beta_1 \left(X_{1i} - \frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2} X_{2i}\right)\right\} X_{1i}
-$$
-
----
-## Continued
-* Re writing  this we get
-$$
-0 = \sum_{i=1}^n \left\{ e_{i, Y | X_2} - \hat \beta_1 e_{i, X_1 | X_2} 
-\right\} X_{1i} 
-$$
-where $e_{i, a | b} = a_i -  \frac{\sum_{j=1}^n a_j b_j }{\sum_{i=1}^n b_j^2} b_i$ is the residual when regressing $b$ from $a$ without an intercept.
-* We get the solution
-$$
-\hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2} X_1}
-$$
-
----
-* But note that 
-$$
-\sum_{i=1}^n e_{i, X_1 | X_2}^2 
-= \sum_{i=1}^n e_{i, X_1 | X_2} \left(X_{1i} - \frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2} X_{2i}\right)
-$$
-$$
-= \sum_{i=1}^n e_{i, X_1 | X_2} X_{1i} - \frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2} \sum_{i=1}^n e_{i, X_1 | X_2} X_{2i}
-$$
-But $\sum_{i=1}^n e_{i, X_1 | X_2} X_{2i} = 0$. So we get that
-$$
-\sum_{i=1}^n e_{i, X_1 | X_2}^2  = \sum_{i=1}^n e_{i, X_1 | X_2} X_{1i}
-$$
-Thus we get that
-$$
-\hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2}^2}
-$$
-
----
-## Summing up fitting with two regressors
+## Result
 $$\hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2}^2}$$
 * That is, the regression estimate for $\beta_1$ is the regression 
 through the origin estimate having regressed $X_2$ out of both
@@ -144,10 +93,10 @@ from both the regressor and response.
 ---
 ## Example with two variables, simple linear regression
 * $Y_{i} = \beta_1 X_{1i} + \beta_2 X_{2i}$ where  $X_{2i} = 1$ is an intercept term.
-* Then  $\frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2}X_{2i} = 
-\frac{\sum_j X_{1j}}{n} = \bar X_1$. 
-* $e_{i, X_1 | X_2} = X_{1i} - \bar X_1$.
-* Simiarly $e_{i, Y | X_2} = Y_i - \bar Y$.
+* Notice the fitted coefficient of $X_{2i}$ on $Y_{i}$ is $\bar Y$
+    * The residuals $e_{i, Y | X_2} = Y_i - \bar Y$
+* Notice the fitted coefficient of $X_{2i}$ on $X_{1i}$ is $\bar X_1$
+    * The residuals $e_{i, X_1 | X_2}= X_{1i} - \bar X_1$
 * Thus
 $$
 \hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2}^2} = \frac{\sum_{i=1}^n (X_i - \bar X)(Y_i - \bar Y)}{\sum_{i=1}^n (X_i - \bar X)^2}
@@ -156,98 +105,37 @@ $$
 
 ---
 ## The general case
-* The equations
+* Least squares solutions have to minimize
 $$
-\sum_{i=1}^n (Y_i - X_{1i}\hat \beta_1 - \ldots - X_{ip}\hat \beta_p) X_k = 0
+\sum_{i=1}^n (Y_i - X_{1i}\beta_1 - \ldots - X_{pi}\beta_p)^2
 $$
-for $k = 1, \ldots, p$ yields $p$ equations with $p$ unknowns.
-* Solving them yields the least squares estimates. (With obtaining a good, fast, general solution requiring some knowledge of linear algebra.)
 * The least squares estimate for the coefficient of a multivariate regression model is exactly regression through the origin with the linear relationships with the other regressors removed from both the regressor and outcome by taking residuals. 
 * In this sense, multivariate regression "adjusts" a coefficient for the linear impact of the other variables. 
 
----
-## Fitting LS equations
-Just so I don't leave you hanging, let's show a way to get estimates. Recall the equations:
-$$
-\sum_{i=1}^n (Y_i - X_{1i}\hat \beta_1 - \ldots - X_{ip}\hat \beta_p) X_k = 0
-$$
-If I hold $\hat \beta_1, \ldots, \hat \beta_{p-1}$ fixed then
-we get that
-$$
-\hat \beta_p = \frac{\sum_{i=1}^n (Y_i - X_{1i}\hat \beta_1 - \ldots - X_{i,p-1}\hat \beta_{p-1}) X_{ip} }{\sum_{i=1}^n X_{ip}^2}
-$$
-Plugging this back into the equations, we wind up with 
-$$
-\sum_{i=1}^n (e_{i,Y|X_p} - e_{i, X_{1} | X_p} \hat \beta_1 - \ldots - e_{i, X_{p-1} | X_{p}} \hat \beta_{p-1}) X_k = 0
-$$
-
----
-## We can tidy it up a bit more, though
-Note that
-$$
-X_k  = e_{i,X_k|X_p} + \frac{\sum_{i=1}^n X_{ik} X_{ip}}{\sum_{i=1}^n X_{ip^2}} X_p
-$$
-and $\sum_{i=1}^n e_{i,X_j | X_p} X_{ip} = 0$.
-Thus 
-$$
-\sum_{i=1}^n (e_{i,Y|X_p} - e_{i, X_{1} | X_p} \hat \beta_1 - \ldots - e_{i, X_{p-1} | X_{p}} \hat \beta_{p-1}) X_k = 0
-$$
-is equal to
-$$
-\sum_{i=1}^n (e_{i,Y|X_p} - e_{i, X_{1} | X_p} \hat \beta_1 - \ldots - e_{i, X_{p-1} | X_{p}} \hat \beta_{p-1}) e_{i,X_k|X_p} = 0
-$$
-
----
-## To sum up
-* We've reduced $p$ LS equations and $p$ unknowns to $p-1$ LS equations and $p-1$ unknowns.
-  * Every variable has been replaced by its residual with $X_p$. 
-  * This process can then be iterated until only Y and one
-variable remains. 
-* Think of it as follows. If we want an adjusted relationship between y and x, keep taking residuals over confounders and do regression through the origin.
-  * The order that you do the confounders doesn't matter.
-  * (It can't because our choice of doing $p$ first 
-    was arbitrary.)
-* This isn't a terribly efficient way to get estimates. But, it's nice conceputally, as it shows how regression estimates are adjusted for the linear relationship with other variables.
-
 ---
 ## Demonstration that it works using an example
-### Linear model with two variables and an intercept
+### Linear model with two variables
 ```{r}
-n <- 100; x <- rnorm(n); x2 <- rnorm(n); x3 <- rnorm(n)
-y <- x + x2 + x3 + rnorm(n, sd = .1)
-e <- function(a, b) a -  sum( a * b ) / sum( b ^ 2) * b
-ey <- e(e(y, x2), e(x3, x2))
-ex <- e(e(x, x2), e(x3, x2))
+n = 100; x = rnorm(n); x2 = rnorm(n); x3 = rnorm(n)
+y = 1 + x + x2 + x3 + rnorm(n, sd = .1)
+ey = resid(lm(y ~ x2 + x3))
+ex = resid(lm(x ~ x2 + x3))
 sum(ey * ex) / sum(ex ^ 2)
-coef(lm(y ~ x + x2 + x3 - 1)) #the -1 removes the intercept term
+coef(lm(ey ~ ex - 1))
+coef(lm(y ~ x + x2 + x3)) 
 ```
 
 ---
-## Showing that order doesn't matter
-```{r}
-ey <- e(e(y, x3), e(x2, x3))
-ex <- e(e(x, x3), e(x2, x3))
-sum(ey * ex) / sum(ex ^ 2)
-coef(lm(y ~ x + x2 + x3 - 1)) #the -1 removes the intercept term
-```
-
----
-## Residuals again
-```{r}
-ey <- resid(lm(y ~ x2 + x3 - 1))
-ex <- resid(lm(x ~ x2 + x3 - 1))
-sum(ey * ex) / sum(ex ^ 2)
-coef(lm(y ~ x + x2 + x3 - 1)) #the -1 removes the intercept term
-```
+## Interpretation of the coeficients
+$$E[Y | X_1 = x_1, \ldots, X_p = x_p] = \sum_{k=1}^p x_{k} \beta_k$$
 
+$$
+E[Y | X_1 = x_1 + 1, \ldots, X_p = x_p] = (x_1 + 1) \beta_1 + \sum_{k=2}^p x_{k} \beta_k
+$$
 
----
-## Interpretation of the coeficient
-$$E[Y | X_1 = x_1, \ldots, X_p = x_p] = \sum_{k=1}^p x_{k} \beta_k$$
-So that
 $$
 E[Y | X_1 = x_1 + 1, \ldots, X_p = x_p]  - E[Y | X_1 = x_1, \ldots, X_p = x_p]$$
-$$= (x_1 + 1) \beta_1 + \sum_{k=2}^p x_{k}+ \sum_{k=1}^p x_{k} \beta_k = \beta_1 $$
+$$= (x_1 + 1) \beta_1 + \sum_{k=2}^p x_{k} \beta_k + \sum_{k=1}^p x_{k} \beta_k = \beta_1 $$
 So that the interpretation of a multivariate regression coefficient is the expected change in the response per unit change in the regressor, holding all of the other regressors fixed.
 
 In the next lecture, we'll do examples and go over context-specific
diff --git a/07_RegressionModels/02_01_multivariate/index.html b/07_RegressionModels/02_01_multivariate/index.html
index bb8b9aff5..e38ce26f3 100644
--- a/07_RegressionModels/02_01_multivariate/index.html
+++ b/07_RegressionModels/02_01_multivariate/index.html
@@ -8,52 +8,55 @@
   <meta name="generator" content="slidify" />
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
     media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
   </script>
   
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
+  
 
 </head>
 <body style="opacity: 0">
   <slides class="layout-widescreen">
     
     <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Multivariable regression</h1>
+    <h2></h2>
+    <p>Brian Caffo, Roger Peng and Jeff Leek<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
     
 
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Multivariable regression</h1>
-        <h2></h2>
-        <p>Brian Caffo, Roger Peng and Jeff Leek<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
     <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Multivariable regression analyses</h2>
-  </hgroup>
-  <article>
-    <ul>
+    <slide class="" id="slide-1" style="background:;">
+  <article data-timings="">
+    <pre><code>## Error: object &#39;opts_chunk&#39; not found
+</code></pre>
+
+<pre><code>## Error: object &#39;knit_hooks&#39; not found
+</code></pre>
+
+<pre><code>## Error: object &#39;knit_hooks&#39; not found
+</code></pre>
+
+<h2>Multivariable regression analyses</h2>
+
+<ul>
 <li>If I were to present evidence of a relationship between 
 breath mint useage (mints per day, X) and pulmonary function
 (measured in FEV), you would be skeptical.
@@ -69,11 +72,11 @@ <h2>Multivariable regression analyses</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-2" style="background:;">
+<slide class="" id="slide-2" style="background:;">
   <hgroup>
     <h2>Multivariable regression analyses</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>An insurance company is interested in how last year&#39;s claims can predict a person&#39;s time in the hospital this year. 
 
@@ -94,11 +97,11 @@ <h2>Multivariable regression analyses</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-3" style="background:;">
+<slide class="" id="slide-3" style="background:;">
   <hgroup>
     <h2>The linear model</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>The general linear model extends simple linear regression (SLR)
 by adding terms linearly into the model.
@@ -127,51 +130,17 @@ <h2>The linear model</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-4" style="background:;">
+<slide class="" id="slide-4" style="background:;">
   <hgroup>
     <h2>How to get estimates</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
-<li>The real way requires linear algebra. We&#39;ll go over an intuitive development instead. </li>
 <li>Recall that the LS estimate for regression through the origin, \(E[Y_i]=X_{1i}\beta_1\), was \(\sum X_i Y_i / \sum X_i^2\).</li>
 <li>Let&#39;s consider two regressors, \(E[Y_i] = X_{1i}\beta_1 + X_{2i}\beta_2 = \mu_i\). </li>
-<li>Also, recall, that if \(\hat \mu_i\) satisfies
+<li>Least squares tries to minimize
 \[
-\sum_{i=1} (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0
-\]
-for all possible values of \(\mu_i\), then we&#39;ve found the LS estimates.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <p>\[
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = \sum_{i=1}^n (Y_i - \hat \beta_1 X_{1i} - \hat \beta_2 X_{2i})
-\left\{X_{1i}(\hat \beta_1 - \beta_1) + X_{2i}(\hat \beta_2 - \beta_2) \right\}
-\]</p>
-
-<ul>
-<li>Thus we need 
-
-<ol>
-<li>\(\sum_{i=1}^n (Y_i - \hat \beta_1 X_{1i} - \hat \beta_2 X_{2i}) X_{1i} = 0\) </li>
-<li> \(\sum_{i=1}^n (Y_i - \hat \beta_1 X_{1i} - \hat \beta_2 X_{2i}) X_{2i} = 0\)</li>
-</ol></li>
-<li>Hold \(\hat \beta_1\) fixed in 2. and solve and we get that
-\[
-\hat \beta_2 = \frac{\sum_{i=1} (Y_i - X_{1i}\hat \beta_1)X_{2i}}{\sum_{i=1}^n X_{2i}^2}
-\]</li>
-<li>Plugging this into 1. we get that
-\[
-0 = \sum_{i=1}^n \left\{Y_i - \frac{\sum_j X_{2j}Y_j}{\sum_j X_{2j}^2}X_{2i} + 
-\beta_1 \left(X_{1i} - \frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2} X_{2i}\right)\right\} X_{1i}
+\sum_{i=1}^n (Y_i - X_{1i} \beta_1 - X_{2i} \beta_2)^2
 \]</li>
 </ul>
 
@@ -179,61 +148,11 @@ <h2>How to get estimates</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-6" style="background:;">
+<slide class="" id="slide-5" style="background:;">
   <hgroup>
-    <h2>Continued</h2>
+    <h2>Result</h2>
   </hgroup>
-  <article>
-    <ul>
-<li>Re writing  this we get
-\[
-0 = \sum_{i=1}^n \left\{ e_{i, Y | X_2} - \hat \beta_1 e_{i, X_1 | X_2} 
-\right\} X_{1i} 
-\]
-where \(e_{i, a | b} = a_i -  \frac{\sum_{j=1}^n a_j b_j }{\sum_{i=1}^n b_j^2} b_i\) is the residual when regressing \(b\) from \(a\) without an intercept.</li>
-<li>We get the solution
-\[
-\hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2} X_1}
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    
-  </hgroup>
-  <article>
-    <ul>
-<li>But note that 
-\[
-\sum_{i=1}^n e_{i, X_1 | X_2}^2 
-= \sum_{i=1}^n e_{i, X_1 | X_2} \left(X_{1i} - \frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2} X_{2i}\right)
-\]
-\[
-= \sum_{i=1}^n e_{i, X_1 | X_2} X_{1i} - \frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2} \sum_{i=1}^n e_{i, X_1 | X_2} X_{2i}
-\]
-But \(\sum_{i=1}^n e_{i, X_1 | X_2} X_{2i} = 0\). So we get that
-\[
-\sum_{i=1}^n e_{i, X_1 | X_2}^2  = \sum_{i=1}^n e_{i, X_1 | X_2} X_{1i}
-\]
-Thus we get that
-\[
-\hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2}^2}
-\]</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Summing up fitting with two regressors</h2>
-  </hgroup>
-  <article>
+  <article data-timings="">
     <p>\[\hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2}^2}\]</p>
 
 <ul>
@@ -249,17 +168,23 @@ <h2>Summing up fitting with two regressors</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-9" style="background:;">
+<slide class="" id="slide-6" style="background:;">
   <hgroup>
     <h2>Example with two variables, simple linear regression</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>\(Y_{i} = \beta_1 X_{1i} + \beta_2 X_{2i}\) where  \(X_{2i} = 1\) is an intercept term.</li>
-<li>Then  \(\frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2}X_{2i} = 
-\frac{\sum_j X_{1j}}{n} = \bar X_1\). </li>
-<li>\(e_{i, X_1 | X_2} = X_{1i} - \bar X_1\).</li>
-<li>Simiarly \(e_{i, Y | X_2} = Y_i - \bar Y\).</li>
+<li>Notice the fitted coefficient of \(X_{2i}\) on \(Y_{i}\) is \(\bar Y\)
+
+<ul>
+<li>The residuals \(e_{i, Y | X_2} = Y_i - \bar Y\)</li>
+</ul></li>
+<li>Notice the fitted coefficient of \(X_{2i}\) on \(X_{1i}\) is \(\bar X_1\)
+
+<ul>
+<li>The residuals \(e_{i, X_1 | X_2}= X_{1i} - \bar X_1\)</li>
+</ul></li>
 <li>Thus
 \[
 \hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2}^2} = \frac{\sum_{i=1}^n (X_i - \bar X)(Y_i - \bar Y)}{\sum_{i=1}^n (X_i - \bar X)^2}
@@ -271,18 +196,16 @@ <h2>Example with two variables, simple linear regression</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-10" style="background:;">
+<slide class="" id="slide-7" style="background:;">
   <hgroup>
     <h2>The general case</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
-<li>The equations
+<li>Least squares solutions have to minimize
 \[
-\sum_{i=1}^n (Y_i - X_{1i}\hat \beta_1 - \ldots - X_{ip}\hat \beta_p) X_k = 0
-\]
-for \(k = 1, \ldots, p\) yields \(p\) equations with \(p\) unknowns.</li>
-<li>Solving them yields the least squares estimates. (With obtaining a good, fast, general solution requiring some knowledge of linear algebra.)</li>
+\sum_{i=1}^n (Y_i - X_{1i}\beta_1 - \ldots - X_{pi}\beta_p)^2
+\]</li>
 <li>The least squares estimate for the coefficient of a multivariate regression model is exactly regression through the origin with the linear relationships with the other regressors removed from both the regressor and outcome by taking residuals. </li>
 <li>In this sense, multivariate regression &quot;adjusts&quot; a coefficient for the linear impact of the other variables. </li>
 </ul>
@@ -291,166 +214,55 @@ <h2>The general case</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>Fitting LS equations</h2>
-  </hgroup>
-  <article>
-    <p>Just so I don&#39;t leave you hanging, let&#39;s show a way to get estimates. Recall the equations:
-\[
-\sum_{i=1}^n (Y_i - X_{1i}\hat \beta_1 - \ldots - X_{ip}\hat \beta_p) X_k = 0
-\]
-If I hold \(\hat \beta_1, \ldots, \hat \beta_{p-1}\) fixed then
-we get that
-\[
-\hat \beta_p = \frac{\sum_{i=1}^n (Y_i - X_{1i}\hat \beta_1 - \ldots - X_{i,p-1}\hat \beta_{p-1}) X_{ip} }{\sum_{i=1}^n X_{ip}^2}
-\]
-Plugging this back into the equations, we wind up with 
-\[
-\sum_{i=1}^n (e_{i,Y|X_p} - e_{i, X_{1} | X_p} \hat \beta_1 - \ldots - e_{i, X_{p-1} | X_{p}} \hat \beta_{p-1}) X_k = 0
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>We can tidy it up a bit more, though</h2>
-  </hgroup>
-  <article>
-    <p>Note that
-\[
-X_k  = e_{i,X_k|X_p} + \frac{\sum_{i=1}^n X_{ik} X_{ip}}{\sum_{i=1}^n X_{ip^2}} X_p
-\]
-and \(\sum_{i=1}^n e_{i,X_j | X_p} X_{ip} = 0\).
-Thus 
-\[
-\sum_{i=1}^n (e_{i,Y|X_p} - e_{i, X_{1} | X_p} \hat \beta_1 - \ldots - e_{i, X_{p-1} | X_{p}} \hat \beta_{p-1}) X_k = 0
-\]
-is equal to
-\[
-\sum_{i=1}^n (e_{i,Y|X_p} - e_{i, X_{1} | X_p} \hat \beta_1 - \ldots - e_{i, X_{p-1} | X_{p}} \hat \beta_{p-1}) e_{i,X_k|X_p} = 0
-\]</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>To sum up</h2>
-  </hgroup>
-  <article>
-    <ul>
-<li>We&#39;ve reduced \(p\) LS equations and \(p\) unknowns to \(p-1\) LS equations and \(p-1\) unknowns.
-
-<ul>
-<li>Every variable has been replaced by its residual with \(X_p\). </li>
-<li>This process can then be iterated until only Y and one
-variable remains. </li>
-</ul></li>
-<li>Think of it as follows. If we want an adjusted relationship between y and x, keep taking residuals over confounders and do regression through the origin.
-
-<ul>
-<li>The order that you do the confounders doesn&#39;t matter.</li>
-<li>(It can&#39;t because our choice of doing \(p\) first 
-was arbitrary.)</li>
-</ul></li>
-<li>This isn&#39;t a terribly efficient way to get estimates. But, it&#39;s nice conceputally, as it shows how regression estimates are adjusted for the linear relationship with other variables.</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-14" style="background:;">
+<slide class="" id="slide-8" style="background:;">
   <hgroup>
     <h2>Demonstration that it works using an example</h2>
   </hgroup>
-  <article>
-    <h3>Linear model with two variables and an intercept</h3>
-
-<pre><code class="r">n &lt;- 100; x &lt;- rnorm(n); x2 &lt;- rnorm(n); x3 &lt;- rnorm(n)
-y &lt;- x + x2 + x3 + rnorm(n, sd = .1)
-e &lt;- function(a, b) a -  sum( a * b ) / sum( b ^ 2) * b
-ey &lt;- e(e(y, x2), e(x3, x2))
-ex &lt;- e(e(x, x2), e(x3, x2))
-sum(ey * ex) / sum(ex ^ 2)
-</code></pre>
-
-<pre><code>[1] 1.004
-</code></pre>
+  <article data-timings="">
+    <h3>Linear model with two variables</h3>
 
-<pre><code class="r">coef(lm(y ~ x + x2 + x3 - 1)) #the -1 removes the intercept term
+<pre><code class="r">n = 100; x = rnorm(n); x2 = rnorm(n); x3 = rnorm(n)
+y = 1 + x + x2 + x3 + rnorm(n, sd = .1)
+ey = resid(lm(y ~ x2 + x3))
+ex = resid(lm(x ~ x2 + x3))
+sum(ey * ex) / sum(ex ^ 2)
 </code></pre>
 
-<pre><code>     x     x2     x3 
-1.0040 0.9899 1.0078 
+<pre><code>## [1] 1.009
 </code></pre>
 
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-      <slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>Showing that order doesn&#39;t matter</h2>
-  </hgroup>
-  <article>
-    <pre><code class="r">ey &lt;- e(e(y, x3), e(x2, x3))
-ex &lt;- e(e(x, x3), e(x2, x3))
-sum(ey * ex) / sum(ex ^ 2)
+<pre><code class="r">coef(lm(ey ~ ex - 1))
 </code></pre>
 
-<pre><code>[1] 1.004
+<pre><code>##    ex 
+## 1.009
 </code></pre>
 
-<pre><code class="r">coef(lm(y ~ x + x2 + x3 - 1)) #the -1 removes the intercept term
+<pre><code class="r">coef(lm(y ~ x + x2 + x3)) 
 </code></pre>
 
-<pre><code>     x     x2     x3 
-1.0040 0.9899 1.0078 
+<pre><code>## (Intercept)           x          x2          x3 
+##      1.0202      1.0090      0.9787      1.0064
 </code></pre>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-16" style="background:;">
+<slide class="" id="slide-9" style="background:;">
   <hgroup>
-    <h2>Residuals again</h2>
+    <h2>Interpretation of the coeficients</h2>
   </hgroup>
-  <article>
-    <pre><code class="r">ey &lt;- resid(lm(y ~ x2 + x3 - 1))
-ex &lt;- resid(lm(x ~ x2 + x3 - 1))
-sum(ey * ex) / sum(ex ^ 2)
-</code></pre>
+  <article data-timings="">
+    <p>\[E[Y | X_1 = x_1, \ldots, X_p = x_p] = \sum_{k=1}^p x_{k} \beta_k\]</p>
 
-<pre><code>[1] 1.004
-</code></pre>
-
-<pre><code class="r">coef(lm(y ~ x + x2 + x3 - 1)) #the -1 removes the intercept term
-</code></pre>
-
-<pre><code>     x     x2     x3 
-1.0040 0.9899 1.0078 
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
+<p>\[
+E[Y | X_1 = x_1 + 1, \ldots, X_p = x_p] = (x_1 + 1) \beta_1 + \sum_{k=2}^p x_{k} \beta_k
+\]</p>
 
-      <slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Interpretation of the coeficient</h2>
-  </hgroup>
-  <article>
-    <p>\[E[Y | X_1 = x_1, \ldots, X_p = x_p] = \sum_{k=1}^p x_{k} \beta_k\]
-So that
-\[
+<p>\[
 E[Y | X_1 = x_1 + 1, \ldots, X_p = x_p]  - E[Y | X_1 = x_1, \ldots, X_p = x_p]\]
-\[= (x_1 + 1) \beta_1 + \sum_{k=2}^p x_{k}+ \sum_{k=1}^p x_{k} \beta_k = \beta_1 \]
+\[= (x_1 + 1) \beta_1 + \sum_{k=2}^p x_{k} \beta_k + \sum_{k=1}^p x_{k} \beta_k = \beta_1 \]
 So that the interpretation of a multivariate regression coefficient is the expected change in the response per unit change in the regressor, holding all of the other regressors fixed.</p>
 
 <p>In the next lecture, we&#39;ll do examples and go over context-specific
@@ -460,11 +272,11 @@ <h2>Interpretation of the coeficient</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-18" style="background:;">
+<slide class="" id="slide-10" style="background:;">
   <hgroup>
     <h2>Fitted values, residuals and residual variation</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <p>All of our SLR quantities can be extended to linear models</p>
 
 <ul>
@@ -483,11 +295,11 @@ <h2>Fitted values, residuals and residual variation</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-19" style="background:;">
+<slide class="" id="slide-11" style="background:;">
   <hgroup>
     <h2>Linear models</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Linear models are the single most important applied statistical and machine learning techniqe, <em>by far</em>.</li>
 <li>Some amazing things that you can accomplish with linear models
@@ -507,34 +319,101 @@ <h2>Linear models</h2>
 
     <slide class="backdrop"></slide>
   </slides>
-
-  <!--[if IE]>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title=''>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Multivariable regression analyses'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='The linear model'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='How to get estimates'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Result'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Example with two variables, simple linear regression'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='The general case'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Demonstration that it works using an example'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Interpretation of the coeficients'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Fitted values, residuals and residual variation'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Linear models'>
+         11
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
     <script 
       src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
     </script>
     <script>CFInstall.check({mode: 'overlay'});</script>
   <![endif]-->
 </body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
 </script>
 <!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/07_RegressionModels/02_01_multivariate/index.md b/07_RegressionModels/02_01_multivariate/index.md
index a712c3a64..169241fa7 100644
--- a/07_RegressionModels/02_01_multivariate/index.md
+++ b/07_RegressionModels/02_01_multivariate/index.md
@@ -8,13 +8,23 @@ framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
 hitheme     : tomorrow      # 
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
 ---
 
+```
+## Error: object 'opts_chunk' not found
+```
+
+```
+## Error: object 'knit_hooks' not found
+```
 
+```
+## Error: object 'knit_hooks' not found
+```
 ## Multivariable regression analyses
 * If I were to present evidence of a relationship between 
 breath mint useage (mints per day, X) and pulmonary function
@@ -59,66 +69,15 @@ predictor variables.)
 
 ---
 ## How to get estimates
-* The real way requires linear algebra. We'll go over an intuitive development instead. 
 * Recall that the LS estimate for regression through the origin, $E[Y_i]=X_{1i}\beta_1$, was $\sum X_i Y_i / \sum X_i^2$.
 * Let's consider two regressors, $E[Y_i] = X_{1i}\beta_1 + X_{2i}\beta_2 = \mu_i$. 
-* Also, recall, that if $\hat \mu_i$ satisfies
-$$
-\sum_{i=1} (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = 0
-$$
-for all possible values of $\mu_i$, then we've found the LS estimates.
-
----
-$$
-\sum_{i=1}^n (Y_i - \hat \mu_i) (\hat \mu_i - \mu_i) = \sum_{i=1}^n (Y_i - \hat \beta_1 X_{1i} - \hat \beta_2 X_{2i})
-\left\{X_{1i}(\hat \beta_1 - \beta_1) + X_{2i}(\hat \beta_2 - \beta_2) \right\}
-$$
-* Thus we need 
-  1. $\sum_{i=1}^n (Y_i - \hat \beta_1 X_{1i} - \hat \beta_2 X_{2i}) X_{1i} = 0$ 
-  2.  $\sum_{i=1}^n (Y_i - \hat \beta_1 X_{1i} - \hat \beta_2 X_{2i}) X_{2i} = 0$
-* Hold $\hat \beta_1$ fixed in 2. and solve and we get that
-$$
-\hat \beta_2 = \frac{\sum_{i=1} (Y_i - X_{1i}\hat \beta_1)X_{2i}}{\sum_{i=1}^n X_{2i}^2}
-$$
-* Plugging this into 1. we get that
-$$
-0 = \sum_{i=1}^n \left\{Y_i - \frac{\sum_j X_{2j}Y_j}{\sum_j X_{2j}^2}X_{2i} + 
-\beta_1 \left(X_{1i} - \frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2} X_{2i}\right)\right\} X_{1i}
-$$
-
----
-## Continued
-* Re writing  this we get
-$$
-0 = \sum_{i=1}^n \left\{ e_{i, Y | X_2} - \hat \beta_1 e_{i, X_1 | X_2} 
-\right\} X_{1i} 
+* Least squares tries to minimize
 $$
-where $e_{i, a | b} = a_i -  \frac{\sum_{j=1}^n a_j b_j }{\sum_{i=1}^n b_j^2} b_i$ is the residual when regressing $b$ from $a$ without an intercept.
-* We get the solution
-$$
-\hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2} X_1}
-$$
-
----
-* But note that 
-$$
-\sum_{i=1}^n e_{i, X_1 | X_2}^2 
-= \sum_{i=1}^n e_{i, X_1 | X_2} \left(X_{1i} - \frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2} X_{2i}\right)
-$$
-$$
-= \sum_{i=1}^n e_{i, X_1 | X_2} X_{1i} - \frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2} \sum_{i=1}^n e_{i, X_1 | X_2} X_{2i}
-$$
-But $\sum_{i=1}^n e_{i, X_1 | X_2} X_{2i} = 0$. So we get that
-$$
-\sum_{i=1}^n e_{i, X_1 | X_2}^2  = \sum_{i=1}^n e_{i, X_1 | X_2} X_{1i}
-$$
-Thus we get that
-$$
-\hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2}^2}
+\sum_{i=1}^n (Y_i - X_{1i} \beta_1 - X_{2i} \beta_2)^2
 $$
 
 ---
-## Summing up fitting with two regressors
+## Result
 $$\hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2}^2}$$
 * That is, the regression estimate for $\beta_1$ is the regression 
 through the origin estimate having regressed $X_2$ out of both
@@ -130,10 +89,10 @@ from both the regressor and response.
 ---
 ## Example with two variables, simple linear regression
 * $Y_{i} = \beta_1 X_{1i} + \beta_2 X_{2i}$ where  $X_{2i} = 1$ is an intercept term.
-* Then  $\frac{\sum_j X_{2j}X_{1j}}{\sum_j X_{2j}^2}X_{2i} = 
-\frac{\sum_j X_{1j}}{n} = \bar X_1$. 
-* $e_{i, X_1 | X_2} = X_{1i} - \bar X_1$.
-* Simiarly $e_{i, Y | X_2} = Y_i - \bar Y$.
+* Notice the fitted coefficient of $X_{2i}$ on $Y_{i}$ is $\bar Y$
+    * The residuals $e_{i, Y | X_2} = Y_i - \bar Y$
+* Notice the fitted coefficient of $X_{2i}$ on $X_{1i}$ is $\bar X_1$
+    * The residuals $e_{i, X_1 | X_2}= X_{1i} - \bar X_1$
 * Thus
 $$
 \hat \beta_1 = \frac{\sum_{i=1}^n e_{i, Y | X_2} e_{i, X_1 | X_2}}{\sum_{i=1}^n e_{i, X_1 | X_2}^2} = \frac{\sum_{i=1}^n (X_i - \bar X)(Y_i - \bar Y)}{\sum_{i=1}^n (X_i - \bar X)^2}
@@ -142,140 +101,58 @@ $$
 
 ---
 ## The general case
-* The equations
+* Least squares solutions have to minimize
 $$
-\sum_{i=1}^n (Y_i - X_{1i}\hat \beta_1 - \ldots - X_{ip}\hat \beta_p) X_k = 0
+\sum_{i=1}^n (Y_i - X_{1i}\beta_1 - \ldots - X_{pi}\beta_p)^2
 $$
-for $k = 1, \ldots, p$ yields $p$ equations with $p$ unknowns.
-* Solving them yields the least squares estimates. (With obtaining a good, fast, general solution requiring some knowledge of linear algebra.)
 * The least squares estimate for the coefficient of a multivariate regression model is exactly regression through the origin with the linear relationships with the other regressors removed from both the regressor and outcome by taking residuals. 
 * In this sense, multivariate regression "adjusts" a coefficient for the linear impact of the other variables. 
 
----
-## Fitting LS equations
-Just so I don't leave you hanging, let's show a way to get estimates. Recall the equations:
-$$
-\sum_{i=1}^n (Y_i - X_{1i}\hat \beta_1 - \ldots - X_{ip}\hat \beta_p) X_k = 0
-$$
-If I hold $\hat \beta_1, \ldots, \hat \beta_{p-1}$ fixed then
-we get that
-$$
-\hat \beta_p = \frac{\sum_{i=1}^n (Y_i - X_{1i}\hat \beta_1 - \ldots - X_{i,p-1}\hat \beta_{p-1}) X_{ip} }{\sum_{i=1}^n X_{ip}^2}
-$$
-Plugging this back into the equations, we wind up with 
-$$
-\sum_{i=1}^n (e_{i,Y|X_p} - e_{i, X_{1} | X_p} \hat \beta_1 - \ldots - e_{i, X_{p-1} | X_{p}} \hat \beta_{p-1}) X_k = 0
-$$
-
----
-## We can tidy it up a bit more, though
-Note that
-$$
-X_k  = e_{i,X_k|X_p} + \frac{\sum_{i=1}^n X_{ik} X_{ip}}{\sum_{i=1}^n X_{ip^2}} X_p
-$$
-and $\sum_{i=1}^n e_{i,X_j | X_p} X_{ip} = 0$.
-Thus 
-$$
-\sum_{i=1}^n (e_{i,Y|X_p} - e_{i, X_{1} | X_p} \hat \beta_1 - \ldots - e_{i, X_{p-1} | X_{p}} \hat \beta_{p-1}) X_k = 0
-$$
-is equal to
-$$
-\sum_{i=1}^n (e_{i,Y|X_p} - e_{i, X_{1} | X_p} \hat \beta_1 - \ldots - e_{i, X_{p-1} | X_{p}} \hat \beta_{p-1}) e_{i,X_k|X_p} = 0
-$$
-
----
-## To sum up
-* We've reduced $p$ LS equations and $p$ unknowns to $p-1$ LS equations and $p-1$ unknowns.
-  * Every variable has been replaced by its residual with $X_p$. 
-  * This process can then be iterated until only Y and one
-variable remains. 
-* Think of it as follows. If we want an adjusted relationship between y and x, keep taking residuals over confounders and do regression through the origin.
-  * The order that you do the confounders doesn't matter.
-  * (It can't because our choice of doing $p$ first 
-    was arbitrary.)
-* This isn't a terribly efficient way to get estimates. But, it's nice conceputally, as it shows how regression estimates are adjusted for the linear relationship with other variables.
-
 ---
 ## Demonstration that it works using an example
-### Linear model with two variables and an intercept
+### Linear model with two variables
 
 ```r
-n <- 100; x <- rnorm(n); x2 <- rnorm(n); x3 <- rnorm(n)
-y <- x + x2 + x3 + rnorm(n, sd = .1)
-e <- function(a, b) a -  sum( a * b ) / sum( b ^ 2) * b
-ey <- e(e(y, x2), e(x3, x2))
-ex <- e(e(x, x2), e(x3, x2))
+n = 100; x = rnorm(n); x2 = rnorm(n); x3 = rnorm(n)
+y = 1 + x + x2 + x3 + rnorm(n, sd = .1)
+ey = resid(lm(y ~ x2 + x3))
+ex = resid(lm(x ~ x2 + x3))
 sum(ey * ex) / sum(ex ^ 2)
 ```
 
 ```
-[1] 1.004
-```
-
-```r
-coef(lm(y ~ x + x2 + x3 - 1)) #the -1 removes the intercept term
-```
-
-```
-     x     x2     x3 
-1.0040 0.9899 1.0078 
+## [1] 1.009
 ```
 
-
----
-## Showing that order doesn't matter
-
 ```r
-ey <- e(e(y, x3), e(x2, x3))
-ex <- e(e(x, x3), e(x2, x3))
-sum(ey * ex) / sum(ex ^ 2)
+coef(lm(ey ~ ex - 1))
 ```
 
 ```
-[1] 1.004
+##    ex 
+## 1.009
 ```
 
 ```r
-coef(lm(y ~ x + x2 + x3 - 1)) #the -1 removes the intercept term
+coef(lm(y ~ x + x2 + x3)) 
 ```
 
 ```
-     x     x2     x3 
-1.0040 0.9899 1.0078 
+## (Intercept)           x          x2          x3 
+##      1.0202      1.0090      0.9787      1.0064
 ```
 
-
 ---
-## Residuals again
-
-```r
-ey <- resid(lm(y ~ x2 + x3 - 1))
-ex <- resid(lm(x ~ x2 + x3 - 1))
-sum(ey * ex) / sum(ex ^ 2)
-```
-
-```
-[1] 1.004
-```
-
-```r
-coef(lm(y ~ x + x2 + x3 - 1)) #the -1 removes the intercept term
-```
-
-```
-     x     x2     x3 
-1.0040 0.9899 1.0078 
-```
-
+## Interpretation of the coeficients
+$$E[Y | X_1 = x_1, \ldots, X_p = x_p] = \sum_{k=1}^p x_{k} \beta_k$$
 
+$$
+E[Y | X_1 = x_1 + 1, \ldots, X_p = x_p] = (x_1 + 1) \beta_1 + \sum_{k=2}^p x_{k} \beta_k
+$$
 
----
-## Interpretation of the coeficient
-$$E[Y | X_1 = x_1, \ldots, X_p = x_p] = \sum_{k=1}^p x_{k} \beta_k$$
-So that
 $$
 E[Y | X_1 = x_1 + 1, \ldots, X_p = x_p]  - E[Y | X_1 = x_1, \ldots, X_p = x_p]$$
-$$= (x_1 + 1) \beta_1 + \sum_{k=2}^p x_{k}+ \sum_{k=1}^p x_{k} \beta_k = \beta_1 $$
+$$= (x_1 + 1) \beta_1 + \sum_{k=2}^p x_{k} \beta_k + \sum_{k=1}^p x_{k} \beta_k = \beta_1 $$
 So that the interpretation of a multivariate regression coefficient is the expected change in the response per unit change in the regressor, holding all of the other regressors fixed.
 
 In the next lecture, we'll do examples and go over context-specific
diff --git a/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-1.png b/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-1.png
index 9c8b5ae4a..ae1866b7b 100644
Binary files a/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-1.png and b/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-16.png b/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-16.png
index 2bae20ead..1735ee3f2 100644
Binary files a/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-16.png and b/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-16.png differ
diff --git a/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-5.png b/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-5.png
index ab95cd7ca..57f66bc71 100644
Binary files a/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-5.png and b/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-5.png differ
diff --git a/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-6.png b/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-6.png
index 59918f2a5..98a0d0016 100644
Binary files a/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-6.png and b/07_RegressionModels/02_02_multivariateExamples/fig/unnamed-chunk-6.png differ
diff --git a/07_RegressionModels/02_02_multivariateExamples/index.Rmd b/07_RegressionModels/02_02_multivariateExamples/index.Rmd
index 5a053aab9..8910b9e78 100644
--- a/07_RegressionModels/02_02_multivariateExamples/index.Rmd
+++ b/07_RegressionModels/02_02_multivariateExamples/index.Rmd
@@ -29,27 +29,28 @@ knit_hooks$set(inline = function(x) {
 knit_hooks$set(plot = knitr:::hook_plot_html)
 runif(1)
 ```
-## Swiss fertility data
-```{r, fig.height=4.5, fig.width=4.5}
-library(datasets); data(swiss); require(stats); require(graphics)
-pairs(swiss, panel = panel.smooth, main = "Swiss data", col = 3 + (swiss$Catholic > 50))
-```
-
----
-## `?swiss`
-### Description
+## Data set for discussion
+### `require(datasets); data(swiss); ?swiss`
 Standardized fertility measure and socio-economic indicators for each of 47 French-speaking provinces of Switzerland at about 1888.
 
 A data frame with 47 observations on 6 variables, each of which is in percent, i.e., in [0, 100].
 
-* [,1]   Fertility	Ig, ‘ common standardized fertility measure’
-* [,2]	 Agriculture	 % of males involved in agriculture as occupation
-* [,3]	 Examination	 % draftees receiving highest mark on army examination
-* [,4]	 Education	 % education beyond primary school for draftees.
-* [,5]	 Catholic	 % ‘catholic’ (as opposed to ‘protestant’).
-* [,6]	 Infant.Mortality	 live births who live less than 1 year.
+* [,1]   Fertility          a common standardized fertility measure
+* [,2]   Agriculture        % of males involved in agriculture as occupation
+* [,3]	 Examination        % draftees receiving highest mark on army examination
+* [,4]	 Education          % education beyond primary school for draftees
+* [,5]	 Catholic           % catholic (as opposed to protestant)
+* [,6]	 Infant.Mortality   live births who live less than 1 year
+
+All variables but Fertility give proportions of the population.
 
-All variables but ‘Fertility’ give proportions of the population.
+---
+
+```{r, fig.height=6, fig.width=10, echo = FALSE}
+require(datasets); data(swiss); require(GGally); require(ggplot2)
+g = ggpairs(swiss, lower = list(continuous = "smooth"),params = c(method = "loess"))
+g
+```
 
 
 ---
@@ -63,7 +64,7 @@ summary(lm(Fertility ~ . , data = swiss))$coefficients
 ## Example interpretation
 * Agriculture is expressed in percentages (0 - 100)
 * Estimate is -0.1721.
-* We estimate an expected 0.17 decrease in standardized fertility for every 1\% increase in percentage of males involved in agriculture in holding the remaining variables constant.
+* Our models estimates an expected 0.17 decrease in standardized fertility for every 1% increase in percentage of males involved in agriculture in holding the remaining variables constant.
 * The t-test for $H_0: \beta_{Agri} = 0$ versus $H_a: \beta_{Agri} \neq 0$ is  significant.
 * Interestingly, the unadjusted estimate is 
 ```{r}
@@ -80,18 +81,24 @@ summary(lm(y ~ x1 + x2))$coef
 
 ---
 ```{r, echo = FALSE, fig.height=5, fig.width=10, results = 'show'}
-par(mfrow = c(1, 2))
-plot(x1, y, pch=21,col="black",bg=topo.colors(n)[x2], frame = FALSE, cex = 1.5)
-title('Unadjusted, color is X2')
-abline(lm(y ~ x1), lwd = 2)
-plot(resid(lm(x1 ~ x2)), resid(lm(y ~ x2)), pch = 21, col = "black", bg = "lightblue", frame = FALSE, cex = 1.5)
-title('Adjusted')
-abline(0, coef(lm(y ~ x1 + x2))[2], lwd = 2)
+dat = data.frame(y = y, x1 = x1, x2 = x2, ey = resid(lm(y ~ x2)), ex1 = resid(lm(x1 ~ x2)))
+library(ggplot2)
+g = ggplot(dat, aes(y = y, x = x1, colour = x2))
+g = g + geom_point(colour="grey50", size = 5) + geom_smooth(method = lm, se = FALSE, colour = "black") 
+g = g + geom_point(size = 4) 
+g
+```
+
+---
+```{r, echo = FALSE, fig.height=5, fig.width=10, results = 'show'}
+g2 = ggplot(dat, aes(y = ey, x = ex1, colour = x2))  
+g2 = g2 + geom_point(colour="grey50", size = 5) + geom_smooth(method = lm, se = FALSE, colour = "black") + geom_point(size = 4) 
+g2
 ```
 
 ---
 ## Back to this data set
-* The sign reverses itself with the inclusion of Examination and Education, but of which are negatively correlated with Agriculture.
+* The sign reverses itself with the inclusion of Examination and Education.
 * The percent of males in the province working in agriculture is negatively related to educational attainment (correlation of `r cor(swiss$Agriculture, swiss$Education)`) and Education and Examination (correlation of `r cor(swiss$Education, swiss$Examination)`) are obviously measuring similar things. 
   * Is the positive marginal an artifact for not having accounted for, say, Education level? (Education does have a stronger effect, by the way.)
 * At the minimum, anyone claiming that provinces that are more agricultural have higher fertility rates would immediately be open to criticism.
@@ -137,11 +144,12 @@ where each $X_{i1}$ is binary so that it is a 1 if measurement $i$ is in a group
 ---
 ## Insect Sprays
 ```{r, echo = FALSE, fig.height=5, fig.width=5}
-require(datasets);data(InsectSprays)
-require(stats); require(graphics)
-boxplot(count ~ spray, data = InsectSprays,
-        xlab = "Type of spray", ylab = "Insect count",
-        main = "InsectSprays data", varwidth = TRUE, col = "lightgray")
+require(datasets);data(InsectSprays); require(stats); require(ggplot2)
+g = ggplot(data = InsectSprays, aes(y = count, x = spray, fill  = spray))
+g = g + geom_violin(colour = "black", size = 2)
+g = g + xlab("Type of spray") + ylab("Insect count")
+g
+
 ```
 
 ---
@@ -163,19 +171,26 @@ summary(lm(count ~
 ---
 ## What if we include all 6?
 ```{r, echo= TRUE}
-lm(count ~ 
+summary(lm(count ~ 
    I(1 * (spray == 'B')) + I(1 * (spray == 'C')) +  
    I(1 * (spray == 'D')) + I(1 * (spray == 'E')) +
-   I(1 * (spray == 'F')) + I(1 * (spray == 'A')), data = InsectSprays)
+   I(1 * (spray == 'F')) + I(1 * (spray == 'A')), data = InsectSprays))$coef
 ```
 
 ---
 ## What if we omit the intercept?
 ```{r, echo= TRUE}
 summary(lm(count ~ spray - 1, data = InsectSprays))$coef
-unique(ave(InsectSprays$count, InsectSprays$spray))
+library(dplyr)
+summarise(group_by(InsectSprays, spray), mn = mean(count))
 ```
 
+---
+## Reordering the levels
+```{r}
+spray2 <- relevel(InsectSprays$spray, "C")
+summary(lm(count ~ spray2, data = InsectSprays))$coef
+```
 ---
 ## Summary
 * If we treat Spray as a factor, R includes an intercept and omits the alphabetically first level of the factor.
@@ -187,28 +202,6 @@ unique(ave(InsectSprays$count, InsectSprays$spray))
   * Tests are tests of whether the groups are different than zero. (Are the expected counts zero for that spray.)
 * If we want comparisons between, Spray B and C, say we could refit the model with C (or B) as the reference level. 
 
----
-## Reordering the levels
-```{r}
-spray2 <- relevel(InsectSprays$spray, "C")
-summary(lm(count ~ spray2, data = InsectSprays))$coef
-```
-
----
-## Doing it manually
-Equivalently 
-$$Var(\hat \beta_B - \hat \beta_C) = Var(\hat \beta_B) + Var(\hat \beta_C) - 2 Cov(\hat \beta_B, \hat \beta_C)$$
-```{r}
-fit <- lm(count ~ spray, data = InsectSprays) #A is ref
-bbmbc <- coef(fit)[2] - coef(fit)[3] #B - C
-temp <- summary(fit) 
-se <- temp$sigma * sqrt(temp$cov.unscaled[2, 2] + temp$cov.unscaled[3,3] - 2 *temp$cov.unscaled[2,3])
-t <- (bbmbc) / se
-p <- pt(-abs(t), df = fit$df)
-out <- c(bbmbc, se, t, p)
-names(out) <- c("B - C", "SE", "T", "P")
-round(out, 3)
-```
 
 ---
 ## Other thoughts on this data
@@ -221,210 +214,81 @@ round(out, 3)
 
 ---
 
-## Example - Millenium Development Goal 1
-
-[http://www.un.org/millenniumgoals/pdf/MDG_FS_1_EN.pdf](http://www.un.org/millenniumgoals/pdf/MDG_FS_1_EN.pdf)
-
-[http://apps.who.int/gho/athena/data/GHO/WHOSIS_000008.csv?profile=text&filter=COUNTRY:*;SEX:*](http://apps.who.int/gho/athena/data/GHO/WHOSIS_000008.csv?profile=text&filter=COUNTRY:*;SEX:*)
-
----
-
-## WHO childhood hunger data
+## Recall the `swiss` data set
 
-
-```{r whoDataLoad}
-#download.file("http://apps.who.int/gho/athena/data/GHO/WHOSIS_000008.csv?profile=text&filter=COUNTRY:*;SEX:*","hunger.csv",method="curl")
-hunger <- read.csv("hunger.csv")
-hunger <- hunger[hunger$Sex!="Both sexes",]
-head(hunger)
+```{r}
+library(datasets); data(swiss)
+head(swiss)
 ```
 
 ---
-
-## Plot percent hungry versus time
-
-```{r, dependson="whoDataLoad",fig.height=4,fig.width=4}
-lm1 <- lm(hunger$Numeric ~ hunger$Year)
-plot(hunger$Year,hunger$Numeric,pch=19,col="blue")
+## Create a binary variable
+```{r}
+library(dplyr); 
+swiss = mutate(swiss, CatholicBin = 1 * (Catholic > 50))
 ```
 
 ---
-
-## Remember the linear model
-
-$$Hu_i = b_0 + b_1 Y_i + e_i$$
-
-$b_0$ = percent hungry at Year 0
-
-$b_1$ = decrease in percent hungry per year
-
-$e_i$ = everything we didn't measure
-
----
-
-## Add the linear model
-
-```{r, dependson="whoDataLoad",fig.height=4,fig.width=4}
-lm1 <- lm(hunger$Numeric ~ hunger$Year)
-plot(hunger$Year,hunger$Numeric,pch=19,col="blue")
-lines(hunger$Year,lm1$fitted,lwd=3,col="darkgrey")
+## Plot the data 
+```{r, fig.height=5, fig.width=8, echo = FALSE}
+g = ggplot(swiss, aes(x = Agriculture, y = Fertility, colour = factor(CatholicBin)))
+g = g + geom_point(size = 6, colour = "black") + geom_point(size = 4)
+g = g + xlab("% in Agriculture") + ylab("Fertility")
+g
 ```
 
-
 ---
-
-## Color by male/female
-
-```{r, dependson="whoDataLoad",fig.height=4,fig.width=4}
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex=="Male")*1+1))
+## No effect of religion
+```{r, echo = TRUE}
+summary(lm(Fertility ~ Agriculture, data = swiss))$coef
 ```
 
 ---
- 
-## Now two lines
-
-$$HuF_i = bf_0 + bf_1 YF_i + ef_i$$
-
-$bf_0$ = percent of girls hungry at Year 0
-
-$bf_1$ = decrease in percent of girls hungry per year
-
-$ef_i$ = everything we didn't measure 
-
-
-$$HuM_i = bm_0 + bm_1 YM_i + em_i$$
-
-$bm_0$ = percent of boys hungry at Year 0
-
-$bm_1$ = decrease in percent of boys hungry per year
-
-$em_i$ = everything we didn't measure 
-
-
-
----
-
-## Color by male/female
-
-```{r, dependson="whoDataLoad",fig.height=3.5,fig.width=4}
-lmM <- lm(hunger$Numeric[hunger$Sex=="Male"] ~ hunger$Year[hunger$Sex=="Male"])
-lmF <- lm(hunger$Numeric[hunger$Sex=="Female"] ~ hunger$Year[hunger$Sex=="Female"])
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex=="Male")*1+1))
-lines(hunger$Year[hunger$Sex=="Male"],lmM$fitted,col="black",lwd=3)
-lines(hunger$Year[hunger$Sex=="Female"],lmF$fitted,col="red",lwd=3)
+## The associated fitted line
+```{r, echo = FALSE, fig.width=8, fig.height=5}
+fit = lm(Fertility ~ Agriculture, data = swiss)
+g1 = g
+g1 = g1 + geom_abline(intercept = coef(fit)[1], slope = coef(fit)[2], size = 2)
+g1
 ```
 
 
 ---
-
-## Two lines, same slope
-
-$$Hu_i = b_0 + b_1 \mathbb{1}(Sex_i="Male") + b_2 Y_i + e^*_i$$
-
-$b_0$ - percent hungry at year zero for females
-
-$b_0 + b_1$ - percent hungry at year zero for males
-
-$b_2$ - change in percent hungry (for either males or females) in one year
-
-$e^*_i$ - everything we didn't measure
-
----
-
-## Two lines, same slope in R
-
-
-```{r, dependson="whoDataLoad",fig.height=4,fig.width=4}
-lmBoth <- lm(hunger$Numeric ~ hunger$Year + hunger$Sex)
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex=="Male")*1+1))
-abline(c(lmBoth$coeff[1],lmBoth$coeff[2]),col="red",lwd=3)
-abline(c(lmBoth$coeff[1] + lmBoth$coeff[3],lmBoth$coeff[2] ),col="black",lwd=3)
+## Parallel lines
+```{r, echo = TRUE}
+summary(lm(Fertility ~ Agriculture + factor(CatholicBin), data = swiss))$coef
 ```
 
 ---
-
-## Two lines, different slopes (interactions)
-
-$$Hu_i = b_0 + b_1 \mathbb{1}(Sex_i="Male") + b_2 Y_i + b_3 \mathbb{1}(Sex_i="Male")\times Y_i + e^+_i$$
-
-$b_0$ - percent hungry at year zero for females
-
-$b_0 + b_1$ - percent hungry at year zero for males
-
-$b_2$ - change in percent hungry (females) in one year
-
-$b_2 + b_3$ - change in percent hungry (males) in one year
-
-$e^+_i$ - everything we didn't measure
-
----
-
-## Two lines, different slopes in R
-
-
-```{r lmBothChunk, dependson="whoDataLoad",fig.height=4,fig.width=4}
-lmBoth <- lm(hunger$Numeric ~ hunger$Year + hunger$Sex + hunger$Sex*hunger$Year)
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex=="Male")*1+1))
-abline(c(lmBoth$coeff[1],lmBoth$coeff[2]),col="red",lwd=3)
-abline(c(lmBoth$coeff[1] + lmBoth$coeff[3],lmBoth$coeff[2] +lmBoth$coeff[4]),col="black",lwd=3)
+## Fitted lines
+```{r, echo = FALSE, fig.width=5, fig.height=4}
+fit = lm(Fertility ~ Agriculture + factor(CatholicBin), data = swiss)
+g1 = g
+g1 = g1 + geom_abline(intercept = coef(fit)[1], slope = coef(fit)[2], size = 2)
+g1 = g1 + geom_abline(intercept = coef(fit)[1] + coef(fit)[3], slope = coef(fit)[2], size = 2)
+g1
 ```
 
 
 ---
-
-## Two lines, different slopes in R
-
-
-```{r, dependson="lmBothChunk",fig.height=4,fig.width=4}
-summary(lmBoth)
+## Lines with different slopes and intercepts
+```{r, echo = TRUE}
+summary(lm(Fertility ~ Agriculture * factor(CatholicBin), data = swiss))$coef
 ```
 
 ---
-## Interpretting a continuous interaction
-$$
-E[Y_i | X_{1i}=x_1, X_{2i}=x_2] = \beta_0 + \beta_1 x_{1} + \beta_2 x_{2} + \beta_3 x_{1}x_{2}
-$$
-Holding $X_2$ constant we have
-$$
-E[Y_i | X_{1i}=x_1+1, X_{2i}=x_2]-E[Y_i | X_{1i}=x_1, X_{2i}=x_2]
-= \beta_1 + \beta_3 x_{2} 
-$$
-And thus the expected change in $Y$ per unit change in $X_1$ holding all else constant is not constant. $\beta_1$ is the slope when $x_{2} = 0$. Note further that:
-$$
-E[Y_i | X_{1i}=x_1+1, X_{2i}=x_2+1]-E[Y_i | X_{1i}=x_1, X_{2i}=x_2+1]
-$$
-$$
--E[Y_i | X_{1i}=x_1+1, X_{2i}=x_2]-E[Y_i | X_{1i}=x_1, X_{2i}=x_2]
-$$
-$$
-=\beta_3  
-$$
-Thus, $\beta_3$ is the change in the expected change in $Y$ per unit change in $X_1$, per unit change in $X_2$.
-
-Or, the change in the slope relating $X_1$ and $Y$ per unit change in $X_2$.
+## Fitted lines
+```{r, echo = FALSE, fig.width=5, fig.height=4}
+fit = lm(Fertility ~ Agriculture * factor(CatholicBin), data = swiss)
+g1 = g
+g1 = g1 + geom_abline(intercept = coef(fit)[1], slope = coef(fit)[2], size = 2)
+g1 = g1 + geom_abline(intercept = coef(fit)[1] + coef(fit)[3], 
+                          slope = coef(fit)[2] + coef(fit)[4], size = 2)
+g1
+```
 
 ---
-
-## Example
-
-$$Hu_i = b_0 + b_1 In_i + b_2 Y_i + b_3 In_i \times Y_i + e^+_i$$
-
-$b_0$ - percent hungry at year zero for children with whose parents have no income
-
-$b_1$ - change in percent hungry for each dollar of income in year zero
-
-$b_2$ - change in percent hungry in one year for children whose parents have no income
-
-$b_3$ - increased change in percent hungry by year for each dollar of income  - e.g. if income is $10,000, then change in percent hungry in one year will be
-
-$$b_2 + 1e4 \times b_3$$
-
-$e^+_i$ - everything we didn't measure
-
-__Lot's of care/caution needed!__
-
-
+## Just to show you it can be done
+```{r, echo = TRUE}
+summary(lm(Fertility ~ Agriculture + Agriculture : factor(CatholicBin), data = swiss))$coef
+```
diff --git a/07_RegressionModels/02_02_multivariateExamples/index.html b/07_RegressionModels/02_02_multivariateExamples/index.html
index ecc8a4389..e699d17b0 100644
--- a/07_RegressionModels/02_02_multivariateExamples/index.html
+++ b/07_RegressionModels/02_02_multivariateExamples/index.html
@@ -45,40 +45,33 @@ <h2>Regression Models</h2>
     <!-- SLIDES -->
     <slide class="" id="slide-1" style="background:;">
   <hgroup>
-    <h2>Swiss fertility data</h2>
+    <h2>Data set for discussion</h2>
   </hgroup>
   <article data-timings="">
-    <pre><code class="r">library(datasets); data(swiss); require(stats); require(graphics)
-pairs(swiss, panel = panel.smooth, main = &quot;Swiss data&quot;, col = 3 + (swiss$Catholic &gt; 50))
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2><code>?swiss</code></h2>
-  </hgroup>
-  <article data-timings="">
-    <h3>Description</h3>
+    <h3><code>require(datasets); data(swiss); ?swiss</code></h3>
 
 <p>Standardized fertility measure and socio-economic indicators for each of 47 French-speaking provinces of Switzerland at about 1888.</p>
 
 <p>A data frame with 47 observations on 6 variables, each of which is in percent, i.e., in [0, 100].</p>
 
 <ul>
-<li>[,1]   Fertility  Ig, ‘ common standardized fertility measure’</li>
-<li>[,2]   Agriculture     % of males involved in agriculture as occupation</li>
-<li>[,3]   Examination     % draftees receiving highest mark on army examination</li>
-<li>[,4]   Education   % education beyond primary school for draftees.</li>
-<li>[,5]   Catholic    % ‘catholic’ (as opposed to ‘protestant’).</li>
-<li>[,6]   Infant.Mortality    live births who live less than 1 year.</li>
+<li>[,1]   Fertility          a common standardized fertility measure</li>
+<li>[,2]   Agriculture        % of males involved in agriculture as occupation</li>
+<li>[,3]   Examination        % draftees receiving highest mark on army examination</li>
+<li>[,4]   Education          % education beyond primary school for draftees</li>
+<li>[,5]   Catholic           % catholic (as opposed to protestant)</li>
+<li>[,6]   Infant.Mortality   live births who live less than 1 year</li>
 </ul>
 
-<p>All variables but ‘Fertility’ give proportions of the population.</p>
+<p>All variables but Fertility give proportions of the population.</p>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-2" style="background:;">
+  <article data-timings="">
+    <div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
 
   </article>
   <!-- Presenter Notes -->
@@ -112,7 +105,7 @@ <h2>Example interpretation</h2>
     <ul>
 <li>Agriculture is expressed in percentages (0 - 100)</li>
 <li>Estimate is -0.1721.</li>
-<li>We estimate an expected 0.17 decrease in standardized fertility for every 1\% increase in percentage of males involved in agriculture in holding the remaining variables constant.</li>
+<li>Our models estimates an expected 0.17 decrease in standardized fertility for every 1% increase in percentage of males involved in agriculture in holding the remaining variables constant.</li>
 <li>The t-test for \(H_0: \beta_{Agri} = 0\) versus \(H_a: \beta_{Agri} \neq 0\) is  significant.</li>
 <li>Interestingly, the unadjusted estimate is </li>
 </ul>
@@ -138,17 +131,17 @@ <h2>Example interpretation</h2>
 </code></pre>
 
 <pre><code>            Estimate Std. Error t value  Pr(&gt;|t|)
-(Intercept)    1.618      1.200   1.349 1.806e-01
-x1            95.854      2.058  46.579 1.153e-68
+(Intercept)    1.454      1.079   1.348 1.807e-01
+x1            96.793      1.862  51.985 3.707e-73
 </code></pre>
 
 <pre><code class="r">summary(lm(y ~ x1 + x2))$coef
 </code></pre>
 
-<pre><code>              Estimate Std. Error   t value   Pr(&gt;|t|)
-(Intercept)  0.0003683  0.0020141    0.1829  8.553e-01
-x1          -1.0215256  0.0166372  -61.4001  1.922e-79
-x2           1.0001909  0.0001681 5950.1818 1.369e-271
+<pre><code>             Estimate Std. Error  t value   Pr(&gt;|t|)
+(Intercept)  0.001933  0.0017709    1.092  2.777e-01
+x1          -1.020506  0.0163560  -62.393  4.211e-80
+x2           1.000133  0.0001643 6085.554 1.544e-272
 </code></pre>
 
   </article>
@@ -164,12 +157,20 @@ <h2>Example interpretation</h2>
 </slide>
 
 <slide class="" id="slide-7" style="background:;">
+  <article data-timings="">
+    <div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
+
+  </article>
+  <!-- Presenter Notes -->
+</slide>
+
+<slide class="" id="slide-8" style="background:;">
   <hgroup>
     <h2>Back to this data set</h2>
   </hgroup>
   <article data-timings="">
     <ul>
-<li>The sign reverses itself with the inclusion of Examination and Education, but of which are negatively correlated with Agriculture.</li>
+<li>The sign reverses itself with the inclusion of Examination and Education.</li>
 <li>The percent of males in the province working in agriculture is negatively related to educational attainment (correlation of -0.6395) and Education and Examination (correlation of 0.6984) are obviously measuring similar things. 
 
 <ul>
@@ -182,7 +183,7 @@ <h2>Back to this data set</h2>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-8" style="background:;">
+<slide class="" id="slide-9" style="background:;">
   <hgroup>
     <h2>What if we include an unnecessary variable?</h2>
   </hgroup>
@@ -210,7 +211,7 @@ <h2>What if we include an unnecessary variable?</h2>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-9" style="background:;">
+<slide class="" id="slide-10" style="background:;">
   <hgroup>
     <h2>Dummy variables are smart</h2>
   </hgroup>
@@ -232,7 +233,7 @@ <h2>Dummy variables are smart</h2>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-10" style="background:;">
+<slide class="" id="slide-11" style="background:;">
   <hgroup>
     <h2>More than 2 levels</h2>
   </hgroup>
@@ -255,18 +256,18 @@ <h2>More than 2 levels</h2>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-11" style="background:;">
+<slide class="" id="slide-12" style="background:;">
   <hgroup>
     <h2>Insect Sprays</h2>
   </hgroup>
   <article data-timings="">
-    <div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div>
+    <div class="rimage center"><img src="fig/unnamed-chunk-8.png" title="plot of chunk unnamed-chunk-8" alt="plot of chunk unnamed-chunk-8" class="plot" /></div>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-12" style="background:;">
+<slide class="" id="slide-13" style="background:;">
   <hgroup>
     <h2>Linear model fit, group A is the reference</h2>
   </hgroup>
@@ -287,7 +288,7 @@ <h2>Linear model fit, group A is the reference</h2>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-13" style="background:;">
+<slide class="" id="slide-14" style="background:;">
   <hgroup>
     <h2>Hard coding the dummy variables</h2>
   </hgroup>
@@ -312,35 +313,31 @@ <h2>Hard coding the dummy variables</h2>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-14" style="background:;">
+<slide class="" id="slide-15" style="background:;">
   <hgroup>
     <h2>What if we include all 6?</h2>
   </hgroup>
   <article data-timings="">
-    <pre><code class="r">lm(count ~ 
+    <pre><code class="r">summary(lm(count ~ 
    I(1 * (spray == &#39;B&#39;)) + I(1 * (spray == &#39;C&#39;)) +  
    I(1 * (spray == &#39;D&#39;)) + I(1 * (spray == &#39;E&#39;)) +
-   I(1 * (spray == &#39;F&#39;)) + I(1 * (spray == &#39;A&#39;)), data = InsectSprays)
+   I(1 * (spray == &#39;F&#39;)) + I(1 * (spray == &#39;A&#39;)), data = InsectSprays))$coef
 </code></pre>
 
-<pre><code>
-Call:
-lm(formula = count ~ I(1 * (spray == &quot;B&quot;)) + I(1 * (spray == 
-    &quot;C&quot;)) + I(1 * (spray == &quot;D&quot;)) + I(1 * (spray == &quot;E&quot;)) + I(1 * 
-    (spray == &quot;F&quot;)) + I(1 * (spray == &quot;A&quot;)), data = InsectSprays)
-
-Coefficients:
-          (Intercept)  I(1 * (spray == &quot;B&quot;))  I(1 * (spray == &quot;C&quot;))  I(1 * (spray == &quot;D&quot;))  
-               14.500                  0.833                -12.417                 -9.583  
-I(1 * (spray == &quot;E&quot;))  I(1 * (spray == &quot;F&quot;))  I(1 * (spray == &quot;A&quot;))  
-              -11.000                  2.167                     NA  
+<pre><code>                      Estimate Std. Error t value  Pr(&gt;|t|)
+(Intercept)            14.5000      1.132 12.8074 1.471e-19
+I(1 * (spray == &quot;B&quot;))   0.8333      1.601  0.5205 6.045e-01
+I(1 * (spray == &quot;C&quot;)) -12.4167      1.601 -7.7550 7.267e-11
+I(1 * (spray == &quot;D&quot;))  -9.5833      1.601 -5.9854 9.817e-08
+I(1 * (spray == &quot;E&quot;)) -11.0000      1.601 -6.8702 2.754e-09
+I(1 * (spray == &quot;F&quot;))   2.1667      1.601  1.3532 1.806e-01
 </code></pre>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-15" style="background:;">
+<slide class="" id="slide-16" style="background:;">
   <hgroup>
     <h2>What if we omit the intercept?</h2>
   </hgroup>
@@ -357,37 +354,20 @@ <h2>What if we omit the intercept?</h2>
 sprayF   16.667      1.132  14.721 1.573e-22
 </code></pre>
 
-<pre><code class="r">unique(ave(InsectSprays$count, InsectSprays$spray))
-</code></pre>
-
-<pre><code>[1] 14.500 15.333  2.083  4.917  3.500 16.667
+<pre><code class="r">library(dplyr)
+summarise(group_by(InsectSprays, spray), mn = mean(count))
 </code></pre>
 
-  </article>
-  <!-- Presenter Notes -->
-</slide>
+<pre><code>Source: local data frame [6 x 2]
 
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Summary</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>If we treat Spray as a factor, R includes an intercept and omits the alphabetically first level of the factor.
-
-<ul>
-<li>All t-tests are for comparisons of Sprays versus Spray A.</li>
-<li>Emprirical mean for A is the intercept.</li>
-<li>Other group means are the itc plus their coefficient. </li>
-</ul></li>
-<li>If we omit an intercept, then it includes terms for all levels of the factor. 
-
-<ul>
-<li>Group means are the coefficients. </li>
-<li>Tests are tests of whether the groups are different than zero. (Are the expected counts zero for that spray.)</li>
-</ul></li>
-<li>If we want comparisons between, Spray B and C, say we could refit the model with C (or B) as the reference level. </li>
-</ul>
+  spray     mn
+1     A 14.500
+2     B 15.333
+3     C  2.083
+4     D  4.917
+5     E  3.500
+6     F 16.667
+</code></pre>
 
   </article>
   <!-- Presenter Notes -->
@@ -417,26 +397,25 @@ <h2>Reordering the levels</h2>
 
 <slide class="" id="slide-18" style="background:;">
   <hgroup>
-    <h2>Doing it manually</h2>
+    <h2>Summary</h2>
   </hgroup>
   <article data-timings="">
-    <p>Equivalently 
-\[Var(\hat \beta_B - \hat \beta_C) = Var(\hat \beta_B) + Var(\hat \beta_C) - 2 Cov(\hat \beta_B, \hat \beta_C)\]</p>
-
-<pre><code class="r">fit &lt;- lm(count ~ spray, data = InsectSprays) #A is ref
-bbmbc &lt;- coef(fit)[2] - coef(fit)[3] #B - C
-temp &lt;- summary(fit) 
-se &lt;- temp$sigma * sqrt(temp$cov.unscaled[2, 2] + temp$cov.unscaled[3,3] - 2 *temp$cov.unscaled[2,3])
-t &lt;- (bbmbc) / se
-p &lt;- pt(-abs(t), df = fit$df)
-out &lt;- c(bbmbc, se, t, p)
-names(out) &lt;- c(&quot;B - C&quot;, &quot;SE&quot;, &quot;T&quot;, &quot;P&quot;)
-round(out, 3)
-</code></pre>
+    <ul>
+<li>If we treat Spray as a factor, R includes an intercept and omits the alphabetically first level of the factor.
 
-<pre><code> B - C     SE      T      P 
-13.250  1.601  8.276  0.000 
-</code></pre>
+<ul>
+<li>All t-tests are for comparisons of Sprays versus Spray A.</li>
+<li>Emprirical mean for A is the intercept.</li>
+<li>Other group means are the itc plus their coefficient. </li>
+</ul></li>
+<li>If we omit an intercept, then it includes terms for all levels of the factor. 
+
+<ul>
+<li>Group means are the coefficients. </li>
+<li>Tests are tests of whether the groups are different than zero. (Are the expected counts zero for that spray.)</li>
+</ul></li>
+<li>If we want comparisons between, Spray B and C, say we could refit the model with C (or B) as the reference level. </li>
+</ul>
 
   </article>
   <!-- Presenter Notes -->
@@ -468,12 +447,21 @@ <h2>Other thoughts on this data</h2>
 
 <slide class="" id="slide-20" style="background:;">
   <hgroup>
-    <h2>Example - Millenium Development Goal 1</h2>
+    <h2>Recall the <code>swiss</code> data set</h2>
   </hgroup>
   <article data-timings="">
-    <p><a href="http://www.un.org/millenniumgoals/pdf/MDG_FS_1_EN.pdf">http://www.un.org/millenniumgoals/pdf/MDG_FS_1_EN.pdf</a></p>
+    <pre><code class="r">library(datasets); data(swiss)
+head(swiss)
+</code></pre>
 
-<p><a href="http://apps.who.int/gho/athena/data/GHO/WHOSIS_000008.csv?profile=text&amp;filter=COUNTRY:*;SEX:*">http://apps.who.int/gho/athena/data/GHO/WHOSIS_000008.csv?profile=text&amp;filter=COUNTRY:<em>;SEX:</em></a></p>
+<pre><code>             Fertility Agriculture Examination Education Catholic Infant.Mortality
+Courtelary        80.2        17.0          15        12     9.96             22.2
+Delemont          83.1        45.1           6         9    84.84             22.2
+Franches-Mnt      92.5        39.7           5         5    93.40             20.2
+Moutier           85.8        36.5          12         7    33.77             20.3
+Neuveville        76.9        43.5          17        15     5.16             20.6
+Porrentruy        76.1        35.3           9         7    90.57             26.6
+</code></pre>
 
   </article>
   <!-- Presenter Notes -->
@@ -481,29 +469,11 @@ <h2>Example - Millenium Development Goal 1</h2>
 
 <slide class="" id="slide-21" style="background:;">
   <hgroup>
-    <h2>WHO childhood hunger data</h2>
+    <h2>Create a binary variable</h2>
   </hgroup>
   <article data-timings="">
-    <pre><code class="r">#download.file(&quot;http://apps.who.int/gho/athena/data/GHO/WHOSIS_000008.csv?profile=text&amp;filter=COUNTRY:*;SEX:*&quot;,&quot;hunger.csv&quot;,method=&quot;curl&quot;)
-hunger &lt;- read.csv(&quot;hunger.csv&quot;)
-hunger &lt;- hunger[hunger$Sex!=&quot;Both sexes&quot;,]
-head(hunger)
-</code></pre>
-
-<pre><code>                               Indicator Data.Source PUBLISH.STATES Year            WHO.region
-1 Children aged &lt;5 years underweight (%) NLIS_310044      Published 1986                Africa
-2 Children aged &lt;5 years underweight (%) NLIS_310233      Published 1990              Americas
-3 Children aged &lt;5 years underweight (%) NLIS_312902      Published 2005              Americas
-5 Children aged &lt;5 years underweight (%) NLIS_312522      Published 2002 Eastern Mediterranean
-6 Children aged &lt;5 years underweight (%) NLIS_312955      Published 2008                Africa
-8 Children aged &lt;5 years underweight (%) NLIS_312963      Published 2008                Africa
-        Country    Sex Display.Value Numeric Low High Comments
-1       Senegal   Male          19.3    19.3  NA   NA       NA
-2      Paraguay   Male           2.2     2.2  NA   NA       NA
-3     Nicaragua   Male           5.3     5.3  NA   NA       NA
-5        Jordan Female           3.2     3.2  NA   NA       NA
-6 Guinea-Bissau Female          17.0    17.0  NA   NA       NA
-8         Ghana   Male          15.7    15.7  NA   NA       NA
+    <pre><code class="r">library(dplyr); 
+swiss = mutate(swiss, CatholicBin = 1 * (Catholic &gt; 50))
 </code></pre>
 
   </article>
@@ -512,14 +482,10 @@ <h2>WHO childhood hunger data</h2>
 
 <slide class="" id="slide-22" style="background:;">
   <hgroup>
-    <h2>Plot percent hungry versus time</h2>
+    <h2>Plot the data</h2>
   </hgroup>
   <article data-timings="">
-    <pre><code class="r">lm1 &lt;- lm(hunger$Numeric ~ hunger$Year)
-plot(hunger$Year,hunger$Numeric,pch=19,col=&quot;blue&quot;)
-</code></pre>
-
-<div class="rimage center"><img src="fig/unnamed-chunk-14.png" title="plot of chunk unnamed-chunk-14" alt="plot of chunk unnamed-chunk-14" class="plot" /></div>
+    <div class="rimage center"><img src="fig/unnamed-chunk-16.png" title="plot of chunk unnamed-chunk-16" alt="plot of chunk unnamed-chunk-16" class="plot" /></div>
 
   </article>
   <!-- Presenter Notes -->
@@ -527,276 +493,89 @@ <h2>Plot percent hungry versus time</h2>
 
 <slide class="" id="slide-23" style="background:;">
   <hgroup>
-    <h2>Remember the linear model</h2>
+    <h2>No effect of religion</h2>
   </hgroup>
   <article data-timings="">
-    <p>\[Hu_i = b_0 + b_1 Y_i + e_i\]</p>
-
-<p>\(b_0\) = percent hungry at Year 0</p>
-
-<p>\(b_1\) = decrease in percent hungry per year</p>
-
-<p>\(e_i\) = everything we didn&#39;t measure</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-24" style="background:;">
-  <hgroup>
-    <h2>Add the linear model</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">lm1 &lt;- lm(hunger$Numeric ~ hunger$Year)
-plot(hunger$Year,hunger$Numeric,pch=19,col=&quot;blue&quot;)
-lines(hunger$Year,lm1$fitted,lwd=3,col=&quot;darkgrey&quot;)
+    <pre><code class="r">summary(lm(Fertility ~ Agriculture, data = swiss))$coef
 </code></pre>
 
-<div class="rimage center"><img src="fig/unnamed-chunk-15.png" title="plot of chunk unnamed-chunk-15" alt="plot of chunk unnamed-chunk-15" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-25" style="background:;">
-  <hgroup>
-    <h2>Color by male/female</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex==&quot;Male&quot;)*1+1))
+<pre><code>            Estimate Std. Error t value  Pr(&gt;|t|)
+(Intercept)  60.3044    4.25126  14.185 3.216e-18
+Agriculture   0.1942    0.07671   2.532 1.492e-02
 </code></pre>
 
-<div class="rimage center"><img src="fig/unnamed-chunk-16.png" title="plot of chunk unnamed-chunk-16" alt="plot of chunk unnamed-chunk-16" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-26" style="background:;">
-  <hgroup>
-    <h2>Now two lines</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[HuF_i = bf_0 + bf_1 YF_i + ef_i\]</p>
-
-<p>\(bf_0\) = percent of girls hungry at Year 0</p>
-
-<p>\(bf_1\) = decrease in percent of girls hungry per year</p>
-
-<p>\(ef_i\) = everything we didn&#39;t measure </p>
-
-<p>\[HuM_i = bm_0 + bm_1 YM_i + em_i\]</p>
-
-<p>\(bm_0\) = percent of boys hungry at Year 0</p>
-
-<p>\(bm_1\) = decrease in percent of boys hungry per year</p>
-
-<p>\(em_i\) = everything we didn&#39;t measure </p>
-
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-27" style="background:;">
+<slide class="" id="slide-24" style="background:;">
   <hgroup>
-    <h2>Color by male/female</h2>
+    <h2>Parallel lines</h2>
   </hgroup>
   <article data-timings="">
-    <pre><code class="r">lmM &lt;- lm(hunger$Numeric[hunger$Sex==&quot;Male&quot;] ~ hunger$Year[hunger$Sex==&quot;Male&quot;])
-lmF &lt;- lm(hunger$Numeric[hunger$Sex==&quot;Female&quot;] ~ hunger$Year[hunger$Sex==&quot;Female&quot;])
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex==&quot;Male&quot;)*1+1))
-lines(hunger$Year[hunger$Sex==&quot;Male&quot;],lmM$fitted,col=&quot;black&quot;,lwd=3)
-lines(hunger$Year[hunger$Sex==&quot;Female&quot;],lmF$fitted,col=&quot;red&quot;,lwd=3)
+    <pre><code class="r">summary(lm(Fertility ~ Agriculture + factor(CatholicBin), data = swiss))$coef
 </code></pre>
 
-<div class="rimage center"><img src="fig/unnamed-chunk-17.png" title="plot of chunk unnamed-chunk-17" alt="plot of chunk unnamed-chunk-17" class="plot" /></div>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-28" style="background:;">
-  <hgroup>
-    <h2>Two lines, same slope</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[Hu_i = b_0 + b_1 \mathbb{1}(Sex_i="Male") + b_2 Y_i + e^*_i\]</p>
-
-<p>\(b_0\) - percent hungry at year zero for females</p>
-
-<p>\(b_0 + b_1\) - percent hungry at year zero for males</p>
-
-<p>\(b_2\) - change in percent hungry (for either males or females) in one year</p>
-
-<p>\(e^*_i\) - everything we didn&#39;t measure</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-29" style="background:;">
-  <hgroup>
-    <h2>Two lines, same slope in R</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code class="r">lmBoth &lt;- lm(hunger$Numeric ~ hunger$Year + hunger$Sex)
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex==&quot;Male&quot;)*1+1))
-abline(c(lmBoth$coeff[1],lmBoth$coeff[2]),col=&quot;red&quot;,lwd=3)
-abline(c(lmBoth$coeff[1] + lmBoth$coeff[3],lmBoth$coeff[2] ),col=&quot;black&quot;,lwd=3)
+<pre><code>                     Estimate Std. Error t value  Pr(&gt;|t|)
+(Intercept)           60.8322     4.1059  14.816 1.032e-18
+Agriculture            0.1242     0.0811   1.531 1.329e-01
+factor(CatholicBin)1   7.8843     3.7484   2.103 4.118e-02
 </code></pre>
 
-<div class="rimage center"><img src="fig/unnamed-chunk-18.png" title="plot of chunk unnamed-chunk-18" alt="plot of chunk unnamed-chunk-18" class="plot" /></div>
-
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-30" style="background:;">
-  <hgroup>
-    <h2>Two lines, different slopes (interactions)</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[Hu_i = b_0 + b_1 \mathbb{1}(Sex_i="Male") + b_2 Y_i + b_3 \mathbb{1}(Sex_i="Male")\times Y_i + e^+_i\]</p>
-
-<p>\(b_0\) - percent hungry at year zero for females</p>
-
-<p>\(b_0 + b_1\) - percent hungry at year zero for males</p>
-
-<p>\(b_2\) - change in percent hungry (females) in one year</p>
-
-<p>\(b_2 + b_3\) - change in percent hungry (males) in one year</p>
-
-<p>\(e^+_i\) - everything we didn&#39;t measure</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-31" style="background:;">
+<slide class="" id="slide-25" style="background:;">
   <hgroup>
-    <h2>Two lines, different slopes in R</h2>
+    <h2>Lines with different slopes and intercepts</h2>
   </hgroup>
   <article data-timings="">
-    <pre><code class="r">lmBoth &lt;- lm(hunger$Numeric ~ hunger$Year + hunger$Sex + hunger$Sex*hunger$Year)
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex==&quot;Male&quot;)*1+1))
-abline(c(lmBoth$coeff[1],lmBoth$coeff[2]),col=&quot;red&quot;,lwd=3)
-abline(c(lmBoth$coeff[1] + lmBoth$coeff[3],lmBoth$coeff[2] +lmBoth$coeff[4]),col=&quot;black&quot;,lwd=3)
+    <pre><code class="r">summary(lm(Fertility ~ Agriculture * factor(CatholicBin), data = swiss))$coef
 </code></pre>
 
-<div class="rimage center"><img src="fig/lmBothChunk.png" title="plot of chunk lmBothChunk" alt="plot of chunk lmBothChunk" class="plot" /></div>
+<pre><code>                                 Estimate Std. Error t value  Pr(&gt;|t|)
+(Intercept)                      62.04993    4.78916 12.9563 1.919e-16
+Agriculture                       0.09612    0.09881  0.9727 3.361e-01
+factor(CatholicBin)1              2.85770   10.62644  0.2689 7.893e-01
+Agriculture:factor(CatholicBin)1  0.08914    0.17611  0.5061 6.153e-01
+</code></pre>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-32" style="background:;">
+<slide class="" id="slide-26" style="background:;">
   <hgroup>
-    <h2>Two lines, different slopes in R</h2>
+    <h2>Just to show you it can be done</h2>
   </hgroup>
   <article data-timings="">
-    <pre><code class="r">summary(lmBoth)
+    <pre><code class="r">summary(lm(Fertility ~ Agriculture + Agriculture : factor(CatholicBin), data = swiss))$coef
 </code></pre>
 
-<pre><code>
-Call:
-lm(formula = hunger$Numeric ~ hunger$Year + hunger$Sex + hunger$Sex * 
-    hunger$Year)
-
-Residuals:
-   Min     1Q Median     3Q    Max 
--25.91 -11.25  -1.85   7.09  46.15 
-
-Coefficients:
-                           Estimate Std. Error t value Pr(&gt;|t|)    
-(Intercept)                603.5058   171.0552    3.53  0.00044 ***
-hunger$Year                 -0.2934     0.0855   -3.43  0.00062 ***
-hunger$SexMale              61.9477   241.9086    0.26  0.79795    
-hunger$Year:hunger$SexMale  -0.0300     0.1209   -0.25  0.80402    
----
-Signif. codes:  0 &#39;***&#39; 0.001 &#39;**&#39; 0.01 &#39;*&#39; 0.05 &#39;.&#39; 0.1 &#39; &#39; 1
-
-Residual standard error: 13.2 on 944 degrees of freedom
-Multiple R-squared:  0.0318,    Adjusted R-squared:  0.0287 
-F-statistic: 10.3 on 3 and 944 DF,  p-value: 1.06e-06
+<pre><code>                                 Estimate Std. Error t value  Pr(&gt;|t|)
+(Intercept)                      62.63037    4.22989 14.8066 1.057e-18
+Agriculture                       0.08539    0.08945  0.9546 3.450e-01
+Agriculture:factor(CatholicBin)1  0.13340    0.06199  2.1520 3.693e-02
 </code></pre>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-<slide class="" id="slide-33" style="background:;">
-  <hgroup>
-    <h2>Interpretting a continuous interaction</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[
-E[Y_i | X_{1i}=x_1, X_{2i}=x_2] = \beta_0 + \beta_1 x_{1} + \beta_2 x_{2} + \beta_3 x_{1}x_{2}
-\]
-Holding \(X_2\) constant we have
-\[
-E[Y_i | X_{1i}=x_1+1, X_{2i}=x_2]-E[Y_i | X_{1i}=x_1, X_{2i}=x_2]
-= \beta_1 + \beta_3 x_{2} 
-\]
-And thus the expected change in \(Y\) per unit change in \(X_1\) holding all else constant is not constant. \(\beta_1\) is the slope when \(x_{2} = 0\). Note further that:
-\[
-E[Y_i | X_{1i}=x_1+1, X_{2i}=x_2+1]-E[Y_i | X_{1i}=x_1, X_{2i}=x_2+1]
-\]
-\[
--E[Y_i | X_{1i}=x_1+1, X_{2i}=x_2]-E[Y_i | X_{1i}=x_1, X_{2i}=x_2]
-\]
-\[
-=\beta_3  
-\]
-Thus, \(\beta_3\) is the change in the expected change in \(Y\) per unit change in \(X_1\), per unit change in \(X_2\).</p>
-
-<p>Or, the change in the slope relating \(X_1\) and \(Y\) per unit change in \(X_2\).</p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-34" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>\[Hu_i = b_0 + b_1 In_i + b_2 Y_i + b_3 In_i \times Y_i + e^+_i\]</p>
-
-<p>\(b_0\) - percent hungry at year zero for children with whose parents have no income</p>
-
-<p>\(b_1\) - change in percent hungry for each dollar of income in year zero</p>
-
-<p>\(b_2\) - change in percent hungry in one year for children whose parents have no income</p>
-
-<p>\(b_3\) - increased change in percent hungry by year for each dollar of income  - e.g. if income is $10,000, then change in percent hungry in one year will be</p>
-
-<p>\[b_2 + 1e4 \times b_3\]</p>
-
-<p>\(e^+_i\) - everything we didn&#39;t measure</p>
-
-<p><strong>Lot&#39;s of care/caution needed!</strong></p>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
     <slide class="backdrop"></slide>
   </slides>
   <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
     <ul>
       <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='Swiss fertility data'>
+        data-slide=1 title='Data set for discussion'>
          1
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='<code>?swiss</code>'>
+        data-slide=2 title=''>
          2
       </a>
     </li>
@@ -826,61 +605,61 @@ <h2>Example</h2>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='Back to this data set'>
+        data-slide=7 title=''>
          7
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='What if we include an unnecessary variable?'>
+        data-slide=8 title='Back to this data set'>
          8
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Dummy variables are smart'>
+        data-slide=9 title='What if we include an unnecessary variable?'>
          9
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='More than 2 levels'>
+        data-slide=10 title='Dummy variables are smart'>
          10
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='Insect Sprays'>
+        data-slide=11 title='More than 2 levels'>
          11
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='Linear model fit, group A is the reference'>
+        data-slide=12 title='Insect Sprays'>
          12
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='Hard coding the dummy variables'>
+        data-slide=13 title='Linear model fit, group A is the reference'>
          13
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='What if we include all 6?'>
+        data-slide=14 title='Hard coding the dummy variables'>
          14
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title='What if we omit the intercept?'>
+        data-slide=15 title='What if we include all 6?'>
          15
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='Summary'>
+        data-slide=16 title='What if we omit the intercept?'>
          16
       </a>
     </li>
@@ -892,7 +671,7 @@ <h2>Example</h2>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=18 title='Doing it manually'>
+        data-slide=18 title='Summary'>
          18
       </a>
     </li>
@@ -904,94 +683,46 @@ <h2>Example</h2>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=20 title='Example - Millenium Development Goal 1'>
+        data-slide=20 title='Recall the <code>swiss</code> data set'>
          20
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=21 title='WHO childhood hunger data'>
+        data-slide=21 title='Create a binary variable'>
          21
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=22 title='Plot percent hungry versus time'>
+        data-slide=22 title='Plot the data'>
          22
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=23 title='Remember the linear model'>
+        data-slide=23 title='No effect of religion'>
          23
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=24 title='Add the linear model'>
+        data-slide=24 title='Parallel lines'>
          24
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=25 title='Color by male/female'>
+        data-slide=25 title='Lines with different slopes and intercepts'>
          25
       </a>
     </li>
     <li>
       <a href="#" target="_self" rel='tooltip' 
-        data-slide=26 title='Now two lines'>
+        data-slide=26 title='Just to show you it can be done'>
          26
       </a>
     </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=27 title='Color by male/female'>
-         27
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=28 title='Two lines, same slope'>
-         28
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=29 title='Two lines, same slope in R'>
-         29
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=30 title='Two lines, different slopes (interactions)'>
-         30
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=31 title='Two lines, different slopes in R'>
-         31
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=32 title='Two lines, different slopes in R'>
-         32
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=33 title='Interpretting a continuous interaction'>
-         33
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=34 title='Example'>
-         34
-      </a>
-    </li>
   </ul>
   </div>  <!--[if IE]>
     <script 
diff --git a/07_RegressionModels/02_02_multivariateExamples/index.md b/07_RegressionModels/02_02_multivariateExamples/index.md
index 53233e7f2..d365da7ca 100644
--- a/07_RegressionModels/02_02_multivariateExamples/index.md
+++ b/07_RegressionModels/02_02_multivariateExamples/index.md
@@ -15,31 +15,25 @@ mode        : selfcontained # {standalone, draft}
 ---
 
 
-## Swiss fertility data
+## Data set for discussion
+### `require(datasets); data(swiss); ?swiss`
+Standardized fertility measure and socio-economic indicators for each of 47 French-speaking provinces of Switzerland at about 1888.
 
-```r
-library(datasets); data(swiss); require(stats); require(graphics)
-pairs(swiss, panel = panel.smooth, main = "Swiss data", col = 3 + (swiss$Catholic > 50))
-```
+A data frame with 47 observations on 6 variables, each of which is in percent, i.e., in [0, 100].
 
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
+* [,1]   Fertility          a common standardized fertility measure
+* [,2]   Agriculture        % of males involved in agriculture as occupation
+* [,3]	 Examination        % draftees receiving highest mark on army examination
+* [,4]	 Education          % education beyond primary school for draftees
+* [,5]	 Catholic           % catholic (as opposed to protestant)
+* [,6]	 Infant.Mortality   live births who live less than 1 year
 
+All variables but Fertility give proportions of the population.
 
 ---
-## `?swiss`
-### Description
-Standardized fertility measure and socio-economic indicators for each of 47 French-speaking provinces of Switzerland at about 1888.
 
-A data frame with 47 observations on 6 variables, each of which is in percent, i.e., in [0, 100].
-
-* [,1]   Fertility	Ig, ‘ common standardized fertility measure’
-* [,2]	 Agriculture	 % of males involved in agriculture as occupation
-* [,3]	 Examination	 % draftees receiving highest mark on army examination
-* [,4]	 Education	 % education beyond primary school for draftees.
-* [,5]	 Catholic	 % ‘catholic’ (as opposed to ‘protestant’).
-* [,6]	 Infant.Mortality	 live births who live less than 1 year.
+<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
 
-All variables but ‘Fertility’ give proportions of the population.
 
 
 ---
@@ -61,7 +55,7 @@ Infant.Mortality   1.0770    0.38172   2.822 7.336e-03
 ## Example interpretation
 * Agriculture is expressed in percentages (0 - 100)
 * Estimate is -0.1721.
-* We estimate an expected 0.17 decrease in standardized fertility for every 1\% increase in percentage of males involved in agriculture in holding the remaining variables constant.
+* Our models estimates an expected 0.17 decrease in standardized fertility for every 1% increase in percentage of males involved in agriculture in holding the remaining variables constant.
 * The t-test for $H_0: \beta_{Agri} = 0$ versus $H_a: \beta_{Agri} \neq 0$ is  significant.
 * Interestingly, the unadjusted estimate is 
 
@@ -86,8 +80,8 @@ summary(lm(y ~ x1))$coef
 
 ```
             Estimate Std. Error t value  Pr(>|t|)
-(Intercept)    1.618      1.200   1.349 1.806e-01
-x1            95.854      2.058  46.579 1.153e-68
+(Intercept)    1.454      1.079   1.348 1.807e-01
+x1            96.793      1.862  51.985 3.707e-73
 ```
 
 ```r
@@ -95,10 +89,10 @@ summary(lm(y ~ x1 + x2))$coef
 ```
 
 ```
-              Estimate Std. Error   t value   Pr(>|t|)
-(Intercept)  0.0003683  0.0020141    0.1829  8.553e-01
-x1          -1.0215256  0.0166372  -61.4001  1.922e-79
-x2           1.0001909  0.0001681 5950.1818 1.369e-271
+             Estimate Std. Error  t value   Pr(>|t|)
+(Intercept)  0.001933  0.0017709    1.092  2.777e-01
+x1          -1.020506  0.0163560  -62.393  4.211e-80
+x2           1.000133  0.0001643 6085.554 1.544e-272
 ```
 
 
@@ -106,9 +100,13 @@ x2           1.0001909  0.0001681 5950.1818 1.369e-271
 <div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
 
 
+---
+<div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
+
+
 ---
 ## Back to this data set
-* The sign reverses itself with the inclusion of Examination and Education, but of which are negatively correlated with Agriculture.
+* The sign reverses itself with the inclusion of Examination and Education.
 * The percent of males in the province working in agriculture is negatively related to educational attainment (correlation of -0.6395) and Education and Examination (correlation of 0.6984) are obviously measuring similar things. 
   * Is the positive marginal an artifact for not having accounted for, say, Education level? (Education does have a stronger effect, by the way.)
 * At the minimum, anyone claiming that provinces that are more agricultural have higher fertility rates would immediately be open to criticism.
@@ -167,7 +165,7 @@ where each $X_{i1}$ is binary so that it is a 1 if measurement $i$ is in a group
 
 ---
 ## Insect Sprays
-<div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div>
+<div class="rimage center"><img src="fig/unnamed-chunk-8.png" title="plot of chunk unnamed-chunk-8" alt="plot of chunk unnamed-chunk-8" class="plot" /></div>
 
 
 ---
@@ -214,24 +212,20 @@ I(1 * (spray == "F"))   2.1667      1.601  1.3532 1.806e-01
 ## What if we include all 6?
 
 ```r
-lm(count ~ 
+summary(lm(count ~ 
    I(1 * (spray == 'B')) + I(1 * (spray == 'C')) +  
    I(1 * (spray == 'D')) + I(1 * (spray == 'E')) +
-   I(1 * (spray == 'F')) + I(1 * (spray == 'A')), data = InsectSprays)
+   I(1 * (spray == 'F')) + I(1 * (spray == 'A')), data = InsectSprays))$coef
 ```
 
 ```
-
-Call:
-lm(formula = count ~ I(1 * (spray == "B")) + I(1 * (spray == 
-    "C")) + I(1 * (spray == "D")) + I(1 * (spray == "E")) + I(1 * 
-    (spray == "F")) + I(1 * (spray == "A")), data = InsectSprays)
-
-Coefficients:
-          (Intercept)  I(1 * (spray == "B"))  I(1 * (spray == "C"))  I(1 * (spray == "D"))  
-               14.500                  0.833                -12.417                 -9.583  
-I(1 * (spray == "E"))  I(1 * (spray == "F"))  I(1 * (spray == "A"))  
-              -11.000                  2.167                     NA  
+                      Estimate Std. Error t value  Pr(>|t|)
+(Intercept)            14.5000      1.132 12.8074 1.471e-19
+I(1 * (spray == "B"))   0.8333      1.601  0.5205 6.045e-01
+I(1 * (spray == "C")) -12.4167      1.601 -7.7550 7.267e-11
+I(1 * (spray == "D"))  -9.5833      1.601 -5.9854 9.817e-08
+I(1 * (spray == "E")) -11.0000      1.601 -6.8702 2.754e-09
+I(1 * (spray == "F"))   2.1667      1.601  1.3532 1.806e-01
 ```
 
 
@@ -253,25 +247,23 @@ sprayF   16.667      1.132  14.721 1.573e-22
 ```
 
 ```r
-unique(ave(InsectSprays$count, InsectSprays$spray))
+library(dplyr)
+summarise(group_by(InsectSprays, spray), mn = mean(count))
 ```
 
 ```
-[1] 14.500 15.333  2.083  4.917  3.500 16.667
+Source: local data frame [6 x 2]
+
+  spray     mn
+1     A 14.500
+2     B 15.333
+3     C  2.083
+4     D  4.917
+5     E  3.500
+6     F 16.667
 ```
 
 
----
-## Summary
-* If we treat Spray as a factor, R includes an intercept and omits the alphabetically first level of the factor.
-  * All t-tests are for comparisons of Sprays versus Spray A.
-  * Emprirical mean for A is the intercept.
-  * Other group means are the itc plus their coefficient. 
-* If we omit an intercept, then it includes terms for all levels of the factor. 
-  * Group means are the coefficients. 
-  * Tests are tests of whether the groups are different than zero. (Are the expected counts zero for that spray.)
-* If we want comparisons between, Spray B and C, say we could refit the model with C (or B) as the reference level. 
-
 ---
 ## Reordering the levels
 
@@ -290,28 +282,16 @@ spray2E        1.417      1.601  0.8848 3.795e-01
 spray2F       14.583      1.601  9.1083 2.794e-13
 ```
 
-
 ---
-## Doing it manually
-Equivalently 
-$$Var(\hat \beta_B - \hat \beta_C) = Var(\hat \beta_B) + Var(\hat \beta_C) - 2 Cov(\hat \beta_B, \hat \beta_C)$$
-
-```r
-fit <- lm(count ~ spray, data = InsectSprays) #A is ref
-bbmbc <- coef(fit)[2] - coef(fit)[3] #B - C
-temp <- summary(fit) 
-se <- temp$sigma * sqrt(temp$cov.unscaled[2, 2] + temp$cov.unscaled[3,3] - 2 *temp$cov.unscaled[2,3])
-t <- (bbmbc) / se
-p <- pt(-abs(t), df = fit$df)
-out <- c(bbmbc, se, t, p)
-names(out) <- c("B - C", "SE", "T", "P")
-round(out, 3)
-```
-
-```
- B - C     SE      T      P 
-13.250  1.601  8.276  0.000 
-```
+## Summary
+* If we treat Spray as a factor, R includes an intercept and omits the alphabetically first level of the factor.
+  * All t-tests are for comparisons of Sprays versus Spray A.
+  * Emprirical mean for A is the intercept.
+  * Other group means are the itc plus their coefficient. 
+* If we omit an intercept, then it includes terms for all levels of the factor. 
+  * Group means are the coefficients. 
+  * Tests are tests of whether the groups are different than zero. (Are the expected counts zero for that spray.)
+* If we want comparisons between, Spray B and C, say we could refit the model with C (or B) as the reference level. 
 
 
 ---
@@ -325,279 +305,95 @@ round(out, 3)
 
 ---
 
-## Example - Millenium Development Goal 1
-
-[http://www.un.org/millenniumgoals/pdf/MDG_FS_1_EN.pdf](http://www.un.org/millenniumgoals/pdf/MDG_FS_1_EN.pdf)
-
-[http://apps.who.int/gho/athena/data/GHO/WHOSIS_000008.csv?profile=text&filter=COUNTRY:*;SEX:*](http://apps.who.int/gho/athena/data/GHO/WHOSIS_000008.csv?profile=text&filter=COUNTRY:*;SEX:*)
-
----
-
-## WHO childhood hunger data
-
+## Recall the `swiss` data set
 
 
 ```r
-#download.file("http://apps.who.int/gho/athena/data/GHO/WHOSIS_000008.csv?profile=text&filter=COUNTRY:*;SEX:*","hunger.csv",method="curl")
-hunger <- read.csv("hunger.csv")
-hunger <- hunger[hunger$Sex!="Both sexes",]
-head(hunger)
+library(datasets); data(swiss)
+head(swiss)
 ```
 
 ```
-                               Indicator Data.Source PUBLISH.STATES Year            WHO.region
-1 Children aged <5 years underweight (%) NLIS_310044      Published 1986                Africa
-2 Children aged <5 years underweight (%) NLIS_310233      Published 1990              Americas
-3 Children aged <5 years underweight (%) NLIS_312902      Published 2005              Americas
-5 Children aged <5 years underweight (%) NLIS_312522      Published 2002 Eastern Mediterranean
-6 Children aged <5 years underweight (%) NLIS_312955      Published 2008                Africa
-8 Children aged <5 years underweight (%) NLIS_312963      Published 2008                Africa
-        Country    Sex Display.Value Numeric Low High Comments
-1       Senegal   Male          19.3    19.3  NA   NA       NA
-2      Paraguay   Male           2.2     2.2  NA   NA       NA
-3     Nicaragua   Male           5.3     5.3  NA   NA       NA
-5        Jordan Female           3.2     3.2  NA   NA       NA
-6 Guinea-Bissau Female          17.0    17.0  NA   NA       NA
-8         Ghana   Male          15.7    15.7  NA   NA       NA
+             Fertility Agriculture Examination Education Catholic Infant.Mortality
+Courtelary        80.2        17.0          15        12     9.96             22.2
+Delemont          83.1        45.1           6         9    84.84             22.2
+Franches-Mnt      92.5        39.7           5         5    93.40             20.2
+Moutier           85.8        36.5          12         7    33.77             20.3
+Neuveville        76.9        43.5          17        15     5.16             20.6
+Porrentruy        76.1        35.3           9         7    90.57             26.6
 ```
 
 
 ---
-
-## Plot percent hungry versus time
-
+## Create a binary variable
 
 ```r
-lm1 <- lm(hunger$Numeric ~ hunger$Year)
-plot(hunger$Year,hunger$Numeric,pch=19,col="blue")
+library(dplyr); 
+swiss = mutate(swiss, CatholicBin = 1 * (Catholic > 50))
 ```
 
-<div class="rimage center"><img src="fig/unnamed-chunk-14.png" title="plot of chunk unnamed-chunk-14" alt="plot of chunk unnamed-chunk-14" class="plot" /></div>
-
 
 ---
+## Plot the data 
+<div class="rimage center"><img src="fig/unnamed-chunk-16.png" title="plot of chunk unnamed-chunk-16" alt="plot of chunk unnamed-chunk-16" class="plot" /></div>
 
-## Remember the linear model
-
-$$Hu_i = b_0 + b_1 Y_i + e_i$$
-
-$b_0$ = percent hungry at Year 0
-
-$b_1$ = decrease in percent hungry per year
-
-$e_i$ = everything we didn't measure
 
 ---
-
-## Add the linear model
-
+## No effect of religion
 
 ```r
-lm1 <- lm(hunger$Numeric ~ hunger$Year)
-plot(hunger$Year,hunger$Numeric,pch=19,col="blue")
-lines(hunger$Year,lm1$fitted,lwd=3,col="darkgrey")
+summary(lm(Fertility ~ Agriculture, data = swiss))$coef
 ```
 
-<div class="rimage center"><img src="fig/unnamed-chunk-15.png" title="plot of chunk unnamed-chunk-15" alt="plot of chunk unnamed-chunk-15" class="plot" /></div>
-
-
-
----
-
-## Color by male/female
-
-
-```r
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex=="Male")*1+1))
 ```
-
-<div class="rimage center"><img src="fig/unnamed-chunk-16.png" title="plot of chunk unnamed-chunk-16" alt="plot of chunk unnamed-chunk-16" class="plot" /></div>
-
-
----
- 
-## Now two lines
-
-$$HuF_i = bf_0 + bf_1 YF_i + ef_i$$
-
-$bf_0$ = percent of girls hungry at Year 0
-
-$bf_1$ = decrease in percent of girls hungry per year
-
-$ef_i$ = everything we didn't measure 
-
-
-$$HuM_i = bm_0 + bm_1 YM_i + em_i$$
-
-$bm_0$ = percent of boys hungry at Year 0
-
-$bm_1$ = decrease in percent of boys hungry per year
-
-$em_i$ = everything we didn't measure 
-
-
-
----
-
-## Color by male/female
-
-
-```r
-lmM <- lm(hunger$Numeric[hunger$Sex=="Male"] ~ hunger$Year[hunger$Sex=="Male"])
-lmF <- lm(hunger$Numeric[hunger$Sex=="Female"] ~ hunger$Year[hunger$Sex=="Female"])
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex=="Male")*1+1))
-lines(hunger$Year[hunger$Sex=="Male"],lmM$fitted,col="black",lwd=3)
-lines(hunger$Year[hunger$Sex=="Female"],lmF$fitted,col="red",lwd=3)
+            Estimate Std. Error t value  Pr(>|t|)
+(Intercept)  60.3044    4.25126  14.185 3.216e-18
+Agriculture   0.1942    0.07671   2.532 1.492e-02
 ```
 
-<div class="rimage center"><img src="fig/unnamed-chunk-17.png" title="plot of chunk unnamed-chunk-17" alt="plot of chunk unnamed-chunk-17" class="plot" /></div>
-
-
-
----
-
-## Two lines, same slope
-
-$$Hu_i = b_0 + b_1 \mathbb{1}(Sex_i="Male") + b_2 Y_i + e^*_i$$
-
-$b_0$ - percent hungry at year zero for females
-
-$b_0 + b_1$ - percent hungry at year zero for males
-
-$b_2$ - change in percent hungry (for either males or females) in one year
-
-$e^*_i$ - everything we didn't measure
 
 ---
-
-## Two lines, same slope in R
-
-
+## Parallel lines
 
 ```r
-lmBoth <- lm(hunger$Numeric ~ hunger$Year + hunger$Sex)
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex=="Male")*1+1))
-abline(c(lmBoth$coeff[1],lmBoth$coeff[2]),col="red",lwd=3)
-abline(c(lmBoth$coeff[1] + lmBoth$coeff[3],lmBoth$coeff[2] ),col="black",lwd=3)
+summary(lm(Fertility ~ Agriculture + factor(CatholicBin), data = swiss))$coef
 ```
 
-<div class="rimage center"><img src="fig/unnamed-chunk-18.png" title="plot of chunk unnamed-chunk-18" alt="plot of chunk unnamed-chunk-18" class="plot" /></div>
-
-
----
-
-## Two lines, different slopes (interactions)
-
-$$Hu_i = b_0 + b_1 \mathbb{1}(Sex_i="Male") + b_2 Y_i + b_3 \mathbb{1}(Sex_i="Male")\times Y_i + e^+_i$$
-
-$b_0$ - percent hungry at year zero for females
-
-$b_0 + b_1$ - percent hungry at year zero for males
-
-$b_2$ - change in percent hungry (females) in one year
-
-$b_2 + b_3$ - change in percent hungry (males) in one year
+```
+                     Estimate Std. Error t value  Pr(>|t|)
+(Intercept)           60.8322     4.1059  14.816 1.032e-18
+Agriculture            0.1242     0.0811   1.531 1.329e-01
+factor(CatholicBin)1   7.8843     3.7484   2.103 4.118e-02
+```
 
-$e^+_i$ - everything we didn't measure
 
 ---
-
-## Two lines, different slopes in R
-
-
+## Lines with different slopes and intercepts
 
 ```r
-lmBoth <- lm(hunger$Numeric ~ hunger$Year + hunger$Sex + hunger$Sex*hunger$Year)
-plot(hunger$Year,hunger$Numeric,pch=19)
-points(hunger$Year,hunger$Numeric,pch=19,col=((hunger$Sex=="Male")*1+1))
-abline(c(lmBoth$coeff[1],lmBoth$coeff[2]),col="red",lwd=3)
-abline(c(lmBoth$coeff[1] + lmBoth$coeff[3],lmBoth$coeff[2] +lmBoth$coeff[4]),col="black",lwd=3)
+summary(lm(Fertility ~ Agriculture * factor(CatholicBin), data = swiss))$coef
 ```
 
-<div class="rimage center"><img src="fig/lmBothChunk.png" title="plot of chunk lmBothChunk" alt="plot of chunk lmBothChunk" class="plot" /></div>
-
+```
+                                 Estimate Std. Error t value  Pr(>|t|)
+(Intercept)                      62.04993    4.78916 12.9563 1.919e-16
+Agriculture                       0.09612    0.09881  0.9727 3.361e-01
+factor(CatholicBin)1              2.85770   10.62644  0.2689 7.893e-01
+Agriculture:factor(CatholicBin)1  0.08914    0.17611  0.5061 6.153e-01
+```
 
 
 ---
-
-## Two lines, different slopes in R
-
-
+## Just to show you it can be done
 
 ```r
-summary(lmBoth)
+summary(lm(Fertility ~ Agriculture + Agriculture : factor(CatholicBin), data = swiss))$coef
 ```
 
 ```
-
-Call:
-lm(formula = hunger$Numeric ~ hunger$Year + hunger$Sex + hunger$Sex * 
-    hunger$Year)
-
-Residuals:
-   Min     1Q Median     3Q    Max 
--25.91 -11.25  -1.85   7.09  46.15 
-
-Coefficients:
-                           Estimate Std. Error t value Pr(>|t|)    
-(Intercept)                603.5058   171.0552    3.53  0.00044 ***
-hunger$Year                 -0.2934     0.0855   -3.43  0.00062 ***
-hunger$SexMale              61.9477   241.9086    0.26  0.79795    
-hunger$Year:hunger$SexMale  -0.0300     0.1209   -0.25  0.80402    
----
-Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
-
-Residual standard error: 13.2 on 944 degrees of freedom
-Multiple R-squared:  0.0318,	Adjusted R-squared:  0.0287 
-F-statistic: 10.3 on 3 and 944 DF,  p-value: 1.06e-06
+                                 Estimate Std. Error t value  Pr(>|t|)
+(Intercept)                      62.63037    4.22989 14.8066 1.057e-18
+Agriculture                       0.08539    0.08945  0.9546 3.450e-01
+Agriculture:factor(CatholicBin)1  0.13340    0.06199  2.1520 3.693e-02
 ```
 
-
----
-## Interpretting a continuous interaction
-$$
-E[Y_i | X_{1i}=x_1, X_{2i}=x_2] = \beta_0 + \beta_1 x_{1} + \beta_2 x_{2} + \beta_3 x_{1}x_{2}
-$$
-Holding $X_2$ constant we have
-$$
-E[Y_i | X_{1i}=x_1+1, X_{2i}=x_2]-E[Y_i | X_{1i}=x_1, X_{2i}=x_2]
-= \beta_1 + \beta_3 x_{2} 
-$$
-And thus the expected change in $Y$ per unit change in $X_1$ holding all else constant is not constant. $\beta_1$ is the slope when $x_{2} = 0$. Note further that:
-$$
-E[Y_i | X_{1i}=x_1+1, X_{2i}=x_2+1]-E[Y_i | X_{1i}=x_1, X_{2i}=x_2+1]
-$$
-$$
--E[Y_i | X_{1i}=x_1+1, X_{2i}=x_2]-E[Y_i | X_{1i}=x_1, X_{2i}=x_2]
-$$
-$$
-=\beta_3  
-$$
-Thus, $\beta_3$ is the change in the expected change in $Y$ per unit change in $X_1$, per unit change in $X_2$.
-
-Or, the change in the slope relating $X_1$ and $Y$ per unit change in $X_2$.
-
----
-
-## Example
-
-$$Hu_i = b_0 + b_1 In_i + b_2 Y_i + b_3 In_i \times Y_i + e^+_i$$
-
-$b_0$ - percent hungry at year zero for children with whose parents have no income
-
-$b_1$ - change in percent hungry for each dollar of income in year zero
-
-$b_2$ - change in percent hungry in one year for children whose parents have no income
-
-$b_3$ - increased change in percent hungry by year for each dollar of income  - e.g. if income is $10,000, then change in percent hungry in one year will be
-
-$$b_2 + 1e4 \times b_3$$
-
-$e^+_i$ - everything we didn't measure
-
-__Lot's of care/caution needed!__
-
-
diff --git a/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-1.png b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..c4b5f784b
Binary files /dev/null and b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-2.png b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..989744ab8
Binary files /dev/null and b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-2.png differ
diff --git a/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-3.png b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-3.png
new file mode 100644
index 000000000..e8e0f0971
Binary files /dev/null and b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-3.png differ
diff --git a/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-4.png b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..4433b1cc2
Binary files /dev/null and b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-4.png differ
diff --git a/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-5.png b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..6273a5c70
Binary files /dev/null and b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-5.png differ
diff --git a/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-6.png b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-6.png
new file mode 100644
index 000000000..c7c0974b6
Binary files /dev/null and b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-6.png differ
diff --git a/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-7.png b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-7.png
new file mode 100644
index 000000000..2795f5ad9
Binary files /dev/null and b/07_RegressionModels/02_03_adjustment/assets/fig/unnamed-chunk-7.png differ
diff --git a/07_RegressionModels/02_03_adjustment/index.Rmd b/07_RegressionModels/02_03_adjustment/index.Rmd
index 3755b45c9..7ea431754 100644
--- a/07_RegressionModels/02_03_adjustment/index.Rmd
+++ b/07_RegressionModels/02_03_adjustment/index.Rmd
@@ -6,9 +6,9 @@ job         : Johns Hopkins Bloomberg School of Public Health
 logo        : bloomberg_shield.png
 framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
+hitheme     : tomorrow      #
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
@@ -33,7 +33,7 @@ runif(1)
 Code for the first plot, rest omitted
 (See the git repo for the rest of the code.)
 ```
-n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), runif(n/2)); 
+n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), runif(n/2));
 beta0 <- 0; beta1 <- 2; tau <- 1; sigma <- .2
 y <- beta0 + x * beta1 + t * tau + rnorm(n, sd = sigma)
 plot(x, y, type = "n", frame = FALSE)
@@ -51,7 +51,7 @@ points(x[(n/2 + 1) : n], y[(n/2 + 1) : n], pch = 21, col = "black", bg = "salmon
 ---
 ## Simulation 1
 ```{r, fig.height=5, fig.width=5, echo = FALSE, results='hide'}
-n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), runif(n/2)); 
+n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), runif(n/2));
 beta0 <- 0; beta1 <- 2; tau <- 1; sigma <- .2
 y <- beta0 + x * beta1 + t * tau + rnorm(n, sd = sigma)
 plot(x, y, type = "n", frame = FALSE)
@@ -78,7 +78,7 @@ points(x[(n/2 + 1) : n], y[(n/2 + 1) : n], pch = 21, col = "black", bg = "salmon
 ---
 ## Simulation 2
 ```{r, fig.height=5, fig.width=5, echo = FALSE, results='hide'}
-n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), 1.5 + runif(n/2)); 
+n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), 1.5 + runif(n/2));
 beta0 <- 0; beta1 <- 2; tau <- 0; sigma <- .2
 y <- beta0 + x * beta1 + t * tau + rnorm(n, sd = sigma)
 plot(x, y, type = "n", frame = FALSE)
@@ -101,15 +101,15 @@ points(x[(n/2 + 1) : n], y[(n/2 + 1) : n], pch = 21, col = "black", bg = "salmon
   doesn't depend on the group variable.
   * The X variable remains related to Y holding group status constant
 * The group variable is marginally related to Y disregarding X.
-* The model would estimate no adjusted effect due to group. 
+* The model would estimate no adjusted effect due to group.
   * There isn't any data to inform the relationship between
     group and Y.
-  * This conclusion is entirely based on the model. 
+  * This conclusion is entirely based on the model.
 
 ---
 ## Simulation 3
 ```{r, fig.height=5, fig.width=5, echo = FALSE, results='hide'}
-n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), .9 + runif(n/2)); 
+n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), .9 + runif(n/2));
 beta0 <- 0; beta1 <- 2; tau <- -1; sigma <- .2
 y <- beta0 + x * beta1 + t * tau + rnorm(n, sd = sigma)
 plot(x, y, type = "n", frame = FALSE)
@@ -137,7 +137,7 @@ holding X fixed.
 ---
 ## Simulation 4
 ```{r, fig.height=5, fig.width=5, echo = FALSE, results='hide'}
-n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(.5 + runif(n/2), runif(n/2)); 
+n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(.5 + runif(n/2), runif(n/2));
 beta0 <- 0; beta1 <- 2; tau <- 1; sigma <- .2
 y <- beta0 + x * beta1 + t * tau + rnorm(n, sd = sigma)
 plot(x, y, type = "n", frame = FALSE)
@@ -163,7 +163,7 @@ holding X fixed.
 ---
 ## Simulation 5
 ```{r, fig.height=5, fig.width=5, echo = FALSE, results='hide'}
-n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2, -1, 1), runif(n/2, -1, 1)); 
+n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2, -1, 1), runif(n/2, -1, 1));
 beta0 <- 0; beta1 <- 2; tau <- 0; tau1 <- -4; sigma <- .2
 y <- beta0 + x * beta1 + t * tau + t * x * tau1 + rnorm(n, sd = sigma)
 plot(x, y, type = "n", frame = FALSE)
@@ -180,7 +180,7 @@ points(x[(n/2 + 1) : n], y[(n/2 + 1) : n], pch = 21, col = "black", bg = "salmon
 ---
 ## Discussion
 ### Some things to note from this simulation
-* There is no such thing as a group effect here. 
+* There is no such thing as a group effect here.
   * The impact of group reverses itself depending on X.
   * Both intercept and slope depends on group.
 * Group status and X unrelated.
@@ -190,7 +190,7 @@ points(x[(n/2 + 1) : n], y[(n/2 + 1) : n], pch = 21, col = "black", bg = "salmon
 ### Simulation 6
 ```{r, fig.height=5, fig.width=5, echo = FALSE, results='hide'}
 p <- 1
-n <- 100; x2 <- runif(n); x1 <- p * runif(n) - (1 - p) * x2 
+n <- 100; x2 <- runif(n); x1 <- p * runif(n) - (1 - p) * x2
 beta0 <- 0; beta1 <- 1; tau <- 4 ; sigma <- .01
 y <- beta0 + x1 * beta1 + tau * x2 + rnorm(n, sd = sigma)
 plot(x1, y, type = "n", frame = FALSE)
@@ -228,10 +228,9 @@ abline(lm(I(resid(lm(x1 ~ x2))) ~ I(resid(lm(y ~ x2)))), lwd = 2)
 ## Some final thoughts
 * Modeling multivariate relationships is difficult.
 * Play around with simulations to see how the
-  inclusion or exclustion of another variable can
+  inclusion or exclusion of another variable can
   change analyses.
 * The results of these analyses deal with the
 impact of variables on associations.
-  * Ascertaining mechanisms or cause are difficult subjects 
+  * Ascertaining mechanisms or cause are difficult subjects
     to be added on top of difficulty in understanding multivariate associations.
-
diff --git a/07_RegressionModels/02_03_adjustment/index.html b/07_RegressionModels/02_03_adjustment/index.html
index 433c18736..039447a3e 100644
--- a/07_RegressionModels/02_03_adjustment/index.html
+++ b/07_RegressionModels/02_03_adjustment/index.html
@@ -8,55 +8,58 @@
   <meta name="generator" content="slidify" />
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/phone.css" 
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
     media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../libraries/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../libraries/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->
-  <script data-main="../../libraries/frameworks/io2012/js/slides" 
-    src="../../libraries/frameworks/io2012/js/require-1.0.8.min.js">
+  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
+  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
+  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  
+  
+  <!-- Grab CDN jQuery, fall back to local if offline -->
+  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
+  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
+  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
+    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
   </script>
   
-    <link rel="stylesheet" href = "../../assets/css/custom.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BACKUP.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.BASE.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.LOCAL.546.css">
-<link rel="stylesheet" href = "../../assets/css/custom.css.orig">
-<link rel="stylesheet" href = "../../assets/css/custom.css.REMOTE.546.css">
-<link rel="stylesheet" href = "../../assets/css/ribbons.css">
+  
 
 </head>
 <body style="opacity: 0">
   <slides class="layout-widescreen">
     
     <!-- LOGO SLIDE -->
-    <!-- END LOGO SLIDE -->
+        <slide class="title-slide segue nobackground">
+  <aside class="gdbar">
+    <img src="../../assets/img/bloomberg_shield.png">
+  </aside>
+  <hgroup class="auto-fadein">
+    <h1>Multivariable regression</h1>
+    <h2>Regression</h2>
+    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
+  </hgroup>
+  <article></article>  
+</slide>
     
 
-    <!-- TITLE SLIDE -->
-    <!-- Should I move this to a Local Layout File? -->
-    <slide class="title-slide segue nobackground">
-      <aside class="gdbar">
-        <img src="../../assets/img/bloomberg_shield.png">
-      </aside>
-      <hgroup class="auto-fadein">
-        <h1>Multivariable regression</h1>
-        <h2>Regression</h2>
-        <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-      </hgroup>
-          </slide>
-
     <!-- SLIDES -->
-      <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>Consider the following simulated data</h2>
-  </hgroup>
-  <article>
-    <p>Code for the first plot, rest omitted
+    <slide class="" id="slide-1" style="background:;">
+  <article data-timings="">
+    <pre><code>## Error: object &#39;opts_chunk&#39; not found
+</code></pre>
+
+<pre><code>## Error: object &#39;knit_hooks&#39; not found
+</code></pre>
+
+<pre><code>## Error: object &#39;knit_hooks&#39; not found
+</code></pre>
+
+<h2>Consider the following simulated data</h2>
+
+<p>Code for the first plot, rest omitted
 (See the git repo for the rest of the code.)</p>
 
-<pre><code>n &lt;- 100; t &lt;- rep(c(0, 1), c(n/2, n/2)); x &lt;- c(runif(n/2), runif(n/2)); 
+<pre><code>n &lt;- 100; t &lt;- rep(c(0, 1), c(n/2, n/2)); x &lt;- c(runif(n/2), runif(n/2));
 beta0 &lt;- 0; beta1 &lt;- 2; tau &lt;- 1; sigma &lt;- .2
 y &lt;- beta0 + x * beta1 + t * tau + rnorm(n, sd = sigma)
 plot(x, y, type = &quot;n&quot;, frame = FALSE)
@@ -74,22 +77,22 @@ <h2>Consider the following simulated data</h2>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-2" style="background:;">
+<slide class="" id="slide-2" style="background:;">
   <hgroup>
     <h2>Simulation 1</h2>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-1.png" alt="plot of chunk unnamed-chunk-1"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-3" style="background:;">
+<slide class="" id="slide-3" style="background:;">
   <hgroup>
     <h2>Discussion</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Some things to note in this simulation</h3>
 
 <ul>
@@ -108,22 +111,22 @@ <h3>Some things to note in this simulation</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-4" style="background:;">
+<slide class="" id="slide-4" style="background:;">
   <hgroup>
     <h2>Simulation 2</h2>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-2.png" alt="plot of chunk unnamed-chunk-2"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-5" style="background:;">
+<slide class="" id="slide-5" style="background:;">
   <hgroup>
     <h2>Discussion</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Some things to note in this simulation</h3>
 
 <ul>
@@ -135,12 +138,12 @@ <h3>Some things to note in this simulation</h3>
 <li>The X variable remains related to Y holding group status constant</li>
 </ul></li>
 <li>The group variable is marginally related to Y disregarding X.</li>
-<li>The model would estimate no adjusted effect due to group. 
+<li>The model would estimate no adjusted effect due to group.
 
 <ul>
 <li>There isn&#39;t any data to inform the relationship between
 group and Y.</li>
-<li>This conclusion is entirely based on the model. </li>
+<li>This conclusion is entirely based on the model.</li>
 </ul></li>
 </ul>
 
@@ -148,22 +151,22 @@ <h3>Some things to note in this simulation</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-6" style="background:;">
+<slide class="" id="slide-6" style="background:;">
   <hgroup>
     <h2>Simulation 3</h2>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-3.png" alt="plot of chunk unnamed-chunk-3"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-7" style="background:;">
+<slide class="" id="slide-7" style="background:;">
   <hgroup>
     <h2>Discussion</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Some things to note in this simulation</h3>
 
 <ul>
@@ -178,22 +181,22 @@ <h3>Some things to note in this simulation</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-8" style="background:;">
+<slide class="" id="slide-8" style="background:;">
   <hgroup>
     <h2>Simulation 4</h2>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-4.png" alt="plot of chunk unnamed-chunk-4"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-9" style="background:;">
+<slide class="" id="slide-9" style="background:;">
   <hgroup>
     <h2>Discussion</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Some things to note in this simulation</h3>
 
 <ul>
@@ -208,26 +211,26 @@ <h3>Some things to note in this simulation</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-10" style="background:;">
+<slide class="" id="slide-10" style="background:;">
   <hgroup>
     <h2>Simulation 5</h2>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-5.png" alt="plot of chunk unnamed-chunk-5"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-11" style="background:;">
+<slide class="" id="slide-11" style="background:;">
   <hgroup>
     <h2>Discussion</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Some things to note from this simulation</h3>
 
 <ul>
-<li>There is no such thing as a group effect here. 
+<li>There is no such thing as a group effect here.
 
 <ul>
 <li>The impact of group reverses itself depending on X.</li>
@@ -244,22 +247,22 @@ <h3>Some things to note from this simulation</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-12" style="background:;">
+<slide class="" id="slide-12" style="background:;">
   <hgroup>
     <h3>Simulation 6</h3>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-6.png" alt="plot of chunk unnamed-chunk-6"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-13" style="background:;">
+<slide class="" id="slide-13" style="background:;">
   <hgroup>
     <h3>Do this to investigate the bivariate relationship</h3>
   </hgroup>
-  <article>
+  <article data-timings="">
     <pre><code>library(rgl)
 plot3d(x1, x2, y)
 </code></pre>
@@ -268,22 +271,22 @@ <h3>Do this to investigate the bivariate relationship</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-14" style="background:;">
+<slide class="" id="slide-14" style="background:;">
   <hgroup>
     <h3>Residual relationship</h3>
   </hgroup>
-  <article>
-    <div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div>
+  <article data-timings="">
+    <p><img src="assets/fig/unnamed-chunk-7.png" alt="plot of chunk unnamed-chunk-7"> </p>
 
   </article>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-15" style="background:;">
+<slide class="" id="slide-15" style="background:;">
   <hgroup>
     <h2>Discussion</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <h3>Some things to note from this simulation</h3>
 
 <ul>
@@ -301,21 +304,21 @@ <h3>Some things to note from this simulation</h3>
   <!-- Presenter Notes -->
 </slide>
 
-      <slide class="" id="slide-16" style="background:;">
+<slide class="" id="slide-16" style="background:;">
   <hgroup>
     <h2>Some final thoughts</h2>
   </hgroup>
-  <article>
+  <article data-timings="">
     <ul>
 <li>Modeling multivariate relationships is difficult.</li>
 <li>Play around with simulations to see how the
-inclusion or exclustion of another variable can
+inclusion or exclusion of another variable can
 change analyses.</li>
 <li>The results of these analyses deal with the
 impact of variables on associations.
 
 <ul>
-<li>Ascertaining mechanisms or cause are difficult subjects 
+<li>Ascertaining mechanisms or cause are difficult subjects
 to be added on top of difficulty in understanding multivariate associations.</li>
 </ul></li>
 </ul>
@@ -326,34 +329,131 @@ <h2>Some final thoughts</h2>
 
     <slide class="backdrop"></slide>
   </slides>
-
-  <!--[if IE]>
+  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
+    <ul>
+      <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=1 title=''>
+         1
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=2 title='Simulation 1'>
+         2
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=3 title='Discussion'>
+         3
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=4 title='Simulation 2'>
+         4
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=5 title='Discussion'>
+         5
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=6 title='Simulation 3'>
+         6
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=7 title='Discussion'>
+         7
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=8 title='Simulation 4'>
+         8
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=9 title='Discussion'>
+         9
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=10 title='Simulation 5'>
+         10
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=11 title='Discussion'>
+         11
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=12 title='Simulation 6'>
+         12
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=13 title='Do this to investigate the bivariate relationship'>
+         13
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=14 title='Residual relationship'>
+         14
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=15 title='Discussion'>
+         15
+      </a>
+    </li>
+    <li>
+      <a href="#" target="_self" rel='tooltip' 
+        data-slide=16 title='Some final thoughts'>
+         16
+      </a>
+    </li>
+  </ul>
+  </div>  <!--[if IE]>
     <script 
       src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
     </script>
     <script>CFInstall.check({mode: 'overlay'});</script>
   <![endif]-->
 </body>
-<!-- Grab CDN jQuery, fall back to local if offline -->
-<script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-<script>window.jQuery || document.write('<script src="../../libraries/widgets/quiz/js/jquery-1.7.min.js"><\/script>')</script>
-<!-- Load Javascripts for Widgets -->
-<!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-<script type="text/x-mathjax-config">
-  MathJax.Hub.Config({
-    tex2jax: {
-      inlineMath: [['$','$'], ['\\(','\\)']],
-      processEscapes: true
-    }
-  });
-</script>
-<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-<!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-</script> -->
-<script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../libraries/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
+  <!-- Load Javascripts for Widgets -->
+  
+  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
+  <script type="text/x-mathjax-config">
+    MathJax.Hub.Config({
+      tex2jax: {
+        inlineMath: [['$','$'], ['\\(','\\)']],
+        processEscapes: true
+      }
+    });
+  </script>
+  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
+  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
+  </script> -->
+  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
 </script>
 <!-- LOAD HIGHLIGHTER JS FILES -->
-<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
-<script>hljs.initHighlightingOnLoad();</script>
-<!-- DONE LOADING HIGHLIGHTER JS FILES -->
-</html>
\ No newline at end of file
+  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
+  <script>hljs.initHighlightingOnLoad();</script>
+  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
+   
+  </html>
\ No newline at end of file
diff --git a/07_RegressionModels/02_03_adjustment/index.md b/07_RegressionModels/02_03_adjustment/index.md
index 46fa34ad2..b66508ebb 100644
--- a/07_RegressionModels/02_03_adjustment/index.md
+++ b/07_RegressionModels/02_03_adjustment/index.md
@@ -6,20 +6,30 @@ job         : Johns Hopkins Bloomberg School of Public Health
 logo        : bloomberg_shield.png
 framework   : io2012        # {io2012, html5slides, shower, dzslides, ...}
 highlighter : highlight.js  # {highlight.js, prettify, highlight}
-hitheme     : tomorrow      # 
+hitheme     : tomorrow      #
 url:
-  lib: ../../libraries
+  lib: ../../librariesNew
   assets: ../../assets
 widgets     : [mathjax]            # {mathjax, quiz, bootstrap}
 mode        : selfcontained # {standalone, draft}
 ---
 
+```
+## Error: object 'opts_chunk' not found
+```
+
+```
+## Error: object 'knit_hooks' not found
+```
 
+```
+## Error: object 'knit_hooks' not found
+```
 ## Consider the following simulated data
 Code for the first plot, rest omitted
 (See the git repo for the rest of the code.)
 ```
-n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), runif(n/2)); 
+n <- 100; t <- rep(c(0, 1), c(n/2, n/2)); x <- c(runif(n/2), runif(n/2));
 beta0 <- 0; beta1 <- 2; tau <- 1; sigma <- .2
 y <- beta0 + x * beta1 + t * tau + rnorm(n, sd = sigma)
 plot(x, y, type = "n", frame = FALSE)
@@ -36,8 +46,7 @@ points(x[(n/2 + 1) : n], y[(n/2 + 1) : n], pch = 21, col = "black", bg = "salmon
 
 ---
 ## Simulation 1
-<div class="rimage center"><img src="fig/unnamed-chunk-1.png" title="plot of chunk unnamed-chunk-1" alt="plot of chunk unnamed-chunk-1" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-1](assets/fig/unnamed-chunk-1.png) 
 
 ---
 ## Discussion
@@ -51,8 +60,7 @@ points(x[(n/2 + 1) : n], y[(n/2 + 1) : n], pch = 21, col = "black", bg = "salmon
 
 ---
 ## Simulation 2
-<div class="rimage center"><img src="fig/unnamed-chunk-2.png" title="plot of chunk unnamed-chunk-2" alt="plot of chunk unnamed-chunk-2" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-2](assets/fig/unnamed-chunk-2.png) 
 
 
 ---
@@ -63,15 +71,14 @@ points(x[(n/2 + 1) : n], y[(n/2 + 1) : n], pch = 21, col = "black", bg = "salmon
   doesn't depend on the group variable.
   * The X variable remains related to Y holding group status constant
 * The group variable is marginally related to Y disregarding X.
-* The model would estimate no adjusted effect due to group. 
+* The model would estimate no adjusted effect due to group.
   * There isn't any data to inform the relationship between
     group and Y.
-  * This conclusion is entirely based on the model. 
+  * This conclusion is entirely based on the model.
 
 ---
 ## Simulation 3
-<div class="rimage center"><img src="fig/unnamed-chunk-3.png" title="plot of chunk unnamed-chunk-3" alt="plot of chunk unnamed-chunk-3" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-3](assets/fig/unnamed-chunk-3.png) 
 
 ---
 ## Discussion
@@ -86,8 +93,7 @@ holding X fixed.
 
 ---
 ## Simulation 4
-<div class="rimage center"><img src="fig/unnamed-chunk-4.png" title="plot of chunk unnamed-chunk-4" alt="plot of chunk unnamed-chunk-4" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-4](assets/fig/unnamed-chunk-4.png) 
 
 ---
 ## Discussion
@@ -100,13 +106,12 @@ holding X fixed.
 
 ---
 ## Simulation 5
-<div class="rimage center"><img src="fig/unnamed-chunk-5.png" title="plot of chunk unnamed-chunk-5" alt="plot of chunk unnamed-chunk-5" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-5](assets/fig/unnamed-chunk-5.png) 
 
 ---
 ## Discussion
 ### Some things to note from this simulation
-* There is no such thing as a group effect here. 
+* There is no such thing as a group effect here.
   * The impact of group reverses itself depending on X.
   * Both intercept and slope depends on group.
 * Group status and X unrelated.
@@ -114,8 +119,7 @@ holding X fixed.
 
 ---
 ### Simulation 6
-<div class="rimage center"><img src="fig/unnamed-chunk-6.png" title="plot of chunk unnamed-chunk-6" alt="plot of chunk unnamed-chunk-6" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-6](assets/fig/unnamed-chunk-6.png) 
 
 ---
 ### Do this to investigate the bivariate relationship
@@ -126,8 +130,7 @@ plot3d(x1, x2, y)
 
 ---
 ### Residual relationship
-<div class="rimage center"><img src="fig/unnamed-chunk-7.png" title="plot of chunk unnamed-chunk-7" alt="plot of chunk unnamed-chunk-7" class="plot" /></div>
-
+![plot of chunk unnamed-chunk-7](assets/fig/unnamed-chunk-7.png) 
 
 
 ---
@@ -144,10 +147,9 @@ plot3d(x1, x2, y)
 ## Some final thoughts
 * Modeling multivariate relationships is difficult.
 * Play around with simulations to see how the
-  inclusion or exclustion of another variable can
+  inclusion or exclusion of another variable can
   change analyses.
 * The results of these analyses deal with the
 impact of variables on associations.
-  * Ascertaining mechanisms or cause are difficult subjects 
+  * Ascertaining mechanisms or cause are difficult subjects
     to be added on top of difficulty in understanding multivariate associations.
-
diff --git a/07_RegressionModels/03_01_glms/index.Rmd b/07_RegressionModels/03_01_glms/index.Rmd
index ce2e4410c..96940881c 100644
--- a/07_RegressionModels/03_01_glms/index.Rmd
+++ b/07_RegressionModels/03_01_glms/index.Rmd
@@ -103,7 +103,7 @@ $$\sum_{i=1}^n y_i \eta_i =
 \sum_{k=1}^p \beta_k\sum_{i=1}^n X_{ik} y_i
 $$
 Thus if we don't need the full data, only $\sum_{i=1}^n X_{ik} y_i$. This simplification is a consequence of chosing so-called 'canonical' link functions.
-* (This has to be derived). All models acheive their maximum at the root of the so called normal equations
+* (This has to be derived). All models achieve their maximum at the root of the so called normal equations
 $$
 0=\sum_{i=1}^n \frac{(Y_i - \mu_i)}{Var(Y_i)}W_i
 $$
diff --git a/07_RegressionModels/03_01_glms/index.html b/07_RegressionModels/03_01_glms/index.html
index 2bdb47aca..da247ea1d 100644
--- a/07_RegressionModels/03_01_glms/index.html
+++ b/07_RegressionModels/03_01_glms/index.html
@@ -173,7 +173,7 @@ <h2>Some things to note</h2>
 \sum_{k=1}^p \beta_k\sum_{i=1}^n X_{ik} y_i
 \]
 Thus if we don&#39;t need the full data, only \(\sum_{i=1}^n X_{ik} y_i\). This simplification is a consequence of chosing so-called &#39;canonical&#39; link functions.</li>
-<li>(This has to be derived). All models acheive their maximum at the root of the so called normal equations
+<li>(This has to be derived). All models achieve their maximum at the root of the so called normal equations
 \[
 0=\sum_{i=1}^n \frac{(Y_i - \mu_i)}{Var(Y_i)}W_i
 \]
diff --git a/07_RegressionModels/03_01_glms/index.md b/07_RegressionModels/03_01_glms/index.md
index eaf20c48a..a04f4b341 100644
--- a/07_RegressionModels/03_01_glms/index.md
+++ b/07_RegressionModels/03_01_glms/index.md
@@ -89,7 +89,7 @@ $$\sum_{i=1}^n y_i \eta_i =
 \sum_{k=1}^p \beta_k\sum_{i=1}^n X_{ik} y_i
 $$
 Thus if we don't need the full data, only $\sum_{i=1}^n X_{ik} y_i$. This simplification is a consequence of chosing so-called 'canonical' link functions.
-* (This has to be derived). All models acheive their maximum at the root of the so called normal equations
+* (This has to be derived). All models achieve their maximum at the root of the so called normal equations
 $$
 0=\sum_{i=1}^n \frac{(Y_i - \mu_i)}{Var(Y_i)}W_i
 $$
diff --git a/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-1.png b/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-1.png
new file mode 100644
index 000000000..fab11c48a
Binary files /dev/null and b/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-1.png differ
diff --git a/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-2.png b/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-2.png
new file mode 100644
index 000000000..1d2e71d3d
Binary files /dev/null and b/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-2.png differ
diff --git a/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-4.png b/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-4.png
new file mode 100644
index 000000000..49b1494ad
Binary files /dev/null and b/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-4.png differ
diff --git a/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-5.png b/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-5.png
new file mode 100644
index 000000000..216bf5b9b
Binary files /dev/null and b/07_RegressionModels/03_04_bonus/assets/fig/unnamed-chunk-5.png differ
diff --git a/07_RegressionModels/pdfs/02_01.pdf b/07_RegressionModels/pdfs/02_01.pdf
index 787b1454d..7f03cfcf6 100644
Binary files a/07_RegressionModels/pdfs/02_01.pdf and b/07_RegressionModels/pdfs/02_01.pdf differ
diff --git a/07_RegressionModels/pdfs/02_02.pdf b/07_RegressionModels/pdfs/02_02.pdf
index 69b605f2a..2c596e1a0 100644
Binary files a/07_RegressionModels/pdfs/02_02.pdf and b/07_RegressionModels/pdfs/02_02.pdf differ
diff --git a/07_RegressionModels/pdfs/Binder1.pdf b/07_RegressionModels/pdfs/Binder1.pdf
new file mode 100644
index 000000000..aa3b50e72
Binary files /dev/null and b/07_RegressionModels/pdfs/Binder1.pdf differ
diff --git a/07_RegressionModels/readme.html b/07_RegressionModels/readme.html
index d00fde985..5f07ef4e7 100644
--- a/07_RegressionModels/readme.html
+++ b/07_RegressionModels/readme.html
@@ -145,7 +145,7 @@ <h2>Regression module</h2>
 <p>This is the readme file for the regression component of the set of data science modules</p>
 
 <ul>
-<li><a href="01_01_introduction/index.thml">Introduction</a></li>
+<li><a href="01_01_introduction/index.html">Introduction</a></li>
 <li><a href="01_02_notation/index.html">Notation</a></li>
 <li><a href="01_03_ols/index.html">Least squares</a></li>
 <li><a href="01_04_rttm/index.html">Regression to mediocrity</a></li>
diff --git a/07_RegressionModels/readme.md b/07_RegressionModels/readme.md
index 8e526a7c4..a6f2fe383 100644
--- a/07_RegressionModels/readme.md
+++ b/07_RegressionModels/readme.md
@@ -2,7 +2,7 @@
 
 This is the readme file for the regression component of the set of data science modules
 
-* [Introduction](01_01_introduction/index.thml)
+* [Introduction](01_01_introduction/index.html)
 * [Notation](01_02_notation/index.html)
 * [Least squares](01_03_ols/index.html)
 * [Regression to mediocrity](01_04_rttm/index.html)
diff --git a/08_PracticalMachineLearning/013plottingPredictors/index.Rmd b/08_PracticalMachineLearning/013plottingPredictors/index.Rmd
index a584b3151..496839a96 100644
--- a/08_PracticalMachineLearning/013plottingPredictors/index.Rmd
+++ b/08_PracticalMachineLearning/013plottingPredictors/index.Rmd
@@ -47,7 +47,7 @@ Data from: [ISLR package](http://cran.r-project.org/web/packages/ISLR) from the
 ## Example: Wage data
 
 ```{r loadData,cache=TRUE}
-library(ISLR); library(ggplot2); library(caret);
+library(ISLR); library(ggplot2); library(caret); library(gridExtra);
 data(Wage)
 summary(Wage)
 ```
@@ -174,4 +174,4 @@ qplot(wage,colour=education,data=training,geom="density")
   * Groups of points not explained by a predictor
   * Skewed variables 
 * [ggplot2 tutorial](http://rstudio-pubs-static.s3.amazonaws.com/2176_75884214fc524dc0bc2a140573da38bb.html)
-* [caret visualizations](http://caret.r-forge.r-project.org/visualizations.html)
\ No newline at end of file
+* [caret visualizations](http://caret.r-forge.r-project.org/visualizations.html)
diff --git a/08_PracticalMachineLearning/023modelBasedPrediction/index.Rmd b/08_PracticalMachineLearning/023modelBasedPrediction/index.Rmd
index a0e264efc..bb092f7b8 100644
--- a/08_PracticalMachineLearning/023modelBasedPrediction/index.Rmd
+++ b/08_PracticalMachineLearning/023modelBasedPrediction/index.Rmd
@@ -16,6 +16,8 @@ mode        : selfcontained # {standalone, draft}
 
 
 ```{r setup, cache = F, echo = F, message = F, warning = F, tidy = F}
+library(knitr)
+library(caret)
 # make this an external chunk that can be included in any file
 options(width = 100)
 opts_chunk$set(message = F, error = F, warning = F, comment = NA, fig.align = 'center', dpi = 100, cache=TRUE,tidy = F, cache.path = '.cache/', fig.path = 'fig/')
@@ -87,7 +89,7 @@ http://statweb.stanford.edu/~tibs/ElemStatLearn/
 
 $$log \frac{Pr(Y = k | X=x)}{Pr(Y = j | X=x)}$$
 $$ = log \frac{f_k(x)}{f_j(x)} + log \frac{\pi_k}{\pi_j}$$
-$$ = log \frac{\pi_k}{\pi_j} - \frac{1}{2}(\mu_k + \mu_j)^T \Sigma^{-1}(\mu_k + \mu_j)$$
+$$ = log \frac{\pi_k}{\pi_j} - \frac{1}{2}(\mu_k^T \Sigma^{-1}\mu_k - \mu_j^T \Sigma^{-1}\mu_j)$$
 $$ + x^T \Sigma^{-1} (\mu_k - \mu_j)$$
 
 http://statweb.stanford.edu/~tibs/ElemStatLearn/
diff --git a/08_PracticalMachineLearning/lectures/001predictionMotivation.pdf b/08_PracticalMachineLearning/lectures/001predictionMotivation.pdf
index 34364459b..5c1f16bb9 100644
Binary files a/08_PracticalMachineLearning/lectures/001predictionMotivation.pdf and b/08_PracticalMachineLearning/lectures/001predictionMotivation.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/002whatIsPrediction.pdf b/08_PracticalMachineLearning/lectures/002whatIsPrediction.pdf
index ca6b9bc2c..3602ed3c3 100644
Binary files a/08_PracticalMachineLearning/lectures/002whatIsPrediction.pdf and b/08_PracticalMachineLearning/lectures/002whatIsPrediction.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/003relativeImportance.pdf b/08_PracticalMachineLearning/lectures/003relativeImportance.pdf
index d81b246ba..7c1a20abb 100644
Binary files a/08_PracticalMachineLearning/lectures/003relativeImportance.pdf and b/08_PracticalMachineLearning/lectures/003relativeImportance.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/004inOutSampleErrors.pdf b/08_PracticalMachineLearning/lectures/004inOutSampleErrors.pdf
index fa2cf944a..eb72c516d 100644
Binary files a/08_PracticalMachineLearning/lectures/004inOutSampleErrors.pdf and b/08_PracticalMachineLearning/lectures/004inOutSampleErrors.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/005predictionStudyDesign.pdf b/08_PracticalMachineLearning/lectures/005predictionStudyDesign.pdf
index 5d3cab227..fe42958a6 100644
Binary files a/08_PracticalMachineLearning/lectures/005predictionStudyDesign.pdf and b/08_PracticalMachineLearning/lectures/005predictionStudyDesign.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/006typesOfErrors.pdf b/08_PracticalMachineLearning/lectures/006typesOfErrors.pdf
index f87a95199..863f2c0e8 100644
Binary files a/08_PracticalMachineLearning/lectures/006typesOfErrors.pdf and b/08_PracticalMachineLearning/lectures/006typesOfErrors.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/007receiverOperatingCharacteristic.pdf b/08_PracticalMachineLearning/lectures/007receiverOperatingCharacteristic.pdf
index 85afe85ae..4f073d8b1 100644
Binary files a/08_PracticalMachineLearning/lectures/007receiverOperatingCharacteristic.pdf and b/08_PracticalMachineLearning/lectures/007receiverOperatingCharacteristic.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/008crossValidation.pdf b/08_PracticalMachineLearning/lectures/008crossValidation.pdf
index 75a6ea787..883cda8ff 100644
Binary files a/08_PracticalMachineLearning/lectures/008crossValidation.pdf and b/08_PracticalMachineLearning/lectures/008crossValidation.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/009whatData.pdf b/08_PracticalMachineLearning/lectures/009whatData.pdf
index eb902c446..e8f824b09 100644
Binary files a/08_PracticalMachineLearning/lectures/009whatData.pdf and b/08_PracticalMachineLearning/lectures/009whatData.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/010caretPackage.pdf b/08_PracticalMachineLearning/lectures/010caretPackage.pdf
index ef992f106..dc4370532 100644
Binary files a/08_PracticalMachineLearning/lectures/010caretPackage.pdf and b/08_PracticalMachineLearning/lectures/010caretPackage.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/011dataSlicing.pdf b/08_PracticalMachineLearning/lectures/011dataSlicing.pdf
index cda73095c..5df550e3f 100644
Binary files a/08_PracticalMachineLearning/lectures/011dataSlicing.pdf and b/08_PracticalMachineLearning/lectures/011dataSlicing.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/012trainOptions.pdf b/08_PracticalMachineLearning/lectures/012trainOptions.pdf
index 6f797d737..3d2baa92b 100644
Binary files a/08_PracticalMachineLearning/lectures/012trainOptions.pdf and b/08_PracticalMachineLearning/lectures/012trainOptions.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/013plottingPredictors.pdf b/08_PracticalMachineLearning/lectures/013plottingPredictors.pdf
index 252693b8f..7b30d3e26 100644
Binary files a/08_PracticalMachineLearning/lectures/013plottingPredictors.pdf and b/08_PracticalMachineLearning/lectures/013plottingPredictors.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/014basicPreprocessing.pdf b/08_PracticalMachineLearning/lectures/014basicPreprocessing.pdf
index d8a13da9b..fb7ccae12 100644
Binary files a/08_PracticalMachineLearning/lectures/014basicPreprocessing.pdf and b/08_PracticalMachineLearning/lectures/014basicPreprocessing.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/015covariateCreation.pdf b/08_PracticalMachineLearning/lectures/015covariateCreation.pdf
index b5f03b97f..f5d79dd2b 100644
Binary files a/08_PracticalMachineLearning/lectures/015covariateCreation.pdf and b/08_PracticalMachineLearning/lectures/015covariateCreation.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/016preProcessingPCA.pdf b/08_PracticalMachineLearning/lectures/016preProcessingPCA.pdf
index 8d13af6bb..014377392 100644
Binary files a/08_PracticalMachineLearning/lectures/016preProcessingPCA.pdf and b/08_PracticalMachineLearning/lectures/016preProcessingPCA.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/017predictingWithRegression.pdf b/08_PracticalMachineLearning/lectures/017predictingWithRegression.pdf
index d82c69122..ff3cd6de0 100644
Binary files a/08_PracticalMachineLearning/lectures/017predictingWithRegression.pdf and b/08_PracticalMachineLearning/lectures/017predictingWithRegression.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/018predictingWithRegressionMC.pdf b/08_PracticalMachineLearning/lectures/018predictingWithRegressionMC.pdf
index 2d637bbe5..99cc620fa 100644
Binary files a/08_PracticalMachineLearning/lectures/018predictingWithRegressionMC.pdf and b/08_PracticalMachineLearning/lectures/018predictingWithRegressionMC.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/019predictingWithTrees.pdf b/08_PracticalMachineLearning/lectures/019predictingWithTrees.pdf
index ec4bec83d..d4b7d147c 100644
Binary files a/08_PracticalMachineLearning/lectures/019predictingWithTrees.pdf and b/08_PracticalMachineLearning/lectures/019predictingWithTrees.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/020bagging.pdf b/08_PracticalMachineLearning/lectures/020bagging.pdf
index 18324adca..43613ac66 100644
Binary files a/08_PracticalMachineLearning/lectures/020bagging.pdf and b/08_PracticalMachineLearning/lectures/020bagging.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/021randomForests.pdf b/08_PracticalMachineLearning/lectures/021randomForests.pdf
index 9792cb894..76656b965 100644
Binary files a/08_PracticalMachineLearning/lectures/021randomForests.pdf and b/08_PracticalMachineLearning/lectures/021randomForests.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/022boosting.pdf b/08_PracticalMachineLearning/lectures/022boosting.pdf
index 6060da3f0..8f4051c89 100644
Binary files a/08_PracticalMachineLearning/lectures/022boosting.pdf and b/08_PracticalMachineLearning/lectures/022boosting.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/023modelBasedPrediction.pdf b/08_PracticalMachineLearning/lectures/023modelBasedPrediction.pdf
index e529b9e20..d8e11311b 100644
Binary files a/08_PracticalMachineLearning/lectures/023modelBasedPrediction.pdf and b/08_PracticalMachineLearning/lectures/023modelBasedPrediction.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/024regularizedRegression.pdf b/08_PracticalMachineLearning/lectures/024regularizedRegression.pdf
index b970ad11f..7ca1debd3 100644
Binary files a/08_PracticalMachineLearning/lectures/024regularizedRegression.pdf and b/08_PracticalMachineLearning/lectures/024regularizedRegression.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/025combiningPredictors.pdf b/08_PracticalMachineLearning/lectures/025combiningPredictors.pdf
index a2129ddd9..43b8afa68 100644
Binary files a/08_PracticalMachineLearning/lectures/025combiningPredictors.pdf and b/08_PracticalMachineLearning/lectures/025combiningPredictors.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/026unsupervisedPrediction.pdf b/08_PracticalMachineLearning/lectures/026unsupervisedPrediction.pdf
index cedb1a7ff..e92ad5fb1 100644
Binary files a/08_PracticalMachineLearning/lectures/026unsupervisedPrediction.pdf and b/08_PracticalMachineLearning/lectures/026unsupervisedPrediction.pdf differ
diff --git a/08_PracticalMachineLearning/lectures/027forecasting.pdf b/08_PracticalMachineLearning/lectures/027forecasting.pdf
index 7f2c64099..94f03816f 100644
Binary files a/08_PracticalMachineLearning/lectures/027forecasting.pdf and b/08_PracticalMachineLearning/lectures/027forecasting.pdf differ
diff --git a/09_DevelopingDataProducts/dataAnalysisReports/exampleProject.zip b/09_DevelopingDataProducts/dataAnalysisReports/exampleProject.zip
new file mode 100644
index 000000000..fe53b0477
Binary files /dev/null and b/09_DevelopingDataProducts/dataAnalysisReports/exampleProject.zip differ
diff --git a/09_DevelopingDataProducts/ggvis/ggvis-figure/unnamed-chunk-2.png b/09_DevelopingDataProducts/ggvis/ggvis-figure/unnamed-chunk-2.png
new file mode 100644
index 000000000..f23cf48af
Binary files /dev/null and b/09_DevelopingDataProducts/ggvis/ggvis-figure/unnamed-chunk-2.png differ
diff --git a/09_DevelopingDataProducts/ggvis/ggvis-figure/unnamed-chunk-3.png b/09_DevelopingDataProducts/ggvis/ggvis-figure/unnamed-chunk-3.png
new file mode 100644
index 000000000..f23cf48af
Binary files /dev/null and b/09_DevelopingDataProducts/ggvis/ggvis-figure/unnamed-chunk-3.png differ
diff --git a/09_DevelopingDataProducts/ggvis/ggvis.Rpres b/09_DevelopingDataProducts/ggvis/ggvis.Rpres
new file mode 100644
index 000000000..454325d56
--- /dev/null
+++ b/09_DevelopingDataProducts/ggvis/ggvis.Rpres
@@ -0,0 +1,33 @@
+A quick introduction to ggvis
+===
+author: Brian Caffo
+date: 7/30/2014
+
+===
+
+A first basic example
+===
+
+```{r, echo=TRUE, eval=FALSE}
+install.packages("ggvis")
+```
+
+
+A first basic example
+========================================================
+
+```{r}
+library(ggvis)
+library(dplyr)
+mtcars %>% ggvis(~wt, ~mpg) %>% layer_points()
+mtcars %>% 
+  ggvis(~wt, ~mpg) %>% 
+  layer_points(fill = ~factor(cyl))
+```
+
+Slide With Plot
+========================================================
+
+```{r, echo=FALSE}
+plot(cars)
+```
diff --git a/09_DevelopingDataProducts/ggvis/ggvis.md b/09_DevelopingDataProducts/ggvis/ggvis.md
new file mode 100644
index 000000000..09e0788e6
--- /dev/null
+++ b/09_DevelopingDataProducts/ggvis/ggvis.md
@@ -0,0 +1,354 @@
+A quick introduction to ggvis
+===
+author: Brian Caffo
+date: 7/30/2014
+
+===
+
+A first basic example
+===
+
+
+```r
+install.packages("ggvis")
+```
+
+
+A first basic example
+========================================================
+
+
+```r
+library(ggvis)
+library(dplyr)
+mtcars %>% ggvis(~wt, ~mpg) %>% layer_points()
+```
+
+<!--html_preserve--><div id="plot_id697143314-container" class="ggvis-output-container">
+<div id="plot_id697143314" class="ggvis-output"></div>
+<div class="plot-gear-icon">
+<nav class="ggvis-control">
+<a class="ggvis-dropdown-toggle" title="Controls" onclick="return false;"></a>
+<ul class="ggvis-dropdown">
+<li>
+Renderer: 
+<a id="plot_id697143314_renderer_svg" class="ggvis-renderer-button" onclick="return false;" data-plot-id="plot_id697143314" data-renderer="svg">SVG</a>
+ | 
+<a id="plot_id697143314_renderer_canvas" class="ggvis-renderer-button" onclick="return false;" data-plot-id="plot_id697143314" data-renderer="canvas">Canvas</a>
+</li>
+<li>
+<a id="plot_id697143314_download" class="ggvis-download" data-plot-id="plot_id697143314">Download</a>
+</li>
+</ul>
+</nav>
+</div>
+</div>
+<script type="text/javascript">
+var plot_id697143314_spec = {
+	"data" : [
+		{
+			"name" : "mtcars0",
+			"format" : {
+				"type" : "csv",
+				"parse" : {
+					"wt" : "number",
+					"mpg" : "number"
+				}
+			},
+			"values" : "\"wt\",\"mpg\"\n2.62,21\n2.875,21\n2.32,22.8\n3.215,21.4\n3.44,18.7\n3.46,18.1\n3.57,14.3\n3.19,24.4\n3.15,22.8\n3.44,19.2\n3.44,17.8\n4.07,16.4\n3.73,17.3\n3.78,15.2\n5.25,10.4\n5.424,10.4\n5.345,14.7\n2.2,32.4\n1.615,30.4\n1.835,33.9\n2.465,21.5\n3.52,15.5\n3.435,15.2\n3.84,13.3\n3.845,19.2\n1.935,27.3\n2.14,26\n1.513,30.4\n3.17,15.8\n2.77,19.7\n3.57,15\n2.78,21.4"
+		},
+		{
+			"name" : "scale/x",
+			"format" : {
+				"type" : "csv",
+				"parse" : {
+					"domain" : "number"
+				}
+			},
+			"values" : "\"domain\"\n1.31745\n5.61955"
+		},
+		{
+			"name" : "scale/y",
+			"format" : {
+				"type" : "csv",
+				"parse" : {
+					"domain" : "number"
+				}
+			},
+			"values" : "\"domain\"\n9.225\n35.075"
+		}
+	],
+	"scales" : [
+		{
+			"name" : "x",
+			"domain" : {
+				"data" : "scale/x",
+				"field" : "data.domain"
+			},
+			"zero" : false,
+			"nice" : false,
+			"clamp" : false,
+			"range" : "width"
+		},
+		{
+			"name" : "y",
+			"domain" : {
+				"data" : "scale/y",
+				"field" : "data.domain"
+			},
+			"zero" : false,
+			"nice" : false,
+			"clamp" : false,
+			"range" : "height"
+		}
+	],
+	"marks" : [
+		{
+			"type" : "symbol",
+			"properties" : {
+				"update" : {
+					"fill" : {
+						"value" : "#000000"
+					},
+					"size" : {
+						"value" : 50
+					},
+					"x" : {
+						"scale" : "x",
+						"field" : "data.wt"
+					},
+					"y" : {
+						"scale" : "y",
+						"field" : "data.mpg"
+					}
+				},
+				"ggvis" : {
+					"data" : {
+						"value" : "mtcars0"
+					}
+				}
+			},
+			"from" : {
+				"data" : "mtcars0"
+			}
+		}
+	],
+	"width" : 504,
+	"height" : 504,
+	"legends" : [],
+	"axes" : [
+		{
+			"type" : "x",
+			"scale" : "x",
+			"orient" : "bottom",
+			"layer" : "back",
+			"grid" : true,
+			"title" : "wt"
+		},
+		{
+			"type" : "y",
+			"scale" : "y",
+			"orient" : "left",
+			"layer" : "back",
+			"grid" : true,
+			"title" : "mpg"
+		}
+	],
+	"padding" : null,
+	"ggvis_opts" : {
+		"keep_aspect" : false,
+		"resizable" : true,
+		"padding" : {},
+		"duration" : 250,
+		"renderer" : "svg",
+		"hover_duration" : 0,
+		"width" : 504,
+		"height" : 504
+	},
+	"handlers" : null
+};
+ggvis.getPlot("plot_id697143314").parseSpec(plot_id697143314_spec);
+</script><!--/html_preserve-->
+
+```r
+mtcars %>% 
+  ggvis(~wt, ~mpg) %>% 
+  layer_points(fill = ~factor(cyl))
+```
+
+<!--html_preserve--><div id="plot_id938326188-container" class="ggvis-output-container">
+<div id="plot_id938326188" class="ggvis-output"></div>
+<div class="plot-gear-icon">
+<nav class="ggvis-control">
+<a class="ggvis-dropdown-toggle" title="Controls" onclick="return false;"></a>
+<ul class="ggvis-dropdown">
+<li>
+Renderer: 
+<a id="plot_id938326188_renderer_svg" class="ggvis-renderer-button" onclick="return false;" data-plot-id="plot_id938326188" data-renderer="svg">SVG</a>
+ | 
+<a id="plot_id938326188_renderer_canvas" class="ggvis-renderer-button" onclick="return false;" data-plot-id="plot_id938326188" data-renderer="canvas">Canvas</a>
+</li>
+<li>
+<a id="plot_id938326188_download" class="ggvis-download" data-plot-id="plot_id938326188">Download</a>
+</li>
+</ul>
+</nav>
+</div>
+</div>
+<script type="text/javascript">
+var plot_id938326188_spec = {
+	"data" : [
+		{
+			"name" : "mtcars0",
+			"format" : {
+				"type" : "csv",
+				"parse" : {
+					"wt" : "number",
+					"mpg" : "number"
+				}
+			},
+			"values" : "\"wt\",\"mpg\",\"factor(cyl)\"\n2.62,21,\"6\"\n2.875,21,\"6\"\n2.32,22.8,\"4\"\n3.215,21.4,\"6\"\n3.44,18.7,\"8\"\n3.46,18.1,\"6\"\n3.57,14.3,\"8\"\n3.19,24.4,\"4\"\n3.15,22.8,\"4\"\n3.44,19.2,\"6\"\n3.44,17.8,\"6\"\n4.07,16.4,\"8\"\n3.73,17.3,\"8\"\n3.78,15.2,\"8\"\n5.25,10.4,\"8\"\n5.424,10.4,\"8\"\n5.345,14.7,\"8\"\n2.2,32.4,\"4\"\n1.615,30.4,\"4\"\n1.835,33.9,\"4\"\n2.465,21.5,\"4\"\n3.52,15.5,\"8\"\n3.435,15.2,\"8\"\n3.84,13.3,\"8\"\n3.845,19.2,\"8\"\n1.935,27.3,\"4\"\n2.14,26,\"4\"\n1.513,30.4,\"4\"\n3.17,15.8,\"8\"\n2.77,19.7,\"6\"\n3.57,15,\"8\"\n2.78,21.4,\"4\""
+		},
+		{
+			"name" : "scale/fill",
+			"format" : {
+				"type" : "csv",
+				"parse" : null
+			},
+			"values" : "\"domain\"\n\"4\"\n\"6\"\n\"8\""
+		},
+		{
+			"name" : "scale/x",
+			"format" : {
+				"type" : "csv",
+				"parse" : {
+					"domain" : "number"
+				}
+			},
+			"values" : "\"domain\"\n1.31745\n5.61955"
+		},
+		{
+			"name" : "scale/y",
+			"format" : {
+				"type" : "csv",
+				"parse" : {
+					"domain" : "number"
+				}
+			},
+			"values" : "\"domain\"\n9.225\n35.075"
+		}
+	],
+	"scales" : [
+		{
+			"name" : "fill",
+			"type" : "ordinal",
+			"domain" : {
+				"data" : "scale/fill",
+				"field" : "data.domain"
+			},
+			"points" : true,
+			"sort" : false,
+			"range" : "category10"
+		},
+		{
+			"name" : "x",
+			"domain" : {
+				"data" : "scale/x",
+				"field" : "data.domain"
+			},
+			"zero" : false,
+			"nice" : false,
+			"clamp" : false,
+			"range" : "width"
+		},
+		{
+			"name" : "y",
+			"domain" : {
+				"data" : "scale/y",
+				"field" : "data.domain"
+			},
+			"zero" : false,
+			"nice" : false,
+			"clamp" : false,
+			"range" : "height"
+		}
+	],
+	"marks" : [
+		{
+			"type" : "symbol",
+			"properties" : {
+				"update" : {
+					"size" : {
+						"value" : 50
+					},
+					"x" : {
+						"scale" : "x",
+						"field" : "data.wt"
+					},
+					"y" : {
+						"scale" : "y",
+						"field" : "data.mpg"
+					},
+					"fill" : {
+						"scale" : "fill",
+						"field" : "data.factor(cyl)"
+					}
+				},
+				"ggvis" : {
+					"data" : {
+						"value" : "mtcars0"
+					}
+				}
+			},
+			"from" : {
+				"data" : "mtcars0"
+			}
+		}
+	],
+	"width" : 504,
+	"height" : 504,
+	"legends" : [
+		{
+			"orient" : "right",
+			"fill" : "fill",
+			"title" : "factor(cyl)"
+		}
+	],
+	"axes" : [
+		{
+			"type" : "x",
+			"scale" : "x",
+			"orient" : "bottom",
+			"layer" : "back",
+			"grid" : true,
+			"title" : "wt"
+		},
+		{
+			"type" : "y",
+			"scale" : "y",
+			"orient" : "left",
+			"layer" : "back",
+			"grid" : true,
+			"title" : "mpg"
+		}
+	],
+	"padding" : null,
+	"ggvis_opts" : {
+		"keep_aspect" : false,
+		"resizable" : true,
+		"padding" : {},
+		"duration" : 250,
+		"renderer" : "svg",
+		"hover_duration" : 0,
+		"width" : 504,
+		"height" : 504
+	},
+	"handlers" : null
+};
+ggvis.getPlot("plot_id938326188").parseSpec(plot_id938326188_spec);
+</script><!--/html_preserve-->
+
+Slide With Plot
+========================================================
+
+![plot of chunk unnamed-chunk-3](ggvis-figure/unnamed-chunk-3.png) 
diff --git a/09_DevelopingDataProducts/lectures/RPackages.pdf b/09_DevelopingDataProducts/lectures/RPackages.pdf
index 2dd6b2867..af157a66f 100644
Binary files a/09_DevelopingDataProducts/lectures/RPackages.pdf and b/09_DevelopingDataProducts/lectures/RPackages.pdf differ
diff --git a/09_DevelopingDataProducts/lectures/classes-methods.pdf b/09_DevelopingDataProducts/lectures/classes-methods.pdf
index 1aee59e58..d897ef645 100644
Binary files a/09_DevelopingDataProducts/lectures/classes-methods.pdf and b/09_DevelopingDataProducts/lectures/classes-methods.pdf differ
diff --git a/09_DevelopingDataProducts/lectures/googleVis.pdf b/09_DevelopingDataProducts/lectures/googleVis.pdf
index 261ac336c..dcfb88162 100644
Binary files a/09_DevelopingDataProducts/lectures/googleVis.pdf and b/09_DevelopingDataProducts/lectures/googleVis.pdf differ
diff --git a/09_DevelopingDataProducts/lectures/manipulate.pdf b/09_DevelopingDataProducts/lectures/manipulate.pdf
index 68efee7ea..e38299ea0 100644
Binary files a/09_DevelopingDataProducts/lectures/manipulate.pdf and b/09_DevelopingDataProducts/lectures/manipulate.pdf differ
diff --git a/09_DevelopingDataProducts/lectures/rCharts.pdf b/09_DevelopingDataProducts/lectures/rCharts.pdf
index f9f101d34..02846c734 100644
Binary files a/09_DevelopingDataProducts/lectures/rCharts.pdf and b/09_DevelopingDataProducts/lectures/rCharts.pdf differ
diff --git a/09_DevelopingDataProducts/lectures/rCharts.pptx b/09_DevelopingDataProducts/lectures/rCharts.pptx
new file mode 100644
index 000000000..95b525db8
Binary files /dev/null and b/09_DevelopingDataProducts/lectures/rCharts.pptx differ
diff --git a/09_DevelopingDataProducts/lectures/rStudioPresenter.pdf b/09_DevelopingDataProducts/lectures/rStudioPresenter.pdf
new file mode 100644
index 000000000..81dc71bb9
Binary files /dev/null and b/09_DevelopingDataProducts/lectures/rStudioPresenter.pdf differ
diff --git a/09_DevelopingDataProducts/lectures/shiny.pdf b/09_DevelopingDataProducts/lectures/shiny.pdf
index 21e59dcc9..8d805bf07 100644
Binary files a/09_DevelopingDataProducts/lectures/shiny.pdf and b/09_DevelopingDataProducts/lectures/shiny.pdf differ
diff --git a/09_DevelopingDataProducts/lectures/shiny2.pdf b/09_DevelopingDataProducts/lectures/shiny2.pdf
index 81a97cb78..6b99ae4dc 100644
Binary files a/09_DevelopingDataProducts/lectures/shiny2.pdf and b/09_DevelopingDataProducts/lectures/shiny2.pdf differ
diff --git a/09_DevelopingDataProducts/lectures/slidify.pdf b/09_DevelopingDataProducts/lectures/slidify.pdf
index 1507b8ade..4005a23c0 100644
Binary files a/09_DevelopingDataProducts/lectures/slidify.pdf and b/09_DevelopingDataProducts/lectures/slidify.pdf differ
diff --git a/09_DevelopingDataProducts/plotly/courseraData.rda b/09_DevelopingDataProducts/plotly/courseraData.rda
new file mode 100644
index 000000000..111e3ec69
Binary files /dev/null and b/09_DevelopingDataProducts/plotly/courseraData.rda differ
diff --git a/09_DevelopingDataProducts/plotly/plotly.R b/09_DevelopingDataProducts/plotly/plotly.R
new file mode 100644
index 000000000..f26647a35
--- /dev/null
+++ b/09_DevelopingDataProducts/plotly/plotly.R
@@ -0,0 +1,28 @@
+## An analysis of the coursera johns hopkins data (from a few months back)
+## Used to illustrate plotly and ggplot
+##
+## Brian Caffo 7/10/2014
+
+
+load("courseraData.rda")
+
+
+## Make sure that you've followed the first few set up steps
+## https://plot.ly/ggplot2/getting-started/
+## Particularly set_credentials_file(username=FILL IN, api_key=FILL IN)
+library(plotly)
+
+
+library(ggplot2)
+## First do a bar plot in ggplot
+g <- ggplot(myData, aes(y = enrollment, x = class, fill = offering)) 
+g <- g + geom_bar(stat = "identity")
+g
+
+## Let's try to get it into plot.ly
+py <- plotly()
+out <- py$ggplotly(g)
+out$response$url
+
+
+
diff --git a/09_DevelopingDataProducts/rCharts/fig/h1.html b/09_DevelopingDataProducts/rCharts/fig/h1.html
index a8df638d4..52a90a604 100644
--- a/09_DevelopingDataProducts/rCharts/fig/h1.html
+++ b/09_DevelopingDataProducts/rCharts/fig/h1.html
@@ -20,13 +20,13 @@
     
   </head>
   <body>
-    <div id='chart22882a3223db' class='rChart highcharts'></div>  
+    <div id='chart22b4e896ba8' class='rChart highcharts'></div>  
     
     <script type='text/javascript'>
     (function($){
         $(function () {
             var chart = new Highcharts.Chart({
- "dom": "chart22882a3223db",
+ "dom": "chart22b4e896ba8",
 "width":            800,
 "height":            400,
 "credits": {
@@ -1265,9 +1265,9 @@
 "subtitle": {
  "text": null 
 },
-"id": "chart22882a3223db",
+"id": "chart22b4e896ba8",
 "chart": {
- "renderTo": "chart22882a3223db" 
+ "renderTo": "chart22b4e896ba8" 
 } 
 });
         });
diff --git a/09_DevelopingDataProducts/rCharts/fig/m1.html b/09_DevelopingDataProducts/rCharts/fig/m1.html
index 3c5a246cb..b430b5c94 100644
--- a/09_DevelopingDataProducts/rCharts/fig/m1.html
+++ b/09_DevelopingDataProducts/rCharts/fig/m1.html
@@ -20,11 +20,11 @@
     
   </head>
   <body>
-    <div id='chart228825c443cc' class='rChart morris'></div>  
+    <div id='chart22b4c1b73b0' class='rChart morris'></div>  
     
     <script type='text/javascript'>
     var chartParams = {
- "element": "chart228825c443cc",
+ "element": "chart22b4c1b73b0",
 "width":            800,
 "height":            400,
 "xkey": "date",
@@ -3860,7 +3860,7 @@
 ],
 "pointSize":              0,
 "lineWidth":              1,
-"id": "chart228825c443cc",
+"id": "chart22b4c1b73b0",
 "labels": [ "psavert", "uempmed" ] 
 },
       chartType = "Line"
diff --git a/09_DevelopingDataProducts/rCharts/fig/map3.html b/09_DevelopingDataProducts/rCharts/fig/map3.html
index 0c37273d0..4b95e0b2d 100644
--- a/09_DevelopingDataProducts/rCharts/fig/map3.html
+++ b/09_DevelopingDataProducts/rCharts/fig/map3.html
@@ -20,11 +20,11 @@
     
   </head>
   <body>
-    <div id='chart228814b14fd2' class='rChart leaflet'></div>  
+    <div id='chart22b47f011990' class='rChart leaflet'></div>  
     
     <script>
   var spec = {
- "dom": "chart228814b14fd2",
+ "dom": "chart22b47f011990",
 "width":            800,
 "height":            400,
 "urlTemplate": "http://{s}.tile.osm.org/{z}/{x}/{y}.png",
@@ -33,7 +33,7 @@
 },
 "center": [         51.505,          -0.09 ],
 "zoom":             13,
-"id": "chart228814b14fd2" 
+"id": "chart22b47f011990" 
 }
 
   var map = L.map(spec.dom, spec.mapOpts)
diff --git a/09_DevelopingDataProducts/rCharts/fig/n1.html b/09_DevelopingDataProducts/rCharts/fig/n1.html
index debbfb4f8..b1e80e03f 100644
--- a/09_DevelopingDataProducts/rCharts/fig/n1.html
+++ b/09_DevelopingDataProducts/rCharts/fig/n1.html
@@ -21,22 +21,22 @@
     
   </head>
   <body>
-    <div id='chart2288554662fc' class='rChart nvd3'></div>  
+    <div id='chart22b464d1440e' class='rChart nvd3'></div>  
     
     <script type='text/javascript'>
  $(document).ready(function(){
-      drawchart2288554662fc()
+      drawchart22b464d1440e()
     });
-    function drawchart2288554662fc(){  
+    function drawchart22b464d1440e(){  
       var opts = {
- "dom": "chart2288554662fc",
+ "dom": "chart22b464d1440e",
 "width":    800,
 "height":    400,
 "x": "Hair",
 "y": "Freq",
 "group": "Eye",
 "type": "multiBarChart",
-"id": "chart2288554662fc" 
+"id": "chart22b464d1440e" 
 },
         data = [
  {
diff --git a/09_DevelopingDataProducts/rCharts/fig/p4.html b/09_DevelopingDataProducts/rCharts/fig/p4.html
index 8debbd383..4ac174c30 100644
--- a/09_DevelopingDataProducts/rCharts/fig/p4.html
+++ b/09_DevelopingDataProducts/rCharts/fig/p4.html
@@ -40,17 +40,17 @@
   </head>
   <body>
     <div class='chart_container'>
-      <div id='yAxischart228852f260f5' class='yAxis'></div>
-      <div id='chart228852f260f5' class='rChart rickshaw'></div>
-      <div id='xAxischart228852f260f5' class='xAxis'></div>
-      <div id='legendchart228852f260f5' class='legend'></div>
-      <div id='sliderchart228852f260f5' class='slider'></div>
+      <div id='yAxischart22b411697183' class='yAxis'></div>
+      <div id='chart22b411697183' class='rChart rickshaw'></div>
+      <div id='xAxischart22b411697183' class='xAxis'></div>
+      <div id='legendchart22b411697183' class='legend'></div>
+      <div id='sliderchart22b411697183' class='slider'></div>
     </div>
     
     <script type='text/javascript'> 
   var palette = new Rickshaw.Color.Palette({ scheme: "colorwheel" });
   var chartParams = {
- "dom": "chart228852f260f5",
+ "dom": "chart22b411697183",
 "width":            560,
 "height":            400,
 "scheme": "colorwheel",
@@ -297,45 +297,45 @@
 } 
 ],
 "renderer": "area",
-"id": "chart228852f260f5" 
+"id": "chart22b411697183" 
 }
-  chartParams.element = document.querySelector('#chart228852f260f5')
+  chartParams.element = document.querySelector('#chart22b411697183')
   
-  var graphchart228852f260f5 = new Rickshaw.Graph(chartParams);
+  var graphchart22b411697183 = new Rickshaw.Graph(chartParams);
   
-  graphchart228852f260f5.render();
+  graphchart22b411697183.render();
   
-  var xAxischart228852f260f5 = new Rickshaw.Graph.Axis.Time({
- "graph":  graphchart228852f260f5  
+  var xAxischart22b411697183 = new Rickshaw.Graph.Axis.Time({
+ "graph":  graphchart22b411697183  
 })
-var yAxischart228852f260f5 = new Rickshaw.Graph.Axis.Y({
- "graph":  graphchart228852f260f5 ,
+var yAxischart22b411697183 = new Rickshaw.Graph.Axis.Y({
+ "graph":  graphchart22b411697183 ,
 "orientation": "left",
-"element":  document.getElementById('yAxischart228852f260f5') ,
+"element":  document.getElementById('yAxischart22b411697183') ,
 "tickFormat":  Rickshaw.Fixtures.Number.formatKMBT  
 })
-graphchart228852f260f5.render()
-var legendchart228852f260f5 = new Rickshaw.Graph.Legend({
- "graph":  graphchart228852f260f5 ,
-"element":  document.getElementById('legendchart228852f260f5')  
+graphchart22b411697183.render()
+var legendchart22b411697183 = new Rickshaw.Graph.Legend({
+ "graph":  graphchart22b411697183 ,
+"element":  document.getElementById('legendchart22b411697183')  
 })
-var shelvingchart228852f260f5 = new Rickshaw.Graph.Behavior.Series.Toggle({
- "graph":  graphchart228852f260f5 ,
-"legend":  legendchart228852f260f5  
+var shelvingchart22b411697183 = new Rickshaw.Graph.Behavior.Series.Toggle({
+ "graph":  graphchart22b411697183 ,
+"legend":  legendchart22b411697183  
 })
-var hoverDetailchart228852f260f5 = new Rickshaw.Graph.HoverDetail({
- "graph":  graphchart228852f260f5  
+var hoverDetailchart22b411697183 = new Rickshaw.Graph.HoverDetail({
+ "graph":  graphchart22b411697183  
 })
-var highlightchart228852f260f5 = new Rickshaw.Graph.Behavior.Series.Highlight({
- "graph":  graphchart228852f260f5 ,
-"legend":  legendchart228852f260f5  
+var highlightchart22b411697183 = new Rickshaw.Graph.Behavior.Series.Highlight({
+ "graph":  graphchart22b411697183 ,
+"legend":  legendchart22b411697183  
 })
-var sliderchart228852f260f5 = new Rickshaw.Graph.RangeSlider({
- "graph":  graphchart228852f260f5 ,
-"element":  document.getElementById('sliderchart228852f260f5')  
+var sliderchart22b411697183 = new Rickshaw.Graph.RangeSlider({
+ "graph":  graphchart22b411697183 ,
+"element":  document.getElementById('sliderchart22b411697183')  
 })
   
-  graphchart228852f260f5.render();
+  graphchart22b411697183.render();
   
 </script> 
     
diff --git a/09_DevelopingDataProducts/rCharts/fig/r1.html b/09_DevelopingDataProducts/rCharts/fig/r1.html
index f10a129b1..3beceda6c 100644
--- a/09_DevelopingDataProducts/rCharts/fig/r1.html
+++ b/09_DevelopingDataProducts/rCharts/fig/r1.html
@@ -17,11 +17,11 @@
     
   </head>
   <body>
-    <div id='chart2288328b5b46' class='rChart polycharts'></div>  
+    <div id='chart22b46c838c2' class='rChart polycharts'></div>  
     
     <script type='text/javascript'>
     var chartParams = {
- "dom": "chart2288328b5b46",
+ "dom": "chart22b46c838c2",
 "width":    800,
 "height":    400,
 "layers": [
@@ -46,12 +46,12 @@
 },
 "guides": [],
 "coord": [],
-"id": "chart2288328b5b46" 
+"id": "chart22b46c838c2" 
 }
     _.each(chartParams.layers, function(el){
         el.data = polyjs.data(el.data)
     })
-    var graph_chart2288328b5b46 = polyjs.chart(chartParams);
+    var graph_chart22b46c838c2 = polyjs.chart(chartParams);
 </script>
     
   </body>
diff --git a/09_DevelopingDataProducts/rCharts/fig/r2.html b/09_DevelopingDataProducts/rCharts/fig/r2.html
index fb059105f..f435e5809 100644
--- a/09_DevelopingDataProducts/rCharts/fig/r2.html
+++ b/09_DevelopingDataProducts/rCharts/fig/r2.html
@@ -17,11 +17,11 @@
     
   </head>
   <body>
-    <div id='chart22889342a94' class='rChart polycharts'></div>  
+    <div id='chart22b463a16968' class='rChart polycharts'></div>  
     
     <script type='text/javascript'>
     var chartParams = {
- "dom": "chart22889342a94",
+ "dom": "chart22b463a16968",
 "width":    800,
 "height":    400,
 "layers": [
@@ -45,12 +45,12 @@
 },
 "guides": [],
 "coord": [],
-"id": "chart22889342a94" 
+"id": "chart22b463a16968" 
 }
     _.each(chartParams.layers, function(el){
         el.data = polyjs.data(el.data)
     })
-    var graph_chart22889342a94 = polyjs.chart(chartParams);
+    var graph_chart22b463a16968 = polyjs.chart(chartParams);
 </script>
     
   </body>
diff --git a/09_DevelopingDataProducts/rCharts/fig/x1.html b/09_DevelopingDataProducts/rCharts/fig/x1.html
index 9e4d21d25..4a720036c 100644
--- a/09_DevelopingDataProducts/rCharts/fig/x1.html
+++ b/09_DevelopingDataProducts/rCharts/fig/x1.html
@@ -19,11 +19,11 @@
     
   </head>
   <body>
-    <figure id='chart228816c66e58' class='rChart xcharts'></figure>  
+    <figure id='chart22b43805189c' class='rChart xcharts'></figure>  
     
     <script type='text/javascript'>
     var data = {
- "dom": "chart228816c66e58",
+ "dom": "chart22b43805189c",
 "width":    800,
 "height":    400,
 "xScale": "linear",
@@ -150,10 +150,10 @@
 ] 
 } 
 ],
-"id": "chart228816c66e58" 
+"id": "chart22b43805189c" 
 },
       chartType = "line-dotted",
-      myChart = new xChart(chartType, data, '#chart228816c66e58');
+      myChart = new xChart(chartType, data, '#chart22b43805189c');
 </script>
 <style>figure.rChart {height: 400px;}</style>
     
diff --git a/09_DevelopingDataProducts/rCharts/index.Rmd b/09_DevelopingDataProducts/rCharts/index.Rmd
index e4b5d27e3..3546dccc5 100644
--- a/09_DevelopingDataProducts/rCharts/index.Rmd
+++ b/09_DevelopingDataProducts/rCharts/index.Rmd
@@ -37,7 +37,7 @@ runif(1)
 - rCharts is a way to create interactive javascript visualizations using R
 - So
   - You don't have to learn complex tools, like D3
-  - You simply work in R learning a minimal amount of new syntaxt
+  - You simply work in R learning a minimal amount of new syntax
 - rCharts was written by Ramnath Vaidyanathan (friend of the Data Science Series), who also wrote slidify, the framework we use for all of the lectures in the class
 - This lecture is basically going through
   (http://ramnathv.github.io/rCharts/)
diff --git a/09_DevelopingDataProducts/rCharts/index.html b/09_DevelopingDataProducts/rCharts/index.html
index c1a90cb40..e69de29bb 100644
--- a/09_DevelopingDataProducts/rCharts/index.html
+++ b/09_DevelopingDataProducts/rCharts/index.html
@@ -1,603 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <title>rCharts</title>
-  <meta charset="utf-8">
-  <meta name="description" content="rCharts">
-  <meta name="author" content="Brian Caffo, Jeff Leek, Roger Peng">
-  <meta name="generator" content="slidify" />
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta http-equiv="X-UA-Compatible" content="chrome=1">
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/default.css" media="all" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/phone.css" 
-    media="only screen and (max-device-width: 480px)" >
-  <link rel="stylesheet" href="../../librariesNew/frameworks/io2012/css/slidify.css" >
-  <link rel="stylesheet" href="../../librariesNew/highlighters/highlight.js/css/tomorrow.css" />
-  <base target="_blank"> <!-- This amazingness opens all links in a new tab. -->  <link rel=stylesheet href="../../librariesNew/widgets/quiz/css/demo.css"></link>
-<link rel=stylesheet href="../../librariesNew/widgets/bootstrap/css/bootstrap.css"></link>
-<link rel=stylesheet href="../../librariesNew/widgets/nvd3/css/nv.d3.css"></link>
-<link rel=stylesheet href="../../librariesNew/widgets/nvd3/css/rNVD3.css"></link>
-<link rel=stylesheet href="../../librariesNew/widgets/morris/css/morris.css"></link>
-<link rel=stylesheet href="../../librariesNew/widgets/leaflet/external/leaflet.css"></link>
-<link rel=stylesheet href="../../librariesNew/widgets/leaflet/external/leaflet-rCharts.css"></link>
-<link rel=stylesheet href="../../librariesNew/widgets/leaflet/external/legend.css"></link>
-<link rel=stylesheet href="../../librariesNew/widgets/rickshaw/css/rickshaw.min.css"></link>
-<link rel=stylesheet href="../../librariesNew/widgets/rickshaw/css/jquery-ui.css"></link>
-
-  
-  <!-- Grab CDN jQuery, fall back to local if offline -->
-  <script src="http://ajax.aspnetcdn.com/ajax/jQuery/jquery-1.7.min.js"></script>
-  <script>window.jQuery || document.write('<script src="../../librariesNew/widgets/quiz/js/jquery.js"><\/script>')</script> 
-  <script data-main="../../librariesNew/frameworks/io2012/js/slides" 
-    src="../../librariesNew/frameworks/io2012/js/require-1.0.8.min.js">
-  </script>
-  
-  <script src="../../librariesNew/widgets/highcharts/js/jquery-1.9.1.min.js"></script>
-<script src="../../librariesNew/widgets/highcharts/js/highcharts.js"></script>
-<script src="../../librariesNew/widgets/highcharts/js/highcharts-more.js"></script>
-<script src="../../librariesNew/widgets/highcharts/js/exporting.js"></script>
-<script src="../../librariesNew/widgets/nvd3/js/jquery-1.8.2.min.js"></script>
-<script src="../../librariesNew/widgets/nvd3/js/d3.v3.min.js"></script>
-<script src="../../librariesNew/widgets/nvd3/js/nv.d3.min-new.js"></script>
-<script src="../../librariesNew/widgets/nvd3/js/fisheye.js"></script>
-<script src="../../librariesNew/widgets/morris/js/raphael-2.1.0.min.js"></script>
-<script src="../../librariesNew/widgets/morris/js/morris.min.js"></script>
-<script src="../../librariesNew/widgets/leaflet/external/leaflet.js"></script>
-<script src="../../librariesNew/widgets/leaflet/external/leaflet-providers.js"></script>
-<script src="../../librariesNew/widgets/leaflet/external/Control.FullScreen.js"></script>
-<script src="../../librariesNew/widgets/rickshaw/js/d3.v2.min.js"></script>
-<script src="../../librariesNew/widgets/rickshaw/js/rickshaw.min.js"></script>
-<script src="../../librariesNew/widgets/rickshaw/js/jquery.min.js"></script>
-<script src="../../librariesNew/widgets/rickshaw/js/jquery-ui.min.js"></script>
-
-
-</head>
-<body style="opacity: 0">
-  <slides class="layout-widescreen">
-    
-    <!-- LOGO SLIDE -->
-        <slide class="title-slide segue nobackground">
-  <aside class="gdbar">
-    <img src="../../assets/img/bloomberg_shield.png">
-  </aside>
-  <hgroup class="auto-fadein">
-    <h1>rCharts</h1>
-    <h2>Building Data Products</h2>
-    <p>Brian Caffo, Jeff Leek, Roger Peng<br/>Johns Hopkins Bloomberg School of Public Health</p>
-  </hgroup>
-  <article></article>  
-</slide>
-    
-
-    <!-- SLIDES -->
-    <slide class="" id="slide-1" style="background:;">
-  <hgroup>
-    <h2>rCharts</h2>
-  </hgroup>
-  <article data-timings="">
-    <p><img src="fig/rCharts.png" alt="rCharts"></p>
-
-<ul>
-<li>rCharts is a way to create interactive javascript visualizations using R</li>
-<li>So
-
-<ul>
-<li>You don&#39;t have to learn complex tools, like D3</li>
-<li>You simply work in R learning a minimal amount of new syntaxt</li>
-</ul></li>
-<li>rCharts was written by Ramnath Vaidyanathan (friend of the Data Science Series), who also wrote slidify, the framework we use for all of the lectures in the class</li>
-<li>This lecture is basically going through
-(<a href="http://ramnathv.github.io/rCharts/">http://ramnathv.github.io/rCharts/</a>)</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-2" style="background:;">
-  <hgroup>
-    <h2>Example</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>require(rCharts)
-haireye = as.data.frame(HairEyeColor)
-n1 &lt;- nPlot(Freq ~ Hair, group = &#39;Eye&#39;, type = &#39;multiBarChart&#39;,
-  data = subset(haireye, Sex == &#39;Male&#39;)
-)
-n1$save(&#39;fig/n1.html&#39;, cdn = TRUE)
-cat(&#39;&lt;iframe src=&quot;fig/n1.html&quot; width=100%, height=600&gt;&lt;/iframe&gt;&#39;)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-3" style="background:;">
-  <hgroup>
-    <h2>nvD3 run</h2>
-  </hgroup>
-  <article data-timings="">
-    <iframe src="fig/n1.html" width=100%, height=600></iframe>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-4" style="background:;">
-  <hgroup>
-    <h2>Slidify interactive</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The above was an example of embedding an rChart in a slidify document
-
-<ul>
-<li>In the YAML 
-<code>yaml ext_widgets : {rCharts: [&quot;libraries/nvd3&quot;]}</code></li>
-</ul></li>
-<li>Or, if you use more than one library</li>
-<li>YAML example
-<code>yaml ext_widgets : {rCharts: [&quot;libraries/highcharts&quot;, &quot;libraries/nvd3&quot;, &quot;libraries/morris&quot;]}</code> </li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-5" style="background:;">
-  <hgroup>
-    <h2>Viewing the plot</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>The object <code>n1</code> contains the plot
-
-<ul>
-<li>In RStudio, typing <code>n1</code> brings up the plot in the RStudio viewer (or you can just not assign it to an object)</li>
-</ul></li>
-<li>Do <code>n1$</code> then hit TAB to see the various functions contained in the object
-
-<ul>
-<li><code>n1$html()</code> prints out the html for the plot</li>
-</ul></li>
-<li>I do <code>n1$save(filename)</code> then bring the code back into slidify document
-
-<ul>
-<li>This is recommended for slidify, but if you&#39;re just looking at the plot,
-it&#39;s unnecessary</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-6" style="background:;">
-  <hgroup>
-    <h2>Deconstructing another example</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>## Example 1 Facetted Scatterplot
-names(iris) = gsub(&quot;\\.&quot;, &quot;&quot;, names(iris))
-r1 &lt;- rPlot(SepalLength ~ SepalWidth | Species, data = iris, color = &#39;Species&#39;, type = &#39;point&#39;)
-r1$save(&#39;fig/r1.html&#39;, cdn = TRUE)
-cat(&#39;&lt;iframe src=&quot;fig/r1.html&quot; width=100%, height=600&gt;&lt;/iframe&gt;&#39;)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-7" style="background:;">
-  <hgroup>
-    <h2>When run</h2>
-  </hgroup>
-  <article data-timings="">
-    <iframe src="fig/r1.html" width=100%, height=600></iframe>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-8" style="background:;">
-  <hgroup>
-    <h2>Example 2 Facetted Barplot</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>hair_eye = as.data.frame(HairEyeColor)
-r2 &lt;- rPlot(Freq ~ Hair | Eye, color = &#39;Eye&#39;, data = hair_eye, type = &#39;bar&#39;)
-r2$save(&#39;fig/r2.html&#39;, cdn = TRUE)
-cat(&#39;&lt;iframe src=&quot;fig/r2.html&quot; width=100%, height=600&gt;&lt;/iframe&gt;&#39;)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-9" style="background:;">
-  <hgroup>
-    <h2>Example 2 Facetted Barplot, when run</h2>
-  </hgroup>
-  <article data-timings="">
-    <iframe src="fig/r2.html" width=100%, height=600></iframe>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-10" style="background:;">
-  <hgroup>
-    <h2>How to get the js/html or publish an rChart</h2>
-  </hgroup>
-  <article data-timings="">
-    <p>Now you can add whatever you&#39;d like</p>
-
-<pre><code class="r">r1 &lt;- rPlot(mpg ~ wt | am + vs, data = mtcars, type = &quot;point&quot;, color = &quot;gear&quot;)
-r1$print(&quot;chart1&quot;) # print out the js 
-r1$save(&#39;myPlot.html&#39;) #save as html file
-r1$publish(&#39;myPlot&#39;, host = &#39;gist&#39;) # save to gist, rjson required
-r1$publish(&#39;myPlot&#39;, host = &#39;rpubs&#39;) # save to rpubs
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-11" style="background:;">
-  <hgroup>
-    <h2>rCharts has links to several libraries</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>We&#39;ll do some examples</li>
-<li>Note Ramnath mentions that io2012 and polychart have conflicting js
-
-<ul>
-<li>They seem to work for me with that theme, but I get errors if I load the polychart library</li>
-<li>If debugging with io and polychart, factor that in</li>
-</ul></li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-12" style="background:;">
-  <hgroup>
-    <h2>morris</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>data(economics, package = &quot;ggplot2&quot;)
-econ &lt;- transform(economics, date = as.character(date))
-m1 &lt;- mPlot(x = &quot;date&quot;, y = c(&quot;psavert&quot;, &quot;uempmed&quot;), type = &quot;Line&quot;, data = econ)
-m1$set(pointSize = 0, lineWidth = 1)
-m1$save(&#39;fig/m1.html&#39;, cdn = TRUE)
-cat(&#39;&lt;iframe src=&quot;fig/m1.html&quot; width=100%, height=600&gt;&lt;/iframe&gt;&#39;)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-13" style="background:;">
-  <hgroup>
-    <h2>morris example run</h2>
-  </hgroup>
-  <article data-timings="">
-    <iframe src="fig/m1.html" width=100%, height=600></iframe>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-14" style="background:;">
-  <hgroup>
-    <h2>xCharts</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>require(reshape2)
-uspexp &lt;- melt(USPersonalExpenditure)
-names(uspexp)[1:2] = c(&quot;category&quot;, &quot;year&quot;)
-x1 &lt;- xPlot(value ~ year, group = &quot;category&quot;, data = uspexp, type = &quot;line-dotted&quot;)
-x1$save(&#39;fig/x1.html&#39;, cdn = TRUE)
-cat(&#39;&lt;iframe src=&quot;fig/x1.html&quot; width=100%, height=600&gt;&lt;/iframe&gt;&#39;)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-15" style="background:;">
-  <hgroup>
-    <h2>xCharts run</h2>
-  </hgroup>
-  <article data-timings="">
-    <iframe src="fig/x1.html" width=100%, height=600></iframe>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-16" style="background:;">
-  <hgroup>
-    <h2>Leaflet</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>map3 &lt;- Leaflet$new()
-map3$setView(c(51.505, -0.09), zoom = 13)
-map3$marker(c(51.5, -0.09), bindPopup = &quot;&lt;p&gt; Hi. I am a popup &lt;/p&gt;&quot;)
-map3$marker(c(51.495, -0.083), bindPopup = &quot;&lt;p&gt; Hi. I am another popup &lt;/p&gt;&quot;)
-map3$save(&#39;fig/map3.html&#39;, cdn = TRUE)
-cat(&#39;&lt;iframe src=&quot;fig/map3.html&quot; width=100%, height=600&gt;&lt;/iframe&gt;&#39;)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-17" style="background:;">
-  <hgroup>
-    <h2>Leaflet run</h2>
-  </hgroup>
-  <article data-timings="">
-    <iframe src="fig/map3.html" width=100%, height=600></iframe>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-18" style="background:;">
-  <hgroup>
-    <h2>Rickshaw</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>usp = reshape2::melt(USPersonalExpenditure)
-# get the decades into a date Rickshaw likes
-usp$Var2 &lt;- as.numeric(as.POSIXct(paste0(usp$Var2, &quot;-01-01&quot;)))
-p4 &lt;- Rickshaw$new()
-p4$layer(value ~ Var2, group = &quot;Var1&quot;, data = usp, type = &quot;area&quot;, width = 560)
-# add a helpful slider this easily; other features TRUE as a default
-p4$set(slider = TRUE)
-p4$save(&#39;fig/p4.html&#39;, cdn = TRUE)
-cat(&#39;&lt;iframe src=&quot;fig/p4.html&quot; width=100%, height=600&gt;&lt;/iframe&gt;&#39;)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-19" style="background:;">
-  <hgroup>
-    <h2>Rickshaw run</h2>
-  </hgroup>
-  <article data-timings="">
-    <iframe src="fig/p4.html" width=100%, height=600></iframe>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-20" style="background:;">
-  <hgroup>
-    <h2>highchart</h2>
-  </hgroup>
-  <article data-timings="">
-    <pre><code>h1 &lt;- hPlot(x = &quot;Wr.Hnd&quot;, y = &quot;NW.Hnd&quot;, data = MASS::survey, type = c(&quot;line&quot;, 
-    &quot;bubble&quot;, &quot;scatter&quot;), group = &quot;Clap&quot;, size = &quot;Age&quot;)
-h1$save(&#39;fig/h1.html&#39;, cdn = TRUE)
-cat(&#39;&lt;iframe src=&quot;fig/h1.html&quot; width=100%, height=600&gt;&lt;/iframe&gt;&#39;)
-</code></pre>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-21" style="background:;">
-  <hgroup>
-    <h2>highchart run</h2>
-  </hgroup>
-  <article data-timings="">
-    <iframe src="fig/h1.html" width=100%, height=600></iframe>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-<slide class="" id="slide-22" style="background:;">
-  <hgroup>
-    <h2>rCharts summarized</h2>
-  </hgroup>
-  <article data-timings="">
-    <ul>
-<li>rCharts makes creating interactive javascript visualizations in R ridiculously easy</li>
-<li>However, non-trivial customization is going to require knowledge of javascript</li>
-<li>If what you want is not too big of a deviation from the rCharts examples, then it&#39;s awesome
-
-<ul>
-<li>Otherwise, it&#39;s challenging to extend without fairly deep knowledge of the JS
-libraries that it&#39;s calling.</li>
-</ul></li>
-<li>rCharts is under fairly rapid development</li>
-</ul>
-
-  </article>
-  <!-- Presenter Notes -->
-</slide>
-
-    <slide class="backdrop"></slide>
-  </slides>
-  <div class="pagination pagination-small" id='io2012-ptoc' style="display:none;">
-    <ul>
-      <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=1 title='rCharts'>
-         1
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=2 title='Example'>
-         2
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=3 title='nvD3 run'>
-         3
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=4 title='Slidify interactive'>
-         4
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=5 title='Viewing the plot'>
-         5
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=6 title='Deconstructing another example'>
-         6
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=7 title='When run'>
-         7
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=8 title='Example 2 Facetted Barplot'>
-         8
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=9 title='Example 2 Facetted Barplot, when run'>
-         9
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=10 title='How to get the js/html or publish an rChart'>
-         10
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=11 title='rCharts has links to several libraries'>
-         11
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=12 title='morris'>
-         12
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=13 title='morris example run'>
-         13
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=14 title='xCharts'>
-         14
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=15 title='xCharts run'>
-         15
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=16 title='Leaflet'>
-         16
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=17 title='Leaflet run'>
-         17
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=18 title='Rickshaw'>
-         18
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=19 title='Rickshaw run'>
-         19
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=20 title='highchart'>
-         20
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=21 title='highchart run'>
-         21
-      </a>
-    </li>
-    <li>
-      <a href="#" target="_self" rel='tooltip' 
-        data-slide=22 title='rCharts summarized'>
-         22
-      </a>
-    </li>
-  </ul>
-  </div>  <!--[if IE]>
-    <script 
-      src="http://ajax.googleapis.com/ajax/libs/chrome-frame/1/CFInstall.min.js">  
-    </script>
-    <script>CFInstall.check({mode: 'overlay'});</script>
-  <![endif]-->
-</body>
-  <!-- Load Javascripts for Widgets -->
-  <script src="../../librariesNew/widgets/quiz/js/jquery.quiz.js"></script>
-<script src="../../librariesNew/widgets/quiz/js/mustache.min.js"></script>
-<script src="../../librariesNew/widgets/quiz/js/quiz-app.js"></script>
-<script src="../../librariesNew/widgets/bootstrap/js/bootstrap.min.js"></script>
-<script src="../../librariesNew/widgets/bootstrap/js/bootbox.min.js"></script>
-
-  <!-- MathJax: Fall back to local if CDN offline but local image fonts are not supported (saves >100MB) -->
-  <script type="text/x-mathjax-config">
-    MathJax.Hub.Config({
-      tex2jax: {
-        inlineMath: [['$','$'], ['\\(','\\)']],
-        processEscapes: true
-      }
-    });
-  </script>
-  <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
-  <!-- <script src="https://c328740.ssl.cf1.rackcdn.com/mathjax/2.0-latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
-  </script> -->
-  <script>window.MathJax || document.write('<script type="text/x-mathjax-config">MathJax.Hub.Config({"HTML-CSS":{imageFont:null}});<\/script><script src="../../librariesNew/widgets/mathjax/MathJax.js?config=TeX-AMS-MML_HTMLorMML"><\/script>')
-</script>
-<script>  
-  $(function (){ 
-    $("#example").popover(); 
-    $("[rel='tooltip']").tooltip(); 
-  });  
-  </script>  
-  <!-- LOAD HIGHLIGHTER JS FILES -->
-  <script src="../../librariesNew/highlighters/highlight.js/highlight.pack.js"></script>
-  <script>hljs.initHighlightingOnLoad();</script>
-  <!-- DONE LOADING HIGHLIGHTER JS FILES -->
-   
-  </html>
\ No newline at end of file
diff --git a/09_DevelopingDataProducts/rCharts/index.md b/09_DevelopingDataProducts/rCharts/index.md
index 2d6fcf305..0d7a9b919 100644
--- a/09_DevelopingDataProducts/rCharts/index.md
+++ b/09_DevelopingDataProducts/rCharts/index.md
@@ -16,7 +16,6 @@ ext_widgets : {rCharts: ["libraries/highcharts","libraries/nvd3", "libraries/mor
 ---
 
 
-
 ## rCharts
 ![rCharts](fig/rCharts.png)
 - rCharts is a way to create interactive javascript visualizations using R
@@ -43,171 +42,20 @@ cat('<iframe src="fig/n1.html" width=100%, height=600></iframe>')
 ## nvD3 run
 
 
-<iframe src="fig/n1.html" width=100%, height=600></iframe>
-
-
----
-## Slidify interactive
-- The above was an example of embedding an rChart in a slidify document
-  - In the YAML 
-```yaml ext_widgets : {rCharts: ["libraries/nvd3"]}```
-- Or, if you use more than one library
-- YAML example
-```yaml ext_widgets : {rCharts: ["libraries/highcharts", "libraries/nvd3", "libraries/morris"]}``` 
-
----
-## Viewing the plot
-- The object `n1` contains the plot
-  - In RStudio, typing `n1` brings up the plot in the RStudio viewer (or you can just not assign it to an object)
-- Do `n1$` then hit TAB to see the various functions contained in the object
-  - `n1$html()` prints out the html for the plot
-- I do `n1$save(filename)` then bring the code back into slidify document
-  - This is recommended for slidify, but if you're just looking at the plot,
-  it's unnecessary
-
----
-## Deconstructing another example
-```
-## Example 1 Facetted Scatterplot
-names(iris) = gsub("\\.", "", names(iris))
-r1 <- rPlot(SepalLength ~ SepalWidth | Species, data = iris, color = 'Species', type = 'point')
-r1$save('fig/r1.html', cdn = TRUE)
-cat('<iframe src="fig/r1.html" width=100%, height=600></iframe>')
-```
-
-
----
-## When run
-<iframe src="fig/r1.html" width=100%, height=600></iframe>
-
-
----
-## Example 2 Facetted Barplot
-```
-hair_eye = as.data.frame(HairEyeColor)
-r2 <- rPlot(Freq ~ Hair | Eye, color = 'Eye', data = hair_eye, type = 'bar')
-r2$save('fig/r2.html', cdn = TRUE)
-cat('<iframe src="fig/r2.html" width=100%, height=600></iframe>')
-```
-
----
-## Example 2 Facetted Barplot, when run
-<iframe src="fig/r2.html" width=100%, height=600></iframe>
-
-
-
----
-## How to get the js/html or publish an rChart
-Now you can add whatever you'd like
-
-```r
-r1 <- rPlot(mpg ~ wt | am + vs, data = mtcars, type = "point", color = "gear")
-r1$print("chart1") # print out the js 
-r1$save('myPlot.html') #save as html file
-r1$publish('myPlot', host = 'gist') # save to gist, rjson required
-r1$publish('myPlot', host = 'rpubs') # save to rpubs
-```
-
-
 
----
-## rCharts has links to several libraries
-- We'll do some examples
-- Note Ramnath mentions that io2012 and polychart have conflicting js
-  - They seem to work for me with that theme, but I get errors if I load the polychart library
-  - If debugging with io and polychart, factor that in
 
 
----
-## morris
-```
-data(economics, package = "ggplot2")
-econ <- transform(economics, date = as.character(date))
-m1 <- mPlot(x = "date", y = c("psavert", "uempmed"), type = "Line", data = econ)
-m1$set(pointSize = 0, lineWidth = 1)
-m1$save('fig/m1.html', cdn = TRUE)
-cat('<iframe src="fig/m1.html" width=100%, height=600></iframe>')
-```
 
----
-## morris example run
-<iframe src="fig/m1.html" width=100%, height=600></iframe>
 
 
----
-## xCharts
-```
-require(reshape2)
-uspexp <- melt(USPersonalExpenditure)
-names(uspexp)[1:2] = c("category", "year")
-x1 <- xPlot(value ~ year, group = "category", data = uspexp, type = "line-dotted")
-x1$save('fig/x1.html', cdn = TRUE)
-cat('<iframe src="fig/x1.html" width=100%, height=600></iframe>')
-```
 
----
-## xCharts run
-<iframe src="fig/x1.html" width=100%, height=600></iframe>
 
 
----
-## Leaflet
-```
-map3 <- Leaflet$new()
-map3$setView(c(51.505, -0.09), zoom = 13)
-map3$marker(c(51.5, -0.09), bindPopup = "<p> Hi. I am a popup </p>")
-map3$marker(c(51.495, -0.083), bindPopup = "<p> Hi. I am another popup </p>")
-map3$save('fig/map3.html', cdn = TRUE)
-cat('<iframe src="fig/map3.html" width=100%, height=600></iframe>')
-```
 
----
-## Leaflet run
-<iframe src="fig/map3.html" width=100%, height=600></iframe>
 
 
----
-## Rickshaw
-```
-usp = reshape2::melt(USPersonalExpenditure)
-# get the decades into a date Rickshaw likes
-usp$Var2 <- as.numeric(as.POSIXct(paste0(usp$Var2, "-01-01")))
-p4 <- Rickshaw$new()
-p4$layer(value ~ Var2, group = "Var1", data = usp, type = "area", width = 560)
-# add a helpful slider this easily; other features TRUE as a default
-p4$set(slider = TRUE)
-p4$save('fig/p4.html', cdn = TRUE)
-cat('<iframe src="fig/p4.html" width=100%, height=600></iframe>')
-```
-
----
-## Rickshaw run
-<iframe src="fig/p4.html" width=100%, height=600></iframe>
 
 
----
-## highchart
-```
-h1 <- hPlot(x = "Wr.Hnd", y = "NW.Hnd", data = MASS::survey, type = c("line", 
-    "bubble", "scatter"), group = "Clap", size = "Age")
-h1$save('fig/h1.html', cdn = TRUE)
-cat('<iframe src="fig/h1.html" width=100%, height=600></iframe>')
-```
 
 
----
-## highchart run
-<iframe src="fig/h1.html" width=100%, height=600></iframe>
-
-
-
----
-## rCharts summarized
-- rCharts makes creating interactive javascript visualizations in R ridiculously easy
-- However, non-trivial customization is going to require knowledge of javascript
-- If what you want is not too big of a deviation from the rCharts examples, then it's awesome
-  - Otherwise, it's challenging to extend without fairly deep knowledge of the JS
-    libraries that it's calling.
-- rCharts is under fairly rapid development
-
 
diff --git a/09_DevelopingDataProducts/rCharts/test.html b/09_DevelopingDataProducts/rCharts/test.html
new file mode 100644
index 000000000..297010f19
--- /dev/null
+++ b/09_DevelopingDataProducts/rCharts/test.html
@@ -0,0 +1,184 @@
+<!doctype HTML>
+<meta charset = 'utf-8'>
+<html>
+  <head>
+    <link rel='stylesheet' href='http://nvd3.org/assets/css/nv.d3.css'>
+    
+    <script src='http://ajax.googleapis.com/ajax/libs/jquery/1.8.2/jquery.min.js' type='text/javascript'></script>
+    <script src='http://d3js.org/d3.v3.min.js' type='text/javascript'></script>
+    <script src='http://timelyportfolio.github.io/rCharts_nvd3_tests/libraries/widgets/nvd3/js/nv.d3.min-new.js' type='text/javascript'></script>
+    <script src='http://nvd3.org/assets/lib/fisheye.js' type='text/javascript'></script>
+    
+    <style>
+    .rChart {
+      display: block;
+      margin-left: auto; 
+      margin-right: auto;
+      width: 800px;
+      height: 400px;
+    }  
+    </style>
+    
+  </head>
+  <body>
+    <div id='chart20141bf12c55' class='rChart nvd3'></div>  
+    
+    <script type='text/javascript'>
+ $(document).ready(function(){
+      drawchart20141bf12c55()
+    });
+    function drawchart20141bf12c55(){  
+      var opts = {
+ "dom": "chart20141bf12c55",
+"width":    800,
+"height":    400,
+"x": "Hair",
+"y": "Freq",
+"group": "Eye",
+"type": "multiBarChart",
+"id": "chart20141bf12c55" 
+},
+        data = [
+ {
+ "Hair": "Black",
+"Eye": "Brown",
+"Sex": "Male",
+"Freq":             32 
+},
+{
+ "Hair": "Brown",
+"Eye": "Brown",
+"Sex": "Male",
+"Freq":             53 
+},
+{
+ "Hair": "Red",
+"Eye": "Brown",
+"Sex": "Male",
+"Freq":             10 
+},
+{
+ "Hair": "Blond",
+"Eye": "Brown",
+"Sex": "Male",
+"Freq":              3 
+},
+{
+ "Hair": "Black",
+"Eye": "Blue",
+"Sex": "Male",
+"Freq":             11 
+},
+{
+ "Hair": "Brown",
+"Eye": "Blue",
+"Sex": "Male",
+"Freq":             50 
+},
+{
+ "Hair": "Red",
+"Eye": "Blue",
+"Sex": "Male",
+"Freq":             10 
+},
+{
+ "Hair": "Blond",
+"Eye": "Blue",
+"Sex": "Male",
+"Freq":             30 
+},
+{
+ "Hair": "Black",
+"Eye": "Hazel",
+"Sex": "Male",
+"Freq":             10 
+},
+{
+ "Hair": "Brown",
+"Eye": "Hazel",
+"Sex": "Male",
+"Freq":             25 
+},
+{
+ "Hair": "Red",
+"Eye": "Hazel",
+"Sex": "Male",
+"Freq":              7 
+},
+{
+ "Hair": "Blond",
+"Eye": "Hazel",
+"Sex": "Male",
+"Freq":              5 
+},
+{
+ "Hair": "Black",
+"Eye": "Green",
+"Sex": "Male",
+"Freq":              3 
+},
+{
+ "Hair": "Brown",
+"Eye": "Green",
+"Sex": "Male",
+"Freq":             15 
+},
+{
+ "Hair": "Red",
+"Eye": "Green",
+"Sex": "Male",
+"Freq":              7 
+},
+{
+ "Hair": "Blond",
+"Eye": "Green",
+"Sex": "Male",
+"Freq":              8 
+} 
+]
+  
+      if(!(opts.type==="pieChart" || opts.type==="sparklinePlus")) {
+        var data = d3.nest()
+          .key(function(d){
+            //return opts.group === undefined ? 'main' : d[opts.group]
+            //instead of main would think a better default is opts.x
+            return opts.group === undefined ? opts.y : d[opts.group];
+          })
+          .entries(data);
+      }
+      
+      if (opts.disabled != undefined){
+        data.map(function(d, i){
+          d.disabled = opts.disabled[i]
+        })
+      }
+      
+      nv.addGraph(function() {
+        var chart = nv.models[opts.type]()
+          .x(function(d) { return d[opts.x] })
+          .y(function(d) { return d[opts.y] })
+          .width(opts.width)
+          .height(opts.height)
+         
+        
+          
+        
+
+        
+        
+        
+      
+       d3.select("#" + opts.id)
+        .append('svg')
+        .datum(data)
+        .transition().duration(500)
+        .call(chart);
+
+       nv.utils.windowResize(chart.update);
+       return chart;
+      });
+    };
+</script>
+    
+  </body>
+</html>
diff --git a/09_DevelopingDataProducts/rStudioPresent/index.Rpres b/09_DevelopingDataProducts/rStudioPresent/index.Rpres
index a237721f7..00c9487c7 100644
--- a/09_DevelopingDataProducts/rStudioPresent/index.Rpres
+++ b/09_DevelopingDataProducts/rStudioPresent/index.Rpres
@@ -1,132 +1,132 @@
-RStudio Presenter
-===
-author: Brian Caffo, Jeff Leek Roger Peng
-date: `r format(Sys.Date(), format="%B %d %Y")`
-transition: rotate
-
-<small> 
-Department of Biostatistics   
-Bloomberg School of Public Health   
-Johns Hopkins University   
-Coursera Data Science Specialization
-</small>
-
-
-RStudio Presentation
-===
-- RStudio created a presentation authoring tool within their
-development environment. 
-- If you are familiar with slidify, you will also be familiar with this tool
-    - Code is authored in a generalized markdown format that allows for code chunks
-    - The output is an html5 presentation 
-    - The file index for the presenter file is .Rpres, which gets converted to an .md file and then to an html file if desired
-    - There's a preview tool in RStudio and GUIs for publishing to Rpubs or viewing/creating an html file
-
-Authoring content
-===
-- This is a fairly complete guide
-    - http://www.rstudio.com/ide/docs/presentations/overview
-- Quick start is
-    - `file` then `New File` then `R Presentation`
-    - (`alt-f` then `f` then `p` if you want key strokes)
-    - Use basically the same R markdown format for authoring as slidify/knitr
-        - Single quotes for inline code
-        - Tripple qutoes for block code
-        - Same options for code evaluation, caching, hiding etcetera
-
-Compiling and tools
-===
-- R Studio auto formats and runs the code when you save the document
-- Mathjax JS library is loaded by default so that `$x^2$` yields $x^2$
-- Slide navigation button on the preview; clicking on the notepad icon takes you to that slide in the deck
-- Clicking on `more` yields options for
-    - Clearning the knitr cache
-    - Viewing in a browser (creates a temporay html file in `AppData/local/temp` for me)
-    - Create a html file to save where you want)
-- A refresh button 
-- A zoom button that brings up a full window
-
-Visuals
-===
-transition: linear
-
-- R Studio has made it easy to get some cool html5 effects, like cube transitions
-with simple options in YAML-like code after the first slide such as
-`transition: rotate`
-- You can specify it in a slide-by-slide basis
-
-Here's the option "linear"
-===
-transition: linear
-
-- Just put `transition: linear` right after the slide creation (three equal signs or more in a row)
-- Tansition options 
-    - http://www.rstudio.com/ide/docs/presentations/slide_transitions_and_navigation
-
-Hierarchical organization
-===
-type: section
-- If you want a hierarchical organization structure, just add a `type: typename` option after the slide
-- This changes the default appearance
-    - http://www.rstudio.com/ide/docs/presentations/slide_transitions_and_navigation
-- This is of type `section`
-
-Here's a subsection
-===
-type: subsection
-
-Two columns
-===
-- Do whatever for column one
-- Then put `***` on a line by itself with blank lines before and after
-
-***
-
-- Then do whatever for column two
-
-
-Changing the slide font
-==========================================================
-font-import: http://fonts.googleapis.com/css?family=Risque
-font-family: 'Risque'
-
-- Add a `font-family: fontname` option after the slide
-    - http://www.rstudio.com/ide/docs/presentations/customizing_fonts_and_appearance
-- Specified in the same way as css font families
-    - http://www.w3schools.com/cssref/css_websafe_fonts.asp
-- Use `font-import: url` to import fonts
-- Important caveats
-    - Fonts must be present on the system that you're presenting on, or it will go to a fallback font
-    - You have to be connected to the internet to use an imported font (so don't rely on this for offline presentations)
-- This is the `Risque` 
-    - http://fonts.googleapis.com/css?family=Risque
-    
-Really changing things 
-===
-- If you know html5 and CSS well, then you can basically change whatever you want
-- A css file with the same names as your presentation will be autoimported 
-- You can use `css: file.css` to import a css file 
-- You have to create named classes and then use `class: classname` to get slide-specific style control from your css
-    - (Or you can apply then within a `<span>`)
-- Ultimately, you have an html file, that you can edit as you wish
-    - This should be viewed as a last resort, as the whole point is to have reproducible presentations, but may be the easiest way to get the exact style control you want for a final product
-
-Slidify versus R Studio Presenter
-===
-**Slidify**
-- Flexible control from the R MD file
-- Under rapid ongoing development
-- Large user base
-- Lots and lots of styles and options
-- Steeper learning curve
-- More command-line oriented
-
-***
-**R Studio Presenter**
-- Embedded in R Studio
-- More GUI oriented
-- Very easy to get started
-- Smaller set of easy styles and options
-- Default styles look very nice
-- Ultimately as flexible as slidify with a little CSS and HTML knowledge
-
+RStudio Presenter
+===
+author: Brian Caffo, Jeff Leek Roger Peng
+date: `r format(Sys.Date(), format="%B %d %Y")`
+transition: rotate
+
+<small> 
+Department of Biostatistics   
+Bloomberg School of Public Health   
+Johns Hopkins University   
+Coursera Data Science Specialization
+</small>
+
+
+RStudio Presentation
+===
+- RStudio created a presentation authoring tool within their
+development environment. 
+- If you are familiar with slidify, you will also be familiar with this tool
+    - Code is authored in a generalized markdown format that allows for code chunks
+    - The output is an html5 presentation 
+    - The file index for the presenter file is .Rpres, which gets converted to an .md file and then to an html file if desired
+    - There's a preview tool in RStudio and GUIs for publishing to Rpubs or viewing/creating an html file
+
+Authoring content
+===
+- This is a fairly complete guide
+    - http://www.rstudio.com/ide/docs/presentations/overview
+- Quick start is
+    - `file` then `New File` then `R Presentation`
+    - (`alt-f` then `f` then `p` if you want key strokes)
+    - Use basically the same R markdown format for authoring as slidify/knitr
+        - Single quotes for inline code
+        - Tripple qutoes for block code
+        - Same options for code evaluation, caching, hiding etcetera
+
+Compiling and tools
+===
+- R Studio auto formats and runs the code when you save the document
+- Mathjax JS library is loaded by default so that `$x^2$` yields $x^2$
+- Slide navigation button on the preview; clicking on the notepad icon takes you to that slide in the deck
+- Clicking on `more` yields options for
+    - Clearning the knitr cache
+    - Viewing in a browser (creates a temporay html file in `AppData/local/temp` for me)
+    - Create a html file to save where you want)
+- A refresh button 
+- A zoom button that brings up a full window
+
+Visuals
+===
+transition: linear
+
+- R Studio has made it easy to get some cool html5 effects, like cube transitions
+with simple options in YAML-like code after the first slide such as
+`transition: rotate`
+- You can specify it in a slide-by-slide basis
+
+Here's the option "linear"
+===
+transition: linear
+
+- Just put `transition: linear` right after the slide creation (three equal signs or more in a row)
+- Tansition options 
+    - http://www.rstudio.com/ide/docs/presentations/slide_transitions_and_navigation
+
+Hierarchical organization
+===
+type: section
+- If you want a hierarchical organization structure, just add a `type: typename` option after the slide
+- This changes the default appearance
+    - http://www.rstudio.com/ide/docs/presentations/slide_transitions_and_navigation
+- This is of type `section`
+
+Here's a subsection
+===
+type: subsection
+
+Two columns
+===
+- Do whatever for column one
+- Then put `***` on a line by itself with blank lines before and after
+
+***
+
+- Then do whatever for column two
+
+
+Changing the slide font
+==========================================================
+font-import: http://fonts.googleapis.com/css?family=Risque
+font-family: 'Risque'
+
+- Add a `font-family: fontname` option after the slide
+    - http://www.rstudio.com/ide/docs/presentations/customizing_fonts_and_appearance
+- Specified in the same way as css font families
+    - http://www.w3schools.com/cssref/css_websafe_fonts.asp
+- Use `font-import: url` to import fonts
+- Important caveats
+    - Fonts must be present on the system that you're presenting on, or it will go to a fallback font
+    - You have to be connected to the internet to use an imported font (so don't rely on this for offline presentations)
+- This is the `Risque` 
+    - http://fonts.googleapis.com/css?family=Risque
+    
+Really changing things 
+===
+- If you know html5 and CSS well, then you can basically change whatever you want
+- A css file with the same names as your presentation will be autoimported 
+- You can use `css: file.css` to import a css file 
+- You have to create named classes and then use `class: classname` to get slide-specific style control from your css
+    - (Or you can apply then within a `<span>`)
+- Ultimately, you have an html file, that you can edit as you wish
+    - This should be viewed as a last resort, as the whole point is to have reproducible presentations, but may be the easiest way to get the exact style control you want for a final product
+
+Slidify versus R Studio Presenter
+===
+**Slidify**
+- Flexible control from the R MD file
+- Under rapid ongoing development
+- Large user base
+- Lots and lots of styles and options
+- Steeper learning curve
+- More command-line oriented
+
+***
+**R Studio Presenter**
+- Embedded in R Studio
+- More GUI oriented
+- Very easy to get started
+- Smaller set of easy styles and options
+- Default styles look very nice
+- Ultimately as flexible as slidify with a little CSS and HTML knowledge
+
diff --git a/09_DevelopingDataProducts/rStudioPresent/index.md b/09_DevelopingDataProducts/rStudioPresent/index.md
index 399fb071a..6b5c3334e 100644
--- a/09_DevelopingDataProducts/rStudioPresent/index.md
+++ b/09_DevelopingDataProducts/rStudioPresent/index.md
@@ -1,132 +1,132 @@
-RStudio Presenter
-===
-author: Brian Caffo, Jeff Leek Roger Peng
-date: April 24 2014
-transition: rotate
-
-<small> 
-Department of Biostatistics   
-Bloomberg School of Public Health   
-Johns Hopkins University   
-Coursera Data Science Specialization
-</small>
-
-
-RStudio Presentation
-===
-- RStudio created a presentation authoring tool within their
-development environment. 
-- If you are familiar with slidify, you will also be familiar with this tool
-    - Code is authored in a generalized markdown format that allows for code chunks
-    - The output is an html5 presentation 
-    - The file index for the presenter file is .Rpres, which gets converted to an .md file and then to an html file if desired
-    - There's a preview tool in RStudio and GUIs for publishing to Rpubs or viewing/creating an html file
-
-Authoring content
-===
-- This is a fairly complete guide
-    - http://www.rstudio.com/ide/docs/presentations/overview
-- Quick start is
-    - `file` then `New File` then `R Presentation`
-    - (`alt-f` then `f` then `p` if you want key strokes)
-    - Use basically the same R markdown format for authoring as slidify/knitr
-        - Single quotes for inline code
-        - Tripple qutoes for block code
-        - Same options for code evaluation, caching, hiding etcetera
-
-Compiling and tools
-===
-- R Studio auto formats and runs the code when you save the document
-- Mathjax JS library is loaded by default so that `$x^2$` yields $x^2$
-- Slide navigation button on the preview; clicking on the notepad icon takes you to that slide in the deck
-- Clicking on `more` yields options for
-    - Clearning the knitr cache
-    - Viewing in a browser (creates a temporay html file in `AppData/local/temp` for me)
-    - Create a html file to save where you want)
-- A refresh button 
-- A zoom button that brings up a full window
-
-Visuals
-===
-transition: linear
-
-- R Studio has made it easy to get some cool html5 effects, like cube transitions
-with simple options in YAML-like code after the first slide such as
-`transition: rotate`
-- You can specify it in a slide-by-slide basis
-
-Here's the option "linear"
-===
-transition: linear
-
-- Just put `transition: linear` right after the slide creation (three equal signs or more in a row)
-- Tansition options 
-    - http://www.rstudio.com/ide/docs/presentations/slide_transitions_and_navigation
-
-Hierarchical organization
-===
-type: section
-- If you want a hierarchical organization structure, just add a `type: typename` option after the slide
-- This changes the default appearance
-    - http://www.rstudio.com/ide/docs/presentations/slide_transitions_and_navigation
-- This is of type `section`
-
-Here's a subsection
-===
-type: subsection
-
-Two columns
-===
-- Do whatever for column one
-- Then put `***` on a line by itself with blank lines before and after
-
-***
-
-- Then do whatever for column two
-
-
-Changing the slide font
-==========================================================
-font-import: http://fonts.googleapis.com/css?family=Risque
-font-family: 'Risque'
-
-- Add a `font-family: fontname` option after the slide
-    - http://www.rstudio.com/ide/docs/presentations/customizing_fonts_and_appearance
-- Specified in the same way as css font families
-    - http://www.w3schools.com/cssref/css_websafe_fonts.asp
-- Use `font-import: url` to import fonts
-- Important caveats
-    - Fonts must be present on the system that you're presenting on, or it will go to a fallback font
-    - You have to be connected to the internet to use an imported font (so don't rely on this for offline presentations)
-- This is the `Risque` 
-    - http://fonts.googleapis.com/css?family=Risque
-    
-Really changing things 
-===
-- If you know html5 and CSS well, then you can basically change whatever you want
-- A css file with the same names as your presentation will be autoimported 
-- You can use `css: file.css` to import a css file 
-- You have to create named classes and then use `class: classname` to get slide-specific style control from your css
-    - (Or you can apply then within a `<span>`)
-- Ultimately, you have an html file, that you can edit as you wish
-    - This should be viewed as a last resort, as the whole point is to have reproducible presentations, but may be the easiest way to get the exact style control you want for a final product
-
-Slidify versus R Studio Presenter
-===
-**Slidify**
-- Flexible control from the R MD file
-- Under rapid ongoing development
-- Large user base
-- Lots and lots of styles and options
-- Steeper learning curve
-- More command-line oriented
-
-***
-**R Studio Presenter**
-- Embedded in R Studio
-- More GUI oriented
-- Very easy to get started
-- Smaller set of easy styles and options
-- Default styles look very nice
-- Ultimately as flexible as slidify with a little CSS and HTML knowledge
-
+RStudio Presenter
+===
+author: Brian Caffo, Jeff Leek Roger Peng
+date: May 24 2014
+transition: rotate
+
+<small> 
+Department of Biostatistics   
+Bloomberg School of Public Health   
+Johns Hopkins University   
+Coursera Data Science Specialization
+</small>
+
+
+RStudio Presentation
+===
+- RStudio created a presentation authoring tool within their
+development environment. 
+- If you are familiar with slidify, you will also be familiar with this tool
+    - Code is authored in a generalized markdown format that allows for code chunks
+    - The output is an html5 presentation 
+    - The file index for the presenter file is .Rpres, which gets converted to an .md file and then to an html file if desired
+    - There's a preview tool in RStudio and GUIs for publishing to Rpubs or viewing/creating an html file
+
+Authoring content
+===
+- This is a fairly complete guide
+    - http://www.rstudio.com/ide/docs/presentations/overview
+- Quick start is
+    - `file` then `New File` then `R Presentation`
+    - (`alt-f` then `f` then `p` if you want key strokes)
+    - Use basically the same R markdown format for authoring as slidify/knitr
+        - Single quotes for inline code
+        - Tripple qutoes for block code
+        - Same options for code evaluation, caching, hiding etcetera
+
+Compiling and tools
+===
+- R Studio auto formats and runs the code when you save the document
+- Mathjax JS library is loaded by default so that `$x^2$` yields $x^2$
+- Slide navigation button on the preview; clicking on the notepad icon takes you to that slide in the deck
+- Clicking on `more` yields options for
+    - Clearning the knitr cache
+    - Viewing in a browser (creates a temporay html file in `AppData/local/temp` for me)
+    - Create a html file to save where you want)
+- A refresh button 
+- A zoom button that brings up a full window
+
+Visuals
+===
+transition: linear
+
+- R Studio has made it easy to get some cool html5 effects, like cube transitions
+with simple options in YAML-like code after the first slide such as
+`transition: rotate`
+- You can specify it in a slide-by-slide basis
+
+Here's the option "linear"
+===
+transition: linear
+
+- Just put `transition: linear` right after the slide creation (three equal signs or more in a row)
+- Tansition options 
+    - http://www.rstudio.com/ide/docs/presentations/slide_transitions_and_navigation
+
+Hierarchical organization
+===
+type: section
+- If you want a hierarchical organization structure, just add a `type: typename` option after the slide
+- This changes the default appearance
+    - http://www.rstudio.com/ide/docs/presentations/slide_transitions_and_navigation
+- This is of type `section`
+
+Here's a subsection
+===
+type: subsection
+
+Two columns
+===
+- Do whatever for column one
+- Then put `***` on a line by itself with blank lines before and after
+
+***
+
+- Then do whatever for column two
+
+
+Changing the slide font
+==========================================================
+font-import: http://fonts.googleapis.com/css?family=Risque
+font-family: 'Risque'
+
+- Add a `font-family: fontname` option after the slide
+    - http://www.rstudio.com/ide/docs/presentations/customizing_fonts_and_appearance
+- Specified in the same way as css font families
+    - http://www.w3schools.com/cssref/css_websafe_fonts.asp
+- Use `font-import: url` to import fonts
+- Important caveats
+    - Fonts must be present on the system that you're presenting on, or it will go to a fallback font
+    - You have to be connected to the internet to use an imported font (so don't rely on this for offline presentations)
+- This is the `Risque` 
+    - http://fonts.googleapis.com/css?family=Risque
+    
+Really changing things 
+===
+- If you know html5 and CSS well, then you can basically change whatever you want
+- A css file with the same names as your presentation will be autoimported 
+- You can use `css: file.css` to import a css file 
+- You have to create named classes and then use `class: classname` to get slide-specific style control from your css
+    - (Or you can apply then within a `<span>`)
+- Ultimately, you have an html file, that you can edit as you wish
+    - This should be viewed as a last resort, as the whole point is to have reproducible presentations, but may be the easiest way to get the exact style control you want for a final product
+
+Slidify versus R Studio Presenter
+===
+**Slidify**
+- Flexible control from the R MD file
+- Under rapid ongoing development
+- Large user base
+- Lots and lots of styles and options
+- Steeper learning curve
+- More command-line oriented
+
+***
+**R Studio Presenter**
+- Embedded in R Studio
+- More GUI oriented
+- Very easy to get started
+- Smaller set of easy styles and options
+- Default styles look very nice
+- Ultimately as flexible as slidify with a little CSS and HTML knowledge
+
diff --git a/09_DevelopingDataProducts/shiny/index.Rmd b/09_DevelopingDataProducts/shiny/index.Rmd
index b9c21160c..56e8cdda4 100644
--- a/09_DevelopingDataProducts/shiny/index.Rmd
+++ b/09_DevelopingDataProducts/shiny/index.Rmd
@@ -55,9 +55,9 @@ diabetesRisk <- function(glucose) glucose / 200
 - Make sure you have the latest release of R installed
 - If on windows, make sure that you have Rtools installed
 - `install.packages("shiny")`
-- `libray(shiny)`
+- `library(shiny)`
 - Great tutorial at 
-[http://rstudio.github.io/shiny/tutorial/](http://rstudio.github.io/shiny/tutorial/)
+[http://shiny.rstudio.com/tutorial/](http://shiny.rstudio.com/tutorial/)
 - Basically, this lecture is walking through that tutorial offering some of our insights
 - Note, some of the proposed interactive plotting uses of Shiny could be handled by the very simple `manipulate` function [rstudio manipulate](http://www.rstudio.com/ide/docs/advanced/manipulate)
 - Also, `rCharts` is will be covered in a different lecture.
diff --git a/09_DevelopingDataProducts/shiny2/index.Rmd b/09_DevelopingDataProducts/shiny2/index.Rmd
index a0894e59d..146c8bc64 100644
--- a/09_DevelopingDataProducts/shiny2/index.Rmd
+++ b/09_DevelopingDataProducts/shiny2/index.Rmd
@@ -84,7 +84,7 @@ shinyServer(
 ---
 ## Try it
 * type `runApp()` 
-* Notice hitting refresh incriments `y` but enterting values in the textbox does not
+* Notice hitting refresh increments `y` but enterting values in the textbox does not
 * Notice `x` is always 1
 * Watch how it updated `text1` and `text2` as needed.
 * Doesn't add 1 to text1 every time a new `text2` is input.
diff --git a/09_DevelopingDataProducts/slidify/index.Rmd b/09_DevelopingDataProducts/slidify/index.Rmd
index 9435213c9..3c419afca 100644
--- a/09_DevelopingDataProducts/slidify/index.Rmd
+++ b/09_DevelopingDataProducts/slidify/index.Rmd
@@ -19,7 +19,7 @@ mode        : selfcontained # {standalone, draft}
 
 - Slidify was created by [Ramnath Vaidyanathan](https://github.com/ramnathv) in order to streamline the process of creating and publishing `R` driven presentations.
 
-- Slidify is an almagamation of other technologies including knitr, Markdown, and several javascript libaries for HTML5 presentations.
+- Slidify is an amalgamation of other technologies including knitr, Markdown, and several javascript libaries for HTML5 presentations.
 
 - Slidify is infinitely extendable and customizable, yet it is easy to use!
 
diff --git a/09_DevelopingDataProducts/slidify/index.html b/09_DevelopingDataProducts/slidify/index.html
index 7b3265f7d..686537475 100644
--- a/09_DevelopingDataProducts/slidify/index.html
+++ b/09_DevelopingDataProducts/slidify/index.html
@@ -52,7 +52,7 @@ <h2>What is Slidify?</h2>
   <article data-timings="">
     <ul>
 <li><p>Slidify was created by <a href="https://github.com/ramnathv">Ramnath Vaidyanathan</a> in order to streamline the process of creating and publishing <code>R</code> driven presentations.</p></li>
-<li><p>Slidify is an almagamation of other technologies including knitr, Markdown, and several javascript libaries for HTML5 presentations.</p></li>
+<li><p>Slidify is an amalgamation of other technologies including knitr, Markdown, and several javascript libaries for HTML5 presentations.</p></li>
 <li><p>Slidify is infinitely extendable and customizable, yet it is easy to use!</p></li>
 <li><p>Slidify allows embedded code chunks and mathematical formulas which keeps your presentation reproducable.</p></li>
 <li><p>Slidify presentations are just HTML files, so you can view them with any web browser and share them easily on Github, Dropbox, or your own website.</p></li>
diff --git a/09_DevelopingDataProducts/slidify/index.md b/09_DevelopingDataProducts/slidify/index.md
index 9435213c9..3c419afca 100644
--- a/09_DevelopingDataProducts/slidify/index.md
+++ b/09_DevelopingDataProducts/slidify/index.md
@@ -19,7 +19,7 @@ mode        : selfcontained # {standalone, draft}
 
 - Slidify was created by [Ramnath Vaidyanathan](https://github.com/ramnathv) in order to streamline the process of creating and publishing `R` driven presentations.
 
-- Slidify is an almagamation of other technologies including knitr, Markdown, and several javascript libaries for HTML5 presentations.
+- Slidify is an amalgamation of other technologies including knitr, Markdown, and several javascript libaries for HTML5 presentations.
 
 - Slidify is infinitely extendable and customizable, yet it is easy to use!
 
diff --git a/courses.Rproj b/courses.Rproj
new file mode 100644
index 000000000..066341ea1
--- /dev/null
+++ b/courses.Rproj
@@ -0,0 +1,13 @@
+Version: 1.0
+
+RestoreWorkspace: Default
+SaveWorkspace: Default
+AlwaysSaveHistory: Default
+
+EnableCodeIndexing: Yes
+UseSpacesForTab: Yes
+NumSpacesForTab: 4
+Encoding: UTF-8
+
+RnwWeave: Sweave
+LaTeX: pdfLaTeX
diff --git a/dependencyGraph.R b/dependencyGraph.R
new file mode 100644
index 000000000..e69de29bb
diff --git a/zips/06_StatisticalInference.zip b/zips/06_StatisticalInference.zip
new file mode 100644
index 000000000..94ba611d8
Binary files /dev/null and b/zips/06_StatisticalInference.zip differ
diff --git a/zips/07_RegressionModels.zip b/zips/07_RegressionModels.zip
new file mode 100644
index 000000000..349ee6656
Binary files /dev/null and b/zips/07_RegressionModels.zip differ
diff --git a/zips/09_DevelopingDataProducts.zip b/zips/09_DevelopingDataProducts.zip
new file mode 100644
index 000000000..e82e33f7f
Binary files /dev/null and b/zips/09_DevelopingDataProducts.zip differ

Truth	Decide	Result
\(H_0\)	\(H_0\)	Correctly accept null
\(H_0\)	\(H_a\)	Type I error
\(H_a\)	\(H_a\)	Correctly reject null
\(H_a\)	\(H_0\)	Type II error
Rejection region	Type I error rate
[0 : 8]	1
[1 : 8]	0.9961
[2 : 8]	0.9648
[3 : 8]	0.8555
[4 : 8]	0.6367
[5 : 8]	0.3633
[6 : 8]	0.1445
[7 : 8]	0.0352
[8 : 8]	0.0039
	\(\beta=0\)	\(\beta\neq0\)	Hypotheses
Claim \(\beta=0\)	\(U\)	\(T\)	\(m-R\)
Claim \(\beta\neq 0\)	\(V\)	\(S\)	\(R\)
Claims	\(m_0\)	\(m-m_0\)	\(m\)
Data type	Statistic	Test name
Ranks	rank sum	rank sum test
Binary	hypergeometric prob	Fisher's exact test
Raw data		ordinary permutation test