You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: abundance-measurement.Rmd
+4-2Lines changed: 4 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -122,7 +122,7 @@ We can rewrite the species' density measurements to account for the effect of ta
122
122
\end{align}
123
123
The variable fold error that bias creates in the measured proportions directly transfers to the density measurements.
124
124
125
-
Taxonomic bias can also create systematic error in the total-density measurements (Appendix \@ref(appendix-total-density)).
125
+
Taxonomic bias can also create systematic error in the total-density measurements (Appendix \@ref(total-density-bias)).
126
126
For now, we suppose that the total density can be measured accurately;
127
127
later, Section \@ref(solutions) describes how experiments might be designed such that taxonomic bias in the total density measurement offsets that in the MGS proportions, leading to a (more) consistent fold error in the measured densities.
128
128
@@ -147,9 +147,11 @@ It follows from Equation \@ref(eq:ratio-error) that the error caused by taxonomi
147
147
Here, it is the constant error in the measured ratio of species $i$ to $r$ that propagates into the density measurement.
148
148
There will generally be systematic error in the density of the reference species; however, if the systematic fold error is constant across samples, so will be that of the densities of other species.
149
149
150
+
<!-- NOTE: the above doesn't quite make sense for DNA spike-ins -->
151
+
150
152
**Differences between the two approaches:**
151
153
Equations \@ref(eq:density-prop-error) and \@ref(eq:density-ratio-error) show that the manner in which taxonomic bias in the sequencing measurement impacts the two approaches mirrors that of proportions and ratios.
152
154
If the fold error in the total-density or reference species measurement is negligible (or at least constant) then the fold errors in species densities by the total-density approach vary across samples with the mean efficiency, whereas the fold errors by the reference-species approach are constant.
153
155
The fundamental determinant of whether constant error is obtained is not which of these two equations is ultimately used, but rather whether densities of individual species, which (we assume) have constant efficiency, or the total community, which has varying efficiency, provide the absolute-abundance information.
154
156
Studies using spike-ins as constant reference species will often, instead of using Equation \@ref(eq:density-ratio-meas), first use the spike-in species to estimate the total density, and then estimate the density of focal species with Equation \@ref(eq:density-prop-meas).
155
-
If the calculation is done carefully, however, this method still yields constant fold errors so long as the species efficiencies (including those of the spike-in) are constant across samples (Appendix \@ref(appendix-total-density)).
157
+
If the calculation is done carefully, however, this method still yields constant fold errors so long as the species efficiencies (including those of the spike-in) are constant across samples (Appendix \@ref(total-density-ref)).
Copy file name to clipboardExpand all lines: solutions.Rmd
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -69,7 +69,7 @@ Consider the popular method of assessing bacterial abundance with total-16S qPCR
69
69
Even an ideal qPCR protocol that perfectly amplifies all species remains a biased measurement of cell density, since the total 16S copies in the extracted DNA contributed by each species will be proportional to its (species-specific) lysis efficiency and 16S copy-number.
70
70
16S qPCR is commonly paired with 16S amplicon sequencing, with which these sources of bias are shared, perhaps along with variation in primer binding and amplification efficiency.
71
71
Methods like using flow cytometry that directly measures cell density lacks these biases but likely have their own that are more likely to be orthogonal to the 16S sequencing measurement.
72
-
By extending the analysis of Section \@ref(differential-abundance) to include consistent taxonomic bias in the total-density measurement, we find that pairings of total-density and sequencing measurements that share large sources of taxonomic bias can lead to an offsetting of errors that reduces the error in fold-change measurement and DA inference (Appendix \@ref(appendix-total-density)).
72
+
By extending the analysis of Section \@ref(differential-abundance) to include consistent taxonomic bias in the total-density measurement, we find that pairings of total-density and sequencing measurements that share large sources of taxonomic bias can lead to an offsetting of errors that reduces the error in fold-change measurement and DA inference (Appendix \@ref(total-density-bias)).
73
73
This finding suggests that methods of total-density measurement that are more accurate as measures of total cell density may actually perform worse than less accurate methods for the purposes of absolute DA inference.
74
74
75
75
We illustrate with a hypothetical example of two vaginal communities of equal density, but which are dominated either by _Lactobacillus iners_ or _Gardnerella vaginalis_.
@@ -84,7 +84,7 @@ If the (log) efficiencies of the total and sequencing measurement are positively
84
84
These observations suggest that qPCR of a marker gene may be the ideal pairing for amplicon sequencing measurements, despite being a poor measure of changes in total cell density.
85
85
Similarly, bulk DNA quantification may be an ideal pairing for shotgun sequencing, making it possible to account for variation in lysis efficiency and genome length.
86
86
These pairings even make it possible to account for error caused by variation among samples in the fraction of unclassified reads, which can form a major fraction of amplicon and shotgun data.
87
-
Importantly, maximizing the offsetting of errors requires thoughtful choices during bioinformatic analysis, perhaps eschewing the filtering and normalization steps used in many software packages and workflows (Appendix \@ref(appendix-total-density)).
87
+
Importantly, maximizing the offsetting of errors requires thoughtful choices during bioinformatic analysis, perhaps eschewing the filtering and normalization steps used in many software packages and workflows (Appendix \@ref(total-density-bias)).
0 commit comments