<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Public Health</journal-id>
<journal-title>Frontiers in Public Health</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Public Health</abbrev-journal-title>
<issn pub-type="epub">2296-2565</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpubh.2014.00249</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Public Health</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Comparison of Two Methods &#x02013; Regression Predictive Model and Intake Shift Model &#x02013; For Adjusting Self-Reported Dietary Recall of Total Energy Intake of Populations</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Lankester</surname> <given-names>Joanna</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="cor1">&#x0002A;</xref>
<uri xlink:href="http://frontiersin.org/people/u/153895"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Perry</surname> <given-names>Sharon</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Parsonnet</surname> <given-names>Julie</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="cor1">&#x0002A;</xref>
<uri xlink:href="http://frontiersin.org/people/u/193769"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Electrical Engineering, Stanford University</institution>, <addr-line>Stanford, CA</addr-line>, <country>USA</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Medicine, Stanford University School of Medicine</institution>, <addr-line>Stanford, CA</addr-line>, <country>USA</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Health Research and Policy, Stanford University School of Medicine</institution>, <addr-line>Stanford, CA</addr-line>, <country>USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Zhiwei Zhang, Food and Drug Administration, USA</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Yong Ma, George Washington University, USA; Brian H. Chen, National Heart, Lung, and Blood Institute, USA</p></fn>
<corresp content-type="corresp" id="cor1">&#x0002A;Correspondence: Joanna Lankester and Julie Parsonnet, Stanford University School of Medicine, 300 Pasteur Dr, Grant Building S-131, Stanford, CA 94305, USA e-mail: <email>lankester&#x00040;stanfordalumni.org</email>; <email>parsonnt&#x00040;stanford.edu</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Epidemiology, a section of the journal Frontiers in Public Health.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>27</day>
<month>11</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date>
<volume>2</volume>
<elocation-id>249</elocation-id>
<history>
<date date-type="received">
<day>20</day>
<month>06</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>07</day>
<month>11</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2014 Lankester, Perry and Parsonnet.</copyright-statement>
<copyright-year>2014</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Daily dietary intake data derived from self-reported dietary recall surveys are widely considered inaccurate. In this study, methods were developed for adjusting these dietary recalls to more plausible values. In a simulation model of two National Health and Nutrition Examination Surveys (NHANES), NHANES I and NHANES 2007&#x02013;2008, a predicted one-third of raw data fell outside a range of physiologically plausible bounds for dietary intake (designated a 33% failure rate baseline). To explore the nature and magnitude of this bias, primary data obtained from an observational study were used to derive models that predicted more plausible dietary intake. Two models were then applied for correcting dietary recall bias in the NHANES datasets: (a) a linear regression to model percent under-reporting as a function of subject characteristics and (b) a shift of dietary intake reports to align with experimental data on energy expenditure. After adjustment, the failure rates improved to &#x0003C;2% with the regression model and 4&#x02013;9% with the intake shift model &#x02013; both substantial improvements over the raw data. Both methods gave more reliable estimates of plausible dietary intake based on dietary recall and have the potential for more far-reaching application in correction of self-reported exposures.</p>
</abstract>
<kwd-group>
<kwd>bias (epidemiology)</kwd>
<kwd>computer simulation</kwd>
<kwd>diet surveys</kwd>
<kwd>energy intake</kwd>
<kwd>NHANES</kwd>
<kwd>questionnaires</kwd>
</kwd-group>
<counts>
<fig-count count="3"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="26"/>
<page-count count="7"/>
<word-count count="5232"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="introduction">
<title>Introduction</title>
<p>Research on diet and obesity has long been hampered by inaccurate estimates of how much we, as individuals and as a population, eat. Individual daily food consumption is often gathered through 24-h dietary recalls, in which individuals report foods and quantities consumed in a given day and total energy intake (EI) is derived.</p>
<p>Dietary recall bias is a well-studied problem in nutrition research (<xref ref-type="bibr" rid="B1">1</xref>&#x02013;<xref ref-type="bibr" rid="B6">6</xref>). In general, individuals tend to under-report their food consumption (<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B8">8</xref>). Substantial efforts are made to minimize biases in primary data collection. For example, dietary surveys such as National Health and Nutrition Examination Survey (NHANES) use visual aids to help participants recall food quantities, and guidelines detail how to ask questions in a neutral manner (<xref ref-type="bibr" rid="B9">9</xref>). Despite these efforts, individuals continue to misreport.</p>
<p>Some error also arises from the assumption that dietary recalls inform the average EI value, while in reality there is day-to-day variation within individuals so that a dietary recall from a particular day could indicate an intake above or below the value researchers are trying to measure. To adjust for this variation &#x02013; i.e., to improve precision &#x02013; researchers have applied a variety of corrections (<xref ref-type="bibr" rid="B10">10</xref>) such as the NRC (<xref ref-type="bibr" rid="B11">11</xref>), NRC-B (<xref ref-type="bibr" rid="B4">4</xref>), and Iowa State University methods (<xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B13">13</xref>). Models have also been developed to quantify relationships between EI and intake of particular nutrients (<xref ref-type="bibr" rid="B14">14</xref>, <xref ref-type="bibr" rid="B15">15</xref>) or to infer EI from biomarkers in urine (<xref ref-type="bibr" rid="B16">16</xref>). Still, no method exists to improve bias of the distribution of total EI from dietary recalls in a population using easily measured characteristics of survey participants.</p>
<p>To gain better insight from dietary recall data, especially in epidemiological nutrition models, a method of correcting for misreporting of total EI in a population would be beneficial. Using an observational dataset from the Observing Protein and Energy Nutrition (OPEN) study, we developed two methods to predict corrected dietary recall data. We then tested these methods on NHANES I (<xref ref-type="bibr" rid="B17">17</xref>) and NHANES 2007&#x02013;2008 (<xref ref-type="bibr" rid="B18">18</xref>) datasets by adjusting dietary recall for a population sample. We chose NHANES because of its thorough survey methodology in capturing population health data and its collection over several decades of time, and we chose these particular NHANES datasets as the most and least recent datasets at the time of this analysis.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<p>We used a dataset from the OPEN study to develop a model for under-reporting and used two different NHANES datasets to test the model (Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
<fig position="float" id="F1">
<label>Figure 1</label>
<caption><p><bold>Flow chart for the development and implementation of dietary recall correction models</bold>. Left side, regression model: right side, intake shift model. Two separate analyses on the OPEN dataset produced output models that were then used in simulation models with National Health and Nutrition Examination Survey (NHANES) datasets. The simulations gave final outputs that were used to estimate a corrected energy intake (EI). Physical activity level (PAL)&#x02009;&#x0003D;&#x02009;energy expenditure/resting metabolic rate (RMR); IndEI<sub>claimed</sub>&#x02009;&#x0003D;&#x02009;EI<sub>claimed</sub>/RMR; OPEN&#x02009;&#x0003D;&#x02009;observing protein and energy nutrition.</p></caption>
<graphic xlink:href="fpubh-02-00249-g001.tif"/>
</fig>
<sec id="S2-1">
<title>Training dataset (OPEN study) and test datasets (NHANES)</title>
<p>We requested the dataset from the OPEN study author (<xref ref-type="bibr" rid="B19">19</xref>). The OPEN study compared self-reported caloric dietary intake with measurements of energy expenditure (EE) in 484 participants ages 40&#x02013;69&#x02009;years old. Exclusion criteria for the study included those on a weight loss diet. Subjects self-reported 24-h dietary intake. The investigators then measured their EE using doubly labeled water.</p>
<p>In our analysis, we excluded OPEN subjects who did not complete the doubly labeled water test, leaving 451 individuals in the dataset. Of those remaining, participants were predominantly White non-Hispanic (82% of the participants). Because initial analysis noted a significant difference in reporting between Blacks and non-Blacks, we excluded Blacks from the modeling analysis, leaving a total of 423 subjects.</p>
<p>The publicly available NHANES datasets were downloaded from the Centers for Disease Control and Prevention NHANES website; survey method descriptions are also available (<xref ref-type="bibr" rid="B20">20</xref>). In brief, variables included in our analysis were collected through either anthropometric measurements or self-reported survey. The dietary survey included a listing of all foods consumed within a 24-h period and an estimate of their quantities.</p>
<p>We excluded individuals from the NHANES datasets who reported that they were on a special diet or did not eat their typical diet that day, as well as those who did not report at all. We constrained the minimum age to 18; maximum age recorded was 74 in NHANES I; and 79 in NHANES 2007&#x02013;2008. After these exclusions, the dataset sizes were 8,006 individuals for NHANES I and 4,830 for NHANES 2007&#x02013;2008.</p>
<p>For both datasets, univariate analyses included means and standard deviations for continuous variables and percentiles for categorical variables as appropriate.</p>
<sec id="S2-1-1">
<title>Parameters created</title>
<p>We used reported dietary intake information for each eligible OPEN record along with the estimated EE to estimate the accuracy of dietary recall, using the principle that in energy balance, EE should approximate dietary intake.</p>
<p>Variables defined for this analysis included:
<list list-type="order">
<list-item><p>Resting metabolic rate (RMR). We calculated theoretical RMR for individuals using the Schofield equations (<xref ref-type="bibr" rid="B21">21</xref>).</p></list-item>
<list-item><p>Energy expenditure and EI. In energy homeostasis, EE is expected to equal EI (thus, body weight is not changing). The term EI<sub>claimed</sub> denotes self-reported EI in the OPEN dataset, while the term EI refers to the true EI that the models later estimated.</p></list-item>
<list-item><p>Physical activity level (PAL). PAL is the ratio of EE/RMR. EE cannot biologically be less than RMR, so PAL cannot be &#x0003C;1.</p></list-item>
<list-item><p>Index of energy intake (IndEI). IndEI is the ratio of EI/RMR. In energy homeostasis, EI&#x02009;&#x0003D;&#x02009;EE, so IndEI will equal PAL and will then also have a lower bound of 1. IndEI<sub>claimed</sub> denotes IndEI measures derived from self-reported EI in the OPEN dataset, i.e., IndEI<sub>claimed</sub>&#x02009;&#x0003D;&#x02009;EI<sub>claimed</sub>/RMR. This definition allows comparison of EI<sub>claimed</sub> between two individuals with drastically different RMR and therefore different EI needs. It also allows comparison between IndEI<sub>claimed</sub> from the OPEN study and figures for PAL from other sources.</p></list-item>
<list-item><p>Percent misreporting. Because subjects reported eating normally, we assumed homeostasis (EI&#x02009;&#x0003D;&#x02009;EE) and defined percent under-reporting as (EE&#x02009;&#x02212;&#x02009;EI<sub>claimed</sub>)/EE.</p></list-item>
</list></p>
</sec>
</sec>
<sec id="S2-2">
<title>Statistical modeling algorithms</title>
<p>We developed several methods for adjusting a distribution of dietary recall to correct for both under- and over-reporting. All methods assumed energy homeostasis, i.e., body weight is not changing so that EE equal to EI. These included: (a) a linear regression, (b) the shifting of the population&#x02019;s reported intake by an added average caloric offset, (c) the scaling of population&#x02019;s intake by a multiplicative value, and (d) the random selection of a dietary intake bias for each individual. The two models, which yielded biologically plausible results in terms of ratio to RMR &#x02013; (a) and (b) &#x02013; are presented here. We used SAS 9.2 (SAS Institute, Cary, NC, USA) for statistical analyses.</p>
<sec id="S2-2-2">
<title>Estimation of individual misreporting error (OPEN indicators regression method)</title>
<p>We estimated using a multiple linear regression on the OPEN data the bias in reported EI based on individuals&#x02019; characteristics and self-reported data. The outcome for this model was percent energy misreporting [i.e., (EE&#x02009;&#x02212;&#x02009;EI<sub>claimed</sub>)/EE, where positive values represent under-reporting and negative values represent over-reporting]. We created a quantile&#x02013;quantile plot of percent energy misreporting and found that this variable was generally normally distributed, with individuals slightly deviating at the most extreme over-reporting end. Because this represents only a small amount of data, particularly because most individuals under-report, we were satisfied that the normality of the data was sufficient for a linear model. Parameter estimates from this model allowed later prediction of true dietary intake. This method relies on the assumption that misreporting patterns in the OPEN study were similar to those in test datasets.</p>
<p>Predictors considered for the regression model included age, sex, weight, height, body mass index (BMI), Ponderal Index, EI<sub>claimed</sub>, log(EI<sub>claimed</sub>), IndEI<sub>claimed</sub>, and log(IndEI<sub>claimed</sub>). We considered day of the week by checking for trends in mean percent misreporting for each day. We also examined interaction effects for race and ethnicity. A stepwise backwards test was used to eliminate each variable with the highest two-sided <italic>P</italic>-value above a cut-off of 0.05 and to identify collinear terms. We checked residual plots for each variable in the final model for reasonable homogeneity of variance.</p>
</sec>
<sec id="S2-2-3">
<title>Estimation of population-level energy intake bias (intake shift method)</title>
<p>The intake shift method&#x02019;s outcome is the population-average caloric shift in EI, stratified by gender, which should be applied to each individual in a test dataset in order to align the self-reported EI<sub>claimed</sub> with the measured EE from the OPEN dataset, normalizing both for RMR. This method relies on the assumption that the EE distribution in the OPEN dataset was similar to that of test datasets.</p>
<p>We first analyzed the sex and weight profile distributions of PAL in the OPEN study. Weight status categories were defined using CDC and WHO definitions: BMI &#x0003C;18.5 underweight, &#x0003E;30 obese, &#x0003E;25 overweight, and otherwise normal weight (<xref ref-type="bibr" rid="B22">22</xref>, <xref ref-type="bibr" rid="B23">23</xref>). As a result, PAL was subsequently stratified by gender and not by weight status.</p>
<p>To find the shift in EI, we built a simulation with <italic>n</italic> individuals and interpolated the OPEN PAL distribution to <italic>n</italic> individuals. To do this, we first fit the PAL distributions for men and women in OPEN to a log-normal curve. A new curve was created with the mean and standard deviation from that fit and the number of individuals in the simulation of the corresponding sex. After ranking the PAL and the IndEI<sub>claimed</sub> values, this interpolation allows, for example, the 28th woman in the simulation of <italic>n</italic> individuals to correspond to the would-be placement of the 28th woman in the OPEN data if the OPEN data had <italic>n</italic> individuals. The difference between PAL and IndEI<sub>claimed</sub> and corresponding change in EI was then calculated for each person in the simulation. These &#x00394;EI were averaged to find a number of calories by which the EI<sub>claimed</sub> of each person in the simulation should be shifted.</p>
</sec>
<sec id="S2-2-4">
<title>Simulation of application of models to NHANES data</title>
<p>We applied these models to simulations of two test datasets, NHANES I and NHANES 2007&#x02013;2008. Individuals were initialized with the age&#x02013;sex distribution corresponding to the census population distribution for the year of each dataset. For each individual, characteristics of height, weight, and calories from the dietary recall (EI<sub>claimed</sub>) were drawn randomly from the corresponding age&#x02013;sex combination distribution in the NHANES test dataset.</p>
<p>Both correction models were run on simulations of three different populations from the NHANES I and NHANES 2007&#x02013;2008 populations: all adults in the population, non-Blacks only, and non-Blacks age 40&#x02013;69 only (i.e., same as those in the OPEN study), in order to analyze whether the age and race limitations of the OPEN dataset affected the final outcome.</p>
<p>We ran 1,000 simulations of 10,000 individuals each for each dataset-population-adjustment method combination (12 combinations). A &#x0201C;failure rate&#x0201D; designates the average from all 1,000 simulations of percent of individuals with a PAL out of bounds (i.e., &#x0003C;1 or &#x0003E; maximum value for gender). Simulations were built in MATLAB (Mathworks, Natick, MA, USA).</p>
</sec>
<sec id="S2-2-5">
<title>Definition of boundary values for estimating accuracy of self-reported dietary intake</title>
<p>By definition PAL can be no lower than 1, and the Goldberg equations &#x02013; which are based on measured RMR &#x02013; establish an even higher lower limit of normal PAL at 1.35 (<xref ref-type="bibr" rid="B24">24</xref>). However, Goldberg tabulates data with a PAL as low as 1.16 from a whole-body calorimetry study and acknowledges some variation in the accuracy of calculation of RMR for a given individual. In addition, in the OPEN study the lowest PAL derived from observed EE data was 1.2. Given these results, we used a minimum PAL, and thus a minimum IndEI, of 1. Experimentally derived PAL values from a variety of studies, including among professional athletes, are tabulated in Black et al. (<xref ref-type="bibr" rid="B25">25</xref>). In the application of these methods to the general US population, we chose cut-offs of maximum PAL&#x02009;&#x0003D;&#x02009;IndEI<sub>claimed</sub> as 2.8 for women and 3.5 for men as numbers corresponding to individuals who were extremely active yet were not undertaking long athletic feats such as Arctic exploration or the Tour de France. Values of IndEI<sub>claimed</sub> outside the specified range (&#x0003C;1.0 and &#x0003E;2.8 for women or 3.5 for men) represent IndEI values that are not physiologically plausible in energy homeostasis, so individuals with these values are considered to have misreported dietary intake.</p>
</sec>
</sec>
</sec>
<sec id="S3">
<title>Results</title>
<p>Univariate summaries of each dataset are presented in Table <xref ref-type="table" rid="T1">1</xref>.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Characteristics of individuals in OPEN and NHANES datasets</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Variable</th>
<th align="center">Mean (standard deviation) or count (percentage), OPEN</th>
<th align="center">Mean (standard deviation) or count (percentage), NHANES I</th>
<th align="center">Mean (standard deviation) or count (percentage), NHANES 2007&#x02013;2008</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Count</td>
<td align="center">451</td>
<td align="center">8006</td>
<td align="center">4830</td>
</tr>
<tr>
<td align="left">Age (years)</td>
<td align="char" char="(" charoff="50">53.5 (8.4)</td>
<td align="char" char="(" charoff="50">47.0 (17.9)</td>
<td align="char" char="(" charoff="50">48.4 (8.4)</td>
</tr>
<tr>
<td align="left">Female</td>
<td align="char" char="(" charoff="50">206 (45.7)</td>
<td align="char" char="(" charoff="50">4680 (58.4)</td>
<td align="char" char="(" charoff="50">2358 (48.8)</td>
</tr>
<tr>
<td align="left">Race</td>
</tr>
<tr>
<td align="left">&#x02003;White non-Hispanic</td>
<td align="char" char="(" charoff="50">371 (82.3)</td>
<td align="char" char="(" charoff="50">6736 (84.1)</td>
<td align="char" char="(" charoff="50">2234 (46.3)</td>
</tr>
<tr>
<td align="left">&#x02003;Black non-Hispanic</td>
<td align="char" char="(" charoff="50">28 (6.2)</td>
<td align="char" char="(" charoff="50">1169 (14.6)</td>
<td align="char" char="(" charoff="50">1020 (21.1)</td>
</tr>
<tr>
<td align="left">&#x02003;Hispanic, any race</td>
<td align="char" char="(" charoff="50">18 (4.0)</td>
<td align="char" char="(" charoff="50"><xref ref-type="table-fn" rid="tfn1"><sup>a</sup></xref></td>
<td align="char" char="(" charoff="50">1387 (28.7)</td>
</tr>
<tr>
<td align="left">&#x02003;Other/unspecified</td>
<td align="char" char="(" charoff="50">34 (7.5)</td>
<td align="char" char="(" charoff="50">101 (1.3)</td>
<td align="char" char="(" charoff="50">189 (3.9)</td>
</tr>
<tr>
<td align="left">Weight (kg)</td>
<td align="char" char="(" charoff="50">81.0 (17.6)</td>
<td align="char" char="(" charoff="50">69.2 (15.3)</td>
<td align="char" char="(" charoff="50">80.0 (20.7)</td>
</tr>
<tr>
<td align="left">Body mass index (kg/m<sup>2</sup>)</td>
<td align="char" char="(" charoff="50">27.8 (5.2)</td>
<td align="char" char="(" charoff="50">24.9 (4.8)</td>
<td align="char" char="(" charoff="50">28.4 (6.5)</td>
</tr>
<tr>
<td align="left">EI<sub>claimed</sub> (kcal)</td>
<td align="char" char="(" charoff="50">2346 (808)</td>
<td align="char" char="(" charoff="50">1876 (884)</td>
<td align="char" char="(" charoff="50">2120 (1050)</td>
</tr>
<tr>
<td align="left">TEE (kcal)</td>
<td align="char" char="(" charoff="50">2627 (556)</td>
<td align="center">n/a</td>
<td align="center">n/a</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1"><p><italic><sup>a</sup>NHANES I did not track ethnicity (Hispanic vs. non-Hispanic); numbers reflect categories White, Black, and Other</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<sec id="S3-3">
<title>Final OPEN indicators regression model of percent misreporting</title>
<p>In the regression model for misreporting using the OPEN data, younger age, greater weight, male sex, and lower EI claimed, both absolute and adjusted for RMR [i.e., log (EI<sub>claimed</sub>) and IndEI<sub>claimed</sub>] were significantly linked to higher dietary under-reporting (Table <xref ref-type="table" rid="T2">2</xref>, all <italic>P</italic>-values two-sided). We found no significant interaction among these terms and no systematic variation in mean under-reporting by day of the week of the dietary recall.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Parameter estimates for non-Blacks in the OPEN study for outcome variable % misreporting<xref ref-type="table-fn" rid="tfn3"><sup>a</sup></xref></bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Variable</th>
<th align="center">Parameter estimate</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Intercept</td>
<td align="char" char="." charoff="50">298.19<xref ref-type="table-fn" rid="tfn2">&#x0002A;&#x0002A;&#x0002A;</xref></td>
</tr>
<tr>
<td align="left">Sex (1&#x02009;&#x0003D;&#x02009;male, 2&#x02009;&#x0003D;&#x02009;female)</td>
<td align="char" char="." charoff="50">&#x02212;2.30<xref ref-type="table-fn" rid="tfn3"><sup>a</sup></xref></td>
</tr>
<tr>
<td align="left">Age (years)</td>
<td align="char" char="." charoff="50">&#x02212;0.36<xref ref-type="table-fn" rid="tfn2">&#x0002A;&#x0002A;&#x0002A;</xref></td>
</tr>
<tr>
<td align="left">Weight (kg)</td>
<td align="char" char="." charoff="50">0.21<xref ref-type="table-fn" rid="tfn2">&#x0002A;&#x0002A;&#x0002A;</xref></td>
</tr>
<tr>
<td align="left">log (EI<sub>claimed</sub>) (log kcal)</td>
<td align="char" char="." charoff="50">&#x02212;35.84<xref ref-type="table-fn" rid="tfn2">&#x0002A;&#x0002A;&#x0002A;</xref></td>
</tr>
<tr>
<td align="left">IndEI<sub>claimed</sub> (unitless)</td>
<td align="char" char="." charoff="50">&#x02212;30.47 &#x02217;&#x02217;&#x02217;</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn2"><p><italic>&#x0002A;&#x0002A;&#x0002A;<italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001</italic>,</p></fn>
<fn id="tfn3"><p><italic><sup>a</sup>NS. <italic>R</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.84. EI<sub>claimed</sub>&#x02009;&#x0003D;&#x02009;energy intake according to dietary recall, IndEI<sub>claimed</sub>&#x02009;&#x0003D;&#x02009;EI<sub>claimed</sub>/resting metabolic rate. Positive outcome represents under-reporting</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="S3-4">
<title>Final intake shift model</title>
<p>The derived PAL distributions were found to differ significantly by gender (<italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001, males mean 1.83 (SD 1.14); females mean 1.63 (SD 1.13)]. Consistent with other studies (<xref ref-type="bibr" rid="B26">26</xref>), the PAL did not differ by weight status (underweight, normal, overweight, and obese) (<italic>P</italic>&#x02009;&#x0003D;&#x02009;0.69). The PAL distributions for each gender were then used to adjust IndEI<sub>claimed</sub> in the test dataset.</p>
</sec>
<sec id="S3-5">
<title>Estimation of reporting bias in simulation of NHANES populations</title>
<p>Prior to correction of dietary reporting, a simulation of 10,000 individuals predicted a baseline failure rate (i.e., IndEI<sub>claimed</sub> out of range) of 31.9% in NHANES I and 32.5% in NHANES 2007&#x02013;2008. Following the incorporation of the regression or intake shift models, the same simulations demonstrated substantial improvement over the unadjusted results, with diminished failure rates (Table <xref ref-type="table" rid="T3">3</xref>; Figure <xref ref-type="fig" rid="F2">2</xref>).</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>Failure rate (percent outside of defined IndEI<xref ref-type="table-fn" rid="tfn4"><sup>a</sup></xref> range after adjustment) of the regression and the intake shift models applied to NHANES I and NHANES 2007&#x02013;2008 datasets for all adults, non-Blacks only, and non-Blacks age 40&#x02013;69</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Dataset</th>
<th align="center">Regression (%)</th>
<th align="center">Intake shift (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">NHANES I all</td>
<td align="center">0.67</td>
<td align="center">5.95</td>
</tr>
<tr>
<td align="left">NHANES I non-Blacks</td>
<td align="center">0.70</td>
<td align="center">5.64</td>
</tr>
<tr>
<td align="left">NHANES I non-Blacks age 40&#x02013;69<xref ref-type="table-fn" rid="tfn5"><sup>b</sup></xref></td>
<td align="center">0.53</td>
<td align="center">4.06</td>
</tr>
<tr>
<td align="left">NHANES 2007-2008 all</td>
<td align="center">1.48</td>
<td align="center">8.87</td>
</tr>
<tr>
<td align="left">NHANES 2007-2008 non-Blacks</td>
<td align="center">1.59</td>
<td align="center">8.14</td>
</tr>
<tr>
<td align="left">NHANES 2007-2008 non-Blacks age 40&#x02013;69<xref ref-type="table-fn" rid="tfn5"><sup>b</sup></xref></td>
<td align="center">0.96</td>
<td align="center">6.75</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn4"><p><italic><sup>a</sup>IndEI, index of energy intake&#x02009;&#x0003D;&#x02009;energy intake/resting metabolic rate; NHANES, National Health and Nutrition Examination Survey. Some small variation exists between the datasets for each method. All results show a much lower failure rate than the &#x0003E;33% in the raw data</italic>.</p></fn>
<fn id="tfn5"><p><italic><sup>b</sup>Ages 40&#x02013;69 represent the ages of participants in the training dataset observing protein and energy nutrition (OPEN)</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<fig position="float" id="F2">
<label>Figure 2</label>
<caption><p><bold>Results from a simulation of non-Blacks from National Health and Nutrition Examination Survey (NHANES) 2007&#x02013;2008 data</bold>. Top row: histogram of energy intake (EI); bottom row: histogram of index of energy intake, i.e., energy intake divided by resting metabolic rate (RMR). Left column: unadjusted data, middle column: adjusted using the regression method (values outside bounds not shown for better clarity), and right column: adjusted using the intake shift method.</p></caption>
<graphic xlink:href="fpubh-02-00249-g002.tif"/>
</fig>
<p>The regression method showed a failure rate of &#x0003C;2%. After adjusting, few individuals (e.g., in one test run, 23 of 10,000) are in the range of 0&#x02009;&#x0003C;&#x02009;IndEI&#x02009;&#x0003C;&#x02009;1, where individuals more often under-report in the crude, unadjusted data (Figure <xref ref-type="fig" rid="F2">2</xref>). Most misreporting was under-reporting (Figure <xref ref-type="fig" rid="F3">3</xref>, left). Using the OPEN regression model, the 25th and 75th percentiles for adjusting for misreporting were &#x02212;9 and 895 calories. Although the great majority of the population was adjusted due to under-reporting, there were also some individuals whose EI was adjusted in a negative direction, indicating a small amount of over-reporting bias that occurs (Figure <xref ref-type="fig" rid="F3">3</xref>).</p>
<fig position="float" id="F3">
<label>Figure 3</label>
<caption><p><bold>Left: density of individuals according to adjusted EI vs. initial EI<sub>claimed</sub> (energy intake claimed)</bold>. Points represent data; darker regions indicate higher density of data points in that region. A line indicates unity. Most of the dense region lies above the line, indicating that most dietary intake was under-reported and shifted upward. A much less dense region below the line represents over-reporters. The majority of individuals claimed an intake in the range of 1,000&#x02013;2,000 calories, and most of this group has been shifted to a range of about 1,900&#x02013;2,400 calories. Another smaller group claimed between 2,000 and 2,500 calories and was shifted to around 2,700 calories. Dataset used includes only non-Blacks from National Health and Nutrition Examination Survey (NHANES) 2007&#x02013;2008. Right: boxplot of calorie difference applied to individuals in the regression model. The interquartile range shows that most individuals under-reported and had EI (energy intake) adjusted upward. Some individuals still over-reported and had EI adjusted downward; however, the asymmetry of the box plot shows that many more individuals under-reported than over-reported. Outliers exist in both directions.</p></caption>
<graphic xlink:href="fpubh-02-00249-g003.tif"/>
</fig>
<p>The intake shift method, with a failure rate of 4&#x02013;9%, performed less well than the regression method but still showed a significant improvement over the unadjusted data of failure rate 32&#x02013;33% (Figure <xref ref-type="fig" rid="F2">2</xref>). Using the intake shift model, we found that, on average, adding 905 calories to men&#x02019;s intake and 600 calories to women&#x02019;s intake best fit the population data in year 2007 for non-Blacks. For the 1971 population data, the best fit came from adding 758 calories to men&#x02019;s intake and 613 calories to women&#x02019;s intake. For both genders in both NHANES datasets, the caloric shift applied represented approximately 1/4 of the final EI for the gender group within that dataset.</p>
<p>The sensitivity analysis (Table <xref ref-type="table" rid="T3">3</xref>) showed that while the best results occur with the age group corresponding to that of the OPEN study, the failure rates are nevertheless similar for all three groups in both cohorts.</p>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<title>Discussion</title>
<p>In this study, we developed a new method to correct for EI under-reporting bias in a population, yielding a more plausible population distribution of total calories consumed daily. We created two models with different basic assumptions, and both greatly decrease the number of individuals in a test dataset whose claimed daily dietary intake is out of a physically possible range.</p>
<p>The variables in the regression on OPEN data showed a coefficient direction as expected. Intuitively, those with a low EI<sub>claimed</sub> or IndEI<sub>claimed</sub> are more likely to have under-reported by a greater amount. Our finding of increased under-reporting with weight agrees with previous findings that under-reporting increases with BMI (<xref ref-type="bibr" rid="B1">1</xref>). This correlation could arise from fear of stigma. It could also reflect the difficulty of recalling more or greater amounts of food for those with a higher weight who require a higher caloric intake. In the OPEN data, extent of under-reporting increases with EE, again suggesting that greater amounts of food intake are more difficult to recall (<xref ref-type="bibr" rid="B19">19</xref>).</p>
<p>The regression method reduces variance in the EI<sub>claimed</sub> and IndEI<sub>claimed</sub> ranges. The results appear close to the mean values for PAL from a tabulation of average PAL values from doubly labeled water studies (<xref ref-type="bibr" rid="B25">25</xref>).</p>
<p>The strong performance of the regression model suggests that the reasons for under-reporting may be fairly consistent among individuals, and that we are rather predictable in our reaction to a standardized self-reporting dietary recall.</p>
<p>The intake shift method may have performed less well than the regression method because it accounts only for average offset and not for individual variation. However, it still provides valuable insight in the case where the test dataset is believed to have participants with an EE distribution similar to that of participants in the training dataset.</p>
<p>Instead of adjusting, we could simply discard data that does not conform to normal food intake as determined by known PAL values from experimental data. However, this would require disgarding between a third and a half of the data, depending on the PAL value chosen. In addition, applying a single cut-off gives no provision for those who under-reported significantly yet were just above the cut-off, e.g., if a subject with a true IndEI of 1.85 instead recalled only enough for an IndEI<sub>claimed</sub> of 1.4. The exclusion of only that data, which falls below a particular cut-off introduces bias.</p>
<p>The analysis of race showed a difference in under-reporting patterns, in particular that of Blacks, whose under-reporting does not seem to be influenced by factors that influence those of other racial/ethnic groups. With only 28 Black participants in OPEN, it is difficult to draw conclusions about these data. With a larger dataset of other racial/ethnic groups, it may be possible to make similar models for these groups.</p>
<sec id="S4-6">
<title>Limitations</title>
<p>The methods we have developed are meant to correct for total EI dietary recall bias in a population.</p>
<p>They cannot be used to accurately predict dietary intake for a particular individual on a particular date because day-to-day variations in individuals&#x02019; intake are averaged out over the population. For any individual in steady state over a prolonged period of constant weight, it is expected that dietary intake and EE should balance perfectly. This balance is not perfect over the short term such as the time of the OPEN study. While a longer study would improve this balance, it would also, as Subar mentions in the OPEN study paper, introduce more error from daily fluctuation in EI (<xref ref-type="bibr" rid="B19">19</xref>). In large populations such as NHANES/OPEN, fluctuations in the balance of intake and expenditure average out, making the error unimportant in overall interpretation of the model.</p>
<p>The methods were derived under the assumption of energy homeostasis, that is, EI equals EE, so that no weight change occurs. If the assumption did not hold, EE could not be assumed equal to EI. For example, EI of a participant losing weight would be lower than EE. Because OPEN exclusions included diets to alter weight, and because we excluded NHANES subjects who had reported being on a special diet or not eating typically that day, our study remains valid. However, it cannot be applied to populations whose subjects are altering their body weight during the study.</p>
<p>The training data set included individuals of age 40&#x02013;69, but the test dataset included all adults in the NHANES data. A training dataset with a wider age range may be able to better characterize the variation in reporting with age. Still, our results show that while the models perform at a slightly lower failure rate in a population age 40&#x02013;69, the results are in a similar range as those of a population of all adults.</p>
</sec>
</sec>
<sec id="S5">
<title>Conclusion</title>
<p>Under-reporting can be a serious problem in dietary studies. We have presented two methods for adjusting for under-reporting, and both showed a substantial improvement in biological plausibility over the raw data. These methods can better inform quantitative nutritional research in populations.</p>
<p>In addition to providing a tool for other studies of diet in populations, these methods may be useful in predicting under-reporting for many other self-reported habits, such as smoking or alcohol consumption. Correcting for these would require an experimental dataset with biomarkers of actual consumption, in addition to a self-reported collection from the participants.</p>
</sec>
<sec id="S6">
<title>Author Contributions</title>
<p>Joanna Lankester designed the research; Joanna Lankester conducted the research; Joanna Lankester, Sharon Perry, and Julie Parsonnet analyzed and interpreted data; Joanna Lankester drafted paper; and Joanna Lankester, Sharon Perry, and Julie Parsonnet made substantial revisions to the paper. All authors read and approved the final manuscript.</p>
</sec>
<sec id="S7">
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>We thank Amy Subar for providing access to the OPEN dataset. We thank Catherine Ley and Margaret Brandeau for helpful comments on the manuscript. This work was supported by the National Institutes of Health (R01 HD063142).</p>
</ack>
<sec id="S8">
<title>Abbreviations</title>
<p>BMI, body mass index; EE, energy expenditure; EI, energy intake (actual, not claimed); EI<sub>claimed</sub>, energy intake claimed in dietary recall (in calories); IndEI, index of energy intake EI/RMR; IndEI<sub>claimed</sub>, EI<sub>claimed</sub>/RMR; NHANES, National Health and Nutrition Examination Survey; OPEN, observing protein and energy nutrition; PAL, physical activity level energy expenditure divided by resting metabolic rate; RMR, calculated resting metabolic rate.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><label>1</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Livingstone</surname> <given-names>MB</given-names></name> <name><surname>Black</surname> <given-names>AE</given-names></name></person-group>. <article-title>Markers of the validity of reported energy intake</article-title>. <source>J Nutr</source> (<year>2003</year>) <volume>133</volume>(<issue>Suppl 3</issue>):<fpage>895S</fpage>&#x02013;<lpage>920S</lpage>.</citation></ref>
<ref id="B2"><label>2</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Briefel</surname> <given-names>RR</given-names></name> <name><surname>Sempos</surname> <given-names>CT</given-names></name> <name><surname>McDowell</surname> <given-names>MA</given-names></name> <name><surname>Chien</surname> <given-names>S</given-names></name> <name><surname>Alaimo</surname> <given-names>K</given-names></name></person-group>. <article-title>Dietary methods research in the third National Health and Nutrition Examination Survey: underreporting of energy intake</article-title>. <source>Am J Clin Nutr</source> (<year>1997</year>) <volume>65</volume>:<fpage>1203S</fpage>&#x02013;<lpage>9S</lpage>.<pub-id pub-id-type="pmid">9094923</pub-id></citation></ref>
<ref id="B3"><label>3</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Poslusna</surname> <given-names>K</given-names></name> <name><surname>Ruprich</surname> <given-names>J</given-names></name> <name><surname>de Vries</surname> <given-names>JH</given-names></name> <name><surname>Jakubikova</surname> <given-names>M</given-names></name> <name><surname>van&#x02019;t Veer</surname> <given-names>P</given-names></name></person-group>. <article-title>Misreporting of energy and micronutrient intake estimated by food records and 24 hour recalls, control and adjustment methods in practice</article-title>. <source>Br J Nutr</source> (<year>2009</year>) <volume>101</volume>(<issue>Suppl 2</issue>):<fpage>S73</fpage>&#x02013;<lpage>85</lpage>.<pub-id pub-id-type="doi">10.1017/S0007114509990602</pub-id><pub-id pub-id-type="pmid">19594967</pub-id></citation></ref>
<ref id="B4"><label>4</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yanetz</surname> <given-names>R</given-names></name> <name><surname>Kipnis</surname> <given-names>V</given-names></name> <name><surname>Carroll</surname> <given-names>RJ</given-names></name> <name><surname>Dodd</surname> <given-names>KW</given-names></name> <name><surname>Subar</surname> <given-names>AF</given-names></name> <name><surname>Schatzkin</surname> <given-names>A</given-names></name> <etal/></person-group> <article-title>Using biomarker data to adjust estimates of the distribution of usual intakes for misreporting: application to energy intake in the US population</article-title>. <source>J Am Diet Assoc</source> (<year>2008</year>) <volume>108</volume>:<fpage>455</fpage>&#x02013;<lpage>64</lpage>.<pub-id pub-id-type="doi">10.1016/j.jada.2007.12.004</pub-id><pub-id pub-id-type="pmid">18313427</pub-id></citation></ref>
<ref id="B5"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lof</surname> <given-names>M</given-names></name> <name><surname>Forsum</surname> <given-names>E</given-names></name></person-group>. <article-title>Validation of energy intake by dietary recall against different methods to assess energy expenditure</article-title>. <source>J Hum Nutr Diet</source> (<year>2004</year>) <volume>17</volume>:<fpage>471</fpage>&#x02013;<lpage>80</lpage>.<pub-id pub-id-type="doi">10.1111/j.1365-277X.2004.00554.x</pub-id><pub-id pub-id-type="pmid">15357701</pub-id></citation></ref>
<ref id="B6"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lissner</surname> <given-names>L</given-names></name> <name><surname>Heitmann</surname> <given-names>BL</given-names></name> <name><surname>Lindroos</surname> <given-names>AK</given-names></name></person-group>. <article-title>Measuring intake in free-living human subjects: a question of bias</article-title>. <source>Proc Nutr Soc</source> (<year>1998</year>) <volume>57</volume>:<fpage>333</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1079/PNS19980048</pub-id></citation></ref>
<ref id="B7"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Livingstone</surname> <given-names>MB</given-names></name></person-group>. <article-title>Assessment of food intakes: are we measuring what people eat?</article-title> <source>Br J Biomed Sci</source> (<year>1995</year>) <volume>52</volume>:<fpage>58</fpage>&#x02013;<lpage>67</lpage>.<pub-id pub-id-type="pmid">7549607</pub-id></citation></ref>
<ref id="B8"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schoeller</surname> <given-names>DA</given-names></name></person-group>. <article-title>How accurate is self-reported dietary energy intake?</article-title> <source>Nutr Rev</source> (<year>1990</year>) <volume>48</volume>:<fpage>373</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1111/j.1753-4887.1990.tb02882.x</pub-id><pub-id pub-id-type="pmid">2082216</pub-id></citation></ref>
<ref id="B9"><label>9</label><citation citation-type="web"><collab>National Center for Health Statistics &#x02013; Centers for Disease Control and Prevention</collab>. <source>MEC In-Person Dietary Interviewers Procedures Manual</source>. (<year>2008</year>). Available from: <uri xlink:href="http://www.cdc.gov/nchs/data/nhanes/nhanes_07_08/manual_dietarymec.pdf">http://www.cdc.gov/nchs/data/nhanes/nhanes_07_08/manual_dietarymec.pdf</uri></citation></ref>
<ref id="B10"><label>10</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dodd</surname> <given-names>KW</given-names></name> <name><surname>Guenther</surname> <given-names>PM</given-names></name> <name><surname>Freedman</surname> <given-names>LS</given-names></name> <name><surname>Subar</surname> <given-names>AF</given-names></name> <name><surname>Kipnis</surname> <given-names>V</given-names></name> <name><surname>Midthune</surname> <given-names>D</given-names></name> <etal/></person-group> <article-title>Statistical methods for estimating usual intake of nutrients and foods: a review of the theory</article-title>. <source>J Am Diet Assoc</source> (<year>2006</year>) <volume>106</volume>:<fpage>1640</fpage>&#x02013;<lpage>50</lpage>.<pub-id pub-id-type="doi">10.1016/j.jada.2006.07.011</pub-id><pub-id pub-id-type="pmid">17000197</pub-id></citation></ref>
<ref id="B11"><label>11</label><citation citation-type="book"><collab>National Research Council Subcommittee on Criteria for Dietary Evaluation</collab>. <source>Nutrient Adequacy: Assessment Using Food Consumption Surveys</source>. <publisher-loc>Washington, DC</publisher-loc>: <publisher-name>National Academy Press</publisher-name> (<year>1986</year>).</citation></ref>
<ref id="B12"><label>12</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guenther</surname> <given-names>PM</given-names></name> <name><surname>Kott</surname> <given-names>PS</given-names></name> <name><surname>Carriquiry</surname> <given-names>AL</given-names></name></person-group>. <article-title>Development of an approach for estimating usual nutrient intake distributions at the population level</article-title>. <source>J Nutr</source> (<year>1997</year>) <volume>127</volume>:<fpage>1106</fpage>&#x02013;<lpage>12</lpage>.<pub-id pub-id-type="pmid">9187624</pub-id></citation></ref>
<ref id="B13"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nusser</surname> <given-names>SM</given-names></name> <name><surname>Carriquiry</surname> <given-names>AL</given-names></name> <name><surname>Dodd</surname> <given-names>KW</given-names></name> <name><surname>Fuller</surname> <given-names>WAA</given-names></name></person-group>. <article-title>Semiparametric transformation approach to estimating usual daily intake distributions</article-title>. <source>J Am Stat Assoc</source> (<year>1996</year>) <volume>91</volume>:<fpage>1440</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1080/01621459.1996.10476712</pub-id></citation></ref>
<ref id="B14"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>CC</given-names></name> <name><surname>Kipnis</surname> <given-names>V</given-names></name> <name><surname>Freedman</surname> <given-names>LS</given-names></name> <name><surname>Hartman</surname> <given-names>AM</given-names></name> <name><surname>Schatzkin</surname> <given-names>A</given-names></name> <name><surname>Wacholder</surname> <given-names>S</given-names></name></person-group>. <article-title>Energy adjustment methods for nutritional epidemiology: the effect of categorization</article-title>. <source>Am J Epidemiol</source> (<year>1994</year>) <volume>1</volume>(<issue>139</issue>):<fpage>323</fpage>&#x02013;<lpage>38</lpage>.<pub-id pub-id-type="pmid">8116608</pub-id></citation></ref>
<ref id="B15"><label>15</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Willett</surname> <given-names>W</given-names></name> <name><surname>Stampfer</surname> <given-names>MJ</given-names></name></person-group>. <article-title>Total energy intake: implications for epidemiologic analyses</article-title>. <source>Am J Epidemiol</source> (<year>1986</year>) <volume>124</volume>:<fpage>17</fpage>&#x02013;<lpage>27</lpage>.</citation></ref>
<ref id="B16"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kipnis</surname> <given-names>V</given-names></name> <name><surname>Subar</surname> <given-names>AF</given-names></name> <name><surname>Midthune</surname> <given-names>D</given-names></name> <name><surname>Freedman</surname> <given-names>LS</given-names></name> <name><surname>Ballard-Barbash</surname> <given-names>R</given-names></name> <name><surname>Troiano</surname> <given-names>RP</given-names></name> <etal/></person-group> <article-title>Structure of dietary measurement error: results of the OPEN biomarker study</article-title>. <source>Am J Epidemiol</source> (<year>2003</year>) <volume>158</volume>:<fpage>14</fpage>&#x02013;<lpage>21</lpage>.<pub-id pub-id-type="doi">10.1093/aje/kwg091</pub-id><pub-id pub-id-type="pmid">12835281</pub-id></citation></ref>
<ref id="B17"><label>17</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tehrani</surname> <given-names>AB</given-names></name> <name><surname>Nezami</surname> <given-names>BG</given-names></name> <name><surname>Gewirtz</surname> <given-names>A</given-names></name> <name><surname>Srinivasan</surname> <given-names>S</given-names></name></person-group>. <article-title>Obesity and its associated disease: a role for microbiota?</article-title> <source>Neurogastroenterol Motil</source> (<year>2012</year>) <volume>24</volume>:<fpage>305</fpage>&#x02013;<lpage>11</lpage>.<pub-id pub-id-type="doi">10.1111/j.1365-2982.2012.01895.x</pub-id><pub-id pub-id-type="pmid">22339979</pub-id></citation></ref>
<ref id="B18"><label>18</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tsai</surname> <given-names>F</given-names></name> <name><surname>Coyle</surname> <given-names>WJ</given-names></name></person-group>. <article-title>The microbiome and obesity: is obesity linked to our gut flora?</article-title> <source>Curr Gastroenterol Rep</source> (<year>2009</year>) <volume>11</volume>:<fpage>307</fpage>&#x02013;<lpage>13</lpage>.<pub-id pub-id-type="doi">10.1007/s11894-009-0045-z</pub-id><pub-id pub-id-type="pmid">19615307</pub-id></citation></ref>
<ref id="B19"><label>19</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Subar</surname> <given-names>AF</given-names></name> <name><surname>Kipnis</surname> <given-names>V</given-names></name> <name><surname>Troiano</surname> <given-names>RP</given-names></name> <name><surname>Midthune</surname> <given-names>D</given-names></name> <name><surname>Schoeller</surname> <given-names>DA</given-names></name> <name><surname>Bingham</surname> <given-names>S</given-names></name> <etal/></person-group> <article-title>Using intake biomarkers to evaluate the extent of dietary misreporting in a large sample of adults: the OPEN study</article-title>. <source>Am J Epidemiol</source> (<year>2003</year>) <volume>158</volume>:<fpage>1</fpage>&#x02013;<lpage>13</lpage>.<pub-id pub-id-type="doi">10.1093/aje/kwg092</pub-id><pub-id pub-id-type="pmid">12835280</pub-id></citation></ref>
<ref id="B20"><label>20</label><citation citation-type="web"><collab>Centers for Disease Control and Prevention</collab>. <source>NHANES Questionnaires, Datasets, and Documentation</source>. (<year>2014</year>). Available from: <uri xlink:href="http://www.cdc.gov/nchs/nhanes/nhanes_questionnaires.htm">http://www.cdc.gov/nchs/nhanes/nhanes_questionnaires.htm</uri></citation></ref>
<ref id="B21"><label>21</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schofield</surname> <given-names>WN</given-names></name></person-group>. <article-title>Predicting basal metabolic rate, new standards and review of previous work</article-title>. <source>Hum Nutr Clin Nutr</source> (<year>1985</year>) <volume>39</volume>(<issue>Suppl 1</issue>):<fpage>5</fpage>&#x02013;<lpage>41</lpage>.<pub-id pub-id-type="pmid">4044297</pub-id></citation></ref>
<ref id="B22"><label>22</label><citation citation-type="web"><collab>World Health Organization</collab>. <source>BMI Classification</source>. (<year>2006</year>). Available from: <uri xlink:href="http://apps.who.int/bmi/index.jsp?introPage=intro3.html">http://apps.who.int/bmi/index.jsp?introPage&#x0003D;intro3.html</uri></citation></ref>
<ref id="B23"><label>23</label><citation citation-type="web"><collab>Centers for Disease Control and Prevention</collab>. <source>Defining Overweight and Obesity</source>. (<year>2010</year>). Available from: <uri xlink:href="http://www.cdc.gov/obesity/defining.html">http://www.cdc.gov/obesity/defining.html</uri></citation></ref>
<ref id="B24"><label>24</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goldberg</surname> <given-names>GR</given-names></name> <name><surname>Black</surname> <given-names>AE</given-names></name> <name><surname>Jebb</surname> <given-names>SA</given-names></name> <name><surname>Cole</surname> <given-names>TJ</given-names></name> <name><surname>Murgatroyd</surname> <given-names>PR</given-names></name> <name><surname>Coward</surname> <given-names>WA</given-names></name> <etal/></person-group> <article-title>Critical evaluation of energy intake data using fundamental principles of energy physiology: 1. Derivation of cut-off limits to identify under-recording</article-title>. <source>Eur J Clin Nutr</source> (<year>1991</year>) <volume>45</volume>:<fpage>569</fpage>&#x02013;<lpage>81</lpage>.<pub-id pub-id-type="pmid">1810719</pub-id></citation></ref>
<ref id="B25"><label>25</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Black</surname> <given-names>AE</given-names></name> <name><surname>Coward</surname> <given-names>WA</given-names></name> <name><surname>Cole</surname> <given-names>TJ</given-names></name> <name><surname>Prentice</surname> <given-names>AM</given-names></name></person-group>. <article-title>Human energy expenditure in affluent societies: an analysis of 574 doubly-labelled water measurements</article-title>. <source>Eur J Clin Nutr</source> (<year>1996</year>) <volume>50</volume>:<fpage>72</fpage>&#x02013;<lpage>92</lpage>.<pub-id pub-id-type="pmid">8641250</pub-id></citation></ref>
<ref id="B26"><label>26</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butte</surname> <given-names>NF</given-names></name> <name><surname>Treuth</surname> <given-names>MS</given-names></name> <name><surname>Mehta</surname> <given-names>NR</given-names></name> <name><surname>Wong</surname> <given-names>WW</given-names></name> <name><surname>Hopkinson</surname> <given-names>JM</given-names></name> <name><surname>Smith</surname> <given-names>EO</given-names></name></person-group>. <article-title>Energy requirements of women of reproductive age</article-title>. <source>Am J Clin Nutr</source> (<year>2003</year>) <volume>77</volume>:<fpage>630</fpage>&#x02013;<lpage>8</lpage>.<pub-id pub-id-type="pmid">12600853</pub-id></citation></ref>
</ref-list>
</back>
</article>