Assessing daily energy intake in adult women: validity of a food-recognition mobile application compared to doubly labelled water

Serra, Michele; Alceste, Daniela; Hauser, Florian; Hulshof, Paul J. M.; Meijer, Harro A. J.; Thalheimer, Andreas; Steinert, Robert E.; Gerber, Philipp A.; Spector, Alan C.; Gero, Daniel; Bueter, Marco

doi:10.3389/fnut.2023.1255499

ORIGINAL RESEARCH article

Front. Nutr., 22 September 2023

Sec. Nutrition Methodology

Volume 10 - 2023 | https://doi.org/10.3389/fnut.2023.1255499

This article is part of the Research TopicAdvancements in Real-Time Technology for Dietary AssessmentView all 3 articles

Assessing daily energy intake in adult women: validity of a food-recognition mobile application compared to doubly labelled water

Michele Serra^1,2^†

Daniela Alceste^1,2^†

Florian Hauser²

Paul J. M. Hulshof³

Harro A. J. Meijer⁴

Andreas Thalheimer¹

Robert E. Steinert¹

Philipp A. Gerber⁵

Alan C. Spector⁶

Daniel Gero¹^‡

Marco Bueter¹^*^‡

¹Department of Surgery and Transplantation, University Hospital Zurich, Zurich, Switzerland
²Faculty of Medicine, University of Zurich (UZH), Zurich, Switzerland
³Division of Human Nutrition, Wageningen University, Wageningen, Netherlands
⁴Centre for Isotope Research (CIO), Energy and Sustainability Research Institute Groningen, University of Groningen, Groningen, Netherlands
⁵Department of Endocrinology, Diabetology and Clinical Nutrition, University Hospital Zurich, Zurich, Switzerland
⁶Department of Psychology and Program in Neuroscience, Florida State University, Tallahassee, FL, United States

Accurate dietary assessment is crucial for nutrition and health research. Traditional methods, such as food records, food frequency questionnaires, and 24-hour dietary recalls (24HR), have limitations, such as the need for trained interviewers, time-consuming procedures, and inaccuracies in estimations. Novel technologies, such as image-based dietary assessment apps, have been developed to overcome these limitations. SNAQ is a novel image-based food-recognition app which, based on computer vision, assesses food type and volume, and provides nutritional information about dietary intake. This cross-sectional observational study aimed to investigate the validity of SNAQ as a dietary assessment tool for measuring energy and macronutrient intake in adult women with normal body weight (n = 30), compared to doubly labeled water (DLW), a reference method for total daily energy expenditure (TDEE). Energy intake was also estimated using a one-day 24HR for direct comparison. Bland–Altman plots, paired difference tests, and Pearson’s correlation coefficient were used to assess agreement and relationships between the methods. SNAQ showed a slightly higher agreement (bias = −329.6 kcal/day) with DLW for total daily energy intake (TDEI) compared to 24HR (bias = −543.0 kcal/day). While both SNAQ and 24HR tended to underestimate TDEI, only 24HR significantly differed from DLW in this regard (p < 0.001). There was no significant relationship between estimated TDEI and TDEE using SNAQ (R² = 27%, p = 0.50) or 24HR (R² = 34%, p = 0.20) and there were no significant differences in energy and macronutrient intake estimates between SNAQ and 24HR (Δ = 213.4 kcal/day). In conclusion, these results indicate that SNAQ provides a closer representation of energy intake in adult women with normal body weight than 24HR when compared to DLW, but no relationship was found between the energy estimates of DLW and of the two dietary assessment tools. Further research is needed to determine the clinical relevance and support the implementation of SNAQ in research and clinical settings.

Clinical trial registration: This study is registered on ClinicalTrials.gov with the unique identifier NCT04600596 (https://clinicaltrials.gov/ct2/show/NCT04600596).

1. Introduction

Accurate assessment of dietary intake plays a crucial role in nutrition and health research, enabling the examination of the relationship between diet and human health (1, 2).

Traditional dietary assessment tools, including food records, food frequency questionnaires, and 24-hour dietary recalls (24HR), have been widely used in nutrition research to capture valuable information on the types and quantities of foods, beverages, and supplements consumed (3–5). One of the most popular and commonly used tools for assessing individuals’ dietary intake in research is the 24HR. This tool, which evaluates an individual nutritional intake over a 24-hour period, can be interviewer-administered or self-administered, providing a relatively simple and participant-friendly approach (6). Notably, the use of 24HR has demonstrated lower misreporting incidence, as well as reduced degrees and variations of underreporting compared to other dietary assessment tools (7).

However, researchers encounter various challenges when measuring energy and nutrient intake in humans. Current dietary assessment tools may require trained interviewers, are time-consuming, and may exhibit limitations in accuracy and qualitative aspects of dietary habits (7–9). Factors such as dieting status, recall bias, social desirability bias, challenges in estimating portion sizes, and omissions of food items can influence the validity of these tools (2, 3, 7). Furthermore, reporting accuracy differs among various populations. Specifically, evidence suggests that individuals with a higher body mass index (BMI) are more likely to underreport their dietary intake (1). Additionally, variations in the accuracy of reporting energy intake have been noted based on sex (7). These findings highlight the importance of considering such factors when assessing self-reported dietary data.

It is crucial to consider not only the reduction of misreporting but also the impact of time-consuming tasks or functions on energy intake when evaluating a dietary assessment tool. Study participants should not be discouraged or inclined to decrease their (reported) energy intake due to the tool’s complexity or time requirements (10, 11). These factors might influence compliance and accuracy, ultimately affecting the validity and reliability of the collected dietary data.

Novel technologies, including web-assisted intake assessments, digital photography, and mobile applications (apps), have been developed to enhance the effectiveness of dietary assessments, aiming to minimize estimation errors, reduce the burden on participants and investigators, and improve the accuracy and efficiency of dietary intake assessments (12–14). These advancements in technology have offered researchers an opportunity to use novel and accurate tools for evaluating dietary intake in real-life settings, contributing to a comprehensive understanding of the impact of diet on overall health and well-being.

Motivated by advancements in information and communication technology, researchers have explored innovative approaches to improve the accuracy of dietary assessment tools (3, 12, 15). Technology-based approaches have gained popularity over conventional tools in the past decade, demonstrating the ability to collect almost real-time data, reduce memory bias, gain user acceptance, and cater to individuals with low literacy levels (4).

Image-assisted dietary assessment tools, utilizing wearable cameras or smartphones, complement traditional tools by capturing images of food and beverages to enhance accuracy (16). On the other hand, image-based approaches rely primarily on captured images as the main source of information on dietary intake (17). Smartphone apps are used in image-based approaches for food recognition and volume and energy estimation, simplifying the recording of dietary intake for both researchers and study participants (18).

Smartphones equipped with advanced features, such as high-resolution cameras, ample memory capacity, strong network capabilities, and faster processors, make them valuable tools for collecting dietary information and capturing food images (19). However, utilizing smartphones for dietary assessment requires training, technical development, secure data transfer infrastructure, and accurate portion size estimations by trained researchers (4). Image-based approaches minimize errors from memory recall and portion size estimations (17) and have shown higher estimates of energy intake per eating occasion compared to proxy-assisted records (16), reducing under-reporting when recording food intake with smartphones.

Currently, several apps for image-based dietary assessment are available (18), and their use has become feasible due to the widespread ownership of smartphones. A novel image-based food-recognition app, SNAQ, utilizes depth-sensing hardware and computer vision to quantify the macronutrient content and quantity of photographed dietary items within the app (20). Designed for real-life scenarios, SNAQ assesses the type and volume of recognized food items, providing information on portion size, macro- and micronutrient intake, food type, eating time, and frequency. Notably, the app is user-friendly and requires less specialized training compared to traditional dietary assessment tools. Its potential as a dietary assessment tool extends to estimating energy, macro-, and micronutrient intake and understanding dietary behavior.

In a previous research work by Herzig et al., it was suggested that SNAQ allows highly accurate volume estimation across a wide range of food items, exhibiting efficient segmentation performance and fast processing time in a controlled setting (20). However, further research is needed to evaluate the validity, accuracy, and precision of these new tools in real-life scenarios (21).

The doubly labeled water (DLW) technique is commonly used to assess the validity of dietary assessment tools by estimating the energy expenditure of an individual during a defined period, usually 7–14 days (22). Based on the assumption of energy balance, hence of body weight stability during the observation period, the energy expenditure estimated with DLW can be compared to the energy intake estimated with the dietary assessment tool under investigation. If the body weight of the individual remains stable during the study period, then the estimated energy intake must equal the energy expenditure. Therefore, any measurement differences between the methods must either be explained by the changes in body weight or be imputed to the dietary assessment tool and the user. This biomarker technique can reveal misreported data of food intake (1, 3).

One approach to assess energy intake reported with a dietary assessment tool is to objectively measure energy intake based on the principle of energy balance by measuring total energy expenditure plus changes in body energy stores, such as fat mass (FM) and fat-free mass (FFM) (22), and to explore the strength of the linear relationship between reported energy intake and body composition (23). Indeed, previous findings suggest that total daily energy intake (TDEI) is proportional to total daily energy expenditure (TDEE) and FFM, but not FM (23–28). These findings suggest not only that energy expenditure itself may have an influence on energy intake (29), but also indicate the presence of a fundamental drive to eat, which serves to meet the energy requirements of vital tissues, organs, and metabolic processes through regular food intake (30–32).

When assessing the validity of an assessment tool for dietary intake, it is important to use multiple statistical tests to gain a comprehensive understanding of its performance. Different facets of validity, such as accuracy, reliability, and agreement, may require different statistical approaches (33). Each of these facets provides distinct information to obtain a more thorough assessment of the performance of the tool. These insights should then be carefully considered when formulating conclusions about the validity of the tool, ensuring they are appropriately aligned with the specific research question and context of the study. In the assessment of validity, it is essential to determine the extent and direction of measurement error, identify factors contributing to such errors, and explore methods to mitigate or address them in the data analyses (33, 34). Furthermore, it is necessary to identify over- and under-reporters by comparing TDEI with TDEE with respect to the basal metabolic rate (Goldberg cut-off points method) before drawing any conclusions regarding validity for a target population (33, 35).

This study aimed to investigate the validity of the SNAQ app as a dietary assessment tool for daily energy intake in adult women with normal weight. The research question was limited to this source population to account for possible confounding of sex and misreporting of individuals with higher BMI. The primary objective was to investigate the agreement between TDEI estimated with the SNAQ app and TDEE estimated with the DLW technique. The secondary objective was to investigate the agreement between TDEI estimated with 24HR and TDEE estimated with DLW. In addition, the agreement between SNAQ and 24HR for TDEI and daily macronutrient intake was also investigated.

2. Materials and methods

2.1. Setting and study design

A cross-sectional observational study was carried out in 30 adult women with a normal body weight [body mass index (BMI) ≥ 18.5 kg/m² and ≤ 24.9 kg/m²] and in free-living conditions. Recruitment and study sessions were conducted between April 2020 and August 2022 at the University Hospital Zurich, Department of Surgery and Transplantation. This investigation is part of a prospective cohort study conducted at the same study site, which explores changes of food intake and selection in patients after Roux-en-Y Gastric Bypass surgery.

The study was conducted according to the Declaration of Helsinki. Ethical approval for this study was received from the Cantonal Ethics Committee of Zurich (BASEC-Nr. 2019-00952) and the study protocol is a secondary analysis of a prospective observational cohort study registered on ClinicalTrials.gov with the unique identifier NCT04600596. Written informed consent was obtained from all the study participants. The results of this study are according to the STROBE-nut statement (36) (Supplementary Table S1).

2.2. Participants

Study participants were required to have self-reported literacy in smartphone technology and no history of metabolic and bariatric surgery. Recruitment of study participants was performed at both the University Hospital Zurich and at the University of Zurich, located in Zurich, Switzerland. Exclusion criteria included (1) systemic or gastrointestinal disorders that could impact food consumption or preferences, (2) type 1 or 2 diabetes mellitus, (3) current use of medication or dietary supplements with metabolic and absorptive effects, (4) pregnancy or lactation, (5) adherence to a diet limiting energy intake, (6) renal failure, (7) congestive heart failure, (8) malabsorption syndrome, (9) active and clinically significant psychiatric conditions, such as eating disorders, and (10) inability to comprehend instructions in either German or English (Supplementary Table S2). After inclusion in the study, the study participants received a study identifier and a user account for the study app (see Section 2.7).

The decision to include in this study only adult women with a normal body weight was based on the evidence suggesting that both women and men report different dietary behaviors (7). It remains uncertain whether this is an actual difference or caused by systematic misreporting. Therefore, to avoid any sex-related bias in our results, we decided to exclude men from our study. Furthermore, an increased BMI has been associated with underreporting of energy intake. Therefore, we included only adult women with healthy weight.

2.3. Calculation of sample size

The sample size was determined according to the primary outcome of the main study. For this study, using the most conservative assumptions, a difference in energy estimates of 350 ± 450 kcal/day (effect size 0.78) between TDEI reported with SNAQ and TDEE estimated with DLW was considered relevant in assessing the validity of SNAQ (37, 38). Therefore, it was estimated that 27 study participants would have been necessary to have 80% power to detect this difference at a 5% significance level. Accounting for a 10% dropout rate, a sample size of 30 was planned for this study.

In the context of research studies assessing validity of new dietary assessment tools for estimation of TDEI, some authors explicitly refrain from sample size calculation because of recruitment difficulties (39), do not state the calculation of their sample size (40), or simply refer to theoretical papers (41). Therefore, in the absence of a consensus and of thoroughly motivated a-priori sample size calculations, it is challenging to define on an individual basis what would be an appropriate accuracy threshold for assessing clinical or research validity of a new dietary assessment tool. Nevertheless, the average sample size of studies assessing the validity of dietary assessment tools by agreement with DLW is 27 (7). In these studies, the sample size was higher when the study design included a laboratory setting (42–44), and it decreased when the study design included free-living conditions (10, 43). Therefore, we considered the calculated sample size (n = 30) as appropriate for answering the research question of the main study.

2.4. Study outcomes

The primary outcome of this study was the level of agreement between TDEI reported with SNAQ and TDEE estimated with DLW.

The secondary outcomes of this study were the levels of agreement between TDEI reported with 24HR and TDEE estimated with DLW, and the levels of agreement between SNAQ and 24HR for (1) TDEI and (2) macronutrient intake.

Further outcomes were defined to investigated different facets of validity:

1. The strength of linear relationship between TDEE estimated with DLW and TDEI reported with SNAQ and 24HR,

2. The differences of TDEI reported with SNAQ and with 24HR in relation to TDEE estimated with DLW – hereby defined as measurement differences,

3. The comparison, in term of over- and underestimation of energy estimates, of SNAQ and 24HR, to DLW,

4. The classification of over- and under-reporters according to the Goldberg cut-off points,

5. The strength of linear relationship between body composition (FFM and FM) and TDEI reported with SNAQ and 24HR,

6. The strength of linear relationship between body weight changes during the study week and measurement differences of SNAQ and 24HR from DLW,

7. The strength of linear relationship between TDEE during the study week and measurement differences of SNAQ and 24HR from DLW,

8. The extent of stability of diet consumption over the study week,

9. The strength of linear relationship between TDEI reported with SNAQ and TDEI reported with 24HR,

10. The difference in TDEI between estimates of SNAQ and 24HR,

11. The difference in macronutrients intake between estimates of SNAQ and 24HR, and

12. The comparison, in term of over- and underestimation, of SNAQ and 24HR for TDEI and macronutrient intake.

2.5. Study protocol

At the screening visit, informed consent was provided by the study participants, inclusion and exclusion criteria were assessed, and data on demographics and anthropometrics were collected. After inclusion in the study, two study visits were set with the study participants at the beginning and at the end of a period of 8 days. This period was necessary to collect data on energy intake and energy expenditure for a period of seven complete days (168 h), where a day was defined as a period of 24 hours. The eight-day period will be hereafter referred to as study week. Participants were instructed to choose a week that reflected their typical daily routines, excluding any exceptional outings or activities.

At the first visit of the study week, the study participants arrived at the study location between 7:00 am and 8:00 am after an instructed overnight fast of at least 8 hours. Baseline anthropometric measurements were performed. The study participants self-reported their dietary habits (whether they adhered to any special diet and which one), smoking habits (whether they smoke or not, independently of the amount and type of cigarettes consumed), and physical activity levels (how many hours per week do they train, independently of the type of activity). Information on food intake in the day immediately preceding the study visit was collected with a 24HR (see Section 2.8). Body composition was measured with a bioelectrical impedance analysis. A pre-dose urine sample was collected. Subsequently, study participants were asked to ingest an individualized dose of DLW (Supplementary Table S3). A second and third urine sample was collected 3 and 4 hours after the dose was ingested, respectively. The overnight fast is observed until the collection of the last urine sample.

On the first day of the study week, before leaving the research facility, the study participants were provided by one of the two study co-investigators with standardized instructions on how to use the app, and a demonstration on how to record food items was given. The study investigator asked the study participant to perform a mock recording. This action was undertaken to verify the capability of the participant to accurately implement the instructions that were provided. Should any difficulties arise during this preliminary phase, supplementary guidance was furnished until the participant exhibited both a comprehensive understanding of the functionality of the app and a proficiency in adhering to the directives provided. In case a study participant did not own a suitable smartphone for the purpose of the study (see Section 2.8), a study smartphone was provided for the duration of the study week. Study participants started reporting their food intake for every eating occasion following the conclusion of the study visit. An “eating occasion” was defined as any occasion where any food or caloric beverage was ingested (45). Three semi-customized prompts were set within the app at the habitual times of breakfast, lunch, and dinner of each study participant. Study participants were also asked to refrain from intense physical activity during the study week or any activities which could cause significant loss of water as sweat. Additionally, study participants were asked not to travel abroad during the study week.

On the eighth day of the study week, study participants returned to the study location for a second study visit before 12 p.m. and did not have to observe an overnight fast. Body weight was measured, and a fourth and fifth urine sample was collected at the same hours of the day as the second and third urine samples, respectively. In case a study participant received a study smartphone, they handed it back at this study visit.

A diagram of the study protocol is shown in Figure 1.

FIGURE 1

Figure 1. Study design. 24HR, 24-hour dietary recall; DLW, doubly labelled water.

2.6. Anthropometric measurements and body composition

Participant height was measured to the nearest 0.01 m with the study subject standing with her back to a wall-mounted stadiometer in bare feet (Seca 274, Seca, Hamburg, Germany). Body weight was measured to the nearest 0.1 kg with a calibrated scale (Seca, Hamburg, Germany). BMI was calculated as weight in kilograms divided by height in squared meters (kg/m²). Body composition, including total body water (TBW), fat-free mass (FFM), fat mass (FM), and resting energy expenditure (REE), was measured using a multi-frequency 8-point stand-on bioelectrical impedance device (Seca mBCA 515, Seca, Hamburg, Germany). Additionally, TBW, FFM, and FM were determined by dilution of oxygen-18 (¹⁸O) and deuterium (²H). Isotope dilution spaces (kg) were calculated by the plateau method (46). TBW (kg) was determined by averaging the deuterium dilution space divided by 1.041 and the oxygen dilution space divided by 1.007, with corrections made for in vivo isotope exchange (47). FFM (kg) was estimated as TBW divided by 0.732, assuming a hydration factor of 0.732 and considering the body fat to be hydrophobic (48). FM (kg) was estimated by subtracting FFM from the body weight.

The Basal Metabolic Rate (BMR) was calculated with the Mifflin-St Jeor equation for females (49):

\begin{array}{l} B M R = (10 * weight (kg)) + (6.25 * height (cm)) - \\ (5 * age (years)) - 161. \end{array}

The Mifflin-St Jeor equation is considered more accurate than the Harris-Benedict equation, and is recommended for calculating BMR in adults (50).

2.7. Total daily energy expenditure

Total daily energy expenditure (TDEE) was measured over a seven-day period with the DLW technique (51). The body weight of the study participant was assumed to be stable in the 7-day period of the observation. Briefly, each study participant consumed a dose mixture of 1.8 g of 10 atom% oxygen-18-(¹⁸O)-labeled water (H₂¹⁸O, Cambridge Isotopes Laboratories, Inc. Tewksbury, Massachusetts, United States) per kg body water and of 0.12 g of 99.9 of atom% deuterium-(²H)-labeled water (²H₂O, Cambridge Isotopes Laboratories, Inc. Tewksbury, Massachusetts, Unites States) per kg of body water (Supplementary Table S4). Following the “two-point” method protocol described by Schoeller (46, 52), a total of five urine samples were collected for each study participant. One pre-dose and two post-dose urine samples collected at 3 and 4 hours after dosing were collected on the first day. The last two urine samples were collected after 7 days at the same time of the second and third samples, respectively. Isotope analyses of the urine samples were performed at the Centre for Isotope Research (CIO), University of Groningen, Netherlands using an optical spectrometer (LGR LWIA 912–0050, ABB Ltd. – Los Gatos Research, San Jose, California, United States). The measurement analysis (including the sample memory correction) and the calculation of isotopic abundances of ²H and ¹⁸O were done according to the isotope measurement method described by Wang et al. – although in the present study limited to DLW (53). Reference waters (DLW) of the International Atomic Energy Agency (IAEA) were used for calibration (54). Total production of respiratory carbon dioxide (rCO₂) for calculation of TDEE was calculated according to a new equation proposed by Speakman et al. (55):

{r C O}_{2} = 0.4554 * N * [(1.007 * k_{O}) - (1.043 * k_{d})] * 22.26,

where N is the total body water (TBW) estimated using the dilution spaces of both isotopes, and k_O and k_d are the two elimination constants for ¹⁸O and ²H, respectively. TDEE was calculated from rCO₂ with the Weir equation (56):

TDEE = {r C O}_{2} * (1.106 + (\frac{3.94}{R Q})),

where RQ is the respiratory quotient, defined as the proportion between the release of CO₂ and the O₂ consumption. TDEE gives an estimation of the daily energy intake under the hypothesis of energy balance, and therefore body weight stability, during the 7 days of the two-point protocol. The use of the coefficients used in the calculations have also been explained by Wang et al. (53).

2.8. Dietary assessment with 24HR

Information on dietary intake in the day immediately preceding the study visit was collected by trained study investigators with a 24HR. The study investigators used a standardized pen-and-paper interview, inquiring about all foods, beverages, and supplements consumed in the preceding 24 hours. During the interview, the type of food item was recorded within set eating occasions (breakfast, lunch, dinner, and snacks) together with the approximate self-reported portion size (e.g., tablespoon, cup, plate). Subsequently, the collected recall data were anonymized, transformed into digital format, and processed utilizing a dietary analysis software (Nutritics, version 5.77, 2021, Nutritics, Dublin, Ireland). The software includes food composition data from over 200,000 food items sourced from global food databases, including those from the United States (United States Department of Agriculture), United Kingdom (McCance and Widdowson), and other European countries. It also includes branded food products and raw ingredients. Total daily energy intake (TDEI, in kcal and kJ) and macronutrient intake (carbohydrates in g, fats in g, saturated fats in g, proteins in g, fibers in g, and dietary occasions in n) were calculated with Nutritics for each study participant. The estimation of energy intake with the 24HR was not the central focus of this study. Nevertheless, the findings could be instrumental for researchers exploring the congruence between the emergent image-based food-recognition technology and conventional tools.

2.9. Dietary assessment with SNAQ

The app SNAQ study (SNAQ AG, Zurich, Switzerland), a customized version of the commercial app SNAQ designed specifically for this study, was used by all study participants to self-report their food intake during seven consecutive days. SNAQ study was available in two language versions, English and German. Unlike the commercial version of SNAQ, SNAQ study features buttons for labeling images of meals as (1) before (English version) or vorher (German version), which referred to any image taken at the beginning of an eating occasion, (2) after (English version) or nachher (German version), which referred to any image taken at the end of a eating occasion where the meal was not completely consumed, and (3) ate everything (English version) or alles aufgegessen (German version), which referred to any image taken at the end of an eating occasion where the meal was completely consumed. Moreover, participants were not granted access to any data pertaining to energy and macronutrient content of the recorded meals which would be normally available on the commercial version of the app. Other attributes of SNAQ study were analogue to those of the commercial version. The customized version of the app will hereafter 650 be referred to as app or SNAQ for simplicity.

The food-recognition function of the app is based on computer vision, a type of artificial intelligence for identification and analysis of objects in images. The quality of the results provided by computer vision improves when a depth-sensing camera is used (20). Therefore, following the recommendations of SNAQ AG, only iPhone models X, 11, or 12 (Apple Inc., Cupertino, California), which all include depth-sensing camaras, were selected for this study. For iPhone models X and 11, photographs were taken using the front depth-sensing camera. For iPhone models 12 or above, the back depth-sensing camera was used.

If available, the study participants were asked to install SNAQ study on their personal smartphone. Otherwise, they were provided with a study smartphone (iPhone model 11; Apple Inc., Cupertino, California) solely for the purpose of using SNAQ during the study period.

The study participants were instructed to capture two images (“before,” and “after” or “eat everything”) for each eating occasion. Eating occasions included breakfast, lunch, dinner, snacks, and caloric beverages, except for water and any other energy-free beverages, and supplements. The first image had to be taken prior to consuming the food item(s), while the second image had to be taken at the conclusion of the eating occasion and needed to display any unconsumed food items or the empty plate or glass. Furthermore, in the event of any technical complications or non-adherence to the protocol, participants were instructed to contact a study investigator for instructions. Interactions between the study participants and the investigators were conducted either telephonically or through text messaging, utilizing the personal phone of the participant and the work phone of the study investigator. This communication was permitted at any time to address inquiries specifically related to the study app. Continuous, real-time support was systematically provided by one of the two co-investigators, ensuring availability around the clock to assist with any emergent questions or concerns.

To overcome a potential measurement error of energy intake due to missing inputs of food items the study protocol included the use of semi-customized prompts. Three daily push notifications were set on the study smartphones. The notifications would activate 15 min before three customized times selected by the study participants. These should correspond approximately to the three main eating occasions of the study participants, i.e., breakfast, lunch, and dinner.

The user interface for recording of the eating occasions is shown in Figure 2. An image of the food item(s) was captured with the built-in camera app by study participants as described above (Figure 2A). The food item(s) had to be placed on a white plate or in a glass and the image had to be taken from a distance of approximately 30 centimeters above the item (Figure 2B). The study participants were instructed to ensure appropriate illumination when the image was taken. If the image appeared blurred, the study participants were instructed to take a photograph with a better quality to improve the performance of the app. Each image had to be labelled, as described above, to indicate whether it was taken before or at the end of the dietary occasion (Figure 2C). The computer vision in the app would then automatically recognize the type and portion size of the food item. If distinct food items were recorded in the image, the computer vision would perform its analysis for these items separately (Figure 2D). The recognized type and portion sizes could be manually corrected by the study participants if the dietary item was not correctly recognized by the app (Figure 2E). The app is designed in such a way that it does not provide the study participants with additional information regarding the macronutrient composition or caloric content of the food items that have been recorded.

FIGURE 2

Figure 2. User interface for recording of the eating occasions. (A) An image of the food item(s) was captured with the built-in camera app by study participants. (B) The food item(s) had to be placed in a white plate or a glass and the image had to be taken from a distance of approximately 30 cm above the item. (C) Each image had to be labelled to indicate whether it was taken before or at the end of the dietary occasion. (D) If distinct food items were recorded in the image, the computer vision would perform its analysis for these items separately. (E) The recognized food type(s) and portion sizes could be manually corrected by the study participants if a food item was not correctly recognized by the app. Translation of the German words in the panels: alles aufgegessen, ate everything; Malzheit erfassen, register a meal; nacher, after; vorher, before.

Maintaining consistent compliance among study participants for each recording, in accordance with all the instructions delineated for the SNAQ app, presented an insurmountable challenge in a free-living setting. This limitation played a significant role in the decision to utilize DLW as a robust reference method. The concordance between the energy estimates obtained through SNAQ and those derived from DLW served a dual purpose. It not only evaluated the accuracy of the tool itself but also the capability and adherence of the participants as a cohesive entity. In doing so, this comparison effectively assesses the validity of the overall use of the app under the specific conditions defined in this study.

The app utilizes a food density database (SNAQ AG) to transform the estimated food volume into food weight. Then, the app utilizes the Swiss Food Composition Database (57) to estimate the energy content of the recorded food items (20). Given that the computer vision functionality of the app relies on a deep learning model specifically trained on an extensive image catalog of common Swiss foods, the model – at least in the version of the app dedicated to this study – lacks the capability to discern the nutritional values delineated on product labels or to identify product names through natural language processing. Nevertheless, it is important to note that the deep learning model has undergone training with images of packaged food, beverages, and mixed meals. Furthermore, comprehensive nutritional data on a wide array of Swiss food items, encompassing both commonly and rarely commercialized products, is accessible through the Swiss Food Composition Database. Consequently, the information retrieval of the app pertaining to packaged food items is based on this deep learning training, referencing the Swiss Food Composition Database, rather than utilizing natural language recognition techniques.

Data on time of the eating occasions, type of recorded food items, portion sizes, energy and macronutrient content were saved on the servers of SNAQ AG and were extracted at the end of the study for data analysis. Total daily energy intake (TDEI, in kcal and kJ) and macronutrient intake (carbohydrates in g, fats in g, saturated fats in g, proteins in g, fibers in g, and dietary occasions in n) were calculated from the extracted data.

2.10. Misreporting of total daily energy intake

The misreporting of energy intake was investigated according to the principles of the Goldberg cut-off points (58) following the approach proposed by Black (35). The cut-off points were used for the classification of study participant to comparative purposes with other studies and not for the detection of outliers to be excluded from the data analysis. Under-reporters were defined as having a TDEI:BMR ratio smaller than the lower 95% confidence limit, and over-reporters were defined as having a TDEI:BMR ratio greater than the upper 95% confidence limit. The following two relationships were assessed for the 30 study participants considered in this study with the values of TDEI reported with SNAQ and 24HR:

\frac{TDEI}{B M R} > PAL \times e^{[s . d ._{min} \times \frac{\frac{S}{100}}{\sqrt{n}}]},

\frac{TDEI}{B M R} < PAL \times e^{[s . d ._{max} \times \frac{\frac{S}{100}}{\sqrt{n}}]}

where BMR is the basal metabolic rate, PAL is the mean physical activity level for the study population, s.d._min is −2 for the 95% lower confidence limit, s.d._max is +2 for the 95% upper confidence limit, and n is the number of study participants. BMR was calculated with the Mifflin-St Jeor equation. PAL was calculated as TDEE:BMR ratio. S is the factor that incorporates the variation in energy intake, BMR, and energy requirements, and was determined by:

S = \sqrt{\frac{{C V}_{wTDEI}^{2}}{d} + {C V}_{w B}^{2} + {C V}_{t P}^{2}},

where CV_wTDEI is the mean within-subject coefficient of variation for TDEI reported with SNAQ, d is the number of days of dietary assessment, CV_wB is the coefficient of variation of repeated BMR measurements, and CV_tP is the total variation in PAL. CV_wTDEI was calculated for this study only for TDEI reported with SNAQ. As far as regard TDEI reported with 24HR, no repeated measurements were available according to the present study protocol. For this study, two values of CV_wTDEI and of CV_tP were used for the calculation of S: (1) the value of CV_wTDEI and CV_tP specific for this study population, and (2) the value of CV_wTDEI of 26% and the value of CV_tP of 16.5% suggested by Black for adult women with normal weight (35). This value of CV_tP was suggested for women between 18 and 29 years of age and a mean PAL of 1.70 (35). As far as regards CV_wB, the coefficient 4.1% was used. This value of CV_wB was suggested for women in free-living conditions (59).

The CV_wTDEI has been calculated for this study according to the formula (35):

{C V}_{wTDEI} = \sqrt{\sum_{i = 1}^{n} \frac{{C V}_{i}^{2}}{n}},

where CV_i is the coefficient of variation (CV) calculated for each study participant from the number of days of dietary assessment with SNAQ available for that study participant, and n is the number of study participants. The CV_i was calculated by dividing the standard deviation (SD) of the variable of interest by the mean for each study participant. The result was multiplied by 100 to express the CV as a percentage. This calculation provided a measure of the variability within the data of each study participant, expressed as a percentage of the mean value. It is commonly used to assess the reliability or consistency of measurements within individuals. CV_tP, being a coefficient of variation, was calculated with the same approach of the CV_i of CV_wTDEI.

2.11. Statistical analysis

Data analysis was performed using R (version 4.2.2) (60) through RStudio (version 2022.07.1) (61). Barplots, boxplots, scatter plots, violin plots, and Bland–Altman plots (Figures 3–7 and Supplementary Figures S1–S7) were generated with the ggplot2 R package (version 3.4.0). All figures were organized using Adobe Illustrator (version 27.4.1).

FIGURE 3

Figure 3. Bland–Altman plots for agreement of the SNAQ app and the 24-hour dietary recall (24HR) with the doubly labelled water (DLW) technique and correlation analysis of linear relationship between energy expenditure and the energy intake estimated with SNAQ and 24HR. (A) Bland–Altman plot for agreement on energy intake between SNAQ and DLW. (B) Bland–Altman plot for agreement on energy intake between 24HR and DLW. The bias is represented as a blue horizontal dotted line. The value of the bias is estimated by the mean difference in energy estimation between DLW and SNAQ or 24HR. The 95% limits of agreement (LoA) are represented as two brown dotted lines and are defined as mean difference ± 1.96 standard deviations. (C) Linear correlation between energy estimates of SNAQ and DLW. (D) Linear correlation between energy estimates of 24HR and DLW. 24HR, 24-hour dietary recall; CI, confidence interval; DLW, doubly labelled water; K, decimal unit suffix for thousand; LoA, limit of agreement, p, p-value of the coefficient of determination; R², coefficient of determination of the linear relationship.

FIGURE 4

Figure 4. Bland–Altman plots for agreement between the app and the 24HR. (A) Bland–Altman plot for total daily energy intake. (B) Bland–Altman plots for macronutrient intake. (C) Bland–Altman plot for number of eating occasions. The bias is represented as a black horizontal line. The value of the bias is estimated by the mean difference in intake estimation between 24HR and the app for total daily energy intake and macronutrient intake. The 95% limits of agreement (LoA) are represented as two dotted lines and are defined as mean difference ± 1.96 standard deviations.

FIGURE 5

Figure 5. Measurement differences of total daily energy intake of the app SNAQ and the 24-hour dietary recall (24HR) in relation to the total daily energy expenditure estimated with the doubly labelled water (DLW) technique. (A) Violin plots of the energy estimates of SNAQ, DLW, and 24HR. Mean of the groups are represented with full points in the middle of the violin plots. Standard deviations are represented with a vertical line in the middle of the violin plots. Pairwise comparisons of SNAQ and DLW, and 24HR and DLW are represented with horizontal lines between the violin plots. p-values of the pairwise comparisons are reported above the horizontal lines. Significance levels were adjusted for multiple comparison (3x) and are reported below the horizontal lines. They are expressed as follows: *when p-value < 0.05; **when p-value < 0.01; *** when p-value < 0.001. (B) Mean percentage differences for estimations of daily energy intake of SNAQ and 24HR in relation to DLW. (C) Mean percentage differences for over- and underestimations of total daily energy intake of SNAQ and 24HR in relation to total daily energy expenditure estimated by DLW. The number of over- and underestimations between study participants are reported as numbers below the bars, for overestimations, or above, for underestimation. 24HR, 24-hour dietary recall; DLW, doubly labelled water; K, decimal unit suffix for thousand; MPD, mean percentage difference; over, overestimation of SNAQ and 24HR in relation to DLW; under, underestimation of SNAQ and 24HR in relation to DLW.

FIGURE 6

Figure 6. Correlation analysis of linear relationship between body composition (fat mass, FM and fat-free mass, FFM) and the energy intake estimated with the SNAQ app and the 24-hour dietary recall (24HR). (A) Linear correlation between daily energy intake estimated with SNAQ and FM. (B) Linear correlation between daily energy intake estimated with SNAQ and FFM. (C) Linear correlation between daily energy intake estimated with 24HR SNAQ and FM. (D) Linear correlation between daily energy intake estimated with 24HR and FFM. 24HR, 24-hour dietary recall; p, p-value of the coefficient of determination; R², coefficient of determination of the linear relationship.

FIGURE 7

Figure 7. Changes in body weight during the study period and correlation analysis of linear relationships of measurement differences of the SNAQ app and the 24-hour dietary recall (24HR). (A) Changes of body weight of the study participants during the study week. Bars on the left of the y-axis represent body weight loss. Bars on the right of the y-axis represent body weight gain. (B) Linear correlation between changes of body weight during the study week and percentage of measurement differences of daily energy intake estimated with SNAQ in relation to DLW. (C) Linear correlation between estimations of daily energy expenditure during the study week and percentage of measurement differences of daily energy intake estimated with SNAQ in relation to DLW. (D) Linear correlation between changes of body weight during the study week and percentage of measurement differences of daily energy intake estimated with 24HR in relation to DLW. (E) Linear correlation between estimations of daily energy expenditure during the study week and percentage of measurement differences of daily energy intake estimated with 24HR in relation to DLW. Δ, difference from baseline; %Δ, percentage difference; 24HR, 24-hour dietary recall; DLW, doubly labelled water; K, decimal unit suffix for thousand; p, p-value of the coefficient of determination; R², coefficient of determination of the linear relationship.

A Bland–Altman-analysis (62) was performed to assess the level of agreement between the three methods of energy estimation. This analysis is widely used to visualize the differences between two methods of medical assessment by plotting the mean measurements of the two methods (x-axis) against the differences in measurements between the two methods (y-axis). Differences were expressed in relation to the DLW values – when DLW was part of the analysis – or 24HR values. Then, lower and upper limits of agreement (LoA) were derived from the mean difference between methods ±1.96 SD (63). Possible systematic bias was assessed visually on the Bland–Altman plots. However, any level-dependent bias identified through this visual inspection was not further addressed in the analysis.

The strength of the linear relationships stated as further outcomes was evaluated using a Pearson’s correlation coefficient. The correlation coefficient (r) was calculated to assess the magnitude of the relationship. The effect size was interpreted based on the magnitude of the coefficient of determination (R²), with values closer to +1 or −1 indicating a stronger relationship. The R² represents the proportion of variance in one variable that can be explained by the other variable in a linear regression model. The significance of the relationship was assessed using the corresponding p-value, with p < 0.05 considered statistically significant. The significance value of all coefficients of determination was set at 0.05, but the p-values were adjusted with a 3x Bonferroni correction to account for all correlation analyses performed between any variable and both SNAQ and 24HR.

The differences in energy estimates between the methods were calculated by subtracting the energy estimates of the reference method (either DLW or 24HR) from the values of TDEI estimated with SNAQ or 24HR. The percentage differences of energy estimates between the methods were calculated as percentage change from the values estimated by the DLW. A paired difference test was performed to test the null hypothesis (H₀) of no difference between TDEI estimated either with SNAQ or with 24HR and TDEE estimated with DLW. The H₀ of no difference between TDEI estimated with SNAQ and TDEI estimated with 24HR was also tested. A t-test was used when the differences followed a normal distribution, while the Wilcoxon signed-rank test was used for non-normally distributed differences. The data distribution was assessed for normality using the Shapiro–Wilk test. The alternative hypothesis (H₁) was a difference between TDEI estimated either with SNAQ or with 24HR and TDEE estimated with DLW, or a difference between TDEI estimated with SNAQ and TDEI estimated with 24HR. The significance value was set at 0.05, but the p-values were adjusted with a 3x Bonferroni correction to account for the three comparisons.

The extent of stability of diet consumption over the study week was assessed with an intraclass correlation coefficient (ICC). This provides an indication of the consistency of measurements when the measurement process is repeated under identical conditions. The ICC was calculated with a mixed-effects model as the ratio of the between-group variance to the total variance (sum of the between-group variance and the residual variance). This model considered both the variation within individuals (within-subject variation) and the variation between individuals (between-subject variation). Considering that there were 30 study participants with seven consecutive daily measurements and no intervention, a random-intercept model was fit with each study participant as a random effect. The ICC was derived from the ratio of the between-subject variance to the total variance (the sum of the between-subject variance and the within-subject variance). The ICC ranges between 0 and 1, with higher values indicating greater similarity between the day-to-day intake measurements within individuals.

The residual mean square, the between-group mean square, and the within-subject standard deviation calculated with the mixed-effects model were also reported to provide an order of effects of the model underlying the ICC. The residual mean square represents the variability of daily energy intake measurements within the same study participant. The between-group mean square represents the variability between study participants and, in the context of the ICC, this value should be much larger than the residual mean square. Indeed, a larger variability between study participants than within study participants would contribute to a higher ICC value. The within-subject standard deviation is a measure of how spread out individual scores are from the mean within the same study participants across different measurements. If this value is small relative to the between-subject variability, the ICC will be higher.

3. Results

3.1. Characteristics of the study participants

A total of 41 adult women with normal weight were invited to participate in the study. Thirty candidates were eligible according to the inclusion and exclusion criteria. The baseline characteristics of the study participants are reported in Table 1. The mean age of the participants was 28.5 (SD ± 6.7) years, and the mean BMI was 21.6 (SD ± 1.6) kg/m². Among the 30 study participants, three (10%) were identified as smokers. Approximately 40% of participants had dietary preferences on which their individual diets were based. Most of the participants (87%) engaged in physical activity for at least 1–2 h per week. Body composition estimated with bioimpedance analysis and isotope dilution spaces is reported in Supplementary Table S5. Nationalities of study participants are reported in Supplementary Table S6.

TABLE 1

Table 1. Baseline characteristics of the study participants.

3.2. Descriptive statistics and normality of energy estimates

Mean TDEI estimated with SNAQ and 24HR and mean TDEE estimated with DLW are presented with SDs in Table 2. Mean TDEI estimated with SNAQ was 1,905.5 kcal/day (SD ± 531.1), and with 24HR 1,692.1 kcal/day (SD ± 632.5). TDEE estimated with DLW was 2,235.2 kcal/day (SD ± 456.5). The results of the DLW analysis are reported in Supplementary Table S7.

TABLE 2

Table 2. Energy estimates of the two dietary assessment tools and the DLW technique.

The values of energy were normally distributed for TDEI estimated with SNAQ and 24HR, but not for TDEE estimated with DLW (Supplementary Table S8). A visual analysis of the Quantile-Quantile (QQ) plots as well showed a larger departure of the DLW data points from their assumed normal distribution (Supplementary Figure S1A). Furthermore, an outlier was visible in QQ plots of 24HR and DLW far above the line of expected normal distribution. Nevertheless, a density plot showed that the density areas of SNAQ and DLW overlapped more than the density areas of DLW and 24HR (Supplementary Figure S1B).

3.3. Agreement between TDEI and TDEE

Results from the Bland–Altman analysis for agreement of TDEI estimated with SNAQ and TDEE estimated with DLW (primary outcome) and for agreement of TDEI estimated with 24HR and TDEE estimated with DLW (secondary outcome for direct comparison with an established dietary assessment tool) are shown in Table 3 and Figure 3. Bias of agreement between energy estimates from SNAQ and DLW was −329.6 kcal/day with LoA from −1,503.8 kcal/day (lower) to 844.5 kcal/day (upper) (Figure 3A). The bias of agreement between energy estimates from 24HR and DLW was −543.0 kcal/day with LoA from −1,802.5 kcal/day (lower) to 716.5 kcal/day (upper) (Figure 3B). A visual analysis of the Bland–Altman plots was negative for systemic bias at low or high values of energy.

TABLE 3

Table 3. Results of the Bland–Altman plot for agreement between SNAQ and DLW, and 24HR and DLW.

3.4. Agreement between SNAQ and 24HR

Results from the Bland–Altman analysis for agreement between TDEI and macronutrient intake estimated with SNAQ and with 24HR (secondary outcome) are shown in Table 4 and Figure 4 (Supplementary Figure S2 for sugars and saturated fats). Bias of agreement between TDEI estimated with SNAQ and with 24HR was 213.4 kcal/day with LoA from lower −774.3 kcal/day to upper 1,201.1 kcal/day (Figure 4A). The bias of agreement was also assessed for the macronutrient intake (Figure 4B) and eating occasions (Figure 4C). Carbohydrate intake had a bias of 33.8 g/day (LoA from −101.5 to 163.7 g/day), sugar intake had a bias of 7.0 g/day (LoA from −46.0 to 60.0 g/day), fat intake had a bias of 3.4 g/day (LoA from −58.4 to 65.2 g/day), and protein intake had a bias of 2.6 g/day (LoA from −28.0 to 29.1 g/day). A visual analysis of the Bland–Altman plots was negative for systemic bias at low or high values of energy and macronutrient intake. However, a systemic bias could be identified in the Bland–Altman plot for number of eating occasions.

TABLE 4

Table 4. Results of the Bland–Altman plot for agreement between SNAQ and 24HR. 24HR was selected as reference method in the analysis.

3.5. Further outcomes

3.5.1. Relationship between TDEI and TDEE

A correlation analysis between the TDEE estimated with DLW and the TDEI estimated with SNAQ and 24HR is shown in Figures 3C,D, respectively. There was no significant linear relationship between TDEI estimated with SNAQ and TDEE (R² = 27%, p = 0.5), nor between TDEI estimated with 24HR and TDEE (R² = 34%, p = 0.2).

3.5.2. Differences between energy assessment tools

Absolute and percentage differences between the methods (SNAQ and DLW, and 24HR and DLW) are shown with SDs in Table 5. Individual values of all study participants are shown in Supplementary Tables S9 (for SNAQ) and S10 (for 24HR) to present the variability of the differences within this study population. The high variability in the differences between the methods is also visible in Supplementary Figure S3.

TABLE 5

Table 5. Overall difference, and over- and underestimations of total daily energy intake estimated with SNAQ and 24HR in relation to DLW.

Differences of the energy estimates of SNAQ and 24HR in relation to DLW are represented as violin plots in Figure 5A. Difference in TDEI estimated with SNAQ and the TDEE estimated with DLW was not statistically significant (−330.0 kcal/day, SD ± 559.0, p = 0.067). However, the p-value estimated for this difference was significant before adjustment with the Bonferroni correction (p = 0.022). There was a statistical difference between the TDEI estimated with 24HR and the TDEE estimated with DLW (−543.0 kcal/day, SD ± 664.6, p = 0.00059).

3.5.3. Over- and underestimations of SNAQ and 24HR in relation to DLW

Despite the finding that both the SNAQ and 24HR underestimated energy intake compared to the values of TDEI expected based on TDEE in conditions of energy balance (Figure 5B), a more detailed analysis revealed instances where energy intake was overestimated (Figure 5C). These overestimations were nonetheless less frequent than underestimations and had a smaller percentage difference from TDEE. TDEI estimated with SNAQ was 12.8% lower than TDEE estimated with DLW. However, this was the result of 4 overestimations with an average of 19.8% ± 10.2% and 26 underestimations with an average of −24.5% ± 17.4%. TDEI estimated with 24HR was 22.9% lower than TDEE estimated with DLW. However, this was the result of 10 overestimations with an average of 34.7% ± 18.0% and 20 underestimations with an average of −31.8% ± 18.9%.

For the definition of study-specific Goldberg cut-off points, an S factor of 26.98% was calculated based on a CV_wTDEI of 42.55%, and CV_tP of 20.24%. Lower and upper 95% confidence limits were 1.51 and 1.84, respectively. According to these Goldberg cut-off points, 6 study participants had a plausible TDEI in relation to their BMR, 20 study participants were under-reporters when using SNAQ, and 4 study participants were over-reporters (Supplementary Table S11). When using 24HR to report their food intake, 3 study participants had a plausible TDEI in relation to their BMR, 25 were under-reporters, and 2 were over-reporters (Supplementary Table S12).

When using the CV_wTDEI of 26% suggested by Black (35) to calculate the S factor (20.04%), lower and upper 95% confidence limits were 1.55 and 1.79, respectively. According to these Goldberg cut-off points, 5 study participants had a plausible TDEI in relation to their BMR, 19 study participants were under-reporters when using SNAQ, and 6 study participants were over-reporters (Supplementary Table S11). When using 24HR to report their food intake, 2 study participants had a plausible TDEI in relation to their BMR, 25 were under-reporters, and 3 were over-reporters (Supplementary Table S12).

3.5.4. Relationship between body composition and energy estimates

A correlation analysis between body composition (FFM and FM) estimated with DLW and the TDEI estimated with SNAQ and 24HR is shown in Figure 6. There was a moderate relationship between FFM and TDEI estimated with SNAQ (R² = 50%, p = 0.01) and between FFM and TDEI estimated with 24HR (R² = 55%, p = 0.005). There was no significant relationship between FM and TDEI estimated either with SNAQ (R² = −18%, p = 1.0) or with 24HR (R² = −3%, p = 1.0).

3.5.5. Relationship between changes in body weight and measurement differences

The changes of body weight of the study participants during the study week are shown in Figure 7A. A correlation analysis of the linear relationship between the changes in body weight and the measurement differences of TDEI estimated with SNAQ and 24HR in relation to TDEE estimated with DLW is shown in Figures 7B,D, respectively. There was a moderate linear relationship between the changes in body weight and the measurement differences of 24HR (R² = 49%, p = 0.015), but not with the measurement differences of SNAQ (R² = 38%, p = 0.086).

3.5.6. Relationship between energy expenditure and measurement differences

A correlation analysis between the TDEE estimated with DLW and the measurement error of TDEI estimated with SNAQ and 24HR in relation to TDEE estimated with DLW is shown in Figures 7C,E, respectively. There was no significant linear relationship neither between the measurement error with SNAQ and TDEE (R² = −38%, p = 0.076) nor between the measurement error with 24HR and TDEE (R² = −17%, p = 0.74).

3.5.7. Stability of diet consumption over the study week

The intraclass correlation coefficient (ICC) was calculated among the 30 study participants to assess the reliability of TDEI measurements for each study participant over a period of seven consecutive days (Supplementary Table S13). The ICC was estimated to be 0.274 (95% confidence interval: 0.137 to 0.460, p < 0.001). This indicated a moderate level of agreement in the stability of diet consumption across the study week. The F-statistic was 3.269 with degrees of freedom equal to 29 and 145 for the numerator and denominator, respectively. The residual mean square was 508,507.1 (kcal/day)², the between-group mean square was 217,928.3 (kcal/day)², and the within-subject standard deviation was 759.0 kcal/day.

3.5.8. Relationship and differences between TDEI estimated with SNAQ and 24HR

A correlation analysis between the TDEI estimated with SNAQ and the TDEI estimated with 24HR is shown in Supplementary Figure S4. There was a significant relationship between TDEI estimated with SNAQ and TDEI estimated with 24HR (R² = 62%, p = < 0.001).

TDEI and macronutrient intake estimated with SNAQ and with 24HR are shown in Table 6. Absolute and percentage differences between the two methods are also reported together with a 95% confidence interval of the absolute differences. Even though there was a high variability in the differences between the two methods, as also shown in Supplementary Figure S5 and Supplementary Table S14, there was no statistical difference between TDEI estimated with SNAQ and with 24HR (213.4 kcal/day, SD ±503.9, p = 0.6). TDEI reported with SNAQ and with 24HR were 1,905.5 ± 531.1 kcal/day and 1,692.1 ± 632.5 kcal/day, respectively.

TABLE 6

Table 6. Comparison of energy intake, macronutrient intake, and dietary occasions between SNAQ and 24HR.

Similarly, there was no statistical difference for the mean intake of any of the macronutrients estimated with SNAQ and with 24HR. Nonetheless, there was a statistical difference in the number of eating occasions reported with the two methods (D = 2.9, p < 0.001). Number of eating occasions reported with SNAQ and with 24HR were 7.5 ± 1.9 and 4.6 ± 1.3, respectively. Differences between the estimates of the two methods are represented as boxplots in Supplementary Figure S6.

3.5.9. Over- and underestimations of SNAQ in relation to 24HR

Percentage differences of SNAQ in relation to 24HR for estimates of TDEI and macronutrient intake are reported in Table 7. Individual values of all study participants are shown in Supplementary Figure S7 to present the variability of the differences within this study population. Even though the overall percentage difference between SNAQ and 24HR indicates an overestimation of intake by SNAQ for TDEI and all considered macronutrients, except for saturated fats, a closer analysis also showed underestimations. Underestimations were however less frequent than overestimations and had a smaller percentage difference from 24HR. For instance, TDEI estimated with SNAQ was 23% higher than with 24HR. However, this was the result of 18 overestimations with an average of 49.0 ± 48.1% and 12 underestimations with an average of −15.3 ± 7.9%. Overestimations for macronutrient intake were 38% for carbohydrates, 28% for sugars, 16% for fats, 23% for proteins, and 30% for fibers.

TABLE 7

Table 7. Over- and underestimations of energy intake estimated with SNAQ in relation to 24HR.

4. Discussion

This study aimed to assess the validity of an image-based food-recognition app, SNAQ, for dietary assessment of daily energy intake, in free-living conditions, of adult women with normal body weight. The validity of the app was assessed in terms of agreement with a reference method for energy expenditure, the DLW technique. Further, the agreement of SNAQ with 24HR for dietary assessment of energy and macronutrient intake was also assessed.

In general, energy intake estimated with SNAQ provided a closer representation of the energy expenditure estimated with DLW than the energy intake estimated with 24HR, while there were no significant differences between energy and macronutrient intake estimated with SNAQ and 24HR.

The comparison of our findings with previous studies is difficult since there are significant variations in the way others have previously evaluated the accuracy of dietary assessment tools. Such variations can be attributed, in part, to the statistical methods employed to determine how well a given dietary assessment tool agrees with a reference method. To address this issue, we conducted a comprehensive assessment utilizing four statistical and three descriptive approaches to assess differences and relationships between energy intake estimated with SNAQ and 24HR and the energy expenditure estimated with DLW. Our objective was to measure agreement at both the individual and group levels, with the aim of providing reliable evidence for the validity of SNAQ as a dietary assessment tool.

To begin with, the Bland–Altman analysis showed that energy intake estimated with SNAQ had a bias of −329.6 kcal/day in relation to the energy expenditure estimated by DLW. This bias was 213.4 kcal/day smaller than the one estimated with the established tool, 24HR. Therefore, both dietary assessment tools seemed to have a bias in the same direction and very large LoA for energy estimates. Additionally, a correlation analysis showed no significant linear relationship between the energy estimates of both dietary assessment tools with the energy expenditure estimated with DLW.

Furthermore, the difference between the energy intake estimated with SNAQ and the energy expenditure estimated with DLW was significantly different, but the difference lost significance when the p-value was adjusted with the Bonferroni correction. Therefore, it is not clear in the present study population if the difference was due to chance alone or not. Indeed, there is an ambiguity of equivalence between DLW and SNAQ energy estimates at the population level which cannot be investigated with an equivalence test due the absence of a pre-established acceptable equivalence value for boundary setting. However, there was a strong significant difference between energy intake estimated with 24HR and energy expenditure estimated with DLW. Indeed, it has also previously been reported that the 24HR underestimates energy intake when compared to reference methods, such as the DLW technique (7). Based on our findings, it appears that SNAQ exhibits greater overall validity in assessing energy intake in our study population in comparison to 24HR.

The results of our study were partly consistent with previous research that compared other image-based apps for dietary assessment to DLW (10, 11, 43). Martin et al. (n = 13) and Most et al. (n = 23) reported a bias of −270 kcal/day and − 600 kcal/day, respectively, but both did not find a significant difference between the methods (10, 43). Gemming et al. reported a bias of −180 kcal/day compared to a bias of −340 kcal/day of a traditional tool (11) but also a significant difference for estimations of energy intake between the investigated image-based dietary tool and DLW at the group level. Boushey et al. (n = 30) reported a bias of −563 kcal/day and also found a significant difference between methods, but reported a strong linear relationship between the methods (64).

Based on the combination of the results of the Bland–Altman plot, of the paired difference tests and of the Parsons’s correlation coefficient, these authors provided different conclusions. Martin et al. and Gemming et al. both concluded that the dietary assessment tool under investigation showed good relative validity for the target population, despite Gemming found a significant difference from the reference method. Instead, Most et al., with no significant difference between methods but a larger bias, concluded that the image-based app that they investigated did not accurately measure energy intake when compared to DLW. Nevertheless, in these three studies, all authors all recommended caution when using the app for assessing individual energy intake. The determination of an acceptable agreement, however, remains ultimately a clinical decision, as it involves considering the context and purpose of the measurement. While statistics provide quantitative information, they cannot solely answer the question of what constitutes an acceptable clinical validity (65). In contrast to this assessment approach of validity, Boushey et al. declared a “superior position” of the investigated image-based dietary assessment tool in comparison with other assessment tools even when reporting a significant difference from the reference method (DLW) and the limits of agreement (LoA) were wider than the ones we report (64). They supported their conclusion with the significant strength of a linear relationship assessed with a Pearson’s correlation coefficient (R² = 58%, p < 0.0001) and an average underreporting of their women population of 16%. However, although the two methods may have been linearly related to the same direction, the fact that the two methods were significantly different when tested with a paired difference test suggests that they differed systematically.

The over- and underestimations of energy intake were assessed also in our study population. Interestingly, on the group level, the underestimation of daily energy intake with SNAQ in relation to DLW was −12.8%. However, this observation on the overall study population was led by the compensation of under- and overestimations at the individual level. While the former – the underestimation at the group level – might be relevant in the comparison of different target populations, the latter – the observation of individual differences – is of relevance when testing the validity of a method for clinical use. Indeed, any conclusion on clinical validity must consider measurement differences at the individual level. Accordingly, we performed an analysis of individual misreporting using Goldberg cut-off points specific of this study population. Most of the study participants were classified as under-reporters and some as over-reporters. In our study population, only 6 study participants had a plausible energy intake in relation to their basal metabolic rate.

Energy intake constitutes the fundamental basis of any given habitual diet. The observed underestimation of energy intake, a common occurrence in various dietary assessment tools including the SNAQ, can have implications on the assessment of macro- and micronutrient intake. Therefore, if energy intake is underestimated, it can be assumed that the intake of macro- and micronutrients is also underestimated. This limitation might restrict the broader application of SNAQ in addressing research questions beyond energy intake.

Our study represents the first attempt to assess the validity of SNAQ for assessment of energy intake by comparing the estimated energy intake to a reference method for energy expenditure. It is difficult to define if the limits of agreement that we report with the Bland–Altman plot can be considered acceptable for clinical use as component of diagnostics or in nutritional counselling. Indeed, a consensus in this regard has, to our knowledge, never been published (33). Nevertheless, the lower bias of SNAQ compared to 24HR and DLW, and the absence of a significant difference between energy estimates of SNAQ and DLW suggests that such a dietary assessment tool can have an application in research. However, based on the design of our study, it is not possible to distinguish whether the observed underestimation can be attributed to participants’ self-reports in the real-life setting or to the shortcomings of the SNAQ technology in accurately assessing the energy content of the reported food intake. This reflects a limitation and further investigations are required to validate the use of SNAQ for clinical and research purposes.

The application of SNAQ for the assessment of dietary habits assumes that the period of recording is representative for the habitual diet of the study population. This, however, might have not been the case for our study population. The intraclass correlation coefficient (ICC) of the study population for daily energy intake was moderate (0.274). This is an index of the within-subject variability during the week of recording. However, there is no universally defined acceptable value for the ICC of daily energy intake in adult women with normal body weight, as it can vary depending on the specific context and research question. Furthermore, this coefficient is generally not reported in other validity studies for new dietary assessment tools. Therefore, also a comparison of this coefficient with other similar studies is not possible. Nevertheless, this result together with the individual Goldberg classification of misreporting, suggested that some days of the study period might have been more representative than others for our study population. However, this hypothesis cannot be tested with a two-point protocol of the DLW technique. An eventual application of SNAQ for the assessment of energy intake in a clinical setting should consider the results of this study with caution and a further investigation with a multiple-point protocol of the DLW technique should be performed.

The dilution space ratio of DLW in the TBW of the study participants was in a range of 0.992–1.036 (Supplementary Table S15), with only one value below the significant threshold of 1.000 and no value above the significant threshold of 1.700. Similarly, the elimination rate ratio of DLW was in a range of 1.193–1.366 (Supplementary Table S16), with no value outside the acceptable range of 1.100–1.700. Therefore, estimations of energy expenditure with DLW were considered reliable for this study. Any measurement error between SNAQ and DLW was considered to be led by wrongful estimations of energy intake with SNAQ. However, despite providing a clear protocol to study participants regarding activities that could potentially deplete isotopes, such as limiting sports activities and sauna usage, we cannot guarantee that these activities were completely restricted, as the energy expenditure was assessed in free-living conditions.

The variability of the measurement differences with SNAQ – intended as energy intake estimated with SNAQ minus the energy expenditure estimated with DLW – could not be explained neither with changes in body weight during the study week (Figures 7B,D) nor with the estimated energy expenditure (Figures 7C,E). However, there was a significant linear relationship between changes in body weight during the study week and measurement differences of 24HR. This might suggest that with a 24HR study participants report less dietary intake than what they daily consume in the status of energy balance.

According to the assumption of energy balance, the energy intake during the study week should have equaled the energy expenditure (23). Any positive or negative difference between energy intake and energy expenditure should have resulted in an increase or decrease of the body’s energy stores, respectively. Consequently, if the changes in body weight could not explain the variability of the measurement differences with SNAQ, it might be that there might have been other reasons for these measurement differences. Nevertheless, body composition, in terms of FFM and FM, would have been a better indicator of the energy stores of the body instead of the sole body weight. Indeed, most of the energy of the body is stored in the FM (9.5 kcal/g of FM) (66–69). On one hand, the body weight also accounts for fluctuations of TBW, which contains virtually no energy. On the other hand, if the energy imbalance is the underlying cause of the measurement differences, changes in FM and FFM can be utilized to accurately determine the energy imbalance, resulting in an equal value to the individual measurement difference (68, 70).

This last approach, called the energy expenditure/balance (EB) method, involves integrating measurements of energy expenditure and changes in energy stores to estimate energy intake (22). The changes in body’s energy stores (ΔES) can be calculated by assessing the changes in FFM and FM between measurements (71). This is achieved by multiplying the changes in FFM and FM by their respective energy density coefficients (9.5 kcal/g for FM and 1.1 kcal/g for FFM). The resulting values are then divided by the number of days between the measurements to obtain the daily energy change in kcal/day.

Body composition is an important factor of energy balance. In this study, body composition was assessed with both bioelectrical impedance and isotope dilution. The bias of agreement of the two methods for TBW, FFM, and FM was comparable to what is reported on the literature (Supplementary Table S17) (72, 73). Therefore, TBW assessed using bioimpedance analysis could potentially serve as a cost-effective substitute of isotope dilution for measuring the energy stores of the body, with an acceptable decrease in precision.

Our findings regarding the secondary outcome are consistent with previous research that compared other image-based apps for dietary assessment to 24HR (74, 75). These studies showed no significant difference for estimations of energy intake between the apps and 24HR. The limits of agreement ranged from −549 kcal to 836 kcal and from −807 kcal to 775 kcal, respectively. In both studies, the authors concluded that the image-based app under investigation showed good relative validity when compared to 24HR for assessing energy intake at the population level. However, they recommended caution when using the app for assessing individual energy intake.

Comparable results have also been reported regarding the estimation of macronutrient intake when comparing image-based apps to 24HR (74, 75). Although there were no significant differences in macronutrient intake estimations between the methods, LoA for all macronutrients were found to be wide. These findings were confirmed in 2020 in a meta-analysis (76). It is worth noting that in all studies reviewed in the meta-analysis, the energy and macronutrient intake from images recorded in the apps was estimated by trained dietitians, whereas in our present study, we utilized the artificial intelligence feature of the app.

Further, we observed that the app tended to overestimate energy intake and all macronutrients considered, except for saturated fats, in comparison to a well-established dietary assessment tool, namely the 24HR. This does not necessarily indicate a lower validity of the app compared to the 24HR. Indeed, it has previously been reported that, when compared to reference methods, such as the DLW technique, the 24HR underestimates energy intake (7). Accordingly, it is possible that the overall validity of the app would be higher than that of 24HR when compared to the DLW technique. Indeed, as reported in the discussion, the energy intake estimated with SNAQ was not significantly different from the energy expenditure estimated with DLW, while the energy intake estimated with 24HR was significantly different from the energy expenditure estimated with DLW.

Other reasons for the increase of intrinsic measurement differences might be (1) a low accuracy of the technology in recognizing the food items or in estimating their portions, and (2) the increase of events of non-compliance of study participants (20). An earlier study previously investigated the use of SNAQ for estimation of energy and macronutrient intake. However, the research question of the mentioned study focused solely on accuracy of estimations in a laboratory setting. Our study expands on this previous work by assessing the validity of SNAQ for the estimation of energy and macronutrient intake in a real-life setting. Although highly accurate volume estimation, high segmentation performance, and low processing time were previously reported (20), in a real-life setting it is important to consider the potential influence of a greater variety of food items and eating occasions, as well as the usability of the app by non-trained users. These could affect the validity of the app in real-life scenarios.

Events of non-compliance, defined by other authors also as misreporting (77–79), are here intended as either non-recording of a dietary item or wrongful entry of food items and their portions. However, explanation on possible sources of measurement differences was out of the scope of this study. The semi-customized prompts were set within the app to overcome a potential measurement error of energy intake from non-recording due forgetfulness. This strategy has previously been reported by other authors to decrease the bias of agreement between an image-based app and DLW (43). However, due to the individual variation of snacking times, study participants did not receive notifications to record snacks.

Furthermore, the awareness of participating in a clinical study might have caused for the study participants intentional and unintentional changes of dietary behavior during the study period. Consequently, these changes might have led to an energy imbalance and have been detected as measurement differences of SNAQ. These differences are widely discussed when applying traditional dietary assessment tools such as food records and 24HR (80). However, our results did not support this possibility, because the hypothesis of energy imbalance, as already explained, could not be tested with the body weight alone.

4.1. Strengths

A notable strength of this study was the comparison of the energy intake estimated with SNAQ to the energy expenditure estimated with the DLW technique. This technique is considered a reference method for measures of total energy expenditure in conditions of energy balance without interfering with the routine lives of the study participants. Furthermore, the isotope dilution of the DLW was used in this study to estimate the body composition (TBW, FFM, and FM), and physical activity level (PAL) of the study participants (81, 82). Given the high cost of DLW and the considerable methodological and analytical requirements associated with it, many researchers face limitations in using this method. As a result, our study offers a valuable contribution to the field of dietary assessment by assessing the validity of a new image-based dietary assessment tool in relation to established biomarkers of energy expenditure.

Our study is the first to investigate image-based dietary assessment by artificial intelligence through computer vision. Most published studies using apps with artificial intelligence features are based on image-assisted dietary assessment methods (17, 21). So far, there has been only one published study that evaluated the validity of image-based food recognition through artificial intelligence in free-living conditions (21). Furthermore, one significant advantage of using the SNAQ app was its integration with the Swiss Food Composition Database (57), which might have facilitated study participants in recalling their dietary intake with comprehensive database specific to the local cuisine.

Our study benefits from the recruitment of a homogeneous study population of adult women with normal body weight. By employing this approach, the potential influence of confounding variables that might affect the outcomes is effectively minimized. This enhances the internal validity of the study, ultimately increasing the reliability of the collected data. However, it is possible that this approach limits the external validity of the findings, constraining the applicability of the findings to other source populations. To enhance the overall generalizability of the results, future research works should consider larger sample sizes and broader inclusion criteria, such as obesity, to obtain more representative results.

4.2. Limitations

A noteworthy limitation of the research design employed in these studies is the absence of measures to monitor compliance among participants with regard to the meal recordings. For instance, adequate lighting, type of plate used to place the food items, and distance of the smartphone to the meals was not supervised, as the participants were in a free-living condition. Moreover, the evaluation of the feasibility, user-friendliness and participant satisfaction with SNAQ was not carried out, which could potentially lead to inaccurate monitoring of dietary intake owing to the app’s laborious recording process.

The use of the DLW technique as a reference method for energy expenditure is based on the assumption of energy balance during the study period. However, a seven-day protocol might have been too short to capture significant changes in the metabolism of the study participants due to adaptation to the energy intake and energy expenditure influenced by the participation in the study. A period of 14 days might still be a reasonable solution between feasibility of the study and quality of data (22). Indeed, the accuracy of the DLW technique is influenced by the dosage of labeled water administered and the duration of the metabolic period. In adults, the optimal metabolic period, which corresponds to 1.5 half-lives of the tracers in the body, typically ranges from 10 to 14 days (22). This duration is critical for achieving precise measurements with DLW.

The use of 24HR as a method for comparison also poses several limitations. The self-reported nature makes this method prone to underreporting (1). Furthermore, intake data from 24HR was only collected for 1 day, whereas data collected from the app represented the average intake over 7 days. It is therefore possible, that the 24HR depicted an unusual day, regarding eating behavior (particularly high or low intake). Conducting multiple 24HRs for the same participant and on week-, as well as weekend days could provide more accurate information on the eating behavior of the participants and reduce random error. However, there is confusion in the literature whether one or the mean of several 24HR should be used, as systematic bias of the method might remain (83).

Physical activity during the study period was not measured. However, the general assumption that higher levels of physical activity result in increased TDEE (24, 25), leading to a negative energy balance and subsequent weight loss, has been rejected (26). Indeed, when individuals engage in high levels of physical activity, the TDEE does not continue to increase proportionally. Instead, the body undergoes adaptations to maintain energy balance. These adaptations occur through both behavioral changes, such as reducing non-exercise physical activity, and metabolic adjustments, such as changes in metabolic efficiency. As a result, the increase in TDEE is limited. The body strives to maintain energy balance rather than experiencing a linear increase in energy expenditure with higher physical activity levels. Nevertheless, monitoring of physical activity during the study period might be important to explain differences between energy expenditures and energy intake, and therefore changes in energy stores of the body. This monitoring allows for a more comprehensive assessment of energy balance and provides insights into the factors influencing energy expenditure and energy intake.

To assess the validity of a new dietary assessment tool, various statistical methods are commonly employed. These methods help determine the accuracy and reliability of the tool by comparing it to an established reference method. However, there is currently no consensus on which statistical methods should be used for assessing validity. Consequently, different approaches can be used, making it challenging to compare and interpret the results across studies.

To address this issue, our suggestion is to establish a benchmark of statistical methods that should be included in research investigating the validity of dietary assessment tools. The idea behind this benchmark is to provide a standardized set of statistical methods and descriptive calculations that researchers should report and use consistently, allowing for better comparison and interpretation of results across studies.

In our proposal, we recommend including nine specific statistical and descriptive steps in the benchmark for assessing validity: (1) Bland–Altman plots for assessment of agreement, (2) Pearson’s correlation coefficient of the strength of relationship between methods, (3) absolute and percentage measurement differences between the methods for assessment of relative accuracy, (4) over- and underestimations of the new dietary assessment tool compared to the reference method for assessment of misreporting, (5) calculation of Goldberg cut-off points for the investigated study population for identification of individual over- and under-reporters, (6) Pearson’s correlation coefficient of the strength of relationship between body composition (FM and FFM) and the energy intake estimated with the new dietary assessment tool for assessment of the homeostatic control, (7) Pearson’s correlation coefficient of the strength of relationship between changes of energy stores of the body and the measurement differences between the methods for assessment of energy balance, (8) Pearson’s correlation coefficient of the strength of relationship between energy expenditures and the measurement differences between the methods for assessment of compliance, and (9) intraclass correlation coefficient for assessment of diet stability. These methods likely have been identified as suitable and informative for evaluating the accuracy and reliability of dietary assessment tools. An overview of the benchmark is presented in Figure 8, illustrating the different statistical methods and descriptive calculations, and explaining how to interpret the single values generated by each method.

FIGURE 8

Figure 8. Overview of the proposed benchmark for standardized reporting of nine specific statistical and descriptive steps for assessment of validity of a new dietary assessment tool. The different statistical methods and descriptive calculations are illustrated in the table in the middle of the figure. The interpretation of the single values generated by each method and calculation is explained in the orange-margined text boxes.

Lastly, the assessment of diet is a complex process that needs the utilization of precise and consistent different methods. It is important to notice that clinicians and researchers should not depend exclusively on new technologies to address all the challenges related to dietary assessment when attempting to evaluate diet. The selection of an appropriate assessment method is contingent upon various factors, such as the research question, study design, characteristics of the sample, and sample size, among others. Dietary assessment tools which directly assess food intake should be implemented in order to understand the direct connection between diet and health. Despite the technological advancements, the intrinsic individual bias that is linked to self-reported dietary intake cannot be eliminated by such novel tools employed to evaluate dietary intake. Consequently, it is essential to incorporate new techniques that enable a direct measurement of dietary intake into the field of nutrition research, such as residential facilities (84, 85), the Universal Eating Monitor (86) or the drinkometer device (87).

5. Conclusion

The selected image-based food-recognition app, SNAQ, provided an estimate of energy intake of the study population which was not significantly different from the energy expenditure estimated with the DLW technique. Further, there was no significant difference between SNAQ and 24HR for estimates of energy and macronutrient intake.

Despite SNAQ showing slightly better performance in relation to DLW compared to 24HR, it is not possible to conclude that SNAQ has a higher validity than 24HR for assessing energy intake in adult women with normal weight. Future research work is encouraged to further explore and address the limitations identified in this study. Energy intake is underestimated, the limits of agreement are wide, and, even if there is no significant difference between SNAQ and DLW at the group level, it must be investigated if the measurement difference between the methods is clinically relevant. Furthermore, the agreement of the app with the DLW technique remains to be assessed in other target populations which might profit by the implementation of this dietary assessment tool.

Finally, it is necessary to acknowledge that SNAQ still lacks accuracy in estimating energy intake and further technologies should be explored to assess food intake in humans.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Ethics Commission of Zurich, Canton of Zurich, Switzerland. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

MS: Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. DA: Conceptualization, Investigation, Writing – original draft, Writing – review & editing. FH: Data curation, Writing – original draft. PH: Conceptualization, Methodology, Writing – review & editing. HM: Formal Analysis, Methodology, Writing – review & editing. AT: Investigation, Methodology, Writing – review & editing. RS: Funding acquisition, Supervision, Writing – review & editing. PG: Investigation, Methodology, Resources, Writing – review & editing. AS: Conceptualization, Methodology, Supervision, Writing – review & editing. DG: Conceptualization, Writing – review & editing. MB: Conceptualization, Funding acquisition, Supervision, Writing – review & editing.

Funding

The study was funded by the Swiss National Science Foundation [grant number 32003B_182309/1] and the assistant-professorship research grant awarded by the University of Zurich to MB. The PhD salary of the co-author MS was covered with a research grant awarded by the Uniscientia Foundation to the co-author RS.

Acknowledgments

We would like to express our gratitude to SNAQ AG for their support during our collaboration.

Conflict of interest

SNAQ AG developed the customized version of the app, SNAQ Study, in exchange for financial compensation. RS is employed by DSM Nutritional Products. AS is a member of the scientific advisory board of Gila Therapeutics.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2023.1255499/full#supplementary-material

References

1. Ravelli, MN, and Schoeller, DA. Traditional self-reported dietary instruments are prone to inaccuracies and new approaches are needed. Front Nutr. (2020) 7:90. doi: 10.3389/fnut.2020.00090

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Elmadfa, I, and Meyer, AL. Developing suitable methods of nutritional status assessment: a continuous challenge. Adv Nutr. (2014) 5:590S–8S. doi: 10.3945/an.113.005330

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Cade, JE. Measuring diet in the 21st century: use of new technologies. Proc Nutr Soc. (2017) 76:276–82. doi: 10.1017/S0029665116002883

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Illner, A, Freisling, H, Boeing, H, Huybrechts, I, Crispim, SP, and Slimani, N. Review and evaluation of innovative technologies for measuring diet in nutritional epidemiology. Int J Epidemiol. (2012) 41:1187–203. doi: 10.1093/ije/dys105

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Schoeller, DA, and Westerterp, M. Advances in the assessment of dietary intake. Boca Raton, FL: CRC Press (2017).

Google Scholar

6. Bailey, RL. Overview of dietary assessment methods for measuring intakes of foods, beverages, and dietary supplements in research studies. Curr Opin Biotechnol. (2021) 70:91–6. doi: 10.1016/j.copbio.2021.02.007

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Burrows, TL, Ho, YY, Rollo, ME, and Collins, CE. Validity of dietary assessment methods when compared to the method of doubly labeled water: a systematic review in adults. Front Endocrinol. (2019) 10:850. doi: 10.3389/fendo.2019.00850

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Magarey, A, Watson, J, Golley, RK, Burrows, T, Sutherland, R, McNaughton, SA, et al. Assessing dietary intake in children and adolescents: considerations and recommendations for obesity research. Int J Pediatr Obes. (2011) 6:2–11. doi: 10.3109/17477161003728469

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Ordabayeva, N, and Chandon, P. In the eye of the beholder: visual biases in package and portion size perceptions. Appetite. (2016) 103:450–7. doi: 10.1016/j.appet.2015.10.014

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Most, J, Vallo, PM, Altazan, AD, Gilmore, LA, Sutton, EF, Cain, LE, et al. Food photography is not an accurate measure of energy intake in obese, pregnant women. J Nutr. (2018) 148:658–63. doi: 10.1093/jn/nxy009

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Gemming, L, Rush, E, Maddison, R, Doherty, A, Gant, N, Utter, J, et al. Wearable cameras can reduce dietary under-reporting: doubly labelled water validation of a camera-assisted 24 h recall. Br J Nutr. (2015) 113:284–91. doi: 10.1017/S0007114514003602

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Naska, A, Lagiou, A, and Lagiou, P. Dietary assessment methods in epidemiological research: current state of the art and future prospects. F1000Research. (2017) 6:926. doi: 10.12688/f1000research.10703.1

CrossRef Full Text | Google Scholar

13. König, LM, Attig, C, Franke, T, and Renner, B. Barriers to and facilitators for using nutrition apps: systematic review and conceptual framework. JMIR Mhealth Uhealth. (2021) 9:e20037. doi: 10.2196/20037

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Zhao, J, Li, Z, Gao, Q, Zhao, H, Chen, S, Huang, L, et al. A review of statistical methods for dietary pattern analysis. Nutr J. (2021) 20:1–18. doi: 10.1186/s12937-021-00692-7

CrossRef Full Text | Google Scholar

15. Penn, L, Boeing, H, Boushey, CJ, Dragsted, LO, Kaput, J, Scalbert, A, et al. Assessment of dietary intake: NuGO symposium report. Genes Nutr. (2010) 5:205–13. doi: 10.1007/s12263-010-0175-9

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Gemming, L, Utter, J, and Mhurchu, CN. Image-assisted dietary assessment: a systematic review of the evidence. J Acad Nutr Diet. (2015) 115:64–77. doi: 10.1016/j.jand.2014.09.015

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Boushey, C, Spoden, M, Zhu, FM, Delp, EJ, and Kerr, DA. New mobile methods for dietary assessment: review of image-assisted and image-based dietary assessment methods. Proc Nutr Soc. (2017) 76:283–94. doi: 10.1017/S0029665116002913

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Amugongo, LM, Kriebitz, A, Boch, A, and Lütge, C. Mobile computer vision-based applications for food recognition and volume and calorific estimation: a systematic review. Healthcare. (2023) 11:59. doi: 10.3390/healthcare11010059

CrossRef Full Text | Google Scholar

19. Zhu, F, Bosch, M, Woo, I, Kim, S, Boushey, CJ, Ebert, DS, et al. The use of mobile devices in aiding dietary assessment and evaluation. IEEE J Sel Top Signal Process. (2010) 4:756–66. doi: 10.1109/JSTSP.2010.2051471

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Herzig, D, Nakas, CT, Stalder, J, Kosinski, C, Laesser, C, Dehais, J, et al. Volumetric food quantification using computer vision on a depth-sensing smartphone: preclinical study. JMIR Mhealth Uhealth. (2020) 8:e15294. doi: 10.2196/15294

PubMed Abstract | CrossRef Full Text | Google Scholar

21. König, LM, van Emmenis, M, Nurmi, J, Kassavou, A, and Sutton, S. Characteristics of smartphone-based dietary assessment tools: a systematic review. Health Psychol Rev. (2022) 16:526–50. doi: 10.1080/17437199.2021.2016066

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Ravelli, MN, and Schoeller, DA. An objective measure of energy intake using the principle of energy balance. Int J Obes. (2021) 45:725–32. doi: 10.1038/s41366-021-00738-0

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Hopkins, M, Casanova, N, Finlayson, G, Stubbs, RJ, and Blundell, JE. Fat-free mass and Total daily energy expenditure estimated using doubly labeled water predict energy intake in a large sample of community-dwelling older adults. J Nutr. (2022) 152:971–80. doi: 10.1093/jn/nxab434

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Weise, CM, Hohenadel, MG, Krakoff, J, and Votruba, SB. Body composition and energy expenditure predict ad-libitum food and macronutrient intake in humans. Int J Obes. (2014) 38:243–51. doi: 10.1038/ijo.2013.85

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Blundell, JE, Caudwell, P, Gibbons, C, Hopkins, M, Näslund, E, King, NA, et al. Body composition and appetite: fat-free mass (but not fat mass or BMI) is positively associated with self-determined meal size and daily energy intake in humans. Br J Nutr. (2012) 107:445–9. doi: 10.1017/S0007114511003138

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Caudwell, P, Finlayson, G, Gibbons, C, Hopkins, M, King, N, Näslund, E, et al. Resting metabolic rate is associated with hunger, self-determined meal size, and daily energy intake and may represent a marker for appetite. Am J Clin Nutr. (2013) 97:7–14. doi: 10.3945/ajcn.111.029975

PubMed Abstract | CrossRef Full Text | Google Scholar

27. McNeil, J, Lamothe, G, Cameron, JD, Riou, MÈ, Cadieux, S, Lafrenière, J, et al. Investigating predictors of eating: is resting metabolic rate really the strongest proxy of energy intake? Am J Clin Nutr. (2017) 106:1206–12. doi: 10.3945/ajcn.117.153718

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Grannell, A, al-Najim, W, Mangan, A, Kapoor, N, Martin, WP, Murphy, JC, et al. Fat free mass is positively associated with hunger and energy intake at extremes of obesity. Appetite. (2019) 143:104444. doi: 10.1016/j.appet.2019.104444

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Piaggi, P, Thearle, MS, Krakoff, J, and Votruba, SB. Higher daily energy expenditure and respiratory quotient, rather than fat-free mass, independently determine greater ad libitum overeating. J Clin Endocrinol Metabol. (2015) 100:3011–20. doi: 10.1210/jc.2015-2164

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Blundell, JE, Caudwell, P, Gibbons, C, Hopkins, M, Naslund, E, King, N, et al. Role of resting metabolic rate and energy expenditure in hunger and appetite control: a new formulation. Dis Model Mech. (2012) 5:608–13. doi: 10.1242/dmm.009837

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Blundell, JE, Gibbons, C, Beaulieu, K, Casanova, N, Duarte, C, Finlayson, G, et al. The drive to eat in homo sapiens: energy expenditure drives energy intake. Physiol Behav. (2020) 219:112846. doi: 10.1016/j.physbeh.2020.112846

CrossRef Full Text | Google Scholar

32. Hopkins, M, and Blundell, JE. Energy balance, body composition, sedentariness and appetite regulation: pathways to obesity. Clin Sci. (2016) 130:1615–28. doi: 10.1042/CS20160006

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Lombard, MJ, Steyn, NP, Charlton, KE, and Senekal, M. Application and interpretation of multiple statistical tests to evaluate validity of dietary intake assessment methods. Nutr J. (2015) 14:1–11. doi: 10.1186/s12937-015-0027-y

CrossRef Full Text | Google Scholar

34. Bellach, B. Remarks on the use of Pearson's correlation coefficient and other association measures in assessing validity and reliability of dietary assessment methods. Eur J Clin Nutr. (1993) 47 Suppl 2:S42–5.

PubMed Abstract | Google Scholar

35. Black, AE. Critical evaluation of energy intake using the Goldberg cut-off for energy intake: basal metabolic rate. A practical guide to its calculation, use and limitations. Int J Obes. (2000) 24:1119–30. doi: 10.1038/sj.ijo.0801376

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Lachat, C, Hawwash, D, Ocké, MC, Berg, C, Forsum, E, Hörnell, A, et al. Strengthening the reporting of observational studies in epidemiology–nutritional epidemiology (STROBE-nut): an extension of the STROBE statement. Nutr Bull. (2016) 41:240–51. doi: 10.1111/nbu.12217

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Ostendorf, DM, Caldwell, AE, Creasy, SA, Pan, Z, Lyden, K, Bergouignan, A, et al. Physical activity energy expenditure and total daily energy expenditure in successful weight loss maintainers. Obesity. (2019) 27:496–504. doi: 10.1002/oby.22373

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Dahle, JH, Ostendorf, DM, Zaman, A, Pan, Z, Melanson, EL, and Catenacci, VA. Underreporting of energy intake in weight loss maintainers. Am J Clin Nutr. (2021) 114:257–66. doi: 10.1093/ajcn/nqab012

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Fassini, PG, das, SK, Pfrimer, K, Suen, VMM, Sérgio Marchini, J, and Ferriolli, E. Energy intake in short bowel syndrome: assessment by 24-h dietary recalls compared with the doubly labelled water method. Br J Nutr. (2018) 119:196–201. doi: 10.1017/S0007114517003373

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Medin, AC, Hansen, BH, Astrup, H, Ekelund, U, and Frost Andersen, L. Validation of energy intake from a web-based food recall for children and adolescents. PLoS One. (2017) 12:e0178921. doi: 10.1371/journal.pone.0178921

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Shook, RP, Yeh, HW, Welk, GJ, Davis, AM, and Ries, D. Commercial devices provide estimates of energy balance with varying degrees of validity in free-living adults. J Nutr. (2022) 152:630–8. doi: 10.1093/jn/nxab317

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Ptomey, LT, Willis, EA, Honas, JJ, Mayo, MS, Washburn, RA, Herrmann, SD, et al. Validity of energy intake estimated by digital photography plus recall in overweight and obese young adults. J Acad Nutr Diet. (2015) 115:1392–9. doi: 10.1016/j.jand.2015.05.006

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Martin, CK, Correa, JB, Han, H, Allen, HR, Rood, JC, Champagne, CM, et al. Validity of the remote food photography method (RFPM) for estimating energy and nutrient intake in near real-time. Obesity. (2012) 20:891–9. doi: 10.1038/oby.2011.344

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Kaczkowski, CH, Jones, PJH, Feng, J, and Bayley, HS. Four-day multimedia diet records underestimate energy needs in middle-aged and elderly women as determined by doubly-labeled water. J Nutr. (2000) 130:802–5. doi: 10.1093/jn/130.4.802

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Leech, RM, Worsley, A, Timperio, A, and McNaughton, SA. Characterizing eating patterns: a comparison of eating occasion definitions. Am J Clin Nutr. (2015) 102:1229–37. doi: 10.3945/ajcn.115.114660

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Cole, T, and Coward, W. Precision and accuracy of doubly labeled water energy expenditure by multipoint and two-point methods. Am J Physiol. (1992) 263:E965–73. doi: 10.1152/ajpendo.1992.263.5.E965

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Racette, SB, Schoeller, DA, Luke, AH, Shay, K, Hnilicka, J, and Kushner, RF. Relative dilution spaces of 2H-and 18O-labeled water in humans. Am J Physiol. (1994) 267:E585–90. doi: 10.1152/ajpendo.1994.267.4.E585

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Wang, Z, Deurenberg, P, Wang, W, Pietrobelli, A, Baumgartner, RN, and Heymsfield, SB. Hydration of fat-free body mass: review and critique of a classic body-composition constant. Am J Clin Nutr. (1999) 69:833–41. doi: 10.1093/ajcn/69.5.833

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Mifflin, MD, St Jeor, ST, Hill, LA, Scott, BJ, Daugherty, SA, and Koh, YO. A new predictive equation for resting energy expenditure in healthy individuals. Am J Clin Nutr. (1990) 51:241–7. doi: 10.1093/ajcn/51.2.241

CrossRef Full Text | Google Scholar

50. Skouroliakou, M, Giannopoulou, I, Kostara, C, and Vasilopoulou, M. Comparison of predictive equations for resting metabolic rate in obese psychiatric patients taking olanzapine. Nutrition. (2009) 25:188–93. doi: 10.1016/j.nut.2008.08.006

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Subar, AF, Kipnis, V, Troiano, RP, Midthune, D, Schoeller, DA, Bingham, S, et al. Using intake biomarkers to evaluate the extent of dietary misreporting in a large sample of adults: the OPEN study. Am J Epidemiol. (2003) 158:1–13. doi: 10.1093/aje/kwg092

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Bhutani, S, Racine, N, Shriver, T, and Schoeller, DA. Special considerations for measuring energy expenditure with doubly labeled water under atypical conditions. J Obes Weight Loss Ther. (2015) s5. doi: 10.4172/2165-7904.S5-002

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Wang, X, Kong, D, van Dijk, G, and Meijer, HAJ. First use of triply labelled water analysis for energy expenditure measurements in mice. Sci Rep. (2022) 12:6351. doi: 10.1038/s41598-022-10377-8

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Faghihi, V, Verstappen-Dumoulin, BMAA, Jansen, HG, Dijk, G, Aerts-Bijma, AT, Kerstel, ERT, et al. A new high-quality set of singly (2H) and doubly (2H and 18O) stable isotope labeled reference waters for biomedical and other isotope-labeled research. Rapid Commun Mass Spectrom. (2015) 29:311–21. doi: 10.1002/rcm.7108

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Speakman, JR, Yamada, Y, Sagayama, H, Berman, ESF, Ainslie, PN, Andersen, LF, et al. A standard calculation methodology for human doubly labeled water studies. Cell Rep Med. (2021) 2:100203. doi: 10.1016/j.xcrm.2021.100203

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Weir, JDV. New methods for calculating metabolic rate with special reference to protein metabolism. J Physiol. (1949) 109:1–9. doi: 10.1113/jphysiol.1949.sp004363

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Heise, T, Meiffren, G, Alluis, B, Seroussi, C, Ranson, A, Arrubla, J, et al. BioChaperone Lispro versus faster aspart and insulin aspart in patients with type 1 diabetes using continuous subcutaneous insulin infusion: a randomized euglycemic clamp study. Diabetes Obes Metab. (2019) 21:1066–70. doi: 10.1111/dom.13621

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Goldberg, G, Black, AE, Jebb, SA, Cole, TJ, Murgatroyd, PR, Coward, WA, et al. Critical evaluation of energy intake data using fundamental principles of energy physiology: 1. Derivation of cut-off limits to identify under-recording. Eur J Clin Nutr. (1991) 45:569–81.

PubMed Abstract | Google Scholar

59. Schoeller, DA, and Hnilicka, JM. Reliability of the doubly labeled water method for the measurement of total daily energy expenditure in free-living subjects. J Nutr. (1996) 126:348S–54S.

PubMed Abstract | Google Scholar

60. (Producer) R.C.T., R: A language and environment for statistical computing (2020).

Google Scholar

61. (Producer) R.T., RStudio: Integrated development environment for R (2020).

Google Scholar

62. Martin Bland, J, and Altman, D. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. (1986) 327:307–10. doi: 10.1016/S0140-6736(86)90837-8

CrossRef Full Text | Google Scholar

63. Flegal, KM, Graubard, B, and Ioannidis, JPA. Use and reporting of Bland-Altman analyses in studies of self-reported versus measured weight and height. Int J Obes. (2020) 44:1311–8. doi: 10.1038/s41366-019-0499-5

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Boushey, CJ, Spoden, M, Delp, E, Zhu, F, Bosch, M, Ahmad, Z, et al. Reported energy intake accuracy compared to doubly labeled water and usability of the mobile food record among community dwelling adults. Nutrients. (2017) 9:312. doi: 10.3390/nu9030312

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Bland, JM, and Altman, DG. Measuring agreement in method comparison studies. Stat Methods Med Res. (1999) 8:135–60. doi: 10.1177/096228029900800204

CrossRef Full Text | Google Scholar

66. Hall, KD. Body fat and fat-free mass inter-relationships: Forbes's theory revisited. Br J Nutr. (2007) 97:1059–63. doi: 10.1017/S0007114507691946

PubMed Abstract | CrossRef Full Text | Google Scholar

67. Hall, KD. What is the required energy deficit per unit weight loss? Int J Obes. (2008) 32:573–6. doi: 10.1038/sj.ijo.0803720

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Hall, KD, Sacks, G, Chandramohan, D, Chow, CC, Wang, YC, Gortmaker, SL, et al. Quantification of the effect of energy imbalance on bodyweight. Lancet. (2011) 378:826–37. doi: 10.1016/S0140-6736(11)60812-X

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Hall, KD, Heymsfield, SB, Kemnitz, JW, Klein, S, Schoeller, DA, and Speakman, JR. Energy balance and its components: implications for body weight regulation. Am J Clin Nutr. (2012) 95:989–94. doi: 10.3945/ajcn.112.036350

PubMed Abstract | CrossRef Full Text | Google Scholar

70. Hall, K.D. Research behind the body weight planner. (2011) Available at: https://www.niddk.nih.gov/research-funding/at-niddk/labs-branches/laboratory-biological-modeling/integrative-physiology-section/research/body-weight-planner (Accessed June 12, 2023).

Google Scholar

71. Thomas, DM, Martin, CK, Heymsfield, S, Redman, LM, Schoeller, DA, and Levine, JA. A simple model predicting individual weight change in humans. J Biol Dyn. (2011) 5:579–99. doi: 10.1080/17513758.2010.508541

PubMed Abstract | CrossRef Full Text | Google Scholar

72. Achamrah, N, Colange, G, Delay, J, Rimbert, A, Folope, V, Petit, A, et al. Comparison of body composition assessment by DXA and BIA according to the body mass index: a retrospective study on 3655 measures. PLoS One. (2018) 13:e0200465. doi: 10.1371/journal.pone.0200465

PubMed Abstract | CrossRef Full Text | Google Scholar

73. Lee, SY, and Gallagher, D. Assessment methods in human body composition. Curr Opin Clin Nutr Metab Care. (2008) 11:566–72. doi: 10.1097/MCO.0b013e32830b5f23

PubMed Abstract | CrossRef Full Text | Google Scholar

74. Ashman, AM, Collins, C, Brown, L, Rae, K, and Rollo, M. Validation of a smartphone image-based dietary assessment method for pregnant women. Nutrients. (2017) 9:73. doi: 10.3390/nu9010073

PubMed Abstract | CrossRef Full Text | Google Scholar

75. Carter, MC, Burley, VJ, Nykjaer, C, and Cade, JE. ‘My meal mate’ (MMM): validation of the diet measures captured on a smartphone application to facilitate weight loss. Br J Nutr. (2013) 109:539–46. doi: 10.1017/S0007114512001353

PubMed Abstract | CrossRef Full Text | Google Scholar

76. Ho, DKN, Tseng, SH, Wu, MC, Shih, CK, Atika, AP, Chen, YC, et al. Validity of image-based dietary assessment methods: a systematic review and meta-analysis. Clin Nutr. (2020) 39:2945–59. doi: 10.1016/j.clnu.2020.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

77. Murakami, K, and Livingstone, MBE. Prevalence and characteristics of misreporting of energy intake in US adults: NHANES 2003–2012. Br J Nutr. (2015) 114:1294–303. doi: 10.1017/S0007114515002706

PubMed Abstract | CrossRef Full Text | Google Scholar

78. Leech, R, Worsley, A, Timperio, A, and McNaughton, SA. The role of energy intake and energy misreporting in the associations between eating patterns and adiposity. Eur J Clin Nutr. (2018) 72:142–7. doi: 10.1038/ejcn.2017.90

PubMed Abstract | CrossRef Full Text | Google Scholar

79. Poslusna, K, Ruprich, J, de Vries, JHM, Jakubikova, M, and van't Veer, P. Misreporting of energy and micronutrient intake estimated by food records and 24 hour recalls, control and adjustment methods in practice. Br J Nutr. (2009) 101:S73–85. doi: 10.1017/S0007114509990602

PubMed Abstract | CrossRef Full Text | Google Scholar

80. Maurer, J, Taren, DL, Teixeira, PJ, Thomson, CA, Lohman, TG, Going, SB, et al. The psychosocial and behavioral characteristics related to energy misreporting. Nutr Rev. (2006) 64:53–66. doi: 10.1111/j.1753-4887.2006.tb00188.x

PubMed Abstract | CrossRef Full Text | Google Scholar

81. Schoeller, DA. Measurement of energy expenditure in free-living humans by using doubly labeled water. J Nutr. (1988) 118:1278–89. doi: 10.1093/jn/118.11.1278

CrossRef Full Text | Google Scholar

82. Freedman, LS, Commins, JM, Moler, JE, Arab, L, Baer, DJ, Kipnis, V, et al. Pooled results from 5 validation studies of dietary self-report instruments using recovery biomarkers for energy and protein intake. Am J Epidemiol. (2014) 180:172–88. doi: 10.1093/aje/kwu116

PubMed Abstract | CrossRef Full Text | Google Scholar

83. Freedman, LS, Commins, JM, Willett, W, Tinker, LF, Spiegelman, D, Rhodes, D, et al. Evaluation of the 24-hour recall as a reference instrument for calibrating other self-report instruments in nutritional cohort studies: evidence from the validation studies pooling project. Am J Epidemiol. (2017) 186:73–82. doi: 10.1093/aje/kwx039

PubMed Abstract | CrossRef Full Text | Google Scholar

84. Redpath, T, Naseer, F, Price, RK, Boyd, A, Martin, M, le Roux, CW, et al. Evaluation of the impact of gastric bypass surgery on eating behaviour using objective methodologies under residential conditions: rationale and study protocol. Contemp Clin Trials Commun. (2021) 24:100846. doi: 10.1016/j.conctc.2021.100846

PubMed Abstract | CrossRef Full Text | Google Scholar

85. Livingstone, MBE, Redpath, T, Naseer, F, Boyd, A, Martin, M, Finlayson, G, et al. Food intake following gastric bypass surgery: patients eat less but do not eat differently. J Nutr. (2022) 152:2319–32. doi: 10.1093/jn/nxac164

PubMed Abstract | CrossRef Full Text | Google Scholar

86. Kissileff, HR, Klingsberg, G, and Van Itallie, TB. Universal eating monitor for continuous recording of solid or liquid consumption in man. Am J Phys. (1980) 238:R14–22. doi: 10.1152/ajpregu.1980.238.1.R14

PubMed Abstract | CrossRef Full Text | Google Scholar

87. Gero, D, File, B, Justiz, J, Steinert, RE, Frick, L, Spector, AC, et al. Drinking microstructure in humans: a proof of concept study of a novel drinkometer in healthy adults. Appetite. (2019) 133:47–60. doi: 10.1016/j.appet.2018.08.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Glossary

Keywords: dietary intake, energy intake, dietary assessment, mHealth, image-based food recognition, doubly labelled water, artificial intelligence

Citation: Serra M, Alceste D, Hauser F, Hulshof PJM, Meijer HAJ, Thalheimer A, Steinert RE, Gerber PA, Spector AC, Gero D and Bueter M (2023) Assessing daily energy intake in adult women: validity of a food-recognition mobile application compared to doubly labelled water. Front. Nutr. 10:1255499. doi: 10.3389/fnut.2023.1255499

Received: 08 July 2023; Accepted: 04 September 2023;
Published: 22 September 2023.

Edited by:

Christoph Höchsmann, Technical University of Munich, Germany

Reviewed by:

Sanjoy Saha, Texas A&M University System, United States
Hanim Ecem Diktas, Pennington Biomedical Research Center, United States
Dachuan Zhang, Pennington Biomedical Research Center, United States

Copyright © 2023 Serra, Alceste, Hauser, Hulshof, Meijer, Thalheimer, Steinert, Gerber, Spector, Gero and Bueter. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Marco Bueter, bWFyY28uYnVldGVyQHVzei5jaA==

^†These authors have contributed equally to this work and share first authorship

^‡These authors have contributed equally to this work and share last authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.