Hostname: page-component-54dcc4c588-gwv8j Total loading time: 0 Render date: 2025-10-10T07:30:13.578Z Has data issue: false hasContentIssue false

Accuracy of menu calorie labelling in the England out-of-home food sector during 2024: assessment of a national food policy

Published online by Cambridge University Press:  23 September 2025

Amy Finlay*
Affiliation:
Department of Psychology, University of Liverpool, Liverpool, UK
Andrew Jones
Affiliation:
School of Psychology, Liverpool John Moores University, Liverpool, UK
Paula Thorp
Affiliation:
Department of Psychology, University of Liverpool, Liverpool, UK
I Gusti Ngurah Edi Putra
Affiliation:
Department of Public Health, Policy and Systems, University of Liverpool, Liverpool, UK
Megan Polden
Affiliation:
Department of Primary Care and Mental Health, University of Liverpool, Liverpool, UK NIHR Applied Research Collaboration Northwest Coast, Liverpool, UK
Jean Adams
Affiliation:
MRC Epidemiology Unit, University of Cambridge, School of Clinical Medicine, Cambridge, UK
Jane Brealey
Affiliation:
Department of Psychology, University of Liverpool, Liverpool, UK
Eric Robinson
Affiliation:
Department of Psychology, University of Liverpool, Liverpool, UK
*
Corresponding author: Amy Finlay; Email: amy.finlay@liverpool.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Mandatory calorie labelling was introduced in out-of-home (OOH) food sector outlets during 2022 in England. Previous research in North America has found that labelled energy content can be underestimated for packaged and quick-serve foods, but no study has evaluated the accuracy of OOH food sector menu calorie labelling in response to the mandatory policy introduced in England. N 295 menu items from a range of outlet types (e.g. cafes, pubs, restaurants) and menu categories (e.g. starters and sides, main, dessert) were sampled. Bomb calorimetry was used to quantify energy content, and the reported energy content on menus was recorded. Consistency of measured energy was assessed by sampling the same items across outlets of the same business (n 50 menu items). Differences between reported and measured energy content were tested through Wilcoxon signed rank tests, and a linear model examined correlates of the difference. Mean measured kilocalories (kcal) were significantly lower than reported kcal (–16·70 kcal (±149·19), V = 16 920, P < 0·01 and r = 0·182). However, both over- (23 % of menu items) and under-estimation (11 %) by > 20 % of measured energy content were common, and the averaged absolute percentage difference between reported and measured values was 21 % (±29 %). Discrepancy between measured and reported energy content was more common in some outlet types (pubs), and reported energy content was substantially different (> 20 %) to measured energy content for 35 % of sampled menu items. There may be significant inaccuracies in reported energy content of calorie labelled menu items in English food outlets subject to mandatory calorie labelling.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of the Nutrition Society

Consumption of food prepared out-of-home (OOH), for example from cafés, restaurants and fast-food outlets, is associated with increased energy consumption(Reference Goffe, Rushton and White1) and poorer dietary quality(Reference Taher, Evans and Evans2). It has been estimated that in the UK, 60 % of the population purchase OOH food weekly, with the OOH food sector contributing on average 11 % of weekly energy intake(Reference Garbutt, Townsend and Johnson3). In April 2022, calorie labelling on food menus was made mandatory in England for all businesses with over 250 employees. This policy was specific to the OOH food sector and applied to all non-prepacked foods and drinks made for immediate consumption(4). The aim of the mandatory calorie labelling policy was to promote informed food choices and healthier eating behaviour in the OOH food sector(5). Calorie labelling policies have been implemented outside of the UK, specifically in parts of Australia(Reference Wellard-Cole, Chapman and Dunford6), Canada(Reference Goodman, Vanderlee and White7) and in the USA nationally(Reference Greenthal, Sorscher and Pomeranz8). Evidence is mixed on the impact that calorie labelling has on consumer behaviour(Reference Bleich, Economos and Spiker9,Reference Robinson, Marty and Jones10) , but studies tend to suggest that calorie labelling results in reductions in energy content of menu items(Reference Zlatevska, Neumann and Dubelaar11,Reference Essman, Burgoine and Huang12) .

Implementation guidance for the 2022 mandatory calorie labelling policy in England(5) states that businesses can calculate the energy content of menu items by various means. Although laboratory analysis is considered the scientific gold standard, this is expensive and may not be feasible for many businesses. Energy content can be estimated using established and accepted food databases (for example, the McCance and Widdowson’s Composition of Foods Dataset), where nutritional composition is based on standard recipes(13). A limitation of this method is that values may not be accurate for recipes that deviate from this standard. Energy values can be averaged based on manufacturer’s analysis or calculated using known or average values of ingredients used and imputing them into nutritional calculators such as MenuCal(5). However, often these approaches will not account for cooking method used, which has a meaningful impact on the final served energy content.

A 20 % discretion between the reported and calculated energy content is permitted and necessary to provide businesses flexibility to account for differences in ingredient composition, portion size or calculation methods. Guidance acknowledges that accurate testing of energy content may not be viable for local enforcement officers(5). Regarding enforcement, this guidance states that ‘Local authorities have discretion in how they enforce the Regulations’(5). Specific roles of the enforcement officer include checking that calorie labels are present and that the methods used to estimate energy content are appropriate(5). However, there is little assistance for enforcement officers in assessing the accuracy of calorie labelling, and no explicit instruction or suggestion that accuracy of labelling of menu items should be tested.

In the USA, businesses appeared initially to have largely complied with the mandatory provision of calorie labelling, with 79 % of businesses fully or partially implementing labels in advance of policy enforcement,(Reference Cleveland, Simon and Block14) and similarly in the UK, 80 % of businesses provided calorie labelling at any point of choice 6 months post-implementation, compared with only 21 % who provided this information prior to implementation(Reference Polden, Jones and Essman15). The food industry has a history of inadequate compliance and exploitation of loopholes in government policy(Reference Boyland and Harris16Reference van der Westhuizen, Frank and Abdool Karim18). This is likely due to the lack of accountability by the food industry surrounding implemented policy and the lack of resources for compliance monitoring(Reference Swinburn, Kraak and Rutter19).

There is evidence that nutritional labels can be prone to some degree of inaccuracy for packaged food products. One study found that the energy content of popular USA snacks was on average higher than reported on labels(Reference Jumpertz, Venti and Le20). However, for most items (96 %), reported energy was within 20 % of measured energy content. In Canada, over 1000 items from supermarkets, bakeries and restaurants were tested for their nutritional content(Reference Fitzpatrick, Arcand and Abbe21). For the tested foods, sodium and energy were consistently under-reported. For 14 % of foods, the measured energy content exceeded label values by more than 20 %. A study conducted in the USA assessed the accuracy of nutrition labels for low-energy (< 500 kcal) restaurant and frozen meals(Reference Urban, Dallal and Robinson22). For frozen meals, measured energy content was on average 8 % higher than reported, and for restaurant meals, energy content was on average 18 % higher than reported. Several restaurant foods contained up to twice the reported energy content. Similarly, a USA study of restaurant menu items in 2011 found that although average measured v. reported energy content across menu items was similar, 19 % of individual menu items had a measured energy content of ≥ 100 kcal per portion more than reported(Reference Urban, McCrory and Dallal23).

No research has to date assessed the accuracy of calorie labels in OOH outlets following the implementation of the mandatory calorie labelling policy in England. As customers are encouraged to pay attention to the nutritional labelling of foods(4), it is expected that the provided information will impact behaviour, and inaccurate nutritional labelling may hinder individual efforts to eat healthier. For example, if consumers use calorie labels to factor foods into their daily energy allowance, then underestimation of calories on food menus would lead to consumers unknowingly consuming excess energy. Additionally, if consumers identify potential inaccuracies in labelling themselves, then they may lose trust and stop using calorie information in the OOH food sector. Furthermore, some research suggests that there is little or no enforcement of the policy requirement of providing accurate calorie information(Reference Essman, Bishop and Burgoine24). Therefore, in the present study, we examined the accuracy of menu calorie labelling in OOH sector outlets subject to the 2022 mandatory calorie labelling law in England. As ingredient and cooking method information from businesses is not publicly available, laboratory analysis of menu items’ energy content was used to assess accuracy of menu calorie labelling in the present research.

Materials and methods

This observational study was pre-registered on the Open Science Framework https://osf.io/8tfu4/.

Outlet selection

Outlets in two local authorities (LA) in England were sampled to ensure findings were not area specific. Liverpool (North of England) and Milton Keynes (South of England) were selected to ensure mixed geographical coverage and representation of different quintiles of deprivation (index of multiple deprivation (IMD) used at the LA level). LA IMD quintiles (1–5) were used as a local measure of deprivation with IMD1 reflecting the most deprived areas and IMD5 reflecting the least deprived. LA selected represent quintile 1 (most deprived – Liverpool) and quintile 3 (medium deprivation – Milton Keynes). However, outlets in both LA were located across quintiles 1–5 as measured at the Lower Super Output Area.

Previous research that evaluated the impact of the calorie labelling legislation on consumer and business behaviour(Reference Polden, Jones and Adams25) used the Inter-Department Business Register to identify businesses likely to be subject to the mandatory calorie labelling policy within LA of interest (data sampled June 2021, list produced Autumn 2020). The Inter-Department Business Register is a list of all UK businesses, their core characteristics, number of employees and principal activities defined using the Standard Industrial Classification. Standard Industrial Classification codes likely to include businesses serving food were identified (see the full list of Standard Industrial Classification codes in online Supplementary Material 1). Large businesses (defined as 250+ employees) within the relevant classification codes were then located, and their individual outlets were identified using Ordnance Survey Points of Interest data from September 2020. We used the final list of eligible businesses to guide outlet selection in the two LA.

Through power calculations, we deemed that a minimum of n 239 samples (menu items) (n 240 rounded to the nearest multiple of 2) would be sufficient to detect small effect sizes through paired samples tests in our primary analysis (see online Supplementary Material 2 for further detail). To obtain 240 samples with four samples per outlet (not including items deemed potentially inaccurate, which were analysed separately) required sixty outlets in total or thirty outlets in each LA. For a sub-sample of outlets (n 5 in each LA), we planned to match outlets across LA and conduct paired sample comparisons to assess consistency within chains. Figure 1 depicts the outlet selection process. To obtain a sample with variation in outlet type and item type, we randomly selected (using the RAND function in excel) N 6 unique outlets from each of four main outlet categories: café, restaurant, fast food and pubs. These outlet categories were taken from previous research evaluating the national calorie labelling policy(Reference Polden, Jones and Essman26). If for any outlet category there were not six unique outlets (i.e. there were only multiple outlets from the same chain), additional outlets classed as ‘sport & entertainment’, for example, cinemas, were sampled. We randomly selected additional outlets from the full database (irrespective of outlet type) until n 25 unique outlets were selected for each LA. Duplicates of businesses across the two LA were permitted.

Figure 1. Outlet selection process.

To assess consistency in calorie labelling between the same businesses across the two local authorities, n 5 (20 %) selected outlets in each LA were matched to an additional corresponding outlet from the same business in the other LA where the same items would be sampled. This resulted in an additional n 10 outlets sampled and a total of n 50 matched pairs for our planned secondary analyses. Additional outlets could only be sampled if they were not already selected. For example, if the randomly selected outlet in LA1 was a Starbucks, an additional matched Starbucks would be identified in LA2 for sampling. In total, we sampled n 300 menu items from sixty outlets (thirty in each LA). For full sample size and power analysis information, see online Supplementary Material 2.

Menu item selection

Prior to data collection, online menus for each outlet were used to identify the items on offer in outlets. For each outlet, one menu item was selected for sampling in each of the following categories:

  • Starter/side or prepared drink (randomly selected)

  • Main meal (randomly selected)

  • Dessert (randomly selected)

  • Menu item that that was potentially inaccurate (determined by a research staff member and trained nutritionist assessing the full menu)

  • Most popular menu item (determined by asking serving staff on the day of data collection)

In instances where menu categories were not distinct (e.g. starter – main – dessert – side dish), two researchers individually considered the categories provided by outlets and re-categorised items into the above groups. Any differences were resolved through discussion. For some outlets, specifically cafés, the categories outlined above did not encompass the majority of items available, and so prepared drinks were sampled in place of starters/sides. This was the case for n 7 outlets from n 3 unique businesses.

Menu items categorised as starter/side or prepared drink, main meal and dessert were numbered, and a random number generator was used to select items for sampling prior to data collection. To determine items sampled under the category of potentially inaccurate, two researchers considered the available description of menu items and whether the reported energy content looked inaccurate using menu information alone (either too high or too low). We adopted this approach to reflect how enforcers of the policy may be expected to assess the accuracy of calorie labels on food menus (i.e. relatively subjectively). If there were no items that appeared to be inaccurate at face value (e.g. very high or low energy content based on type of menu item), researchers considered instances where potential variability in serving size (e.g. inconsistency in the serving size of a dish of pasta) could result in differences in energy content from reported. Once each researcher had determined a potentially inaccurate item for each outlet, a final item was decided upon through discussion. Due to the absence of sales data to assess popularity, in each outlet, the researcher asked the serving staff what the most popular item on the menu was, and this item was sampled. If the most popular menu item or potentially inaccurate item from an outlet was a drink, this was eligible for sampling.

For instances where menu items required customisation (e.g. choosing a side to go with a dish), researchers randomly selected from all available customisation options.

Energy content of sampled items and procedure

The researcher responsible for data collection recorded the energy content reported on menus for each sampled menu item on the day of sampling. Samples were collected (Monday–Thursday) between 11.00 and 19.00 during April-May 2024. For each outlet, the researcher responsible for data collection ordered food to dine in. All sampled items (n 300 from n 60 outlets) were individually weighed and packaged in the restaurant. Calibration weights were used to ensure scales were accurate. All items were sent via a courier to an external laboratory accredited by the UK Accreditation Service (SGS Cambridge) for nutritional analysis through bomb calorimetry, and energy (kcal) data were analysed.

Analysis

Primary analyses

For all analyses, the potentially inaccurate items were considered separately from all other items tested as we anticipated their inclusion may overestimate average difference between measured and reported energy content. For the main analyses, one item was missing due to lab error (spoiled food due to storage error), and three items were excluded from analysis due to implausibility of energy values. This resulted in n 236 samples. Similarly, for potentially inaccurate items, one menu item was missing due to lab error, resulting in n 59 samples.

Throughout the results, relative differences are calculated as measured kcal – reported kcal for both mean and percentage difference. Absolute differences are calculated as the mean or percentage difference from measured kcal regardless of the direction of the difference, calculated using the ‘abs()’ function in R Studio.

Absolute percentage differences were used to explore whether differences between measured and reported energy content differed from 0 % (perfect accuracy) and 20 % (permitted inaccuracy). As data were not normally distributed, one-sample Wilcoxon signed rank tests were conducted against 0 and 20 % to examine if the observed absolute difference values were significantly different from 0 % and 20 %.

We also conducted Wilcoxon signed rank tests to assess whether there was a significant difference between reported energy and lab-measured energy content (relative difference).

Secondary analyses

For the subsample of restaurants where a corresponding outlet from each LA was tested, resulting in n 50 matched pairs, we examined whether the measured energy content differed between the two locations using a paired Wilcoxon signed rank test. We created a new variable (measured kcal – reported kcal) and fitted a linear regression model to examine potential predictors (e.g. outlet characteristics) of the relative difference between measured and reported energy. We explored outlet type, menu category, item energy content, IMD score for outlet and LA. If any predictor variables were significant, we conducted subgroup analyses to explore differences. We also used this method to explore predictors of absolute mean percentage difference.

Results for primary analyses were considered significant at P < 0·05. To account for multiple comparisons, results for secondary and any exploratory analyses were considered significant at P < 0·01. For explanation of deviation to the pre-registered analysis plan, see online Supplementary Material 3.

Results

A total of n 295 menu items were sampled from n 60 outlets across two LA in England. Figure 2 displays the mean reported and measured energy content for each category of item.

Figure 2. Reported v. Measured energy (kcal) split by outlet type and food category. *Error bars represent standard deviation.

Menu items were categorised by outlet type (pubs, cafes, fast food, restaurants and entertainment) and menu category (starter/side, main, dessert, popular item, potentially inaccurate item and drinks). Mean energy and percentage differences between reported and measured energy for each category are reported in Table 1. For the most popular menu items and items that were potentially inaccurate, approximately 2/3 of sampled items were main meal items. Overall, the mean relative difference between reported and measured energy (measured-reported) was –16·70 kcal (±149·19) and −8·75 % (±35·27 %) expressed as a relative percentage of the menu item energy content. This was 98·76 kcal ±112·88 and 21·30 % (±29·45 %) when the absolute energy and percentage difference were examined (i.e. mean difference regardless of direction of difference).

Table 1. Difference between reported and measured energy (kcal) for the different categories

* Negative values indicate that reported kcal were greater than measured.

Percentage difference for reported – measured kcal.

Mean difference between reported and measured kcal regardless of direction (– or +) of difference.

§ Numbers in brackets are sd.

|| Potentially inaccurate items not included in mean and standard deviation values for ‘overall’, ‘item category’ and ‘outlet category’.

In the main analyses (all menu items except for those that were potentially inaccurate), 56 % of items had a lower measured energy content than reported on menus. For over one-third (35 %) of items, the energy content reported on menus was outside of the 20 % leeway permitted; for 66 % of these (23 % of total), this was due to reported energy being substantially higher than measured and for 31 % of these (11 % of total) this was due to reported energy being substantially lower than measured. Findings were similar in the potentially inaccurate items (see online Supplementary Material 4).

Two Wilcoxon signed rank tests found that the absolute percentage difference between reported and measured energy (n 236) was significantly greater than 0 % (V = 27 261, P < 0·001) but not greater than 20 % (V = 0, P > 0·99). However, reported and measured energy content were significantly different from each other, whereby on average energy content reported on menus was significantly greater than measured energy content (V = 16 920, P < 0·01, r = 0·182). A boxplot of mean differences is shown in Figure 3. Reported and measured energy were not significantly different for potentially inaccurate items.

Figure 3. Reported and measured energy content of n 236 items. *white circle indicates the mean, ** outliers are shown twice (see grey points).

For the subsample of menu items from outlets matched across the two LA, a Wilcoxon signed rank test found no significant differences in measured energy between matched items (V = 583, P = 0·775), see Figure 4. The mean energy content of these items in Liverpool was 547·39 kcal ± 327·96 and for Milton Keynes this was 549·45 kcal ±312·02. The mean difference (Liverpool – Milton Keynes) was −14·15 kcal, and the relative percentage difference was −7·99 %. 73 % of items sampled in Milton Keynes were within 20 % of the value in Liverpool.

Figure 4. Matched pairs of menu items across the two local authorities (n 50 matched pairs). *white circle indicates the mean.

A linear model explored predictors of the mean difference between reported and measured energy content (Table 2). Variance inflation factor for all predictors were below 2, indicating minimal multicollinearity. Items with a higher measured energy content were more likely to have a measured energy higher than reported energy (B = 0·22, 99 % CI 0·12, 0·32, P < 0·001), whereby every measured 1 kcal increase was associated with an increased difference of 0·22 kcal. Main effects were also observed for outlet type and item category, whereby pubs and restaurants (v. cafes) and popular dishes (v. starters/sides) had a greater difference between reported and measured energy. There were no significant predictors of difference between measured and reported energy for potentially inaccurate items (full results in online Supplementary Material 4). We also explored predictors of absolute mean percentage difference between reported and measured energy content, and results were similar to main analyses (online Supplementary Material 5).

Table 2. Linear model exploring predictors of mean relative difference between reported and measured energy content (calculated as measured – reported)

IMD, index of multiple deprivation.

Reference categories are *Cafes, Starters/sides, IMD quintile 1, §Liverpool.

Negative values indicate a decreased relative difference.

Bold text indicates significant P value.

To explore identified significant predictors, paired samples tests (Wilcoxon signed ranks) were conducted that examined differences between measured v. reported energy for cafes, restaurants and pubs and the starter and popular item categories. Pubs were the only outlet type where reported and measured energy were significantly different from each other (V = 943, P < 0·01). For pubs, mean reported kcal were 725·00 ± 326·89 and mean measured kcal were 674·10 ± 338·36. There was no significant difference between measured and reported energy for the starter or popular menu categories. Mean differences, percentage differences and the proportion of items outside of the 20 % leeway for each outlet type and item category are shown in Table 1.

Discussion

This study of large business owned OOH food outlets subject to the mandatory calorie labelling law in England during 2024 found that averaged across all sampled menu items, the energy content (kcal) reported on OOH outlet menus was significantly greater than measured energy. However, it was common for reported energy content of menu items to be greater and less than measured energy content. For 35 % of menu items, reported energy was over 20 % greater or less than measured energy. Pubs had a particularly pronounced difference between reported and measured energy content compared with other outlet types. The measured energy content of the same menu items sampled from different outlets of the same chain were largely similar. Collectively, these findings suggest that reported energy content for significant numbers of menu items in the OOH sector in England differ substantially to measured energy content.

Expressed as a relative calorie value and a percentage of each item’s measured energy content, average reported energy of items was 17 kcal and 9 % greater than measured energy content. However, expressed as an absolute value (size of deviation from measured energy content irrespective of direction), the mean difference between measured and reported energy was 99 kcal and 21 %, due to both over- and under-estimation of measured energy content being common (although over-estimation of energy content was observed more frequently). This suggests that on average, calories reported for menu items in the English OOH food sector may differ substantially to their measured energy content. The overall pattern of results differs somewhat to research conducted in the USA(Reference Urban, Dallal and Robinson22) which identified that measured energy content tended to be greater than reported on labelling and on restaurant menus. This difference in our findings may be a result of differences in types of outlets or menu items examined. Previous research has largely sampled packaged foods, which may be less prone to variability during the preparation and cooking process compared with items made in the OOH food sector. There may also be differences in the sampling methods used that could impact the measured energy content of items (e.g. sampling and packaging hot food v. sampling cold, packaged foods). One USA study considered OOH foods, but limited the investigation to menu items under 500 kcal(Reference Urban, Dallal and Robinson22), which may help to explain differences as our findings would suggest there is less scope for inaccuracy at lower levels of energy content. However, our findings are comparable to those of a study conducted in Canada sampling a range of outlet types(Reference Fitzpatrick, Arcand and Abbe21), whereby items had an overall greater mean labelled energy content than mean measured energy content, with evidence of both over and under-estimation of reported energy content.

In the previous North American studies discussed, reported energy content was largely within the 20 % leeway of measured content (70 %(Reference Jumpertz, Venti and Le20), 86 %(Reference Fitzpatrick, Arcand and Abbe21), 59 %(Reference Urban, Dallal and Robinson22)), and this is comparable to the present study (65 %). However, across all of these and the present study, a proportion of menu item labels appears to be significantly different from measured values (> 20 %). There are multiple possible explanations for the observed error in reported energy content in the present and prior research, for example, errors in ingredient information or human error in calculation. It would now be informative to study what the causes of inaccuracy are. In England, the calorie labelling legislation outlines several accepted methods to calculate energy content, and for this research we chose to use the gold standard of measurement (i.e. bomb calorimetry)(Reference Basolo, Parrington and Ando27). It is plausible that the observed error comes from businesses utilising alternative methods (e.g. calculation using standardised ingredient information) to estimate energy content(5).

While five different outlet types were explored, the largest difference between mean reported and measured energy content was observed for pubs. This outlet type had 46 % of items outside of the 20 % legislation leeway, while all other outlet types had between 30 and 36 % of items outside of the leeway. Observed differences between measured and reported energy may have been greater in pubs due to differences in the use of ingredients that are more prone to fluctuations in water and energy content through cooking. This, along with potentially greater autonomy for chefs in how foods are prepped and cooked (e.g. compared with fast food outlets or the café chains sampled, where many food ingredients are pre-packaged, and serving sizes more standardised), may explain why discrepancies were greater in pubs. Alternatively, differences between outlet types and individual outlets may exist if there are differences in how the energy content of menu items is calculated.

When items deemed potentially inaccurate were examined, results were similar to the main analysis. Items in this category were selected based on two researcher’s assessments of all items on a menu with consideration of the description and composition of the dish and the potential for inconsistencies in portion sizes made by servers in outlets. As these findings were not largely different from the items in the main analyses, this suggests that reliably identifying likely inaccurately labelled menu items based on menu information alone may be difficult.

Our findings highlight the difficulty that enforcers of the policy may have if attempting to assess accuracy of calorie labelling in OOH outlets without expending substantial resources on laboratory measurement. At present there appears to be minimal enforcement training supplied, although how enforcement is monitored is at the discretion of the LA(5). To aid improved accuracy of a menu’s calorie information and enforcement of the policy, laboratory analysis of menu items would preferably determine reported energy content; however, this would result in significant financial cost to businesses. Despite this, a one-off or annual cost may be minimal for larger food chains such as those explored in this study. Greater support for businesses in calculating energy content, as well as greater training and assistance for the individuals responsible for assessing compliance, would likely help to reduce the discrepancy between measured and reported energy content. Additionally, clearer pathways for compliance monitoring and sanctions for observed inaccuracy may be necessary to improve accuracy of labelling by businesses. Our results showed that the same menu items from different outlet locations of the same business tended to have similar measured calorie content, so analysis at the chain level is likely to be somewhat representative of all outlets in a chain. However, this should be explored in greater depth, and particularly within pubs, where the greatest inconsistencies in reported v. measured energy content were observed.

Strengths and limitations

There are a number of limitations of this work that should be considered alongside findings. This study explored a large sample of menu items in the OOH food sector; however, this sample is not representative of all menu items in outlets in England and instead provides a snapshot of the accuracy of calorie labels in the OOH food sector among large businesses. The analyses relating to matched samples and exploration of potentially inaccurate items can be considered as exploratory only, as the study was not powered for these smaller samples. We used a gold standard measurement of energy content, and this is a strength of the present work. However, we collected one sample per menu item from outlets for our main analyses, and future research would benefit from taking multiple samples of the same item. This would improve measurement accuracy for estimated energy content and also allow for examination of consistency of menu item energy content within the same outlet.

Conclusion

There were significant inaccuracies in reported energy content of calorie labelled menu items in English food outlets subject to mandatory calorie labelling, and this appears to be caused by both over- and under-estimation of reported energy content.

Supplementary material

For supplementary material/s referred to in this article, please visit https://doi.org/10.1017/S0007114525105217

Acknowledgements

This piece of work was primarily funded by the University of Liverpool Policy Support Fund. A. F. and E. R.’s salary is supported by an ESRC grant (ES/W007932/1). E. R. was funded by the National Institute for Health and Care Research (NIHR) Oxford Health Biomedical Research Centre (BRC). J. A. was supported by the Medical Research Council [grant number MC_UU_00006/7]. M.P. received support from the NIHR Applied Research Collaboration ARC NWC and Alzheimer’s Society and was funded through a Post-Doctoral Fellowship. I.G.N.E.P. received support from the NIHR Development and Skill Enhancement Award (DSE) [grant number NIHR305076].

A. F., E. R., M. P., I. G. N. E. P., A. J. and J. A. conceptualised the study. A. F. and P. T. were responsible for data curation. I. G. N.E. P. and A. F. conducted formal analyses. E. R. and A. F. acquired funding. A. F., E. R., M. P. and P. T. were responsible for investigation and methodology. A. F. was responsible for project admin. J. B. conducted data validation. A. F. and E. R. wrote the original draft of the manuscript, and all authors reviewed and edited the manuscript.

E. R. has previously received research funding from Unilever and the American Beverage Association for unrelated research projects. Other authors have no competing interests.

References

Goffe, L, Rushton, S, White, M, et al. (2017) Relationship between mean daily energy intake and frequency of consumption of out-of-home meals in the UK National Diet and Nutrition Survey. Int J Behav Nutr Physical Activity 14, 131.Google Scholar
Taher, AK, Evans, N & Evans, CEL (2019) The cross-sectional relationships between consumption of takeaway food, eating meals outside the home and diet quality in British adolescents. Public Health Nutr 22, 6373.Google Scholar
Garbutt, J, Townsend, N, Johnson, L, et al. (2025) The contribution of the out-of-home food (OOHF) sector to the national diet: a cross-sectional survey with repeated 24-hour recalls of adults in England (2023–2024). medRxiv https://doi.org/10.1101/2025.06.30.25330369.Google Scholar
Department of Health and Social Care (2022) New Calorie Labelling Rules Come into Force to Improve Nation’s Health. UK: GOV.UK. https://www.gov.uk/government/news/new-calorie-labelling-rules-come-into-force-to-improve-nations-health (accessed June 2024).Google Scholar
Department of Health and Social Care (2021) Calorie Labelling in the Out of Home Sector: Implementation Guidance. UK: GOV.UK. https://www.gov.uk/government/publications/calorie-labelling-in-the-out-of-home-sector/calorie-labelling-in-the-out-of-home-sector-implementation-guidance (accessed June 2024).Google Scholar
Wellard-Cole, L, Chapman, K, Dunford, EK, et al. (2018) Monitoring the changes to the nutrient composition of fast foods following the introduction of menu labelling in New South Wales, Australia: an observational study. Public Health Nutr 21, 11941199.Google Scholar
Goodman, S, Vanderlee, L, White, CM, et al. (2018) A quasi-experimental study of a mandatory calorie-labelling policy in restaurants: impact on use of nutrition information among youth and young adults in Canada. Prev Med 116, 166172.Google Scholar
Greenthal, E, Sorscher, S, Pomeranz, JL, et al. (2023) Availability of calorie information on online menus from chain restaurants in the USA: current prevalence and legal landscape. Public Health Nutr 26, 32393246.Google Scholar
Bleich, SN, Economos, CD, Spiker, ML, et al. (2017) A systematic review of calorie labeling and modified calorie labeling interventions: impact on consumer and restaurant behavior. Obesity 25, 20182044.Google Scholar
Robinson, E, Marty, L, Jones, A, et al. (2021) Will calorie labels for food and drink served outside the home improve public health? BMJ 372, n40.Google Scholar
Zlatevska, N, Neumann, N & Dubelaar, C (2018) Mandatory calorie disclosure: a comprehensive analysis of its effect on consumers and retailers. J Retailing 94, 89101.Google Scholar
Essman, M, Burgoine, T, Huang, Y, et al. (2024) Changes in energy content of menu items at out-of-home food outlets in England after calorie labelling policy implementation: a pre-post analysis (2021–2022). medRxiv 2024.08.09.24311741.Google Scholar
Food Standards Agency (2021) Guidance on Voluntary Energy Labelling for Out of Home Businesses in Northern Ireland. UK. https://www.food.gov.uk/sites/default/files/media/document/calorie-wise-technical-guidance_july2021.pdf (accessed June 2024).Google Scholar
Cleveland, LP, Simon, D & Block, JP (2018) Compliance in 2017 with Federal Calorie Labeling in 90 chain restaurants and 10 retail food outlets prior to required implementation. Am J Public Health 108, 10991102.Google Scholar
Polden, M, Jones, A, Essman, M, et al. (2024) Point-of-choice kilocalorie labelling practices in large, out-of-home food businesses: a preobservational v. post observational study of labelling practices following implementation of The Calorie Labelling (Out of Home Sector) (England) Regulations 2021. BMJ Open 14, e080405.Google Scholar
Boyland, EJ & Harris, JL (2017) Regulation of food marketing to children: are statutory or industry self-governed systems effective? Public Health Nutr 20, 761764.Google Scholar
Roberts, M, Pettigrew, S, Chapman, K, et al. (2012) Compliance with children’s television food advertising regulations in Australia. BMC Public Health 12, 846.Google Scholar
van der Westhuizen, B, Frank, T, Abdool Karim, S, et al. (2023) Determining food industry compliance to mandatory sodium limits: successes and challenges from the South African experience. Public Health Nutr 26, 25512558.Google Scholar
Swinburn, B, Kraak, V, Rutter, H, et al. (2015) Strengthening of accountability systems to create healthy food environments and reduce global obesity. Lancet 385, 25342545.Google Scholar
Jumpertz, R, Venti, CA, Le, DS, et al. (2013) Food label accuracy of common snack foods. Obesity 21, 164169.Google Scholar
Fitzpatrick, L, Arcand, J, Abbe, M, et al. (2014) Accuracy of Canadian food labels for sodium content of food. Nutrients 6, 33263335.Google Scholar
Urban, LE, Dallal, GE, Robinson, LM, et al. (2010) The accuracy of stated energy contents of reduced-energy, commercially prepared foods. J Am Dietetic Assoc 110, 116123.Google Scholar
Urban, LE, McCrory, MA, Dallal, GE, et al. (2011) Accuracy of stated energy contents of restaurant foods. JAMA 306, 287293.Google Scholar
Essman, M, Bishop, T, Burgoine, T, et al. (2024) Implementation and enforcement of mandatory calorie labelling regulations for the out-of-home sector in England: qualitative study of the experiences of business implementers and regulatory enforcers. medRxiv https://doi.org/10.1101/2024.02.18.24302990.Google Scholar
Polden, M, Jones, A, Adams, J, et al. (2023) Kilocalorie labelling in the out-of-home sector: an observational study of business practices and consumer behaviour prior to implementation of the mandatory calorie labelling policy in England, 2022. BMC Public Health 23, 1088.Google Scholar
Polden, M, Jones, A, Essman, M, et al. (2025) Evaluating the association between the introduction of mandatory calorie labelling and energy consumed using observational data from the out-of-home food sector in England. Nat Hum Behav 9, 277286.Google Scholar
Basolo, A, Parrington, S, Ando, T, et al. (2020) Procedures for measuring excreted and ingested calories to assess nutrient absorption using bomb calorimetry. Obesity 28, 23152322.Google Scholar
Figure 0

Figure 1. Outlet selection process.

Figure 1

Figure 2. Reported v. Measured energy (kcal) split by outlet type and food category. *Error bars represent standard deviation.

Figure 2

Table 1. Difference between reported and measured energy (kcal) for the different categories

Figure 3

Figure 3. Reported and measured energy content of n 236 items. *white circle indicates the mean, ** outliers are shown twice (see grey points).

Figure 4

Figure 4. Matched pairs of menu items across the two local authorities (n 50 matched pairs). *white circle indicates the mean.

Figure 5

Table 2. Linear model exploring predictors of mean relative difference between reported and measured energy content (calculated as measured – reported)

Supplementary material: File

Finlay et al. supplementary material

Finlay et al. supplementary material
Download Finlay et al. supplementary material(File)
File 36.3 KB