Abstract
Background
Accurate assessment is required to assess current and changing physical activity levels, and to evaluate the effectiveness of interventions designed to increase activity levels. This study systematically reviewed the literature to determine the extent of agreement between subjectively (self-report e.g. questionnaire, diary) and objectively (directly measured; e.g. accelerometry, doubly labeled water) assessed physical activity in adults.
Methods
Eight electronic databases were searched to identify observational and experimental studies of adult populations. Searching identified 4,463 potential articles. Initial screening found that 293 examined the relationship between self-reported and directly measured physical activity and met the eligibility criteria. Data abstraction was completed for 187 articles, which described comparable data and/or comparisons, while 76 articles lacked comparable data or comparisons, and a further 30 did not meet the review's eligibility requirements. A risk of bias assessment was conducted for all articles from which data was abstracted.
Results
Correlations between self-report and direct measures were generally low-to-moderate and ranged from -0.71 to 0.96. No clear pattern emerged for the mean differences between self-report and direct measures of physical activity. Trends differed by measure of physical activity employed, level of physical activity measured, and the gender of participants. Results of the risk of bias assessment indicated that 38% of the studies had lower quality scores.
Conclusion
The findings suggest that the measurement method may have a significant impact on the observed levels of physical activity. Self-report measures of physical activity were both higher and lower than directly measured levels of physical activity, which poses a problem for both reliance on self-report measures and for attempts to correct for self-report – direct measure differences. This review reveals the need for valid, accurate and reliable measures of physical activity in evaluating current and changing physical activity levels, physical activity interventions, and the relationships between physical activity and health outcomes.
Similar content being viewed by others
Background
Physical activity is defined as "any bodily movement produced by the skeletal muscle that results in energy expenditure (EE)" [1]. Inactivity is known to be associated with an increased risk for many chronic diseases including: coronary artery disease, stroke, hypertension, colon cancer, breast cancer, Type 2 diabetes, and osteoporosis [2], as well as premature death. The economic burden of physical inactivity in Canada has been estimated to be $2.1 billion [2]. Physical activity levels are often monitored to assess the health behaviours of the population and their association with health status including mortality and morbidity rates. Accurate assessment of physical activity is required to identify current levels and changes within the population, and to assess the effectiveness of interventions designed to increase activity levels.
Data collection at the population level often involves self-report (subjective) measures of physical activity through the use of questionnaires, diaries/logs, surveys, and interviews. These measures are frequently used due to their practicality, low cost, low participant burden, and general acceptance [3]. Although self-reports are useful for gaining insight into the physical activity levels of populations, they have the capacity to over- or underestimate true physical activity energy expenditure and rates of inactivity. The self-report methods are often wrought with issues of recall and response bias (e.g. social desirability, inaccurate memory) and the inability to capture the absolute level of physical activity.
As self-report methods possess several limitations in terms of their reliability and validity [4], objective or direct measures of physical activity are commonly used to increase precision and accuracy and to validate the self-report measures. Direct measures are believed to offer more precise estimates of energy expenditure and remove many of the issues of recall and response bias. Direct measures consist of calorimetry (i.e., doubly labeled water, indirect, direct), physiologic markers (i.e., cardiorespiratory fitness, biomarkers), motion sensors and monitors (i.e., accelerometers, pedometers, heart rate monitors), and direct observation. Despite the advantages of using direct methods, these types of measures are often time and cost intensive and intrusive rendering them difficult to apply to large epidemiologic settings. These measures also require specialized training and the physical proximity of the participant for data collection. In addition, direct measures each possess their own limitations and no single "gold standard" exists for measuring physical activity or assessing validity [3].
The appropriate method for measuring physical activity at various levels depends on factors such as the number of individuals to be monitored, the time period of measurements and available finances [5]. Many previous studies have examined the reliability and validity of various self-report and direct methods for assessing physical activity. Results from these studies have been conflicting. To our knowledge no attempt has been made to synthesize the literature to determine the validity of physical activity measures in adult populations.
The primary objective of this study was to perform a systematic review to compare self-report versus direct measures for assessing physical activity in observational and experimental studies of adult populations. The results from this systematic review provide a comprehensive summary of past research and a comparison between physical activity levels based on direct versus self-report measures in adult populations.
Methods
Study criteria
The review sought to identify all studies (observational or experimental) that presented a comparison of self-report and direct measurement results to reveal differences in physical activity levels based on measurement in adult populations (18 years and over). Studies which examined only a self-report or direct measure, but not both were not included in the review. All study designs were eligible (e.g. retrospective, prospective, case control, randomized controlled trial, etc.) and both published (peer-reviewed) and unpublished literature were examined.
Only studies involving adult populations with a mean age of 18 years and older were considered. Abstracts and titles were examined for their mention of adult populations (using adult$.tw.), but the search relied mostly on the subject headings for adult age groups (exp adult/). This systematic review was conducted simultaneously with a systematic review of the same focus in child populations (mean age < 19 years). A separate pediatric review was carried out as a result of differences in measurement methodologies and hypothesized cognitive and recall abilities between adults and children [6].
The eligible self-report measures of physical activity included: diaries or logs; questionnaires; surveys; and recall interviews. Proxy-reports were excluded because they present issues of reliability due to the potential heterogeneity of reporters (e.g., spouse, trainer, coach, parent, caregiver). The eligible direct measures of physical activity included: doubly-labeled water (DLW), indirect or direct calorimetry, accelerometry, pedometry, heart rate monitoring (HRM), global positioning systems, and direct observation. Although no language restrictions were imposed in the search, only English language articles were included in the review. Abstracts were included if they provided sufficient details to meet inclusion criteria.
Search strategy
The following electronic bibliographic databases were searched using a comprehensive search strategy to identify relevant studies reporting the use of both self-report and direct measures for assessing individual physical activity levels: Ovid MEDLINE(R) (1950 to April Week 4 2007); Ovid EMBASE (1980 to 2007 Week 18); Ovid CINAHL (1982 to April Week 4 2007); Ovid PsycINFO (1806 to April Week 1 2007); SPORTDiscus (1830 to April 2007); Physical Education Index (1970 to April 2007); Dissertations and Theses (1861 to April 2007); and Ovid MEDLINE (R) Daily Update (May 4, 2007). The search strategy is illustrated using the MEDLINE search as an example (Table 1) and was modified according to the indexing systems of the other databases. The OVID interface was used to search MEDLINE, EMBASE, CINAHL, and PsycINFO; Ebscohost was used to search SPORTDiscus; Scholar's Portal was used to search Physical Education Index; and ProQuest for Dissertations and Theses. Grey literature (non-peer reviewed works) included published abstracts and conference proceedings, published lists of theses and dissertations, and government reports. Knowledgeable researchers in the field were solicited for key studies of interest. The bibliographies of key studies selected for the review were examined to identify further studies.
Two independent reviewers screened the titles and abstracts of all studies to identify potentially-relevant articles. Duplicates were manually removed. The full texts of all studies that met the inclusion criteria were then obtained and reviewed. When disagreements between reviewers occurred, consensus was achieved through discussion and/or with a third reviewer.
Standardized data abstraction forms were completed by one reviewer and verified by two others. Information was extracted on the type of study design, participant characteristics, sample size, and methods of physical activity measurement (self-report and direct measures employed, units of measurement, duration of direct measure, length of recall, and length of time between the self-report and directly measured estimates). Reviewers were not blinded to the authors or journals when extracting data.
Risk of bias assessment
The Downs and Black [7] checklist was used to assess the risk of bias. The Downs and Black instrument was recommended for assessing risk of bias in observational studies in a recent systematic review [8] and other assessments [9] and was employed in this review to assess study quality including reporting, external validity, and internal validity (bias). The Downs and Black checklist consists of 27 items with a maximum count of 32 points. A modified version of the checklist was employed with items that were not relevant to the objectives of this review removed. The adapted checklist consisted of 15 items, including items 1–4, 6, 7, 9–13, 16–18, and 20 from the original list, with a maximum possible count of 15 points (higher scores indicate superior quality). The risk of bias assessment was carried out by two independent assessors and when disagreements between assessors occurred, consensus was achieved through discussion.
Data synthesis
Percent mean difference was used as the main outcome of this analysis; it was calculated using the formula: [(self-report mean – direct mean)/direct mean]. Only studies with units of measurement that were the same for both the self-report and direct measures were used to calculate percent mean differences. Units were converted where possible. These studies were included in the direct comparison analyses. Forest plots (graphical displays of the percent mean differences across the individual studies) were constructed to present overall trends in agreement of physical activity by direct measure and gender. As most studies did not employ the same units of measurement (e.g. kcal/week, MET/day, MET-min/day) and did not report a measure of variance (e.g. standard deviations or standard errors), pooled estimates and confidence intervals were not calculated.
Results
Description of studies
The preliminary search of electronic bibliographic databases, reference lists and grey literature identified 4,463 citations (see Figure 1). Of these, 1,638 were identified in MEDLINE, 1,306 in EMBASE, 732 in CINAHL, 218 in PsycINFO, 133 in SportDISCUS, 34 in Physical Education Index, 3 in MEDLINE Daily Update, and 399 from Dissertations and Theses. After a preliminary title and abstract review, 296 full text articles were retrieved for a detailed assessment. Of these, 173 met the criteria for study inclusion. One hundred and forty-eight of these studies reported correlation statistics [10–157]. Seventy-four studies contained comparable data meaning the self-report and direct measurements were reported using the same units [11, 15, 17, 19, 20, 23, 32, 33, 44, 48, 53, 56–59, 65, 73–77, 80, 88, 90, 92, 94, 100, 102, 105, 111, 114, 116, 119–121, 128, 131, 134, 135, 138–140, 143, 148, 151, 153, 154, 158–183]. These studies were included in the direct comparison analyses and their characteristics are described in Table 2. Common reasons for excluding studies included: populations with mean ages less than 18 years, the absence of directly measured and self-report data on the same population, non-English language, duplicate reporting of data, and the absence of comparable units between measures or the absence of a direct comparison.
Data abstraction identified three articles and two dissertations that analyzed and reported duplicate data in multiple papers [184–188]. Authors of suspected duplications were contacted and in cases where several publications reported the same analyses from the same data source, only one study per data source/analysis was retained in order to avoid double counting. Studies were retained based on the most pertinent and most recent data, as well as the largest sample size. Studies included were published over a 24-year period from 1983 to 2007. All studies were written in English. Nineteen of the studies used randomized controlled trial designs [22, 24, 26, 28, 30, 50, 53, 61, 84, 91, 124, 148, 149, 163, 165, 171, 181] and all others used observational designs (e.g. case control, cross-sectional, longitudinal). All included studies were published as journal articles except for 19 dissertations [16, 24, 30, 34, 38, 45, 49, 61, 64, 69, 71, 73, 74, 78, 99, 107, 117, 163, 171].
Participants in the studies ranged from 10 to 101 years of age. Although the focus of the review was on those aged 18 and over, studies that had a range of ages less than 18 years were not excluded as long as the mean age of the sample was over 18 years. Sample sizes ranged from a low of six [21] to a high of 2,721 in Craig et al.'s work that assessed the validity of the International Physical Activity Questionnaire (IPAQ) [35]. There were a greater number of studies reporting on female-only data than studies reporting on male-only data.
A total of five direct measures were used in the assessment of physical activity and included: accelerometers, DLW, indirect calorimetry, HRM, and pedometers. Of the studies included in the synthesis of directly comparable data (Table 2), accelerometers were the most frequently used direct measure and indirect calorimetry was the least used. A variety of self-report measures were employed, but the seven-day physical activity recall (7-day PAR) [189] was the most cited. Over half of the studies reported that the self-report and directly assessed physical activity levels were measured over the same length of time (e.g. seven days) and over the same period of time (i.e. no time lag between measurements). There were also a considerable number who reported measurements over the same period of time, but that did not measure the same length of time (e.g. self-report over seven days, directly measured over three days). Eleven of the studies in Table 2 lacked any mention of time [59, 131, 135, 138, 143, 159, 160, 164, 177, 178, 183].
Risk of bias assessment
Risk of bias was assessed for all included studies (n = 173) including those reporting only correlation data. The range of items met on the modified Downs and Black tool was 8 to 15 (maximum possible count was 15) with a mean of 11.7 ± 1.2. Results of the risk of bias assessment indicated that 38% (65/173) of the studies had lower quality (based on a median split count of < 12/15). All studies were given maximum points for describing study objectives. All but one study scored maximum points for describing the main outcomes to be measured and the interventions used (including comparison methods between measures). Although most studies carried out some sort of significance testing on results, most did not report the actual probability values associated with the estimates or their associated measures of random variability (e.g. standard error or confidence intervals). Most studies obtained a high number of items on the reporting section (maximum count of 8) with a mean of 6.9 ± 0.9.
The external validity section of the risk of bias assessment had a maximum count of three and consisted of reporting on the representativenessof the subjects and the testing conditions. Almost all of the studies (166/173) reported that the staff, places and facilities where the participants were tested were representative of the testing conditions that would be expected by most individuals (e.g. real-life and free-living situations). However, 87% (151/173) of the studies did not report on the representativeness of the subjects asked to participate in the study and 95% (165/173) of the studies failed to report on the representativeness of those who were prepared to participate (enrolled) compared to the entire population from which they were recruited (received a score of 0). As a result, the external validity ratings of most studies were poor with a mean of 1.1 ± 0.5.
In order to obtain the maximum number of items (four) in the internal validity section, studies must have reported whether any of the results of the study were based on "data dredging", whether the analyses adjusted for any time lag between the two measurements or different lengths of follow-up, whether the statistical tests used to assess the main outcomes were appropriate, and whether the main outcome measures were accurate (valid and reliable). Internal validity item counts were generally high with the majority of studies having obtained a four.
A qualitative analysis was conducted on the top seven (scores of 14 and 15 out of 15) and lowest seven studies (8 and 9 out of 15) based on scores from the risk of bias assessment. No conclusive patterns were identified from this analysis. The results from the accelerometer studies were further examined, as this was the only group of studies with a good distribution of low and high quality studies based on the accelerometer median split of bias scores. Findings from this analysis did not identify any clear patterns in the differences in agreement between physical activity measured by self-report compared to accelerometer when grouped by low and high quality.
Data synthesis
One hundred and forty-eight studies [10, 11, 13–157, 190] reported correlation statistics between self-report and direct measurements of physical activity. Figure 2 is a plot of all extracted correlations and shows that overall, there is no clear trend in the degree of correlation between self-reported and directly measured physical activity, regardless of the direct method employed. Overall, correlations were low-to-moderate with a mean of 0.37 (SD = 0.25) and a range of -0.71 to 0.98. Mean correlations were higher in studies reporting results for males-only (r = 0.47) versus studies reporting results for females-only (r = 0.36), but with very similar ranges (males: -0.17 to 0.93 vs. females: -0.17 to 0.95).
Seventy-four studies contained comparable data on the measurement of physical activity based on self-report and directly measured values. Table 2 describes these studies and their subcomponents. Percent mean differences were calculated for all of these studies and are presented as forest plots in Figures 3 to 8. Negative values indicate that self-report estimates were lower than the amount of physical activity assessed by direct methods while positive values indicate values that are higher. Sixty percent of the percent mean differences indicated that self-reported physical activity estimates were higher than those measured by direct methods.
Studies with extreme percent mean differences (≥ 400%) were removed from the forest plots for clarity purposes [11, 139, 151, 181]. All outlying data were from studies where physical activity was categorized by level of exertion (e.g. easy, moderate, vigorous) and outliers represent physical activity data categorized as vigorous or of high energy expenditure. While not all data categorized as vigorous had percent mean differences ≥ 400%, a pattern emerged whereby greater percent mean differences between the self-report and direct measures was larger for vigorous levels of physical activity than for light or moderate activities [11, 44, 56, 134, 139, 151, 175, 181, 182].
Percent mean differences were examined separately for the five different direct measures. Accelerometers were the most used direct measure. Self-report measures of physical activity were generally higher than those directly measured by accelerometers (Figures 3 to 5). Studies reporting data for males and females combined (n = 58) had a mean percent difference of 44% (range: -78% to 500%), with similar findings for the male-only data (n = 32) (mean: 44%, range: -100% to 425%). However, female-only data (n = 60) identified that, on average, females self-reported higher levels of physical activity compared to accelerometers with a mean percent difference of 138% (range: -100% to 4024%).
The second-most common direct measure employed was DLW and comparable data with self-report measures are presented in Figures 6 to 8. Studies reporting on combined male and female data (n = 6) indicated that self-report measures of physical activity were lower when compared to DLW measures with a mean percent difference of -9% and a range of -1% to -26%. Results for male-only (n = 16) and female-only (n = 23) data were less distinct with percent mean differences and ranges of -4.5% (-78% to 37%) and 7% (-58% to 113%), respectively.
A greater number of HRM and self-report comparisons were observed for studies with both male and female participants (n = 11) or female-only populations (n = 13) versus male-only populations (n = 3). Female-only results showed a general trend toward higher levels of self-reported physical activity (mean 11%, range: -5% to 45%), while the male-only (mean -9%, range: -24% to 5%) and combined (mean -2%, range: -21% to 67%) data had a greater number of studies with lower self-reported physical activity levels when compared to results of HRM.
Pedometers and indirect calorimetry were the least commonly used direct measures for studies with comparable data. There were a total of eight comparisons from four studies for pedometers and 15 from two studies for indirect calorimetry (Figures 6 to 8) making it difficult to draw conclusions with regard to patterns of agreement between the self-report and direct measures. However, seven [19, 75, 76, 167] of the eight pedometer comparisons reported higher levels of physical activity by self-report when compared to the pedometer results. The eighth comparison [19] which involved female-only data saw no difference between the two measures. The indirect calorimetry results were less straightforward and presented no obvious patterns in agreement.
Subgroups were qualitatively examined to assess whether any differences existed in the degree of agreement between self-reported and directly measured physical activity. No clear patterns emerged within studies reporting on elderly (range or mean ≥ 65 years) populations [23, 73, 77, 92, 105, 116, 174] or within studies reporting on different time lags and periods of measurement. Few studies with comparable data reported exclusively on overweight/obese populations, but amongst those captured, the majority of studies reported higher levels of physical activity by self-report compared to the direct measures [139, 143, 148, 163–165, 172]. However, it was not possible to compare the overweight/obese percent mean differences to those reported in general populations.
Meta-analyses were not possible due to the substantial heterogeneity in units of reporting for physical activity measured by the various self-report and direct methods across the studies, and the significant lack of data with comparable units across measures. As a result, we were unable to determine the sensitivity of the values and the associated measures of error for the studies. Overall effect sizes to summarize the magnitude of discrepancy across the various measures of physical activity could therefore not be calculated.
Discussion
To the authors' knowledge this review represents the most comprehensive attempt to examine the relationship between self-report and directly measured estimates of adult physical activity in the international literature. Risk of bias was assessed and identified that just over one third of the studies had lower quality based on their description of the methods and external and internal validity. Overall, no clear trends emerged in the over- or underreporting of physical activity by self-report compared to direct methods. However, some results suggest that patterns in the agreement between self-report and direct measures of physical activity may exist, but they are likely to differ depending on the direct methods used for comparison and the sex of the population sampled. Interestingly, findings also identified that studies which categorized physical activity by level of exertion (e.g. light, moderate, vigorous) exhibited a trend wherein these categorized studies saw the mean percent differences between the self-report and direct measures increasing with the higher category levels of intensity (i.e. vigorous physical activity). These larger differences may reflect a problem with self-report measures attempting to capture higher levels of physical activity, or problems with participant interpretation and recall.
Many of the studies tested the relationship between self-report and direct measures by using a correlation coefficient, but this is limited as correlation is only able to measure the strength of the relationship between two variables and cannot assess the level of agreement between them, as well as ignoring any bias in the data [191]. A more useful approach, the Bland-Altman method, provides a means for assessing the level of agreement between self-report and direct measures by deriving the mean difference between the two measures and the limits of agreement. If the two measures possess good agreement and measure the same parameter of physical activity, then the cheaper and less invasive self-report methods may be valid substitutes for direct methods.
A meta-analysis would have allowed us to estimate the overall effect sizes for each of the direct measures and undertake a sensitivity analysis to further understand the degree of bias in the studies. Unfortunately, inconsistent methods and reporting among the studies included made such an analysis methodologically inappropriate. Further research in this area would benefit from greater consistency in the units of reporting and the methods used to facilitate comparisons. For instance, many studies did not report results using the same units, so estimates of agreement between the self-report and direct measures could not be computed. There was also an inconsistency in the number of days measured and the time lag between the self-report and direct measures. It is recommended that authors present their results using the same units for both measures (e.g. minutes/day, kcal/day), that the two measurements assess physical activity for and over the same time period, and that all relevant data including a mean and measurement of variance (i.e. standard deviation, standard error) be included in all reports.
Adhering to consistent reporting criteria would increase the comparability of results across studies and enable the calculation of overall effect sizes. At the population level, over- or underestimation of physical activity prevalence has important implications as these data are used to monitor physical activity trends, determine spending for research and physical activity interventions and programming, and to estimate physical inactivity-related risks of disease. Future studies may wish to refer to the updated Compendium of Physical Activities [192] which provides a coding scheme to classify physical activity by rate of energy expenditure. The Compendium offers a means to increase the comparability of results between self-report and direct measures, as well as across studies.
A lack of a clear trend amongst the differences between the self-report methods for assessing physical activity and the more robust direct methods is of concern, especially when trying to establish whether the measures could be used interchangeably. There are several possible explanations for the lack of a clear trend in the data. Many self-report instruments (such as the 7-day PAR) may not have the ability to account for activities of less than 10 minutes in duration or those with a level of exertion lower than brisk walking [193], whereas some of the direct methods (such as DLW) may capture all forms of physical movement. However, it is important to recognize that other direct measures such as accelerometers are unable to capture certain types of activities such as swimming and activities involving the use of upper extremities. Our findings demonstrate the inherent difficulty self-report measures possess when trying to accurately capture data at various levels of exertion. Compared to direct measures, self-report methods appear to estimate greater amounts of higher intensity (i.e. vigorous) physical activities than in the low-to-moderate levels.
Just as with some self-report measures not being able to capture all forms of activity, some direct measures may capture non-physical activity. For instance, the DLW technique is an accurate assessment of total energy expenditure, but it does not only capture physical activity, but rather all forms of energy expenditure including resting energy expenditure and the thermogenic effect of food. DLW is therefore expected to overestimate physical activity unless corrections are made. These and other measurement errors may inflate the between-individual variability in the energy expended in physical activity [194]. Finally, direct methods may be too sensitive to small errors derived from the various calibration methods employed and the equations used to define and categorize physical activity.
It is important to take into account all of these factors when comparing self-report and direct measures of physical activity. In specific circumstances (e.g. at different levels of activity) these two methods may not be comparable as they are not able to capture the same parameters of physical activity. Self-report measures may not able to accurately capture all levels of activity, but they may be able to capture how difficult an individual perceives an activity to be and the type of activity that is undertaken (e.g. leisure, work, transportation). Direct measures, on the other hand, may be more able to capture some of the information not captured in self-report methods (e.g. incidental daily movement and lower intensity activities), but also possess their own limitations such as the inability to capture arm movements and various types of physical activity (e.g. swimming).
Concern regarding the discrepancy between self-reported and directly measured physical activity were recently reported by Troiano and colleagues who examined data from the 2003–2004 National Health and Nutrition Examination Survey (NHANES) which contained the first direct measurements of physical activity in a nationally representative U.S. sample [195]. They compared self-reported adherence estimates of physical activity recommendations with those directly measured by accelerometer. Their findings identified that self-reported adherence estimates were much higher than those measured by accelerometer. The authors hypothesize that the overestimation may be a result of respondents misclassifying sedentary or light activity as moderate or from underestimations of activity duration by the accelerometers.
Other factors, such as those related to the population under study, may influence the ability of self-report and direct methods to capture the same measurement. For example, our findings show that in studies with a focus on overweight/obese individuals, self-reported physical activity was overestimated in all cases except for DLW studies involving combined male/female and male-only data. Our results differed from those reported by Irwin, Ainsworth and Conway (2001) [58]. Their study consisted of 24 males and used DLW to compare energy expenditure estimates with those obtained by physical activity record and the 7-day PAR. The investigators observed an overestimation of energy expenditure in participants with higher body fat using the physical activity record, but not the 7-day PAR. A comparison of the same sample by body mass index (BMI) identified that those with a BMI ≥ 25 kg/m2 overestimated energy expenditure from physical activity records and the 7-day PAR. In confirmation of the trends within our accelerometer data, a recent study (published after our search) of 154 subjects compared a physical activity questionnaire to accelerometry data and identified that the accuracy of the physical activity questionnaire was higher for males than females and for those with a lower BMI [196]. It is likely that a response bias exists due to social desirability, and influences the degree of over-reporting of physical activity by overweight/obese individuals. Future research and synthesis is needed to identify whether a bias does in fact exist and if so, whether it differs by gender, and to what extent.
This review had limitations that should be considered when examining the results. First, the sample was limited to studies that included directly comparable data between self-report and direct measures (same units for both measures) or a comparison by way of correlation. Access to primary data from each study was not feasible; therefore, we relied upon reported comparisons and the means of measured physical activity. This reduced the number of studies with reported measures of physical activity by self-report and direct methods and limited our ability to accurately assess the degree of agreement between the two measures. However, when possible we converted non-comparable units to increase the number of studies used. The review did not assess the agreement between proxy-reported physical activity and direct measures. Proxy-report data are less prevalent but is an important means for assessing physical activity in sub-populations such as those who are chronically ill, disabled, or elderly, and who are unable to self-report on their own physical activity levels. Further research is required to assess the validity of proxy-report measures of physical activity when compared to direct methods. Finally, this review did not discern between differences in study protocols related to calibration, cut-points, or collection of the measurements and other population specific characteristics.
Conclusion
In conclusion, this review provides an objective summary of the difference in physical activity levels assessed via self-report methods compared to directly measured physical activity. The results may assist researchers considering the use of self-report or direct measurement methods and serves as a note of caution that self-report and directly measured physical activity can differ greatly. Overall there were no clear trends in the degree to which physical activity measured by self-report and direct measures differ. The strength of trends differed by the direct method employed and by the gender of the population sampled. One-third of the studies were of poor quality with most studies having failed to report actual probabilities or measures of variability for estimates and the representativeness of their samples. The costs and benefits of direct measurement need to be considered in any study in order to determine if the added resources required for personnel training and laboratory analyses justify the possible increase in the precision of results. At this time, it is not possible to draw any definitive conclusions concerning the validity of self-report measurements compared to various direct methods, but caution should be exerted when comparing studies across methods.
References
Caspersen CJ, Powell EC, Christenson GM: Physical activity, exercise, and physical fitness: definitions and distinctions for health-related research. Public Health. 1985, 100: 126-131.
Katzmarzyk PT, Gledhill N, Shephard RJ: The economic burden of physical inactivity in Canada. CMAJ. 2000, 163: 1435-1440.
Dishman RK, Washburn RA, Schoeller DA: Measurement of physical activity. QUEST. 2001, 53: 295-309.
Shephard RJ: Limits to the measurement of habitual physical activity by questionnaires. Br J Sports Med. 2003, 37: 197-206. 10.1136/bjsm.37.3.197.
Ainslie PN, Reilly T, Westerterp KR: Estimating human energy expenditure: a review of techniques with particular references to doubly labelled water 38. Sports Med. 2003, 33: 683-698. 10.2165/00007256-200333090-00004.
Adamo KB, Prince SA, Tricco AC, Connor Gorber S, Tremblay M: A comparison of direct versus self-report measures for assessing physical activity in the pediatric population: a systematic review. Int J Pediatr Obes.
Downs SH, Black N: The feasibility of creating a checklist for the assessment of the methodological quality of both randomised and non-randomised studies of health care intervention. J Epidemiol Community Health. 1998, 52: 377-384.
Saunders DL, Soomro GM, Buckingham J, Jamtvedt G, Raina P: Assessing the methodological quality of nonrandomized intervention studies. Western J Nurs Res. 2003, 25: 223-237. 10.1177/0193945902250039.
Deeks JJ, Dinnes JD, D'Amico R, Sowden AJ, Sakarovitch C, Song F, Petticrew M, Altman DG: Evaluating non-randomised intervention studies. Health Technol Assess. 2003, 7 (27): 1-173.
Aadahl M, Jorgensen T: Validation of a new self-report instrument for measuring physical activity. Med Sci Sports Exer. 2003, 35: 1196-1202. 10.1249/01.MSS.0000074446.02192.14.
Adams SA: Methodological and substantive issues: The relationship between physical activity and breast cancer. PhD Thesis. 2003, University of South Carolina
Adams SA, Matthews CE, Ebbeling CB, Moore CG, Cunningham JE, Fulton J, Hebert JR: The effect of social desirability and social approval on self-reports of physical activity. Am J Epidemiol. 2005, 161: 389-398. 10.1093/aje/kwi054.
Ainsworth BE, Richardson MT, Jacobs DR, Leon AS, Sternfeld B: Accuracy of recall of occupational physical activity by questionnaire. J Clin Epidemiol. 1999, 52: 219-227. 10.1016/S0895-4356(98)00158-9.
Ainsworth BE, Bassett DR, Strath SJ, Swartz AM, O'Brien WL, Thompson RW, Jones DA, Macera CA, Kimsey CD: Comparison of three methods for measuring the time spent in physical activity. Med Sci Sports Exer. 2000, 32: S457-S464. 10.1097/00005768-200009001-00004.
Ainsworth BE, Sternfeld B, Richardson MT, Jackson K: Evaluation of the kaiser physical activity survey in women. Med Sci Sports Exer. 2000, 32: 1327-1338. 10.1097/00005768-200007000-00022.
Armstrong CA: The stages of change in exercise adoption and adherence: Evaluation of measures with self-report and objective data. PhD Thesis. 1998, University of California, San Diego and San Diego State University
Atienza AA, King AC: Comparing self-reported versus objectively measured physical activity behavior: a preliminary investigation of older Filipino American women. Res Q Exerc Sport. 2005, 76 (3): 358-362.
Bassett DR, Schneider PL, Huntington GE: Physical activity in an Old Order Amish community. Med Sci Sports Exer. 2004, 36: 79-85. 10.1249/01.MSS.0000106184.71258.32.
Bassett DR, Cureton AL, Ainsworth BE: Measurement of daily walking distance-questionnaire versus pedometer. Med Sci Sports Exer. 2000, 32: 1018-1023. 10.1097/00005768-200005000-00021.
Bernstein M, Sloutskis D, Kumanyika S, Sparti A, Schutz Y, Morabia A: Data-based approach for developing a physical activity frequency questionnaire. Am J Epidemiol. 1998, 147: 147-154.
Bisgaard T, Kjaersgaard M, Bernhard A, Kehlet H, Rosenberg J: Computerized monitoring of physical activity and sleep in postoperative abdominal surgery patients. J Clin Monitor Comp. 1999, 15: 1-8. 10.1023/A:1009930026753.
Bjorgaas M, Vik JT, Saeterhaug A, Langlo L, Sakshaug T, Mohus RM, Grill V: Relationship between pedometer-registered activity, aerobic capacity and self-reported activity and fitness in patients with type 2 diabetes. Diabetes Obes Metab. 2005, 7: 737-744. 10.1111/j.1463-1326.2004.00464.x.
Bonnefoy M, Normand S, Pachiaudi C, Lacour JR, Laville M, Kostka T: Simultaneous validation of ten physical activity questionnaires in older men: a doubly labeled water study. J Am Geriatr Soc. 2001, 49: 28-35. 10.1046/j.1532-5415.2001.49006.x.
Brach JS: The relation of physical activity to functional status over a seventeen-year time period in community-dwelling older women. PhD Thesis. 2000, University of Pittsburgh
Brown WJ, Ringuet C, Trost SG: How active are young adult women?. Health Promot J Aust. 2002, 23-28.
Bulley C, Donaghy M, Payne A, Mutrie N: Validation and modification of the Scottish Physical Activity Questionnaire for use in a female student population. Int J Health Promot Edu. 2005, 43.
Carter-Nolan PL, Adams-Campbell LL, Makambi K, Lewis S, Palmer JR, Rosenberg L: Validation of Physical Activity Instruments: Black Women's Health Study. Ethn Dis. 2006, 16 (4): 943-947.
Cauley JA, Kriska AM, LaPorte RE, Sandler RB, Pambianco G: A two year randomized exercise trial in older women: effects on HDL-cholesterol. Atherosclerosis. 1987, 66: 247-258. 10.1016/0021-9150(87)90068-2.
Chasan-Taber L, Schmidt MD, Roberts DE, Hosmer D, Markenson G, Freedson PS: Development and validation of a Pregnancy Physical Activity Questionnaire. Med Sci Sports Exer. 2004, 36: 1750-1760. 10.1249/01.MSS.0000142303.49306.0D.
Chen AH-W: The effectiveness of a home-based intervention to promote walking in ethnic minority women. PhD Thesis. 1995, University of California, San Diego and San Diego State University
Conn VS, Minor MA, Mehr DR, Burks KJ: Recording activity in older women with TriTrac. Am J Health Behav. 2000, 24: 370-378.
Conway JM, Irwin ML, Ainsworth BE: Estimating energy expenditure from the Minnesota Leisure Time Physical Activity and Tecumseh Occupational Activity questionnaires – a doubly labeled water validation. J Clin Epidemiol. 2002, 55: 392-399. 10.1016/S0895-4356(01)00497-8.
Conway JM, Seale JL, Jacobs DR, Irwin ML, Ainsworth BE: Comparison of energy expenditure estimates from doubly labeled water, a physical activity questionnaire, and physical activity records. Am J Clin Nutr. 2002, 75: 519-525.
Cook TC: Epidemiology of fitness in the elderly: a twin approach. PhD Thesis. 1986, University of Pittsburgh
Craig CL, Marshall AL, Sjostrom M, Bauman AE, Booth ML, Ainsworth BE, Pratt M, Ekelund U, Yngve A, Sallis JF, et al: International physical activity questionnaire: 12-country reliability and validity. Med Sci Sports Exer. 2003, 35: 1381-1395. 10.1249/01.MSS.0000078924.61453.FB.
Davies SW, Jordan SL, Lipkin DP: Use of limb movement sensors as indicators of the level of everyday physical activity in chronic congestive heart failure. Am J Cardiol. 1992, 69 (19): 1581-1586. 10.1016/0002-9149(92)90707-6.
De Abajo S, Larriba R, Marquez S: Validity and reliability of the Yale Physical Activity Survey in Spanish elderly. J Sports Med Phys Fitness. 2001, 41 (4): 479-485.
del Aguila MA: Assessment of physical activity in patients with diabetes. PhD Thesis. 1998, University of Washington
Dinger MK: Reliability and convergent validity of the National College Health Risk Behavior Survey Physical Activity items. Am J Health Educ. 2003, 34: 162-166.
Dinger MK, Oman RF, Taylor EL, Vesely SK, Able J: Stability and convergent validity of the Physical Activity Scale for the Elderly (PASE). J Sports Med Phys Fitness. 2004, 44 (2): 186-192.
Dishman RK, Darracott CR, Lambert LT: Failure to generalize determinants of self-reported physical activity to a motion sensor. Med Sci Sports Exerc. 1992, 24 (8): 904-910.
Dubbert PM, Weg Vander MW, Kirchner KA, Shaw B: Evaluation of the 7-day physical activity recall in urban and rural men. Med Sci Sports Exerc. 2004, 36 (9): 1646-1654. 10.1249/01.MSS.0000139893.65189.F2.
Dubbert PM, White JD, Grothe KB, O'Jile J, Kirchner KA: Physical activity in patients who are severely mentally ill: feasibility of assessment for clinical and research applications. Arch Psychiatr Nurs. 2006, 20: 205-209. 10.1016/j.apnu.2006.04.002.
Ekelund U, Sepp H, Brage S, Becker W, Jakes R, Hennings M, Wareham NJ: Criterion-related validity of the last 7-day, short form of the International Physical Activity Questionnaire in Swedish adults. Public Health Nutr. 2006, 9: 258-265. 10.1079/PHN2005840.
Elmore BG: An evaluation of five physical activity assessment methods in a group of women. PhD Thesis. 1989, University of Illinois at Urbana-Champaign
Evangelista LS, Dracup K, Doering L, Moser DK, Kobashigawa J: Physical activity patterns in heart transplant women. J Cardiovasc Nurs. 2005, 20: 334-339.
Faulkner G, Cohn T, Remington G: Validation of a physical activity assessment tool for individuals with schizophrenia. Schizophr Res. 2006, 82: 225-231. 10.1016/j.schres.2005.10.020.
Friedenreich CM, Courneya KS, Neilson HK, Matthews CE, Willis G, Irwin M, Troiano R, Ballard-Barbash R: Reliability and validity of the Past Year Total Physical Activity Questionnaire. Am J Epidemiol. 2006, 163: 959-970. 10.1093/aje/kwj112.
Fu LL: Health status, functional state and physical activity level in community dwelling elderly women. PhD Thesis. 1995, University of Pittsburgh
Gardner AW, Montgomery PS: The Baltimore activity scale for intermittent claudication: a validation study. Vascular & Endovascular Surgery. 2006, 40: 383-391. 10.1177/1538574406288575.
Ginis KA, Latimer AE, Hicks AL, Craven BC: Development and evaluation of an activity measure for people with spinal cord injury. Med Sci Sports Exer. 2005, 37: 1099-1111. 10.1249/01.mss.0000170127.54394.eb.
Gretler DD, Carlson GF, Montano AV, Murphy MB: Diurnal blood pressure variability and physical activity measured electronically and by diary. Am J Hypertens. 1993, 6: 127-133.
Hagfors L, Westerterp K, Skoldstam L, Johansson G: Validity of reported energy expenditure and reported intake of energy, protein, sodium and potassium in rheumatoid arthritis patients in a dietary intervention study. Eur J Clin Nutr. 2005, 59: 238-245. 10.1038/sj.ejcn.1602064.
Hagstromer M, Oja P, Sjostrom M: The International Physical Activity Questionnaire (IPAQ): a study of concurrent and construct validity. Public Health Nutr. 2006, 9: 755-762. 10.1079/PHN2005898.
Harada ND, Chiu V, King AC, Stewart AL: An evaluation of three self-report physical activity instruments for older adults. Med Sci Sports Exer. 2001, 33: 962-970. 10.1097/00005768-200105001-01412.
Hayden-Wade HA, Coleman KJ, Sallis JF, Armstrong C: Validation of the telephone and in-person interview versions of the 7-day PAR. Med Sci Sports Exer. 2003, 35: 801-809. 10.1249/01.MSS.0000064941.43869.4E.
Iqbal R, Rafique G, Badruddin S, Qureshi R, Gray-Donald K: Validating MOSPA questionnaire for measuring physical activity in Pakistani women. Nutr J. 2006, 5: 18-10.1186/1475-2891-5-18.
Irwin ML, Ainsworth BE, Conway JM: Estimation of energy expenditure from physical activity measures: determinants of accuracy. Obes Res. 2001, 9: 517-525. 10.1038/oby.2001.68.
Jacobs DR, Ainsworth BE, Hartman TJ, Leon AS: A simultaneous evaluation of 10 commonly used physical activity questionnaires. Med Sci Sports Exer. 1993, 25: 81-91. 10.1249/00005768-199301000-00012.
Jakes RW, Day NE, Luben R, Welch A, Bingham S, Mitchell J, Hennings S, Rennie K, Wareham NJ: Adjusting for energy intake – what measure to use in nutritional epidemiological studies?. Int J Epidemiol. 2004, 33: 1382-1386. 10.1093/ije/dyh181.
Jansen L: The effect of exercise on explanatory style in HIV-infected men. PhD Thesis. 2002, United States International University
Johansen KL, Chertow GM, Ng AV, Mulligan K, Carey S, Schoenfeld PY, Kent-Braun JA: Physical activity levels in patients on hemodialysis and healthy sedentary controls. Kidney International. 2000, 57: 2564-2570. 10.1046/j.1523-1755.2000.00116.x.
Johansen KL, Painter P, Kent-Braun JA, Ng AV, Carey S, Da SM, Chertow GM: Validation of questionnaires to estimate physical activity and functioning in end-stage renal disease. Kidney International. 2001, 59.
Johnson-Kozlow MF: Validity and measurement bias in three self-report measures of physical activity among women diagnosed with breast cancer. PhD Thesis. 2003, University of California, San Diego
Johnson-Kozlow M, Rock CL, Gilpin EA, Hollenbach KA, Pierce JP: Validation of the WHI brief physical activity questionnaire among women diagnosed with breast cancer. Am J Health Behav. 2007, 31: 193-202.
King WC, Brach JS, Belle S, Killingsworth R, Fenton M, Kriska AM: The relationship between convenience of destinations and walking levels in older women. Am J Health Promot. 2003, 18: 74-82.
Kolbe-Alexander TL, Lambert EV, Harkins JB, Ekelund U: Comparison of two methods of measuring physical activity in South African older adults. J Aging Phys Act. 2006, 14 (1): 98-114.
Krishnamoorthy JS: The transmission of physical activity and related cognitions among African-American adolescent females and their primary female caregiver. Dissertation Abstracts International: Section B: The Sciences and Engineering. 2002, 63.
Kriska AM: Assessment of current and historical physical activity in the Pima Indians. PhD Thesis. 1988, University of Pittsburgh
Kriska AM, Knowler WC, LaPorte RE, Drash AL, Wing RR, Blair SN, Bennett PH, Kuller LH: Development of questionnaire to examine relationship of physical activity and diabetes in Pima Indians. Diabetes Care. 1990, 13: 401-411. 10.2337/diacare.13.4.401.
Lambert P: Physical activity and the oldest-old: A comparison of self-report and accelerometer readings. PhD Thesis. 2006, University of Manitoba (Canada)
LaPorte RE, Black-Sandler R, Cauley JA, Link M, Bayles C, Marks B: The assessment of physical activity in older women: analysis of the interrelationship and reliability of activity monitoring, activity surveys, and caloric intake. J Gerontol. 1983, 38: 394-397.
Lasuzzo J: Ability of the California State University, Fullerton Physical Activity Questionnaire to assess physical activity in older individuals. PhD Thesis. 2004, The University of Alabama
Leenders NYJM: Evaluation of methods to assess physical activity. PhD Thesis. 1998, The Ohio State University
Leenders NYJM, Sherman WM, Nagaraja HN: Comparisons of four methods of estimating physical activity in adult women. Med Sci Sports Exer. 2000, 32: 1320-1326. 10.1097/00005768-200007000-00021.
Leenders NY, Sherman WM, Nagaraja HN, Kien CL: Evaluation of methods to assess physical activity in free-living conditions. Med Sci Sports Exerc. 2001, 33 (7): 1233-1240.
Lemmer JT, Ivey FM, Ryan AS, Martel GF, Hurlbut DE, Metter JE, Fozard JL, Fleg JL, Hurley BF: Effect of strength training on resting metabolic rate and physical activity: age and gender comparisons. Med Sci Sports Exerc. 2001, 33: 532-541.
Lewis JM: A phenomenological analysis of the physical activity experiences of youth and their parents. PhD Thesis. 2005, Dalhousie University (Canada)
Lindseth G, Vari P: Measuring physical activity during pregnancy. West J Nurs Res. 2005, 27: 722-734. 10.1177/0193945905276523.
Lof M, Hannestad U, Forsum E: Comparison of commonly used procedures, including the doubly-labelled water technique, in the estimation of total energy expenditure of women with special reference to the significance of body fatness. Br J Nutr. 2003, 90: 961-968. 10.1079/BJN2003975.
Lowther M, Mutrie N, Loughlan C, McFarlane C: Development of a Scottish physical activity questionnaire: a tool for use in physical activity interventions. Br J Sports Med. 1999, 33: 244-249.
Macfarlane DJ, Lee CC, Ho EY, Chan KL, Chan D: Convergent validity of six methods to assess physical activity in daily life. J Appl Physiol. 2006, 101: 1328-1334. 10.1152/japplphysiol.00336.2006.
Mader U, Martin BW, Schutz Y, Marti B: Validity of four short physical activity questionnaires in middle-aged persons. Med Sci Sports Exer. 2006, 38: 1255-1266. 10.1249/01.mss.0000227310.18902.28.
Mahabir S, Baer DJ, Giffen C, Clevidence BA, Campbell WS, Taylor PR, Hartman TJ: Comparison of energy expenditure estimates from 4 physical activity questionnaires with doubly labeled water estimates in postmenopausal women. Am J Clin Nutr. 2006, 84: 230-236.
Martinez-Gonzalez MA, Lopez-Fontana C, Varo JJ, Sanchez-Villegas A, Martinez JA: Validation of the Spanish version of the physical activity questionnaire used in the Nurses' Health Study and the Health Professionals' Follow-up Study. Public Health Nutr. 2005, 8: 920-927. 10.1079/PHN2005745.
Masse LC, Fulton JE, Watson KL, Mahar MT, Meyers MC, Wong WW: Influence of body composition on physical activity validation studies using doubly labeled water. J Appl Physiol. 2004, 96: 1357-1364. 10.1152/japplphysiol.00901.2003.
Masse LC, Eason KE, Tortolero SR, Kelder SH: Comparing participants' rating and compendium coding to estimate physical activity intensities. Meas Phys Edu Exer Sci. 2005, 9.
Matthews CE, Freedson PS: Field trial of a three-dimensional activity monitor: comparison with self report. Med Sci Sports Exer. 1995, 27: 1071-1078. 10.1249/00005768-199507000-00017.
Matthews CE, Freedson PS, Hebert JR, Stanek EJ, Merriam PA, Ockene IS: Comparing physical activity assessment methods in the Seasonal Variation of Blood Cholesterol Study. Med Sci Sports Exerc. 2000, 32: 976-984. 10.1097/00005768-200005000-00015.
Matthews CE, Ainsworth BE, Hanby C, Pate RR, Addy C, Freedson PS, Jones DA, Macera CA: Development and testing of a short physical activity recall questionnaire. Med Sci Sports Exerc. 2005, 37: 986-994.
Matthews CE, Wilcox S, Hanby CL, Der AC, Heiney SP, Gebretsadik T, Shintani A: Evaluation of a 12-week home-based walking intervention for breast cancer survivors. Support Care Cancer. 2007, 15: 203-211. 10.1007/s00520-006-0122-x.
McDermott MM, Liu K, O'Brien E, Guralnik JM, Criqui MH, Martin GJ, Greenland P: Measuring physical activity in peripheral arterial disease: a comparison of two physical activity questionnaires with an accelerometer. Angiology. 2000, 51: 91-100. 10.1177/000331970005100201.
Mckeen NA: The meaning of motor activity: Emotion, temperament, mood, and laterality. Dissertation Abstracts International: Section B: The Sciences and Engineering. 2001, 61.
Meriwether RA, McMahon PM, Islam N, Steinmann WC: Physical activity assessment: validation of a clinical assessment tool. Am J Prev Med. 2006, 31: 484-491. 10.1016/j.amepre.2006.08.021.
Miller DJ, Freedson PS, Kline GM: Comparison of activity levels using the Caltrac accelerometer and five questionnaires. Med Sci Sports Exerc. 1994, 26: 376-382.
Motl RW, McAuley E, Snook EM, Scott JA: Validity of physical activity measures in ambulatory individuals with multiple sclerosis. Disability & Rehabilitation. 2006, 28: 1151-1156. 10.1080/09638280600551476.
Ng AV, Kent-Braun JA: Quantitation of lower physical activity in persons with multiple sclerosis. Med Sci Sports Exerc. 1997, 29: 517-523.
Otis RB, Brown AS, Womack CJ, Fonong T, Gardner AW: Relationship between physical activity recall and free-living daily physical activity in older claudicants. Angiology. 2000, 51: 181-188. 10.1177/000331970005100301.
Owens JF: Physical activity and cardiovascular risk factors: A cross-sectional study of premenopausal women. PhD Thesis. 1989, University of Pittsburgh
Paton NI, Elia M, Jebb SA, Jennings G, Macallan DC, Griffin GE: Total energy expenditure and physical activity measured with the bicarbonate-urea method in patients with human immunodeficiency virus infection. Clin Sci. 1996, 91: 241-245.
Patterson SM, Krantz DS, Montgomery LC, Deuster PA, Hedges SM, Nebel LE: Automated physical activity monitoring: validation and comparison with physiological and self-report measures. Psychophysiology. 1993, 30: 296-305. 10.1111/j.1469-8986.1993.tb03356.x.
Paul DR, Rhodes DG, Kramer M, Baer DJ, Rumpler WV: Validation of a food frequency questionnaire by direct measurement of habitual ad libitum food intake. Am J Epidemiol. 2005, 162: 806-814. 10.1093/aje/kwi279.
Philippaerts RM, Westerterp KR, Lefevre J: Doubly labelled water validation of three physical activity questionnaires. Int J Ssports Med. 1999, 20: 284-289. 10.1055/s-2007-971132.
Philippaerts RM, Westerterp KR, Lefevre J: Comparison of two questionnaires with a tri-axial accelerometer to assess physical activity patterns. Int J Sports Med. 2001, 22: 34-39. 10.1055/s-2001-11359.
Pitta F, Troosters T, Spruit MA, Decramer M, Gosselink R: Activity monitoring for assessment of physical activities in daily life in patients with chronic obstructive pulmonary disease. Arch Phys Med Rehabil. 2005, 86: 1979-1985. 10.1016/j.apmr.2005.04.016.
Pols MA, Peeters PH, Kemper HC, Collette HJ: Repeatability and relative validity of two physical activity questionnaires in elderly women. Med Sci Sports Exerc. 1996, 28: 1020-1025.
Pongurgsorn C: A questionnaire for assessment of physical activity in Thailand. PhD Thesis. 2002, University of Illinois at Urbana-Champaign
Poudevigne MS, O'Connor PJ: Physical activity and mood during pregnancy. Med Sci Sports Exerc. 2005, 37.
Rauh MJ, Hovell MF, Hofstetter CR, Sallis JF, Gleghorn A: Reliability and validity of self-reported physical activity in Latinos. Int J Epidemiol. 1992, 21: 966-971. 10.1093/ije/21.5.966.
Reis JP, Dubose KD, Ainsworth BE, Macera CA, Yore MM: Reliability and validity of the occupational physical activity questionnaire. Med Sci Sports Exerc. 2005, 37: 2075-2083. 10.1249/01.mss.0000179103.20821.00.
Richardson MT, Leon AS, Jacobs DR, Ainsworth BE, Serfass R: Comprehensive evaluation of the Minnesota Leisure Time Physical Activity Questionnaire. J Clin Epidemiol. 1994, 47: 271-281. 10.1016/0895-4356(94)90008-6.
Richardson MT, Ainsworth BE, Wu HC, Jacobs DR, Leon AS: Ability of the Atherosclerosis Risk in Communities (ARIC)/Baecke Questionnaire to assess leisure-time physical activity. Int J Epidemiol. 1995, 24: 685-693. 10.1093/ije/24.4.685.
Richardson MT, Ainsworth BE, Jacobs DR, Leon AS: Validation of the Stanford 7-day recall to assess habitual physical activity. Ann Epidemiol. 2001, 11: 145-153. 10.1016/S1047-2797(00)00190-3.
Rothenberg E, Bosaeus I, Lernfelt B, Landahl S, Steen B: Energy intake and expenditure: validation of a diet history by heart rate monitoring, activity diary and doubly labeled water. Eur J Clin Nutr. 1998, 52: 832-838. 10.1038/sj.ejcn.1600655.
Rousham EK, Clarke PE, Gross H: Significant changes in physical activity among pregnant women in the UK as assessed by accelerometry and self-reported activity. Eur J Clin Nutr. 2006, 60: 393-400. 10.1038/sj.ejcn.1602329.
Rutgers CJ, Klijn MJ, Deurenberg P: The assessment of 24-hour energy expenditure in elderly women by minute-by-minute heart rate monitoring. Ann Nutr Metab. 1997, 41: 83-88.
Rutter S: Estimates of energy expenditure in women and a biofeedback device for weight loss. PhD Thesis. 1990, University of New Hampshire
Saleh KJ, Mulhall KJ, Bershadsky B, Ghomrawi HM, White LE, Buyea CM, Krackow KA: Development and validation of a lower-extremity activity scale. Use for patients treated with revision total knee arthroplasty. J Bone Joint Surg Am. 2005, 87: 1985-1994. 10.2106/JBJS.D.02564.
Schmidt MD, Freedson PS, Chasan-Taber L: Estimating physical activity using the CSA accelerometer and a physical activity log. Med Sci Sports Exer. 2003, 35: 1605-1611. 10.1249/01.MSS.0000084421.97661.17.
Schmidt MD, Freedson PS, Pekow P, Roberts D, Sternfeld B, Chasan-Taber L: Validation of the Kaiser Physical Activity Survey in pregnant women. Med Sci Sports Exer. 2006, 38: 42-50. 10.1249/01.mss.0000181301.07516.d6.
Schulz S, Westerterp KR, Bruck K: Comparison of energy expenditure by the doubly labeled water technique with energy intake, heart rate, and activity recording in man. Am J Clin Nutr. 1989, 49: 1146-1154.
Schulz LO, Harper IT, Smith CJ, Kriska AM, Ravussin E: Energy intake and physical activity in Pima Indians: comparison with energy expenditure measured by doubly-labeled water. Obes Res. 1994, 2: 541-548.
Sieminski DJ, Cowell LL, Montgomery PS, Pillai SB, Gardner AW: Physical activity monitoring in patients with peripheral arterial occlusive disease. J Cardiopulm rehabil. 1997, 17: 43-47. 10.1097/00008483-199701000-00006.
Sims J, Smith F, Duffy A, Hilton S, Sims J, Smith F, Duffy A, Hilton S: The vagaries of self-reports of physical activity: a problem revisited and addressed in a study of exercise promotion in the over 65s in general practice. Family Practice. 1999, 16: 152-157. 10.1093/fampra/16.2.152.
Singh PN, Fraser GE, Knutsen SF, Lindsted KD, Bennett HW: Validity of a physical activity questionnaire among African-American Seventh-day Adventists. Med Sci Sports Exer. 2001, 33: 468-475. 10.1097/00005768-200103000-00021.
Sirard JR, Melanson EL, Li L, Freedson PS: Field evaluation of the Computer Science and Applications, Inc. physical activity monitor. Med Sci Sports Exerc. 2000, 32: 695-700. 10.1097/00005768-200003000-00022.
Smith BJ, Marshall AL, Huang N: Screening for physical activity in family practice: evaluation of two brief assessment tools. Am J Prev Med. 2005, 29: 256-264. 10.1016/j.amepre.2005.07.005.
Sobngwi E, Mbanya JC, Unwin NC, Aspray TJ, Alberti KG: Development and validation of a questionnaire for the assessment of physical activity in epidemiological studies in Sub-Saharan Africa. Int J Epidemiol. 2001, 30: 1361-1368. 10.1093/ije/30.6.1361.
Speck BJ, Looney SW: Self-reported physical activity validated by pedometer: a pilot study. Public Health Nurs. 2006, 23: 88-94. 10.1111/j.0737-1209.2006.230112.x.
Stanish HI, Draheim CC: Assessment of walking activity using a pedometer and survey in adults with mental retardation. Adapt Phys Activity Q. 2005, 22: 136-145.
Starling RD, Toth MJ, Matthews DE, Poehlman ET: Energy requirements and physical activity of older free-living African-Americans: a doubly labeled water study. J Clin Endocr Metab. 1998, 83: 1529-1534. 10.1210/jc.83.5.1529.
Steele R, Mummery K: Occupational physical activity across occupational categories. J Sci Med Sport. 2003, 6: 398-407. 10.1016/S1440-2440(03)80266-9.
Stel VS, Smit JH, Pluijm SM, Visser M, Deeg DJ, Lips P: Comparison of the LASA Physical Activity Questionnaire with a 7-day diary and pedometer. J Clin Epidemiol. 2004, 57: 252-258. 10.1016/j.jclinepi.2003.07.008.
Strath SJ, Bassett DR, Ham SA, Swartz AM: Assessment of physical activity by telephone interview versus objective monitoring. Med Sci Sports Exerc. 2003, 35: 2112-2118. 10.1249/01.MSS.0000099091.38917.76.
Strath SJ, Bassett DR, Swartz AM: Comparison of the college alumnus questionnaire physical activity index with objective monitoring. Ann Epidemiol. 2004, 14: 409-415. 10.1016/j.annepidem.2003.07.001.
Strycker LA, Duncan SC, Chaumeton NR, Duncan TE, Toobert DJ: Reliability of pedometer data in samples of youth and older women. Int J Behav Nutr Phys Act. 2007, 4: 4-10.1186/1479-5868-4-4.
Sugimoto A, Hara Y, Findley TW, Yoncmoto K: A useful method for measuring daily physical activity by a three-direction monitor. Scand J Rehabil Med. 1997, 29: 37-42.
Taylor CB, Coffey T, Berra K, Iaffaldano R, Casey K, Haskell WL: Seven-day activity and self-report compared to a direct measure of physical activity. Am J Epidemiol. 1984, 120: 818-824.
Timperio A, Salmon J, Crawford D: Validity and reliability of a physical activity recall instrument among overweight and non-overweight men and women. J Sci Med Sport. 2003, 6: 477-491. 10.1016/S1440-2440(03)80273-6.
Timperio A, Salmon J, Rosenberg M, Bull FC: Do logbooks influence recall of physical activity in validation studies?. Med Sci Sports Exerc. 2004, 36: 1181-1186. 10.1249/01.MSS.0000132268.74992.D8.
Verheul A-C, Prins AN, Kemper HCG, Kardinaal AFM, Van Erp-Baart M-A: Validation of a weight-bearing physical activity questionnaire in a study of bone density in girls and women. Pediatr Exerc Sci. 1998, 10.
Voorrips LE, Ravelli AC, Dongelmans PC, Deurenberg P, van Staveren WA: A physical activity questionnaire for the elderly. Med Sci Sports Exerc. 1991, 23: 974-979.
Walsh MC, Hunter GR, Sirikul B, Gower BA: Comparison of self-reported with objectively assessed energy expenditure in black and white women before and after weight loss. Am J Clin Nutr. 2004, 79: 1013-1019.
Wareham NJ, Jakes RW, Rennie KL, Mitchell J, Hennings S, Day NE: Validity and repeatability of the EPIC-Norfolk Physical Activity Questionnaire. Int J Epidemiol. 2002, 31: 168-174. 10.1093/ije/31.1.168.
Warms CA, Belza BL: Actigraphy as a measure of physical activity for wheelchair users with spinal cord injury. Nurs Res. 2004, 53: 136-143. 10.1097/00006199-200403000-00010.
Washburn RA, Smith KW, Jette AM, Janney CAW: The Physical Activity Scale for the Elderly (PASE): development and evaluation. J Clin Epidemiol. 1993, 46: 153-162. 10.1016/0895-4356(93)90053-4.
Washburn RA, Ficker JL: Physical Activity Scale for the Elderly (PASE): the relationship with activity measured by a portable accelerometer. Sports Med Phys Fitness. 1999, 39 (4): 336-340.
Washburn RA, Jacobsen DJ, Sonko BJ, Hill JO, Donnelly JE: The validity of the Stanford Seven-Day Physical Activity Recall in young adults. Med Sci Sports Exerc. 2003, 35: 1374-1380. 10.1249/01.MSS.0000079081.08476.EA.
Welk GJ, Differding JA, Thompson RW, Blair SN, Dziura J, Hart P: The utility of the Digi-walker step counter to assess daily physical activity patterns. Med Sci Sports Exerc. 2000, 32: S481-S488. 10.1097/00005768-200009001-00007.
Welk GJ, Thompson RW, Galper DI: A temporal validation of scoring algorithms for the 7-day physical activity recall. Meas Phys Edu Exer Sci. 2001, 5.
Wendel-Vos GC, Schuit AJ, Saris WH, Kromhout D: Reproducibility and relative validity of the short questionnaire to assess health-enhancing physical activity. J Clin Epidemiol. 2003, 56: 1163-1169. 10.1016/S0895-4356(03)00220-8.
Whitt MC, Dubose KD, Ainsworth BE, Tudor-Locke C: Walking patterns in a sample of African American, Native American, and Caucasian women: the Cross-Cultural Activity Participation Study. Health Educ Behav. 2004, 31 ($ Suppl): 45S-56S. 10.1177/1090198104266034.
Wickel EE, Welk GJ, Eisenmann JC: Concurrent validation of the Bouchard Diary with an accelerometry-based monitor. Med Sci Sports Exer. 2006, 38: 373-379. 10.1249/01.mss.0000183344.46077.bb.
Wilbur J, Chandler P, Miller AM: Measuring adherence to a women's walking program. West J Nurs Res. 2001, 23: 8-24. 10.1177/01939450122044934.
Wilkinson S, Huang CM, Walker LO, Sterling BS, Kim M: Physical activity in low-income postpartum women. J Nurs Scholarsh. 2004, 36: 109-114. 10.1111/j.1547-5069.2004.04022.x.
Williams E, Klesges RC, Hanson CL, Eck LH: A prospective study of the reliability and convergent validity of three physical activity measures in a field research trial. J Clin Epidemiol. 1989, 42: 1161-1170. 10.1016/0895-4356(89)90114-5.
Yamamura C, Tanaka S, Futami J, Oka J, Ishikawa-Takata K, Kashiwazaki H: Activity diary method for predicting energy expenditure as evaluated by a whole-body indirect human calorimeter. J Nutr Sci Vitaminol. 2003, 49: 262-269.
Adams SA, Matthews CE, Ebbeling CB, Moore CG, Cunningham JE, Fulton J, Hebert JR: The effect of social desirability and social approval on self-reports of physical activity. Am J Epidemiol. 2005, 161: 389-398. 10.1093/aje/kwi054.
Ainsworth BE, Jacobs DR, Leon AS, Richardson MT, Montoye HJ: Assessment of the accuracy of physical activity questionnaire occupational data. J Occup Med. 1993, 35: 1017-1027.
Barnard JA, Tapsell LC, Davies PS, Brenninger VL, Storlien LH: Relationship of high energy expenditure and variation in dietary intake with reporting accuracy on 7 day food records and diet histories in a group of healthy adult volunteers. Eur J Clin Nutr. 2002, 56: 358-367. 10.1038/sj.ejcn.1601341.
Boulay MR, Serresse O, Almeras N, Tremblay A: Energy expenditure measurement in male cross-country skiers: comparison of two field methods. Med Sci Sports Exer. 1994, 26: 248-253. 10.1249/00005768-199402000-00017.
Clark D, Tomas F, Withers RT, Chandler C, Brinkman M, Phillips J, Berry M, Ballard FJ, Nestel P: Energy metabolism in free-living, 'large-eating' and 'small-eating' women: studies using 2H2(18)O. Br J Nutr. 1994, 72: 21-31. 10.1079/BJN19940006.
Davis JN: Comparisons of physical activity and dietary components in an overweight/obese population and their normal weight controls matched for gender, age and height. PhD Thesis. 2004, The University of Texas at Austin
Fogelholm M, Hiilloskorpi H, Laukkanen R, Oja P, Van Marken LW, Westerterp K: Assessment of energy expenditure in overweight women. Med Sci Sports Exerc. 1998, 30: 1191-1197. 10.1097/00005768-199808000-00002.
Jakicic JM, Polley BA, Wing RR: Accuracy of self-reported exercise and the relationship with weight loss in overweight women. Med Sci Sports Exerc. 1998, 30: 634-638.
Johnson-Kozlow M, Sallis JF, Gilpin EA, Rock CL, Pierce JP: Comparative validation of the IPAQ and the 7-day PAR among women diagnosed with breast cancer. Int J Behav Nutr Phys Act. 2006, 3: 7-10.1186/1479-5868-3-7.
Koulouri AA, Tigbe WW, Lean ME: The effect of advice to walk 2000 extra steps daily on food intake. J Human Nutr Diet. 2006, 19: 263-266. 10.1111/j.1365-277X.2006.00703.x.
Levin S, Jacobs DR, Ainsworth BE, Richardson MT, Leon AS: Intra-individual variation and estimates of usual physical activity. Ann Epidemiol. 1999, 9: 481-488. 10.1016/S1047-2797(99)00022-8.
Lovejoy JC, Champagne CM, Smith SR, de JL, Xie H: Ethnic differences in dietary intakes, physical activity, and energy expenditure in middle-aged, premenopausal women: the Healthy Transitions Study. Am J Clin Nutr. 2001, 74: 90-95.
Masse LC, Fulton JE, Watson KL, Heesch KC, Kohl HW, Blair SN, Tortolero SR: Detecting bouts of physical activity in a field setting. Res Q Exerc Sport. 1999, 70 (3): 212-219.
Miller CL: A symptom management intervention in diabetic coronary artery bypass graft patients. PhD Thesis. 2005, University of Nebraska Medical Center
Racette SB, Schoeller DA, Kushner RF: Comparison of heart rate and physical activity recall with doubly labeled water in obese women. Med Sci Sports Exerc. 1995, 27: 126-133.
Richardson MT, Leon AS, Jacobs DR, Ainsworth BE, Serfass R: Ability of the Caltrac accelerometer to assess daily physical activity levels. J Cardiopulm Rehabil. 1995, 15: 107-113. 10.1097/00008483-199503000-00003.
Seale JL, Klein G, Friedmann J, Jensen GL, Mitchell DC, Smiciklas-Wright H: Energy expenditure measured by doubly labeled water, activity recall, and diet records in the rural elderly. Nutr. 2002, 18: 568-573. 10.1016/S0899-9007(02)00804-3.
Sjostrom M, Yngve A, Ekelund U, Poortvliet E, Hurtig-Wennlof A, Nilsson A, Hagstromer M, Nylund K, Faskunger J: Physical activity in groups of Swedish adults: Are the recommendations feasible?. Scand J Nutr. 2002, 46.
Soundy A, Taylor A: Comparison of self-reported physical activity and accelerometer-generated data in individuals with severe mental illness. J Sports Sci. 2005, 23: 1223-1224.
Starling RD, Matthews DE, Ades PA, Poehlman ET: Assessment of physical activity in older individuals: a doubly labeled water study. J Appl Physiol. 1999, 86: 2090-2096.
Staten LK, Taren DL, Howell WH, Tobar M, Poehlman ET, Hill A, Reid PM, Ritenbaugh C: Validation of the Arizona Activity Frequency Questionnaire using doubly labeled water. Med Sci Sports Exer. 2001, 33: 1959-1967. 10.1097/00005768-200111000-00024.
Stein AD, Rivera JM, Pivarnik JM: Measuring energy expenditure in habitually active and sedentary pregnant women. Med Sci Sports Exerc. 2003, 35: 1441-1446. 10.1249/01.MSS.0000079107.04349.9A.
Tzetzis G, Avgerinos A, Vernadakis N, Kioumourtzoglou E: Differences in self-reported perceived and objective measures of duration and intensity of physical activity for adults in skiing. Eur J Epidemiol. 2001, 17: 217-222. 10.1023/A:1017925731003.
Wadsworth D: Evaluation of a social cognitive theory based e-mail intervention to increase physical activity of college females. Dissertation Abstracts International: Section B: The Sciences and Engineering. 2006, 66: 4766.
Buchowski MS, Townsend KM, Chen KY, Acra SA, Sun M: Energy expenditure determined by self-reported physical activity is related to body fatness. Obes Res. 1999, 7: 23-33.
Liu B, Woo J, Tang N, Ng K, Ip R, Yu A: Assessment of total energy expenditure in a Chinese population by a physical activity questionnaire: examination of validity. Int J Food Sci Nutr. 2001, 52: 269-282. 10.1080/09637480120044138.
Lof M, Hannestad U, Forsum E: Assessing physical activity of women of childbearing age. Ongoing work to develop and evaluate simple methods. Food Nutr Bull. 2002, 23: 30-33.
Lof M, Forsum E: Activity pattern and energy expenditure due to physical activity before and during pregnancy in healthy Swedish women. Br J Nutr. 2006, 95: 296-302. 10.1079/BJN20051497.
Macfarlane DJ, Lee CC, Ho EY, Chan KL, Chan DT: Reliability and validity of the Chinese version of IPAQ (short, last 7 days). J Sci Med Sport. 2007, 10: 45-51. 10.1016/j.jsams.2006.05.003.
Richardson MT: Evaluation of the Minnesota Leisure Time Physical Activity Questionnaire. PhD Thesis. 1991, University of Minnesota
Warms CA: Acceptability and feasibility of a lifestyle physical activity program for people with spinal cord injury (SCI): A pilot study. PhD Thesis. 2002, University of Washington
Blair SN, Haskell WL, Ho Ping, Paffenbarger RSJR, Vranizan KM, Farquhar JW, Wood PD: Assessment of habitual physical activity by a seven day recall in a community survey and controlled experiments. Am J Epidemiol. 1985, 122: 794-804.
Adams SA, Matthews CE, Ebbeling CB, Moore CG, Cunningham JE, Fulton J, Hebert JR: The effect of social desirability and social approval on self-reports of physical activity. [erratum appears in Am J Epidemiol. 2005 May 1;161(9):899]. AM J EPIDEMIOL. 2005, 161: 389-398. 10.1093/aje/kwi054.
Bland JM, Altman DG: Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986, 1 (8476): 307-310.
Ainsworth BE, Haskell WL, Whitt MC, Irwin ML, Swartz AM, Strath SJ, O'Brien WL, Bassett DR, Schmitz KH, Emplaincourt PO, et al: Compendium of Physical Activities: an update of activity codes and MET intensities. Med Sci Sports Exer. 2000, 32: S498-S516. 10.1097/00005768-200009001-00009.
Tudor-Locke CE, Myers AM, Tudor-Locke CE, Myers AM: Challenges and opportunities for measuring physical activity in sedentary adults. Sports Med. 2001, 31: 91-100. 10.2165/00007256-200131020-00002.
Schoeller DA: Recent Advances from Application of Doubly Labeled Water to Measurement of Human Energy Expenditure. J Nutr. 1999, 129: 1765-1768.
Troiano RP, Berrigan D, Dodd K, Masse LC, Tilert T, McDowell M: Physical activity in the United States measured by accelerometer. Med Sci Sports Exerc. 2008, 40: 181-188.
Ferrari P, Friedenreich C, Matthews CE: The role of measurement error in estimating levels of physical activity. Am J Epidemiol. 2007, 166: 832-840. 10.1093/aje/kwm148.
Acknowledgements
The authors would like to thank David Moher for insightful guidance in the development of the research methods and Margaret Sampson for her technical assistance with the search strategy. The first author acknowledges financial support from Statistics Canada for the preparation of the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
SAP carried out the design, bibliographic search, article screening, data abstraction and synthesis and drafted and edited the manuscript. KBA participated in its design and coordination and helped edit the manuscript. MH participated in article screening, data abstraction and editing of the manuscript. JH participated in data abstraction, data synthesis and helped edit the manuscript. SCG participated in the design of the study, provided methodological input, and assisted in the editing of the manuscript. MT conceived the study, and participated in its design and coordination and helped to draft the manuscript. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Prince, S.A., Adamo, K.B., Hamel, M.E. et al. A comparison of direct versus self-report measures for assessing physical activity in adults: a systematic review. Int J Behav Nutr Phys Act 5, 56 (2008). https://doi.org/10.1186/1479-5868-5-56
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/1479-5868-5-56