1 Introduction
Wearables are smart electronic devices that bring the ability to continuously record, monitor, and analyze key health indicators of individuals as they go about their lives. The global market share for wearables is forecasted to be $33 billion by 2025, with an average growth rate of 15% each year [
1]. Wearable devices have been instrumental in transforming personalized medicine and individual health monitoring practices. The number of wearables-based research has been steadily increasing each year—a trend that is expected to continue in the future [
2]. Wearables are primarily wrist-worn sensors that continuously track elementary wellness indicators such as sleep, heart rate, and activity; however, newer types of wearables, including implantables, chest-worn sensors, head-mounted displays, smart jewelry, and smart clothing, could be capable of capturing a lot more health indicators such as physiological stress, respiration rate, blood alcohol, blood sugar, and other vitals in a noninvasive way.
Many studies employ wearables data for predictive modeling such as fall detection or predicting 30-day readmissions [
3–
5], yet there is growing interest in conducting natural and quasi-natural experiments to study human health and its relationship with externalities [
2,
6,
7]. Health indicators from wearables can be combined with other data sources such as electronic health records and social media to better understand dynamics between human health and society [
8–
10]. Wearables are capable of continuously monitoring and recording health indicators at fine granularity for multiple participants simultaneously. However, the resulting repeated measures data poses data quality and analytical challenges such as missing values, high dimensionality, lagged effects, and clustered errors [
11,
12]. Existing glass-box models such as mixed-effects regression are often insufficient for accurately representing higher-order associations between variables in wearables data [
13]. For example, including a second-order term in the mixed-effects regression model can show that a second-order effect of an input is statistically significant, but it is difficult to explain the functional nature of the second-order association from the quadratic expression alone. In other words, it would be interesting to learn the extrema (i.e., maxima/minima) of the second-order effect and how the outcome varies for unit change in the input around such a point. Furthermore, establishing such a functional relationship using visualizations alone can be challenging for wearables data with repeated measures per user and heterogeneity across users. Therefore, there is a need for novel interpretable methods for explaining higher-order associative patterns in wearables data.
Segmented regression, also known as broken-stick or piecewise regression, is an explanatory modeling approach where input variable(s) of interest are partitioned or segmented into intervals followed by fitting straight lines for each interval in the regression model. It is commonly used in design science research applications with a hypothesized curvilinear relationship in simple regression models [
14,
15]. The most critical challenge in fitting a segmented model is the determination of input value(s) at which segments need to be separated, a problem also known as breakpoint or change point determination. Studies that use segmented regression with mixed effects [
16] determine change points using either ad hoc or black-box procedures [
13,
17]. These methods either tend to have low external validity or overfit and are therefore unsuitable for making inferences. To overcome these shortcomings, we propose a new method for determining change points for input segments in a mixed-effects regression. Our method is based on the
human-in-the-loop (HITL) paradigm as it uses human inputs during the model training process. Our method repurposes the smooth functions generated by a
generalized additive mixed model (GAMM) to allow the analyst to set initial estimates of the change points through visual inspection. Following which, a fast robust root-finding algorithm called
Brent's method is used to precisely locate each change point iteratively by maximizing the mixed-effects model's Akaike information criteria [
18]. In this way, our method takes advantage of human inputs to fit segmented models that can accurately represent higher-order associations between variables in wearables data. We evaluate our method on three real-world datasets. Our method consistently outperforms existing approaches for all three datasets. The models developed using our method enable clear interpretations of four higher-order associations across the datasets. Inferences from our analysis have multiple managerial implications.
3 Our Method
We propose a new method for capturing higher-order associative patterns in wearables data using segmented mixed-effects modeling. Our method uses an HITL approach to determine the change points in the segmented model by combining an algorithmic search process with human inputs for fine-tuning. In other words, our method uses the smooth function estimated by a GAMM to allow the analyst to annotate the change point estimates followed by fine-tuning of the estimates using a constrained optimization procedure. Our method is novel since few studies have used an HITL approach to tune change point parameters for segmented modeling. It also provides a robust mechanism to capture and explain higher-order relationships hypothesized in wearables data. In the rest of this section, we explain our method in detail.
Consider a mixed-effects model commonly used for explaining repeated measures in data from wearables as follows:
Equation (
1) is a representation of generalized linear mixed models with a linear link function, but any link function is applicable to our method.
\({y}_{ij}\) is value of a given health outcome for the
\({i}\hbox{th}\) observation and
\({j}\hbox{th}\) individual,
\({\beta }_0\) is the fixed intercept,
\({\rm{{\rm B}}} = \{ {{\beta }_1, \ldots ,{\rm{\ }}{\beta }_K} \}\) are coefficients for
K fixed effects
\(\{ {{x}_1,{\rm{\ }}{x}_2, \ldots ,{\rm{\ }}{x}_K} \}\),
\({{\rm{\Gamma }}}_0 = \{ {{\gamma }_{01},{\gamma }_{02}, \ldots ,{\gamma }_{0j}, \ldots ,{\gamma }_{0J}} \}\) are J random intercepts for each individual,
\({\rm{\Gamma \ }} = \{ {{\gamma }_{11}, \ldots ,{\gamma }_{1j}, \ldots ,{\gamma }_{MJ}} \}{\rm{\ }}\) are coefficients for
\(M{\rm{\ }}x{\rm{\ }}J\) random effects
\(\{ {{z}_1,{\rm{\ }}{z}_2, \ldots ,{\rm{\ }}{z}_M} \}\), and
\({\epsilon }_{ij}\) is the residual error.
Suppose there exists an input
\({x}_r\) such that its second-order (or higher-order) effects are significant, then Equation (
1) can be represented as follows:
In Equation (
2),
\({x}_{rij}\) denotes the value of input variable
\({x}_r\) for the
\({i}\hbox{th}\) observation and
\({j}\hbox{th}\) individual.
As described earlier, segmented representations of higher-order effects are more interpretable than polynomials. The input variable
\({x}_r\) can be represented as the sum of segments as follows:
In Equation (
3),
\({\rm{{\rm H}}} = \{ {{\eta }_1,{\eta }_2, \ldots ,{\rm{\ }}{\eta }_P} \}{\rm{\ }}\) is a set of
P change points defined for the input variable
\({\rm{\ }}{x}_r\), which is broken into P segments
\(\{ x_r^{( 1 )} = \ {x}_r\cdot I( {{x}_r < {\eta }_1} ), x_r^{( 2 )} = {x}_r\cdot I( {\eta }_1 \le {\rm{\ }}{x}_r < {\eta }_2 ), \ldots ,{\rm{\ }}x_r^{( P )} = {x}_r\cdot I( {{\eta }_P \le {\rm{\ }}{x}_r} ) \}\).
\(I( \varphi )\) is an indicator function equal to 1 if condition
\(\varphi\) is true; otherwise, it is 0. Therefore, the scalar product
\({x}_r\cdot I( \varphi )\) has value equal to
\({x}_r\) when
\(\varphi\) is true and is 0 otherwise. The next logical step is to estimate number of change points and their positions.
We propose an HITL method to estimate the change points
\({\eta }_{p{\rm{\ }}} \in H,\ P = | H |\) as follows. As the first step, we fit a GAMM [
56] with a given input
\({x}_r{\rm{\ }}\) as a nonparametric spline as shown next:
Although a nonparametric spline can be included for the corresponding random effect of input
\({x}_r\), it is computationally more expensive for fitting the corresponding semiparametric model. We empirically tested on multiple datasets and observed the shape of the component smooth function
\({f}_r( {{x}_{rij}} )\) to not be sensitive to random effects as smooth functions. Therefore, we consider the smooth function only in the fixed effects. In the next step, we visualize the plot of the smooth function in Equation (
4) approximated as a B-spline [
57]. Here, a human input is required to identify the order of the curve by inspecting the number of extrema (i.e., minima and maxima) to set the value of
P. The setting of
P can be based upon visual inspection as well as prior domain knowledge. For example, in Figure
1, the
P values for the different scenarios in (a) through (d) are chosen as
1,
2,
1, and
\(5,\) respectively. This step also determines whether to opt for a segmented model over a linear model, by inspecting the curvilinear nature of the component smooth function. For instance, although we set
P = 1 for scenario illustrated in Figure
1(c), an analyst may also approximate the monotonically increasing curve as a linear function in this case, thus favoring simplicity over slightly better model fit.
The value of the maxima and minima are used as starting points in a linear search algorithm in the third step. The third step involves iteratively performing search for change points using Brent's method [
58], a linear optimization with box constraints. Brent's method is a hybrid root-finding algorithm combining the bisection method, the secant method, and the inverse quadratic interpolation that make it robust and highly efficient while incorporating box constraints (i.e., range of permitted values) [
59]. For each iteration of the optimizer, a mixed model shown with segmented inputs for
\({x}_r\) is fit for a particular change point parameter. The algorithm returns the change point parameter corresponding to the model with minimum Akaike information criteria. Since Brent's method is a single parameter estimator, we identify the change points sequentially by repeating the search algorithm for each subsequent change point after fixing values of previously determined change points. Finally, the segmented mixed-effects model is fit as shown next:
In Equation (
5),
\(S{\rm{\ }}\) is a set of segments constructed using change points
\({\rm{{\rm H}}}\) identified for input
\({x}_r\). Significance of the effect of input variable,
\({x}_r\), at each segment,
s, can be determined by inspecting the corresponding fixed-effects coefficient,
\({\rm{\ }}\beta _r^{( s )}\), under regular conditions. The algorithm for our method is shown in Table
1.
5 Discussion and Conclusion
In this study, we presented the problem of developing an interpretable model that captures piecewise pairwise associations between different modalities captured by wearables. Since existing methods for segmented modeling for mixed-effect regression are insufficient to determine robust and verifiable change point, our method is timely with increasing research applications utilizing wearables in a natural experimental setup. Our method involves the inspection of smooth functions of pairwise associations captured using GAMM, followed by using Brent's method to sequentially position change points optimizing model fit. Our method not only uses analytical tools to determine change points but also utilizes user discretion to control the number of change points and its localization. For example, it is often desirable to avoid change points at extremities as data corresponding to these segments may be very sparse, rendering inference unreliable. We apply our method to three different wearables datasets and show that not only is it effective in terms of improving model fit and prediction performance but also significantly enhances model interpretability and ability to derive meaningful inferences.
5.1 Managerial Implications
Our method and analysis have several managerial implications. Our study provides a novel tool to analyze wearables data, thus boosting the value for storing and processing of large amounts of big data generated by wearables. Our HITL-based segmented modeling method can be used in a wide range of wearables applications such as patient monitoring systems, military fitness management programs, smart diet applications, and COVID-19 contact tracing. Our analysis over the three wearables datasets presents interesting pairwise associations. The positive relationship between sound level and physical wellbeing measure below the range of 51 dBA informs workplace design practices on the need for further examination of sound level effects on employee health for different sound level ranges. A higher gradient of an activity-wellbeing relationship in the lower range of activity provides additional empirical evidence on the value of low-intensity/intermittent activities on elevating instantaneous stress and improving wellbeing. The significant association between cardiovascular wellness measure and respiration rate after a certain threshold of the ST segment index solicits clinical researchers to further examine inter-relationships between pulmonary and cardiovascular wellness indices to improve on existing hospital monitoring and early warning systems. Finally, the association between a dimension of raw accelerometer data and extrinsic phenomena such as alcohol consumption stresses the value of looking at raw data in addition to expert-engineered features such as gait variability and the number of continuous steps.
5.2 Contribution to IS Research
Predictive modeling and statistical modeling in analytics go side-by-side as one predicts the future using existing data, focusing on informing us on the question “What will be,” whereas the other explicates hidden patterns and tells us about “What is” with respect to a phenomenon. Both of them are important and require attention to optimize the utility of the generated data. As the number of wearable technology-based applications increases in the future, the quantum of available data to analyze will exponentially increase and warrant more and more advancements in explainable modeling for meaningful interpretations of patterns. In this study, we introduce a new method to address the design challenge of representing nonlinear associative patterns in wearables data. Our contribution is timely in IS research, as the discipline is widening its scope in design science as well as explanatory modeling applications by using novel data sources such as wearables [
70]. WDA is a promising area in IS [
5,
19], opening a wide range of research applications owing to the following two reasons: the ubiquitous nature of wearables in today's lifestyle, and the promise of wearables to generate rich, personalized, temporal, and highly grained information content. We therefore posit that our contributions through a novel interpretable modeling method for addressing challenges in WDA lays the foundation for promising research in IS using data generated from wearables.
5.3 Limitations
There are some caveats and limitations to our study. We have focused on the design science problem of developing an interpretable modeling method but do not delve into the subject of determining the significance of input variables themselves. In addition, our method by itself does not imply causation, although it can be applied to any explanatory modeling scenario including causal or quasi-causal experimental settings. If curvilinear effects are absent, the segmented modeling approach should be avoided to prevent overfitting. The modeling approach described in this study is useful when higher-order association is predetermined between pairs of repeated measures and there is a need to better explain these associations for making inferences. For high-dimensional large datasets, GAMM can take longer time to fit, and the change point optimization can be tedious for the analyst. A few ways to avoid this problem are to apply feature selection, variable transformation, and outlier detection procedures before examining pairwise associations using our method. Next, our HITL approach involves human inputs and therefore may still be susceptible to human errors and biases, despite the fine-tuning step using the optimization procedure. One way of reducing such potential errors is to consult domain experts post determination of change point from the optimization procedure. Finally, it is worth noting that our method caters to the problem of improving interpretability of glass-box models, at the cost of increased bias and limited predictive power when compared to black-box data mining models [
28].
5.4 Conclusion
With the increasing availability of wearables, we can measure and understand different health phenomena at a highly granular level. We propose an HITL method for accurate estimation of change points in segmented mixed-effects regression facilitating the interpretations of pairwise associations of variables in wearables data. Our method is robust and efficient, and the resultant segmented models provide better prediction accuracy than state-of-the-art alternatives for a given problem. Our proposed method is empirically validated, more reliable due to human verification, and provides better interpretable results. Our approach can be generalized to other areas of IS where nonlinear pairwise associations are anticipated.