Introduction Individuals with type 1 diabetes (T1D) present with diverse body weight status and degrees of glycemic control, which may warrant different treatment approaches. We sought to identify subgroups sharing phenotypes based on both weight and glycemia and compare characteristics across subgroups.
Research design and methods Participants with T1D in the SEARCH study cohort (n=1817, 6.0–30.4 years) were seen at a follow-up visit >5 years after diagnosis. Hierarchical agglomerative clustering was used to group participants based on five measures summarizing the joint distribution of body mass index z-score (BMIz) and hemoglobin A1c (HbA1c) which were estimated by reinforcement learning tree predictions from 28 covariates. Interpretation of cluster weight status and glycemic control was based on mean BMIz and HbA1c, respectively.
Results The sample was 49.5% female and 55.5% non-Hispanic white (NHW); mean±SD age=17.6±4.5 years, T1D duration=7.8±1.9 years, BMIz=0.61±0.94, and HbA1c=76±21 mmol/mol (9.1±1.9)%. Six weight-glycemia clusters were identified, including four normal weight, one overweight, and one subgroup with obesity. No cluster had a mean HbA1c <58 mmol/mol (7.5%). Cluster 1 (34.0%) was normal weight with the lowest HbA1c and comprised 85% NHW participants with the highest socioeconomic position, insulin pump use, dietary quality, and physical activity. Subgroups with very poor glycemic control (ie, ≥108 mmol/mol (≥12.0%); cluster 4, 4.4%, and cluster 5, 7.5%) and obesity (cluster 6, 15.4%) had a lower proportion of NHW youth, lower socioeconomic position, and reported decreased pump use and poorer health behaviors (overall p<0.01). The overweight subgroup with very poor glycemic control (cluster 5) showed the highest lipids and blood pressure (p<0.01).
Conclusions There are distinct subgroups of youth and young adults with T1D that share weight-glycemia phenotypes. Subgroups may benefit from tailored interventions addressing differences in clinical care, health behaviors, and underlying health inequity.
- pediatric Type 1 diabetes
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
Significance of this study
What is already known about this subject?
Obesity is an increasing issue among youth and young adults with type 1 diabetes.
However, there is considerable heterogeneity in both weight status and glycemic control across the population, which challenges the use of universal or ‘one-size-fits-all’ recommendations to optimize weight alongside glycemic control.
What are the new findings?
We used data-driven methods and focused on cohort of youth and young adults with etiologically defined type 1 diabetes to characterize heterogeneity in the clinical phenotypes based on both weight status and glycemia.
We found six distinct subgroups of youth and young adults with type 1 diabetes sharing weight-glycemia clinical phenotypes based on measures of body mass index z-score (BMIz) and hemoglobin A1c (HbA1c).
We found differences in sociodemographic characteristics, clinical factors, and behavioral correlates across subgroups, suggesting that the subgroups with the poorest glycemic control and highest weight are more likely to be non-white, have lower measures of socioeconomic status, and report poor health behaviors.
How might these results change the focus of research or clinical practice?
We show that the weight-glycemia phenotypes of diabetes present with varying combinations of BMIz and HbA1c with differences in other individual characteristics, including sociodemographic, clinical, and behavioral features. These subgroups of individuals may benefit from similar therapeutic strategies and can be targeted more efficiently as groups for clinical recommendations and interventions to co-optimize the outcomes of weight and glycemia.
The results implicate a precision medicine approach to address the complicated physiologic relationships between weight and glycemic control across phenotypic subgroups.
As the prevalence of obesity increases worldwide, recent data have shown that the prevalence of overweight and obesity in youth and young adults with type 1 diabetes (T1D) is even higher than in the general population.1 2 Excess adiposity increases the risk of cardiovascular disease later in life which is already elevated up to 10-fold in persons with T1D. Therefore, there are early efforts to integrate weight management with the complexities of routine T1D care including both weight loss and prevention of overweight and obesity.3
However, the rising numbers of youth and young adults with T1D who are overweight or obese have also contributed to the heterogeneity in the T1D patient population. Given that appropriate treatment algorithms may vary markedly across the broad spectrum of body weight and glycemia,4 the T1D patient population is a good candidate for precision medicine, which matches interventions to different subgroups of patients expected to show a positive response.5 6 Epidemiological evidence suggests population-level associations between body mass index z-score (BMIz) and hemoglobin A1c (HbA1c)2 7 8; however, surprisingly little is known about how weight status and glycemic control are codistributed across the population and interact to form more nuanced clinical phenotypes of T1D. The weight-glycemia phenotype may confer information about goals for treatment and effectiveness of specific therapeutic strategies for optimizing outcomes simultaneously, especially given that weight gain may be an unintended consequence of intensive insulin therapy in some individuals.9
Previous work used data-driven approaches to stratify adults with T1D and type 2 diabetes (T2D) into subgroups based on six ‘raw’ clinical and physiologic features.10 The resulting subgroups showed differences in progression of T2D and risk for complications.10 However, few studies have characterized heterogeneity in weight and glycemia within the etiologic diagnosis of T1D despite the rising prevalence of overweight and obesity. Therefore, our objective was to use data from a large, diverse cohort of youth and young adults with T1D to identify and characterize subgroups sharing clinical phenotypes of T1D based on weight status, measured by BMIz, and glycemic control, measured by HbA1c.
The SEARCH for Diabetes in Youth study began in 2000 with an overarching objective to describe the incidence and prevalence of youth-onset diabetes in the USA by age, sex, and race/ethnicity. Youth and young adults with diabetes diagnosed <20 years of age (‘youth’) were identified from a population-based incidence registry network at five US sites (South Carolina; Cincinnati, Ohio and surrounding counties; Colorado with southwestern Native American sites; Seattle, Washington and surrounding counties; and Kaiser Permanente Health Plan, Southern California).11 A subset of participants with newly diagnosed diabetes between 2002 and 2006 and in 2008 were recruited for a follow-up ‘cohort’ visit between 2012-2015 if they had attended a baseline visit and had greater than 5 years of diabetes duration. The subset of youth who were included in the SEARCH cohort visit were not significantly different from all other SEARCH youth diagnosed between the years 2002 and 2008 in terms of average diabetes-onset age, demographics, or clinical measures.12
Inclusion criteria for this report consisted of incident cases of T1D between 2002 and 2006 and 2008 who attended the SEARCH cohort visit (n=2004). Diabetes type for these analyses was based on an etiological classification using diabetes autoantibodies and estimated insulin sensitivity score (euglycemic clamp-validated equation including waist circumference, HbA1c and triglyceride levels) from the baseline visit.13 Participants who reported inconsistent or implausible levels of severe hypoglycemia (n=4) and who were missing BMIz or HbA1c measures at the cohort visit (n=183) were excluded.
Trained personnel administered questionnaires; measured height, weight, and blood pressure; and obtained fasting blood samples. BMI was defined as weight (kilograms) divided by height (meters2) and converted to a z-score based on US growth reference data.14 To facilitate study across youth and young adults, BMIz for individuals >20 years was estimated assuming an age of 20 years (the maximum age represented in the growth reference); this approach has been operationalized in previous SEARCH studies12 15 and elsewhere.16 A blood draw occurred after an 8-hour overnight fast, and medications, including short-acting insulin, were withheld the morning of the visit.
Blood samples were obtained under conditions of metabolic stability, defined as no episodes of diabetic ketoacidosis in the preceding month and the absence of fever and acute infections. They were processed locally and shipped within 24 hours to the central laboratory (Northwest Lipid Metabolism and Diabetes Research Laboratories, Seattle, WA). HbA1c was measured by a dedicated ion exchange high-performance liquid chromatography instrument (TOSOH Bioscience, San Francisco, CA).
Demographic measures included sex and self-reported race and ethnicity, categorized as non-Hispanic white, non-Hispanic black, Hispanic, Asian, Native American, Pacific Islander, and other. Highest education by either parent was classified as less than high school degree, high school graduate, some college through associate degree, and bachelor’s degree or more. Annual household income was classified as >$75 000, $50 000–75 000, $25 000–49 999 and <$25 000. Healthcare access was measured by health insurance type, classified as none, private, Medicaid, or other.
Insulin regimen was classified as pumps, long-acting with short/rapid-acting insulin injections with ≥3 injections per day, and any other form of multiple or singular daily injections. Self-reported frequency of self-monitoring of blood glucose (SMBG) was categorized as <1, 1–3, and >3 times per day. Diabetes care provider was classified as pediatric endocrinologist, adult endocrinologist, and all other healthcare providers (pediatrician, family practice doctor, nurse practitioner, and so on).
Depressive symptoms were measured using the Center for Epidemiologic Studies Depression Scale (CESD).17 Quality of life was measured using the age-appropriate Center for Pediatric Quality of Life Inventory (PedsQL).18 The CESD and PedsQL were modeled as continuous variables. Physical activity and screen time were assessed using questionnaires. High physical activity was classified as vigorous activity 3–7 days weekly. High screen time was classified as 2 or more hours of screen time per day. Data from a self-administered, validated food frequency questionnaire were available for 1643 participants who were aged >10 years.19 Dietary quality was assessed by adherence to the Dietary Approaches to Stop Hypertension diet using an index score ranging from 0 to 80.20
We used cluster analysis to identify and characterize subgroups sharing clinical phenotypes of T1D based on weight status and glycemic control. Considerations of the clustering approach are described in detail in the online supplementary appendix. As opposed to unsupervised cluster analysis, where there is no outcome measure or data labels, we wished to perform a semisupervised cluster analysis guided by the two outcomes of interest. A challenge in identifying supervised clusters is that noise in a given outcome may obscure true subgroups of clinical interest.21 Therefore, rather than cluster individuals based on the observed values of BMIz and HbA1c at the cohort visit, we employed a novel, semisupervised clustering technique to group individuals in SEARCH by five measures of the joint distribution of BMIz and HbA1c at the cohort visit: the means and variances of BMIz and HbA1c and their covariance. The five values summarizing the joint distribution of BMIz and HbA1c were predicted for each individual using reinforcement learning trees (RLT), a type of tree-based machine learning technique,22 and 28 other characterizing variables that were available for each patient (X-variables). The 28 X-variables were chosen to capture a breadth of individual characteristics available at the cohort visit including sociodemographic, clinical, anthropometric, laboratory, psychosocial and behavioral measures (see online supplementary table S1). Any given X-variable was missing from at most 12% of individuals and imputed by a multiple imputation method, missForest.23 The resulting RLT-estimated outcomes represent smoothed outcome measures, denoised by the X-variables, which maintain the individual-level signal with reduced noise or measurement error.22 Of note, the 28 X-variables were only used to predict measures of the joint distribution of BMIz and HbA1c for each individual and were not used directly in the cluster analysis.
The five clustering variables (RLT-predicted means and variances of BMIz and HbA1c and their covariance) were standardized and a hierarchical clustering algorithm with Ward’s D2 method and Euclidean distance was applied. The number of clusters was chosen using the NbClust package in R24 and restricted to considering between four and nine clusters. The smallest cluster was restricted to greater than 50 people for adequate statistical power (>85%) to detect small to medium effects in overall cluster comparisons.25 Clustering stability was assessed by sequentially omitting individual clusters, one at a time, and evaluating the agreement of the remaining clusters using the adjusted Rand index.26 For more information on imputation methods, RLT parameters, clustering methods, stability assessments, and additional analyses, see online supplementary appendix.
Clusters were ordered by increasing weight status and then by increasing mean HbA1c. Clusters were named based on mean BMIz and HbA1c using traditional clinical cut-point for ease of interpretation. Cluster weight status was classified as underweight (mean BMIz <−1.64), normal weight (mean BMIz −1.64 to <1.04), overweight (mean BMIz 1.04 to <1.64), and obesity (mean BMIz ≥1.64), corresponding to <5th, 5th to <85th, 85th to <95th, and ≥95th percentiles for age and sex, respectively.27 Cluster glycemic control was defined as good (mean HbA1c <58 mmol/mol (<7.5%)), moderate (mean HbA1c 58 to <75 mmol/mol (7.5% to <9.0%)), poor (mean HbA1c 75 to <108 mmol/mol (9.0% to <12.0%)), and very poor (mean HbA1c ≥108 mmol/mol (≥12.0%)).28 Unfortunately, this method of cluster nomenclature cannot represent the weight status and glycemic control of each individual within cluster and instead was selected to facilitate cluster-level phenotypic interpretation and comparisons thereof.
The cross-sectional correlates of each cluster from the follow-up visit were summarized using descriptive statistics. The chosen approach and statistical criteria were part of an a priori specified analysis plan. Data are presented as mean±SD. Overall tests of difference as well as pairwise comparisons were carried out using analysis of variance, t-tests, and χ2 tests, or Fisher’s exact tests, where appropriate. We accounted for multiple comparisons in (1) overall tests of difference, and (2) post hoc pairwise comparisons between individual clusters. Overall tests were corrected via a Bonferroni adjustment using the total number of tests as the denominator. For pairwise comparisons, we report q values, which control for the positive false discovery rate29 (see online supplementary appendix). P values and q values were evaluated at the 0.05 significance level.
Additional analyses explored clusters based on the observed or ‘raw’ measures of BMIz and HbA1c for comparison to the clusters based on the joint distribution of BMIz and HbA1c, denoted Y-clusters. These analyses are described in detail in the online supplementary appendix, see Additional Analyses. All data analyses were performed using the statistical analysis software package R V.3.4.1 and SAS V.9.4 (SAS Institute).
The study included 1817 individuals with T1D, with a mean age of 17.6 (6.0–30.4 years) and a mean T1D duration of 7.8 years (table 1). Six weight-glycemia phenotypic clusters were identified based on measures of the joint distribution of BMIz and HbA1c (figure 1).
Based on mean measures, cluster 1 (n=618, 34.0%) was normal weight with moderate glycemic control (mean BMIz 0.59±0.59, mean HbA1c 61±12 mmol/mol (7.7%±1.1%)); although the mean HbA1c was classified as moderate, this subgroup had the highest proportion of youth (38%) with HbA1c <58 mmol/mol (7.5%). Cluster 2 (n=195, 10.7%) was also classified as normal weight with moderate glycemic control but showed a slightly lower mean BMIz and higher mean HbA1c than cluster 1 (mean BMIz −0.68±0.66, mean HbA1c 68±10 mmol/mol (8.4%±0.9%)). Cluster 3 (n=509, 28.0%) was normal weight with poor glycemic control (mean BMIz 0.56±0.62, mean HbA1c 86±12 mmol/mol (10.0%±1.1%)). Cluster 4 (n=79, 4.4%) was normal weight with very poor glycemic control (mean BMIz −1.05±0.83, mean HbA1c 113±15 mmol/mol (12.5%±1.4%)). Cluster 5 (n=137, 7.5%) was overweight with very poor glycemic control (mean BMIz 1.29±0.69, mean HbA1c 109±15 mmol/mol (12.1%±1.5%)). Cluster 6 (n=279, 15.4%) was those with obesity and moderate glycemic control (mean BMIz 1.74±0.42, mean HbA1c 70±11 mmol/mol (8.6%±1.0%)). Figure 2 depicts the density distribution of BMIz and HbA1c within each weight-glycemia cluster.
Cluster 1 (normal weight with moderate glycemic control) was the largest cluster, comprising 34.0% of the sample. Based on mean BMIz and HbA1c measures closest to clinical targets, this group was selected as the referent group for individual comparisons. Tables 1 and 2 depict the sociodemographic characteristics and the diabetes care, psychosocial, and behavioral factors according to the six weight-glycemia clusters. Participants in cluster 1 were 46% female, 88% non-Hispanic white and were characterized by the highest measures of socioeconomic position, including 74% having parents with a bachelor’s degree or more and 85% with private health insurance. This group also had the highest prevalence of insulin pump use and frequency of SMBG, the lowest level of depressive symptoms, the highest quality of life, the highest dietary quality, and the highest levels of physical activity (overall p<0.001).
One cluster emerged with poor glycemic control (cluster 3, normal weight with poor glycemic control) and two with mean HbA1c >108 mmol/mol (12.0%) (cluster 4, normal weight with very poor glycemic control; cluster 5, overweight with very poor glycemic control). The latter two were the smallest subgroups, comprising approximately 4% and 8% of the sample, respectively. Compared with cluster 1, these clusters included a significantly higher proportion of non-white individuals (58% and 50%), with the highest proportion of non-Hispanic black individuals in cluster 4 (28%) and highest prevalence of Hispanic individuals in cluster 5 (23%) (q<0.001). Clusters 4 and 5 also had lower measures of socioeconomic position and significantly lower insulin pump use and less frequent SMBG (table 1; all q<0.001). Approximately 38% of individuals in cluster 4% and 41% in cluster 5 experienced an episode of diabetic ketoacidosis in the past 6 months compared with 10% in cluster 1. Finally, clusters 4 and 5 were characterized by higher depressive symptoms, lower quality of life, poorer dietary quality, and greater a proportion of high screen time (all q<0.001).
Two clusters were classified as overweight and having obesity (cluster 5, overweight with very poor glycemic control; cluster 6, obesity with moderate glycemic control). Compared with cluster 1, both subgroups contained a higher proportion of females (66% in cluster 5 and 55% in cluster 6) and non-white youth. Cluster 6 was also characterized by moderately lower measures of socioeconomic position compared with cluster 1 (all q<0.001).
Additional post hoc pairwise comparisons were made between cluster 5 and cluster 6, the two overweight/obesity subgroups (online supplementary tables S2–S4). Compared with cluster 6 (obesity with moderate glycemic control), cluster 5 (overweight with poor glycemic control) comprised more female (q=0.028) and non-white participants (q<0.001). Individuals in cluster 5 were older at the follow-up visit (q<0.001) and had lower socioeconomic position (q<0.001 for parental education, income, and insurance type) with no significant differences in diabetes duration (p=0.15). These participants were also less likely to use an insulin pump or report frequent SMBG (q<0.001). There was a higher prevalence of high screen time in cluster 5 (p=0.001) with no significant differences in physical activity (q=0.34).
Table 3 depicts other clinical measures across the weight-glycemia clusters. Compared to cluster 1, clusters 3, 4, 5, and 6 showed significantly higher blood lipid levels. Cluster 5 showed higher total cholesterol, low-density lipoprotein cholesterol, and triglycerides compared with both the referent cluster 1 and cluster 6 (obesity). This group also exhibited higher mean systolic and diastolic blood pressure levels than cluster 1 and a higher mean diastolic blood pressure compared with cluster 6 (all q<0.001).
In a large, diverse cohort of youth and young adults with T1D, we found evidence of subgroups that share distinct weight-glycemia phenotypes including varying combinations across BMIz and glycemic control parameters. None of the clusters that were identified had a mean HbA1c <58 mmol/mol to be classified as good glycemic control, underscoring that youth and young adults with T1D are not meeting the targets put forward by the American Diabetes Association and International Society for Pediatric and Adolescent Diabetes.30 Based on mean measures, four clusters were largely normal weight, with the remaining two clusters classified as overweight and having obesity, although there were individuals across all weight status categories who were captured in each cluster based on measures of the joint distribution. Examination of the latter two subgroups reveals that while overweight and poor glycemic control can co-occur in young people with diabetes (ie, the weight-glycemia phenotype of cluster 5, comprising 8% of the sample), obesity is not always associated and does not necessarily account for those with poor or very poor glycemic control (ie, the weight-glycemia phenotype of cluster 6, comprising 15% of the sample). This is consistent with previous work demonstrating a U-shaped association between HbA1c and BMI standard deviation score (SDS), with the highest HbA1c levels among children and adolescents with T1D who were classified as underweight and obese.8
Clinical recommendations for individuals with HbA1c levels above target may differ based on weight status, especially given the complicated physiologic relationships between weight and glycemia.31 For example, approaches for under or normal weight individuals with elevated HbA1c may be centered on insulin intensification, while approaches for overweight individuals could balance the glycemic benefits of insulin intensification with the potential for weight gain via concurrent behavioral, pharmacological, or surgical interventions.32 Although the American Diabetes Association Standards of Care currently state that the risks and benefits of agents adjunctive to insulin therapy in T1D require further evaluation, early studies have shown improvements in glycemic control as well as body weight associated with metformin, GLP1-receptor agonists, and SGLT-2 inhibitors.32 33 Subgroups with adequate glycemic control and overweight or obesity may be specifically targeted for more intensive behavioral modifications such as dietary interventions to address obesity with special attention towards avoiding hypoglycemia associated with decreased caloric or carbohydrate intake.31 Other aspects of clinical care may be tailored according to subgroup as well. For example, while all youth should receive timely and regular screening for complications of diabetes, particular subgroups may warrant earlier interventions, including the introduction of renoprotective regimens or statin medications. Ultimately, larger and intentionally designed trials will be required to move from understanding observational phenotypes to devising their therapeutic approaches. In particular, as the treatment landscape of T1D expands to include novel non-insulin adjuvants32 as well as hybrid closed loop and fully automated insulin delivery systems,34 capturing treatment response across subgroups both in terms of changes in body weight and glycemic control will provide critical data to inform how phenotypes may be modified to better predict heterogeneity in response to specific treatment approaches.
Given the high risk for long-term complications, we focus our discussion on clusters 4 and 5, the subgroups with the poorest glycemic control (HbA1c >108 mmol/mol), as well as cluster 6, the subgroup with obesity and moderate glycemic control. Together, they account for approximately 27% of the study population.
The results corroborate previous reports that glycemic control differs by race and ethnicity among youth and young adults with T1D4 28 and is also associated with lower measures of socioeconomic position including parental education, income, and health insurance type. The results also substantiate other studies showing that lower household income and parental education level associate with overweight/obesity status in T1D35 and are consistent with reports that youth with T1D who are of Hispanic ethnicity and females are at the highest risk of overweight or obesity.36 This finding is particularly concerning given recent data suggesting that the incidence of T1D is increasing most rapidly among Hispanic youth.37
Weight-glycemia clusters also showed significant differences in several aspects of clinical care, psychosocial outcomes, and health behaviors that were measured concurrently with BMIz and HbA1c. In our study, the best mean glycemic control was associated with higher use of insulin pump therapy38 and increased frequency of blood glucose monitoring.39
The psychosocial correlates of clusters were consistent with previous studies, including a positive relationship between mean HbA1c levels and mean depressive symptoms and a negative association between mean HbA1c levels and mean perceived quality of life measures.40 Differences in potentially modifiable behavioral factors underscore that physically active lifestyle and decreased sedentary time are associated with lower BMI and percentage of body fat among people with T1D.35 Unfortunately, overall dietary quality measures were low, even among youth and young adults with the most favorable weight-glycemia phenotype, consistent with previous findings.19
The significant differences in clinical parameters across weight-glycemia clusters implicate potential disparity in long-term cardiovascular disease risk across these subgroups.9 The combination of very high HbA1c and increased insulin needs of cluster 5, the overweight subgroup with very poor glycemic control, is suggestive of insulin resistance.9 36 This group also exhibited the worst cardiovascular disease risk profile including elevated lipid and blood pressure levels. Laboratory measures were significantly elevated compared with cluster 6, despite the higher mean BMIz of cluster 6. More work is needed to understand how adiposity and hyperglycemia jointly contribute to cardiovascular disease risk profiles.
One of the most striking results is the pattern with which all other demographic, socioeconomic, clinical care, psychosocial, and behavioral factors track across the clusters derived from measures of the joint distribution of weight and glycemia. It is relevant to note that non-white race/ethnicity, lower socioeconomic position and healthcare access, and poorer psychosocial well-being have been shown to be associated with each other and with glycemic control elsewhere in SEARCH studies.38 40 A maximally effective precision medicine approach to co-optimize weight and glycemia will concern itself with underlying biology as well as characteristics of individuals and resource constraints that may influence outcomes over time. Although the cross-sectional cluster analysis is not designed for causal conclusions, future research is needed to develop the specific interventional strategies to impact weight and glycemia outcomes that considers the close relationships among these economic, social, and cultural factors. In the meantime, efforts should also be made to identify youth soon after diagnosis who may have sociodemographic risk with subsequent efforts to ensure access and affordability of necessary diabetes medications and supplies.
The study has several weaknesses. Despite the collective use of gap statistics and supporting graphs, selection of the number of clusters is subjective. Additional external validation studies are required to understand the generalizability of major weight-glycemia phenotypes across other observational cohort studies of T1D. In addition, BMIz was used as a proxy for weight status, although this measure may not reflect adiposity,41 42 and the large age range necessitated imputation of BMIz for participants over 20 years of age using z-score data for age 20 years. To assess for differential error of BMIz on participant age (ie, youth vs young adults), we stratified the sample by age at follow-up visit (<21 years, n=1399, ≥21 years, n=418) and independently evaluated clusters in each sample (online supplementary table S5, figure S1). Despite significant differences in sample sizes, we found largely consistent clustering results in both strata, suggesting that the measure of BMIz did not bias the nature of the clusters across different age ranges. Future work could use validated predictive equations to estimate body fat percentage43 and derive ‘adiposity-glycemia’ clusters using the predicted variable. The current study is cross-sectional and cannot elucidate temporal associations with the weight-glycemia phenotypes nor the longitudinal clinical outcomes; future studies should explore whether subgroups develop different rates or patterns in the emergence of T1D complications and cardiovascular disease risk factors to inform clinical utility of this weight-glycemia phenotype. Along these lines, the prognostic value of T1D phenotypes may be enhanced by incorporation of other clinical variables representing cardiovascular disease risk factors to inform risk stratification with regard to cardiovascular disease. The study also has several strengths. One is inherent in the analytic design; this approach to characterize a phenotype based on two outcomes allows real-life phenotypes to emerge rather than forcing a fit based on a priori clinical cut-points for weight and glycemic control. In additional analyses, the six weight-glycemia clusters were compared with strata of the same sample defined by clinical cut-points for overweight/obesity and poor glycemic control (see online supplementary tables S6 and S7). The strata corroborated main descriptive results (ie, differences in sociodemographic characteristics across subgroups with differing levels of glycemic control), providing face validity to the weight-glycemia clusters. However, the use of a priori cut-points was found to be less well suited to identify subgroups sharing clinically significant yet more nuanced weight-glycemia phenotypes who may otherwise distinguish themselves in a clustering approach, such as the subgroups with very poor glycemic control. For example, clinical cut-points collapsed all individuals in clusters 3, 4, and 5 in the same strata of glycemic control, despite the notable differences in glycemia (refer to these subgroups in relation to the dashed line denoting poor glycemic control at HbA1c 75 of mmol/mol (9.0%) in figure 1B.) Given the relatively recent emergence of overweight and obesity in T1D, the data-driven approach is particularly well suited to explore underlying subgroups within the SEARCH cohort, from which the population-specific utility of existing clinical cut-points can be better evaluated. A further strength of the study was the novel semisupervised statistical methods used to identify a phenotype based on two clinical outcomes and their relationship to each other, using all patient information to adjust for potential measurement error and within-person heterogeneity. In additional analyses, the Y-clusters showed multiple nodes of density and larger within-cluster distribution of BMIz and HbA1c (online supplementary table S8, supplementary figure S2), suggesting higher within-cluster variability due to noise in the raw outcomes that obscures underlying clustering structure in the data. The advantage of clusters driven by predicted measures of the joint distribution is that this method uses X-variables to denoise the raw outcome measures, thereby maximizing data available in the cohort study to understand the underlying variance in weight and glycemia, and their relationship as a clinical phenotype. Finally, to our knowledge, this is the first study to investigate the spectrum of the weight-glycemia phenotypes of T1D and their broad epidemiologic correlates using the large, diverse SEARCH cohort. The study complements previous efforts to address heterogeneity in adult diabetes10 with a focus on T1D in a younger age range to inform earlier interventions.
In conclusion, we show that the heterogeneous population of youth and young adults with T1D comprised identifiable subgroups with shared weight-glycemia clinical phenotypes based on measures of the joint distribution of BMIz and HbA1c. Importantly, overweight and obesity present with varying degrees of glycemic control in this population, implicating different therapeutic and clinical strategies to concurrently address weight and glycemia across subgroups. To this end, a precision medicine framework may facilitate a systems-based approach to address health inequity and deliver targeted strategies needed to optimize obesity and dysglycemia, particularly when both are poorly controlled.
The SEARCH for Diabetes in Youth study is indebted to the many youth and their families, and their healthcare providers, whose participation made this study possible.
MRK and EJM-D contributed equally.
Contributors ARK, CTN, XJ, MRK, and EJMD designed the analyses. ARK and CTN conducted the analyses. ARK and EJMD drafted the initial manuscript. LAA, SA, AEA, KSB, JBB, DD, LMD, GI, JML, SM, CP, BAR, and KAS reviewed all analyses and provided critical review of the manuscript. GI—a representative of one of the cosponsors—reviewed all analyses and provided critical review of the manuscript. ARK and EJMD had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. MRK and EJMD are cosenior authors.
Funding Grant support (SEARCH 4): The SEARCH for Diabetes in Youth cohort study (1UC4DK108173) is funded by the National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases and supported by the Centers for Disease Control and Prevention. Grant support (SEARCH 3): The SEARCH for Diabetes in Youth is funded by the Centers for Disease Control and Prevention (PA numbers 00097, DP-05-069, and DP-10-001) and supported by the National Institute of Diabetes and Digestive and Kidney Diseases. Site contract numbers: Kaiser Permanente Southern California (U18DP006133, U48/CCU919219, U01 DP000246, and U18DP002714), University of Colorado Denver (U18DP006139, U48/CCU819241-3, U01 DP000247, and U18DP000247-06A1), Cincinnati's Children's Hospital Medical Center (U18DP006134, U48/CCU519239, U01 DP000248, and 1U18DP002709), University of North Carolina at Chapel Hill (U18DP006138, U48/CCU419249, U01 DP000254, and U18DP002708), Seattle Children's Hospital (U18DP006136, U58/CCU019235-4, U01 DP000244, and U18DP002710-01), Wake Forest University School of Medicine (U18DP006131, U48/CCU919219, U01 DP000250, and 200-2010-35171). The authors wish to acknowledge the involvement of the Kaiser Permanente Southern California’s Clinical Research Center (funded by Kaiser Foundation Health Plan and supported in part by the Southern California Permanente Medical Group); the South Carolina Clinical and Translational Research Institute at the Medical University of South Carolina, NIH/National Center for Advancing Translational Sciences (NCATS) grant numbers UL1 TR000062, UL1Tr001450; Seattle Children's Hospital and the University of Washington, NIH/NCATS grant number UL1 TR00423; University of Colorado Pediatric Clinical and Translational Research Center, NIH/NCATS grant number UL1 TR000154; the Barbara Davis Center at the University of Colorado at Denver (DERC NIH grant number P30 DK57516); the University of Cincinnati, NIH/NCATS grant numbers UL1 TR000077, UL1 TR001425; and the Children with Medical Handicaps program managed by the Ohio Department of Health. This study includes data provided by the Ohio Department of Health, which should not be considered an endorsement of this study or its conclusions. ARK is supported by the National Institute of Diabetes and Digestive and Kidney Disease of the National Institutes of Health under award number F30DK113728. MRK, CTN, XJ, and JBB are supported by funding from NC Tracs (the CTSA at UNC): UL1TR002489.
Disclaimer The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention and the National Institute of Diabetes and Digestive and Kidney Diseases.
Competing interests None declared.
Patient consent for publication Not required.
Ethics approval The study was approved by institutional review boards with jurisdiction, and the parent, adolescent or young adult, or both provided consent or assent for all participants, as appropriate.
Provenance and peer review Not commissioned; externally peer reviewed.
Data availability statement Data may be obtained from a third party and are not publicly available. The data that support the findings of this study are available from the SEARCH for Diabetes in Youth study, but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of the SEARCH for Diabetes in Youth study.