Published on in Vol 14 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/81135, first published .
Predicting Age-Related Hearing Loss in Community-Dwelling Older Adults: Multicenter Retrospective Cohort Study

Predicting Age-Related Hearing Loss in Community-Dwelling Older Adults: Multicenter Retrospective Cohort Study

Predicting Age-Related Hearing Loss in Community-Dwelling Older Adults: Multicenter Retrospective Cohort Study

Original Paper

1School of Nursing, Capital Medical University, Beijing, Beijing, China

2Dongcheng District Community Health Service Management Center, Beijing, China

3Department of Otolaryngology, Beijing Tongren Hospital, Beijing, China

Corresponding Author:

Jun-E Liu, PhD

School of Nursing

Capital Medical University

No. 10, Xitoutiao, You'anmenwai

Fengtai District

Beijing, Beijing, 100069

China

Phone: 86 185 0013 6128

Email: liujune66@ccmu.edu.cn


Background: Age-related hearing loss (ARHL) is associated with severe negative outcomes, including social isolation, depression, and cognitive decline. Despite this, routine ARHL screening is often neglected in primary care due to low awareness, resource limitations, and inefficiencies. A practical risk assessment tool could effectively address this gap.

Objective: This study aims to develop and validate a user-friendly nomogram for identifying older adults at high risk of ARHL in community settings, thereby facilitating targeted screening and timely interventions.

Methods: This multicenter retrospective cohort study included 34,983 older adults from 3 primary health care centers in Beijing (January 2020 to October 2023). Data from center A (n=18,707) were used for model development, with external validation performed on cohorts from center B (n=11,008) and center C (n=5268). Least absolute shrinkage and selection operator and logistic regression identified the final predictors. Model performance was evaluated using discrimination, calibration, and decision curve analysis, leading to the development of an online nomogram.

Results: In the training cohort (center A), 1177 participants (6.3%) had hearing loss. Six key predictors were identified: age, education, exercise frequency, physical function, dietary habits, and hypertension. The multivariate logistic regression model demonstrated good discrimination in internal validation (area under the curve [AUC] 0.806, 95% CI 0.782-0.831; sensitivity 0.774; specificity 0.820). External validation confirmed its generalizability (AUC 0.720, 95% CI 0.670-0.771 and AUC 0.747, 95% CI 0.712-0.782). Decision curve analysis highlighted a substantial clinical net benefit. A user-friendly online prediction web page was also developed.

Conclusions: We successfully developed and validated a dynamic, online nomogram for predicting ARHL in older adults. Comprising 6 readily available predictors, this model shows potential as a practical, online tool for proactive risk identification in primary care. However, further validation in larger and more diverse populations is essential to confirm its generalizability and real-world clinical utility.

Interact J Med Res 2025;14:e81135

doi:10.2196/81135

Keywords



Age-related hearing loss (ARHL) is the third most prevalent health condition among older adults, after hypertension and arthritis, and has become a significant public health issue [1]. The World Health Organization estimates that more than 1.5 billion people worldwide experience hearing loss, a number projected to reach 2.5 billion by 2050 [2]. In China, hearing loss affects 30% of adults older than 60 years and rises to 50% for those older than 70 years, totaling at least 70 million cases [3]. ARHL can hinder communication and social interaction, leading to greater risks of social isolation, depression, cognitive decline [4-6], frailty, and falls [7,8].

Despite these negative health consequences, routine hearing screening remains significantly underused, with rates below 25% in older populations [9,10]. Key barriers include limited patient awareness and financial constraints; for example, a diagnostic audiogram in China can cost ¥100 (US $14.1) to ¥300 (US $42.3), a substantial out-of-pocket expense for many older adults; these challenges are further compounded by limited time, training, and equipment among primary care clinicians [11-14]. Furthermore, high false-positive rates from some hearing screening methods can increase costs, prompting policymakers to adopt a cautious approach [3,15].

Leveraging primary health care centers for the management of hearing loss is the most suitable and cost-effective approach [16,17]. A preliminary prediction tool designed for these centers could accurately identify individuals with high risk for targeted audiometric screening, thereby optimizing referrals and specialist resources. While previous models have been developed using known risk factors for hearing impairment, such as age, environmental noise [18], comorbidities [19], exercise, and nutrition [20], their clinical applicability is often limited by small sample sizes, a lack of external validation, and reliance on complex variables.

Therefore, this study aimed to develop a streamlined nomogram-based prediction model for ARHL among community-dwelling older adults, using a large sample derived from health check-ups conducted in primary health care institutions in Beijing, China. Additionally, the model was externally validated in 2 independent datasets. An online tool was used to deploy the model as an interactive application, enabling primary health care providers to perform risk prediction without requiring any software installation or programming expertise. This study adhered to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis guidelines [21].


Study Population

This multicenter retrospective cohort study used electronic health examination records from 3 community primary health care centers (center A, center B, and center C) in Beijing, China. Data were collected from older adults who participated in the annual health checkup under the National Public Health Service Project between January 1, 2020, and October 30, 2023. The dataset included comprehensive information on demographics, medical history, physical examination results, and chronic disease records. For this study, the dataset from center A, which had the largest sample size, was used for model development and internal validation, while data from center B and center C were used for external validation. A detailed study design and sample selection are shown in Figure 1.

From center A, a total of 20,923 patient records were initially screened. First, 3.94% (825/20,923) records with incomplete data were excluded. After subsequently excluding 0.26% (54/20,923) individuals with major ear conditions, 5.96% (1246/20,923) individuals aged younger than 65 years, and 0.43% (91/20,923) individuals with cognitive or neurological disorders, a final cohort of 89.41% (18,707/20,923) participants was included in the analysis. The external validation cohorts were derived from center B and center C. Initially, 11,454 records from center B and 5522 records from center C were screened. After applying the exclusion criteria, a total of 3.89% (446/11,454) participants were excluded from center B and 4.60% (254/5522) from center C. This process resulted in final validation cohorts of 96.11% (11,008/11,454) participants from center B and 95.40% (5268/5522) from center C.

Figure 1. Flowchart for hearing loss prediction model development and validation.

Assessment of Hearing Loss

The outcome variable for this study was hearing loss, which was assessed using the whispered voice test as specified by the standardized protocol in the National Basic Public Health Service Standards (Third Edition) [22], issued by China’s National Health Commission. This screening was performed by trained primary health care professionals to ensure consistency across all centers.

Specifically, the whispered voice test, a widely recognized and validated screening tool, was used due to its simplicity, cost-effectiveness, and reliability in detecting hearing loss in clinical and community settings [23,24]. The protocol dictates the following procedure: the examiner stands approximately 0.5 m from the participant’s tested ear, positioned slightly behind them to prevent lip-reading. The participant’s nontested ear is occluded (blocked) by the examiner to isolate the tested ear. After a full exhalation, the examiner whispers a simple, standardized phrase (eg, What is your name?). Failure to correctly repeat the phrase was classified as the presence of hearing loss. For the purpose of our model, this binary outcome (presence vs absence of hearing loss) was used as the dependent variable [22].

Sample Size Estimation

The determination of an adequate sample size for a multivariate predictive model requires a sufficient representation of individuals and outcome events relative to predictor parameters [25]. The combined sample size from 3 primary health care institutions provided robust support for using 33 predictive variables in both modeling and validation. Specifically, according to the widely applied 10 events per variable principle, at least 330 events (33 variables×10 events) are required [26]. Given an event rate of 0.063, this translates to approximately 5239 cases. The modeling dataset comprised 18,707 cases and 1177 events, which is more than 3 times the minimum requirement of 330 events. This large number of events per variable provides sufficient statistical power and minimizes the risk of model overfitting.

Model Development and Validation

Variable selection was conducted using the least absolute shrinkage and selection operator (LASSO) regression, which applies a Least Absolute Deviations penalty to shrink some regression coefficients to 0, simplifying the model while preserving variables with strong predictive power. Tenfold cross-validation determined the optimal penalty parameter λ (λ_min), and variables with nonzero coefficients in the final model were chosen as candidate predictors [27,28]. Next, univariate analysis was performed on the LASSO-selected variables. Variables with P values <0.05 were considered for inclusion in the multivariate logistic regression model. A forward stepwise regression approach was applied to identify the final predictors to be included in the model. To handle missing data, a complete case analysis was used. This approach, which excludes records with any missing data on key variables, was considered appropriate because the proportion of such records was low across all cohorts (3.94% in center A, 1.62% in center B, and 1.68% in center C), consistently remaining below 5%.

Statistical analysis was performed using R software (version 4.2.2; R Foundation for Statistical Computing). To facilitate clinical application of the prediction model, we developed an interactive online application using the Shiny framework (RStudio, PBC). The final logistic regression model and coefficients were embedded into the server logic using the rms package to ensure consistency with the nomogram. A user-friendly interface was created with Shiny, shiny Widgets, and shiny themes packages, allowing real-time input and calculation of predicted risk. Dynamic visualizations of individualized risk were implemented using plotly, providing immediate graphical feedback. The application was deployed on a cloud-based Shiny server to allow online access without requiring local installation.

Model validation included 3 aspects: discrimination, calibration, and clinical utility [29]. Discrimination was assessed using the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and Matthews correlation coefficient to evaluate the model’s ability to distinguish between individuals with hearing loss and those without hearing loss. The calibration of the model was assessed using calibration plots generated through 1000 bootstrap resamples, in addition to the Brier score and mean absolute error, to evaluate the consistency between the predicted and actual observed hearing loss outcomes. Risk stratification strategy was based on the optimal cutoff value (the hearing loss risk probability when the Youden index is maximum, Youden=sensitivity+specificity−1) [30]. Finally, the clinical utility of the final model was evaluated using decision curve analysis.

Ethical Considerations

The study was approved by the ethics committee of Capital Medical University, Beijing, China (Z2024SY071). Informed consent was obtained from participants during routine health examinations, and no additional consent was required for this secondary analysis. All data were anonymized and securely stored with access restricted to the research team. No financial or other compensation was provided, as the data were derived from routine public health services.


Population Characteristics

A total of 18,707 participants were included in the analysis in center A. Of these, 44.20% (8268/18,707) were male and 55.80% (10,439/18,707) were female, with a mean age of 72.89 (SD 7.15, range 65-99) years. A total of 6.29% (1177/18,707) participants were identified as having hearing loss, while 93.71% (17,530 /18,707) had normal hearing. The participants were then randomly divided into training (13,094/18,707, 70%) and internal validation sets (5613/18,707, 30%) in a 7:3 ratio. Additionally, 96.11% (11,008/11,454) and 95.40% (5268/5522) of participants were included from center B and center C, respectively, for external validation.

Model Development and Performance

The most significant features selected by LASSO from the training set were used to identify potential predictors for the model. The screening process is detailed in Figure 2. Initially, 33 variables were identified, guided by systematic reviews, meta-analyses, and expert clinical opinions. These factors encompass demographic characteristics, lifestyle factors, physiological measurements, and chronic diseases. The model’s bias showed minimal variation within the interval (λ1se, λmin). When λmin was set to 0.002 in the LASSO model, 17 variables exhibited nonzero coefficients, as shown in Table 1.

Figure 2. Feature selection using the least absolute shrinkage and selection operator (LASSO) regression. (A) The LASSO coefficient profile plot. Each colored line represents the coefficient path for 1 of the 33 candidate predictor variables as the regularization penalty (log lambda) increases. As the penalty grows, the coefficients of less important variables are shrunk toward 0. (B) Identification of the optimal penalization coefficient (λ) was achieved by 10-fold cross-validation and the minimum criterion. The left vertical line represents the mean square error, and the right vertical line represents the cross-validated mean square error within 1 SE of the minimum.
Table 1. Univariate analysis of potential predictors for hearing loss in the training set (n=13,094).
VariablesNo hearing loss, n (%)Hearing loss, n (%)Chi-square (df)P value
Sex0.5 (1).49

Male6832 (52.2)453 (3.5)


Female5448 (41.6)361 (2.8)

Age (years)677.3 (2)<.001

65-706467 (49.4)152 (1.2)


71-803785 (28.9)242 (1.8)


≥812028 (15.5)420 (3.2)

Education level33.3 (2)<.001

Associate degree or above2815 (21.5)207 (1.6)


Middle or high school6395 (48.8)343 (2.6)


Primary school or below3070 (23.4)264 (2)

Marital status1.8 (1).19

Married9470 (72.3)611 (4.7)


Other (Never married, divorced, or separated, widowed)2810 (21.5)203 (1.6)

Waist circumference0.3 (1).56

Normal7737 (59.1)504 (3.8)


Abnormal4543 (34.7)310 (2.4)

BMI (kg/m2)

8.0 (2).02

18.5-29.911,708 (89.4)759 (5.8)


<18.5181 (1.4)15 (0.1)


>30.0391 (3)40 (0.3)

Physical function483.1 (1)<.001

Able to perform smoothly12,203 (93.2)739 (5.6)


Unable to perform any task independently75 (0.6)77 (0.6)

Exercise frequency886.0 (3)<.001

Everyday7592 (58)205 (1.6)


More than once a week1826 (13.9)79 (0.6)


Occasionally970 (7.4)75 (0.6)


Not exercising1892 (14.4)455 (3.5)

Dietary habits57.2 (3)<.001

Balanced diets11,411 (87.1)703 (5.4)


Primarily vegetarian diet527 (4)59 (0.5)


Primarily meat-based diet182 (1.4)21 (0.2)


Preference for sugar, salt, and oil160 (1.2)31 (0.2)

Smoking11.9 (2).003

Never smoked11,107 (84.8)760 (5.8)


Former smoker360 (2.7)25 (0.2)


Current smoker813 (6.2)29 (0.2)

Alcohol consumption frequency13.5 (2).001

Never11,138 (85.1)768 (5.9)


Occasionally486 (3.7)15 (0.1)


Frequently or daily656 (5)31 (0.2)

Hypertension19.7 (1)<.001

No4785 (36.5)253 (1.9)


Yes7495 (57.2)561 (4.3)

Chronic obstructive pulmonary disease0.01 (1).90

No12,115 (92.5)804 (6.1)


Yes165 (1.3)10 (0.1)

Diabetes0.8 (1).37

No3730 (28.5)260 (2)


Yes8550 (65.3)554 (4.2)

Stroke2.1 (1).15

No10,365 (79.2)671 (5.1)


Yes1915 (14.6)143 (1.1)

Left systolic blood pressure (mm Hg)5.4 (1).02

90-14010,573 (80.7)725 (5.5)


<90 or >1401707 (13)89 (0.7)

Right systolic blood pressure (mm Hg)5.7 (1).02

90-14010,597 (80.9)727 (5.6)


<90 or >1401683 (12.9)87 (0.7)

First, univariate analysis was conducted on the 17 variables identified through LASSO regression to evaluate their independent associations with hearing loss. The detailed results of this analysis, including odds ratios and P values for all variables, are presented in Table 1. The results indicated that age, education level, marital status, BMI, physical function, exercise frequency, dietary habits, smoking, alcohol consumption frequency, hypertension, left systolic blood pressure, and right systolic blood pressure were significantly associated with hearing loss.

Subsequently, a forward stepwise regression approach selected 6 predictors—age, education, exercise frequency, physical function, dietary habits, and hypertension—which were then included in the multivariate logistic regression model (Table 2). A user-friendly nomogram was developed, as illustrated in Figure 3. The optimal cutoff value was 0.805 based on the maximum Youden index. Accordingly, the population was classified into low (predicted probability <80.5%) and high (predicted probability≥80.5%) hearing loss risk groups. A comparison of the model’s predicted risk classifications against the observed outcomes is presented in Table 3. For example, in the training set, of the 814 individuals, the model correctly identified 639 (78.5%) with hearing loss (true positives) and of the 12,280 individuals, 10,099 (82.2%) with normal hearing (true negatives), corresponding to the sensitivity and specificity values reported.

In the training set, the model achieved an AUC of 0.828 (95% CI 0.813-0.842), sensitivity was 0.785 (639 out of 814 participants who actually developed hearing loss were correctly predicted as high-risk), specificity was 0.822 (10,099 of the 12,280 participants who did not develop hearing loss were correctly predicted as low-risk), and balanced accuracy was 0.804. In the internal validation set, AUC was 0.806 (95% CI 0.782-0.831), sensitivity was 0.774, specificity was 0.820, balanced accuracy was 0.797, and Matthews correlation coefficient was 0.350.

Table 2. Results of the multivariate logistic regression model for predicting hearing loss.
Predictorβ (SE)ORa (95% CI)P value
Intercept−4.46 (0.131)b<.001
Age (years)

65-70 (reference)

71-801.15(0.108)3.16 (2.56-3.91)<.001

≥811.97 (0.108)7.19 (5.81-8.89)<.001
Education level

Primary school or below (reference)

Middle or high school−0.34 (0.095)0.71 (0.59-0.86)<.001

Associate degree or above−0.49 (0.107)0.61 (0.50-0.76)<.001
Exercise frequency

Every day (reference)

More than once a week0.39 (0.133)1.48 (1.14-1.91).003

Occasionally0.75 (0.144)2.13 (1.61-2.81)<.001

Almost no exercise1.84 (0.091)6.3 (5.27-7.52)<.001
Physical function

Able to perform smoothly (reference)

Unable to perform any action1.87 (0.176)6.47 (4.58-9.14)<.001
Dietary habits

Balanced diets (reference)

Primarily meat-based or vegetarian0.37 (0.135)1.45 (1.11-1.89).006

Sugar, salt, and oil addiction0.89 (0.23)2.43 (1.55-3.82)<.001
Hypertension

Negative (reference)

Positive0.26 (0.083)1.29 (1.09-1.52).002

aOR: odds ratio.

bNot available.

Figure 3. Nomogram to predict the probability of hearing loss in community-dwelling older adults.
Table 3. Comparison of model-predicted risk stratification and observed hearing loss across training and validation cohorts.
Predicted risk groupObserved hearing loss, nObserved no hearing loss, nRow total, n
Training set

Model-predicted high riska63921812820

Model-predicted low riskb17510,09910,274

Column total81412,28013,094
Internal validation set

Model-predicted high riska2819461227

Model-predicted low riskb8243044386

Column total36353505613
External validation setA

Model-predicted high riska8712851372

Model-predicted low riskb3995979636

Column total12610,88211,008
External validation setB

Model-predicted high riska148791939

Model-predicted low riskb8942404329

Column total23750315268

aThe model-predicted low-risk group was defined as predicted probability <80.5%.

bThe model-predicted high-risk group was defined as predicted probability ≥80.5%. Percentages are omitted because row percentages and column percentages represent different concepts. Absolute counts and all marginal totals are provided to allow direct calculation of any desired proportion.

External Validation

In external validation set A, the AUC was 0.720 (95% CI 0.670-0.771), with a sensitivity of 0.690, a specificity of 0.882, a balanced accuracy of 0.786, and a Matthews correlation coefficient of 0.204. In external validation set B, the model achieved an AUC of 0.747 (95% CI 0.712-0.782), with a sensitivity of 0.624, a specificity of 0.843, a balanced accuracy of 0.734, and a Matthews correlation coefficient of 0.270. These results highlight the model’s good predictive performance for identifying hearing loss in older adults. The receiver operating characteristic curves are shown in Figure 4.

The nomogram’s calibration plots are shown in Figure 5. Quantitative metrics supported the visual assessment of calibration. In the external validation set A, the Brier score was 0.125 and the mean absolute error was 0.002. In the external validation set B, these values were 0.221 and 0.003, respectively. Lower Brier scores and mean absolute error values suggest that the model provides probability estimates close to the true risk, helping clinicians avoid unnecessary interventions for patients with low risk and missed diagnoses in patients participants with high risk.

Figure 6 presents the decision curves for the nomogram. The high-risk threshold probability represents the predicted probability of hearing loss, above which clinicians would consider routine audiometric screening and early interventions, such as lifestyle modifications. In the decision curve analysis plots for both cohorts, the model (red line) consistently demonstrated greater net benefit than the default strategies (treat all or none) across a conservative overlapping threshold range of approximately 5% to 75%. Using the model to guide ARHL prediction and implement routine screening and early interventions provided an additional net benefit of up to 20% compared with alternative strategies that assume all patients are either severely or nonseverely affected.

Figure 4. Receiver operating characteristic curves of the prediction model for hearing loss in community-dwelling older adults in the training and validation sets.
Figure 5. Calibration plots of the prediction model for hearing loss in community-dwelling older adults in external validations sets. (A) Calibration plot for external validation set A. (B) Calibration plot for external validation set B.
Figure 6. Decision curve analysis of the prediction model for hearing loss in community-dwelling older adults in external validation sets. (A) Decision curve analysis for external validation set A. (B) Decision curve analysis for external validation set B.

Visualization of Predictive Models: Development of a User-Friendly Web Page

To facilitate the clinical application of our nomogram, we developed an interactive and user-friendly web application using the R Shiny framework. The application features a simple, single-panel user interface where clinicians can input a patient’s 6 predictor variables—age, education, exercise frequency, physical function, dietary habits, and hypertension—using a combination of numeric input fields and dropdown menus (Figure 7).

Upon clicking a Predict button, the tool instantly computes and displays the patient’s predicted probability of hearing loss. The output is presented as a clear percentage, along with a corresponding risk stratification (High risk or Low risk based on the 80.5% cutoff) to aid in immediate clinical decision-making. The web application is publicly accessible and requires no software installation or programming knowledge, ensuring its broad utility for primary care providers [31].

Figure 7. Web page–based estimation of hearing loss likelihood in community-dwelling older adults.

Principal Findings

The principal finding of this large, multicenter study is the development and external validation of a practical, 6-predictor nomogram for predicting hearing loss in community-dwelling older adults. Our model, which uses readily available data from routine health checkups (age, education, exercise frequency, physical function, dietary habits, and hypertension), demonstrated good discriminative ability (AUC>0.720 in all cohorts) and calibration.

A key clinical implication of this finding is the model’s potential to shift primary care from reactive secondary screening to proactive primary prevention. The online tool, developed as a proof of concept, illustrates how this model could be integrated into clinical workflows. For example, if embedded within electronic medical record systems, it could provide real-time risk scores, creating a risk-stratified screening pathway. This would allow general practitioners to prioritize formal audiometry and targeted interventions for individuals at the highest risk, potentially improving the efficiency of health care resource allocation.

Comparison With Prior Work

Our risk prediction model incorporates demographic, lifestyle, and chronic disease factors, aligning broadly with previous studies. It is well established that age is strongly associated with hearing loss. Hearing decline along with aging is primarily attributed to progressive degeneration of cochlear sensory hair cells, stria vascularis, and spiral ganglion neurons, leading to impaired sound transmission [18,32]. Consistent with previous studies, we found a higher risk of hearing loss in individuals with lower education levels [33,34]. This may be attributed to its correlation with lower socioeconomic status [35], which can limit health care access and use, ultimately leading to poorer health status. Additionally, lower education levels are often associated with unhealthy lifestyle attributes (eg, smoking, alcohol consumption, and exercise habits) that were included in our model [36]. Interestingly, while recent studies have suggested a potential link between diabetes and hearing loss [37,38], our study did not identify a statistically significant association between the two. Instead, hypertension was included as a predictor in the model. It may contribute to hearing decline by reducing blood supply to the stria vascularis, a cochlear structure dependent on terminal arteries without collateral circulation [39,40].

Recent studies have focused on a prediction model for hearing loss. Ge et al [41] developed a logistic regression model for ARHL using 18 key factors based on 401 older adults. Their model achieved an AUC value of 0.777, and the sensitivity and specificity were 70.9% and 75.3%, respectively; however, due to limited sample size and absence of internal and external validation, further scrutiny is required regarding its predictive performance. Similarly, Wang et al [42] developed a predictive model comprising 22 features. While the model achieved an AUC of 0.864 and 0.844 during training and internal validation, the large number of variables and complex operation may hinder its clinical applicability and generalizability. Predictive models should strike a balance between performance and practicality, aiming to achieve the highest possible predictive accuracy with the fewest possible variables [43].

Strengths and Limitations

The primary strengths of this study include its large, multicenter sample, which enhances the robustness and generalizability of our findings, and a rigorous variable selection process. The resulting 6-predictor model is simple, and its variables are accessible from routine physical examinations without requiring specialized tests. Furthermore, the development of an online tool highlights its potential feasibility for real-world implementation. Finally, the selection of an interpretable white-box model (logistic regression) was a deliberate choice. While more complex black-box algorithms may offer marginal gains in accuracy, their lack of transparency is a significant barrier to clinical adoption. Our model achieves an optimal balance between its robust, externally validated performance and the essential need for interpretability, making it well suited for guiding real-world clinical decisions.

However, this study is not devoid of limitations. First, our study was conducted exclusively in primary health care centers within Beijing. While we attempted to mitigate this by validating across 3 distinct centers, the shared urban setting may influence the results by limiting the model’s generalizability. Second, our model intentionally omits key risk factors such as genetic predisposition, long-term noise exposure, and ototoxic medication use. This was a necessary trade-off to create a practical tool based solely on variables routinely collected in health checkups. However, we acknowledge that the absence of these predictors may limit the model’s accuracy in populations where such factors are highly prevalent. Third, hearing loss was assessed using the whispered voice test rather than pure-tone audiometry. To mitigate this, all professionals followed a standardized protocol. However, this method may influence the results by underestimating mild hearing loss and being subject to interexaminer variability. Fourth, several predictors (eg, physical function, exercise frequency, and dietary habits) were based on self-report, which may be prone to information bias and reduce predictive precision. Finally, as a retrospective cohort study, our analysis can suggest temporal relationships between exposures and outcomes but cannot establish definitive causal inferences due to potential confounding and selection biases.

Future Directions and Conclusions

Building on this work, several avenues for future research are recommended. The immediate priority is to validate the current nomogram in broader and more diverse geographic and demographic cohorts, using pure-tone audiometry as the gold standard outcome for a more precise assessment. Future prospective studies could also enhance the model’s accuracy by incorporating key variables not available in our dataset, such as genetic predisposition, long-term noise exposure, and objective measures of physical activity. Finally, as the current online tool is a proof of concept, dedicated implementation research is required. This should address the practical challenges of integrating the tool into diverse electronic medical record systems and evaluate its real-world impact on clinical decision-making, patient outcomes, and cost-effectiveness.

In conclusion, this study provides a validated, practical tool that shows promise for identifying older adults at high risk of hearing loss. If confirmed in future studies, this nomogram could become a valuable asset in the primary prevention of ARHL.

Acknowledgments

The authors would like to thank the study participants, as well as the nurses and general practitioners from the primary health care institutions, for their invaluable support. The authors gratefully acknowledge the financial support from the Beijing Municipal Social Science Foundation (23SRB007). The authors used the generative AI tool (Google Gemini), to assist with language editing and improve the text’s readability. The authors take full responsibility for the final publication.

Data Availability

The datasets generated or analyzed during this study are not publicly available due to patient privacy regulation but are available from the corresponding author on reasonable request.

Authors' Contributions

Conceptualization: JL, SJ, LS, and JEL

Data curation: QS, HW, and YZ

Formal analysis: JL and SJ

Funding acquisition: LS

Writing original draft: JL, SJ, LS, JEL, QS, MS, HW, and YZ

Writing–review & editing: SJ, LS, MS and JEL

Conflicts of Interest

None declared.

  1. GBD 2015 Disease Injury Incidence and Prevalence Collaborators. Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990-2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet. Oct 08, 2016;388(10053):1545-1602. [FREE Full text] [CrossRef] [Medline]
  2. World report on hearing. World Health Organization (WHO). URL: https://www.who.int/publications/i/item/9789240020481 [accessed 2025-05-29]
  3. Long M, Zheng X, Bu X. China hearing health report. Social Science Academic Press. 2020. URL: https://www.pishu.cn/zxzx/xwdt/574203.shtml [accessed 2025-05-29]
  4. Lin FR, Pike JR, Albert MS, Arnold M, Burgard S, Chisolm T, et al. ACHIEVE Collaborative Research Group. Hearing intervention versus health education control to reduce cognitive decline in older adults with hearing loss in the USA (ACHIEVE): a multicentre, randomised controlled trial. Lancet. Sep 02, 2023;402(10404):786-797. [CrossRef] [Medline]
  5. Rutherford BR, Brewster K, Golub JS, Kim AH, Roose SP. Sensation and psychiatry: linking age-related hearing loss to late-life depression and cognitive decline. Am J Psychiatry. Mar 01, 2018;175(3):215-224. [FREE Full text] [CrossRef] [Medline]
  6. Loughrey DG, Kelly ME, Kelley GA, Brennan S, Lawlor BA. Association of age-related hearing loss with cognitive function, cognitive impairment, and dementia: a systematic review and meta-analysis. JAMA Otolaryngol Head Neck Surg. Feb 01, 2018;144(2):115-126. [FREE Full text] [CrossRef] [Medline]
  7. Jiam NT, Li C, Agrawal Y. Hearing loss and falls: a systematic review and meta-analysis. Laryngoscope. Nov 2016;126(11):2587-2596. [CrossRef] [Medline]
  8. Yévenes-Briones H, Caballero FF, Struijk EA, Rey-Martinez J, Montes-Jovellar L, Graciani A, et al. Association between hearing loss and impaired physical function, frailty, and disability in older adults: a cross-sectional study. JAMA Otolaryngol Head Neck Surg. Nov 01, 2021;147(11):951-958. [FREE Full text] [CrossRef] [Medline]
  9. Lin FR, Niparko JK, Ferrucci L. Hearing loss prevalence in the United States. Arch Intern Med. Nov 14, 2011;171(20):1851-1852. [FREE Full text] [CrossRef] [Medline]
  10. Agrawal Y, Platz EA, Niparko JK. Prevalence of hearing loss and differences by demographic characteristics among US adults: data from the National Health and Nutrition Examination Survey, 1999-2004. Arch Intern Med. Jul 28, 2008;168(14):1522-1530. [CrossRef] [Medline]
  11. Contrera KJ, Wallhagen MI, Mamo SK, Oh ES, Lin FR. Hearing loss health care for older adults. J Am Board Fam Med. May 11, 2016;29(3):394-403. [FREE Full text] [CrossRef] [Medline]
  12. Danhauer JL, Celani KE, Johnson CE. Use of a hearing and balance screening survey with local primary care physicians. Am J Audiol. Jun 2008;17(1):3-13. [CrossRef] [Medline]
  13. Zuo JL, Lan YN, Sun W, Qiao YF, Li J, Shang YY. [Preliminary study on hearing screening for the elderly population undergoing physical examinations]. Zhonghua Er Bi Yan Hou Tou Jing Wai Ke Za Zhi. Feb 07, 2025;60(2):166-171. [CrossRef] [Medline]
  14. US Preventive Services Task Force, Krist AH, Davidson KW, Mangione CM, Cabana M, Caughey AB, et al. Screening for hearing loss in older adults: US preventive services task force recommendation statement. JAMA. Mar 23, 2021;325(12):1196-1201. [CrossRef] [Medline]
  15. GBD 2019 Hearing Loss Collaborators. Hearing loss prevalence and years lived with disability, 1990-2019: findings from the Global Burden of Disease Study 2019. Lancet. Mar 13, 2021;397(10278):996-1009. [FREE Full text] [CrossRef] [Medline]
  16. Bu XK. Primary prevention and treatment of otological diseases and hearing loss: WHO's latest program and China’s response and reflections. Chin Sci J Hear Speech Rehab. 2014;2:40-41. [FREE Full text]
  17. Sydlowski SA, Marinelli JP, Lohse CM, Carlson ML, Hearing Health Collaborative. Hearing health perceptions and literacy among primary healthcare providers in the united states: a national cross-sectional survey. Otol Neurotol. Sep 01, 2022;43(8):894-899. [FREE Full text] [CrossRef] [Medline]
  18. Lin FR. Age-related hearing loss. N Engl J Med. Apr 25, 2024;390(16):1505-1512. [CrossRef]
  19. Lin FR, Thorpe R, Gordon-Salant S, Ferrucci L. Hearing loss prevalence and risk factors among older adults in the United States. J Gerontol A Biol Sci Med Sci. May 2011;66(5):582-590. [FREE Full text] [CrossRef] [Medline]
  20. Tang D, Tran Y, Dawes P, Gopinath B. A narrative review of lifestyle risk factors and the role of oxidative stress in age-related hearing loss. Antioxidants (Basel). Apr 04, 2023;12(4):878. [FREE Full text] [CrossRef] [Medline]
  21. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. Jan 07, 2015;350:g7594. [FREE Full text] [CrossRef] [Medline]
  22. National essential public health service standards (3rd Edition). National Health Commission of the People's Republic of China. URL: https://www.nhc.gov.cn/ewebeditor/uploadfile/2017/04/20170417104506514.pdf#page=93.13 [accessed 2025-05-29]
  23. Eekhof JA, de Bock GH, de Laat JA, Dap R, Schaapveld K, Springer M. The whispered voice: the best test for screening for hearing impairment in general practice? Br J Gen Pract. Aug 1996;46(409):473-474. [FREE Full text] [Medline]
  24. Pirozzo S, Papinczak T, Glasziou P. Whispered voice test for screening for hearing impairment in adults and children: systematic review. BMJ. Oct 25, 2003;327(7421):967. [FREE Full text] [CrossRef] [Medline]
  25. Riley RD, Snell KI, Ensor J, Burke DL, Harrell FE, Moons KG, et al. Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes. Stat Med. Mar 30, 2019;38(7):1276-1296. [FREE Full text] [CrossRef] [Medline]
  26. Peduzzi P, Concato J, Kemper E, Holford TR, Feinstein AR. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. Dec 1996;49(12):1373-1379. [FREE Full text] [CrossRef] [Medline]
  27. Wang J, Xu Y, Liu L, Wu W, Shen C, Huang H, et al. Comparison of LASSO and random forest models for predicting the risk of premature coronary artery disease. BMC Med Inform Decis Mak. Dec 20, 2023;23(1):297. [FREE Full text] [CrossRef] [Medline]
  28. Kang J, Choi YJ, Kim I, Lee HS, Kim H, Baik SH, et al. LASSO-based machine learning algorithm for prediction of lymph node metastasis in T1 colorectal cancer. Cancer Res Treat. Jul 2021;53(3):773-783. [FREE Full text] [CrossRef] [Medline]
  29. Alba AC, Agoritsas T, Walsh M, Hanna S, Iorio A, Devereaux PJ, et al. Discrimination and calibration of clinical prediction models: users' guides to the medical literature. JAMA. Oct 10, 2017;318(14):1377-1384. [CrossRef] [Medline]
  30. Schisterman EF, Perkins NJ, Liu A, Bondell H. Optimal cut-point and its corresponding Youden Index to discriminate individuals using pooled blood samples. Epidemiology. Jan 2005;16(1):73-81. [CrossRef] [Medline]
  31. Li J. Nomogram prediction tool for hearing loss. shinyapps. URL: https://hearpredict.shinyapps.io/data/ [accessed 2025-05-29]
  32. Ren HM, Ren J, Liu W. Recognition and control of the progression of age-related hearing loss. Rejuvenation Res. Dec 2013;16(6):475-486. [FREE Full text] [CrossRef] [Medline]
  33. Chou C, Beckles GL, Zhang X, Saaddine JB. Association of socioeconomic position with sensory impairment among US working-aged adults. Am J Public Health. Jun 2015;105(6):1262-1268. [CrossRef]
  34. Cruickshanks KJ, Dhar S, Dinces E, Fifer RC, Gonzalez F, Heiss G, et al. Hearing impairment prevalence and associated risk factors in the Hispanic community health study/study of Latinos. JAMA Otolaryngol Head Neck Surg. Jul 01, 2015;141(7):641-648. [FREE Full text] [CrossRef] [Medline]
  35. Emmett SD, Francis HW. The socioeconomic impact of hearing loss in U.S. adults. Otol Neurotol. Mar 2015;36(3):545-550. [FREE Full text] [CrossRef] [Medline]
  36. Cruickshanks KJ, Nondahl DM, Tweed TS, Wiley TL, Klein BE, Klein R, et al. Education, occupation, noise exposure history and the 10-yr cumulative incidence of hearing impairment in older adults. Hear Res. Jun 01, 2010;264(1-2):3-9. [FREE Full text] [CrossRef] [Medline]
  37. Gupta S, Eavey RD, Wang M, Curhan SG, Curhan GC. Type 2 diabetes and the risk of incident hearing loss. Diabetologia. Feb 6, 2019;62(2):281-285. [FREE Full text] [CrossRef] [Medline]
  38. Akinpelu OV, Mujica-Mota M, Daniel SJ. Is type 2 diabetes mellitus associated with alterations in hearing? A systematic review and meta-analysis. Laryngoscope. Mar 07, 2014;124(3):767-776. [CrossRef] [Medline]
  39. Jin X, Xu X, Wang J, Liu X, Deng X, Xie H. Association between hypertension and hearing loss: a systemic review and meta-analysis. Front Neurol. Jan 7, 2024;15:1470997. [FREE Full text] [CrossRef] [Medline]
  40. Li P, Pang K, Zhang R, Zhang L, Xie H. Prevalence and risk factors of hearing loss among the middle-aged and older population in China: a systematic review and meta-analysis. Eur Arch Otorhinolaryngol. Nov 13, 2023;280(11):4723-4737. [CrossRef] [Medline]
  41. Ge J, Yan Y, Zhu Y, Cheng X, Li H, Sun X, et al. Development and validation of the screening tool for age-related hearing loss in the community based on the information platform. Eur Arch Otorhinolaryngol. Jun 11, 2024;281(6):2893-2903. [FREE Full text] [CrossRef] [Medline]
  42. Wang Y, Ye C, Wang D, Li C, Wang S, Li J, et al. Construction and evaluation of a high-frequency hearing loss screening tool for community residents. Int J Environ Res Public Health. Nov 23, 2021;18(23):12311. [FREE Full text] [CrossRef] [Medline]
  43. Efthimiou O, Seo M, Chalkou K, Debray T, Egger M, Salanti G. Developing clinical prediction models: a step-by-step guide. BMJ. Sep 03, 2024;386:e078276. [FREE Full text] [CrossRef] [Medline]


ARHL: age-related hearing loss
AUC: area under the receiver operating characteristic curve
LASSO: least absolute shrinkage and selection operator


Edited by N Cahill; submitted 23.Jul.2025; peer-reviewed by P Sooful, P Usuemerai, O Mbanefo; comments to author 17.Sep.2025; revised version received 15.Nov.2025; accepted 16.Nov.2025; published 10.Dec.2025.

Copyright

©Jing Li, Shuai Jin, Liu Sun, Jun-E Liu, Qiang Shen, Miao Shang, Hanting Wang, Yuanyuan Zhao. Originally published in the Interactive Journal of Medical Research (https://www.i-jmr.org/), 10.Dec.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Interactive Journal of Medical Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.i-jmr.org/, as well as this copyright and license information must be included.