A structural equation model to test a conceptual framework of oral health in Japanese edentulous patients with an item weighting method using factor score weights: a cross-sectional study
BMC Oral Health volume 18, Article number: 71 (2018)
To investigate Locker’s multidimensional model of oral health in Japanese edentulous patients with an item weighting method using factor score weights, which is more accurate than the sum scoring method. A previous study tested Locker’s model in edentulous elders in the UK, using empirical evidence from the Short-Form Oral Health Impact Profile (OHIP-14). Investigating the model using the OHIP for edentulous subjects (OHIP-EDENT), which contains 19 items suitable for these patients, may complement that study. Testing Locker’s model in Japanese patients may support generalization of the model.
A total of 394 patients who were edentulous in both arches and visited the Dental Hospital of Tokyo Medical and Dental University for new complete dentures were recruited. This cross-sectional study had a non-probabilistic sampling design and included the following: data collection; application of the new item weighting method that involves hierarchical confirmatory factor analysis (CFA) to derive factor score weights for each item, using the bootstrap method, to check the significance of the factor score weights; and empirical testing of Locker’s conceptual model of oral health in Japanese edentulous patients, using structural equation modelling analysis with the bootstrap method for precise estimations and model generation.
Factor score weights derived from CFA were significant. After item weighting, the initial model was analyzed and found to have an inconsistent direct path (functional limitation to disability). This path was eliminated from the model and the modified model was re-run. All effects were significant. The model showed acceptable fit on indices including the model chi-squared, standardized root-mean-square residual, root mean-square error of approximation, goodness-of-fit index, comparative fit index, and P-value.
Our findings showed an empirical fit to Locker’s model in Japanese edentulous patients when using the item weighting method, which was more accurate than the sum scoring method. These results could contribute to the generalization of Locker’s model.
The experimental procedures were published in the University hospital Medical Information Network (UMIN) Center (UMINCTR Clinical Trial, Unique trial Number: UMIN000028711).
Oral health-related quality of life (OHRQoL) is a multidimensional construct. OHRQoL has been researched mostly based on Locker’s conceptual model of oral health . Locker proposed a scientific model that aims to specify the complicated consequences of oral disease on quality of life. Nevertheless, no study, except for that by Baker , has investigated Locker’s model explicitly using empirical evidence. In that study , data for three samples (general adults, edentulous elders, and patients with xerostomia) were analyzed and the short version of the Oral Health Impact Profile (OHIP-14)  was used as the measure.
The OHIP  is often used to evaluate the multidimensional construct of OHRQoL. However, the large number of items included makes it difficult for participants to complete the survey. Therefore, the OHIP-14 was designed and has been widely adopted to assess the association between OHRQoL and a clinical intervention . However, because of a floor effect, the OHIP-14 cannot determine improvements in edentulous persons following clinical intervention . The OHIP-EDENT is a shortened version of the OHIP, which includes 19 items suitable for edentulous persons. By including an item on chewing and eating difficulty, the OHIP-EDENT could detect OHRQoL changes in edentulous persons with new or different prostheses . In the present study, the Japanese version of the Oral Health Impact Profile for edentulous subjects (OHIP-EDENT-J), a cross-culturally adapted scale, was used .
Historically, the numbers of edentulous persons in developed countries have been decreasing. However, given the present ageing of societies, the need for treatment of edentulous persons is not anticipated to decrease overall . The World Health Organization recommends that socioepidemiological research focusing on high-risk groups, including edentulous patients, is needed in order to improve the health of older adults . Further, Critchlow and Ellis  concluded that the evidence base in complete denture research suffers from an insufficient number of well-conducted studies. Using the OHIP-14, Baker  succeeded in indicating that Locker’s conceptual model of oral health is supported by empirical evidence in edentulous elders as well as in the general adult population. An investigation applying the OHIP-EDENT to Locker’s model in edentulous patients may complement Baker’s study.
Item weighting is a process by which the relative weight of events can be expressed. Using a weighted scoring system, the discriminant validity of OHIP was improved to a small extent ; however, it does not have good cost-performance . That is, item weighting is a time-consuming process that offers only slight improvement of discriminant validity. On the other hands, DiStefano et al.  reported that sum scoring was a non-refined method because its score does not necessarily indicate adequate contribution to the factor (e.g., negative factor loading). Zucoloto et al.  also regarded sum scoring as an inaccurate method, and proposed a second-order or third-order model for derivation of the scores on the subscales and an overall score for the measure that adequately improves the accuracy of estimation of the construct using the structural equation modelling (SEM) method. SEM is a powerful multi-variable analytical method that can present direct and indirect effects separately and express complicated relationships in a path diagram .
The aim of this study was to investigate Locker’s conceptual model of oral health in Japanese edentulous patients with the OHIP-EDENT-J using SEM with the item weighting method proposed by Zucoloto et al. in order to generalize Locker’s model. The following hypotheses were tested: functional limitations would be related to disability, which would be related to handicap, which in turn would be related to pain and discomfort; both pain and discomfort would be associated with disability; and pain would be related to discomfort. These hypotheses were adopted as the conceptual model of oral health in a sample of edentulous elders in a previous study by Baker .
The study was conducted in three stages: 1) collection of data; 2) deriving weighting formulae from hierarchical confirmatory factor analysis (CFA) to improve the accuracy of the estimation ; and 3) empirical testing of Locker’s conceptual model of oral health in Japanese edentulous patients with the OHIP-EDENT-J  using SEM analysis after item weighting derived from CFA. A cross-sectional design with non-probabilistic sampling was adopted.
The participants were systemically healthy persons who were edentulous in both arches and visited the Dental Hospital of Tokyo Medical and Dental University requesting new complete dentures during the period from January 2009 to April 2015. The exclusion criteria included no existing denture or dentures and non-attendance before measurements. Three hundred and ninety-four patients were recruited for the study. One patient was hospitalized, another one was withdrawn, 49 had missing data, leaving 343 patients (87.1%, mean age 76.3 ± 8.3 years) for analysis. The patient characteristics, oral condition, and quality of previous dentures were investigated by calibrated prosthodontists with more than 4 years of clinical experience, during the creation of the new complete dentures (Table 1). The method devised by Cawood and Howell  was employed to assess the residual ridge forms. Denture stability and retention were estimated using the Kapur method . Jaw relation was estimated by investigating whether premature contact was existing or not in centric relation. The assessments of patient characteristics, oral condition, and quality of previous dentures are part of the screening process for patients requesting new complete dentures, and thus were not purely for purpose of this study. All subjects provided written informed consent to participate in this study.
To investigate the multidimensional construct of OHRQoL, the OHIP was assessed using the OHIP-EDENT-J . The OHIP-EDENT-J has 19 items and consists of seven subscales (functional limitation, pain, psychological discomfort, physical disability, psychological disability, social disability, and handicap) and is based on Locker’s model . Functional limitation is defined as the extent of depression of function of body parts or systems. The definition of discomfort is the self-assessment of physical and psychological distress, including pain and other feelings that are not directly observable. Disability is expressed as three dimensions of well-being (physical, psychological, and social). Handicap is concerned with the social effects of disease, which are broader than those of disability . Participants were asked how many times they had experienced the impact of each item in the previous month using a scale ranging from 0 (never) to 4 (very often).
Factor score weights
To improve the accuracy of estimation of the construct, we employed hierarchical CFA using SEM analysis [14, 15]. The SEM analysis was conducted with AMOS (SPSS Statistics version 17.0, SPSS Inc., Chicago, IL). Given that many authors have indicated their calculation of the OHIP by summing all items, the existence of the third-order factor (OHIP) is presumably assured . Therefore, we performed CFA using a third-order hierarchical CFA model and derived a formula whereby the third-order factor (OHIP) could be estimated. The third-order model has been described in the literature . The scores derived from the formula can obtain a more accurate estimation than the simple summing method. In detail, the weighting formula derived from the third-order model included factor score weights for items 1–19. The product of the factor score weight and average deviation of item score for the raw data was adopted as the final item score to investigate the hypothesized model. Evaluation of the significance of factor score weights was conducted using bias-corrected bootstrapped 95% confidence intervals (CIs)  based on 1000 replications. The method used to assess the model fit of CFA is described in the following paragraph.
Testing the Locker model
Locker’s conceptual model of oral health in edentulous patients was empirically investigated using SEM. The hypothesized model was that used in a previous study of edentulous patients by Baker . The maximum likelihood method is adopted for estimation of free parameters and requires data that have a normal distribution. More than 1.0 of absolute value of kurtosis was regarded as non-normal distribution. The bootstrap method can also be used to determine parameter estimates in data that have a non-normal distribution . Parameter estimates of the direct and indirect effects were determined using the bootstrapping method with 1000 iterations.
Estimation of model fit
We assessed model fit to the data using five indices commonly used in SEM analysis, i.e., the chi-squared test and P-value, the standardized root-mean-square residual (SRMR), the root mean-square error of approximation (RMSEA), the comparative fit index (CFI), and the goodness-of-fit index (GFI) . As the chi-squared value increases and the P-value consequently decreases, the fit of the model becomes increasingly worse. A ‘larger’ P-value indicates a ‘better’ model fit. SRMR values less than 0.08 are generally considered to be favorable [19, 20]. In general, an RMSEA less than 0.05 indicates a close fit, values between 0.05 and 0.08 indicate a reasonable fit, and an RMSEA more than 0.1 indicates a poor fit . A GFI and a CFI of 1.0 indicates a complete model fit. Generally, a GFI and a CFI greater than 0.95 indicates a good fit [19, 20].
Strategy in model specification
There are some strategies involved in specification and evaluation of the model. MacCallum and Austin  proposed three SEM analysis strategies: (a) a strictly confirmatory strategy, in which a single a priori model is investigated; (b) a model generation strategy, in which an initial model is fitted to the data and then modified as necessary until the fit is adequate; and (c) an alternative model strategy, in which various a priori models are studied. We employed (a) a strictly confirmatory strategy for CFA and (b) a model generation strategy for the Baker model.
The means, medians, and standard deviations (SDs) of the observed variables before weighting and Pearson’s correlations between observed variables after weighting are shown in Table 2. There were no correlations with high coefficients (> 0.85), indicating that multicollinearity did not occur in the SEM analysis.
Univariate kurtosis in items 2, 7, 10, 13, and 15–19 (CFA section), handicap (Baker model section after weighting), and multivariate kurtosis (CFA and Baker model section) indicated a non-normal distribution.
Factor score weights
We derived the weighting formula from hierarchical CFA in which the third-order model was employed using raw data (OHIP item score). The CFA model and the bootstrap standardized estimates of direct effect are shown in Fig. 1. The fit indices were as follows: chi-squared = 897.03 (146 degrees of freedom), P < 0.001, CFI = 0.83, GFI = 0.76, RMSEA = 0.12 (90% CI 0.11–0.13), and SRMR = 0.089. The fit of the model was poor. The bootstrap standardized estimates and the standard error and CI values for the factor score weights of each item (OHIP) are shown in Table 3. All factor score weights were significant. Based on the model, the item scores for the third-order factor (OHIP) can be estimated by the following formula :
Testing the locker model
The main (Baker) model for the a priori hypotheses showed an acceptable fit on all indices: the GFI was 1.00, the CFI was 1.00, the RMSEA was 0.00 (90% CI 0.00–0.08), the SRMR was 0.013, the chi-squared value (3 degrees of freedom) was 2.139, and the P-value was 0.544 with weighted data. However, the direct effect of functional limitation on disability was a minus quantity, which was inadequate considering the consistency of association (worse functional limitation was associated with improving disability). Therefore, the path was deleted from the initial hypothesized (modified Baker) model. When the modified Baker model was re-run, the data supported Locker’s conceptual model  in terms of the estimation of effects and fit indices. The fit indices of the modified Baker model were as follows: GFI = 1.00, CFI = 1.00, RMSEA = 0.00 (90% CI 0.00-0.08), SRMR = 0.013, chi-squared value (4 degrees of freedom) = 3.431, and P-value = 0.488. Therefore, all five criteria were met. The modified Baker model accounted for 66% of the variance in pain, 66% in discomfort, 56% in disability, and 57% in handicap. The bootstrap standardized estimates, standard error values, and bias-corrected 95% CIs of direct effects and indirect effects are shown in Fig. 2.
The present findings support Locker’s conceptual model of oral health  and complement a previous well-designed study . Both the study by Baker and the present study show that Locker’s model can be generalized to various samples, including both edentulous patients and the general adult population, and that in both UK and Japanese edentulous sample, Locker’s model can be applied.
By empirical analysis of the structure of a model, a theoretical model may be evaluated as highly sophisticated when compared with models that explain the nature of directional relationships between elements . SEM is a powerful analytical method that is useful for investigating complex relationships like the structure of the elements of OHIP and presents the percentage of variance of the variables. In this study, the final (modified Baker) model explained 66% of the variance in pain, 66% in discomfort, 56% in disability, and 57% in handicap. That is, 34%–44% of the variance was not expressed in the model. Baker  referred to coping strategies, social support, sense of coherence, and negative affectivity as key contextual factors that may have improved interpretability. Moreover, we propose that elements of personality, such as neuroticism and life satisfaction, play an important role in oral health. Fenlon et al.  demonstrated that neuroticism had an influence on satisfaction with complete dentures and Yamaga et al.  indicated that satisfaction with complete dentures was associated with OHIP. Therefore, neuroticism may influence oral health. Locker et al.  showed a significant relationship between life satisfaction and oral health in older adults. Therefore, life satisfaction may be related to oral health, especially in edentulous patients. If these variables had been included in this study model, more variation in OHIP elements may have been obtained.
In the present study, the final (modified Baker) model indicated higher fit indices than those indicated in the previous study  in edentulous patients. The P-value in the previous study was 0.350 and in the present study was 0.488. This may be because we used the OHIP-EDENT, which succeeded in eliminating the ceiling effect by including items relevant to chewing and eating difficulty , and not the sum scoring method but the item weighting method using hierarchical CFA with SEM analysis.
Jenkinson  indicated that the item weighting method is not so useful, whereas Zucoloto et al.  affirmed the correctness of item weighting. Jenkinson showed that measurements of health status are not significantly improved by weighting of items . On the other hand, Zucoloto et al.  referred to the usefulness of the scoring method that adopted CFA with SEM. The theoretical concepts of physical, psychological, and social as second-order, or OHIP as third-order, have been discussed in the literature . However, to date, its construct validity could not be tested by CFA analysis, which is important for accurate estimation. Therefore, further study is needed. The sum scoring method does not necessarily express the degree of effect of the score on the factor (OHIP). On the other hand, this weighting method can reflect how the score contributed to the factor (OHIP).
SEM analysis requires a large sample size (individuals) to obtain a precise estimation in free parameters. No absolute criteria for sample size exist in the literature. However, the complexity of the model is thought to be critical for sample size (individuals). A larger sample (individuals) was needed because the model was more complex and included more free parameters. In general, 20 individuals per free parameter is considered the desirable sample size . Given that the hypothesized (Baker) model in the present study had 12 free parameters to be estimated, 240 individuals was considered the minimum adequate sample size. The third-order hierarchical (CFA) model had 44 free parameters to be estimated. Therefore, 880 individuals were needed. On the other hand, sample size (individuals) more than 200 was recommended in the field of social psychology for SEM analysis in the point of absolute criteria based on the general guide . Both models met this recommendation.
In this study, the third-order model was used to interpret the multidimensional construct of OHRQoL and adjust item scores. It is possible to use various models, including CFA, to derive weight factor scores and understand the construct. For example, Baker  constructed a model for use in housebound edentulous elders in which functional (OHIP) was used as the latent variable (first order), physical, psychological, and social as indicator variables, and the covariance between the residual error of the psychological and social items was added. In the literature, the relevance of general health perception, functional (OHIP), and symptom status was investigated using a two-stage approach to SEM analysis . Therefore, a more macroscopic view might be required to capture the multidimensional construct of OHRQoL rather than detailed elements, such as physical pain, as employed in this study. While a number of possible models exist, the third-order model was used to derive factor score weights because the third-order model covers all possible models and is not perfect but has been adequately tested in the literature . A model fit was poor in the CFA model from which factor score weights were derived. However, bias-corrected bootstrapped 95% confidence intervals showed significance; the sample size recommendation in terms of absolute criteria was met. Moreover, the bootstrapping method had been recommended as the best approach for small-moderate sample sizes .
In the final (modified Baker) model, the direct effect of functional limitation on disability was not examined because of apparent inconsistency in the amount of direct effect. That is, it appears that more functional limitation decreases disability as derived from the initial hypothesized model, whereas functional limitation has a significant large indirect effect on disability. To wit, in edentulous patients, functional limitation influences disability indirectly rather than directly. This is because of the strong direct link between functional limitation and pain (0.81) and the indirect link between pain and discomfort (0.73). Clinically, it may be that functional limitation (e.g., dentures not fitting) has an indirect influence on disability (e.g., avoidance of eating) via pain or discomfort rather than a direct influence. In terms of general statistical principles, not all the potential direct relationships were incorporated (the parsimony principle) .
The main limitation of this study is its cross-sectional rather than longitudinal design. Thereby, a causal relationship could not be shown. Further studies including intervention would be required to determine the relationship between change in scores for before and after outcome variables. According to the theory of response shift , a follow-up response may be influenced by new information not available at the time of the initial response. On outcome evaluation, the response shift causes bias that confuses the meaning of the score. To eliminate this source of bias, future studies should include a longitudinal design.
The results of the present study show an empirical fit to Locker’s model in Japanese edentulous patients by an item weighting method using factor score weights, which has more accuracy than the sum scoring method. This finding may contribute to the generalization of Locker’s model.
Confirmatory factor analysis
Comparative fit index
The Short-Form Oral Health Impact Profile,
OHIP for edentulous subjects
The Japanese version of the Oral Health Impact Profile for edentulous subjects
Oral health-related quality of life
Root mean-square error of approximation
Structural equation modeling
Standardized root-mean-square residual
The University hospital Medical Information Network
Locker D. Measuring oral health: a conceptual framework. Community Dent Health. 1988;5:3–18.
Baker SR. Testing a conceptual model of oral-health: a structural equation modeling approach. J Dent Res. 2007;86:708–12.
Slade GD. Derivation and validation of a short form oral health impact profile. Community Dent Oral Epidemiol. 1997;25:284–90.
Slade GD, Spencer AJ. Development and evaluation of oral health impact profile. Community Dent Health. 1994;11:3–11.
Ikebe K, Watkins CA, Ettinger RL, Sajima H, Nokubi T. Application of short-form oral health impact profile on elderly Japanese. Gerodontology. 2004;21:167–76.
Allen PF, Locker D. A modified short version of the oral health impact profile for assessing health-related quality of life in edentulous adults. Int J Prosthodont. 2002;15:446–50.
Sato Y, Kaiba Y, Yamaga E, Minakuchi S. Reliability and validity of a Japanese version of the oral health impact profile for edentulous subjects. Gerodontology. 2012;29:e1033–7.
Douglass CW, Shih A, Ostry L. Will there be a need for complete dentures in the United States in 2020? J Prosthet Dent. 2002;87:5–8.
Peterson PE, Yamamoto T. Improving the oral health of older people: the approach of the WHO global oral health Programme. Community Dent Oral Epidemiol. 2005;33:81–92.
Critchlow BC, Ellis JS. Prognostic indicators for conventional complete denture therapy: a review of the literature. J Dent. 2010;38:2–9.
Allen PF, Locker D. Do item weights matter? An assessment using the oral health impact profile. Community Dent Health. 1997;14:133–8.
Allen PF, Steele J. Data validity and quality. In: Lesaffre E, Feine J, Leroux B, Declerck D, editors. Statistical and methodological aspects of oral Health Research. Chichester: Wiley; 2009. p. 131–44.
DiStefano C, Zhu M, Mîndrilă D. Understanding and using factor scores: considerations for the applied researcher. Pract Assess Res Eval. 2009;14:1–11.
Zucoloto ML, Maroco J, Campos JADB. Psychometric properties of the oral health impact profile and new methodological approach. J Dent Res. 2014;93:645–50.
Kline RB. Principles and practice of structural equation modeling. 3rd ed. New York: Guilford Press; 2011.
Cawood JI, Howell RA. A classification of the edentulous jaws. Int J Oral Maxillofac Surg. 1988;17:232–6.
Kapur KK. A clinical evaluation of denture adhesives. J Prosthet Dent. 1967;18:550–8.
Efron B, Tibshirani R. Bootstrap methods for standard errors, confidence intervals, and other measures of statistical accuracy. Stat Sci. 1986;1:54–77.
Hu L, Bentler PM. Fit indices in covariance structure modeling: sensitivity to underparameterized model misspecification. Psychol Methods. 1998;3:424–53.
Hu L, Bentler PM. Cutoff criteria for fit indices in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Model. 1998;6:1–55.
MacCallum RC, Austin JT. Applications of structural equation modeling in psychological research. Ann Rev Psychol. 2000;51:201–26.
Taillefer MC, Dupuis G, Roberge MA, May SL. Health-related quality of life models: systematic review of the literature. Soc Indic Res. 2003;64:293–323.
Fenlon MR, Sherriff M, Newton JT. The influence of personality on patients’ satisfaction with existing and new complete dentures. J Dent. 2007;35:744–8.
Yamaga E, Sato Y, Minakuchi S. A structural equation model relating oral condition, denture quality, chewing ability, satisfaction, and oral health-related quality of life in complete denture wearers. J Dent. 2013;41:710–7.
Locker D, Clarke M, Payne B. Self-perceived oral health status, psychological well-being, and life satisfaction in an older adult population. J Dent Res. 2000;79:970–5.
Jenkinson C. Why are we weighting? A critical examination of the use of item weights in a health status measure. Soc Sci Med. 1991;32:1413–6.
Slade GD. The oral health impact profile. In: Slade GD, editor. Measuring oral health and quality of life: Department of Dental Ecology, School of Dentistry, Chapel Hill: University of North Carolina; 1997. p. 93–104. https://www.adelaide.edu.au/arcpoh/downloads/publications/reports/miscellaneous/measuring-oral-health-and-quality-of-life.pdf. Accessed 24 Apr 2018.
Baker SR. Testing the applicability of a conceptual model of oral health in housebound edentulous older people. Community Dent Oral Epidemiol. 2008;36:237–48.
Anderson JC, Gerbing DW. Structural equation modelling in practice: a review and recommended two-step approach. Psychol Bull. 1988;103:411–23.
Schwartz CE, Sprangers MAG. Methodological approaches for assessing response shift in longitudinal health-related quality-of-life research. Soc Sci Med. 1999;48:1531–48.
This research was funded by a Grant-in-Aid for Scientific Research from the Japan Society for the Promotion of Science (Grant 26861628). The role of the funding agency was financial support, and it was not involved in the design of the study or collection, analysis, and interpretation of data or in writing the manuscript.
Availability of data and materials
The data that support the findings of this study are available from the corresponding author.
Ethics approval and consent to participate
The experimental procedures were approved by the Ethics Committee of Tokyo Medical and Dental University with registration number 232. All subjects provided written informed consent to participate in this study.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Yamaga, E., Sato, Y. & Minakuchi, S. A structural equation model to test a conceptual framework of oral health in Japanese edentulous patients with an item weighting method using factor score weights: a cross-sectional study. BMC Oral Health 18, 71 (2018). https://doi.org/10.1186/s12903-018-0527-1