Reliability and validity of a questionnaire for self-assessment of complete dentures
© Komagamine et al.; licensee BioMed Central Ltd. 2014
Received: 26 September 2013
Accepted: 25 April 2014
Published: 2 May 2014
Demand for complete denture treatment is expected to rise over several decades. However, to date, no questionnaire on complete dentures, as evaluated by edentulous patients, has been shown to be reliable and valid. This study sought to assess the reliability and validity of Patient’s Denture Assessment (PDA), which provides a multidimensional evaluation of dentures among edentulous patients.
Patients, who had new complete dentures fabricated at the University Hospital of Dentistry, Tokyo Medical and Dental University through 2009 to 2010, were enrolled. The reliability of the PDA was determined by examining internal consistency and test-retest reliability. Internal consistency for all of the question items and the six subscales was measured using Cronbach’s α and average inter-item correlation coefficients among 93 participants. For 33 of these participants, test-retest reliability was determined at a 2 month-interval using the interclass correlation coefficients (ICCs) and 95% confidence interval for the summary scores and the six subscale scores. The PDA was validated in 93 participants by examining the difference in the summary score and the six subscale scores of the PDA before and after replacement with new dentures by the paired t-test. Ability to detect change was also tested in 93 patients using effect size.
The Cronbach’s α for the PDA ranged from 0.56 to 0.93. The average inter-item correlation coefficients ranged from 0.28 to 0.83. ICCs for the PDA ranged from 0.37 to 0.83. The paired t-test showed a significant difference between the summary score and the six subscale scores before and after replacement with new dentures (p < 0.05) and the effect size was 0.97.
The PDA demonstrated good reliability by assessing internal consistency and test-retest reliability. In addition, the PDA demonstrated good validity by assessing discriminant validity. Thus, the PDA could help dentists obtain a detailed understanding of the patients’ perceptions in using their dentures.
KeywordsComplete denture Questionnaire Edentulous Self-assessment Reliability Validity
In recent years, the National Epidemiological Survey data have revealed a decline in the proportion of edentulous people. However, given the simultaneous decline in the percentage of edentulous people and increase in the number of older people, the number of people requiring complete dentures in the United States is predicted to increase over the next 20 years . Thus, the need for complete denture treatment is likely to remain substantial .
Patient-reported outcome measures, as recommended by the United States Food and Drug Administration, is a growing area within research , and the impact of treatment on patients has been attracting attention. This is also the case for prosthetic treatment: the success of complete denture treatment does not depend on the assessment by the dentist, but on patient self-assessment results. The patient self-assessment is an inexpensive and economical method with which to measure the success of the denture and can yield a large amount of data in a short period of time.
Patient satisfaction is rated using a questionnaire that is assessed via a Likert scale or the 100-mm visual analogue scale (VAS). Some patient satisfaction evaluation methods depend on only one question [4, 5], and it is likely that such evaluations may not be reliable. For example, in dissatisfied patients, it is not known whether the dissatisfaction was caused by the lower or upper denture. In addition, patient satisfaction is usually not based on one factor, but on many factors, such as taste, ease of chewing, comfort, retention, pain, fit and aesthetics. Taking those factors into consideration, other evaluation methods have been introduced that comprise several questions to permit the multidimensional evaluation of patient satisfaction [6–9]. Limitations of these questionnaires are the lack of reliabilities and validities. On the other hand, in prosthetic treatment, one of the most widely used evaluation tools for measurements of oral health related quality of life (OHRQoL) is the Oral Health Impact Profile (OHIP). This instrument has been translated into many languages across the globe [10–16], which has allowed for comparisons of reliability and validity between OHIP in different regions. The OHIP assesses the impact of oral conditions on the quality of life using a frequency estimation of the disruption, such as dysfunction, discomfort and disability, in daily activities. Thus, negative-oriented measures composing the OHIP may be unable to capture positive changes in patients with no negative impacts recorded at baseline because the absence of a negative does not necessarily imply a positive .
Consequently, it is necessary to establish a multidimensional patient self-assessment specific to dentures that can capture both positive and negative denture-related effects and have reliability and validity. We first developed a questionnaire for the self-assessment of complete dentures  referred to as the Patient’s Denture Assessment (PDA). The PDA was developed from an original questionnaire that is a subjective assessment about oral, physical, and psychological conditions on daily functioning. In the PDA, some question items contain quality of life similar to the OHIP, but the PDA is absolutely assessment of dentures in themselves. Moreover, the PDA is the bipolar design, which could capture both negative and positive impacts on the patients. The conceptual of framework established for the PDA is in the following. The purposes of complete denture treatment are recoveries of three major functions; chewing, speech, and aesthetics. This three functions influence acceptances and expectations for dentures, which have impacts on patients’ perceptions, consciousness and feelings of dentures. The PDA assesses these impacts of complete denture treatment on the patients’ perceptions, consciousness and feelings of dentures. From the clinical point of view, the PDA will be used for the diagnosis, prognosis, and comparing the efficacy of prosthetic treatment. For example, if the association between the PDA scores before treatment and outcomes after treatment, such as numbers of denture adjustment, length of denture survival and denture survival rate is assessed, the difficulties of treatments and longevities of dentures as prognosis will be predictable from the PDA scores at pre-treatment diagnosis. However, at this point, reliability and validity of the PDA has not been confirmed yet. Thus, the purpose of this study is to demonstrate that the PDA was a reliable and valid instrument for application in clinical settings.
Distribution of patients’ previous and current denture status according to upper and lower dentures (N = 93)
Period of edentulousness
1 year ≤ 3 years
3 years ≤ 5 years
5 years ≤ 10 years
Number of previous complete dentures
1 – 3
4 – 6
Time since current complete dentures
0.5 years ≤ 10 years
Sample size and characteristics of the participants for each investigation
Type of investigation
Mean age (years)
Year of enrolment
2009 & 2010
2009 & 2010
Ability to detect change
2009 & 2010
Development of the patient denture assessment (PDA)
Question items of the patient’s denture assessment (PDA)
Q1. How much pain do you feel?
Q2. How easy is it for you to swallow food boluses and water?
Q3. How well do you enjoy your meals?
Q4. How worn out does your jaw feel?
Aesthetics & speech
Q5. How worried are you about other people watching?
Q6. How easy is it for you to speak?
Q7. How worried are you about your mouth?
Q8. How often do your dentures click when chewing?
Q9. How often does food debris get stuck under your lower denture?
Q10. How is your lower denture retained on the ridge?
Q11. How does your lower denture fit?
Q12. How uncomfortable is your lower denture?
Q13. How satisfactory will the new dentures be?
Q14. How problematic will the new dentures be?
Q15. How well will the new dentures fit?
Q16. How often does food debris get stuck under your upper denture?
Q17. How does your upper denture fit
Q18. How often does your upper denture fall down?
Q19. How much do you consider your dentures as part of your body?
Q20. How important are your dentures to you?
Q21. How much can you care for your dentures without any difficulty?
Q22. How at ease do you feel when wearing your dentures?
All 93 participants were instructed to complete the PDA twice: before replacement (Before-1 PDA), and on completion of denture adjustment after replacement (After PDA). An additional PDA was performed before replacement (Before-2 PDA) for 33 participants randomly selected among the 93 participants. The interval between Before-1 PDA and Before-2 PDA was two months. Each subscale score was calculated by summing the values of the question items corresponding to each subscale. Moreover, the summary score of all the question items was calculated.
Reliability and validity
Internal consistency of the PDA was assessed by Cronbach’s α and average inter-item correlation. Before-1 PDA scores from the 93 participants were used for an assessment of internal consistency. Cronbach’s α for summary score of 0.70-0.80 are considered satisfactory for a reliable comparison between groups and more than 0.90 are required for clinical usefulness of the instrument . The test-retest reliability was assessed by determining the interclass correlation coefficients (ICCs) and 95% confidence interval of the test-retest difference for 33 participants. Factor analysis is reported in previous study . To evaluate discriminant validity, we assessed the difference between Before-1 PDA and After PDA scores in 93 patients using a paired t-test. Values of p < 0.05 were considered significant for the paired t-test.
Ability to detect change
Effect size was used to investigate the ability to detect change to the treatment. Ability to detect change to the questionnaire was assessed in 93 participants by calculating the effect size. Effect size is one of the most commonly used methods for interpreting a change in scores . The effect size was evaluated by calculating the difference between the means of the summary scores of Before-1 PDA and After PDA and dividing the difference by the standard deviation (SD) of the summary scores of Before-1 PDA . To facilitate decisions regarding the clinical importance of the observed change in the measure, some benchmarks have been proposed; a value of 0.2 or less, 0.5 and 0.8 or greater has been proposed to represent low, moderate and high ability to detect change, respectively .
Internal consistency of the PDA was assessed by Cronbach’s α and average inter-item correlation. Test-retest reliability was assessed by the interclass correlation coefficients (ICCs) and 95% confidence interval. Discriminant validity was assessed by a paired t-test between PDA scores before and after treatment. Values of p < 0.05 were considered significant for the paired t-test. Ability to detect change was investigated by effect size. SPSS version 16.0 (SPSS Japan, Tokyo, Japan) was used for all analyses.
Mean values and standard deviations (SDs) for before-1 and after patient’s denture assessment (PDA) scores (N = 93)
Mean ± SD
74 ± 30
83 ± 24
78 ± 24
87 ± 18
72 ± 28
83 ± 23
74 ± 29
87 ± 21
Aesthetics & speech
67 ± 34
88 ± 20
66 ± 33
85 ± 22
58 ± 37
86 ± 21
60 ± 37
72 ± 31
41 ± 35
70 ± 30
45 ± 34
77 ± 28
43 ± 35
82 ± 23
47 ± 36
79 ± 26
64 ± 38
87 ± 20
51 ± 40
80 ± 28
54 ± 42
82 ± 27
60 ± 36
83 ± 24
71 ± 33
89 ± 18
72 ± 33
91 ± 17
88 ± 23
90 ± 20
95 ± 11
97 ± 6
75 ± 26
84 ± 21
79 ± 27
88 ± 18
Reliability and validity
Cronbach’s α and average inter-item correlation coefficients assessed by Before-1 PDA scores (N = 93)
Average inter-item correlation coefficient
Aesthetics & speech
Test-retest reliability assessed by before-1 and before-2 patient’s denture assessment (PDA) scores (N = 33)
Aesthetics & speech
Results of the paired t-test for before-1 and after patient’s denture assessment (PDA) scores
Mean ± SD
1433 ± 430
1850 ± 340
298 ± 93
340 ± 74
Aesthetics & speech
251 ± 116
331 ± 73
176 ± 122
307 ± 93
169 ± 113
249 ± 69
202 ± 90
263 ± 43
336 ± 59
360 ± 48
Ability to detect change
The mean summary scores changed from 1433 for the Before-1 PDA score to 1850 for the After PDA score. The SD of the summary scores of the Before-1 PDA was 430 and the effect size for the summary score was 0.97.
In the present study, both the reliability assessed by internal consistency and test-retest reliability and the validity assessed by discriminant validity of the PDA were satisfactory. Therefore, the PDA is suitable for use in clinical settings.
The internal consistency of the PDA was assessed using two measures: average inter-item correlation and Cronbach’s α. Cronbach’s α is computed from the SDs of the question items and the SD of the total scores. As the difference between the SDs for question items of a subscale becomes larger, the Cronbach’s α of the subscale decreases . In the PDA, each subscale comprised three or four question items, which often resulted in a smaller value of Cronbach’s α in each subscale. Nevertheless, five out of the six subscales presented with a value of 0.8-0.9. Only the ‘importance’ subscale presented with a low Cronbach’s α of 0.56.
It is important to note that, in this study, in describing the importance of dentures, Q20 exhibited a ‘ceiling effect’. This was because all patients had used complete dentures previously and required new dentures, thus complete dentures were important for them. For this reason, the SDs of Q20 were small for most of the patients as compared with that in the other three question items for this subscale. Moreover, the difference between the Before-1 PDA and After PDA scores mean values of Q21 or Q22 was large with wide variance. On the other hand, the difference of Q19 or Q20 was small with narrow variance. The question items of Q21 and Q22 were added after second factor analysis. Thus, the directions of the construct of Q19 or Q20 might be different from Q21and Q22. These led to a small Cronbach’s α, the average inter-item correlation and the impact in the ‘importance’ subscale. By comparison, the highest average inter-item correlation coefficient was in the ‘expectation’ subscale. It is likely that Q13 and Q15 (Table 3) deliver similar content, which may account for the highest score of the average inter-item correlation coefficients in the ‘expectation’ subscale.
The ICCs of the test-retest reliability for the ‘expectation’ and ‘importance’ subscales were low as compared with other studies [10, 11]. The reason for this discrepancy may be that the interval between the test-retest in our study was 2 months while other studies used a 1- or 2-week-interval. The ‘expectation’ subscale consisted of question items related to expectancy of improvement with new dentures and the ‘importance’ subscale consisted of question items related to the importance of dentures to patients, depending on the patients’ perception. The 2-month interval was sufficient time to enhance the relationship and trust between the patients and their operators, which may have influenced their expectation perceptions to new dentures and the importance of wearing dentures.
For the assessment of validity, there was a significant difference in the summary score and the six subscale scores before and after replacement of new dentures. Thus, it is suggested that the PDA can detect differences in patients’ self-assessment between previous and new dentures.
The ‘lower denture’ and ‘expectation’ subscales had lower means for the Before-1 PDA scores than that seen with the other subscales. This suggests that most of the participants selected for the study wore insufficient lower dentures and they had low expectations for the efficacy of their new dentures. Table 1 shows larger number of patients whose period of lower edentulousness was less than one year than upper edentulousness. Moreover, the previous study has shown that patients have a lower expectation for their new dentures when their rating for their previous dentures is low . Therefore, it is suggested that shorter period of edentulousness and insufficient dentures might cause problems with lower dentures and following low expectations in the study.
In the present study, the effect size was 0.97. According to the already-described statement, it is suggested that 0.97 of the ability to detect change in the study represents high ability to detect change. The PDA after treatment was requested after the dentures had been completely adjusted, with all participants given adequate time for these adjustments. Thus, the negative effects of the previous dentures were no longer present and positive effects of the new complete dentures might explain the large effect size reported in the study.
This study demonstrated good reliability and validity of the PDA. Thus, this questionnaire could help dentists obtain a detailed understanding of the patients’ perceptions in using their dentures, which is considered valuable information for dentists when designing new dentures. In our previous study, we demonstrated that sufficient retention of lower dentures and appropriate appearance would lead to improved OHRQoL in edentulous patients who had previously been wearing insufficient lower dentures and/or dentures with an unsatisfactory appearance . The PDA can be thus applied in the clinical situation. However, dentists should carefully interpret the results specific to the ‘importance’ subscale because of the ‘ceiling effect’ and the possibility of the difference direction between Q19 or Q20 and Q21 or Q22 seen in this study.
All participants in the study were recruited in a university hospital setting. Therefore, the reliability and validity of our test must be extrapolated to the general population besides patients in university hospitals. In addition, the questionnaire is presently written in Japanese only. It is recommended that the questionnaire is translated into other languages and evaluated for its reliability and validity with a view for its wider implementation in future studies.
When used among edentulous patients requiring new complete dentures, the PDA demonstrated good reliability by assessing internal consistency and test-retest reliability. In addition, the PDA demonstrated good validity by assessing discriminant validity.
United States food and drug administration
Oral health-related quality of life
Visual analogue scale
Oral health impact profile
Patient’s denture assessment.
This work was supported by a Japanese Society for the Promotion of Sciences KAKENHI Grant-in-Aid for Scientific Research (C) 20592257.
- Douglass CW, Shih A, Ostry L: Will there be a need for complete dentures in the United States in 2020?. J Prosthet Dent. 2002, 87: 5-8. 10.1067/mpr.2002.121203.View ArticlePubMed
- Carlsson GE, Omar R: The future of complete dentures in oral rehabilitation. A critical review. J Oral Rehabil. 2010, 37: 143-156. 10.1111/j.1365-2842.2009.02039.x.View ArticlePubMed
- U.S. Department of Health and Human Services FDA Center for Drug Evaluation and Research; U.S. Department of Health and Human Services FDA Center for Biologics Evaluation and Research; U.S. Department of Health and Human Services FDA Center for Devices and Radiological Health: Guidance for industry: patient-reported outcome measures: use in medical product development to support labeling claims: draft guidance. Health Qual Life Out. 2006, 4: 79-View Article
- Divaris K, Ntounis A, Marinis A, Polyzois GL, Polychronopoulou A: Patients’ profiles and perceptions of complete dentures in a university dental clinic. Int J Prosthodont. 2012, 25: 145-147.PubMed
- Stober T, Danner D, Lehmann F, Séché AC, Rammelsberg P, Hassel AJ: Association between patient satisfaction with complete dentures and oral health-related quality of life: two-year longitudinal assessment. Clin Oral Invest. 2012, 16: 313-318. 10.1007/s00784-010-0483-x.View Article
- Gjengedal H, Berg E, Boe OE, Trovik TA: Self-reported oral health and denture satisfaction in partially and completely edentulous patients. Int J Prosthodont. 2011, 24: 9-15.PubMed
- Turker SB, Sener ID, Ozkan YK: Satisfaction of the complete denture wearers related to various factors. Arch Gerontol Geriat. 2009, 49: e126-e129. 10.1016/j.archger.2008.11.003.View Article
- Forgie AH, Scott BJ, Davis DM: A study to compare the oral health impact profile and satisfaction before and after having replacement complete dentures in England and Scotland. Gerodontology. 2005, 22: 137-142. 10.1111/j.1741-2358.2005.00066.x.View ArticlePubMed
- Kawai Y, Murakami H, Shariati B, Klemetti E, Blomfield JV, Billette L, Lund JP, Feine JS: Do traditional techniques produce better conventional complete dentures than simplified techniques?. J Dent. 2005, 33: 659-668. 10.1016/j.jdent.2005.01.005.View ArticlePubMed
- Bae KH, Kim HD, Jung SH, Park DY, Kim JB, Paik DI, Chung SC: Validation of the Korean version of the oral health impact profile among the Korean elderly. Community Dent Oral Epidemiol. 2007, 35: 73-79. 10.1111/j.1600-0528.2007.00331.x.View ArticlePubMed
- Wong MC, Lo EC, McMillan AS: Validation of a Chinese version of the oral health impact profile (OHIP). Community Dent Oral Epidemiol. 2002, 30: 423-430. 10.1034/j.1600-0528.2002.00013.x.View ArticlePubMed
- van der Meulen MJ, John MT, Naeije M, Lobbezoo F: The Dutch version of the oral health impact profile (OHIP-NL): translation, reliability and construct validity. BMC Oral Health. 2008, 8: 11-10.1186/1472-6831-8-11.PubMed CentralView ArticlePubMed
- Papagiannopoulou V, Oulis CJ, Papaioannou W, Antonogeorgos G, Yfantopoulos J: Validation of a Greek version of the oral health impact profile (OHIP-14) for use among adults. Health Qual Life Out. 2012, 10: 7-10.1186/1477-7525-10-7.View Article
- Sato Y, Kaiba Y, Yamaga E, Minakuchi S: Reliability and validity of a Japanese version of the oral health impact profile for edentulous subjects. Gerodontology. 2012, 29: e1033-e1037. 10.1111/j.1741-2358.2011.00606.x.View ArticlePubMed
- Saub R, Locker D, Allison P: Derivation and validation of the short version of the Malaysian oral health impact profile. Community Dent Oral Epidemiol. 2005, 33: 378-383. 10.1111/j.1600-0528.2005.00242.x.View ArticlePubMed
- Souza RF, Patrocínio L, Pero AC, Marra J, Compagnoni MA: Reliability and validation of a Brazilian version of the oral health impact profile for assessing edentulous subjects. J Oral Rehabil. 2007, 34: 821-826. 10.1111/j.1365-2842.2007.01749.x.View ArticlePubMed
- McGrath C, Broder H, Wilson-Genderson M: Assessing the impact of oral health on the life quality of children: implications for research and practice. Community Dent Oral Epidemiol. 2004, 32: 81-85. 10.1111/j.1600-0528.2004.00149.x.View ArticlePubMed
- Komagamine Y, Kanazawa M, Kaiba Y, Sato Y, Minakuchi S, Sasaki Y: Association between self-assessment of complete dentures and oral health-related quality of life. J Oral Rehabil. 2012, 39: 847-857. 10.1111/joor.12004.View ArticlePubMed
- Bland JM, Altman DG: Cronbach’s alpha. BMJ. 1997, 314: 572-10.1136/bmj.314.7080.572.PubMed CentralView ArticlePubMed
- Szentpétery A, Szabó G, Marada G, Szántó I, John MT: The Hungarian version of the oral health impact profile. Eur J Oral Sci. 2006, 114: 197-203. 10.1111/j.1600-0722.2006.00349.x.View ArticlePubMed
- Allen PF, McMillan AS, Locker D: An assessment of sensitivity to change of the oral health impact profile in a clinical trial. Community Dent Oral Epidemiol. 2001, 29: 175-182. 10.1034/j.1600-0528.2001.290303.x.View ArticlePubMed
- Cohen J: A power primer. Psychol Bull. 1992, 112: 155-159.View ArticlePubMed
- Cortina JM: What is coefficient alpha? An examination of theory and applications. J Appl Psychol. 1993, 78: 98-104.View Article
- Gaspar MG, Dos Santos MB, Dos Santos JF, Marchini L: Correlation of previous experience, patient expectation and the number of post-delivery adjustments of complete dentures with patient satisfaction in a Brazilian population. J Oral Rehabil. 2013, 40: 590-594. 10.1111/joor.12070.View ArticlePubMed
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1472-6831/14/45/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.