Skip to main content

Genome-Wide Association Study (GWAS) of dental caries in diverse populations



Dental caries is one of the most common chronic diseases and is influenced by a complex interplay of genetic and environmental factors. Most previous genetic studies of caries have focused on identifying genes that contribute to dental caries in specific ethnic groups, usually of European descent.


The aim of this study is to conduct a genome-wide association study (GWAS) to identify associations affecting susceptibility to caries in a large multiethnic population from Argentina, the Philippines, Guatemala, Hungary, and the USA, originally recruited for studies of orofacial clefts (POFC, N = 3686). Ages of the participants ranged from 2 to 12 years for analysis of the primary dentition, and 18–60 years for analysis of the permanent dentition. For each participant, dental caries was assessed by counts of decayed and filled teeth (dft/DFT) and genetic variants (single nucleotide polymorphisms, SNPs) were genotyped or imputed across the entire genome. Caries was analyzed separately for the primary and permanent dentitions, with age, gender, and presence/absence of any type of OFC treated as covariates. Efficient Mixed-Model Association eXpedited (EMMAX) was used to test genetic association, while simultaneously accounting for relatedness and stratification.


We identified several suggestive loci (5 × 10−8 < P < 5 × 10−6) within or near genes with plausible biological roles for dental caries, including a cluster of taste receptor genes (TAS2R38, TAS2R3, TAS2R4, TASR25) on chromosome 7 for the permanent dentition analysis, and DLX3 and DLX4 on chromosome 17 for the primary dentition analysis. Genome-wide significant results were seen with SNPs in the primary dentition only; however, none of the identified genes near these variants have known roles in cariogenesis.


The results of this study warrant further investigation and may lead to a better understanding of cariogenesis in diverse populations, and help to improve dental caries prediction, prevention, and/or treatment in future.

Peer Review reports


Dental caries is a common multifactorial disease, in which environment and genetics each play important roles. Dental caries is a global public health problem and if left untreated, can lead to serious complications including pain, infection, abscess and loss of teeth [1] There are at least 2.3 billion people globally affected with caries of the permanent dentition, and more than 530 million children with caries of the primary dentition [2]. The 2010 Global Burden of Disease, Injury and Risk Factors (GBD) study estimated that oral diseases, such as untreated dental caries and severe periodontitis, accounted for 15 million disability-adjusted life-years (DALYs) globally among people older than 60 years old; and untreated dental caries in permanent dentition was the most common condition in the entire 2010 GBD study (global prevalence for all ages is 35%) [3].

The role of environmental risk factors in causing dental caries is very well established, and includes factors such as lack of fluoride, poor oral hygiene habits, and consuming a diet high in sucrose [4, 5]. In addition, disparities in terms of income, educational level and access to dental services in areas such as Appalachia in the United States could be associated with an increased risk of oral diseases, especially dental caries [6]. Moreover, a systematic review study indicated that socioeconomic factors, such as educational level, occupation and income, are highly associated with dental caries among adults [7].

Over the last decade, human genetic studies have made significant progress identifying genetic risk factors associated with dental caries. Previous genome-wide association studies (GWAS) of dental caries, summarized in Table 1, have reported numerous associations. However, these prior studies were limited to a single ethnic group and/or in a single dentition [8,9,10]. Therefore, the aim of the current GWAS study is to investigate both primary and permanent dentitions in a multiethnic cohort of children and adults.

Table 1 Genome-wide studies of dental caries



The study sample (N = 3686) comprised individuals recruited as part of a large international study of orofacial clefting. Individuals recruited for this study included affected cases, their unaffected family members, and unrelated healthy controls from the United States (Colorado, Iowa, Pennsylvania, Texas, and Puerto Rico) and internationally (Argentina, the Philippines, and Hungary). All participant provided written informed consent for themselves and for children younger than 18 years, informed consent was obtained from their parents or their legally authorized representative. Local ethical approval was obtained at each site and all methods in this study were performed in accordance with the Institutional Review Board policies and guidelines of the University of Pittsburgh and all of the other sites.

The total sample for the primary dentition analysis included 1116 children (age 2–12 years) and 2570 adults for the permanent dentition analysis (ages 18–60 years). Table 2 summarizes the sex and cleft status breakdown for each analysis. Since our participants came from multiple sites, Table 3 summarizes the distribution of participants for the different recruitment sites.

Table 2 Basic characteristics of the study cohorts
Table 3 Distribution of participants across different sites

Dental caries assessment

The same data collection protocols were used for every site. Dental caries scores (dft/DFT) were assessed by trained dentists or dental hygienists, with data collected either through in-person dental examinations and/or intraoral photos (at least 5 per participant, maxillary and mandibular occlusal, right and left lateral, anterior biting) to appropriately cover the entire oral cavity, as previously reported [11]. The dft/DFT index was calculated as the total number of teeth with decayed, and/or filled/restored surfaces. In addition, questionnaires recording dental history were collected for all participants in the study. We used the dft/DFT index instead of dmft/DMFT due to incomplete information regarding missing teeth for the majority of the participants. Note that for the children, primary dentition caries scores (dft) were obtained only from the primary teeth, any permanent teeth in these children were not included. Table 3 summarizes the dental caries mean scores for the different recruitment sites in analyses of both the primary and permanent dentitions.

All of the training and calibration for the intra-oral photos and the in-person dental exam was completed prior to the start of data collection. Each photo rater (BJH, LMU, and ARV) rated randomly selected participants (n = 15) for calibration twice. Results from ratings by LMU and ARV were calibrated against the gold-standard rater, BJH (note BJH’s intra-rater reliability (kappa) = 0.95)). The Inter-rater reliability (kappa) between all the 3 raters = 0.91–0.93. Data from 158 participants who had both intra-oral photos and in-person dental exams were used to measure the consistency between in-person dental exams and the ratings from intra-oral photos; the results showed excellent agreement (kappa > 90%) between the two methods [11].


Participants were genotyped on the Illumina Human610-Quadv1_B BeadChip (Illumina, Inc., San Diego, CA, USA) and Illumina Infinium II assay protocol by the Center for Inherited Disease Research (CIDR) at Johns Hopkins University. Data cleaning and quality assurance procedures were conducted in conjunction with the CIDR data cleaning center at the University of Washington. 529,285 SNPs (single nucleotide polymorphism) were genotyped, and additional SNPs were imputed based on the 1000 Genomes Project reference panel resulting in 35,153,445 SNPs [12]. Of those, 8,859,951 SNPs passed quality control and analysis filters (participant call rates > 90%; SNP call rates > 99%; Hardy–Weinberg p values > 0.0001; MAF > 2%) and were therefore analyzed in this study.

Statistical analysis

Genetic analyses

Caries scores in the primary dentition (dft) were analyzed separately from caries scores in the permanent dentition (DFT), both as quantitative traits. The analysis of dft in the primary dentition included participants of aged 2–12 years, and the analysis of DFT in the permanent dentition included participants aged 18–60 years. The statistical software used, namely, the Efficient Mixed-Model Association eXpedited (EMMAX) [13] uses a variance component-based method for testing the SNP main effects along with a genetic relationship matrix (GRM) and covariates (in this case, sex, recruitment site, age, age2, and cleft status). Results of modeling of those covariates to determine if they are needed to be included to adjust our primary dft and permanent DFT analyses are shown in Table 4. Adjustment for those covariates were included in the final analyses. The genomic inflation factor (λ), and Manhattan plots were generated using the R v3.4.1 statistical analysis environment (R Core Team, 2019). Given the issue of multiple comparisons, conservative p-value thresholds for genome-wide statistical significance were set to P < 5 × 10−8. The p-value thresholds for suggestive significance were set equal to 5 × 10−8 < P < 1 × 10−6.

Table 4 Regression results for the covariates

Bioinformatic approaches for the genetic results

Different bioinformatics and visualization tools were used in this study to interpret the GWAS results at both the SNP and gene level. Loci/genes of interest were identified and reported based on physical proximity of ± 500 kb windows from the lead SNP at each of the loci associated with dental caries. We used LocusZoom [14], to visualize the associations results in the regions around the lead SNPs at each locus. To gain more information regarding the biological consequences of the lead SNPs in the risk loci, variant annotation tools, such as HaploReg, were used to annotate regulatory information, such as enhancer/promoter regions, expression quantitative trait loci (eQTLs), and transcription factor binding sites [15]. Other regulatory databases, such as the Genotype-Tissue Expression (GTEx) [16], and Encyclopedia of DNA Elements [17] were used, too, to help in interpreting the associations results.


Cohort characteristics and covariate modeling

The total study sample comprised 3686 genotyped and phenotyped individuals. The primary dentition cohort (n = 1116) included 616 males (55.5%), and 497 females (44.5%) with an age range of 2–12 years and a mean age of 6.9 years (Table 2), 46.1% of the participants were from U.S recruitment sites, and 53.1% from international sites. The permanent dentition cohort (n = 2570) included 1001 males (38.9%), and 1569 females (61.1%) with an age range of 18–60 years and a mean age of 34.4 years (Table 2), with 33.2% of the participants from U.S recruitment sites, and 66.8% of participants from international sites. General linear regression analysis (details in Table 4) showed that age, recruitment site, cleft status (affected vs. unaffected with cleft) and sex had an impact on the DFT and dft mean scores; therefore, these factors were included as covariates in the genetic analyses.

Genetic results

Manhattan plots illustrating the GWAS results for the primary dft and permanent DFT are shown in Fig. 1a, c. The genomic inflation factor (λ) for the primary dentition analysis was 0.98, and for the permanent dentition analysis was 0.99, indicating no inflation of p values (Fig. 1b, d). Note that associations identified under the GWAS approach do not specify which SNP at that locus is the causal SNP (cause the association) or which gene is affected by the causal SNP. Here we report any known biological functions of genes near our significant and suggestive results that have plausible roles in dental caries or have some biological relevance to tooth development and/or oral health.

Fig. 1
figure 1

Manhattan plots and (Q-Q) plots showing GWAS results for the analyses. Red lines represent thresholds for genome-wide significance (p value < 5 × 10−8). Blue lines represent thresholds for suggestive significance (p value < 5 × 10−6). a Manhattan plots for primary dft show negative log10-transformed p values (y-axis) across the whole genome (x-axis). Genotyped and imputed SNPs are plotted together. b The quantile–quantile plot (Q-Q) for GWAS of Primary dft. The genomic inflation factor (λ) is 0.98. c Manhattan plots for permanent DFT show negative log10-transformed p values (y-axis) across the whole genome (x-axis). Genotyped and imputed SNPs are plotted together. d The quantile–quantile plot (Q–Q) for GWAS of permanent DFT. The genomic inflation factor (λ) is 0.99

Primary dentition

For the primary dft, SNPs at multiple loci exceeded the threshold for genome-wide significance (P < 5 × 10−8) and others exhibited suggestive significance (5 × 10−8 < P < 1 × 10−6). The biological roles of potential candidate genes at these loci are described in Table 5. Genes at two of the genome-wide significant loci have no known role in cariogenesis and warrant further investigation. One of the lead SNPs (rs113021760, P = 3.36 × 10−8), is located in the intron of the SUSD1 gene (Sushi Domain Containing 1), so it might play a regulatory role. In addition, this SNP shows enhancer chromatin marks in different tissues, including osteoblasts, which are found in the periodontal ligament and alveolar bone in the oral cavity.

Table 5 Significant and suggestive SNPs across the analyses

One of the strongest suggestive association signals for the primary dft was at 17q21.33 (rs16948495, P = 5.16 × 10−8) which is less than 100 kb downstream from DLX3 and upstream from DLX4 (see Fig. 2a). These genes belong to the Distal-less homeobox family, known to be expressed in cranial neural crest cells and in the craniofacial mesenchyme; and important in tooth development [18,19,20]. DLX3, which plays an essential role in skeletal formation and development, is also required for tooth morphogenesis [21, 22]. Furthermore, a recent study also showed that DLX3 plays an important role in dentinogenesis. Mutations in this gene cause tricho-dento-osseous (TDO) syndrome, a rare syndrome that affects the teeth and bones, including enamel hypoplasia and dentin hypoplasia [23]. These severe outcomes of DLX3 mutations emphasize the important role DLX3 plays in the development of bones and teeth.

Fig. 2
figure 2

LocusZoom of regions of Interest. Panels a, b and c are association results from the primary dentition GWAS, panels d and e are from the permanent dentition GWAS. a suggestive locus near DLX3 and DLX4 genes on chromosome 17. b suggestive locus near TLN1 and CA9 on chromosome 9. c suggestive locus near NFX1 gene on chromosome 9. d suggestive locus near REL gene on chromosome 2. e suggestive locus near Taste receptor genes (TAS2R38, TAS2R3, TAS2R4, TASR25) on chromosome 7. The genome build used for the recombination rate was based on 1000 Genomes November 2014 EUR data. All of the gene positions and directions of transcription are annotated on the plots

DLX4 is highly expressed in human dental pulp cells (DPCs) and is associated with abnormal human tooth formation [24]. DLX4 also plays an important role in forebrain and craniofacial and development. A mutation in DLX4 was reported to cause a non-syndromic form of cleft lip and palate [25]. Rs16948495 shows enhancer chromatin marks in osteoblasts. These lines of evidence suggest that both DLX3 and DLX4 may play important roles in the dental caries process by the regulation tooth development [21,22,23, 25].

There was also a strong suggestive association signal for primary dft and a SNP at 9p13.3 (chr9:35,753,170, P = 1.89 × 10−7, Fig. 2b). The gene CA9 (CA IX) is located less than 200 kb upstream from the lead SNP at this locus: CA9 encodes a family of enzymes with at least 14 different isoforms that participate in several biological processes including the secretion of saliva and pH homeostasis [26]. In candidate gene studies, a different member of the CA family (CA VI) was investigated due to its role in saliva secretion and regulating salivary pH and was found to be significantly associated with dental caries in children [27,28,29]. When we looked-up CA6 gene in our association results, we did find a SNP (rs58579969), 300 kb downstream from it. However, this SNP only showed nominal evidence of an association with primary dft (P = 1.2 × 10−3).

Another gene at the 9p13.3 locus is TLN1; a cytoskeletal protein that has been implicated in several biological functions, including migration of fibroblasts and osteoclasts [30, 31]. The expression of tln1 in zebrafish was observed in the craniofacial cartilage structures, including the palate. Tln1 mutant zebrafish had malformation in their palate and craniofacial muscles, which indicate the possible role of tln1 in craniofacial morphogenesis [32]. Neither TLN1 nor CA9 genes have been implicated in dental caries before, however, their plausible roles in craniofacial morphogenesis and pH regulation could suggest that they may influence cariogenesis.

Permanent dentition

In the permanent dentition analysis, there were no SNPs meeting the genome-wide significance level, but there were results with multiple loci that were within the range for the genome-wide suggestive level (5 × 10−8 < P < 1 × 10−6). Summaries of the lead SNPs and nearby genes from the permanent analysis are presented in Table 5.

One of the strongest suggestive association signals for the permanent DFT was at the 9p21.1 locus (rs17226825, P = 7.47 × 10−8) which is located within APTAX (Fig. 2c), and the encoded protein of this gene is involved in the repair of DNA damage in cells [33]; however, no role is currently known for this gene in odontogenesis or dental caries. The lead SNP is a strong eQTL (expression quantitative trait locus) for APTAX (per an (eQTL) database [34]. It is also a strong eQTL for other genes (B4GALT1, SMU1, RP11-54K16.2) [16]. The lead SNP is also approximately 300 kb upstream from the NFX1 gene, which plays an important role in regulating the duration of inflammatory responses [35].

Another suggestive association signal at 2p16.1 (rs11686767, P = 3.11 × 10−8), is intronic to PAPOLG (Fig. 2d) which plays a role in catalyzing template-independent extension of a DNA/RNA strand [36] The SNP is a strong eQTL for PAPOLG [16], however, no known role yet of this gene in odontogenesis or dental caries. Finally, rs11686767 was also 100 kb upstream to REL gene. The protein encoded by this gene, proto-oncogene c-Rel, is involved in many important cellular processes such as apoptosis, inflammation, and the immune response [37]. Those genes; NFX1 and REL could contribute to dental caries by influencing host susceptibility to oral microorganism.

An interesting suggestive association for the permanent dentition DFT was at 7q34 (lead SNP rs11197981; P = 1.11 × 10−6), approximately 200 kb downstream of taste receptor genes (TAS2R38, TAS2R3, TAS2R4, TASR25) (Fig. 2e), which have plausible roles in dental caries. Specifically, TAS2R38 as has been associated with dental caries in previous studies [38,39,40,41]. Further, taste receptor genes, specifically TAS2R38, influence bitter perception and dietary habits that are hypothesized to lead to consuming more sweets in the diet and thus leading to dental caries. The lead SNP at this locus is also within the OR9A4 gene, which is responsible for activating the neural response that initiates the perception of smell [42].

To further investigate these significant or suggestive SNPs, we used the FUMA platform (Functional Mapping and Annotation of Genome-Wide Association Studies) [43] to prioritize, annotate, and interpret the genomic variants and genes from the GWAS results of both primary and permanent analyses. Gene-based testing using GWAS results was computed by MAGMA (Multi-marker Analysis of GenoMic Annotation) using the default settings implemented in FUMA. After mapping the top SNPs from the GWAS analyses to 19,182 protein coding genes, the genome-wide significance level was calculated based on the number of tested genes and it was set at 0.05/19182 = 2.61 × 10−6. However, none of the genes in both analyses reached the genome wide significance level. FUMA results of the risk genomic loci for both analyses are presented in the Additional file 1: Tables S1 and S2.


Dental caries is one of the most common chronic diseases that affects both children and adults, and is influenced by a complex interplay of diet, bacteria, salivary flow, genetic factors and other environmental factors. Untreated dental caries could lead to tooth loss which can negatively impact the oral health quality of life (OHRQoL) because it affects function, aesthetics and satisfaction. OHRQoL is a multidimensional concept that includes the impact of dental health on the individual’s functional well-being, emotional health, satisfaction, and self-esteem. OHRQoL is very important because of its usefulness in addressing the impact of oral health disparities and access to care on overall health and quality of life to communicate it with policymakers to improve oral health by increasing access to care [44, 45].

There are now a number of effective preventive methods for dental caries (good oral hygiene, balanced diet, regular dental care) [46, 47]. Most notably, stannous fluoride, such as in toothpaste and water, has been an important public health advance by reducing the prevalence of dental caries [48]. However, dental caries is still wide spread worldwide [3], requiring additional approaches to understanding risk factors for dental caries.

The current study took a genome-wide approach to investigate genetic factors contributing to dental caries in the primary and permanent dentitions. Several previous genetic studies have investigated and identified the role of different genes in cariogenesis. Most of these past studies involved samples that were ethnically homogeneous, mainly of European descent [49,50,51,52,53], unlike the current multiethnic study (see also Table 1).

Notably, results in the primary dentition nominated a few SNPs that showed association at genome-wide significance, and multiple loci that showed suggestive evidence for association. In the permanent dentition, no variants reached genome-wide significance, but several reached suggestive significances. Importantly, none of the top associated SNPs in either of the analyses (dft and DFT) were replicated in the other one, full results are presented in the Additional file 1: Tables S3 and S4. This confirms that genetic loci that influence the susceptibility to dental caries differ between primary and permanent dentitions [54]. Although none of the genome-wide significant association signals were near genes with known roles in dental caries or tooth development, some of the suggestive signals were near genes with potential roles in dental caries.

Some of the new associations identified in this study were in or near genes that have important roles in the inflammatory process, such as NFX1 and REL genes (permanent dentition). Those genes could contribute to dental caries by influencing host susceptibility to oral microorganism. This study also identified association signals near genes that have a confirmed role in tooth and craniofacial development, such as TLN1 and DLX genes. Heterozygous variants in DLX4 have been implicated before in Orofacial clefting [25]. This could indicate that both dental caries and OFC share common genetic risk loci. This is plausible because some of the genes that had been previously implicated in dental caries and OFCs are craniofacial development genes. PKD2 gene is one of the genes that has been associated with dental caries and had an implication in craniofacial development [8] and mutation in PKD2 has been linked to craniofacial anomalies and tooth loose in mice [55]. Thus, our future investigations will focus on exploring the relationship between dental caries and OFCs on both phenotypic and genetic levels.

This study also supported the role of taste receptor genes in dental caries (TAS2R38, TAS2R3, TAS2R4, TASR25, see Table 5), which have been associated with dental caries in previous studies [38,39,40,41]. On the other hand, several loci/genes associated with dental caries from previous large meta-GWAS studies of dental caries [50, 53] were reviewed in our results (the lead SNPs ± 500 kb, a total of 5935 SNPs), but none of these variants reached statistical significance (see Additional file 1: Tables S5, S6 for details).

In the permanent dentition subset of the current study population, females had a significantly higher rate of dental caries than males (P = 0.000234) consistent with earlier studies [56, 57]. However, there was no gender difference in the primary dentition. Possible explanations for the increased rates of dental caries among adult females include (i) the fact that teeth tend to erupt earlier in females, which allows for a longer exposure to a cariogenic oral environment, and (ii) hormonal changes during pregnancy affect saliva pH leading to a decrease in the concentration of calcium, phosphorus, magnesium, and chloride, which, in turn might lead to increased caries. (iii) other behavioral/environmental factors during pregnancy such as consuming a diet high in sucrose and poor oral hygiene could also increase the risk of developing more dental caries.


In summary, this study found multiple associations near genes that may play important roles in dental caries, such as TAS2R38, TAS2R3, TAS2R4, TASR25, DLX3 and DLX4, several of which have plausible biological functions relevant to tooth development and cariogenesis. These findings contribute to our understanding of the genetic mechanisms of cariogenesis, providing targets for follow-up translational studies that will eventually improve prevention and treatment of this highly prevalent chronic disease worldwide.

Availability of data and materials

The datasets generated and/or analysed during the current study are available in the dbGaP repository,



Number of decayed or filled primary tooth


Number of decayed or filled permanent teeth


Genome-wide association study


Minor allele frequency


Single nucleotide polymorphism


  1. Leal SC, Bronkhorst EM, Fan M, Frencken JE. Untreated cavitated dentine lesions: impact on children’s quality of life. Caries Res. 2012;46(2):102–6.

    Article  PubMed  Google Scholar 

  2. Disease GBD, Injury I, Prevalence C. Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018;392(10159):1789–858.

    Article  Google Scholar 

  3. Marcenes W, Kassebaum NJ, Bernabe E, Flaxman A, Naghavi M, Lopez A, Murray CJ. Global burden of oral conditions in 1990–2010: a systematic analysis. J Dent Res. 2013;92(7):592–7.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Ahluwalia M, Brailsford SR, Tarelli E, Gilbert SC, Clark DT, Barnard K, Beighton D. Dental caries, oral hygiene, and oral clearance in children with craniofacial disorders. J Dent Res. 2004;83:175–9.

    Article  PubMed  Google Scholar 

  5. Shaffer JR, Wang X, DeSensi RS, Wendell S, Weyant RJ, Cuenco KT, Crout R, McNeil DW, Marazita ML. Genetic susceptibility to dental caries on pit and fissure and smooth surfaces. Caries Res. 2012;46:38–46.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Martin B, Wilkerson AH, Patterson G, Nahar VK, Sharma M. Need for addressing oral health disparities in rural Appalachia. Health Promot Perspect. 2017;7(4):178–80.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Costa SM, Martins CC, Bonfim Mde L, Zina LG, Paiva SM, Pordeus IA, Abreu MH. A systematic review of socioeconomic indicators and dental caries in adults. Int J Environ Res Public Health. 2012;9(10):3540–74.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Shaffer JR, Feingold E, Wang X, Lee M, Tcuenco K, Weeks DE, Weyant RJ, Crout R, McNeil DW, Marazita ML. GWAS of dental caries patterns in the permanent dentition. J Dent Res. 2013;92(1):38–44.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Shaffer JR, Polk DE, Feingold E, Wang X, Cuenco KT, Weeks DE, DeSensi RS, Weyant RJ, Crout R, McNeil DW, et al. Demographic, socioeconomic, and behavioral factors affecting patterns of tooth decay in the permanent dentition: principal components and factor analyses. Community Dent Oral Epidemiol. 2013;41(4):364–73.

    Article  PubMed  Google Scholar 

  10. Shaffer JR, Wang X, Feingold E, Lee M, Begum F, Weeks DE, Cuenco KT, Barmada MM, Wendell SK, Crosslin DR, et al. Genome-wide association scan for childhood caries implicates novel genes. J Dent Res. 2011;90:1457–62.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Howe BJ, Cooper ME, Wehby GL, Resick JM, Nidey NL, Valencia-Ramirez LC, Lopez-Palacio AM, Rivera D, Vieira AR, Weinberg SM, et al. Dental decay phenotype in nonsyndromic orofacial clefting. J Dent Res. 2017;96:1106–14.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Leslie EJ, Carlson JC, Shaffer JR, Feingold E, Wehby G, Laurie CA, Jain D, Laurie CC, Doheny KF, McHenry T, et al. A multi-ethnic genome-wide association study identifies novel loci for non-syndromic cleft lip with or without cleft palate on 2p24.2, 17q23 and 19q13. Hum Mol Genet. 2016;25(13):2862–72.

  13. Zhou X, Stephens M. Efficient multivariate linear mixed model algorithms for genome-wide association studies. Nat Methods. 2014;11:407–9.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Pruim RJ, Welch RP, Sanna S, Teslovich TM, Chines PS, Gliedt TP, Boehnke M, Abecasis GR, Willer CJ. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics. 2010;26(18):2336–7.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Ward LD, Kellis M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 2012;40(Database issue):D930-934.

    Article  PubMed  Google Scholar 

  16. Consortium GT. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science. 2015;348(6235):648–60.

    Article  Google Scholar 

  17. Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74.

    Article  Google Scholar 

  18. Chai Y, Jiang X, Ito Y, Bringas P Jr, Han J, Rowitch DH, Soriano P, McMahon AP, Sucov HM. Fate of the mammalian cranial neural crest during tooth and mandibular morphogenesis. Development. 2000;127(8):1671–9.

    Article  PubMed  Google Scholar 

  19. Duverger O, Zah A, Isaac J, Sun HW, Bartels AK, Lian JB, Berdal A, Hwang J, Morasso MI. Neural crest deletion of Dlx3 leads to major dentin defects through down-regulation of Dspp. J Biol Chem. 2012;287(15):12230–40.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Thomas BL, Tucker AS, Qui M, Ferguson CA, Hardcastle Z, Rubenstein JL, Sharpe PT. Role of Dlx-1 and Dlx-2 genes in patterning of the murine dentition. Development. 1997;124(23):4811–8.

    Article  PubMed  Google Scholar 

  21. Zhao Z, Stock D, Buchanan A, Weiss K. Expression of Dlx genes during the development of the murine dentition. Dev Genes Evol. 2000;210(5):270–5.

    Article  PubMed  Google Scholar 

  22. Isaac J, Erthal J, Gordon J, Duverger O, Sun HW, Lichtler AC, Stein GS, Lian JB, Morasso MI. DLX3 regulates bone mass by targeting genes supporting osteoblast differentiation and mineral homeostasis in vivo. Cell Death Differ. 2014;21(9):1365–76.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Li Y, Han D, Zhang H, Liu H, Wong S, Zhao N, Qiu L, Feng H. Morphological analyses and a novel de novo DLX3 mutation associated with tricho-dento-osseous syndrome in a Chinese family. Eur J Oral Sci. 2015;123(4):228–34.

    Article  PubMed  Google Scholar 

  24. Tamaoki N, Takahashi K, Aoki H, Iida K, Kawaguchi T, Hatakeyama D, Inden M, Chosa N, Ishisaki A, Kunisada T, et al. The homeobox gene DLX4 promotes generation of human induced pluripotent stem cells. Sci Rep. 2014;4:7283–7283.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Wu D, Mandal S, Choi A, Anderson A, Prochazkova M, Perry H, Gil-Da-Silva-Lopes VL, Lao R, Wan E, Tang PL, et al. DLX4 is associated with orofacial clefting and abnormal jaw development. Hum Mol Genet. 2015;24(15):4340–52.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Pastorekova S, Parkkila S, Pastorek J, Supuran CT. Review article. J Enzyme Inhibit Med Chem. 2004;19(3):199–229.

    Article  Google Scholar 

  27. Frasseto F, Parisotto TM, Peres RC, Marques MR, Line SR. Nobre Dos Santos M: Relationship among salivary carbonic anhydrase VI activity and flow rate, biofilm pH and caries in primary dentition. Caries Res. 2012;46(3):194–200.

    Article  PubMed  Google Scholar 

  28. Peres RC, Camargo G, Mofatto LS, Cortellazzi KL, Santos MC, Nobre-dos-Santos M, Bergamaschi CC, Line SR. Association of polymorphisms in the carbonic anhydrase 6 gene with salivary buffer capacity, dental plaque pH, and caries index in children aged 7–9 years. Pharmacogenomics J. 2010;10(2):114–9.

    Article  PubMed  Google Scholar 

  29. Picco DCR, Lopes LM, Rocha Marques M, Line SRP, Parisotto TM. Nobre dos Santos M: Children with a higher activity of carbonic anhydrase VI in saliva are more likely to develop dental caries. Caries Res. 2017;51(4):394–401.

    Article  PubMed  Google Scholar 

  30. Monkley SJ, Pritchard CA, Critchley DR. Analysis of the mammalian talin2 gene TLN2. Biochem Biophys Res Commun. 2001;286(5):880–5.

    Article  PubMed  Google Scholar 

  31. Zou W, Izawa T, Zhu T, Chappel J, Otero K, Monkley SJ, Critchley DR, Petrich BG, Morozov A, Ginsberg MH, et al. Talin1 and Rap1 are critical for osteoclast function. Mol Cellul Biol. 2013;33(4):830–44.

    Article  Google Scholar 

  32. Ishii K, Mukherjee K, Okada T, Liao EC. Genetic requirement of talin1 for proliferation of cranial neural crest cells during palate development. Plast Reconstr Surg Glob Open. 2018;6(3):e1633.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Sano Y, Date H, Igarashi S, Onodera O, Oyake M, Takahashi T, Hayashi S, Morimatsu M, Takahashi H, Makifuchi T, et al. Aprataxin, the causative protein for EAOH is a nuclear protein with a potential role as a DNA repair protein. Ann Neurol. 2004;55(2):241–9.

    Article  PubMed  Google Scholar 

  34. Westra HJ, Peters MJ, Esko T, Yaghootkar H, Schurmann C, Kettunen J, Christiansen MW, Fairfax BP, Schramm K, Powell JE, et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat Genet. 2013;45(10):1238–43.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Yamashita S, Fujii K, Zhao C, Takagi H, Katakura Y. Involvement of the NFX1-repressor complex in PKC-δ-induced repression of hTERT transcription. The J Biochem. 2016;160(5):309–13.

    Article  PubMed  Google Scholar 

  36. Topalian SL, Kaneko S, Gonzales MI, Bond GL, Ward Y, Manley JL. Identification and functional characterization of neo-poly(A) polymerase, an RNA processing enzyme overexpressed in human tumors. Mol Cell Biol. 2001;21(16):5614–23.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Shono Y, Tuckett AZ, Ouk S, Liou H-C, Altan-Bonnet G, Tsai JJ, Oyler JE, Smith OM, West ML, Singer NV, et al. A small-molecule c-Rel inhibitor reduces alloactivation of t cells without compromising antitumor activity. Cancer Discov. 2014;4(5):578–91.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Keller KL, Tepper BJ. Inherited taste sensitivity to 6-n-propylthiouracil in diet and body weight in children. Obes Res. 2004;12(6):904–12.

    Article  PubMed  Google Scholar 

  39. Mennella JA, Pepino MY, Reed DR. Genetic and environmental determinants of bitter perception and sweet preferences. Pediatrics. 2005;115(2):e216–22.

    Article  PubMed  Google Scholar 

  40. Wendell S, Wang X, Brown M, Cooper ME, DeSensi RS, Weyant RJ, Crout R, McNeil DW, Marazita ML. Taste genes associated with dental caries. J Dent Res. 2010;89(11):1198–202.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Yildiz G, Ermis RB, Calapoglu NS, Celik EU, Türel GY. Gene-environment interactions in the etiology of dental caries. J Dent Res. 2016;95(1):74–9.

    Article  PubMed  Google Scholar 

  42. Niimura Y. Olfactory receptor multigene family in vertebrates: from the viewpoint of evolutionary genomics. Curr Genomics. 2012;13(2):103–14.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8(1):1826.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Fiorillo L. Oral health: the first step to well-being. Medicina (Kaunas). 2019;55(10):676.

    Article  Google Scholar 

  45. Sischo L, Broder HL. Oral health-related quality of life: what, why, how, and future implications. J Dent Res. 2011;90(11):1264–70.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Grigalauskiene R, Slabsinskiene E, Vasiliauskiene I. Biological approach of dental caries management. Stomatologija. 2015;17(4):107–12.

    PubMed  Google Scholar 

  47. Twetman S. Prevention of dental caries as a non-communicable disease. Eur J Oral Sci. 2018;126(Suppl 1):19–25.

    Article  PubMed  Google Scholar 

  48. Fiorillo L, Cervino G, Herford AS, Laino L, Cicciu M. Stannous fluoride effects on enamel: a systematic review. Biomimetics (Basel). 2020;5(3):41.

    Article  Google Scholar 

  49. Gasse B, Grabar S, Lafont AG, Quinquis L, Opsahl Vital S, Davit-Béal T, Moulis E, Chabadel O, Hennequin M, Courson F, et al. Common SNPs of AmelogeninX (AMELX) and dental caries susceptibility. J Dent Res. 2013;92(5):418–24.

    Article  PubMed  Google Scholar 

  50. Haworth S, Shungin D, van der Tas JT, Vucic S, Medina-Gomez C, Yakimov V, Feenstra B, Shaffer JR, Lee MK, Standl M, et al. Consortium-based genome-wide meta-analysis for childhood dental caries traits. Human Mol Genet. 2018;27(17):3113–27.

    Article  Google Scholar 

  51. Izakovicova Holla L, Borilova Linhartova P, Lucanova S, Kastovsky J, Musilova K, Bartosova M, Kukletova M, Kukla L, Dusek L. GLUT2 and TAS1R2 polymorphisms and susceptibility to dental caries. Caries Res. 2015;49(4):417–24.

    Article  PubMed  Google Scholar 

  52. Kastovsky J, Borilova Linhartova P, Musilova K, Zackova L, Kukletova M, Kukla L, Izakovicova Holla L. Lack of association between BMP2/DLX3 gene polymorphisms and dental caries in primary and permanent dentitions. Caries Res. 2017;51(6):590–5.

    Article  PubMed  Google Scholar 

  53. Shungin D, Haworth S, Divaris K, Agler CS, Kamatani Y, Keun Lee M, Grinde K, Hindy G, Alaraudanjoki V, Pesonen P, et al. Genome-wide analysis of dental caries and periodontitis combining clinical and self-reported data. Nat Commun. 2019;10(1):2773–2773.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Wang X, Shaffer JR, Zeng Z, Begum F, Vieira AR, Noel J, Anjomshoaa I, Cuenco KT, Lee MK, Beck J, et al. Genome-wide association Scan of dental caries in the permanent dentition. BMC Oral Health. 2012.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Khonsari RH, Ohazama A, Raouf R, Kawasaki M, Kawasaki K, Porntaveetus T, Ghafoor S, Hammond P, Suttie M, Odri GA, et al. Multiple postnatal craniofacial anomalies are characterized by conditional loss of polycystic kidney disease 2 (Pkd2). Hum Mol Genet. 2013;22(9):1873–85.

    Article  PubMed  Google Scholar 

  56. Ferraro M, Vieira AR. Explaining gender differences in caries: a multifactorial approach to a multifactorial disease. Int J Dent. 2010.

    Article  PubMed  PubMed Central  Google Scholar 

  57. Salvolini E, Di Giorgio R, Curatola A, Mazzanti L, Fratto G. Biochemical modifications of human whole saliva induced by pregnancy. Br J Obstet Gynaecol. 1998;105(6):656–60.

    Article  PubMed  Google Scholar 

Download references


Many thanks to the participating families world-wide who made this project possible. The tireless efforts by the dedicated field staff and collaborators was similarly essential to the success of this study. Special thanks to Margaret Cooper, Judith Resick, Dr. Eduardo Castilla and Dr. Andrew Czeizel.


This work was supported by grants from the National Institutes of Health (NIH) including: R01-DE016148 [MLM, SMW], X01-HG007485 [MLM, EF], U01-DE024425 [MLM], R37-DE008559 [JCM, MLM], R21-DE016930 [MLM], R01-DE012472 [MLM], R01-DE011931 [JTH], U01-DD000295 [GLW], R00-DE024571 [CJB], S21-MD001830 [CJB], U54-MD007587 [CJB], R56-DE027055 [JRS]. Genotyping and data cleaning were provided via an NIH/NIDCR contract to the Johns Hopkins Center for Inherited Disease Research: HHSN268201200008I. Additional support provided by an intramural grant from the Research Institute of the Children's Hospital of Colorado [FWD]; additional operating costs support in the Philippines was provided by the Institute of Human Genetics, National Institutes of Health, University of the Philippines, Manila [CP].

Author information

Authors and Affiliations



RNA generated the data set for the analyses. RNA, JMC analyzed data. RNA, MLM, JMC, JRS, SMW helped define outcomes to be studied. RNA wrote the first draft of the manuscript. MLM, JRS, SMW, BJH, LMU, NM, and KN critically reviewed the manuscript. RNA & MLM designed the study. RNA interpreted data. MLM, CS, FWBD, KN, CP, FAP, IMO, CJB, JTH, GLW, SMW, ARV, LMU, REL collected and interpreted data. RNA & MLM generated the final draft of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Rasha N. Alotaibi.

Ethics declarations

Ethics approval and consent to participate

Study approval was granted by the University of Pittsburgh Institutional Review Board (coordinating center approval # CR19030367-003, Pittsburgh site approval # CR19080127-00). All participant provided written informed consent for themselves and for children younger than 18 years, informed consent was obtained from their parents or their legally authorized representative. Local ethical approval was obtained at each site and all methods in this study were performed in accordance with the Institutional Review Board policies and guidelines of the University of Pittsburgh and all of the other sites.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Supplementary material.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alotaibi, R.N., Howe, B.J., Chernus, J.M. et al. Genome-Wide Association Study (GWAS) of dental caries in diverse populations. BMC Oral Health 21, 377 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: