- Research article
- Open Access
- Open Peer Review
Predicting oral malodour based on the microbiota in saliva samples using a deep learning approach
BMC Oral Health volume 18, Article number: 128 (2018)
Oral malodour is mainly caused by volatile sulphur compounds produced by bacteria and bacterial interactions. It is difficult to predict the presence or absence of oral malodour based on the abundances of specific species and their combinations. This paper presents an effective way of deep learning approach to predicting the oral malodour from salivary microbiota.
The 16S rRNA genes from saliva samples of 90 subjects (45 had no or weak oral malodour, and 45 had marked oral malodour) were amplified, and gene sequence analysis was carried out. Deep learning classified oral malodour and healthy breath based on the resultant abundances of operational taxonomic units (OTUs)
A discrimination classifier model was constructed by profiling OTUs and calculating their relative abundance in saliva samples from 90 subjects. Our deep learning model achieved a predictive accuracy of 97%, compared to the 79% obtained with a support vector machine.
This approach is expected to be useful in screening the saliva for prediction of oral malodour before visits to specialist clinics.
Oral malodourous compounds are reportedly produced by periodontitis-associated bacteria in the oral cavity, such as those belonging to the genera Porphyromonas and Prevotella, which produce volatile sulphur compounds (VSCs) [1–4]. Fusobacterium nucleatum and Treponema denticola also produce VSCs, but additionally produce butyric acid and other volatile organic compounds that cause oral malodour [3, 5–7]. Nevertheless, because there are over 700 known bacterial species in the human oral cavity [8, 9], and because several species produce VSCs to varying degrees, the presence of VSCs in the breath cannot be predicted by the presence of specific species. Concentrations of oral malodourous compounds produced by oral bacteria vary according to the type and abundance of species. Interactions between bacterial species may play important roles in the production of VSCs.
Analysis of the oral microbiota reveals several signals from various bacterial species present in various numbers, but we cannot directly or indirectly distinguish bacterial species producing oral malodourous compounds from non-producing bacteria; the bacterial cells form complicated networks in the oral cavity. Machine learning is suitable for prediction from such complicated data, and we previously reported some success in predicting oral malodour using support vector machines (SVMs) .
Machine learning algorithms use training data to uncover underlying patterns, build models, and make predictions based on the best fit models. Indeed, some well-known algorithms, such as SVMs, random forests, Bayesian networks, and Gaussian networks, have been applied in genomics, proteomics, systems biology, and numerous other domains . We previously reported prediction of oral malodour from oral microbiota in saliva by using an SVM based on peak areas of terminal restriction fragment length polymorphisms (T-RFLPs) of the 16S rRNA gene as data for supervised machine-learning methods . Using this training data, the SVM achieved a high classification accuracy of 82%, with a sensitivity of 51% and specificity of 95%. Currently, T-RFLP does not provide economic advantages over 16S RNA sequence analysis. In this study, we devised a more precise classification system using a deep-learning approach based on 16S rRNA sequences with a higher resolution than that of T-RFLP analysis, and compared it with SVM-based prediction.
The study population consisted of 90 patients (37 men and 53 women, mean age of 50.0 ± 14.7 years) who had visited the Oral Malodour Clinic of Fukuoka Dental College Medical and Dental Hospital between August 2011 and October 2016 with a complaint of halitosis. They had not consumed antibiotics within 3 months and had no otorhinolaryngological illness or metabolic disease. Of the 90 patients, 45 had no or weak oral malodour and 45 had marked oral malodour. All participating subjects understood the nature of the research project and provided written, informed consent. Permission for this study was obtained from the Ethics Committee for Clinical Research of Fukuoka Dental College and Fukuoka College of Health Sciences (approval numbers 89, 233, and 249). All study methods were carried out in accordance with the approved guidelines.
The severity of oral malodorour was determined in each patient using an organoleptic test (OLT) and gas chromatography. Malodour assessment and clinical examination, including tests of salivary flow and mucosal moisture level, were performed at least 5 h after eating, drinking, chewing, smoking, and brushing or rinsing the mouth. The OLT scores were estimated by two of three evaluators using a scale of 0 to 5 , and the mean of the scores given by the evaluators was used. The presence of OLT scores ≥ 2 among the three evaluators always exceeded 75% (= 0.50). Gas chromatography (model GC2014; Shimazu Works, Kyoto, Japan) was used to measure the concentrations of hydrogen sulphide (H2S), methyl mercaptan (CH3SH), and dimethyl sulphide (CH3SCH3) in the breath. The value for total VSCs was defined as the sum of the H2S, CH3SH, and CH3SCH3 concentrations. The threshold for marked oral malodour was defined as an OLT score of ≥ 3 and total VSC concentration of ≥ 0.3 ppm. The threshold for no or weak oral malodour was defined as an OLT score of < 3 and total VSC concentration of < 0.3 ppm.
Sample collection and pyrosequencing analysis
Saliva samples were collected from subjects using chewing gum. Subjects were asked to spit into a vessel throughout the a 5 min collection period. Samples (0.5 ml) were collected and were transferred to sterile plastic tubes. Bacteria were harvested by centrifugation (20,400 × g, 15 min at 4°C), and the resulting pellets were resuspended in 150 μl of buffer containing 50 mM Tris-HCl, 1 mM EDTA, and 1% sodium dodecyl sulfate (SDS; pH 7.6). The suspension was added to plastic tubes containing 0.3 g zirconia-silica beads (bead size, 0.1 mm; Biospec Products, Bartlesville, OK, USA) and one tungsten-carbide bead (bead size, 3 mm; Qiagen, Hilden, Germany). The samples were heated at 90°C for 10 min and then vigorously agitated for 3 min in a cell disruptor (Disruptor Genie, Scientific Industries, Inc., Bohemia, NY, USA). After centrifugation at 6000 × g for a few seconds, 200 μl of 1% SDS was added, and the samples were incubated at 70°C for 10 min. The mixtures were extracted using 400 μl of phenol–chloroform–isoamyl alcohol (25:24:1), and the nucleic acids were precipitated with 100% ethanol. Following centrifugation, the DNA was washed with 70% ethanol, resuspended in 100 μl of TE buffer (10 mM Tris-HCl, 1 mM EDTA, pH 7.6), and frozen until subsequent analysis. After extraction, samples were PCR-amplified under permissive conditions using primers to amplify the 508-807 region in prokaryotic 16S rDNA containing the MiSeq sequencing adapters and an 8-nucleotide barcode on the forward primer, followed by the bases matching the 16S rRNA gene. The analysis was performed using the forward primer AA TGA TAC GGC GAC CAC CGA GAT CTA CAC XXXXXXXX TCG TCG GCA GCG TCA GAT GTG TAT AAG AGA CAG and the reverse primer GTT CGT CTT CTG CCG TAT GCT CTA CAA GCA GAA GAC GGC ATA CGA GAT XXXXXXXX CAG AGC ACC CGA GCC TCT ACA CAT ATT CTC TGT C. Pyrosequencing was conducted at Hokkaido System Science Co., Ltd. (Sapporo, Japan) on an Illumina MiSeq sequencer (Illumina, San Diego, CA, USA) using a paired-end 300 bp sequence read run with the Miseq Reagent Kit v3 and MiSeq Control Software version 188.8.131.52 (Illumina).
Data analysis and taxonomy assignment
Putative chimera sequences were removed by UCHIME v6.1.544 , and sequences with 80% of their nucleotides of fragment quality score 20 or lower were removed. The remaining sequences were assigned to OTUs using cd-hit with a 98% threshold of pairwise identity . Each representative sequence was compared using the BLAST algorithm with 998 sequences of the oral bacterial 16S rRNA gene (Human Oral Microbiome Database [HOMD] 16S rRNA RefSeq Version 15.1) deposited in HOMD and assigned to the best BLAST hit with a 97% identity. A total of 3000 sequences were randomly extracted from each sample and used for the following analyses (Additional files 1 and 2). LDA effect size (LEfSe)  analysis was used to detect significant differences between the relative abundances of OTUs in samples from patients with healthy and malodourous breath.
Learning and classification of the bacterial composition of each sample were accomplished using R (http://www.r-project.org) with the h2o package for deep learning and the activation type RectifierWithDropout, and the e1071 package for the SVM with the radial basis function (RBF). The radial kernel function transformed the data using the non-linear function k(x1,x2)=exp(−γ|x1−x2|2), where γ determines the RBF width, unless otherwise specified. Classification by machine learning was evaluated by leave-one-out cross-validation, i.e., one sample was classified by supervised machine learning using the other 89 samples for training. The commands used in this study are showed in Additional file 3.
Evaluation of microbiome compositions based on 16S rRNA sequences
Nucleotide sequences of 16S rRNAs were determined and their taxa were estimated by BLAST analysis using HOMD. A total of 3000 sequences from each sample were analysed and the results showed a typical bacterial composition profile (Fig. 1) when compared to the 16S rRNA sequences of known oral microbes. OTUs with ≤ 0.1% frequency, found in ≤ 4 samples, were omitted from the following calculation. In total, 108 distinct OTUs were noted, and the minimum, maximum, and mean numbers of OTUs per sample were 20, 66, and 38.9, respectively (Additional file 1).
Some OTUs belonged to the same genus, so we re-examined the composition of the genera (Additional file 2). Thirty-seven were present in the samples. Genera characteristic of healthy and malodourous mouth breath were analysed using the Mann-Whitney U test. Table 1 shows the significance of bacteria between two groups.
Linear discriminant analysis (LDA) was performed using LEfSe  to detect OTUs with significantly different relative abundances in oral malodourous and healthy breath. A total of 108 OTUs were found to be significantly differences between bacteria in the groups (Fig. 2). OTUs identified as most strongly associated with oral malodour were from the Bacteroides, Prevotella, and Porphyromonas genera.
Classification of the presence of oral malodour by SVM and deep learning
To evaluate the classification performance of SVM and deep learning for oral malodour, proportions of OTUs associated with oral malodour in 16S rRNA analysis were used for classification by the support vector machine and deep learning (Table 2). Deep learning discriminated oral malodour from normal breath with 96.7% accuracy as compared to SVM, which discriminated between them with 78.9% accuracy.
We previously reported that SVM discriminated oral malodour from normal breath based on T-RFLP analysis with 81% accuracy . T-RFs are generated by digestion with restriction enzyme(s); therefore, the resolution for discrimination of bacterial species is limited by the size of recognition sites within the target fragments and lacks quantitative ability. The costs for 16S rRNA sequence analyses have been reduced by pyrosequencing and other next-generation sequencing techniques. We expected that the higher resolution would improve the recognition rate by SVM and other machine learning systems. Contrary to our expectations, the SVM classifier discriminated between the presence and absence of oral malodour using the amplified 16S rRNA sequences from the saliva samples with an accuracy of 78.9%, similar to that of T-LFs (Table 2).
The bacterial composition in the saliva samples showed a typical oral microbiota profile (Fig. 1), and statistical analysis showed some genera specific to healthy and malodourous breath (Table 1). Of the genera present at over 5% abundance, Streptococcus, Granulicatella and Rothia were more abundant in the healthy group, whereas Veillonella, Peptostreptococcus, and Porphyromonas were more abundant in the malodourous group. These genera contain oral malodour-associated species, and Porphyromonas spp, in particular, as periodontal bacteria, are strongly suggested to be an oral malodour-producing bacteria [4, 16, 17]. In addition, LEfSe analysis revealed many bacterial species, genera, and families with significantly different relative abundances in oral malodourousand healthy breath. A total of 74 OTUs or OTU groups were noted to be significantly differentially abundant in these groups, with an LDA threshold of 4.0 (Fig. 2). The Bacteroidales order, which includes the genera Prevotella and Porphyromonas, was strongly associated with oral malodour. The results of LEfSe analysis include a cladogram showing the significant differences at different hierarchical levels (Fig. 2b). In this study, the relative abundance of Fusobacterium was an averaged 0.12%, and Treponema was detected in almost no samples. Thus, VSC concentrations cannot be predicted based on the abundances of some known VSC-producing bacteria, and machine learning techniques are useful.
A large proportion of oral microbiota, particularly in the saliva, does not produce oral malodourous compounds, but may indicate the presence of organisms specific to oral malodour owing to the interactions among bacterial species. Thus, machine learning can be expected to classify oral microorganisms as oral malodourous or non-malodourous, although our first trial with SVM did not show high classification accuracy (Table 2). We next focused on deep learning, which has advanced significantly in solving problems that have resisted the best attempts of the artificial intelligence community for several years . Application of deep learning in bioinformatics has become a focal point of research .
Deep learning classified oral malodour and healthy breath with 97% accuracy, improving accuracy by 15% over that obtained by SVM classification (Table 2). Notably, the sensitivity is 100%, or false negative rate is 0%. That is, anyone whose oral microbiota is classified as oral malodourous has a high risk of oral malodour. Classification with high sensitivity is desirable for screening or preliminary tests because of the lower number of false negatives, though 97% accuracy is not always expected for such biological data. ROC curve analysis (Fig. 3) suggested that this approach is effective and reliable. Patients concerned about oral malodour could send saliva samples for analysis before medical treatment. Packaging breath is impractical, whereas preparation of a saliva sample is simple, as DNA from oral bacterial cells in the saliva can be dried and transported at room temperature by using FTA cards (GE Healthcare, Little Chalfont, UK).
We demonstrated deep learning-based classification of oral malodourous and healthy breath with high accuracy (97%) based on profiling of 16S rRNA sequences from microbiota in saliva samples. Classification using saliva samples is suitable for screening of oral malodour risk. In addition, treatment for oral malodour can be monitored using this classification system.
Linear discriminant analysis
LDA effect size
Operational taxonomic units
Radial basis function
Receiver operating characteristic
Support vector machine
Terminal restriction fragment length polymorphisms
Volatile sulphur compound
Kleinberg I, Westbay G. Oral malodor,. Crit Rev Oral Biol Med. 1990; 1(4):247–59.
Scully C, Porter S, Greenman J. What to do about halitosis. BMJ. 1994; 308(6923):217–8.
Hughes FJ, McNab R. Oral malodour–a review. Arch Oral Biol. 2008; 53 Suppl 1:1–7.
Suzuki N, Yoneda M, Hirofuji T. Relationship Between Oral Malodor and Oral Microbiota. In: Oral Heal. Care - Prosthodont. Periodontol. Biol. Res. Syst. Cond.London: InTech. p. 123–30. Chap. 9.
Nakano Y, Yoshimura M, Koga T. Methyl mercaptan production by periodontal bacteria. Int Dent J. 2002; 52 Suppl 3:217–20.
Tanaka M, Yamamoto Y, Kuboniwa M, Nonaka A, Nishida N, Maeda K, Kataoka K, Nagata H, Shizukuishi S. Contribution of periodontal pathogens on tongue dorsa analyzed with real-time PCR to oral malodor. Microbes Infect. 2004; 6(12):1078–83.
Moore WE, Moore LV. The bacteria of periodontal diseases. Periodontol 2000. 1994; 5:66–77.
Aas JA, Paster BJ, Stokes LN, Olsen I, Dewhirst FE. Defining the Normal Bacterial Flora of the Oral Cavity. J Clin Microbiol. 2005; 43(11):5721–32.
Dewhirst FE, Chen T, Izard J, Paster BJ, Tanner ACR, Yu W-H, Lakshmanan A, Wade WG. The Human Oral Microbiome. J Bacteriol. 2010; 192(19):5002–17.
Nakano Y, Takeshita T, Kamio N, Shiota S, Shibata Y, Suzuki N, Yoneda M, Hirofuji T, Yamashita Y. Supervised machine learning-based classification of oral malodor based on the microbiota in saliva samples. Artif Intell Med. 2014; 60(2):97–101.
Larrañaga P, Calvo B, Santana R, Bielza C, Galdiano J, Inza I, Lozano JA, Armañanzas R, Santafé G, Pérez A, Robles V. Machine learning in bioinformatics. Brief Bioinform. 2006; 7(1):86–112.
Yaegaki K, Coil JM. Examination, classification, and treatment of halitosis; clinical perspectives. J Can Dent Assoc. 2000; 66(5):257–61.
Edgar RC, Haas BJ, Clemente JC, Quince C, Knight R. UCHIME improves sensitivity and speed of chimera detection. Bioinformatics. 2011; 27(16):2194–200.
Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006; 22(13):1658–9.
Segata N, et al. Metagenomic biomarker discovery and explanation. Genome Biol. 2011; 12(6):60.
Mashima I, Nakazawa F. A review on the characterization of a novel oral Veillonella species, V. Tobetsuensis, and its role in oral biofilm formation. J Oral Biosci. 2013; 55(4):184–90.
Nakano Y, Yoshimura M, Koga T. Correlation between oral malodor and periodontal bacteria. Microbes Infect. 2002; 4(6):679–83.
LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015; 521(7553):436–44.
Min S, Lee B, Yoon S. Deep learning in bioinformatics. Brief Bioinform. 2017; 18(5):851–69.
This study was supported in part by Grants-in-Aid for Scientific Research 15K14423 (F. K), 16K07205 (Y. N.), 26463175 (N. S.); and by Sato Fund (2015–2016) from Nihon University School of Dentistry. The funding agencies had no role in the design of the study and collection, analysis, and interpretation of data or in the writing of the manuscript.
Availability of data and materials
The dataset used is attached as Additional file 1.
Ethics approval and consent to participate
All participating subjects understood the nature of the research project and provided written, informed consent. Permission for this study was obtained from the Ethics Committee for Clinical Research of Fukuoka Dental College and Fukuoka College of Health Sciences (approval numbers 89, 233, and 249). All the study methods were carried out in accordance with the approved guidelines.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Dataset of numbers of OTUs in samples. A total of 3000 sequences were randomly extracted from each sequence data of a sample. The second column, “Malodour”, shows malodourous (P, positive) or normal (N, negative) breath. (CSV 24 kb)
Dataset of numbers of genera in samples. OTUs in Additional file 1 were combined into genera and counted again. (CSV 10 kb)
The commands of h2o and e1071 in R. (TXT 1 kb)