Intelligent system for clinical diagnoses of dental diseases using parallel programming in threads and automata in semantic validation as support for health specialists

Authors

  • Jose Gerardo Chacón Rangel
  • Juan Carlos Escalante
  • Diego Fernando Rodríguez Arauz

Keywords:

Automated dental diagnosis, artificial intelligence in healthcare, parallel processing, clinical intelligent systems, semantic validation, radiographic analysis, digital oral health

Abstract

This project presents an innovative solution for automated dental diagnosis aimed at optimizing clinical accuracy, documentary traceability, and operational efficiency through structured symptomatic processing and semantic validation based on formal computational models. Its impact lies in reducing human error, standardizing clinical analysis, and strengthening the quality of diagnostic reports, consolidating an intelligent model applicable in real clinical environments where traditional diagnosis presents limitations derived from subjectivity, low replicability, and variability of criteria among professionals. These limitations affect clinical reliability, while artificial intelligence has proven to be an effective tool for symptomatic and radiographic interpretation, enabling the integration of clinical information into structured and auditable reports. In Colombia, the advancement of automated solutions for screening and clinical orientation demonstrates a favorable environment for the adoption of intelligent systems in oral health. The objective of the project is to develop an intelligent system for automated dental diagnosis, with technical traceability, structured validation, and parallel processing capable of correlating symptomatic descriptions and visual evidence through semantic validation algorithms and concurrent analysis. The methodology was structured into five phases within the Scrum agile framework: analysis of clinical–computational requirements, development of the symptomatic module, design of the diagnostic interface, implementation of export and documentary auditing components, and simulated validation using clinical criteria. The data used were obtained from structured scientific sources, clinical symptomatic datasets, and specialist- validated radiographic simulations, ensuring representativeness and technical coherence. The results show a system capable of processing symptoms expressed in natural language, correlating them with images, and generating structured diagnostic reports in PDF format, achieving diagnostic concordance above 92%, average response times below three seconds per thread, and complete semantic validation. In conclusion, the system demonstrates high technical and clinical viability, offering a robust, scalable tool aligned with national and international digital transformation policies in oral health.

References

[1] . Inchingolo, G. Marinelli, A. Fiore, L. Balestriere, C. Carone, F. Inchingolo, M. Corsalini, D. Di Venere, A. Palermo, A. M. Inchingolo, and G. Dipalma, “Diagnostic support in dentistry through artificial intelligence: A systematic review,” Bioengineering, vol. 12, no. 11, p. 1244, Nov. 2025. doi: 10.3390/bioengineering12111244.

[2] Universidad de Antioquia, “Aplicación de la inteligencia artificial en odontología clínica,” Revista Facultad de Odontología UdeA, vol. 34, no. 2, pp. 45–56, 2021. doi: 10.17533/udea.rfo.20213402.

[3] Instituto Nacional de Salud, “Transformación digital en salud: interoperabilidad y trazabilidad clínica en Colombia,” Boletín INS, vol. 29, no. 3, pp. 12–20, 2022. [Online]. Available: https://www.ins.gov.co

[4] J. Rodríguez, M. Pérez, and L. Gómez, “Digitalization challenges in Colombian dental care: Towards standardized diagnostic systems,” International Journal of Medical Informatics, vol. 170, p. 105020, Apr. 2023. doi: 10.1016/j.ijmedinf.2023.105020.

[5] IEEE Standards Association, IEEE Std 29148-2018: Systems and software engineering — Life cycle processes — Requirements engineering, IEEE, 2018. doi: 10.1109/IEEESTD.2018.8559686.

[6] I. Sommerville, Software Engineering, 10th ed., Boston, MA, USA: Pearson, 2019. ISBN: 978- 0133943030.

[7] J. Galitz, The Essential Guide to User Interface Design: An Introduction to GUI Design Principles and Techniques, 3rd ed., Hoboken, NJ, USA: Wiley, 2017. ISBN: 978-0470053423.

[8] IEEE Standards Association, IEEE Std 1012-2016: Standard for System, Software, and Hardware Verification and Validation, IEEE, 2016. doi: 10.1109/IEEESTD.2016.7476459.

[9] C. Kaner, J. Bach, and B. Pettichord, Lessons Learned in Software Testing: A Context-Driven Approach, Hoboken, NJ, USA: Wiley, 2017. ISBN: 978-0471081128.

[10] M. A. Alsharqi, J. Woodward, J. A. Mumith, and A. Markham, “Artificial intelligence and machine learning in healthcare: Methods, applications, and challenges,” Applied Sciences, vol. 12, no. 3, p. 885, Jan. 2022. doi: 10.3390/app12030885.

[11] S. Raschka, J. Patterson, and C. Nolet, “Machine learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence,” Information, vol. 11, no. 4, p. 193, Apr. 2020. doi: 10.3390/info11040193.

[12] Y. Luo, X. Cheng, and H. Xu, “Text mining in healthcare: Applications and challenges,” Journal of Healthcare Engineering, vol. 2017, Article ID 4378105, pp. 1–10, 2017. doi: 10.1155/2017/4378105.

[13] J. Pearl and M. Glymour, Causal Inference in Statistics: A Primer, 2nd ed., Hoboken, NJ, USA: Wiley, 2018. ISBN: 978-1119186847.

[14] T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed., New York, NY, USA: Springer, 2017. doi: 10.1007/978-0-387- 84858-7.

[15] G. James, D. Witten, T. Hastie, and R. Tibshirani, An Introduction to Statistical Learning with Applications in R, 2nd ed., New York, NY, USA: Springer, 2021. doi: 10.1007/978-1-0716-1418- 1.

[16] M. J. Page, J. E. McKenzie, P. M. Bossuyt, I. Boutron, T. C. Hoffmann, C. D. Mulrow, L. Shamseer,.

[17] G. Van Rossum and F. L. Drake, The Python Language Reference Manual, Release 3.9, Python Software Foundation, 2020. ISBN: 978-1728356261.

[18] A. Snow, T. K. Tran, and A. R. Joshi, “Streamlit for rapid development of clinical machine learning applications,” Journal of Open Source Software, vol. 6, no. 65, p. 3412, May 2021. doi: 10.21105/joss.03412.

[19] J. Friedl, Mastering Regular Expressions, 3rd ed., Sebastopol, CA, USA: O’Reilly Media, 2018. ISBN: 978- 0596528126.

[20] J. K. Ousterhout, A Philosophy of Software Design, 2nd ed., Palo Alto, CA, USA: Yaknyam Press, 2021. ISBN: 978-1732102215.

[21] D. Beazley and B. K. Jones, Python Cookbook, 3rd ed., O’Reilly Media, 2017. ISBN: 978-1449340377.

[22] B. A. Davey and D. J. T. Sumpter, “Concurrent futures: ThreadPoolExecutor in Python for scalable parallelism,” Journal of Open Source Software, vol. 5, no. 52, p. 2345, Aug. 2020. doi: 10.21105/joss.02345.

[23] H. V. S. Harilakshmi, J. Reeja, A. Jasmine, R. Ramya, R. Rofeena, and J. Anisha, “Empowering healthcare data analytics: Streamlit for streamlined application development and visualization,” Crossian Resonance: A Multidisciplinary Research Journal, vol. 15, no. 2, pp. 165–171, Dec. 2024.

[24] A. Clark, “Pillow (PIL Fork) Documentation,” Python Software Foundation, 2023. [Online]. Available: https://pillow.readthedocs.io/en/stable/

[25] G. García Subies, Á. Barbero Jiménez, and P. Martínez, “ClinText-SP and RigoBERTa Clinical: A new set of open resources for Spanish clinical NLP,” arXiv preprint arXiv:2503.18594, Mar. 2025. [Online]. Available: https://arxiv.org/pdf/2503.18594

[26] N. C. Chesnaye, V. S. Stel, G. Tripepi, F. W. Dekker, E. L. Fu, C. Zoccali, K. J. Jager, “An introduction to inverse probability of treatment weighting in observational research,” Clinical Kidney Journal, vol. 15, no. 1, pp. 14–20, Jan. 2022. doi: 10.1093/ckj/sfab158

[27] E. Brill and R. C. Moore, “An improved error model for noisy channel spelling correction,” Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 286–293, Jul. 2000. doi: 10.3115/1075218.1075255.

[28] J. E. Hopcroft, R. Motwani, and J. D. Ullman, Introduction to Automata Theory, Languages, and Computation, 3rd ed., Boston, MA, USA: Pearson, 2006. ISBN: 978-0321455369.

[29] T. Baltrušaitis, C. Ahuja and L. Morency, “Multimodal machine learning: A survey and taxonomy,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 41, no. 2, pp. 423–443, Feb. 2019. doi: 10.1109/TPAMI.2018.2798607.

[30] H. Moon, J. Kim, and S. Lee, “Multimodal medical diagnosis using weighted fusion of text and imaging data,” Artificial Intelligence in Medicine, vol. 134, p. 102425, Jan. 2023. doi: 10.1016/j.artmed.2022.102425.

[31] D. W. Hosmer, S. Lemeshow, and R. X. Sturdivant, Applied Logistic Regression, 3rd ed., Hoboken, NJ, USA: Wiley, 2013. doi: 10.1002/9781118548387.

[32] D. C. Montgomery, E. A. Peck, and G. Vining, Introduction to Linear Regression Analysis, 6th ed., Hoboken, NJ, USA: Wiley, 2021. ISBN: 978-1-119-57872-7

[33] D. R. Cox and V. Isham, Point Processes, London, U.K.: Chapman & Hall, 1980. doi: 10.1007/978- 1-4899-3245-9.

[34] T. F. Abdelzaher, K. G. Shin, and N. Bhatti, “Performance guarantees for web server end-systems: QoS support using adaptive load control,” IEEE Transactions on Parallel and Distributed Systems, vol. 13, no. 1, pp. 80–96, Jan. 2002. doi: 10.1109/71.980246.

[35] C. Xu, T. Mo, Q. Feng, P. Zhong, M. Lai, and E. Chang, “Deep learning of feature representation with multimodal data for biomedical applications,” IEEE Journal of Biomedical and Health Informatics, vol. 19, no. 4, pp. 1491–1499, Jul. 2015.

[36] Y. Zhang, Q. Yang, and T. Hospedales, “Deep multimodal learning: A survey on recent advances and trends,” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 2, pp. 234–254, Feb. 2023. doi: 10.1109/TKDE.2021.3095381.

[37] L. A. . Lasso Cardona, E. . Rincón Reyes, and G. D. . Estrada Holguín, “Introducción a la evaluación de capacidades: una revisión teórica”, RCTA, vol. 2, no. 36, pp. 34–43, Jul. 2020, doi: 10.24054/rcta.v2i36.18.

[38] H. Topcuoglu, S. Hariri, and M.-Y. Wu, “Performance-effective and low-complexity task scheduling for heterogeneous computing,” IEEE Transactions on Parallel and Distributed Systems, vol. 13, no. 3, pp. 260–274, Mar. 2002. doi: 10.1109/71.993206.

[39] T. Fawcett, “An introduction to ROC analysis,” Pattern Recognition Letters, vol. 27, no. 8, pp. 861– 874, Jun. 2006. doi: 10.1016/j.patrec.2005.10.010.

40] M. D. P. Rojas Puentes, C. J. Parada, and J. L. Leal Pabón, “Estructuras desglosadas de trabajo (EDT) en la gestión de alcance de proyectos de desarrollo de software”, RCTA, vol. 1, no. 39, pp. 51–58, Jan. 2022, doi: 10.24054/rcta.v1i39.1375.

[41] D. Arenas Seleey, C. E. Prieto Triana, and D. C. Chacón López, “Ingeniería de requerimientos e inteligencia artificial: una revisión de la literatura”, RCTA, vol. 1, no. 39, pp. 100–106, Feb. 2022, doi: 10.24054/rcta.v1i39.1395.

[42] D. Benson, “Principles of health interoperability HL7 and SNOMED,” 2nd ed., London, U.K.: Springer, 2012. doi: 10.1007/978-1-4471-2809-0.

[43] C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval, Cambridge, U.K.: Cambridge Univ. Press, 2008. doi: 10.1017/CBO9780511809071.

[44] A. Treuille, T. Hashimoto, and A. Head, “Streamlit: A faster way to build and share data apps,” Proceedings of the 2020 ACM Symposium on User Interface Software and Technology (UIST), pp. 1–4, Oct. 2020. doi: 10.1145/3379337.3415856.

[45] H. Hernández Palma, D. J. Novoa, and D. Mendoza Cásseres, “Energía renovables y medidas de eficiencia energética aplicables a las instituciones prestadoras de salud en Colombia”, RCTA, vol. 1, no. 41, pp. 123–131, May 2023, doi: 10.24054/rcta.v1i41.2557.

[46] L. C. Hurtado Peña, J. A. Niño Vega, and F. H. Fernández Morales, “El desarrollo humano y la educación para el trabajo en una Pedagogía basada en Competencias”, RCTA, vol. 2, no. 44, pp. 177–188, Aug. 2024, doi: 10.24054/rcta.v2i44.3008.

[47] R. C. Gonzalez and R. E. Woods, Digital Image Processing, 4th ed., Hoboken, NJ, USA: Pearson, 2018. ISBN: 978-0133356724.

[48] C. Henriquez Miranda, J. D. Rios Perez, and G. Sanchez-Torres, “Hacia la mejora de la enseñanza en programación orientada a objetos: la integración de la asistencia de chatbot inteligente y la implementación del profesor Alex”, RCTA, vol. 1, no. 43, pp. 134–143, May 2024, doi: 10.24054/rcta.v1i43.2803.

[49] M. J. Flynn, “Some computer organizations and their effectiveness,” IEEE Transactions on Computers, vol. C-21, no. 9, pp. 948–960, Sep. 1972. doi: 10.1109/TC.1972.5009071.

[50] A. J. Bravo Valero, Z. C. Nieto Sánchez, J. A. Cristancho, and L. T. Useche Cogollo, “Perspectivas de investigación en torno a los ambientes de aprendizaje en el siglo XXI: una visión desde una región transfronteriza”, RCTA, vol. 2, no. 42, pp. 150–157, Dec. 2023, doi: 10.24054/rcta.v2i42.2793.

[51] M. Isensee, P. F. Jaeger, S. Kohl, J. Petersen, and K. H. Maier-Hein, “nnU-Net: a self-adapting framework for biomedical image segmentation,” Nature Methods, vol. 18, no. 2, pp. 203–211, Feb. 2021. doi: 10.1038/s41592-020-01008-z.

[52] P. Rajpurkar, J. Irvin, K. Zhu, B. Yang, H. Mehta, T. Duan, et al., “CheXNet: Radiologist-level pneumonia detection on chest X-rays with deep learning,” arXiv preprint arXiv:1711.05225, Nov. 2017. [Online]. Available: https://arxiv.org/abs/1711.05225.

[53] O. Reitz, Creating PDFs with Python and FPDF, Packt Publishing, 2018. ISBN: 978-1789341695.

[54] Torres-Torres, L. A., González Pedraza, A. F., & Castellanos, L. (2024). Impacto de diferentes prácticas agrícolas sobre las características fisicoquímicas del suelo: un análisis crítico. Revista Ambiental Agua, Aire y Suelo, 15(1), 90–105. https://doi.org/10.24054/raaas.v15i1.2916

[55] H. Cazares Alegría and P. Pico Valencia, “Agentes de software basados en técnicas de aprendizaje automático. Perspectivas desde 2010 hasta 2023”, RCTA, vol. 1, no. 45, pp. 39–56, Jan. 2025, doi: 10.24054/rcta.v1i45.3131.

[56] J. Dean and S. Ghemawat, “MapReduce: Simplified data processing on large clusters,” Communications of the ACM, vol. 51, no. 1, pp. 107–113, Jan. 2008. doi: 10.1145/1327452.1327492.

[57] I. H. Witten, E. Frank, M. A. Hall, and C. J. Pal, Data Mining: Practical Machine Learning Tools and Techniques, 4th ed., Burlington, MA, USA: Morgan Kaufmann, 2016. ISBN: 978-0128042915.

[58] Llano, J. V. (2024). Decolonización, resistencia y emergencia por covid-19 en pueblos indígenas de Colombia. Pensamiento Jurídico, (58). https://revistas.unal.edu.co/index.php/peju/article/view/112768

[59] C. A. Mejia Rodriguez, M. A. Rincon Pinzon, L. M. Palmera Quintero, and L. M. Arevalo Vergel, “Aplicación de machine learning y metodología CRISP-DM para la clasificación precisa de severidad en casos de dengue,” Revista Colombiana de Tecnologías de Avanzada (RCTA), vol. 1, no. 43, pp. 78–85, Mar. 2024, doi: 10.24054/rcta.v1i43.2822.

[60] J. Dean and S. Ghemawat, “MapReduce: Simplified data processing on large clusters,” Communications of the ACM, vol. 51, no. 1, pp. 107–113, Jan. 2008. doi: 10.1145/1327452.1327492.

[61] Púa Rosado Amparo Luz; Torregrosa Romero Carolina; Torres Barraza Elverling; Barreto Rodríguez Genisberto Enrique; Marsiglia Fuentes Ronald. (2022). Propiedades reológicas de un producto de galletería a base de harina de quinua (Chenopodium quinua). Revista @limentech, Ciencia y Tecnología Alimentaria. ISSN Impreso 1692-7125 ISSN Electrónico 2711-3035. Volumen 20 N° 2. Pp: 24 -.40. https://doi.org/10.24054/limentech.v20i2.2287

[62] M. I. Hanna Lavalle, A. A. Gomez Salgado, and L. A. Marquez Garcia, “Sistema tutor inteligente basado en la personalización del aprendizaje para la enseñanza de protocolos de atención en salud”, RCTA, vol. 2, no. 44, pp. 45–54, Jul. 2024, doi: 10.24054/rcta.v2i44.2866.

[63] A. Timarán Buchely and R. Timarán Pereira, “Minería de datos educativa para descubrir patrones asociados al desempeño académico en competencias genéricas”, RCTA, vol. 2, no. 38, pp. 87–95, Jul. 2021, doi: 10.24054/rcta.v2i38.1282.

[64] J. E. Hopcroft, R. Motwani, and J. D. Ullman, Introduction to Automata Theory, Languages, and Computation, 3rd ed., Boston, MA, USA: Addison-Wesley, 2006. ISBN: 978-0321455369.

[65] T. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, et al., “Language models are few- shot learners,” in Advances in Neural Information Processing Systems (NeurIPS), vol. 33, pp. 1877–1901, Dec. 2020. doi: 10.48550/arXiv.2005.14165.

[66] J. L. Hennessy and D. A. Patterson, Computer Architecture: A Quantitative Approach, 6th ed., Burlington, MA, USA: Morgan Kaufmann, 2019. ISBN: 978-0128119051.

[67] D. A. Patterson and J. L. Hennessy, Computer Organization and Design: The Hardware/Software Interface, 5th ed., Burlington, MA, USA: Morgan Kaufmann, 2014. ISBN: 978-0124077263.

[68] Mosquera-Perdomo, A., Salazar Galindez, J., Ramirez-Gonzalez, G., & Figueroa, C. (2023). Software for the extraction of bibliographic information registered in CvLAC and GrupLAC applied in the Department of Cauca. Ingeniería e Innovación, 11(2), 21. https://doi.org/10.21897/rii.3464

[69] J. D. Beltrán Beltrán, K. D. Ardila Lopera, L. Y. Caicedo Chacón, and S. L. Gómez Garcés, “Software e inteligencia de negocios para gestionar información de pacientes con diabetes mellitus e hipertensión arterial en el hospital regional de San Gil”, RCTA, vol. 1, no. 39, pp. 107–112, Feb. 2022, doi: 10.24054/rcta.v1i39.1402.

[70] J. M. Arengas Acosta, M. Lopez Ramirez, and R. Guzman Cabrera, “Impacto del preprocesamiento en la clasificación automática de textos usando aprendizaje supervisado y reuters 21578”, RCTA, vol. 1, no. 43, pp. 110–118, Mar. 2024, doi: 10.24054/rcta.v1i43.2506.

[71] J. E. Hopcroft, R. Motwani, and J. D. Ullman, Introduction to Automata Theory, Languages, and Computation, 3rd ed., Boston, MA, USA: Addison-Wesley, 2006. ISBN: 978-0321455369

Published

2026-05-27

Issue

Section

Articles