On Determining Multiple Languages through Technological Examination for Conservation Management Using Machine Learning

Authors

  • Nguyen Minh Tuan

    Faculty of Information Technology, Posts and Telecommunications Institute of Technology, 11 Nguyen Dinh Chieu, Dakao Ward, 1 District, Ho Chi Minh City 700000, Viet Nam

  • Phan Thi Thanh Thuy

    University of Foreign Language Studies, The University of Danang (UFLS), Le Duan, Hai Chau, Da Nang 50000, Viet Nam

  • Ha Huy Nguyen Cuong

    Software Development Center, University of Danang, Le Duan, Hai Chau, Da Nang 50000, Viet Nam

  • Nguyen Trong Hien

    Faculty of Public Health, Pham Ngoc Thach University of Medicine, 2 Duong Quang Trung, 10 District, Ho Chi Minh City 700000, Viet Nam

DOI:

https://doi.org/10.30564/fls.v7i5.9110
Received: 16 March 2025 | Revised: 18 April 2025 | Accepted: 23 April 2025 | Published Online: 8 May 2025

Abstract

This study delves into the intricate linguistic history of Quang Nam province, a region rich in cultural and linguistic layers that existed long time ago. By analyzing place names and linguistic strata, the research uncovers traces of Mon Khmer and Austronesian languages, highlighting the region’s deep historical connections and multilingualism within Cham Pa society. To further explore these linguistic complexities, we employed an interdisciplinary approach that integrates insights from comparative linguistics, archaeology, and cultural studies. A key contribution of this research is the application of advanced information technology, specifically data mining and artificial intelligence, to the preservation and analysis of this linguistic heritage. Using the YOLO-v8 deep learning model, we developed a system capable of accurately recognizing and classifying handwritten place names. The YOLO-v8 model, renowned for its powerful object detection capabilities, was instrumental in automating the analysis of large datasets, achieving high levels of accuracy in the recognition of diverse linguistic characters. This integration of AI not only enhances our ability to study historical language use but also provides an efficient and scalable solution for archiving and managing valuable cultural data. The results of this study contribute to both the field of linguistic conservation and the development of AI-based tools for heritage preservation, ensuring the longevity and accessibility of the rich linguistic diversity of the region for future research.

Keywords:

Place Name; Origin; Champa; Multilingual; Quang Nam Province; AI; Dataset

References

[1] Trần, T.D., 2012. Họ ngôn ngữ và văn hóa tiền sử: Trường hợp văn hóa Đông Sơn và họ Thái - Kađai. In: Cộng đồng các tộc người ngữ hệ Thái - Kađai ở Việt Nam. Nhà xuất bản Thế giới: Hanoi, Vietnam. pp. 337–346.

[2] Trần, T.D., 2009. Về mối quan hệ giữa các ngôn ngữ Nam Á và Nam Đảo ở Đông Nam Á. VNU Journal of Science: Social Sciences and Humanities. 25(3), 121–126.

[3] Po, D., 2006. Cham language and script in historical process. Proceedings of the History of Language and Cham Script; 21–22 September 2006; Kuala Lumpur, Malaysia. EFEO & Tokyo University of Foreign Studies: Paris, France; Tokyo Japan. pp. 1–10.

[4] Bùi, T.N., 2017. Another Hypothesis on the Semantics and Etymology of the Name 'Đà Nẵng'. Language & Life Journal. 9(263), 95–101.

[5] Trần, T.D., 2016. Stopization of initial sounds in the history of the vietnamese language. Journal of Linguistics. 5(324), 9–15.

[6] Trần, T.D., 2013. Tên gọi thánh "Dóng" và lễ hội "Phù Đổng": góc nhìn từ ngữ âm lịch sử tiếng Việt. Journal of Linguistics. 2(285), 3–10.

[7] Dinh, H.H., 2020. The symbol of Saint Gióng and the Gióng festival in the historical context of Vietnam. Asian Education and Development Studies. 9(1), 37–45. DOI: https://doi.org/10.1108/AEDS-01-2018-0015

[8] Parmentier, H., 1935. LA CONSTRUCTION DANS L'ARCHITECTURE KHMÈRE CLASSIQUE. Bulletin of the French School of the Far East (BEFEO). 35(2), 243-311.

[9] Hardy, A., Griffiths, A., Baptiste, P., et al., 2019. Champa Territories and Networks of a Southeast Asian Kingdom. Available from: https://publications.efeo.fr/en/livres/932_champa (cited 20 April 2025).

[10] Sastri, K.A., 1935. The Origin of the Champa Alphabet. Bulletin de l'École française d'Extrême-Orient. Bulletin of the French School of the Far East (BEFEO). 35, 233–241.

[11] Liu, C., Yin, F., Wang, D., et al., 2013. Online and offline handwritten Chinese character recognition: benchmarking on new databases. Pattern Recognition. 46(1), 155–162. DOI: https://doi.org/10.1016/j.patcog.2012.06.021

[12] Xiao, X., Jin, L., Yang, Y., et al., 2017. Building Fast and Compact Convolutional Neural Networks for Offline Handwritten Chinese Character Recognition. Available from: https://doi.org/10.48550/arXiv.1702.07975 (cited 20 April 2025).

[13] Wu, Y., & Yin, F., Chen, Z., et al., 2017. Handwritten Chinese Text Recognition Using Separable Multi-Dimensional Recurrent Neural Network. Available from: https://ieeexplore.ieee.org/document/8269953 (cited 20 April 2025).

[14] Zhang, Y., 2015. Deep Convolutional Network for Handwritten Chinese Character Recognition. Available from: https://paperswithcode.com/paper/deep-convolutional-network-for-handwritten (cited 20 April 2025).

[15] Zhuang, Y., Liu, Q., Qiu, C., et al., 2021. A Handwritten Chinese Character Recognition based on Convolutional Neural Network and Median Filtering. Journal of Physics: Conference Series. DOI: https://doi.org/10.1088/1742-6596/1820/1/012162

[16] Huang, J., Tan, J., 2019. DropDist: A CNN-based model for large-scale handwritten Chinese character recognition with small training dataset. IEEE Transactions on Image Processing. 28(6), 2993–3003

[17] Ahlawat, S., Choudhary, A., 2020. Hybrid CNN-SVM Classifier for Handwritten Digit Recognition. Procedia Computer Science. 167, 2554-2560. DOI: https://doi.org/10.1016/j.procs.2020.03.309

[18] Kavitha, B.R., Srimathi, C., 2019. Benchmarking on offline Handwritten Tamil Character Recognition using convolutional neural networks. Journal of King Saud University - Computer and Information Sciences. 34(4), 1183-1190. DOI: https://doi.org/10.1016/j.jksuci.2019.06.004

[19] He, K., Zhang, X., Ren, S., et al., 2016. Deep Residual Learning for Image Recognition. Available from: https://doi.org/10.1109/CVPR.2016.90 (cited 20 April 2025).

[20] Liu, X., Zhou, Y., Wang, Z., 2020. Deep neural network-based recognition of entities in Chinese online medical inquiry texts. Future Generation Computer Systems. 114, 581-604. DOI: https://doi.org/10.1016/j.future.2020.08.022

[21] Peng, N., Dredze, M., 2016. Improving Named Entity Recognition for Chinese Social Media with Word Segmentation Representation Learning. Available from: https://doi.org/10.18653/v1/P16-2025 (cited 20 April 2025).

[22] Tuan, N.M., Meesad, P., 2025. A Bilinear Neural Network Method for Solving a Generalized Fractional (2+1)-Dimensional Konopelchenko-Dubrovsky-Kaup-Kupershmidt Equation. International Journal of Theoretical Physics. 64, 1–17. DOI: https://doi.org/10.1007/s10773-024-05855-w

[23] Tuan, N.M., Meesad, P., 2025. Bilinear Recurrent Neural Network for a Modified Benney-Luke Equation. International Journal of Applied and Computational Mathematics. 11(2), 1–35. DOI: https://doi.org/10.1007/s40819-025-01851-8

[24] Tuan, N.M., Meesad, P., Hieu, D.V., et al., 2024. On Students' Sentiment Prediction Based on Deep Learning: Applied Information Literacy. SN Computer Science. 5, 928. DOI: https://doi.org/10.1007/s42979-024-03281-7

[25] Tuan, N.M., Meesad, P., Nguyen, H.H.C., 2024. English–Vietnamese Machine Translation Using Deep Learning for Chatbot Applications. SN Computer Science. 5(1), 1–5. DOI: https://doi.org/10.1007/s42979-023-02339-2

[26] Papadakis, S., Kiv, A., Kravtsov, H., et al., 2023. Unlocking the power of synergy: The joint force of cloud technologies and augmented reality in education. Proceedings of the 10th Workshop on Cloud Technologies in Education, and 5th International Workshop on Augmented Reality in Education (CTE+AREdu 2022); 23 May 2022; Kryvyi Rih, Ukraine. 3364, pp. 1–23.

[27] Papadakis, S., Ків, А.Ю., Kravtsov, H.M., et al., 2023. Revolutionizing education: Using computer simulation and cloud-based smart technology to facilitate successful open learning. Proceedings of the 10th Illia O. Teplytskyi Workshop on Computer Simulation in Education, and Workshop on Cloud-based Smart Technologies for Open Education (CoSinEi and CSTOE 2022) co-located with ACNS Conference on Cloud and Immersive Technologies in Education (CITEd 2022); 22 December 2022; Kyiv, Ukraine. 3358, pp. 1–18.

[28] Nguyen, M.T., Meesad, P., Nguyen, H.S., 2024. On a Stock Prediction Aligned to Natural Language Sentiments. Proceedings of the 2024 8th International Conference on Natural Language Processing and Information Retrieval; 13–15 December 2024; Okayama, Japan. ACM: New York, NY, USA. pp. 395–400.

[29] Nguyen, M.T., Phayung, M., Duong, V.H., et al., 2023. New data about library service quality and convolution prediction. CTU Journal of Innovation and Sustainable Development. 15(ISDS), 30–38. DOI: https://doi.org/10.22144/ctujoisd.2023.032

Downloads

How to Cite

Minh Tuan, N., Thi Thanh Thuy, P., Cuong, H. H. N., & Hien, N. T. (2025). On Determining Multiple Languages through Technological Examination for Conservation Management Using Machine Learning. Forum for Linguistic Studies, 7(5), 643–654. https://doi.org/10.30564/fls.v7i5.9110

Issue

Article Type

Article