System for Medical Documentation Filling based on Automatic Recognition of Audio Recordings
DOI:
https://doi.org/10.15407/intechsys.2026.01.043Keywords:
intelligent software system, automatic recognition of audio recordings, medical records administration, automatic speech recognition, generative language models, structured reportingAbstract
The investigation aims to address the pressing issue of excessive administrative burden on medical personnel in Ukraine, which leads to significant time expenditures and the risk of errors in manual documentation. As a result, an intelligent software system was created that can automatically convert audio recordings of medical consultations into structured reporting adapted to national standards. The developed system is adapted to the linguistic and regulatory environment of Ukraine, ensures an unprecedented level of data confidentiality by localizing the transcription process, and directly generates reports in accordance with national standards.
References
Arndt B.G., Beasley J.W., Watkinson M.D., et al. Tethered to the EHR: Primary Care Physician Workload Assessment Using EHR Event Log Data and Time-Motion Observations. Annals of Family Medicine, 2017, Vol. 15 (5), 419–426. https://doi.org/10.1370/afm.2121
Mamykina L., Vawdrey D.K., Hripcsak G. How Do Residents Spend Their Shift Time? A Time and Motion Study With a Particular Focus on the Use of Computers. Academic Medicine, 2016, Vol. 91 (6), 827–832. https://doi.org/10.1097/ACM.0000000000001148
Shanafelt T.D., West C.P., Sinsky C., et al. Changes in Burnout and Satisfaction With Work-Life Integration in Physicians and the General US Working Population Between 2011 and 2020. Mayo Clinic Proceedings, 2022, Vol. 97 (3), 491–506. https://doi.org/10.1016/j.mayocp.2021.11.021
LeCun Y., Bengio Y., Hinton G. Deep learning. Nature, 2015, Vol. 521, 436–444. https://doi.org/10.1038/nature14539
Radford A., Kim J. W., Xu T., et al. Robust Speech Recognition via Large-Scale Weak Supervision. arXiv, 2022, Article 2212.04356. https://doi.org/10.48550/arXiv.2212.04356
OpenAI Whisper Model Card. URL: https://github.com/openai/whisper/blob/main/model-card.md [Accessed 07 Nov. 2025]
Google Cloud Speech-to-Text Documentation. URL: https://cloud.google.com/speech-to-text/docs [Accessed 07 Nov. 2025]
Vosk Offline Speech Recognition API. URL: https://alphacephei.com/vosk/ [Accessed 07 Nov. 2025]
Microsoft Azure Speech Services Documentation. URL: https://azure.microsoft.com/en-us/products/ai-foundry/tools/speech/ [Accessed 07 Nov. 2025]
Jensen P. B., Jensen L. J., Brunak S. Mining electronic health records… Nature Reviews Genetics, 2012, Vol. 13, 395–405. https://doi.org/10.1038/nrg3208
Yadav V., Bethard S. A Survey on Recent Advances in Named Entity Recognition… The 27th International Conference on Computational Linguistics, COLING, 2018, 2145–2158.
Touvron H., Lavril T., Izacard G., et al. LLaMA: Open and Efficient Foundation Language Models. ArXiv, 2023, Article 2302.13971.
International Statistical Classification of Diseases… ICD-10, WHO, 2019. URL: https://icd.who.int/ [Accessed 14 Nov. 2025]
Suki AI Platform Overview. URL: https://www.suki.ai/ [Accessed 15 Nov. 2025]
Nuance Dragon Medical One Documentation. URL: https://dragon.nuance.com/en-us/user-documentation [Accessed 16 Nov. 2025]
DeepScribe: Ambient AI Scribe for Healthcare. URL: https://www.deepscribe.ai/ [Accessed 16 Nov. 2025]
Electronic health care system in Ukraine. URL: https://ehealth.gov.ua/ [Accessed 16 Nov. 2025]
Order Of The Ministry Of Health Of Ukraine 14 Feb. 2012 No. 110 On approval of forms of primary accounting documentation and Instructions for their completion, used in healthcare institutions regardless of the form of ownership and subordination. URL: https://zakon.rada.gov.ua/laws/show/z0661-12 [Accessed 16 Nov. 2025]
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Copyright Holder is the publisher of the Paper (The Institute of Information Technologies and Systems of the NAS of Ukraine), and/or the publisher of the Paper (PH "Akademperiodika" of the NAS of Ukraine), to that the The Institute of Information Technologies and Systems of the NAS of Ukraine on the basis of a sublicense publishing agreement granted the right to publish the work and the right to indicate the publisher after the copyright sign.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
The paper is an Open Access under the CC BY-NC-ND 4.0 license - Attribution-NonCommercial-NoDerivatives 4.0 International.