System for Medical Documentation Filling based on Automatic Recognition of Audio Recordings

Authors

DOI:

https://doi.org/10.15407/intechsys.2026.01.043

Keywords:

intelligent software system, automatic recognition of audio recordings, medical records administration, automatic speech recognition, generative language models, structured reporting

Abstract

The investigation aims to address the pressing issue of excessive administrative burden on medical personnel in Ukraine, which leads to significant time expenditures and the risk of errors in manual documentation. As a result, an intelligent software system was created that can automatically convert audio recordings of medical consultations into structured reporting adapted to national standards. The developed system is adapted to the linguistic and regulatory environment of Ukraine, ensures an unprecedented level of data confidentiality by localizing the transcription process, and directly generates reports in accordance with national standards.

References

Arndt B.G., Beasley J.W., Watkinson M.D., et al. Tethered to the EHR: Primary Care Physician Workload Assessment Using EHR Event Log Data and Time-Motion Observations. Annals of Family Medicine, 2017, Vol. 15 (5), 419–426. https://doi.org/10.1370/afm.2121

Mamykina L., Vawdrey D.K., Hripcsak G. How Do Residents Spend Their Shift Time? A Time and Motion Study With a Particular Focus on the Use of Computers. Academic Medicine, 2016, Vol. 91 (6), 827–832. https://doi.org/10.1097/ACM.0000000000001148

Shanafelt T.D., West C.P., Sinsky C., et al. Changes in Burnout and Satisfaction With Work-Life Integration in Physicians and the General US Working Population Between 2011 and 2020. Mayo Clinic Proceedings, 2022, Vol. 97 (3), 491–506. https://doi.org/10.1016/j.mayocp.2021.11.021

LeCun Y., Bengio Y., Hinton G. Deep learning. Nature, 2015, Vol. 521, 436–444. https://doi.org/10.1038/nature14539

Radford A., Kim J. W., Xu T., et al. Robust Speech Recognition via Large-Scale Weak Supervision. arXiv, 2022, Article 2212.04356. https://doi.org/10.48550/arXiv.2212.04356

OpenAI Whisper Model Card. URL: https://github.com/openai/whisper/blob/main/model-card.md [Accessed 07 Nov. 2025]

Google Cloud Speech-to-Text Documentation. URL: https://cloud.google.com/speech-to-text/docs [Accessed 07 Nov. 2025]

Vosk Offline Speech Recognition API. URL: https://alphacephei.com/vosk/ [Accessed 07 Nov. 2025]

Microsoft Azure Speech Services Documentation. URL: https://azure.microsoft.com/en-us/products/ai-foundry/tools/speech/ [Accessed 07 Nov. 2025]

Jensen P. B., Jensen L. J., Brunak S. Mining electronic health records… Nature Reviews Genetics, 2012, Vol. 13, 395–405. https://doi.org/10.1038/nrg3208

Yadav V., Bethard S. A Survey on Recent Advances in Named Entity Recognition… The 27th International Conference on Computational Linguistics, COLING, 2018, 2145–2158.

Touvron H., Lavril T., Izacard G., et al. LLaMA: Open and Efficient Foundation Language Models. ArXiv, 2023, Article 2302.13971.

International Statistical Classification of Diseases… ICD-10, WHO, 2019. URL: https://icd.who.int/ [Accessed 14 Nov. 2025]

Suki AI Platform Overview. URL: https://www.suki.ai/ [Accessed 15 Nov. 2025]

Nuance Dragon Medical One Documentation. URL: https://dragon.nuance.com/en-us/user-documentation [Accessed 16 Nov. 2025]

DeepScribe: Ambient AI Scribe for Healthcare. URL: https://www.deepscribe.ai/ [Accessed 16 Nov. 2025]

Electronic health care system in Ukraine. URL: https://ehealth.gov.ua/ [Accessed 16 Nov. 2025]

Order Of The Ministry Of Health Of Ukraine 14 Feb. 2012 No. 110 On approval of forms of primary accounting documentation and Instructions for their completion, used in healthcare institutions regardless of the form of ownership and subordination. URL: https://zakon.rada.gov.ua/laws/show/z0661-12 [Accessed 16 Nov. 2025]

Downloads

Published

2026-04-30

How to Cite

Zatulovskyi, H., & Pіdnebesna H. (2026). System for Medical Documentation Filling based on Automatic Recognition of Audio Recordings. Information Technologies and Systems, 7(1), 43–53. https://doi.org/10.15407/intechsys.2026.01.043