Automatic Classification of Ukrainian Texts by Functional Styles
DOI:
https://doi.org/10.15407/intechsys.2025.02.090Keywords:
functional style, stylemetry, vectorization, machine learning, classificationAbstract
The proposed multilevel method for classifying Ukrainian texts by functional style combines statistical analysis, keyword analysis, and contextual analysis based on the BERT model, which accounts for semantic and contextual dependencies in the text.
The results support the hypothesis that combining contextual features (generated by BERT) with statistical style parameters yields the highest classification accuracy. This highlights the advantage of the proposed model for tasks requiring high precision and stability in identifying functional text styles.
References
What are the advantages and disadvantages of Random Forest? URL: https://aiml.com/what-are-the-advantages-and-disadvantages-of-random-forest/ [Accessed 15 Nov. 2024]
Understanding searches better than ever before. URL: https://web.archive.org/web/20210127042834/https://www.blog.google/products/search/search-language-understanding-bert/ [Accessed 15 Nov. 2024]
mshamrai/bert-base-ukr-eng-rus-uncased. URL: https://huggingface.co/mshamrai/bert-base-ukr-eng-rus-uncased [Accessed 15 Nov. 2024]
Slavic BERT NER. URL: https://github.com/deeppavlov/Slavic-BERT-NER/blob/master/README.md [Accessed 15 Nov. 2024]
multilingual.md. URL: https://github.com/google-research/bert/blob/master/multilingual.md [Accessed 15 Nov. 2024]
Areshenkov Yu. O. Stylistics of the Ukrainian language: lecture notes and lesson plans: teaching and methodological manual. KrDPU, Kryvyy Rih, 2007, 3-th ed., 18p. [In Ukrainian: Арешенков Ю. О. Стилістика української мови: конспект лекцій та плани занять : навч.-метод. посіб.] https://doi.org/10.31812/0564/2140
Artistic style as a type of language. Substyles of artistic style. Genres of artistic style. Colors of artistic style. URL: https://studfile.net/preview/5721078/page:36 [Accessed 15 Nov. 2024] [In Ukrainian: Художній стиль як різновид мови. Підстилі художнього стилю. Жанри художнього стилю. Колорити художнього стилю]
BERT 101. State Of The Art NLP Model Explained. URL: https://huggingface.co/blog/bert-101 [Accessed 15 Nov. 2024]
UberText 2.0. URL: https://lang.org.ua/en/ubertext [Accessed 15 Nov. 2024]
Brown corpus of the Ukrainian language. [In Ukrainian: Браунський корпус української мови]
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Information Technologies and Systems

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
The paper is an Open Access under the CC BY-NC-ND 4.0 license - Attribution-NonCommercial-NoDerivatives 4.0 International.