ANALYSIS OF DATA COLLECTION PROBLEMS IN INTEGRATED STRUCTURES AND THEIR IMPACT ON THE ACCURACY OF PREDICTIVE ANALYTICS IN BUILDING BUSINESS PROCESSES
DOI:
https://doi.org/10.15407/economyukr.2025.06.039Keywords:
data collection; integrated structures; predictive analytics; data quality; machine learning; data integration into business processesAbstract
The effectiveness of business processes in the modern world is determined by the quality of big data analytics. These processes represent a sequence of interrelated actions aimed at achieving strategic and operational goals. One of the key obstacles to high-accuracy analytics is the heterogeneity of data sources, incompatibility of formats, delays in updating information, as well as errors in aggregation and processing. These factors provoke data distortion, which negatively affects the reliability of predictive models and can lead to inefficient management. BigData tools allow to conduct detailed business operations analysis, identify bottlenecks, optimize processes, and determine promising development directions. The efficiency of such solutions directly depends on the quality of the source data. In case of errors, inconsistencies or contradictions, predictive models lose their accuracy, which reduces the reliability of analytical conclusions. To minimize the risks associated with information distortion, it is necessary to standardize data collection, verification and processing procedures. The use of modern machine learning algorithms and statistical analysis methods allows to automatically detect errors, clean data and increase forecast accuracy. By optimizing data collection and analysis processes in integrated structures, companies can make informed decisions based on up-to-date information. This helps to increase competitiveness, create in-demand products, formulate effective strategies and quickly adapt to market changes.
The impact of data collection problems in integrated structures on the accuracy of predictive analytics is considered. The key factors leading to data distortion are analyzed, including heterogeneity, incompleteness, outdatedness and integration problems. The need to standardize data collection, cleaning and verification processes to increase forecast reliability is substantiated. Modern machine learning and statistical analysis methods, the efficiency of which depends on data quality, are outlined.
References
Redman, T. (2001). Data Quality: The Field Guide. Digital Press. 241 p. URL: https://books.google.com.ua/books/about/Data_Quality.html?id=iE2qsxDyEEYC&redir_esc=y
Provost, F., Fawcett, T. (2013). Data Science for Business. URL: https://www.researchgate.net/publication/256438799_Data_Science_for_Business
O’Neil, C. (2016). Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy. New York, Crown Publishers. 272 p. https://doi.org/10.5860/crl.78.3.403
Patil, DJ (2011). Building Data Science Teams. O'Reilly Media. 25 p.
Kimball, R., Ross, M. (2013). The Data Warehouse Toolkit. Third edition. Wiley. 601 p. URL: https://ia801609.us.archive.org/14/items/the-data-warehouse-toolkit-kimball/The%20Data%20Warehouse%20Toolkit%20-%20Kimball.pdf
Ventsel, V. (2020). Improving the effectiveness of management decisions in the process of economic and social policy-making based on economic and mathematical modeling. Theory and Practice of Public Administration, 1(68), 100-109. https://doi.org/10.34213/tp.20.01.13 [in Ukrainian].
Shchedrina, O. (2020). System analysis as a tool for making management decisions in business. Modeling and Information System in Economics, 99, 169-183. http://doi.org/10.33111/mise.99.15 [in Ukrainian].
Kravchenko, S., Grishkun, E., Vlasenko, O. (2020). Classification methods for machine learning using the Scikit-Learn library. Scientific notes of Taurida National V.I. Vernadsky University. Series: Technical Sciences. Vol. 31 (70). No. 3. Part 1. https://doi.org/10.32838/TNU-2663-5941/2020.3-1/19 [in Ukrainian].
Temchyshyna, Yu., Kravchenko, I. (2014). Improvement of the accounting and analytical component in the information system for forecasting at a modern production enterprise. Scientific Journal of the National Pedagogical Dragomanov University. Iss. 26. P. 57-63. URL: https://dspace.nuft.edu.ua/handle/123456789/19632 [in Ukrainian].
Turing, A. (1950). Computing Machinery and Intelligence. Mind. Vol. LIX. Iss. 236. P. 433-460. https://doi.org/10.1093/mind/LIX.236.433
Zamlynskyi, V., Shchurovska, A., Zamlynska, O. (2023). Features and characteristics of business intelligence (BI)-systems as a tool for improving the efficiency of company activities. Ukrainian Journal of Applied Economics and Technology. Vol. 8. No. 1. P. 53-61. https://doi.org/10.36887/2415-8453-2023-1-8 [in Ukrainian].
Golestanizadeh, M., Sarvari, H., Parishani, A., Akindele, N., Edwards, D.J. (2025). Probing the Effect of Business Intelligence on the Performance of Construction Projects Through the Mediating Variable of Project Quality Management. Buildings, 15, 621. https://doi.org/10.3390/buildings15040621
Kumar, V., Ashraf, A., Nadeem, W. (2024). AI-powered marketing: What, where, and how? International Journal of Information Management, 77, 102783. https://doi.org/10.1016/j.ijinfomgt.2024.102783
Bousdekis, A., Kerasiotis, A., Kotsias, S., Theodoropoulou, G., Miaoulis, G., Ghazanfarpour, D. (2023). Modelling and Predictive Monitoring of Business Processes under Uncertainty with Reinforcement Learning. Sensors, 23 (15), 6931. https://doi.org/10.3390/s23156931
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Publisher PH "Academperiodyka" of the NAS of Ukraine

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
