ANNOUNCEMENTS
Submission for the Journal in 2025 is open now More
Acceptance of Application Forms for Participation in International Competition
ICMHDS-2026 will be open soon
More
Acceptance of Application Forms for Participation in International Academic Conferences 
PPPMSF-2026 & CIES-2026 will be open soon More
Data Quality Factors for Big Data Analytics in Occupational Health and Risk Management
Lephoto N. 1, Segooa M. A. 1, Motjolopane I. 2, Seaba T. R. 3
 
1 Tshwane University of Technology, South Africa
2 University of Witwatersrand, South Africa
3 Nelson Mandela University, South Africa
 

 

Abstract

Background and Aim of Study: Occupational health and risk management (OHRM) in the South African mining sector remains a critical national priority, where the life or death outcomes can be impacted by poor quality-data usage. Big data analytics (BDA) is increasingly used for hazards predictions and timely decision-making. 
The aim of the study: to explore critical data quality factors that influence the reliability and effectiveness of BDA for decision-making to guide occupational health practitioners and risk managers within South African mining sector.
Material and Methods: The study employed a quantitative survey methodology, informed by the literature review, to identify key data quality factors of BDA impacting OHRM in the South African mining sector. Underpinned by Technological, Organizational and Environmental (TOE) theory and contextual factors within big data quality dimensions and big data sources. Data was collected from 103 OHRM experts determined by the population size of 140.
Results: The results reveal the following factors to have influence on data quality for BDA within SA mining OHRM; Environmental factors with a predictive power of 25.0% (β=0.250) at p=0.014; followed by big data quality dimensions with 24.1% (β=0.241) at p=0.008; then, technological factors with 15.9% (β=0.159) at p=0.027; big data sources with 13.2% (β=0.132) at p=0.026; lastly organisational factors was less significant at p=0.228 with 10.0% (β=0.100).
Conclusions: This study identifies the factors of data quality, highlighting its role in BDA for decision-making within OHRM. These factors can further be used to provide guidance for SA mining OHRM decision makers to target critical data quality improvement areas for enhanced decision making in the sector.

 
 
 

Keywords

big data analytics, data quality, mining safety, occupational health, risk management, South Africa

 
 
  

References

Abburi, C. K. (2024). Optimizing big data quality management for national-scale projects: Strategies and frameworks. International Journal of Intelligent Systems and Applications in Engineering, 12(4), 4811–4815. https://doi.org/10.2139/ssrn.5118713 

Abd Karim, A. T., & Sejati, A. E. (2021). Evaluation of mining safety management system implementation in PT. ANTAM UBPN Sultra. Jurnal Ekonomi, 26(2), 223–238. https://doi.org/10.24912/je.v26i2.747 

Albers, M. J. (2017). Quantitative data analysis – In the graduate curriculum. Journal of Technical Writing and Communication, 47(2), 215–233. https://doi.org/10.1177/0047281617692067 

Alnefaie, A., Albogami, S., Asiri, Y., Ahmad, T., Alotaibi, S. S., Al-Sanea, M. M., & Althobaiti, H. (2022). Chimeric antigen receptor T-cells: An overview of concepts, applications, limitations, and proposed solutions. Frontiers in Bioengineering and Biotechnology, 10, Article 797440. https://doi.org/10.3389/fbioe.2022.797440 

Andrews, R., Wynn, M. T., Vallmuur, K., ter Hofstede, A. H. M., Bosley, E., Elcock, M., & Rashford, S. (2019). Leveraging data quality to better prepare for process mining: An approach illustrated through analysing road trauma pre-hospital retrieval and transport processes in Queensland. International Journal of Environmental Research and Public Health, 16(7), Article 1138. https://doi.org/10.3390/ijerph16071138 

Arikekpar, F., & Bestman, A. E. (2023). Data quality management and responsiveness of indigenous oil and gas companies in rivers state, Nigeria. Journal of Office and Information Management (JOIM), 7(1-2), 301–313. https://www.rsu-joim.org/journal2.html 

Aseeri, M., & Kang, K. (2022). Big data-oriented organizational culture and business performance: A socio-technical approach. Problems and Perspectives in Management, 20(4), 52–66. https://doi.org/10.21511/ppm.20(4).2022.05   

Bag, S., Srivastava, G., Gupta, S., & Taiga, S. (2021). Diffusion of big data analytics innovation in managing natural resources in the African mining industry. Journal of Global Information Management, 30(6), 1–21. https://doi.org/10.4018/JGIM.297074 

Bauer, G. R., Churchill, S. M., Mahendran, M., Walwyn, C., Lizotte, D., & Villa-Rueda, A. A. (2021). Intersectionality in quantitative research: A systematic review of its emergence and applications of theory and methods. SSM – Population Health, 14, Article 100798. https://doi.org/10.1016/j.ssmph.2021.100798 

Bisschoff, R., & Grobbelaar, S. (2022). Evaluation of data-driven decision-making implementation in the mining industry. South African Journal of Industrial Engineering, 33(3), 218–232. https://doi.org/10.7166/33-3-2799 

Brouwer, D. H., & Rees, D. (2020). Can the South African milestones for reducing exposure to respirable crystalline silica and silicosis be achieved and reliably monitored? Frontiers in Public Health, 8, Article 107. https://doi.org/10.3389/fpubh.2020.00107 

Cai, L., & Zhu, Y. (2015). The challenges of data quality and data quality assessment in the big data era. Data Science Journal, 14, Article 2. https://doi.org/10.5334/dsj-2015-002 

Chikosi, L. C., & Mutezo, A. T. (2023). Managerial ownership, board gender diversity, occupational health, and safety risk management in an emerging economy. Journal of Governance and Regulation, 12(1), 230–241. https://doi.org/10.22495/jgrv12i1siart3 

Chukwuere, J. E. (2021). Theoretical and conceptual framework: A critical part of information systems research process and writing. Review of International Geographical Education Online, 11(9), 2678–2683. https://doi.org/10.48047/rigeo.11.09.234 

Cresswell, K., Claire, R., Avsar, T. S., Hudson, T., Purcell, R., Peeters, R., … Colvin, H. (2024). PD137 reviewing the health technology assessment and regulatory policy landscape on acceptability standards for real-world evidence - Initial findings. International Journal of Technology Assessment in Health Care, 40(S1), S147–S147. https://doi.org/10.1017/S026646232400374X 

Declerck, J., Kalra, D., Vander Stichele, R., & Coorevits, P. (2024). Frameworks, dimensions, definitions of aspects, and assessment methods for the appraisal of quality of health data for secondary use: Comprehensive overview of reviews. JMIR Medical Informatics, 12, Article e51560. https://doi.org/10.2196/51560 

Donkor, P., Siabi, E. K., Frimpong, K., Mensah, S. K., Siabi, E. S., & Vuu, C. (2023). Socio-demographic effects on role assignment and associated occupational health and safety issues in artisanal and small-scale gold mining in Ghana. Heliyon, 9(1), Article e13741. https://doi.org/10.1016/j.heliyon.2023.e13741 

Famure, O., Anderson, B. K., Atienza, J., Lena, E. R., & Singh, S. K. (2019). Standardization and alignment of data capture practices to clinical processes in the evaluation of living kidney donor candidates. Healthcare Management Forum, 32(4), 202–207. https://doi.org/10.1177/0840470419843672 

Feng, M., Zheng, J., Ren, J., Hussain, A., Li, X., Xi, Y., & Liu, Q. (2019). Big data analytics and mining for effective visualization and trends forecasting of crime data. IEEE Access, 7, 106111–106123. https://doi.org/10.1109/ACCESS.2019.2930410 

Feng, T., Zhang, X., Tan, L., Su, Y., & Liu, H. (2022). Near-miss organizational learning in nursing within a tertiary hospital: A mixed methods study. BMC Nursing, 21(1), Article 315. https://doi.org/10.1186/s12912-022-01071-1 

Franke, F., & Hiebl, M. R. W. (2022). Big data and decision quality: The role of management accountants’ data analytics skills. International Journal of Accounting & Information Management, 31(1), 93–127. https://doi.org/10.1108/IJAIM-12-2021-0246 

Gheorghe, G. C., Manrique-Hernandez, E. F., & Idrovo, A. J. (2022). Injuries and fatalities in Colombian mining emergencies (2005–2018): A retrospective ecological study. Revista Brasileira de Medicina do Trabalho, 20(4), 591–598. https://doi.org/10.47626/1679-4435-2022-799 

Haas, E. J. (2020). The role of supervisory support on workers’ health and safety performance. Health Communication, 35(3), 364–374. https://doi.org/10.1080/10410236.2018.1563033 

Hariri, R. H., Fredericks, E. M., & Bowers, K. M. (2019). Uncertainty in big data analytics: Survey, opportunities, and challenges. Journal of Big Data, 6(1), 1-16. https://doi.org/10.1186/s40537-019-0206-3 

Hermanus, M. (2007). Occupational health and safety in mining – Status, new developments, and concerns. Journal of the Southern African Institute of Mining and Metallurgy, 107(8), 531–538. https://hdl.handle.net/10520/AJA0038223X_3264 

Johnson, D. S., Sihi, D., & Muzellec, L. (2021). Implementing big data analytics in marketing departments: Mixing organic and administered approaches to increase data-driven decision making. Informatics, 8(4), 66. https://doi.org/10.3390/informatics8040066 

Kivunja, C., & Kuyini, A. B. (2017). Understanding and applying research paradigms in educational contexts. International Journal of Higher Education, 6(5), 26–41. https://doi.org/10.5430/ijhe.v6n5p26 

Krejcie, R. V., & Morgan, D. W. (1970). Determining sample size for research activities. Educational and Psychological Measurement, 30(3), 607–610. https://doi.org/10.1177/001316447003000308 

Kumar, D., & Bhattacharjee, R. M. (2023). Reducing workplace unsafe behaviour using risk classification, profiling, risk tolerance approach. Heliyon, 9(3), Article e13969. https://doi.org/10.1016/j.heliyon.2023.e13969 

Kuo, M. H., Sahama, T., Kushniruk, A. W., Borycki, E. M., & Grunwell, D. K. (2014). Health big data analytics: Current perspectives, challenges and potential solutions. International Journal of Big Data Intelligence, 1(1), 114–126. https://doi.org/10.1504/IJBDI.2014.063835 

Li, H., Huang, W., Zha, Z., & Yang, J. (2021). Application and platform design of geospatial big data. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLIII-B4-2021, 293–300. https://doi.org/10.5194/isprs-archives-XLIII-B4-2021-293-2021 

Lim, S., Saldanha, T. J. V., Malladi, S., & Melville, N. P. (2013). Theories used in information systems research: Insights from complex network analysis. Journal of Information Technology Theory and Application, 14(2), 5–46. https://aisel.aisnet.org/jitta/vol14/iss2/2 

Luo, N., Yu, H., You, Z., Li, Y., Zhou, T., Jiao, Y., Han, N., Liu, C., Jiang, Z., & Qiao, S. (2023). Fuzzy logic- and neural network-based risk assessment model for import and export enterprises: A review. Journal of Data Science and Intelligent Systems, 1(1), 2-11. https://doi.org/10.47852/bonviewJDSIS32021078 

Maroun, W. (2019). Exploring the rationale for integrated report assurance. Accounting, Auditing & Accountability Journal, 32(6), 1826–1854. https://doi.org/10.1108/AAAJ-04-2018-3463 

Mishra, P. C., & Mishra, P. K. (2023). Challenges and opportunities of big data analytics for human resource management in mining and metal industries. Journal of Mines, Metals and Fuels, 71(10), 1747–1753. https://doi.org/10.18311/jmmf/2023/35858 

Montisci, A., Palmieri, V., Vietri, M. T., Sala, S., Maiello, C., Donatelli, F., & Napoli, C. (2022). Big data in cardiac surgery: Real world and perspectives. Journal of Cardiothoracic Surgery, 17(1), Article 277. https://doi.org/10.1186/s13019-022-02025-z 

Moroe, N., Khoza-Shangase, K., Madahana, M., & Nyandoro, O. (2019). A proposed preliminary model for monitoring hearing conservation programmes in the mining sector in South Africa. Journal of the Southern African Institute of Mining and Metallurgy, 119(7), 671-679. https://doi.org/10.17159/2411-9717/18/016/2019 

Muhunzi, D., Kitambala, L., & Mashauri, H. (2023). Big data analytics in the healthcare sector: Opportunities and challenges in developing countries: A literature review. Health Informatics Journal, 30(4). https://doi.org/10.1177/14604582241294217 

Musiałek, M., & Maksymowicz, M. (2024). Remote sensing in mining: A brief overview and examples. Górnictwo Odkrywkowe – Opencast Mining, 65(3), 4–14. https://doi.org/10.5604/01.3001.0055.0491 

Muthelo, L., Mothiba, T. M., Malema, N. R., Mbombi, M. O., & Mphekgwana, P. M. (2022). Exploring occupational health and safety standards compliance in the South African mining industry, Limpopo Province, using principal component analysis. International Journal of Environmental Research and Public Health, 19(16), Article 10241. https://doi.org/10.3390/ijerph191610241 

Nazari, E., Ebnehoseini, Z., Agharezaei, Z., & Tabesh, H. (2020). Knowledge, attitude, and challenges of big data analytics based on IT staffs point of view in a developing country. Frontiers in Health Informatics, 9(1), Article 36. https://doi.org/10.30699/fhi.v9i1.225 

Ntlhakana, L., Nelson, G., Khoza-Shangase, K., & Dorkin, E. (2021). Occupational hearing loss for platinum miners in South Africa: A case study of data sharing practices and ethical challenges in the mining industry. International Journal of Environmental Research and Public Health, 19(1), Article 1. https://doi.org/10.3390/ijerph19010001 

Pypenko, I. S. (2019). Digital product: The essence of the concept and scopes. International Journal of Education and Science, 2(4), 56. https://doi.org/10.26697/ijes.2019.4.41 

Pypenko, I. S., & Melnyk, Yu. B. (2021). Principles of digitalisation of the state economy. International Journal of Education and Science, 4(1), 42-50. https://doi.org/10.26697/ijes.2021.1.5 

Rangineni, S., Bhanushali, A., Suryadevara, M., Venkata, S., & Peddireddy, K. (2023). A review on enhancing data quality for optimal data analytics performance. International Journal of Computer Sciences and Engineering, 11(10), 51–58. https://doi.org/10.26438/ijcse/v11i10.5158 

Rikhotso, O., Morodi, T. J., & Masekameni, D. M. (2022). Health risk management cost items imposed by occupational health and safety regulations: A South African perspective. Safety Science, 150(2), Article 105707. https://doi.org/10.1016/j.ssci.2022.105707 

Sarstedt, M., & Mooi, E. (2018). A concise guide to market research: The process, data, and methods using IBM SPSS Statistics. Springer. https://doi.org/10.1007/978-3-642-12541-6 

Segooa, A. M., & Kalema, B. M. (2024). Big data analytics artefact for outcome-based funding prediction in South African public universities. International Journal of Service Science, Management, Engineering, and Technology, 15(1), 1–27. https://doi.org/10.4018/IJSSMET.334220 

Stojanović, A., Milošević, I., & Nikolić, Đ. (2024). Developing a novel quantitative approach to evaluate the organizational factors affecting occupational health and safety in the mining industry. In XIX International May Conference on Strategic Management – IMCSM24 Proceedings, 20(1), 60–68. https://doi.org/10.5937/IMCSM24006S 

Taber, K. S. (2018). The use of Cronbach’s alpha when developing and reporting research instruments in science education. Research in Science Education, 48(6), 1273–1296. https://doi.org/10.1007/s11165-016-9602-2 

Tylečková, E., & Noskievičová, D. (2020). The role of big data in Industry 4.0 in the mining industry in Serbia. System Safety: Human – Technical Facility – Environment, 2(1), 166–173. https://doi.org/10.2478/czoto-2020-0020 

Ullah, F., Qayyum, S., Thaheem, M. J., Al-Turjman, F., & Sepasgozar, S. M. E. (2021). Risk management in sustainable smart cities governance: A TOE framework. Technological Forecasting and Social Change, 167, Article 120743. https://doi.org/10.1016/j.techfore.2021.120743 

Van Rensburg, H. M. M. J., Gous, A. G. S., Vosloo, J. C., & Van Heerden, M. (2019). Improving data management for environmental reporting in the gold mining industry. South African Journal of Industrial Engineering, 30(3), 163–173. https://doi.org/10.7166/30-3-2236 

Vassakis, K., Petrakis, E., & Kopanakis, I. (2018). Big data analytics: Applications, prospects and challenges. In G. Skourletopoulos, G. Mastorakis, C. Mavromoustakis, C. Dobre, & E. Pallis (Eds.), Mobile Big Data: Lecture Notes on Data Engineering and Communications Technologies, 10 (pp. 3–20). Springer. https://doi.org/10.1007/978-3-319-67925-9_1 

Wang, J., Liu, Y., Li, P., Lin, Z., Sindakis, S., & Aggarwal, S. (2023). Overview of data quality: Examining the dimensions, antecedents, and impacts of data quality. Journal of the Knowledge Economy, 15, 1159–1178. https://doi.org/10.1007/s13132-022-01096-6 

Wardhani, M. K., Setioko, B., & Pandelaki, E. E. (2021). Village spatial planning based on its potential as guidelines for guided and sustainable village development. Tataloka, 23(2), 171–182. https://doi.org/10.14710/tataloka.23.2.171-182 

Yang, L., Birhane, G. E., Zhu, J., & Geng, J. (2021). Mining employees’ safety and the application of information technology in coal mining: Review. Frontiers in Public Health, 9, Article 709987. https://doi.org/10.3389/fpubh.2021.709987 

Zhou, L. J., Cao, Q. G., Yu, K., Wang, L. L., & Wang, H. B. (2018). Research on occupational safety, health management and risk control technology in coal mines. International Journal of Environmental Research and Public Health, 15(5), Article 868. https://doi.org/10.3390/ijerph15050868 

 

 

 

  
 

 

Information about the authors:

Lephoto Nyakallo (Corresponding Author) https://orcid.org/0009-0000-7899-6950This email address is being protected from spambots. You need JavaScript enabled to view it.; Department of Informatics, Tshwane University of Technology, Pretoria, South Africa.

Segooa Mmatshuene Annahttps://orcid.org/0000-0002-4190-8256; Doctor of Computing, Senior Lecturer, Department of Informatics, Tshwane University of Technology, Pretoria, South Africa.

Motjolopane Ignitiahttps://orcid.org/0000-0001-9047-6720; PhD in Information Systems, Associate Professor, Digital Business Wits Business School, University of Witwatersrand, Johannesburg, South Africa.

Seaba Tshinakaho Relebogilehttps://orcid.org/0000-0002-5773-887X; Doctor of Computing in Informatics, Senior Lecturer, Department of IT Management and Governance, Nelson Mandela University, Gqeberha, South Africa

 

 
 
Cite this article as:

APA


Lephoto, N., Segooa, M. A., Motjolopane, I., & Seaba, T. R. (2025). Data quality factors for big data analytics in occupational health and risk management. International Journal of Science Annals, 8(2), 1–11. https://doi.org/10.26697/ijsa.2025.2.5 

Harvard


Lephoto, N., Segooa, M. A., Motjolopane, I., & Seaba, T. R. "Data quality factors for big data analytics in occupational health and risk management." International Journal of Science Annals, [online] 8(2), pp. 1–11. viewed 30 June 2025, https://culturehealth.org/ijsa_archive/ijsa.2025.2.5.pdf

Vancouver


Lephoto N., Segooa M. A., Motjolopane I., & Seaba T. R. Data quality factors for big data analytics in occupational health and risk management. International Journal of Science Annals [Internet]. 2025 [cited 30 June 2025]; 8(2): 1–11. Available from: https://culturehealth.org/ijsa_archive/ijsa.2025.2.5.pdf https://doi.org/10.26697/ijsa.2025.2.5

  © 2018 – 2025 International Journal of Science Annals
DOI: https://doi.org/10.26697/ijsa