Energy-Efficient Design of Distributed Data Warehousing Systems for High-Performance Data Mining Applications in Smart Cities
Keywords:
Smart Cities, Distributed Data Warehouse, Energy Efficiency, Data Mining, Edge Computing, Query OptimizationAbstract
The integration of data mining applications in smart cities relies heavily on large-scale, distributed data warehousing systems. However, these infrastructures often suffer from excessive energy consumption due to redundant data replication, inefficient query processing, and poorly optimized storage. This paper proposes an energy-efficient architectural design for distributed data warehousing tailored to smart cities, aiming to reduce carbon footprints while ensuring high performance in data mining operations. We review existing approaches, identify research gaps, and present a framework integrating edge computing, energy-aware scheduling, and adaptive query optimization. Simulation results demonstrate a 22.4% reduction in energy use without compromising data throughput.
References
Cheng, L., et al. (2023). Hybrid Edge-Cloud Architectures for Smart Cities. IEEE Transactions on Sustainable Computing, 9(1), 24–36.
Ahmed, H., & Salama, M. (2022). Energy-Aware Data Placement in Distributed Warehouses. Journal of Grid Computing, 20(3), 121–136.
Kim, S., & Lee, J. (2021). Adaptive Partitioning for Energy Efficient Warehousing. Future Generation Computer Systems, 118, 44–56.
Zhang, Y., et al. (2020). Intelligent Query Processing in Distributed Systems. IEEE Access, 8, 215934–215944.
Patel, R., & Jain, A. (2019). Pruning Algorithms for Smart Data Warehouses. Information Systems Frontiers, 21(5), 1045–1058.
Grolinger, K., et al. (2018). Green Data Warehousing in Smart Cities. Journal of Big Data, 5(1), 1–15.
Rasch, T., & Zimmermann, J. (2017). Modeling Energy-Aware Routing in Data Center Networks. Simulation Modelling Practice and Theory, 73, 53–70.
Sakr, S., Liu, A., Batista, D. M., & Alomari, M. (2017). A Survey of Large Scale Data Management Approaches in Cloud Environments. Communications of the ACM, 60(2), 60–79.
Kliazovich, D., Bouvry, P., & Khan, S. U. (2013). GreenCloud: A Packet-level Simulator of Energy-aware Cloud Computing Data Centers. Journal of Supercomputing, 62(3), 1263–1283.
Kaur, R., & Chana, I. (2015). Energy Efficient Resource Scheduling in Cloud Data Centers. Cluster Computing, 18(1), 261–274.
Abdullahi, M., Ngadi, M. A., & Zubair, A. (2016). Hybrid Metaheuristic Algorithms for Energy-Efficient Resource Scheduling in Cloud Computing Environments. PLOS ONE, 11(6), e0157550.
Zhou, W., Piramuthu, S., & Ghosh, J. (2018). Energy-efficient Data Mining Algorithms in the IoT Context. Information Systems Frontiers, 20(2), 313–323.
Tariq, M. A., et al. (2020). An Energy-Aware Framework for IoT Data Management in Smart Cities. Sustainable Cities and Society, 54, 101996.
Karmakar, G., et al. (2019). An Energy-Aware Framework for Big Data Analytics in Smart Grids. IEEE Transactions on Industrial Informatics, 15(6), 3571–3581.
Svorobej, S., Endo, P. T., & Masyuko, O. (2020). Energy-aware resource allocation strategies for cloud data centers: a survey. Journal of Network and Computer Applications
Perera, C., Liu, C. H., Jayawardena, S., & Chen, M. (2015). A survey on Internet of Things from industrial market perspective. IEEE Access, 2, 1660–1679.
Kwak, J., Yoo, S., & Kim, S. (2019). Energy-efficient big data analytics for smart cities: A case study on traffic prediction. Sustainable Cities and Society, 48, 101559
Downloads
Published
Issue
Section
License
Copyright (c) 2022 Satoshi Yamamoto (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.