Matthew N.O. Sadiku, Olaniyi David Olaleye, Sarhan M. Musa


A data lake is a centralized storage that allows you to store all your structured and unstructured data at any scale. It is a place to put all the data an organization wants to collect, store, and analyze and turn into insights ad actions. Once data is placed in the data lake, it is available for analysis by everyone in the organization. The purpose of this paper is to provide a brief introduction on data lakes.

Full Text:



“Data lake,” Wikipedia, the free encyclopedia

C. Quix, “ Data lakes: A solution or a new challenge for big data integration,”

P. Russom, Data Lakes: Purposes, Practices, Patterns, and Platforms. TDWI, 2017.

R. Raju, R. Mital, and D. Finkelsztein, “Data lake architecture for air traffic management,” Proceedings of IEEE/AIAA 37th Digital Avionics Systems Conference, Sept. 2018.

P. P. Khine and Z. S. Wang, “Data lake: A new ideology in big data era,” Proceedings of 4th Annual International Conference on Wireless Communication and Sensor Network, 2018.

“What is a data lake?”

A. LaPlante and B. Sharma, Architecting Data Lakes Data Management Architectures for Advanced Business Use Cases. Sebastopol, CA: O’Reilly Media, 2016.

M. R. Llave, “Data lakes in business intelligence: Reporting from the trenches,”

Procedia Computer Science, vol. 138, 2018, pp. 516–524.

“How to build a successful data lake, “ May 17, 2016,

P. Pasupuleti and B. S. Purra, Data Lake Development with Big Data. Packt Publishing, 2015.

R. Marty, The Security Data Lake: Leveraging Big Data Technologies to Build a Common Data Repository for Security. Sebastopol, CA: O’Reilly Media, 2015.

B. Underdahl, Data Lakes for Dummies. John Wiley & Sons Australia, 2016.



  • There are currently no refbacks.

© International Journals of Advanced Research in Computer Science and Software Engineering (IJARCSSE)| All Rights Reserved | Powered by Advance Academic Publisher.