Skip to main content

Posts

Showing posts with the label The Crazy Programmer Data Lake vs Data Warehouse Data Lake vs Data Warehouse The Crazy Programmer

Data Lake vs Data Warehouse Neeraj Mishra The Crazy Programmer

Companies everywhere are handling more data than ever and all these terabytes of data need to be stored somewhere. Should you store the data in a database, a data warehouse, or a data lake? How do you know what is best for your company? Choosing the right data storage solution will depend greatly on how the data is going to be used. While both a data lake and a data warehouse share the goal of the process data queries to facilitate analytics, their functions are different. This post will give you an overview and use cases to understand when to use a data lake or a data warehouse. What is Data Lake? A data lake is a repository that holds raw data, of which the purpose is not yet defined or requires a very high level of flexibility and agility. A data lake allows you to store all data, at its raw format, structured and unstructured in a central repository. You can store the data without having to structure it first. The data lake may not use databases to store the data, using flat fi