DATA WAREHOUSE & DATA LAKE
What is it?
A Data Warehouse is a repository of a pool of data, which is nevertheless a model structured and designed for the analytics phase. A Data Lake is also a repository but it uses the data in its native format, as if it were fluid shapes, not yet filtered or divided into packages. The data will only be transformed when it needs to be analysed by applying, if necessary, a scheme to proceed to the next analysis.
What do you get?
In both cases you get a data package collected at one point. With a Data Lake, the data can be unstructured, semi-structured or structured, and therefore offers massive scalability.
On the contrary, a Data Warehouse structures and stores data in files or folders and it is therefore necessary to assess the size in advance.
A consistent data platform is a company’s most important information asset. The technologies we use are: AWS, Azure, Google Cloud Platform, Oracle, PostgreSQL, Snowflake, SQL Server.