- What is the source of data
- How the data will be extracted from the source
- What will the data format be?
- How often should data be collected?
- How to handle missing data points? what rules must be applied for missing data points
- How the data will be received by the reporting system (ware house). Pull or push approach.
Saturday, 13 December 2025
Data Engineering - Client Interview question regarding data collection.
Subscribe to:
Comments (Atom)
Data Engineering - Client Interview question regarding data collection.
What is the source of data How the data will be extracted from the source What will the data format be? How often should data be collected? ...
-
When you're working with a file that's too large to fit into memory—say, multi-GB logs or datasets—Python gives you several efficien...
-
What is Delta Table ? A Delta Table is a type of table used in Delta Lake , which is an open-source storage layer built on top of Apache ...
-
Using Snowpark to connect to S3 via a Snowflake stage is a powerful pattern for scalable, secure, and flexible data engineering. Here'...