How does Hbase store data internally?
Data Engineer Interview Questions
21,097 data engineer interview questions shared by candidates
What was my previous experience.
Architecture and ETL process of my previous employment with an example of End to end ETL flow. Couple of questions on AWS services. Learning spirit Differences between batch processing and stream processing. Questions on Kappa architecture Many more relavent to my previous experience mentioned in CV
What is Hadoop? What is Spark? And very silly questions like why use a Message Queue and not a Database.
How would you describe the data quality and data integrity
What has Microsoft attempted in the last decade that would be considered mismanaged by the general public?
Can you walk us through the process of designing an end-to-end data pipeline, including data ingestion, transformation, and loading (ETL), and how you would ensure its scalability and reliability?
1. Previous Works 2. Joining of Datasets and Questions in Relation to CDC Logics. 3. SCD-related questions and backfill scenarios 4. Architecture of Abinitio (since my tool was Abinitio)
Suppose I have records like this: ("a-b", "data1", 1) ("a-c", "data2", 1) ("a-b", "data3", 1) How can I group and sum, such that I have the following results when the input is a DataStream? ("a-b", ["data1", "data3"], 2) ("a-c", ["data2"], 1)
How do you manage client expectations when a project goes wrong?
Viewing 1861 - 1870 interview questions