Very generic questions which do not have any specific or single answer like - how would you optimize a hive query? how would you optimize a spark job?
Big Data Intern Interview Questions
1,784 big data intern interview questions shared by candidates
There is two table. one is emp_details another is employee salary Find emp who has salary more than 8000 and who has worked in multiple project. 2. Find prime num. between 1 to 50 in Python/java. 3. Execution of spark job 4.Big-data-ques
First Round Experience He asked me 2 SQL Questions(Questions were based on Join and group byAlso asked to write same code in pyspark) and few basic spark questions like cache and persist, reparation and coalesc, RDD, Dataframe and dataset difference. I cleared first round and HR again scheduled second round Second Round: 2 SQL questions which was based on joins and union, union and union all difference, RDD vs Dataframe vs Dataset, Repartition and coalesce,Spark Architecture, Hive and other project related and basic questions. I cleared second round as well then HR scheduled next round which is HR round. HR round: Introduce yourself, What I know about Impetus, Why changing job and walked me through company info and CTC breakdown. She has some budget constraints or approval constraints so not sure why they called and conducted this whole process.
General and scenario based hadoop and spark questions
Aggregate cumulative statistics with a 1h moving window
Related Hadoop
Wie sehen Sie sich in den 5Jahren?
Can you describe some projects that you did at the university?
Difference between RDD and dataframe Difference between tuple and list Some spark questions Tell me about your project
Difference between RDD and dataframe Difference between tuple and list Some spark questions
Viewing 1761 - 1770 interview questions