What is combinebykey SCD1 logic Different between edge node and data node Where the code will be deployed? (edge node or in cluster) YARN architecture What are all the versions of spark you have worked? Diff btw SchemaRDD and df Different ways to create dataframe what is bundle in oozie? fork action in oozie? distcp command how do you decide number of mappers in sqoop job? what is the optimal number of mappers provided there is no restriction in establishing connection to DB? how to do you pull clob,blob datatype in oracle to HDFS? semi join,anti-join in scala diff between logical plan and physical plan where can we see logical plan?
Ingeniero De Big Data Interview Questions
1,228 ingeniero de big data interview questions shared by candidates
interview questions were mostly from experience and easy.
Q: When did you analyse data?
Why Kubrick?
Design an app that uses more than one data type
Garbage collection & JVM internals. Unique vs primary key. Clustered vs non-clustered indexes.
Can you describe some projects that you did at the university?
Two parts - working with API and an advanced SQL question
Repartiton and colaesce, why we are going for shuffuling if any image processed data are incoming what you choose to process many spark sql questions
What new tech are you interested in?
Viewing 1181 - 1190 interview questions
See Interview Questions for Similar Jobs
Ingeniero De Ciencias De DatosIngeniero De Mineria De DatosIngeniero De DatosCientífico De Minería De DatosCientífico De Datos PrincipalCientífico De Datos LíderIngeniero De Datos De LíderIngeniero De AnalíticaCientífico De DatosSr. científico De DatosIngeniero De Investigación De SoftwareCientífico De Datos SeniorSistemas De Información ComputacionalPasante Científico De DatosCiencias De DatosCientífico De Datos AsociadoDesarrollador Fase De DatosCientífico De La Computación