Data Science Internship Interview Questions

40,355 data science internship interview questions shared by candidates

SoftDev : Python (I chose), you have an array of string sorted but with empty strings (e.g. ["abc","","","","cef","","","dej,""] and you want to know if an other string is in this array. How to do it with a complexity of O(log n) in general case ? Then how to find how many times it appears in this array ?
avatar

Data Scientist Intern

Interviewed at Microsoft

4
May 9, 2017

SoftDev : Python (I chose), you have an array of string sorted but with empty strings (e.g. ["abc","","","","cef","","","dej,""] and you want to know if an other string is in this array. How to do it with a complexity of O(log n) in general case ? Then how to find how many times it appears in this array ?

1. How do you use NN to reduce dimensionality? 2. Can you model time series as a linear regression model? 3. a) Can you use resampling methods like bagging to estimate the max of a population? b) Why is bagging a variance reduction scheme? 4. Why is the use of minibatch to minimize a function computationally more efficient than any other methods? 5.Gambler's ruin problem. 6. Assume that in a time series, some data are missing. How do you handle that? A. average out the existing values. Okay, so you want to average out the existing values, but how do you define the the new time series as a single function? A. Use characteristic or indicator function.
avatar

Data Scientist

Interviewed at Criteo

3.9
Nov 22, 2018

1. How do you use NN to reduce dimensionality? 2. Can you model time series as a linear regression model? 3. a) Can you use resampling methods like bagging to estimate the max of a population? b) Why is bagging a variance reduction scheme? 4. Why is the use of minibatch to minimize a function computationally more efficient than any other methods? 5.Gambler's ruin problem. 6. Assume that in a time series, some data are missing. How do you handle that? A. average out the existing values. Okay, so you want to average out the existing values, but how do you define the the new time series as a single function? A. Use characteristic or indicator function.

Viewing 1471 - 1480 interview questions

Glassdoor has 40,355 interview questions and reports from Data science internship interviews. Prepare for your interview. Get hired. Love your job.