Data scientist interviews at Google are challenging. The questions are difficult, specific to Google, and cover a wide range of topics.
The good news is that the right preparation can help you maximize your chances of landing a job offer. We’ve analyzed 90+ data scientist interview questions reported by real Google candidates to determine which topics come up most frequently.
Below you’ll find our ultimate guide for success, including example questions, links to solutions, and a preparation plan to help you land that Google data scientist role.
- Process and timeline
- Example questions
- Preparation tips
1.1 What interviews to expect
What's the Google data scientist interview process and timeline?
It typically takes three to six weeks and follows these steps:
- Application and referrals
- Recruiter screen (~30 min)
- Technical screen (~45-60 min)
- Onsite interviews (5 interviews, 45 min each)
Note that the exact process varies slightly between positions, as Google data scientists may be working in research, product analysis, or in other areas. Your recruiter will send you information at the beginning of the process, which will detail what interviews you can expect.
Let's look at each of these steps in more detail below:
1.1.1 Application and referrals
Step one: apply. As this guide focuses primarily on practice questions, the application overview section will be brief. Note that the Google data scientist role used to be titled “quantitative analyst,” and some job postings may still retain that term.
The online application requires a resume and basic applicant information, with an optional cover letter. According to Google, there is no guarantee that your cover letter will be considered. In your resume, be sure to highlight leadership experience, include specific data and success metrics for each role, and keep it to one page. Double-check using Google's own hiring guide.
After applying online, you will receive an email containing a pre-screen questionnaire of introductory technical questions before moving on to the full technical screen. If you are prepared for the full technical screen, this preliminary questionnaire will not be difficult. Applicants with referrals or who have been contacted directly by recruiters via LinkedIn may be able to skip the pre-screen online assessment.
1.1.2 Recruiter phone screen
After this, many but not all applicants talk to an HR recruiter on the phone. This call is an opportunity to learn more about the interview process ahead of you as well as how to prepare. Come ready to answer questions about your professional background and why you’re interested in Google. As this is followed by a technical screen, candidates are rarely asked coding or statistical questions at this step.
However, some candidates report passing directly to the technical screen after the initial application. In this case, the technical screen will include a few background questions that would otherwise have been asked in the recruiter call. Be sure to specify with your recruiter ahead of time what type of call you’ll be receiving so that you can come prepared.
1.1.3 Technical screen
After the application and pre-screen questionnaire, you’ll move on to a video call with a hiring manager, recruiter, or one of Google’s data scientists. This will take place over Google Hangouts. In very rare cases, you may undergo two technical screens before the onsite round. If you’re not sure what to expect, check in with your recruiter.
We’ll go into greater detail on the questions themselves later in this article, but in general be prepared for a few background questions, followed by statistical questions and coding. You’ll be coding live in a language of your choice on a shared document. Clear communication is important to Google, so practice talking through your reasoning simply and coherently as you work.
1.1.4 Onsite interviews
The final and toughest stage of the Google data scientist interview process is the onsite portion. Typically this involves five rounds of interviews that last about 45 minutes each, with time for lunch in the Google cafeteria. Other than lunch, you may have little to no breaks between interviews.
You will need to be prepared for many types of questions during the onsite interviews, with an emphasis on applied statistics in business case scenarios. Prepare for a higher level of difficulty than the questions presented during the technical screen.
More specifically, there are four main categories of questions that you’ll have to answer. We’ll give you practice examples later, but here’s a summary:
- Statistics and machine learning questions, where you’ll be tested both on general statistical principles and definitions as well as probability and applied machine learning.
- Coding questions: where you’ll demonstrate both technical skills and statistical problem solving via SQL querying and programming in the language of your choice.
- Product sense questions, where you’ll need to apply your statistical and coding skills to test and drive business and product decisions.
- Behavioral questions, where Google will test your culture fit through your past experiences and current motivations.
After you’ve completed the onsite rounds of interviews, you should receive feedback in a matter of weeks.
[COVID note] It's possible that your onsite interviews will be held virtually instead of in-person, given the COVID-19 pandemic. However, your recruiter should be able to provide you the most up-to-date information on Google's onsite interview procedures. Feel free to ask your recruiter for details once you've been officially invited to participate in the onsite interviews.
1.2 What exactly is Google looking for?
At the end of each session your interviewer will grade your performance using a standardized feedback form that summarizes the attributes Google looks for in a candidate. That form is constantly evolving, but we have listed the main components we know of at the time of writing this article below.
A) Questions asked
In the first section of the form the interviewer fills in the questions they asked you. These questions are then shared with your future interviewers so you don't get asked the same questions twice.
B) Attribute scoring
In the next section, each interviewer will assess you on the four main attributes Google looks for when hiring:
- General cognitive ability. This is often referred to as "GCA" by Googlers. The company wants to hire smart data scientists who can learn and adapt to new situations. Here your interviewer will try to understand how you solve hard problems and how you learn. Thus, it’s important to have a discussion with your interviewer rather than rushing to an underdeveloped answer.
- Role-related knowledge and experience. This is often referred to as "RRK" or "RRKE" internally. The company wants to make sure that you have the right experience, domain expertise and competencies for the position you're applying for. For instance if you're applying for a position titled “Product Analyst, Data Science” then you'll be expected to have in-depth knowledge about Google’s line of products.
- Leadership. Google looks for a particular type of leadership called “emergent leadership.” You'll typically be working in cross-functional teams at Google, and different team members are expected to step up and lead at different times in the lifecycle of a project when their skills are needed.
- Googleyness (i.e. culture fit). The company wants to make sure Google is the right environment for you. Your interviewer will check whether you naturally exhibit the company's values, including comfort with ambiguity, a bias to action, and a collaborative nature.
Depending on the exact job you're applying for, these attributes might be broken down further, but the total number of attributes does not usually exceed six or seven.
In this middle section, Google's interviewers typically repeat the questions they asked you, document your answers in detail, and give you a score for each attribute (e.g. "Poor", "Mixed", "Good", "Excellent").
C) Final recommendation
Finally, interviewers will write a summary of your performance and provide an overall recommendation on whether they think Google should be hiring you or not (e.g. "Strong no hire", "No hire", "Leaning no hire", "Leaning hire", "Hire", "Strong hire").
1.3 What happens behind the scenes
If things go well at your onsite interviews here is what the final steps of the process look like:
- Interviewers submit feedback
- Hiring committee recommendation
- Team matching
- Senior leader and Compensation committee review
- Final executive review (only for senior roles)
- You get an offer
After your onsite, your interviewers will all submit their feedback usually within two to three days. This feedback will then be reviewed by a hiring committee, along with your resume, internal referrals, and any past work you have submitted. At this stage, the hiring committee will make a recommendation on whether Google should hire you or not.
If the hiring committee rules in your favor, you'll usually start your team matching process. In other words, you'll talk to hiring managers and one or several of them will need to be willing to add you to their team in order for you to get an offer from the company.
In parallel, the hiring committee recommendation will be reviewed and validated by a senior manager and a compensation committee who will then decide how much money you are offered. Finally, if you are interviewing for a senior role, a senior Google executive will review a summary of your candidacy and compensation before the offer is sent to you.
As you've probably gathered by now, Google goes to great lengths to avoid hiring the wrong candidates. This hiring process with multiple levels of validations helps them scale their teams while maintaining a high caliber of employees. But it also means that the typical process can spread over many weeks and sometimes months.
Let’s get into the four primary categories of questions you’ll answer during the Google data science interview:
- Statistics and Machine Learning (56%)
- Coding (26%)
- Product Sense (9%)
- Behavioral (9%)
In the sections below, we've put together a high-level overview of each type of question. In addition, we've compiled a selection of real Google data scientist interview questions, according to data from Glassdoor. We've edited the language in some places to improve the clarity or grammar, and we've included a link to a solution when viable.
Use these example questions to start practicing for your interviews. If you're looking for even more practice questions, check out our general data science interview prep guide.
Google’s data scientists have to derive useful insights from large, and potentially complex, datasets. Thus it’s imperative to have a strong understanding of statistics. Out of all the question categories, general statistics and statistical probability come up the most often in all stages of the interview process. Take extra time to study this section.
Review fundamental statistics and how to give concise explanations of statistical terms, with an emphasis on probability. Some general topics that have come up before include p-values, MLE, confidence intervals, and Bayes theorem. In addition to these general topics, you’ll find complete questions to work through below.
Your interviewer will also ask questions specific to machine learning, as Google data scientists must build algorithms that improve and remain accurate over time. General topics that have come up before include regression models, feature selection, and recurrent neural networks.
Let's get to the example questions.
Google data scientist interview questions - Statistics and machine learning
- In what situation would you consider mean over median?
- For sample size n, the margin of error is 3. How many more samples do we need to make the margin of error 0.3?
- What is the assumption of error in linear regression? (Solution)
- Given data from two product campaigns, how could you do an A/B test if we see a 3% increase for one product?
- I have a deck and take one card at random. What is the probability you guess it right?
- Explain a probability distribution that is not normal and how to apply that.
- Given uniform distributions X and Y and the mean 0 and standard deviation 1 for both, what’s the probability of 2X > Y? (Solution)
- There are four people in an elevator and four floors in a building. What’s the probability that each person gets off on a different floor?
- Make an unfair coin fair. (Solution)
- If the labels are known in a clustering project, how would you evaluate the performance of the model?
- Why use feature selection? (Solution)
- If two predictors are highly correlated, what is the effect on the coefficients in the logistic regression? What are the confidence intervals of the coefficients?
- What is the difference between K-mean and EM?
- When using a Gaussian mixture model, how do you know it is applicable?
Google data scientists work with the company's vast datasets to understand and solve real-world problems. So expect Google interviewers to test you on statistical coding, SQL, and some data analysis.
As statistical questions come up the most often, be ready to write functions that solve problems related to statistical analysis and probability. Most candidates report coding with Python, but you may use your preferred programming language.
SQL questions are the second most frequent technical topic, followed by a mixture of data structures and modeling. Google is looking for candidates who know how to use and manipulate important data. So practice running SQL queries quickly and with proper syntax.
Let's get into the interview questions below.
Google data scientist interview questions - Coding
Statistical coding questions
- Write a function to generate N sample from a normal distribution and plot the histogram. (Solution)
- Write code to generate iid draws from distribution X when we only have access to a random number generator.
- Coding in R, multiply all a[i,j] in a i rows j columns dataset.
- Given a list of characters, a list of prior probabilities for each character, and a matrix of probabilities for each character combination, return the optimal sequence for the highest probability.
- How would you find the top 5 highest-selling items from a list of order histories?
- Can you explain how SQL works?
- Given three columns of data, how would you compare the first three to the last three?
- How do you calculate the median for a given column of numbers in a data set?
Google's data scientists must be able to use their technical skills in order to drive concrete business decisions. Through a variety of techniques, data scientists generate insights that are ultimately used to test and improve Google's products as well as the company as a whole.
So come prepared to apply your technical knowledge to business case scenarios. For example, Google tends to ask questions that use statistical A/B testing to compare the performance of their products and services. You should also be prepared for questions about product metrics and how they could be improved. Don’t forget to familiarize yourself with Google’s main products in advance.
Practice using the example questions below.
Google data scientist interview questions - Product sense
- You have a google app and you make a change. How do you test if a metric has increased or not? (Solution)
- How do you detect viruses or inappropriate content on YouTube?
- How would you compare if upgrading the android system produces more searches?
- The outcome of an experiment is that 5% of one group clicks more. Is that a good result?
- How would you remove bias and make inferences from data about two ad campaigns?
- Given there are no metrics being tracked for Google Docs, a product manager comes to you and asks, what are the top five metrics you would implement?
In addition to the question types highlighted above, you can expect to be asked behavioral or "resume" questions about your past work experience and your motivation for applying to Google. Your interviewers are looking for you to demonstrate your “Googleyness” (see section 1.2 B) as well as your ability to communicate clearly.
If you're applying directly to a job posting, be strategic by aligning your answers for behavioral questions with the top qualifications that are listed in the job description. Below you’ll find the real behavioral interview questions reported by data scientist candidates, as well as a few extras reported by Google software engineer candidates.
Practice using the example questions below.
Google data scientist interview questions - Behavioral
- Why Google?
- How do you sort your priorities when engaged in multitasking?
- Describe a past project you worked on.
- In what direction do you see your career moving?
- Do you prefer working in small or large teams?
What’s your favorite Google product and why? (software engineer question)
Tell me about a time you had to handle a project that was late. (software engineer question)
- Tell me about a time you used data to make a critical decision. (software engineer question)
Now that you know what questions to expect, let's focus on how to prepare. Here are the four preparation steps we recommend to help you get an offer as a Google data scientist.
3.1 Learn about Google's culture
Most candidates fail to do this. But before investing tens of hours preparing for an interview at Google, you should take some time to make sure it's actually the right company for you.
Google is prestigious and it's therefore tempting to assume that you should apply, without considering things more carefully. However, it's important to remember that prestige alone won't make you happy in your day-to-day work. What will make you happy is what you’ll actually be doing as well as the people you'll be working with.
If you know data scientists who work at Google or used to work there, talk to them to understand what the culture is like. In addition, we would recommend reading the following resources:
- Google's mission statement (by Google)
- Google's values (by Google, archived 2016)
- Google strategy teardown (by CBS Insights)
3.2 Practice by yourself
As mentioned above, you'll encounter four main types of interview questions at Google: statistics and machine learning, coding, product sense, and behavioral. To get an idea of real-life problems that Google data scientists have to tackle on the job, take a look at The Unofficial Google Data Science Blog.
For statistics and machine learning interview questions, we'd recommend brushing up on the fundamentals using Google’s own technical development guides. Brilliant.org offers online courses designed around statistical probability and other useful topics, some of which are free. Search for specific questions and answers around statistics, machine learning, data analysis, and others on StackExchange. Finally, you can post your own questions and discuss topics likely to come up in your interview on Reddit’s statistics and machine learning threads.
For coding interview questions, start with the video below that shows a step-by-step method for answering coding questions. It is aimed toward Amazon software development but may be useful for any type of coding.
Practice the method using example questions such as those in section 2.1, or those relative to similar Google positions (e.g. Google software engineer coding questions).
Also, practice SQL and programming questions with medium and hard level examples on leetcode, and explore Google's Tech Dev resource library for more questions. For extra help with SQL, read this analysis of the 3 "types" of SQL problems. Note that in the onsite rounds you’ll likely have to code on a whiteboard without being able to execute it, so practice writing through problems on paper.
For product sense interview questions, you're dealing with problems that are similar to what product managers at Google would work on. As a result, we'd recommend studying our product management guides on metric, favorite product, product improvement, and estimation questions. These guides will equip you with a method for answering the majority of the product / business sense questions you're likely to encounter as a data scientist candidate. Again, study up on Google’s main products so you’re ready to dive into your reasoning about them.
For behavioral interview questions, we recommend learning our step-by-step method for answering behavioral questions. You can then use that method to practice answering the example questions provided in section 2.4 above.
Finally, a great way to practice all of these different types of questions is to interview yourself out loud. This may sound strange, but it will significantly improve the way you communicate your answers during an interview. Play the role of both the candidate and the interviewer, asking questions and answering them, just like two people would in an interview. Trust us, it works.
3.3 Practice with peers
Practicing by yourself will only take you so far. One of the main challenges of data scientist interviews at Google is communicating your different answers in a way that's easy to understand.
As a result, we strongly recommend practicing with a peer interviewing you. If possible, a great place to start is to practice with friends. This can be especially helpful if your friend has experience with data scientist interviews, or is at least familiar with the process. You can also find peers to practice with on our free mock interview platform.
3.4 Practice with ex-interviewers
Practicing with peers can be a great help, and it's usually free. But at some point, you'll start noticing that the feedback you are getting from peers isn't helping you that much anymore. Once you reach that stage, we recommend practicing with ex-interviewers from top tech companies.
If you know a data scientist or someone who has experience running interviews at Google or another big tech company, then that's fantastic. But for most of us, it's tough to find the right connections to make this happen. And it might also be difficult to practice multiple hours with that person unless you know them really well.
This problem is exactly why we're preparing to launch a coaching service for data scientists, where you can get one-on-one practice and feedback from Google ex-interviewers. Drop-in your email address below to get notified when we launch!
If you have any questions about Google data scientist interviews, do not hesitate to ask them in the comments below. All questions are good questions, so go ahead!