Differences Between a Data Scientist, Data Analyst, and Data Engineer

December 3, 2018 By Pascal Schmidt Personal Project

Often times, there is some confusion about the different job titles in the data industry. After all, Data Scientist, Data Analyst, and Data Engineer sound pretty similar. Nevertheless, there is a big difference in the work and skill these three job titles do and need. So what is actually the difference between a Data Scientist, Data Analyst, and Data Engineer?

To answer this question, we went on Indeed’s job website and scraped data from Vancouver, Toronto, and Montreal for the terms Data Scientist, Data Analyst, and Data Engineer. We made a separate blog post last time where we analyzed what is needed to become a Data Scientist.

We wanted to know if there is a difference between a Data Analyst, Data Scientist, and Data Engineer in:

  • the level of education someone has to acquire to enter different data professions
  • the major one should pick in college and which qualifies best for certain positions
  • what technologies you should learn to become a Data Scientist, Data Analyst, or Data Engineer
  • what hard skills different data professions prefer (programming, machine learning)
  • what soft skills different data professions prefer (creativity, teamwork)
  • how many years of experience one need to enter certain fields

In the end we looked at who is hiring for a Data Scientist, a Data Analyst, or a Data Engineering position. We also looked at the job titles. So, what kind of job title would a Data Scientist have in comparison to a Data Analyst or in comparison to a Data Scientist. This is very interesting because it highlights the different job titles which make the distinctions between these positions very apparent. Also after reading this post, you know exactly what kind of job titles refer to the different professions and then in turn know what their requirements are (technologies, skills, etc.).

Let’s jump into the analysis!

What Level of Education is Required for Different Data Professions?

First, we wanted to know what the differences are for the level of education someone needs to pursue to get a job. Naturally, we would think that becoming a Data Scientist requires the highest level of education because the technical analysis can be sometimes very complex and has to be precise. Let’s see what Indeed says.

Difference of education between data analyst and data scientist

 

As we suspected, the highest degree, a PhD, is required for the Data Science position. The graph above shows that more than double the number of PhDs are required for becoming a Data Scientist in comparison to a Data Analyst or Data Engineering position.

The percentage of bachelor degrees is highest in the Data Engineering profession. I guess this is because a lot of these jobs require computer science degrees. It is often sufficient to get a bachelor’s in Computer Science whereas, for Statistics, Masters are preferred or even PhDs. But enough of the talk. Let’s just find out what these majors are.

What Major Should you Pick to Become a Data Scientist, Data Analyst, or Data Engineer?

We looked for Statistics, Mathematics, Computer Science, any type of Engineering, Finance, Economics, Operations Research, and Physics degree. In the end, we kept degrees in the graph that were mentioned at 5% of the time or more.

differences of majors between data analyst and data scientist

 

The graph above is pretty interesting. First, we see that the Data Engineering profession only requires Computer Science and engineering degrees (with most of them being software engineering). This makes a lot of sense considering that a Data Engineer is responsible for the construction, maintenance, and testing of databases and other large-scale systems. Data Engineers are usually dealing with a huge amount of data. All of which has to be properly stored and made easily accessible for Data Scientists or Data Analysts to analyze. Such skills are not acquired by a Statistics, Economics, or Finance degree.

Another interesting observation is that Economics and Finance are only preferred for Data Analysts. This is because a lot of Data Analysts are Business Analysts or Financial Analysts who are working in the finance sectors e.g. banks.

The proportion of Statistics is highest in the Data Scientist bar with around 17%.

What Technology Skills are Required for Data Analysts, Data Engineers, or Data Scientists?

 

difference of technologies between data scientist and data analyst

SQL skills are required for every position and a must-have skill. Version control (VC) like git or Bitbucket are also necessary for all three professions.

What is interesting is that Data Analysts are the only ones who are using Excel extensively and SAS to a certain extent. R is mostly used by Data Scientists and Data Engineers are the only ones who use java extensively.

Scala is a necessary skill for Data Engineers and appears 17% of the time. Scala is used in combination with Spark to deal with huge amounts of data. I am surprised that a Data Scientist also has to know Scala. Usually, I would say R, Python, SQL, and Version Control are the most important technologies to know for a Data Scientist.

What Hard Skills do Data People Need?

difference between data analyst and data scientist

The most important hard skills across all three job titles are good programming skills and a good knowledge of data structures and algorithms. In this case, it is a mix of data structures and algorithms and algorithms like random forest or support vector machines that you would need for machine learning.

Modeling is mostly required for Data Analysts and experimental design only appears in the Data Scientist profession. This is because of the statistics skills a Data Scientist need to have. Whereas the other professions are relying less on statistics.

What Soft Skills are Important?

 

difference between data analyst and data scientist

This graph shows basically the same for all three professions. Problem-solving skills, creativity, and communication skills are the most important soft skills for Data Analysts, Data Engineers, and Data Scientists. But really, this graph would probably be the same for every office job.

How Many Years of Experience are Required for Data Analysts, Data Scientists, and Data Engineers?

 

difference of years of experience data analyst vs. data scientist

It looks like Data Science people really need time to build their skill set because it requires the most experience. This matches our first graph where Data Scientists mostly need PhDs.

Employment Options for Data Analysts, Data Scientists, and Data Engineers

difference between data analyst and data scientist
differences between data scientist and data analyst
difference between data analyst and data scientist

Amazon is hiring a lot of Data Engineers!

What Kind of Job Titles can you Expect for different Data Professions?

This one is really important and interesting. Data Engineers are not always called “Data Engineers” and Data Analysts are also not always called “Data Analyst”. Whenever you hear a job title that fits the one of a Data Analyst but is not specifically called “Data Analyst”, then you can easily identify the job’s required skill set by knowing if the job belongs to the Data Analyst, Data Engineering, or Data Scientist profession.

difference between data analyst and data scientist
difference between data analyst and data scientist
difference between data analyst and data scientist

Data Analysts are most often called Business Analysts and Financial Analysts. That’s why this job title likes to employ people who majored in Economics or Finance. Data Analyst comes in third place which is pretty funny to me.

Data Engineering workers are all called engineers. Either Software Engineers, Data Engineers, DevOps Engineers, or even Sales Engineers.

The graph for the Data Scientist looks pretty boring.

What are the Differences Between a Data Scientist, a Data Analyst, and a Data Engineer?

In short, a Data Scientist is mostly involved in the statistical analysis of data. They have to know the underlying assumptions of models and have to assess whether a method can be applied to a particular data set. This job needs a ton of experience because every single data set out there looks different.

Because every data set is different, a Data Scientist can’t just apply the previous statistical method to the current data. A ton of experience is required with data to know what can be done and what is just bad practice. No one acquires these skills in school. A lot of work experience in the industry is required to become a good Data Scientist.

A Data Engineer is responsible to set up a system for storing and distributing data. Data Engineering people are very well paid because the skill set is very hard to acquire.

A Data Analyst works a lot with Excel and it is more of a reporting type of position. Heavy statistics and mathematics is not needed by Data Analysts. They usually do a lot of exploratory analysis and then present their data findings to the company.

I hope you enjoyed reading about the differences between a Data Analyst, a Data Scientist, and a Data Engineer. Let me know in the comments below which type of position you would like to pursue the most!

If you are interested in the code, then check it out on my Github.

Additional Resources

  • Here is another article from Toptal that explains well what it takes to be hired as a Data Scientist.

Post your comment