Home

Welcome To My Data Blog

Hi I' am Pascal

Hi I' am Pascal

Thanks for checking out my blog. You can find all kinds of blog posts about R, Python, statistics, and R Shiny on here. Enjoy exploring and feel free to leave comments or message me directly at pascal.sfu.ca.  

 

I also created a website from scratch with Shiny at https://pascal-schmidt-ds.com where you can find my interactive resume and also some posts and personal projects. It is still under construction but will be finalized soon. 

Blog Posts

Scrape Wikipedia using NodeJs

Wikipedia is a data-rich website and contains a large amount of information. This data can be used to take appropriate decisions or you can use it to train bots or neural networks. In this post, we are going to scrape Wikipedia using Nodejs. We are going to target this page from Wikipedia. You can also read Web Scraping with Nodejs… Read More

Scraping Netflix Data with Python

As we all know, Netflix is an OTT platform where you can watch unlimited Shows and movies. Literally UNLIMITED! You can scrape Netflix to collect any episode’s names, cast, ratings, similar shows, pricing of plans, etc.  Using this data, you can analyze what users are watching these days, which also helps sentiment analysis. I will be using Python for scraping… Read More

Simplifying Parts Of A Shiny App by Creating Functions

Until now, the Shiny app we created has all the HTML in the UI. On top of that, we have repeated code that we can stick into a function to make the app smaller and to be able to test the functions we are creating. Here are Part 1 and Part 2 of the app. The UI Shiny Part Right… Read More

Building a Google Analytics Dashboard With R Shiny From Scratch – Part 2

Last week we started out building the Google Analytics Dashboard for my blog. This week we are continuing with adding some functionality to the UI with a bit of JavaScript. I am going through the complete application on YouTube Part 1 is on YouTube and my GitHub We added panels with visualizations to the page that can be deleted when… Read More

Building a Google Analytics Dashboard With r Shiny From Scratch – Part 1

I participated in the R Shiny 2021 contest and published an application similar to the Google Analytics dashboard app. For that, I used the Google Analytics API and the Google Search Console API to pull my own data from my blog directly into the application. The application uses the shinyauthr library because the dashboard can be adjusted for each user who has… Read More

Vectorization, Purrr, and Mutate

Sometimes, R is a bit too intuitive, and I wondered what was wrong with my code the other day was. The problem was vectorized functions within a mutate statement. I usually use the paste function and the ifelse function within mutate so the vectorization is already automatic. However, for a specific task at work, I was working with a non… Read More

How I won an In-Class Kaggle Competition

During my 4th year as an undergraduate student in statistics, I took a Big Data class where we had to compete in a Kaggle competition. This competition was about imputing missing values in a times series weather data set, acquired from 76 different weather stations in the United Kingdom. The data set consisted of 1,880,892 rows and 11 columns. In… Read More

The Question That Got Me My First Data Analyst Job

In this blog post, we will be exploring how I answered a question that got me my first job as a data analyst. It was an open-ended question and was supposed to be answered in a casual/theoretical way. The question was “How would you collect all cities in Qatar and how I would make sure in which municipality the cities… Read More

Data Science Jobs