Home

Welcome To My Data Blog

Hi I' am Pascal

Hi I' am Pascal

Thanks for checking out my blog. You can find all kinds of blog posts about R, Python, statistics, and R Shiny on here. Enjoy exploring and feel free to leave comments or message me directly at pascal.sfu.ca.  

 

I also created a website from scratch with Shiny at https://pascal-schmidt-ds.com where you can find my interactive resume and also some posts and personal projects. It is still under construction but will be finalized soon. 

Blog Posts

#TidyTuesday – Which Univeristies Have Had the Most Increase in Tuition Costs?

In this blog post, I will be going over my first tidytuesday data set. This data set is about universities in the United States and their tuition. In total, there are three different data sets and they can be explored from many different angles. I decided to look at universities how the tuition cost of universities has been developed over… Read More

Tidyr’s pivot_longer() and pivot_wider() Examples From the #TidyTuesday Challenge

In today’s blog post, we will be talking about tidyrs pivot_wider() and pivot_longer() functions in the context of the brain injuries #TidyTuesday challenge. The two functions above have been developed to make it easier for R users/programmers to have a better mental model about the transformations on the data frames. Unlike gather and spread, pivot_longer() and pivot_wider() are the opposites… Read More

Two Sum LeetCode – Four Algorithms Implemented in R

In this post, we will be going over the two sum LeetCode question in R. We will be going to cover a total of 4 algorithms with different data structures. In the end, we will see which algorithm performs best with a ggplot2 visualization. Algorithms & Data Structures we are using: Double for loop Base R built-in function which() Named… Read More

Build a Histogram in R From Scratch – Resembling the hist() Function in R

In this tutorial, we will be covering how to create a histogram in R from scratch without the base hist() function and without geom_histogram() or any other plotting library. We will do this by only using the plot() and lines() functions in base R. For our histogram, we will be developing two functions. One function creates the desired intervals or… Read More

Analyzing Web Data with Google Search Console and searchConsoleR in R Studio

In my last blog post, we investigated my website with the data from Google Analytics and the Google Analytics API. Today, we will be using Google’s search console and its API to pull data into R Studio and then analyze it. If you are interested in my analysis with the Google Analytics API and the googleAnalyticsR package, then check out… Read More

Analyzing My Website With Google Analytics, R and googleAnalyticsR

Today, we will be going over some basic analytics of my data science blog (thatdatatho.com). We will be looking at my page views,  what articles generate the most traffic and a few other things. I have been blogging now for a bit over two years and I started the blog to become a better data scientist and to share my… Read More

R Shiny App of Pokemon Data Set

Recently, I looked through my university folder and found a project that I have created in my STAT 240 class. Then I thought about this picture here. Therefore, I decided to publish my first ever Shiny app where I analyzed a Pokemon data set. If you want a better layout, then visit the visualizations on the Shiny server. Code: ###############… Read More

Multiple Ways of Doing Vectorization in R – Speeding up For Loops

Today, we will be talking about vectorization in R. This topic is especially important to R because for loops do not run very fast. So instead of using the great Rcpp library, which requires C++, we can take advantage of vectorization. What we will be covering: What is vectorization?  Base R implementation of a vectorized function  Vectorization with base::vectorize()  Vectorization… Read More

Data Science Jobs