StatOfMind


A melting pot of statistics, machine learning and data vizualization.

The biggest liars in US politics

Who lies the most in US politics? Most Americans, and anyone that follows US politics, will be aware of the tremendous changes and volatility that has struck the US political landscape in the past year. The ascent of Donald Trump from a billionaire entertainer to a fully fledged presidential candidate,...
Read more...


Data science with Docker

Using docker to facilitate your data science pipelines Until recently, and like many other fellow data scientists I have talked to, I built data science pipelines on my local machine or a remote host while relying on virtual environments. In doing so, I ensured some degree of replicability by keeping...
Read more...


Player and roster similarity in the NBA

Recently, professional sports associations and teams have made big strides towards leveraging data to inform both personel and on-the-field decision making. While the four major leagues (NBA, NFL, MLB, NHL) vary in terms of where they are in that process, most people would argue that the NBA is at the...
Read more...


Tracking Social Issues and Topics in Presidential Speeches

Scraping presidential transcripts To begin, we must scrape the content of all presidential speeches recorded in American history. To do that, I’ll rely on the very handy BeautifulSoup library, and eventually store all data in a pandas dataframe that will be persisted in a pickle file. # import required libraries...
Read more...


Cloning a graph in Python

If you have ever played around with Algorithms & Data Structures, then you most likely have heard of Leetcode.com, which contains a number of famous (or infamous) of technical questions. One of my favorite in there is the graph clone question, which can be shortly stated as: Clone an undirected...
Read more...