Aaron HumeinThe StartupDifference Between ARIMA and ARCH Models for Time Series Analysis:Auto Regressive Integrated Moving Average (ARIMA) models and a similar concept known as Auto Regressive Conditional Heteroskedasticity…4 min read·Dec 6, 2020----
Aaron HumeinThe StartupYour Very Own Recommender System: What Shall We Eat?Recommender Systems rely on the concept of similarity or proximity of your data. This relates to 2D coordinates on a grid, and the…5 min read·Dec 6, 2020----
Aaron HumeinThe StartupUsing Synthetic Data for Imbalanced Classes in a Classification Model:Sometimes for a classification problem we will have a clear and important distinction for our target classes. Even here though, we can…4 min read·Dec 6, 2020----
Aaron HumeClassifying Text Content with TF-IDFIn Natural Language Processing (NLP), word vectorization offers several options for assigning numerical values to text data. It is this…4 min read·Dec 5, 2020----
Aaron HumeinThe StartupAmes Housing Prices Reconsidered Part 2: Insights With PCAIn Part 1 we went over how to encode categorical features that are either nominal or ordinal. Completing this process gave us a DataFrame…5 min read·Dec 5, 2020----
Aaron HumeAmes Housing Prices Reconsidered Part 1: Simple EncodingThe Ames, Iowa housing prices dataset (Ames data) offers a great opportunity to explore the tools available to us as Data Scientists…4 min read·Dec 5, 2020----