Published inThe StartupDifference Between ARIMA and ARCH Models for Time Series Analysis:Auto Regressive Integrated Moving Average (ARIMA) models and a similar concept known as Auto Regressive Conditional Heteroskedasticity…Dec 6, 2020Dec 6, 2020
Published inThe StartupYour Very Own Recommender System: What Shall We Eat?Recommender Systems rely on the concept of similarity or proximity of your data. This relates to 2D coordinates on a grid, and the…Dec 6, 2020Dec 6, 2020
Published inThe StartupUsing Synthetic Data for Imbalanced Classes in a Classification Model:Sometimes for a classification problem we will have a clear and important distinction for our target classes. Even here though, we can…Dec 6, 2020Dec 6, 2020
Classifying Text Content with TF-IDFIn Natural Language Processing (NLP), word vectorization offers several options for assigning numerical values to text data. It is this…Dec 5, 2020Dec 5, 2020
Published inThe StartupAmes Housing Prices Reconsidered Part 2: Insights With PCAIn Part 1 we went over how to encode categorical features that are either nominal or ordinal. Completing this process gave us a DataFrame…Dec 5, 2020Dec 5, 2020
Ames Housing Prices Reconsidered Part 1: Simple EncodingThe Ames, Iowa housing prices dataset (Ames data) offers a great opportunity to explore the tools available to us as Data Scientists…Dec 5, 2020Dec 5, 2020