Published inTowards Data ScienceRunning dbt using Gitlab CI/CDThe easiest way to deploy, run and schedule dbt for freeMay 2, 20225May 2, 20225
A Dockerized dbt WorkflowHow to setup dbt using docker containers with an RPC server for DagsterApr 30, 20221Apr 30, 20221
Published inTowards Data ScienceAbstracting Data Loading with Airflow DAG FactoriesCreating an abstraction layer for improved scalability and usability for loading Google Sheets data with AirflowNov 30, 20212Nov 30, 20212
Published inTowards Data ScienceData lake in S3 from MongoDBUsing Python to upload MongoDB data to AWS S3 to build a data lakeOct 17, 20212Oct 17, 20212
Published inTowards Data ScienceHow we use Airflow for SQL alerts and Slack notificationsUsing Airflow to get notified over Slack when things don’t go as planned while loading dataJul 12, 2021Jul 12, 2021
Published inTowards Data ScienceHow to Build a Custom Estimator for scikit-learnImplementing a custom ensemble model with under-sampling for imbalanced dataFeb 21, 20212Feb 21, 20212
Bayesian Weight LossIn a previous post, I discussed the concept of linear regressions in the realm of Bayesian statistics. I will do something similar in this…Feb 12, 2021Feb 12, 2021
Published inTowards Data ScienceCrazy Simple Anomaly Detection for Customer SuccessThe topic of anomaly detection is fascinating. There is a vast number of methods that can be used, from simple statistics to more complex…Feb 9, 2021Feb 9, 2021
Published inThe StartupBayesian CAPM Beta EstimationContinuing with my (mostly) healthy obsession with Bayesian statistics (see my previous article), in this article, I’ll use a linear…Feb 8, 2021Feb 8, 2021
Published inThe StartupBayesian A/B Testing WhatsApp MessagesA/B testing from a Bayesian perspective to determine the best WhatsApp message to send to customersNov 17, 20201Nov 17, 20201