Numpy Exercises 1-50

First 50 numpy exercises This is a set of exercises collected by Rougier. All credits to Rougier for curating this list. I am simply trying to solve it for practice and hoping it serves as a reference for others. I am surprised I didn’t come across it before. View this post in a Jupyter Notebook This is intended to serve as a stepping stone to becoming a better Data Scientist / Machine Learning Researcher....

October 29, 2023 · 28 min · Kaushik Moudgalya

Demystifying the mathematics behind PCA

Demystifying the mathematics behind PCA We all know PCA and we all love PCA. Our friend that helps us deal with the curse of dimensionality. All data scientists have probably used PCA. I thought I knew PCA. Until I was asked to explain the mathematics behind PCA in an interview and all I could murmur was that it somehow maximizes the variance of the new features. The interviewer was even kind enough to throw me a hint about projections....

October 7, 2023 · 23 min · Kaushik Moudgalya

NHL Data Science Project

Implemented a complete Data Science pipeline: data collection, tidying data, creation of synthetic features, basic and advanced interactive visualizations using plotly, tracking models through CometML, deploying models through a REST API using Docker and Flask.

December 23, 2021 · 1 min · Kaushik Moudgalya

Crop Harvest Classification

Given meteorological and satellite data, predicted land as either crop or non-crop land. Used techniques such as AutoML (Light AutoML and PyCaret) as well as blending and stacking to reduce bias and generalize better. Check out the doc link for a more detailed overview of the project.

November 30, 2021 · 1 min · Kaushik Moudgalya

Weather Events Classification

Classified events as either standard background conditions / tropical cyclones or atmospheric rivers. Used techniques such as SMOTE and SMOTE Tomek to fix class imbalance, hyperparameter tuning using HalvingRandomGridSearchCV, manual feature engineering, and a plethora of sklearn classification algorithms. Check out the doc link for a more detailed overview of the project.

October 15, 2021 · 1 min · Kaushik Moudgalya

Anime Project

My current magnum opus. Consecutively scraped a ton of images for the top 100 anime followed by scraping even more than a ton of images of the top 10 characters in each anime. The goal was to try to identify anime using a ML model and when successful, we planned to identify the characters in the image as well. WIP.

July 23, 2021 · 1 min · Kaushik Moudgalya

Whitepaper: Ethically mitigating biases in DS / ML

The paper provides techniques and a checklist to prevent bias from creeping into Machine Learning models. (Co-author)

January 30, 2021 · 1 min · Kaushik Moudgalya

DataRobot Ideation Challenge

This challenge was a competition hosted within Deloitte, on a platform called DLabs with cash prizes for podium finishes. It was a Kaggle style competition with public and private leaderboards. Our team managed to place Second.

October 30, 2020 · 1 min · Kaushik Moudgalya

Zomato Ratings Prediction Challenge

View this project on Github This challenge was a competition hosted within Deloitte, on a platform called DLabs with cash prizes for podium finishes. It was a Kaggle style competition with public and private leaderboards. Our team managed to place Third. I worked on this competition with Navaneesh Gangala. I was responsible for cleaning data, validating the integrity of the cleaned data, engineering features, training models, using AutoML and ensembling models, etc....

July 11, 2020 · 1 min · Kaushik Moudgalya

DL Paper Notes

View this project on Github My notes on a few Deep Learning papers. Hopefully, I improve my understanding of Deep Learning in the process (also helps when I want to revisit the paper) while making it easier for other people to read papers.

September 22, 2019 · 1 min · Kaushik Moudgalya