# learning-from-imbalanced-classes **Repository Path**: ldu_1_gch_liu/learning-from-imbalanced-classes ## Basic Information - **Project Name**: learning-from-imbalanced-classes - **Description**: Learning From Imbalanced Classes - **Primary Language**: HTML - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 1 - **Created**: 2020-03-21 - **Last Updated**: 2022-01-22 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # learning-from-imbalanced-classes This repo corresponds to a blog post I wrote discussing how to learn from data with imbalanced classes. The blog post is here: http://www.svds.com/learning-imbalanced-classes/. In this directory you'll find two Python Jupyter notebooks illustrating two points made in that blog post. `Gaussians.ipynb` is an interactive notebook that allows you to play with varied sampling from two Gaussian distributions to see what logistic regression does with the points as the mixtures are varied. `ImbalancedClasses.ipynb` illustrates a method called **blagging** (basically, downsampled bagging) on a domain called [Glass](https://archive.ics.uci.edu/ml/datasets/Glass+Identification) from the UCI Repository. It goes through steadily more imbalanced versions of the domain, testing different algorithms and showing the results.