Amex Scorecard Modelling

Amex Scorecard Modelling

PythonXGBoostPandasScikit-learn
2024-03-15

A comprehensive credit risk model utilizing the AMEX dataset to predict default probabilities with high accuracy.

The Problem

Predicting credit default is crucial for banks. Using the AMEX dataset, I explored how to build a robust scorecard model.

My Approach

  1. Data Cleaning: Handling missing values using KNN imputation.
  2. Feature Engineering: created aggregated features based on customer transaction history.
  3. Modeling: Used XGBoost and CatBoost.

Results

We achieved a Gini score of 0.78, which puts this model in the top 10% of submissions.