CatBoost Click‑Through 9611 Prediction From ADM HDS Notebook

This repository contains the Jupyter notebook catboost_hds.ipynb, which demonstrates how to build a gradient‑boosted decision‑tree model with CatBoost to predict click‑through outcomes using ADM historical dataset.

The notebook walks through every stage of the typical machine‑learning workflow—data ingestion, cleaning, feature engineering, model training, evaluation, and explainability—while showcasing advanced CatBoost capabilities such as:

Automatic handling of categorical features without manual encoding
Built‑in text processing via custom tokenizers
Built‑in evaluation metrics (Logloss, AUC) and visualisation utilities
Efficient SHAP value computation for model interpretability

Folder structure

.
├── catboost_hds.ipynb   # Main notebook
├── Web_ClickThrough.zip # Raw dataset 
└── README.md            # You are here 🎉

Dataset

The notebook expects a compressed file called Web_ClickThrough.zip containing the Click‑Through & Decision Outcomes export produced by the Pega Customer Decision Hub (or a structurally similar table). Each row captures:

Column Group	Examples	Notes
Interaction identifiers	`Decision_InteractionID`, `Context_Treatment`	Used to remove duplicates
Contextual features	`channel`, `language`, `country`	Categorical
Numerical metrics	`propensity`, `score`, `timeOnPage`	Numeric
Textual payloads	`Meta_keywords`, `Search_terms`	Parsed with a comma tokenizer

Feel free to swap in your own dataset—just ensure the target column is named Decision_Outcome (binary: Clicked = 1, No‑click = 0).

Examples

Decision tree visualization.

Feature analysis.

Decision explanaition.

Model calibartion quality check.

References

Prokhorenkova L. et al. (2018) CatBoost: unbiased boosting with categorical features. NeurIPS.
Lundberg S., Lee S.‑I. (2017) A Unified Approach to Interpreting Model Predictions (SHAP).

License

This notebook is released under the MIT License. The sample data is provided for demonstration purposes only; verify licensing before using it in production.

Enjoy and happy boosting! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
README.md		README.md
Web_ClickThrough.zip		Web_ClickThrough.zip
catboost_hds.html		catboost_hds.html
catboost_hds.ipynb		catboost_hds.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CatBoost Click‑Through 9611 Prediction From ADM HDS Notebook

Folder structure

Dataset