PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
-
Updated
Mar 30, 2025 - Jupyter Notebook
PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
🔍 AI-powered diagnosis for Scikit-learn models: Detect overfitting, data leakage, class imbalance & more with LLM-generated insights
This repository commits to the application of biostatistics knowledge on clinical, randomized trials and observational studies.
Tool for evaluating atmospheric carbon dioxide concentrations as simulated by Earth system models
Sub-package of spatstat containing functionality for parametric modelling and inference
Polyhedron is a domain-driven Python optimization modeling framework that turns business entities into transparent MILP/MIQP models, with built-in diagnostics, scenario analysis, and robust solver interoperability.
This project uses the Reaction Time Survey dataset to develop a linear regression model for accurately predicting student reaction times based on various predictors. Tech: R (RStudio)
Approximation Bayesian Computation: Population Monte Carlo in MATLAB and Python
Tool for evaluating atmospheric carbon dioxide concentrations as simulated by Earth system models
Global challenge to create Species Distribution Model to predict occurrence of frog species, Litoria fallax, in Australia.
This repository contains some of the time series analysis, diagnostics and forecasting projects I have done.
Explain why your model fails — not just how accurate it is.
time series analysis in R use cases
Objective of this project is to perform predictive assesment on the Gross Domestic Product of India through an inferential analysis of various socio-economic factors to find out which predictors contribute most to the GDP. Various models are compared and Stepwise Regression model is implemented which resulted in 5.7% Test MSE.
A comprehensive explanation of Generalized Linear Models (GLMs) with Python examples, covering Binomial, Gamma, and Gaussian families, model diagnostics, formula interface, and alternative estimation approaches using statsmodels.
Detailed implementation of various time series analysis models and concepts on real datasets.
Date cleaning and preprocessing | Data wrangling | Data visualization | Summary statistics | Kaplan Meier | Cox Proportional Hazards Regression| Stratification | Report Writing | Real world data
Predicting wage in the uswage dataset (Linear Regression). Model Selection, Model Diagnostics etc.
Add a description, image, and links to the model-diagnostics topic page so that developers can more easily learn about it.
To associate your repository with the model-diagnostics topic, visit your repo's landing page and select "manage topics."