Skip to content

Sanjay-00/AutoCAM

Repository files navigation

AutoCAM - CIBIL Report Analyser

The Problem

At Shriram Finance, credit analysts prepare a CAM (Credit Appraisal Memo) for every customer with an exposure above Rs. 25 lakh. A critical section of the CAM requires manually listing all active loans from the customer's CIBIL report, including sanction amount, outstanding balance, EMI, overdue, DPD, and lender details.

This is straightforward for a customer with 3-4 loans. But CIBIL reports with 50-100+ loan accounts make this a slow, error-prone, and repetitive task, done 6-7 times per month per branch.

The Solution

AutoCAM eliminates this manual effort entirely.

Upload a CIBIL PDF and get a structured, formatted Excel file in seconds.

  • Extracts all loan accounts automatically (active and closed)
  • Covers both CRIF High Mark and TransUnion CIBIL report formats
  • Validates extracted data against the report's own summary totals
  • Outputs an Excel file in the exact format required for the CAM
  • Filter by Active / Closed accounts directly in the app or excel

Live demo: autocam-cibil.streamlit.app

Screenshots

Upload screen

Upload UI

After extraction - borrower profile and key metrics

Metrics

Account table with Active / Closed filter and Excel download

Table and Download

Impact

Before After
30-60 min manual data entry per CAM Under 1 minute
Risk of transcription errors Validated against report totals
Only feasible for small CIBIL reports Handles 100+ account reports
Done atleast 6-7 times/month per branch Same frequency, fraction of the effort

How it works

  1. Upload a digital CIBIL PDF (CRIF High Mark or TransUnion)
  2. Click Extract Data
  3. Review the dashboard - borrower name, CIBIL score, account metrics
  4. Download the pre-formatted Excel file ready for the CAM

If rule-based extraction doesn't pass validation, the app automatically falls back to a Gemini LLM for correction, without any action needed from the user.

Tech Stack

Streamlit · PyMuPDF · openpyxl · LangChain + Google Gemini · pandas

Future Work

  • LLM-powered credit analysis: auto-generate risk observations and key points from the extracted CIBIL data
  • Support for scanned/image-based PDFs via OCR
  • Multi-borrower batch processing

Limitations

  • Scanned (image-based) PDFs are not supported, only digital CIBIL reports with extractable text
  • TransUnion reports have no LLM fallback

About

Extracts structured loan account data from CRIF High Mark and TransUnion CIBIL PDFs and generates a formatted Excel file for credit analysts at Shriram Finance.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages