(RAG + LangChain + FAISS + Kaggle API)This project implements a Retrieval-Augmented Generation (RAG) system that answers natural language questions about movies using their plot summaries.
βI built a RAG-based movie knowledge assistant using LangChain and FAISS, powered by Kaggle data and deployed in Google Colab.β
This project implements a Retrieval-Augmented Generation (RAG) system that answers natural language questions about movies using their plot summaries.
π Open the notebook in Colab: Click here to run the project
- Python
- LangChain
- FAISS (Vector Database)
- Sentence-Transformers
- OpenAI LLM
- KaggleHub API
- Google Colab
Wikipedia Movie Plots Dataset from Kaggle
- Movie plot summaries are converted into vector embeddings
- FAISS stores and retrieves semantically similar plots
- Retrieved context is passed to an LLM
- The LLM generates grounded, accurate answers
- Retrieval-Augmented Generation (RAG)
- Vector databases
- Prompt engineering
- Semantic search
- Large Language Models (LLMs)
- What is the plot of Inception?
- Which movies are about artificial intelligence?
- Tell me about mafia-related movies
- Open the Google Colab notebook
- Upload your
kaggle.jsonAPI key - Run cells sequentially
- Enter natural language movie questions
This project shows how RAG systems can significantly improve LLM reliability by grounding responses in real data. It is scalable, efficient, and suitable for real-world applications.
movie-rag-langchain-faiss/ β βββ notebook/
β βββ movie_rag_colab.ipynb
β βββ README.md
βββ requirements.txt
Run the Colab notebook for an interactive demo.