RAG notebook deep dive

Technologies Used

LangChain: Chains the document loaders, splitters, vector store, and LLM.
Hugging Face (Transformers & Embeddings): google/flan-t5-large for generation; sentence-transformers/all-MiniLM-L6-v2 for embeddings.
FAISS: Vector database for fast semantic search over chunks.
PyMuPDF: Parses uploaded PDFs into text.
PyTorch: Backend for the Hugging Face models.
ipywidgets: Interactive UI inside the notebook.

Setup: Installs dependencies, detects GPU.
Data Ingestion: Uploads a PDF and splits text with RecursiveCharacterTextSplitter.
Vector Database: Embeds chunks and stores them in FAISS.
Model Initialization: Loads flan-t5-large and creates a text-generation pipeline.
Basic RAG: Retrieves context and answers single-turn questions.
Conversational Memory: Rewrites follow-ups into standalone questions to keep chat history coherent.
Final Interface: Chat UI showing responses plus history.